4-model average
building category history
loading
3pt spread
The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.
Loading belief trajectory...
Given the pace of technological advancement in AI, inference token costs have been trending downward, but a decline of more than 60% YoY seems overly ambitious. For example, recent trends show reductions in costs ranging around 30-40% YoY, suggesting that a collapse beyond that is unlikely unless significant breakthroughs occur. Meanwhile, enterprise AI revenue is projected to grow moderately, with estimates around 30-35% YoY growth, aligning with the current market's cautious optimism.
While inference token costs have seen rapid declines (e.g., from ~$0.03/1k tokens for GPT-3.5 to under $0.001/1k tokens for some models), a >60% YoY decline in 2026 is aggressive given current diffusion rates and the increasing complexity and scale of models. Enterprise AI revenue growth is projected to remain robust, likely exceeding 40% YoY in 2026, driven by widespread adoption.
This event requires two simultaneous conditions: inference token costs declining >60% YoY (unprecedented compression given only ~20-30% annual declines historically) while enterprise AI revenue grows <40% YoY (below current 50-70% CAGR rates). The inverse relationship between these metrics works against joint occurrence—significant cost reductions typically *drive* higher revenue growth by expanding addressable markets. For costs to fall 60%+ would require major architectural breakthroughs (e.g., MoE scaling, drastically improved efficiency) that would simultaneously accelerate enterprise adoption, making the revenue constraint harder to meet. Historical precedent shows inference costs have declined ~25-30% annually (2023-2025), making a doubling of that rate unlikely without demand collapsing, which would contradict the 40%+ revenue growth condition.
Inference costs have already fallen 50-70% YoY in 2024-25 from GPT-4 ($0.03/1k tokens) to Claude 3.5/GPT-4o levels, but sustaining >60% further declines requires continued hardware scaling and algorithmic gains that face diminishing returns as models approach 10^26 FLOP training budgets. Enterprise AI revenue grew ~45% YoY in 2024 (Microsoft Azure AI +$10B, Google Cloud +$7B) and is unlikely to drop below 40% given committed capex from hyperscalers exceeding $200B combined in 2025-26. The combination of plateauing inference efficiency gains and sustained enterprise spending creates low odds of both conditions holding simultaneously.