Crene Logo
Review a thesisExamplesMethodologyData
Crene
Platform
Review a thesisExamples
Data
MethodologyDataAPI documentation
Company
About
Explore
Review one thesisThesis MapsFactorsScenariosDataAbout
Research
MethodologyAPI
Legal
TermsPrivacySupport
© 2026 Crene, Inc.
Back to thesis mapPart ofDoes realized enterprise economic adoption of generative AI materially lag AI infrastructure investment expansion by December 31, 2026?
Event · UNIT ECONOMICS & PRICING

Will inference token costs decline by more than 60% YoY in 2026 while enterprise AI revenue grows less than 40% YoY?

Resolves Dec 31, 2026
Probability
26%

4-model average

Confidence
LOW

building category history

Stability
—

loading

Models
Aligned

3pt spread

The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o
25%-1 vs avg

Given the pace of technological advancement in AI, inference token costs have been trending downward, but a decline of more than 60% YoY seems overly ambitious. For example, recent trends show reductions in costs ranging around 30-40% YoY, suggesting that a collapse beyond that is unlikely unless significant breakthroughs occur. Meanwhile, enterprise AI revenue is projected to grow moderately, with estimates around 30-35% YoY growth, aligning with the current market's cautious optimism.

Key uncertaintyThe emergence of a new generative AI architecture that drastically reduces inference costs could significantly alter both token costs and revenue growth forecasts.
Gemini
25%-1 vs avg

While inference token costs have seen rapid declines (e.g., from ~$0.03/1k tokens for GPT-3.5 to under $0.001/1k tokens for some models), a >60% YoY decline in 2026 is aggressive given current diffusion rates and the increasing complexity and scale of models. Enterprise AI revenue growth is projected to remain robust, likely exceeding 40% YoY in 2026, driven by widespread adoption.

Key uncertaintyThe pace of algorithmic efficiency improvements and the emergence of significantly more cost-effective model architectures could accelerate cost declines beyond current projections.
Claude
28%+2 vs avg

This event requires two simultaneous conditions: inference token costs declining >60% YoY (unprecedented compression given only ~20-30% annual declines historically) while enterprise AI revenue grows <40% YoY (below current 50-70% CAGR rates). The inverse relationship between these metrics works against joint occurrence—significant cost reductions typically *drive* higher revenue growth by expanding addressable markets. For costs to fall 60%+ would require major architectural breakthroughs (e.g., MoE scaling, drastically improved efficiency) that would simultaneously accelerate enterprise adoption, making the revenue constraint harder to meet. Historical precedent shows inference costs have declined ~25-30% annually (2023-2025), making a doubling of that rate unlikely without demand collapsing, which would contradict the 40%+ revenue growth condition.

Key uncertaintyWhether a fundamental efficiency breakthrough (e.g., novel model compression, new inference paradigm) could decouple cost improvements from revenue acceleration, allowing both conditions to be technically satisfied through market saturation or budget constraints in 2026.
GrokGrok
28%+2 vs avg

Inference costs have already fallen 50-70% YoY in 2024-25 from GPT-4 ($0.03/1k tokens) to Claude 3.5/GPT-4o levels, but sustaining >60% further declines requires continued hardware scaling and algorithmic gains that face diminishing returns as models approach 10^26 FLOP training budgets. Enterprise AI revenue grew ~45% YoY in 2024 (Microsoft Azure AI +$10B, Google Cloud +$7B) and is unlikely to drop below 40% given committed capex from hyperscalers exceeding $200B combined in 2025-26. The combination of plateauing inference efficiency gains and sustained enterprise spending creates low odds of both conditions holding simultaneously.

Key uncertaintyWhether OpenAI/Anthropic release models with 5-10x inference efficiency in 2026 versus incremental 2x gains.

Resolution criteria

SourceOpenAI, Anthropic, Google AI Studio published pricing; enterprise revenue from earnings calls and disclosed ARR statements
CRENE-AIER-04-20261231Generated Jun 17, 2026