Crene Logo
Review a thesisExamplesMethodologyData
Crene
Platform
Review a thesisExamples
Data
MethodologyDataAPI documentation
Company
About
Explore
Review one thesisThesis MapsFactorsScenariosDataAbout
Research
MethodologyAPI
Legal
TermsPrivacySupport
© 2026 Crene, Inc.
Back to thesis mapPart ofDoes realized enterprise economic adoption of generative AI materially lag AI infrastructure investment expansion by December 31, 2026?
Event · UNIT ECONOMICS & PRICING

Will GPT-4-class input token prices decline by more than 70% YoY by Q4 2026?

Resolves Dec 31, 2026
Probability
45%

4-model average

Confidence
LOW

building category history

Stability
—

loading

Models
Diverging

40pt spread

The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o
30%-15 vs avg

Current AI model deployment and development costs remain high, with significant investments in infrastructure, including GPUs. As of Q3 2023, input token prices have been stable, and the demand for advanced AI capabilities continues to grow, making a 70% decline by Q4 2026 unlikely. Historical trends show that while prices for technology tend to decline, such a steep drop is rare unless there’s a radical shift in technology or production efficiency that significantly reduces costs.

Key uncertaintyA breakthrough in quantum computing or alternative AI architectures could drastically lower the cost of AI inputs, potentially supporting a significant decline in token prices.
Gemini
55%+10 vs avg

The rapid pace of AI hardware innovation and increased competition among major AI providers (OpenAI, Google, Anthropic, Microsoft) are strong drivers for cost reduction. Historically, nascent technologies like GPUs and cloud computing have seen significant price drops after initial adoption, with input token costs likely to follow a similar trajectory as economies of scale and efficiency improvements are realized. However, the current high demand and complexity of training and running GPT-4 class models introduce significant cost pressures that could temper the rate of decline.

Key uncertaintyThe actual total cost of ownership and operational complexity of deploying and maintaining GPT-4-class models at scale, which is not yet fully understood or publicly disclosed by major players, will significantly influence pricing.
Claude
28%-17 vs avg

Historical precedent shows 40-50% price declines over similar timeframes, with the incremental 20+ percentage points to reach 70% requiring exceptional competitive pressure or strategic pricing changes by late 2026. While infrastructure costs decline steadily and competition intensifies, maintaining >70% YoY decline from Q4

Key uncertaintyNot specified
GrokGrok
68%+23 vs avg

GPT-4-class input pricing fell from $30/M tokens (Mar 2023) to $2.50/M (GPT-4o, May 2024), an 8× or 87.5% decline in 14 months; training runs at 5–6× prior FLOP scale now occur every 10–12 months while inference hardware efficiency improves ~2.5×/year (NVIDIA H100→B200 roadmap). Sustained capex at $50–70 B/yr by Microsoft/OpenAI and continued 30–40% gross-margin targets imply further 3–4× cost cuts are required by late 2026, making an additional 70% YoY drop from the 2025 baseline likely.

Key uncertaintyWhether next-generation inference accelerators deliver the projected 3–4× tokens/Joule gain or stall at 1.5–2× due to memory-bandwidth or yield constraints.
Key disagreementGrok (68%) vs Claude (28%): Different weighting of factors

Resolution criteria

SourceOpenAI, Anthropic, Google published pricing
CRENE-AIER-C026-20261231Generated Jun 28, 2026