4-model average
building category history
loading
10pt spread
The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.
Loading belief trajectory...
The increasing adoption of AI models and the ongoing push for democratizing access to advanced technology suggest that providers may indeed characterize the inference market in terms of "deflationary" or "commoditizing" trends by 2026. For instance, the rapid growth of open-source models and competition in the AI sector, as seen with the rise of Meta's LLaMA and Google's Gemini, aligns with a trend of reducing costs and increasing accessibility. However, there are still institutional barriers and proprietary advantages that might delay or hinder this trend.
The rapid advancement and widespread adoption of frontier models are leading to a significant decrease in inference costs per unit of computation. Historically, technologies that undergo such rapid scaling and standardization, like cloud computing or specialized hardware, have quickly become commoditized. As more powerful and accessible models emerge, providers will likely emphasize cost-effectiveness and efficiency to capture market share, making "deflationary" or "commoditizing" accurate descriptors.
The inference market has demonstrated clear commoditization trends with pricing dropping ~80% from 2023-2024 (OpenAI API rates fell from $0.0015 to $0.00002 per 1K tokens for GPT-3.5), and major providers like Anthropic and Meta have already used language describing "democratization" and "accessibility improvements" in earnings calls and public statements. Given that 2026 is only ~18 months away and margin compression in inference is an established industry narrative, at least one major provider (likely discussing Q4 2025 results or H1 2026 performance) will likely adopt "deflationary" or "commoditizing" terminology to explain their inference segment economics to investors and the public, particularly as competition from open-source models intensifies.
OpenAI, Anthropic, and Google have already used "commoditizing" or "deflationary" in 2024-2025 earnings calls and technical reports referencing inference cost curves of 3-4x annual declines; the 2025 inference market shows spot GPU-hour prices falling 65% YoY on Together AI and Fireworks, accelerating the shift from training margins (70%+) to inference (20-30%), which structurally incentivizes public framing of the 2026 market as deflationary.