Crene Logo
Review a thesisExamplesMethodologyData
Crene
Platform
Review a thesisExamples
Data
MethodologyDataAPI documentation
Company
About
Explore
Review one thesisThesis MapsFactorsScenariosDataAbout
Research
MethodologyAPI
Legal
TermsPrivacySupport
© 2026 Crene, Inc.
Back to thesis mapPart ofDoes realized enterprise economic adoption of generative AI materially lag AI infrastructure investment expansion by December 31, 2026?
Event · UNIT ECONOMICS & PRICING

Will OpenAI publicly disclose negative gross margin on inference for at least one quarter of 2026?

Resolves Dec 31, 2026
Probability
33%

4-model average

Confidence
LOW

building category history

Stability
—

loading

Models
Mixed

23pt spread

The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o
45%+12 vs avg

Given the increasing operational costs associated with AI model training and inference, along with market pressures to provide affordable services, it is plausible that OpenAI could experience negative gross margins in 2026, especially if they expand their infrastructure and user base significantly. As of 2023, costs of cloud computing and energy have surged, with data from industry analysts suggesting a 30% rise in operational costs year-over-year, potentially impacting profitability.

Key uncertaintyThe speed of advancements in efficiency and cost-reduction technologies within AI model deployment significantly affects the margins and could alter the outcome for OpenAI.
Gemini
40%+7 vs avg

OpenAI's rapid growth and substantial R&D investments, particularly in model development and infrastructure, suggest potential for high inference costs relative to revenue in the near term. While they haven't disclosed specific financials, the immense computational power required for training and running advanced models like GPT-4, coupled with the evolving efficiency of hardware and software, makes a negative gross margin on inference plausible in a specific quarter of 2026, especially if demand outstrips optimized cost structures.

Key uncertaintyThe pace of AI hardware efficiency improvements and OpenAI's ability to optimize their inference infrastructure for cost reduction will be the primary determinant.
Claude
28%-5 vs avg

OpenAI has not publicly disclosed detailed gross margin metrics for inference operations in their historical disclosures, and there is no precedent for them reporting negative gross margins on a specific business segment. However, inference margins are structurally challenged: GPU costs (especially for frontier models like o1) remain high relative to pricing power, and competitive pressure from Claude/Anthropic/others may compress prices in 2026. OpenAI's recent shift toward reasoning models with higher compute requirements during inference could temporarily pressurize margins. The main counterargument is that OpenAI typically avoids granular negative disclosures that would concern investors, preferring to discuss blended metrics or strategic investments. A 28% probability reflects a meaningful but minority scenario where either (a) competitive dynamics force public acknowledgment of margin challenges, or (b) new regulation/transparency requirements mandate such disclosure.

Key uncertaintyWhether OpenAI will voluntarily segment and publicly report inference gross margins separately from other business lines—this disclosure choice is entirely within management's control and historically they have avoided breaking out unprofitable segments.
GrokGrok
22%-11 vs avg

OpenAI’s 2024 operating losses exceeded $5B on ~$3.5B revenue with inference costs cited at ~60% of revenue in public filings; its 2025 Stargate plan projects $100B+ capex on 5-10x compute scaling, but Microsoft Azure margins on similar GPU workloads remain positive at 35-40% and OpenAI has repeatedly signaled plans to raise inference pricing 2-3x by late 2025. Historical precedent shows no major cloud provider reporting negative gross margins on inference for a full quarter after reaching >$1B annual run-rate.

Key uncertaintyWhether OpenAI’s custom inference silicon (rumored 2026 tape-out) delivers the projected 3-4x cost per token reduction versus Nvidia H100/H200 fleets.
Key disagreementGPT-4o (45%) vs Grok (22%): Different weighting of factors

Resolution criteria

SourceOpenAI disclosures, credible press reporting
CRENE-AIER-C030-20261231Generated Jun 19, 2026