Back to cluster Part ofDoes realized enterprise economic adoption of generative AI materially lag AI infrastructure investment expansion by December 31, 2026?

Event · TECHNOLOGY

Will frontier model performance gap (top-1 vs top-5 on standard benchmarks) compress to less than 5 percentage points by Q4 2026?

Resolves Dec 31, 2026

48%probability

4-model average

LOWconfidence

building category history

—stability

Divergingmodels

42pt spread

The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o

30%-18 vs avg

Gemini

70%+22 vs avg

Claude

28%-20 vs avg

Grok

65%+17 vs avg

Key disagreementGemini (70%) vs Claude (28%): Different weighting of factors

Resolution criteria

SourceStandard benchmark leaderboards 2026

CRENE-AIER-C085-20261231Generated May 11, 2026