Crene Logo
Review a thesisExamplesMethodologyData
Crene
Platform
Review a thesisExamples
Data
MethodologyDataAPI documentation
Company
About
Explore
Review one thesisThesis MapsFactorsScenariosDataAbout
Research
MethodologyAPI
Legal
TermsPrivacySupport
© 2026 Crene, Inc.
Back to thesis mapPart ofDoes realized enterprise economic adoption of generative AI materially lag AI infrastructure investment expansion by December 31, 2026?
Event · POLICY & GEOPOLITICS

Will the EU AI Office formally classify any frontier model as "systemic risk" during 2026?

Resolves Dec 31, 2026
Probability
41%

4-model average

Confidence
LOW

building category history

Stability
—

loading

Models
Diverging

42pt spread

The three supporting readings tell you how much weight to put on the probability: confidence reflects category-level track record, stability tracks how the estimate has moved over time, models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o
30%-11 vs avg

The EU has increasingly focused on regulating AI, particularly with the AI Act set to be fully implemented by 2026. While the European Commission has indicated a cautionary approach towards labeling AI systems as "systemic risk," recent statements suggest a growing acknowledgment of the potential societal impacts of frontier models, evidenced by their classification frameworks and the emphasis on high-risk categories. However, actual classifications may depend heavily on ongoing evaluations and consensus among EU member states regarding what constitutes systemic risk.

Key uncertaintyThe extent to which public pressure and technological developments in the AI sector influence the EU's regulatory stance leading up to 2026 could significantly alter the likelihood of such classifications.
Gemini
70%+29 vs avg

The EU AI Act establishes a clear framework for identifying and regulating high-risk AI systems, with a specific category for "systemic risk" models. Given the rapid advancement and widespread potential impact of frontier models, it's highly probable that the EU AI Office will identify at least one such model as systemic by 2026 to enforce the Act's stringent requirements. Current discussions and the preparatory work by the AI Office indicate a proactive approach to managing these powerful AI systems.

Key uncertaintyThe pace of development and deployment of novel, highly capable frontier models that demonstrably exhibit systemic risks beyond current understanding.
Claude
28%-13 vs avg

The EU AI Act's Article 4(4) establishes "systemic risk" classification mechanisms, but formal designations require sustained evidence of real-world harm and regulatory consensus. As of late 2024, no frontier model has been formally classified despite significant regulatory scrutiny of models like GPT-4 and Claude—suggesting high evidentiary thresholds exist. The EU AI Office, operational since 2024, has focused on compliance monitoring rather than systemic designations. For 2026 classification to occur, a frontier model would need to demonstrate documented systemic harms (e.g., widespread infrastructure failures, election interference, or severe financial system disruption) substantial enough to trigger coordinated EU action within approximately 12 months. Current frontier models show capability risks rather than demonstrated systemic impact, and regulatory bodies historically move cautiously on novel designations.

Key uncertaintyWhether a significant AI-enabled security incident (cyberattack, critical infrastructure disruption, or synthetic media election interference) occurs in 2025-early 2026 that directly implicates a specific frontier model's capabilities, which could accelerate formal classification through emergency provisions.
GrokGrok
38%-3 vs avg

The EU AI Office must first finalize the thresholds in the implementing act (expected Q1 2026) and then complete model evaluations under Article 51; only four labs currently publish ≥10^26 FLOP training runs, and none have disclosed public evidence meeting the 10^25 cumulative-compute systemic-risk criterion set out in the December 2023 trilogue text. The Commission’s March 2024 impact assessment projected the first classification decision no earlier than late 2026, giving a compressed nine-month window after the Office becomes operational in August 2025.

Key uncertaintyWhether the final implementing act lowers the FLOP threshold or adds non-compute criteria (e.g., dangerous capability benchmarks) before the 2026 deadline.
Key disagreementGemini (70%) vs Claude (28%): Different weighting of factors

Resolution criteria

SourceEU AI Office classifications 2026
CRENE-AIER-C094-20261231Generated Jun 28, 2026