Crene is investment thesis review infrastructure. The platform decomposes macro, labor, AI transition, and investment thesis questions into falsifiable components using four frontier AI models (Claude, GPT, Gemini, Grok). The system produces three product types: thesis maps (binary decomposition into assumptions), factors (continuous decomposition into drivers), and scenarios (hybrid decomposition into coherent pathways). Cross model disagreement is the primary signal. Calibrated against 1,161 resolved events with a Brier score of 0.114.

How does AI prediction calibration work at Crene?

Calibration applies at the consensus layer, not at the underlying model layer. Crene queries four frontier models in isolation, computes a cross model median consensus, and tracks resolution against a tiered source allowlist. Every resolved event becomes a permanent calibration point. Per tier accuracy, per model Brier scores, and per domain accuracy are computed live and published at https://crene.com/methodology.

What is a Brier score?

A Brier score is the proper scoring rule for probability forecasts on binary outcomes. It is the mean squared error between the forecast probability and the realized outcome (0 or 1). Lower scores indicate better calibration. The no skill baseline is 0.25 (always predicting 0.5); a perfect forecaster scores 0.0. Crene publishes the consensus Brier score and per model Brier scores across the resolved event corpus.

Crene serves investment teams that need a repeatable thesis review process: hedge funds, asset managers, macro PMs, CIO offices, family offices, and thematic investment teams. A team brings one thesis it is actively debating. Crene maps what the thesis depends on, where models disagree, what changed each week, and what would force a rethink. Data and API access remain available as the proof layer behind the workflow.

What is the difference between an event, a thesis map, and a factor at Crene?

Crene has four product types. Events are base binary outcomes with scalar probabilities. Thesis maps decompose binary anchor questions into child situations. Factor maps decompose continuous variables into drivers. Scenarios map long horizon investment questions into coherent pathways across live systems including Empire by Default, AI Labor Transition, European Rearmament, and India as the Third Growth Pole.

Bring your own thesis

Get this same map built on a thesis your team is debating.

Assumptions, model disagreement, and rethink triggers, updated weekly before your PM, risk, or IC discussion. Currently accepting one macro thesis and one AI-economy thesis for July.

Map one thesis

Back to thesis map Part ofWill AI generated content exceed human generated content on the US internet before 2030?

Event · INFRASTRUCTURE & GENERATION COST

Will the cost of generating AI content decrease by over 95% from 2025 levels by 2028?

Resolves Dec 31, 2028

Probability

34%

4-model average

Confidence

LOW

building category history

Stability

—

Models

Diverging

53pt spread

Current read

Consensus sits at 34% across the four models and is — (loading). Models are diverging at a 53pt spread, so the average conceals live disagreement worth inspecting below. Category confidence is LOW (building category history).

Confidence reflects category-level track record. Stability tracks estimate movement. Models shows whether the four agree.

Belief trajectory

Loading belief trajectory...

Per-model probabilities

GPT-4o

25%-9 vs avg

Given the trajectory of AI development and the current investments, the cost of generating AI content is expected to continue decreasing, but a reduction exceeding 95% from 2025 levels is overly aggressive. As of 2023, leading AI models, like OpenAI's GPT-4, have operating costs that are still substantial, with estimates suggesting that generating high-quality content remains in the range of $0.002 to $0.01 per word. Scaling down this cost by over 95% by 2028 would require extraordinary advancements in efficiency and reduction in power consumption that have not historically been observed at that scale within such a brief time frame.

Key uncertaintyThe pace of technological advancement in AI hardware and software, particularly breakthroughs in efficiency and energy costs, could significantly alter this outlook.

Gemini

15%-19 vs avg

While AI content generation costs have seen rapid decreases, a 95% drop from 2025 to 2028 (a 3-year period) is exceptionally aggressive. Current trends suggest continued cost reduction due to algorithmic improvements and hardware advancements, but the diminishing returns of Moore's Law and the increasing complexity of advanced AI models will likely slow this rate of decline. Achieving such a drastic reduction would require a fundamental, unforeseen breakthrough in AI efficiency or hardware architecture.

Key uncertaintyThe emergence of a novel, vastly more efficient AI architecture or a breakthrough in quantum computing applicable to AI generation.

Claude

28%-6 vs avg

AI inference costs have declined approximately 50-60% from 2022-2025 due to improved model efficiency, hardware optimization, and competition (e.g., GPT-4 API pricing dropped ~90% from initial launch, but from an artificially high baseline). A 95% reduction from 2025 levels by 2028 would require cumulative cost declines of 20x over 3 years, implying ~75% annual cost reductions—substantially faster than the historical 40-50% annual improvements observed in the 2023-2025 period. While semiconductor advances (improved chip yields, architectural efficiency) and algorithmic improvements continue, physical compute constraints and training data scaling limitations suggest diminishing returns. The most likely scenario involves 60-80% cost reductions by 2028 (significant but not 95%), with further gains bottlenecking on energy costs and hardware manufacturing constraints.

Key uncertaintyBreakthrough developments in model efficiency (sparse activation, novel architectures, or radical improvements in inference optimization) could accelerate cost declines beyond current trajectories, though no clear evidence suggests such breakthroughs are imminent as of early 2025.

Grok

68%+34 vs avg

Historical cost declines for frontier LLMs show 4-6x reductions every 18-24 months (GPT-3 at $60/M tokens in 2020 to GPT-4o at ~$2.50/M tokens by 2024), driven by 3-4x hardware efficiency gains per Nvidia generation and algorithmic improvements like mixture-of-experts reducing active parameters by 5-10x. Current 2025 baseline of ~$0.50-1.00/M output tokens faces continued pressure from TSMC 2nm ramp in 2026 and Google/Amazon custom silicon targeting 3-5x inference efficiency, though energy and data constraints may slow the final 20x drop needed by 2028.

Key uncertaintyWhether training data exhaustion forces a shift to synthetic data loops that could increase rather than decrease marginal inference costs.

Key disagreementGrok (68%) vs Gemini (15%): Different weighting of factors

CRENE-AICD-IC02Generated Jul 3, 2026