240_016predictionAIAI-scaling

Everyone is underestimating the next year of AI improvements (more than 2x gain)

Predictor: Dave Blundin · ep#240 "NVIDIA's $1 Trillion Prediction, Anthropic Beats OpenAI, Tesla vs. TSMC & The CS Job Collapse" · source

Prior probability

60.0%

Current probability

35.7%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

pending

Window

2026-01-01 – 2027-11-30

Edges in / out

10 / 5

Tickers exposed

Prediction text

Everyone is underestimating the next year of AI improvements (more than 2x gain) | that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?

Verbatim quote

From episode "NVIDIA's $1 Trillion Prediction, Anthropic Beats OpenAI, Tesla vs. TSMC & The CS Job Collapse"

that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?

Predictor: Dave Blundin

κ + Brier as of 2026-07-04

Full calibration →

κ (discount)

0.821

Brier

0.0491

excellent

Hits / Misses

3 / 2

of 9 resolved

Hit rate

33.3%

Calibration plot (stated vs observed)

Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

5 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 35.7%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 7 fired ✓ · 1 overdue ⏱ · 1 pending

2026-04-15hitARC-AGI-2 doubles year-over-year
How: Best ARC-AGI-2 score reaches 77.1% (Gemini 3.1 Pro), double its predecessor
Source: LM Council Benchmarks — 'Gemini 3.1 Pro 77.1% on ARC-AGI-2, double predecessor'conf 95%
2026-04-23hitGPT-5.5 ships with massive coding/reasoning benchmark gains
How: OpenAI ships GPT-5.5 scoring 82.7% on Terminal-Bench 2.0 (up from 75.1% GPT-5.4) and 74.0% on 1M-token MRCR v2 (up from 36.6%)
Source: Artificial Analysis — 'OpenAI's GPT-5.5 is the new leading AI model'conf 99%
Notes: HIT — direct evidence of >2x improvement on long-context reasoning benchmark.
2026-04-29hitNvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.
2026-04-29hitNvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.
2026-04-29hitNvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).
2026-04-29hitNvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a
2026-04-30hitHumanity's Last Exam top score crosses 50% threshold
How: Top frontier model achieves >=50% on Humanity's Last Exam vs 38.3% Stanford 2025 baseline (Claude Opus 4.6 / Gemini 3.1 Pro)
Source: IEEE Spectrum — 'Stanford's AI Index 2026' + LM Council Benchmarksconf 95%
2026-06-25overdueNvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.
2026-06-01 → 2027-06-30pendingAI investment exceeds $700B annualized run-rate
How: Stanford AI Index or equivalent confirms 2026 AI investment >=$700B (vs $581B in 2025, $253B in 2024) — ~20% YoY growth above already-record base
Source: Build Fast — 'Best AI Models April 2026' + Stanford Index trajectoryconf 70%
2027-05-14pendingEveryone is underestimating the next year of AI improvements (more than 2x gain)
2027-04-01 → 2027-12-31pendingTop-tier benchmark performance growth >=2x by mid-2027
How: Top-of-leaderboard scores on SWE-bench Verified, ARC-AGI-2, and HLE all show >=2x improvement vs April 2026 baselines
Source: Trend extrapolation from 2025-2026 capability curveconf 55%
Notes: Validates Blundin's '>2x next year' thesis if borne out across major benchmarks.
2028-06-25pendingWe're exiting the industrial age permanently as recursive self-improvement unfolds.
2028-07-20pendingSuperhuman AI will make BCI-enhanced humans irrelevant compared to AI 2 years from today.
2030-09-27pendingMost large companies' business models will be disrupted in 2-5 years
2063-06-21pendingPeter's 14-year-old son Milan will never get a driver's license.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 36%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

metadata_milestone_miss_sweep2026-07-03T22:12:25Z35.7%-13.8pp

metadata_milestone_miss_sweep bayesian_v2 n=1 inside=0.357 blend=0.357 LLR=-0.569 κ=0.82 no_blend

Raw metadata

{
  "trf": 0.7364967146797728,
  "kappa": 0.8214,
  "base_rate": null,
  "predictor": "Dave Blundin",
  "total_llr": -0.6931471805599453,
  "grace_days": 7,
  "bayesian_v2": true,
  "prior_logit": -0.01913257734403183,
  "bayes_factor": "1.8:1 against",
  "blend_reason": "no reference_class linked",
  "inside_prior": 0.4952170015666818,
  "kappa_source": "predictor_table",
  "n_milestones": 1,
  "blend_applied": false,
  "contributions": [
    {
      "llr": -0.6931471805599453,
      "kind": "prereq",
      "kappa": 0.8214,
      "label": "Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.",
      "weight": 0.5,
      "strength": "moderate",
      "confidence": null,
      "source_url": null,
      "adjusted_llr": -0.5693510941119391,
      "expected_date": "2026-06-25",
      "measurement_criterion": null
    }
  ],
  "evidence_kind": "metadata_milestone_miss_sweep",
  "inside_source": "history_v2",
  "inside_weight": 0.484452299724159,
  "outside_weight": 0.5155477002758411,
  "posterior_prob": 0.3569828460671014,
  "posterior_logit": -0.5884836714559709,
  "predictor_brier": 0.0491,
  "inside_posterior": 0.3569828460671014,
  "blended_posterior": 0.3569828460671014,
  "reference_class_id": null,
  "total_adjusted_llr": -0.5693510941119391,
  "predictor_n_resolved": 9
}

LBP2026-05-10T02:00:02Z49.5%-1.2pp

Network propagation: 50.7% → 49.5%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

LBP2026-05-03T02:00:01Z50.7%-2.2pp

Network propagation: 52.9% → 50.7%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

LBP2026-04-30T16:39:51Z52.9%-2.9pp

Network propagation: 55.8% → 52.9%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z55.8%-4.2pp

Network propagation: 60.0% → 55.8%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
killer	TK03 AI Regulatory Moratorium (EU/US Capability Freeze)	10.0%	0.050	0.600	+0.188
killer	TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption)	12.0%	0.050	0.600	+0.177
prereq	SEM_014 Nvidia's Arizona-based TSMC factory successfully fabricated — Jensen Huang	86.1%	0.600	0.050	+0.162
killer	TK01 AGI Capability Plateau (2026-27 Training Stall)	15.0%	0.050	0.600	+0.161
prereq	SEM_011 Nvidia became the world's first $5 trillion company (late 20 — Jensen Huang	85.5%	0.600	0.050	+0.160

Top outgoing (children)

Predictions THIS node influences

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis	18.0%	0.700	0.050	+0.140
prereq	248_033 Superhuman AI will make BCI-enhanced humans irrelevant compa — Dave Blundin	36.7%	0.600	0.050	-0.089
prereq	244_019 Peter's son won't need a driver's license in 2 years — Peter Diamandis	48.4%	0.920	0.050	-0.073
prereq	242_031 Most large companies' business models will be disrupted in 2 — Peter Diamandis	23.5%	0.650	0.050	+0.064
prereq	230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis	34.7%	0.650	0.050	-0.047

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MU WULF IREN EQIX ALAB APLD ASMIY ASML PLAB NVDA NBIS CRWV AAPL AMT AMZN DELL GOOGL IRM LNVGY META MSFT ORCL SFTBY STX

Adverse (6)

ACN GEN CHGG IBM WNS LRN

Prerequisites (10)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
prereq	SEM_011	Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.	Capital Markets	—
prereq	SEM_027	Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.	Capital Markets	—
prereq	SEM_014	Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).	Manufacturing	—
prereq	SEM_012	Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.	AI/Manufacturing	—
prereq	SEM_015	Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.	Policy/Semis	—
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK05	Rate Regime Persistence (10y > 5% through 2028)	—	—
killer	TK01	AGI Capability Plateau (2026-27 Training Stall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK03	AI Regulatory Moratorium (EU/US Capability Freeze)	—	—

Dependents (5)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
prereq	244_019	Peter's son won't need a driver's license in 2 years	Auto/Transport	—
prereq	232_055	We're exiting the industrial age permanently as recursive self-improvement unfolds.	AI	—
prereq	242_031	Most large companies' business models will be disrupted in 2-5 years	Markets/Stocks	—
prereq	230_020	Peter's 14-year-old son Milan will never get a driver's license.	Auto/Transport	—
prereq	248_033	Superhuman AI will make BCI-enhanced humans irrelevant compared to AI 2 years from today.	AI	—

Linked documents (6)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.620	manifold	Will "Maybe I was too harsh on deep learning theory..." make the top fifty posts in LessWrong's 2026 Annual Review?	11%	mentions	pending	2026-05-11
0.618	github_release	facebookresearch/neuroai v0.2.1	—	mentions	pending	2026-05-14
0.611	arxiv	Negation Neglect: When models fail to learn negations in training	—	mentions	pending	2026-05-13
0.605	github_release	pytorch/pytorch v2.2.0	—	mentions	pending	2024-01-30
0.579	manifold	Will "Forecasting is Way Overrated, and We Should S..." make the top fifty posts in LessWrong's 2026 Annual Review?	20%	mentions	pending	2026-04-26
0.566	github_release	pytorch/pytorch v1.13.1	—	mentions	pending	2022-12-16

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "qty": ">>2x",
  "url": "https://www.youtube.com/watch?v=uOGHXAfvK8w",
  "mode": "PREDICTION",
  "role": "Host",
  "context": "that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about? Yeah, it's not it's not going to happen that way.",
  "to_year": 2027,
  "verbatim": "that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?",
  "conv_cues": "not going to happen that way",
  "direction": "DOWN",
  "from_year": 2026,
  "timeframe": "Next year (2026-2027)",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "llm_pre_event",
      "label": "ARC-AGI-2 doubles year-over-year",
      "source": "LM Council Benchmarks — 'Gemini 3.1 Pro 77.1% on ARC-AGI-2, double predecessor'",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -9,
      "source_id": null,
      "confidence": 0.95,
      "source_url": "https://lmcouncil.ai/benchmarks",
      "expected_date": "2026-04-15",
      "observed_date": "2026-04-15",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00",
      "research_origin": "deep_research",
      "measurement_criterion": "Best ARC-AGI-2 score reaches 77.1% (Gemini 3.1 Pro), double its predecessor"
    },
    {
      "kind": "llm_pre_event",
      "label": "GPT-5.5 ships with massive coding/reasoning benchmark gains",
      "notes": "HIT — direct evidence of >2x improvement on long-context reasoning benchmark.",
      "source": "Artificial Analysis — 'OpenAI's GPT-5.5 is the new leading AI model'",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -8,
      "source_id": null,
      "confidence": 0.99,
      "source_url": "https://artificialanalysis.ai/articles/openai-gpt5-5-is-the-new-leading-AI-model",
      "expected_date": "2026-04-23",
      "observed_date": "2026-04-23",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00",
      "research_origin": "deep_research",
      "measurement_criterion": "OpenAI ships GPT-5.5 scoring 82.7% on Terminal-Bench 2.0 (up from 75.1% GPT-5.4) and 74.0% on 1M-token MRCR v2 (up from 36.6%)"
    },
    {
      "kind": "prereq",
      "label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -7,
      "source_id": "SEM_011",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00"
    },
    {
      "kind": "prereq",
      "label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -6,
      "source_id": "SEM_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00"
    },
    {
      "kind": "prereq",
      "label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -5,
      "source_id": "SEM_014",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -4,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29",
      "hit_emitted_at": "2026-06-08T13:04:02.341521+00:00"
    },
    {
      "kind": "llm_pre_event",
      "label": "Humanity's Last Exam
... (truncated)