Forward calibration & shadow scorecard

The honest end-state of the FPT evidence loop. Belief-writers default to SHADOWand only graduate to live writes through a forward-Brier promotion gate. Cohort-split calibration, the per-channel shadow scorecard, recent shadow activity, and belief-motion freshness. No single headline "skill" number, by design.

Forward skill: not yet demonstrated. The thesiscohort's high hit rate is largely backfilled survivorship — terminal milestones resolve long-horizon predictions as hits, so the resolved sample is not a forward, out-of-sample test of belief motion. The pmci_v1_trading cohort is quarantined and scored separately. Promotion of any shadow channel requires the forward-Brier gate below to beat both the independent-sample climatology and the frozen prior on non-circular resolutions — until then every belief-writer stays in shadow.
Promotable channels
0
of 1 evaluated
Shadow writes (7d)
0
0 channel(s)
Resolved (thesis)
88
backfilled survivorship
Resolved (pmci_v1)
302
quarantined trading

Cohort-split calibration

fpt_calibration_v · prior = stated-at-creation probability, current = post-evidence-loop probability
CohortResolved (n)Hit rateBrier (prior)Brier (current)Δ Brier
PMCI v1 (quarantined trading)30220.9%0.19530.1775-0.0178
Thesis (backfilled survivorship)8896.6%0.02940.0978+0.0684

Δ Brier < 0 (green) means the evidence loop improved calibration vs the stated-at-creation probability; Δ Brier > 0 (red) means it degraded it. A degraded thesis Brier is expected while the loop is in shadow and is not evidence of forward skill either way.

Shadow-channel scorecard

fpt_control['promotion_status'] · min_n=40 · need 2 consecutive · generated 2026-07-04 22:13
ChannelInd (n)Circ / SuspBrier (shadow)vs climatologyvs priorVerdictReason
milestone_hit04 / 0shadow
⚠ structurally unpromotable
insufficient_n

A channel is promotable only when, on its independent (non-circular, non-suspect) forward-resolved sample, its Brier beats both the climatology and the frozen prior at a corrected significance level, for the required consecutive runs. Circular pairs (the channel co-authored the resolution label) and suspect pairs are excluded. milestone_hit is structurally unpromotable while the milestone-derived thesis resolver is the only resolver — external channels are the real promotion candidates.

Recent shadow activity

evidence_shadow · last 7 days · grouped by channel + gate status
ChannelGate statusWrites (n)Avg |Δ prob|Latest
No shadow writes in the last 7 days.

Belief-motion freshness

prob_history · last real probability move per reason-class (first token of reason)
Reason classMoves (n)Avg |Δ prob|Last move
metadata_milestone_miss_sweep5740.09795h ago
milestone_miss_sweep140.15773d ago
resolution_terminal1180.188919d ago
lbp_propagation51900.034142d ago
intake:7afeeb9a-f217-4dd2-b910-24ff14bdfc39460.110144d ago
auto_consensus:dc47127b-c217-49d2-97c6-ce58983ee59910.054860d ago
intake:515b84c4-6b29-4d57-8dd6-d41dac0675ec90.104763d ago
reference_class_override:10.619765d ago
intake:99aa73db-75b1-4b1e-8470-a11f87b23937310.137165d ago
reference_class_assigned4460.143065d ago
intake:568095f6-eb44-4f96-92d3-13455b79e33710.331566d ago
intake:c130222f-ab93-4d94-b596-5f4dc7adcd0b:weak10.006066d ago

A reason class whose last move is > 7 days old (amber) indicates that source of belief motion has gone quiet — useful for spotting silently-broken writers.