Verification

Verification now reflects what we can actually prove.

HazardPulse freezes every live forecast into replay artifacts, tracks raw append-only ledgers, and now separates exact model benchmarks, pending maturity windows, and scoring backlogs. If a live model is not scored yet, this page says so directly.

Built 2026-04-15T20:44:14Z · Score as of 2026-04-15T20:43:04Z

198
Frozen forecasts

Replay artifacts preserved across all live hazards.

131
Scoring backlog

Matured windows waiting for an evaluator or scoring run.

0
Hash mismatches

Prev-hash continuity failures in raw append-only ledgers.

1
Exact benchmarks

Live model versions with an attached exact benchmark.

Control alerts

  • 131 matured forecast windows are waiting for scoring.
  • Earthquake: no exact benchmark is attached to the current live model version.
  • Tornado: no exact benchmark is attached to the current live model version.

Hazard by hazard

These cards tell you whether each live model has exact scores, only related research benchmarks, or just frozen forecasts waiting for scoring.

Earthquake

Waiting

Prospective earthquake logging is live; the 30-day windows have not matured yet.

Live modeleq_coherence_v1_0
Latest forecasteq_fcst_20260415_1900
Storage52 replays · horizon 30 days
Maturity0 matured · 0 scored
Related benchmarksame-location AUC 0.797 | global AUC 0.907
Raw ledger63 rows · 0 mismatches

Keep freezing every earthquake forecast. Once the first 30-day windows mature, run the prospective scorer and use those scores to tune thresholds and calibration.

Hurricane

Backlog

Matured hurricane forecasts exist, but no live advisory-to-outcome scorer is wired yet.

Live modelhurricane_ri_v8_1
Latest forecasthu_fcst_20260415_1253
Storage12 replays · horizon 24 hours
Maturity10 matured · 0 scored
Primary metricAUC 0.938 · Brier 0.034
Exact benchmarkAUC 0.938 · Brier 0.034
Raw ledgerNot implemented for this hazard yet

Implement an advisory-to-outcome scorer that joins frozen hurricane forecasts to realized 24-hour intensity change before using the model for calibration or promotion decisions.

Tornado

Backlog

Matured tornado storm-object forecasts exist, but no live outcome scorer is wired yet.

Live modeltornado_storm_v1_0
Latest forecastto_fcst_20260415_2043
Storage134 replays · horizon 24 hours
Maturity121 matured · 0 scored
Related benchmarkAUC 0.894
Raw ledger200 rows · 0 mismatches

Bind each frozen tornado storm-object forecast to a matched outcome definition and write a 24-hour scorer before using the live model for calibration or threshold changes.

Benchmark provenance

Exact benchmarks are safe to cite for the current live model version. Related benchmarks are useful for research context, but not as proof of live performance.

HazardTypeModelMetricsSource updated
EarthquakeRelated Research Benchmarkearthquake_honest_regional_suitesame-location AUC 0.797 | global AUC 0.907--
HurricaneExact Model Benchmarkhurricane_ri_v8_1AUC 0.938 | Brier 0.0342026-03-13T03:00:00Z
TornadoRelated Research Benchmarkhazardpulse_tornado_definitive_v1AUC 0.8942026-03-31T04:07:24Z

Storage and audit surfaces

This surface is intentionally stricter than marketing copy. A billion-dollar company needs a page that tells operators what is frozen, what is scored, what is only a research benchmark, and what still needs engineering work before it can influence model adjustment or promotion.

Use the evidence ledger for artifact-level traceability and this page for scoring readiness and benchmark discipline.