3.0 KiB
3.0 KiB
Phase-0 Backend Drift Check
Generated at: 2026-02-27T19:34:29.957Z Artifacts: /home/will/lab/flynn/docs/plans/artifacts Backends: pi_embedded, native Freshness max age (hours): 36 Overall gate: PASS
Thresholds
- requireBaselineHistory: false
- minCandidateSampledEvents: 10
- maxSampledEventsDropPct: 80
- maxRunOutcomesDropPct: 80
- maxCompletionRateDropPp: 35
- maxCancelRateIncreasePp: 25
- maxErrorRateIncreasePp: 25
- maxCancelLatencyP95IncreaseMs: 6000
pi_embedded
- status: PASS
- candidate: tag=2026-02-27-193429 file=/home/will/lab/flynn/docs/plans/artifacts/phase0_baseline_live_backend_pi_embedded_2026-02-27-193429.json
- candidate generated_at: 2026-02-27T19:34:29.727Z
- baseline: tag=2026-02-27-184726 file=/home/will/lab/flynn/docs/plans/artifacts/phase0_baseline_live_backend_pi_embedded_2026-02-27-184726.json
- baseline generated_at: 2026-02-27T18:47:26.396Z
- candidate snapshot: sampled=60 outcomes=27 completion=100% cancel=0% error=0% cancel_p95_ms=n/a
- baseline snapshot: sampled=59 outcomes=26 completion=100% cancel=0% error=0% cancel_p95_ms=n/a
- deltas: sampled_event_count_pct=+1.69% run_total_outcomes_pct=+3.85% completion_rate_pp=0 cancel_rate_pp=0 error_rate_pp=0 cancel_latency_p95_ms=n/a reaction_match_rate_pp=0 reaction_skip_rate_pp=0
- freshness gate: PASS (age_hours=0 threshold=36)
- drift gate: PASS PASS candidate_sampled_events actual=60 threshold=>= 10 PASS sampled_events_drop_pct actual=0 threshold=<= 80 PASS run_outcomes_drop_pct actual=0 threshold=<= 80 PASS completion_rate_drop_pp actual=0 threshold=<= 35 PASS cancel_rate_increase_pp actual=0 threshold=<= 25 PASS error_rate_increase_pp actual=0 threshold=<= 25 PASS cancel_latency_p95_increase_ms actual=n/a threshold=<= 6000
native
- status: PASS
- candidate: tag=2026-02-27-193429 file=/home/will/lab/flynn/docs/plans/artifacts/phase0_baseline_live_backend_native_2026-02-27-193429.json
- candidate generated_at: 2026-02-27T19:34:29.861Z
- baseline: tag=2026-02-27-184726 file=/home/will/lab/flynn/docs/plans/artifacts/phase0_baseline_live_backend_native_2026-02-27-184726.json
- baseline generated_at: 2026-02-27T18:47:26.506Z
- candidate snapshot: sampled=15 outcomes=2 completion=100% cancel=0% error=0% cancel_p95_ms=n/a
- baseline snapshot: sampled=15 outcomes=2 completion=100% cancel=0% error=0% cancel_p95_ms=n/a
- deltas: sampled_event_count_pct=0% run_total_outcomes_pct=0% completion_rate_pp=0 cancel_rate_pp=0 error_rate_pp=0 cancel_latency_p95_ms=n/a reaction_match_rate_pp=0 reaction_skip_rate_pp=0
- freshness gate: PASS (age_hours=0 threshold=36)
- drift gate: PASS PASS candidate_sampled_events actual=15 threshold=>= 10 PASS sampled_events_drop_pct actual=0 threshold=<= 80 PASS run_outcomes_drop_pct actual=0 threshold=<= 80 PASS completion_rate_drop_pp actual=0 threshold=<= 35 PASS cancel_rate_increase_pp actual=0 threshold=<= 25 PASS error_rate_increase_pp actual=0 threshold=<= 25 PASS cancel_latency_p95_increase_ms actual=n/a threshold=<= 6000