Files
flynn/docs/plans/remaining-phases-rollout-quality-gates.md
T
2026-02-12 22:47:28 -08:00

1.6 KiB

Remaining Phases Rollout Quality Gates

Created: 2026-02-12 Applies to: Phase 2 and Phase 3 implementation PRs

Regression Safety

  • All existing tests pass before/after each PR.
  • No public interface break in core abstractions (ModelClient, ChannelAdapter, Tool).
  • Existing configs load without migration failures.
  • Existing command flows continue working unchanged.

Routing Determinism

  • Sender/channel/default route resolution remains deterministic.
  • Intent and confidence routing produce stable decisions for same input.
  • Unknown/low-confidence inputs always fall back to normal orchestrator path.
  • Routing decisions are logged for debugging.

Memory Correctness

  • Legacy and category memory reads/writes remain compatible.
  • Search results respect namespace/category filters.
  • Prompt memory injection remains token-bounded.
  • Compaction preserves high-value turns under weighted selection.

Latency Budgets

  • New routing checks add negligible overhead for non-command messages.
  • Memory relevance scoring remains bounded (time + token budgets).
  • History search defaults are capped (limit, thresholds) to avoid spikes.

Policy Enforcement Verifiability

  • Tool allow/deny and autonomy decisions are test-covered.
  • Denied/overridden actions are audit-logged with explicit reason.
  • Confirmation behavior is deterministic by mode.

Pre-Merge Checks Per PR

pnpm typecheck
pnpm test:run
pnpm lint
pnpm build
  • Add targeted test commands from each PR checklist.
  • Update docs/plans/state.json with progress and test counts when implementation lands.
  • Keep commits atomic and scoped to one checklist PR at a time.