1.6 KiB
1.6 KiB
Remaining Phases Rollout Quality Gates
Created: 2026-02-12 Applies to: Phase 2 and Phase 3 implementation PRs
Regression Safety
- All existing tests pass before/after each PR.
- No public interface break in core abstractions (ModelClient, ChannelAdapter, Tool).
- Existing configs load without migration failures.
- Existing command flows continue working unchanged.
Routing Determinism
- Sender/channel/default route resolution remains deterministic.
- Intent and confidence routing produce stable decisions for same input.
- Unknown/low-confidence inputs always fall back to normal orchestrator path.
- Routing decisions are logged for debugging.
Memory Correctness
- Legacy and category memory reads/writes remain compatible.
- Search results respect namespace/category filters.
- Prompt memory injection remains token-bounded.
- Compaction preserves high-value turns under weighted selection.
Latency Budgets
- New routing checks add negligible overhead for non-command messages.
- Memory relevance scoring remains bounded (time + token budgets).
- History search defaults are capped (
limit, thresholds) to avoid spikes.
Policy Enforcement Verifiability
- Tool allow/deny and autonomy decisions are test-covered.
- Denied/overridden actions are audit-logged with explicit reason.
- Confirmation behavior is deterministic by mode.
Pre-Merge Checks Per PR
pnpm typecheck
pnpm test:run
pnpm lint
pnpm build
- Add targeted test commands from each PR checklist.
- Update
docs/plans/state.jsonwith progress and test counts when implementation lands. - Keep commits atomic and scoped to one checklist PR at a time.