51 lines
1.6 KiB
Markdown
51 lines
1.6 KiB
Markdown
# Remaining Phases Rollout Quality Gates
|
|
|
|
Created: 2026-02-12
|
|
Applies to: Phase 2 and Phase 3 implementation PRs
|
|
|
|
## Regression Safety
|
|
|
|
- All existing tests pass before/after each PR.
|
|
- No public interface break in core abstractions (ModelClient, ChannelAdapter, Tool).
|
|
- Existing configs load without migration failures.
|
|
- Existing command flows continue working unchanged.
|
|
|
|
## Routing Determinism
|
|
|
|
- Sender/channel/default route resolution remains deterministic.
|
|
- Intent and confidence routing produce stable decisions for same input.
|
|
- Unknown/low-confidence inputs always fall back to normal orchestrator path.
|
|
- Routing decisions are logged for debugging.
|
|
|
|
## Memory Correctness
|
|
|
|
- Legacy and category memory reads/writes remain compatible.
|
|
- Search results respect namespace/category filters.
|
|
- Prompt memory injection remains token-bounded.
|
|
- Compaction preserves high-value turns under weighted selection.
|
|
|
|
## Latency Budgets
|
|
|
|
- New routing checks add negligible overhead for non-command messages.
|
|
- Memory relevance scoring remains bounded (time + token budgets).
|
|
- History search defaults are capped (`limit`, thresholds) to avoid spikes.
|
|
|
|
## Policy Enforcement Verifiability
|
|
|
|
- Tool allow/deny and autonomy decisions are test-covered.
|
|
- Denied/overridden actions are audit-logged with explicit reason.
|
|
- Confirmation behavior is deterministic by mode.
|
|
|
|
## Pre-Merge Checks Per PR
|
|
|
|
```bash
|
|
pnpm typecheck
|
|
pnpm test:run
|
|
pnpm lint
|
|
pnpm build
|
|
```
|
|
|
|
- Add targeted test commands from each PR checklist.
|
|
- Update `docs/plans/state.json` with progress and test counts when implementation lands.
|
|
- Keep commits atomic and scoped to one checklist PR at a time.
|