docs(eval): enforce min-sample gates in canary artifacts and decision log
This commit is contained in:
@@ -32,8 +32,8 @@
|
||||
|
||||
| Category | Count | Percent |
|
||||
| --- | ---: | ---: |
|
||||
| loaded pi module does not expose a supported session factory (expected one of: c | 1 | 50.00% |
|
||||
| pi agent runtime produced no assistant text | 1 | 50.00% |
|
||||
| empty_assistant_text | 1 | 50.00% |
|
||||
| pi_module_interface | 1 | 50.00% |
|
||||
|
||||
## Top Fallback Reasons
|
||||
|
||||
@@ -43,6 +43,9 @@
|
||||
## Gate Evaluation
|
||||
|
||||
- Gate result: HOLD
|
||||
- [x] Minimum target routes: actual=8, threshold=>= 8
|
||||
- [x] Minimum baseline routes: actual=2, threshold=>= 2
|
||||
- [x] Minimum target external attempts: actual=8, threshold=>= 8
|
||||
- [x] Completion rate delta (target - baseline): actual=0.00pp, threshold=>= -2.00pp
|
||||
- [ ] P50 latency delta (target - baseline): actual=259ms, threshold=<= 250ms
|
||||
- [ ] P95 latency delta (target - baseline): actual=5695ms, threshold=<= 700ms
|
||||
|
||||
Reference in New Issue
Block a user