zap
6642964ae6
docs(cost): add inference cost optimization plan — 4 phases
Phase 1: Enable prompt caching (cacheRetention: long on Claude models)
Phase 2: Heartbeat cache warming (25m main, 55m default)
Phase 3: Context pruning (cache-ttl mode, 1h TTL)
Phase 4: Cheaper models for subagents (GLM-4.7 free tier for bulk work)
All config-only, no OpenClaw code changes, fully reversible.
2026-03-05 20:20:03 +00:00
..
2026-03-05 19:44:34 +00:00
2026-03-05 20:20:03 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 19:00:58 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 02:18:32 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 04:40:46 +00:00
2026-03-05 19:01:49 +00:00