zap 6642964ae6 docs(cost): add inference cost optimization plan — 4 phases
Phase 1: Enable prompt caching (cacheRetention: long on Claude models)
Phase 2: Heartbeat cache warming (25m main, 55m default)
Phase 3: Context pruning (cache-ttl mode, 1h TTL)
Phase 4: Cheaper models for subagents (GLM-4.7 free tier for bulk work)

All config-only, no OpenClaw code changes, fully reversible.
2026-03-05 20:20:03 +00:00
Description
No description provided
591 KiB
Languages
Shell 61%
Python 28%
JavaScript 6.2%
Clojure 3.5%
TypeScript 1.3%