Commit Graph

3 Commits

Author SHA1 Message Date
zap
23782735a1 docs(cost): update plan with corrections from official docs
- Phase 1: clarify cacheRetention only applies to Claude models; GPT auto-caches; GLM has none
- Phase 1: add TTL reality check (short=5min, long=1h) and implications for heartbeat timing
- Phase 2: explain why long TTL + 25m heartbeat is the right combo
- Phase 4: replace generic prompt tips with model-specific guidance from official Anthropic/OpenAI docs
- Added prompt structure notes for cache efficiency, GLM-4.7 tighter prompting requirements
- References: memory/references/*.md
2026-03-05 20:37:32 +00:00
zap
c2fe8155e3 docs(cost): add upstream safety rules to cost optimization plan
Hard constraints: no dist edits, no undocumented config keys, no middleware.
Safe: documented openclaw.json knobs + workspace files only.
Includes update workflow and pre-implementation verification checklist.
2026-03-05 20:25:59 +00:00
zap
6642964ae6 docs(cost): add inference cost optimization plan — 4 phases
Phase 1: Enable prompt caching (cacheRetention: long on Claude models)
Phase 2: Heartbeat cache warming (25m main, 55m default)
Phase 3: Context pruning (cache-ttl mode, 1h TTL)
Phase 4: Cheaper models for subagents (GLM-4.7 free tier for bulk work)

All config-only, no OpenClaw code changes, fully reversible.
2026-03-05 20:20:03 +00:00