swarm-zap

will/swarm-zap

Fork 0

Commit Graph

Author	SHA1	Message	Date
zap	23782735a1	docs(cost): update plan with corrections from official docs - Phase 1: clarify cacheRetention only applies to Claude models; GPT auto-caches; GLM has none - Phase 1: add TTL reality check (short=5min, long=1h) and implications for heartbeat timing - Phase 2: explain why long TTL + 25m heartbeat is the right combo - Phase 4: replace generic prompt tips with model-specific guidance from official Anthropic/OpenAI docs - Added prompt structure notes for cache efficiency, GLM-4.7 tighter prompting requirements - References: memory/references/*.md	2026-03-05 20:37:32 +00:00
zap	c2fe8155e3	docs(cost): add upstream safety rules to cost optimization plan Hard constraints: no dist edits, no undocumented config keys, no middleware. Safe: documented openclaw.json knobs + workspace files only. Includes update workflow and pre-implementation verification checklist.	2026-03-05 20:25:59 +00:00
zap	6642964ae6	docs(cost): add inference cost optimization plan — 4 phases Phase 1: Enable prompt caching (cacheRetention: long on Claude models) Phase 2: Heartbeat cache warming (25m main, 55m default) Phase 3: Context pruning (cache-ttl mode, 1h TTL) Phase 4: Cheaper models for subagents (GLM-4.7 free tier for bulk work) All config-only, no OpenClaw code changes, fully reversible.	2026-03-05 20:20:03 +00:00

Author

SHA1

Message

Date

zap

23782735a1

docs(cost): update plan with corrections from official docs

- Phase 1: clarify cacheRetention only applies to Claude models; GPT auto-caches; GLM has none
- Phase 1: add TTL reality check (short=5min, long=1h) and implications for heartbeat timing
- Phase 2: explain why long TTL + 25m heartbeat is the right combo
- Phase 4: replace generic prompt tips with model-specific guidance from official Anthropic/OpenAI docs
- Added prompt structure notes for cache efficiency, GLM-4.7 tighter prompting requirements
- References: memory/references/*.md

2026-03-05 20:37:32 +00:00

zap

c2fe8155e3

docs(cost): add upstream safety rules to cost optimization plan

Hard constraints: no dist edits, no undocumented config keys, no middleware.
Safe: documented openclaw.json knobs + workspace files only.
Includes update workflow and pre-implementation verification checklist.

2026-03-05 20:25:59 +00:00

zap

6642964ae6

docs(cost): add inference cost optimization plan — 4 phases

Phase 1: Enable prompt caching (cacheRetention: long on Claude models)
Phase 2: Heartbeat cache warming (25m main, 55m default)
Phase 3: Context pruning (cache-ttl mode, 1h TTL)
Phase 4: Cheaper models for subagents (GLM-4.7 free tier for bulk work)

All config-only, no OpenClaw code changes, fully reversible.

2026-03-05 20:20:03 +00:00

3 Commits