23782735a1
- Phase 1: clarify cacheRetention only applies to Claude models; GPT auto-caches; GLM has none - Phase 1: add TTL reality check (short=5min, long=1h) and implications for heartbeat timing - Phase 2: explain why long TTL + 25m heartbeat is the right combo - Phase 4: replace generic prompt tips with model-specific guidance from official Anthropic/OpenAI docs - Added prompt structure notes for cache efficiency, GLM-4.7 tighter prompting requirements - References: memory/references/*.md