Document voice transcription hardening and storage behavior

This commit is contained in:
William Valentin
2026-02-22 18:56:26 -08:00
parent db4e52dd7e
commit 12802b9b24
3 changed files with 118 additions and 1 deletions
+7
View File
@@ -428,6 +428,13 @@ docker run -d \
# docker compose up -d
```
Audio persistence and diagnostics:
- Latest inbound voice bytes are stored per-session in `~/.local/share/flynn/sessions.db` under `session_config.key = "lastAudioAttachment"` (used to safely hydrate `audio.transcribe` calls).
- `/reset` clears session history and session config for that chat, including `lastAudioAttachment`.
- When Flynn rewrites bad model-provided audio tool args, it emits audit event `tool.args_rewritten`.
- Runbook: `docs/runbooks/VOICE_TRANSCRIPTION_DEBUG.md`.
### Text-to-Speech (TTS) Reply Audio
Flynn can attach synthesized voice replies (OpenAI-compatible `/v1/audio/speech`) alongside text responses.