flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	948d4ac6d8	chore(lint): burn down remaining warnings to zero	2026-02-15 23:14:21 -08:00
William Valentin	32e1a2724a	feat(audio): add native audio support to type system and model clients - Add AudioSource interface and 'audio' variant to MessageContentPart union - Update buildUserMessage() to create audio content parts from attachments - Add attachmentToAudioSource(), hasAudio(), stripAudioParts() helpers - Gemini: native audio via inlineData (same format as images) - OpenAI/GitHub: native audio via input_audio content parts - Anthropic/Bedrock: graceful fallback to transcript text - Update getMessageTextWithTools() to handle audio blocks for local models	2026-02-11 18:17:33 -08:00
William Valentin	6090508bad	style: auto-fix ESLint issues (curly braces and formatting) - Add curly braces to all if/else/for/while statements - Fix indentation and trailing spaces - Auto-fixed 372 linting errors using eslint --fix - Remaining issues are warnings only (non-null assertions, explicit any types)	2026-02-11 10:30:24 -08:00
William Valentin	6761dca1c2	fix: normalize message roles for local model backends (llama.cpp, Ollama) Local backends using strict chat templates (e.g. Mistral 3) rejected Flynn's Anthropic-style tool_use/tool_result content blocks, causing 'roles must alternate' errors. Added getMessageTextWithTools() and normalizeMessagesForLocal() to serialize structured blocks to plain text, drop empty messages, and merge consecutive same-role messages. Also fixed compaction to ensure kept messages start with user role.	2026-02-10 22:04:17 -08:00
William Valentin	2a962abcd0	feat: add audio transcription pipeline for voice messages Adds Whisper-compatible audio transcription via configurable endpoint. New functions: isSupportedAudio(), mimeToExtension(), transcribeAudio(), buildUserMessageWithAudio(). Config schema gains audio section with transcription_endpoint, api_key, and model. Daemon wires transcription into the message router. Channel adapters extract audio from voice/audio messages (Telegram voice+audio, Discord audio/, Slack audio/, WhatsApp ptt+audio). Includes 57 media tests (was 25, now covers all audio paths).	2026-02-07 09:09:13 -08:00
William Valentin	a515912537	feat: add multimodal media pipeline for image support across all providers and channels Widen Message.content from string to string \| MessageContentPart[] to support multimodal content. Add Attachment type to channel layer, media conversion utilities, and image extraction to all channel adapters (Telegram, Discord, Slack, WhatsApp). Update all model clients (Anthropic, OpenAI, Gemini, Bedrock) to convert structured content to provider-specific formats. Fix downstream consumers (tokens, compaction, TUI, local models) to handle the widened type via getMessageText() helper.	2026-02-06 17:17:21 -08:00

6 Commits