flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	1159fac640	feat(skills): add confirmed no-op execution receipts	2026-02-12 18:44:13 -08:00
William Valentin	f0fc806f95	refactor(skills): centralize install action modes	2026-02-12 18:36:09 -08:00
William Valentin	e8d5d01d4d	feat(skills): add installer execution stub command	2026-02-12 18:26:09 -08:00
William Valentin	1bb791c7dd	feat(skills): support install preflight-only mode	2026-02-12 18:17:46 -08:00
William Valentin	601844c50e	feat(skills): preview installer plan during install	2026-02-12 18:15:42 -08:00
William Valentin	d3ba1328f2	feat(skills): add installer plan command output	2026-02-12 18:11:38 -08:00
William Valentin	bd754d520e	feat(skills): add dry-run installer planning surface	2026-02-12 17:56:51 -08:00
William Valentin	81d04357a1	feat(skills): validate manifest installer specs	2026-02-12 17:52:53 -08:00
William Valentin	bd29afeaff	chore(skills): improve watcher event observability	2026-02-12 17:40:41 -08:00
William Valentin	333e33f30f	feat(skills): target watcher updates with safe fallback	2026-02-12 17:36:32 -08:00
William Valentin	2fb5c9adab	feat(skills): reload registry on watcher change events	2026-02-12 17:30:23 -08:00
William Valentin	b773e2bbf3	feat(skills): enable watcher wiring through daemon lifecycle	2026-02-12 17:18:22 -08:00
William Valentin	95091cc198	feat(skills): add debounced watcher foundation for phase 2	2026-02-12 17:15:46 -08:00
William Valentin	0a19f01639	feat(doctor): surface skill directory health in diagnostics	2026-02-12 17:05:04 -08:00
William Valentin	fc3d2ab4d8	feat(skills): add refresh summary for discovery health	2026-02-12 17:02:23 -08:00
William Valentin	2d753321b3	feat(skills): guard uninstall with explicit confirmation	2026-02-12 16:59:50 -08:00
William Valentin	d5b7d72e5d	feat(skills): add install dispatch for local skill setup	2026-02-12 16:50:25 -08:00
William Valentin	0d84a6bccc	feat(skills): add info command for skill inspection	2026-02-12 16:44:46 -08:00
William Valentin	b3e5aee333	feat(skills): expose list command for skill visibility	2026-02-12 16:42:00 -08:00
William Valentin	90ce622080	feat(policy): enforce truthfulness and autonomy guardrails Add runtime truthfulness modes and autonomy-level tool gating with audit metadata for overrides/denials. Wire policy through prompt assembly, tool execution context, and daemon/gateway agent paths; update tests and planning state for Phase 3 PR #2 completion.	2026-02-12 16:06:45 -08:00
William Valentin	125af4e832	refactor(backend): use systemd for daemon management Replace manual process management with systemctl --user commands. Uses ollama.service and llama-server.service units for proper lifecycle management, VRAM cleanup, and integration with system services.	2026-02-12 00:24:43 -08:00
William Valentin	1c8da30905	fix(backend): only kill processes started by TUI Track PIDs of backends started by /backend command and only kill those specific PIDs. Previous implementation used pkill which would kill all Ollama/llama-server processes including those started by the user or systemd services. Now we only terminate processes we started.	2026-02-12 00:19:26 -08:00
William Valentin	05037a917e	feat(backend): auto-stop/start daemon when switching backends - Add local_providers with ollama and llamacpp configurations - /backend command now stops current daemon before starting new one - Start backends as detached processes to avoid blocking TUI - Wait 500ms for daemon to initialize before switching	2026-02-12 00:13:59 -08:00
William Valentin	0b44adbaea	fix(audio): add SSRF protection, MIME type fix, and tests for audio-transcribe tool - Add URL validation blocking localhost, private IPs, and non-http protocols - Use response Content-Type header instead of hardcoded audio/wav for URL downloads - Add 25 tests covering validation, SSRF, config errors, transcription paths, and error handling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 21:57:45 -08:00
William Valentin	a8a2c59313	feat: implement model persistence with per-session overrides - Add session_config SQLite table for per-session settings - Update routing to support session override → agent config → global default resolution chain - Upgrade WebChat SessionBridge from NativeAgent to AgentOrchestrator - Add /model, /local, /cloud commands to Telegram adapter - Add /model command to WebChat gateway handlers - Clear session overrides on /reset command - Pass memoryStore and config through to SessionBridge - Add comprehensive tests for all new functionality Fixes model persistence bug where TUI model changes didn't affect WebChat/Telegram sessions. Now: - TUI /model sets global default (persists across restarts, affects all new sessions) - WebChat/Telegram /model sets session override (only that conversation, cleared on /reset) - WebChat sessions gain AgentOrchestrator features (delegation, compaction, memory)	2026-02-11 21:51:38 -08:00
William Valentin	2e235213d9	fix(audio): handle voice message failures gracefully - Send user feedback when voice/audio download fails instead of silent failure - Send graceful message when audio transcription is not configured instead of empty text which crashes API	2026-02-11 19:44:42 -08:00
William Valentin	148219153e	feat(audio): add tests, token estimation, and config override for native audio - Add capabilities.test.ts (18 tests) for supportsAudioInput() - Add 15 audio tests to media.test.ts (hasAudio, stripAudioParts, attachmentToAudioSource) - Add estimateAudioTokens() to tokens.ts (base64→bytes→duration→tokens) - Update estimateMessageTokens() to include audio content parts - Add 5 audio token tests to tokens.test.ts - Add supports_audio config override to model schema - Wire supports_audio from tier config through routing to capability check Total tests: 1369 (was 1331, +38 audio-related)	2026-02-11 18:27:19 -08:00
William Valentin	32ac4df20a	feat(audio): add smart routing for native vs transcribed audio - Create capabilities.ts with supportsAudioInput() detection - Gemini, OpenAI, and GitHub Models get native audio passthrough - Anthropic, Bedrock, Ollama, llama.cpp fall back to Whisper transcription - routing.ts now checks model capability before deciding to transcribe - Audio attachments are stripped for non-native models (only transcript text passed) - Remove deprecated audioConfig from createMessageRouter deps (read from config.audio)	2026-02-11 18:20:10 -08:00
William Valentin	32e1a2724a	feat(audio): add native audio support to type system and model clients - Add AudioSource interface and 'audio' variant to MessageContentPart union - Update buildUserMessage() to create audio content parts from attachments - Add attachmentToAudioSource(), hasAudio(), stripAudioParts() helpers - Gemini: native audio via inlineData (same format as images) - OpenAI/GitHub: native audio via input_audio content parts - Anthropic/Bedrock: graceful fallback to transcript text - Update getMessageTextWithTools() to handle audio blocks for local models	2026-02-11 18:17:33 -08:00
William Valentin	a875bcc4ae	feat(audio): add audio.transcribe tool with Whisper-compatible API support - Add createAudioTranscribeTool with OpenAI/Groq/Ollama/llama.cpp provider support - Refactor audio config schema to nested audio.enabled + audio.provider structure - Move audio tool registration to initTools() for conditional enablement - Fix duplication bug in audio-transcribe.ts URL download handler - Support base64 data and URL-based audio input with format detection	2026-02-11 18:13:19 -08:00
William Valentin	2dddae8f9b	feat(audit): Add automation component logging Add audit logging to: - WebhookHandler: connect/disconnect, receive, not_found, denied, HMAC verified - HeartbeatMonitor: start/stop, cycle, check, fail, recover - GmailWatcher: connect/disconnect lifecycle events All automation events now captured in audit log with proper context	2026-02-11 16:04:33 -08:00
William Valentin	d62e836b5d	feat(audit): Add core audit logging infrastructure - Add AuditLogger class with rotation support - Add audit configuration to config schema - Instrument tool execution with full audit logging - Instrument session lifecycle (create, message, delete, transfer, compact) - Add audit logger initialization in daemon - Add cron scheduler audit logging Audit events captured: - tool.start/success/error/denied - session.create/message/delete/transfer/compact - cron.trigger/add/remove All logs go to ~/.local/share/flynn/audit.log (JSON lines) with rotation (10MB files, 30-day retention)	2026-02-11 15:58:07 -08:00
William Valentin	6090508bad	style: auto-fix ESLint issues (curly braces and formatting) - Add curly braces to all if/else/for/while statements - Fix indentation and trailing spaces - Auto-fixed 372 linting errors using eslint --fix - Remaining issues are warnings only (non-null assertions, explicit any types)	2026-02-11 10:30:24 -08:00
William Valentin	85d7a6bfec	test: add stopReason edge case tests; update state.json with recent fixes - Added tests for finish_reason 'tool_calls' with empty array → 'end_turn' - Added test for finish_reason 'length' → 'max_tokens' - Updated state.json with 4 new entries for today's fixes (SOUL.md, message normalization, agent loop resilience, stopReason normalization) - Test count: 1329 → 1331	2026-02-11 09:51:19 -08:00
William Valentin	01c3175fdb	fix: normalize OpenAI/GitHub finish_reason to Flynn stopReason conventions OpenAI-compatible providers return 'stop' and 'tool_calls' as finish_reason values, but Flynn's agent loop expects Anthropic-style 'end_turn' and 'tool_use'. This caused the agent to exit the tool loop prematurely when falling back to GitHub Copilot (due to Anthropic API quota exhaustion). - openai.ts: Map 'stop' → 'end_turn', 'length' → 'max_tokens', tool_calls with actual tools → 'tool_use', tool_calls without tools → 'end_turn' - github.ts: Handle edge case where finish_reason is 'tool_calls' but no tools were parsed - agent.ts: Accept both 'tool_use' and 'tool_calls' as valid stop reasons (belt-and-suspenders), extract toolCalls to local variable for TS narrowing - openai.test.ts: Update expectations to match new normalized values	2026-02-11 09:49:36 -08:00
William Valentin	1aab006a7f	feat: improve agent loop resilience — same-tool nudging and error handling - agent.ts: track consecutive calls to the same tool (ignoring args) and inject a nudge after 4 repeats telling the model to summarize and respond, preventing local models from endlessly retrying searches with slight query variations - agent.ts: wrap the entire tool loop iteration in try-catch so model/network errors don't crash the daemon — returns a descriptive error message instead - Tests for both: nudge triggers after 4 same-tool calls, error recovery persists to history	2026-02-11 09:33:30 -08:00
William Valentin	c01de7d097	feat: native tool calling message normalization for Ollama and llama.cpp - ollama.ts: add normalizeMessagesForOllama() converting Anthropic-style tool_use/tool_result blocks to Ollama's native tool_calls + role:tool format - llamacpp.ts: add normalizeMessagesForLlamaCpp() with hybrid approach — assistant tool_calls in native format, but tool results as structured user messages (many GGUF templates silently drop role:tool messages) - llamacpp.ts: add configurable requestTimeout with AbortController (default 3min) - Both use fast-path when no tool blocks are present (zero overhead) - Full test coverage for both normalizers: plain text passthrough, tool_use conversion, tool_result mapping, multi-tool round trips, error results	2026-02-11 09:33:21 -08:00
William Valentin	5270234bbb	feat: improve tool usage guidance in SOUL.md and add cron.create/cron.delete tools - SOUL.md: list all available tools (web.search, memory., cron., etc.) and add Tool Usage Rules section enforcing 'act, don't narrate' - cron.ts: add getJob(), addJob(), removeJob() to CronScheduler for runtime (ephemeral) cron job management - cron tools: add cron.create and cron.delete tools, enhance cron.list to show schedule/output/message details - policy.ts: add cron tools to messaging and coding profiles, add group:cron to tool groups Fixes issue where models would narrate tool intent ('let me search...') then stop without actually calling tools.	2026-02-11 09:32:36 -08:00
William Valentin	eea7ca62a8	chore: increase GmailWatcher default poll interval from 60s to 300s	2026-02-11 08:43:48 -08:00
William Valentin	60b214e7c4	feat: add per-cron-job model tier selection Allow cron jobs to specify a `model_tier` field that controls which LLM tier handles the job, without needing separate agent configs. Precedence: cron job model_tier > agent config > global primary_tier > 'default'. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 22:31:18 -08:00
William Valentin	6761dca1c2	fix: normalize message roles for local model backends (llama.cpp, Ollama) Local backends using strict chat templates (e.g. Mistral 3) rejected Flynn's Anthropic-style tool_use/tool_result content blocks, causing 'roles must alternate' errors. Added getMessageTextWithTools() and normalizeMessagesForLocal() to serialize structured blocks to plain text, drop empty messages, and merge consecutive same-role messages. Also fixed compaction to ensure kept messages start with user role.	2026-02-10 22:04:17 -08:00
William Valentin	2f6d045e2a	feat: load .env file at startup using Node built-in loadEnvFile Adds process.loadEnvFile() to CLI entry point so API keys (ZHIPUAI_API_KEY, OPENROUTER_API_KEY, XAI_API_KEY, etc.) can be stored in a project .env file instead of shell environment or systemd service config. Uses Node >= 20.12 built-in — no dotenv dependency needed. Silent no-op if .env doesn't exist. Updates .env.example with placeholders for all provider API keys.	2026-02-10 21:43:09 -08:00
William Valentin	5c90640e2a	fix: clear error messages for missing API keys on provider switch Previously, switching to zhipuai/openrouter/xai via /model would throw a confusing 'OPENAI_API_KEY missing' error from the OpenAI SDK. Now createClientFromConfig validates API keys before constructing the client, throwing errors that name the correct env var (e.g. ZHIPUAI_API_KEY). Also fixes the misleading 'as anthropic' type cast in the /model handler to validate against MODEL_PROVIDERS and use the ModelProvider type.	2026-02-10 21:32:44 -08:00
William Valentin	aaaf4a361a	fix(webchat): move action buttons outside message bubble Wrap each message in a .message-wrapper div and render copy/edit buttons below the bubble instead of overlapping inside it. Improves readability and prevents buttons from covering message content.	2026-02-10 21:26:22 -08:00
William Valentin	25482b8516	feat: sync PROVIDER_NAMES with config schema and update README docs Extract MODEL_PROVIDERS const from config schema as single source of truth for provider names. PROVIDER_NAMES in TUI commands now imports from schema instead of maintaining a hardcoded list. Adds tests verifying sync. Updates README TUI Commands section with /model hot-swap documentation, supported providers, and runtime model switching examples.	2026-02-10 21:26:18 -08:00
William Valentin	27ee3b2c10	feat(webchat): add copy and edit buttons on chat messages Copy button on all messages (clipboard API with checkmark feedback). Edit button on user messages populates the input textarea. Buttons appear on hover (desktop) or always visible (mobile).	2026-02-10 20:53:49 -08:00
William Valentin	4c8ba3f20c	feat(webchat): add slash commands, autocomplete popup, and web search button Add 6 slash commands (/help, /reset, /compact, /usage, /status, /model) with autocomplete popup (arrow keys, Enter/Tab/Escape navigation). Search button toggles web search mode by prepending instruction to message. Backend agent.send extended with metadata for server-side command routing.	2026-02-10 20:45:14 -08:00
William Valentin	7a69794418	fix: sync model tier between TUI and WebChat when switching models ModelRouter now supports multiple tier-change listeners via addOnTierChange(), SessionBridge subscribes to tier changes and propagates them to all WebChat agents (both existing and newly created), and the fullscreen TUI now also updates the agent's tier when switching models (matching minimal TUI behavior).	2026-02-10 20:22:40 -08:00
William Valentin	bf9ca690f3	fix(agent): detect repeated tool call loops and make max_iterations configurable Local LLMs often get stuck calling the same tool repeatedly because they lack the sophistication to synthesize results. The agent loop had no safeguard — it re-executed whatever the model requested up to 10 times. Add fingerprint-based loop detection: if the same tool+args combination repeats 3 consecutive times, break the loop and return the last results. Also add agents.max_iterations to the config schema so the iteration limit is user-configurable (default: 10).	2026-02-10 19:35:09 -08:00
William Valentin	4ce8e81c01	fix(gmail): sanitize HTML entities and tags in tool output Gmail API returns snippets with HTML entities (&, ', <br>, etc.) that leaked into LLM responses as raw HTML. Added shared sanitizeHtml() utility in src/utils/html.ts and applied it to gmail tool snippets, HTML body fallback, and gmail watcher snippets.	2026-02-10 16:30:14 -08:00

... 3 4 5 6 7 ...

423 Commits