flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	35a0061de9	feat(01-02): extract channel adapter registration into src/daemon/channels.ts - Move Telegram, Discord, Slack, WhatsApp, WebChat adapter setup to channels.ts - Move CronScheduler, WebhookHandler, GmailWatcher registration to channels.ts - Clean up index.ts imports (remove unused adapter value imports) - index.ts calls registerChannels() and receives cronScheduler for tool wiring	2026-02-09 20:14:23 -08:00
William Valentin	fb1199a1da	refactor(01-01): extract tool registration into src/daemon/tools.ts - Create initTools() factory encapsulating ToolRegistry, allBuiltinTools, web search tools, ProcessManager, BrowserManager, ToolExecutor, and ToolPolicy - Replace ~70 lines of inline tool setup in startDaemon() with single initTools() call - Clean up tool-specific imports from daemon/index.ts (ToolPolicy, allBuiltinTools, createWebSearchTools, createProcessTools, ProcessManager, createBrowserTools) - Tier 1 agent tools (session, agents list, message send, cron) remain in daemon/index.ts as intended - daemon/index.ts reduced to 457 lines (from 1088 baseline)	2026-02-09 20:12:46 -08:00
William Valentin	efceb38cb6	feat(01-02): extract agent config and sandbox setup into src/daemon/agents.ts - Create initAgents() function encapsulating AgentConfigRegistry, AgentRouter, SandboxManager init - Replace ~26 lines in startDaemon() with single initAgents() call - Lifecycle shutdown handler for sandbox cleanup included in agents.ts - Zero type errors, routing tests pass	2026-02-09 20:11:32 -08:00
William Valentin	00f8f74aac	refactor(01-01): extract memory initialization into src/daemon/memory.ts - Create initMemory() factory encapsulating MemoryStore, VectorStore, HybridSearch, background indexer, and memory tools registration - Replace ~65 lines of inline memory init in startDaemon() with single initMemory() call - Clean up memory-specific imports from daemon/index.ts (MemoryStore, VectorStore, HybridSearch, createEmbeddingProvider, chunkText, contentHash, createMemoryTools)	2026-02-09 20:10:49 -08:00
William Valentin	08f5b6b8e7	feat(01-02): extract message routing into src/daemon/routing.ts - Move createMessageRouter function (~220 lines) to dedicated routing module - Add import from ./routing.js in daemon/index.ts - routing.test.ts passes without modification - Zero type errors	2026-02-09 20:09:28 -08:00
William Valentin	86cda91f6b	refactor(01-01): extract model client logic into src/daemon/models.ts - Move createClientFromConfig, anthropicToGitHubModel, createAutoFallbackClient, createModelRouter to dedicated module - Add re-exports from daemon/index.ts for backward compatibility - clientFactory.test.ts passes without modification - Reduces daemon/index.ts by ~248 lines	2026-02-09 20:06:27 -08:00
William Valentin	1e29da4da2	feat: complete DM pairing codes with channel adapters, gateway handlers, and TUI command (Tier 4 feature 4)	2026-02-09 18:28:10 -08:00
William Valentin	4413c4dc7c	feat: add gateway lock, shell completion, and tailscale serve (Tier 4 features 1-3)	2026-02-09 13:29:59 -08:00
William Valentin	9be8f76bc7	feat: implement Tier 3 features — lane queue, credential redaction, token dashboard, xAI, Voyage AI - Lane Queue: per-session FIFO queue in gateway replacing reject-when-busy (9 tests) - Credential Redaction: redactConfig() expanded to cover 18+ secret fields (16 tests) - Web UI Token Dashboard: system.tokenUsage endpoint + Usage page with summary cards - xAI (Grok) Provider: OpenAI-compatible client with model pricing - Voyage AI Embeddings: new embedding provider with configurable dimensions (5 tests) - Update gap analysis: 90→95 match (70%→74%), Tier 3 section marked DONE - Update state.json: test count 1001→1034, add tier3_completion entry Total: 1034 tests passing across 85 files, typecheck clean	2026-02-09 10:32:57 -08:00
William Valentin	1d126cddfb	feat: add Zhipu AI (GLM) model provider support Adds zhipuai as a new provider using the OpenAI-compatible API at api.z.ai. Supports api_key config or ZHIPUAI_API_KEY env var, with optional endpoint override. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 09:55:13 -08:00
William Valentin	06438bb44f	feat: add Gmail Pub/Sub watcher for inbound email automation New ChannelAdapter that monitors Gmail via Google Cloud Pub/Sub push notifications with polling fallback. Supports OAuth2 auth, configurable watch labels, template rendering with email metadata placeholders (from, to, subject, snippet, date, id, labels). Wired into daemon lifecycle and gateway (POST /gmail/push endpoint). Includes 16 tests covering auth, templates, push notifications, and channel routing.	2026-02-07 15:39:24 -08:00
William Valentin	88731a50e3	feat: add heartbeat monitor and vector memory search (Tier 2) Heartbeat: - HeartbeatMonitor with 5 checks: gateway, model, channels, memory, disk - Configurable interval, failure threshold, notification channel - Recovery notifications when health restores - 25 new tests Vector Memory Search: - EmbeddingProvider interface with OpenAI, Gemini, Ollama, LlamaCpp backends - SQLite-backed VectorStore with cosine similarity search - Text chunker with paragraph-aware splitting and overlap - HybridSearch merging keyword + vector results with configurable weight - Background indexer with dirty-namespace tracking - Graceful fallback to keyword search when embeddings unavailable - 51 new tests Config: automation.heartbeat + memory.embedding schema sections Total: 950 tests passing, all types clean	2026-02-07 14:45:11 -08:00
William Valentin	b50c140d25	feat: add Docker support and inbound webhooks (Tier 2) - Dockerfile: multi-stage build (node:22-alpine), better-sqlite3 native deps handled - .dockerignore + docker-compose.yml for deployment - FLYNN_DATA_DIR env var support in daemon, CLI, and TUI - WebhookHandler: ChannelAdapter for HTTP POST /webhooks/:name - Per-webhook HMAC auth, template rendering ({{body}}, {{json.field}}) - Config schema: automation.webhooks array with name/secret/message/output - Gateway routes webhook requests before static files (bypasses gateway auth) - 23 new tests for webhook functionality, 874 total tests passing	2026-02-07 14:36:05 -08:00
William Valentin	b322e8f29c	fix: GitHub Copilot fallback — remove stale API version header and fix model name mapping Two issues prevented the GitHub Models fallback from working: 1. The X-GitHub-Api-Version: 2022-11-28 header caused '400 invalid apiVersion' errors. The Copilot chat completions endpoint does not use this header — removed from both constructor and rebuildClient. 2. The anthropicToGitHubModel mapping was incomplete: it only knew three models and the generic date-stripping fallback produced wrong names (e.g. 'claude-sonnet-4-5' instead of 'claude-sonnet-4.5'). GitHub Copilot uses dots for sub-versions, not hyphens. Updated with explicit mappings for all current models (sonnet 4, 4.5; opus 4, 4.5, 4.6; haiku 4.5) and a smarter generic fallback that converts digit-hyphen-digit to digit.digit at the end. 3. createClientFromConfig now auto-maps Anthropic-style model names when the provider is 'github', so users can copy model names from their Anthropic config into fallback blocks without manual renaming.	2026-02-07 14:04:54 -08:00
William Valentin	e12eb3a0be	fix: TUI now uses shared model router with auto-fallback support The TUI was building its own ModelRouter with a duplicated client factory that lacked auto same-model fallback, local_providers resolution, retry config, and per-tier fallback logic. When Anthropic failed, it skipped GitHub Models and fell straight to the local Ollama model. Replace the duplicated ~50-line createClient + router setup in tui.ts with a single call to the daemon's createModelRouter(), which already handles all of these correctly. This removes ~50 lines of duplicated code and ensures TUI and daemon have identical fallback behavior.	2026-02-07 13:58:34 -08:00
William Valentin	5984c42bfd	feat: auto same-model fallback via GitHub Models when primary Anthropic provider fails When a tier uses the Anthropic provider and has no user-configured inline fallback, automatically insert a GitHub Models client for the equivalent model as a tier fallback. This ensures the same model is tried via an alternative provider before degrading to the global fallback chain (which may be a much weaker local model). Mapping: claude-sonnet-4-20250514 → claude-sonnet-4, etc.	2026-02-07 13:52:53 -08:00
William Valentin	1c2f54fae3	feat: implement tier 1 quick wins (tool groups, typing, pruning, verbose, think) Five additive features with no breaking changes: - Tool groups: group:fs, group:runtime, group:web, group:memory syntactic sugar for allow/deny lists in tool policy config - Typing indicators: Discord sendTyping() and WhatsApp sendStateTyping() on message receipt for better UX feedback - Session pruning: TTL-based auto-cleanup via sessions.ttl config with hourly daemon timer and SQLite GROUP BY pruning - /verbose command: TUI command parser toggle for raw streaming display - !!think prefix: per-message extended thinking mode wired through Anthropic (budget_tokens), OpenAI/GitHub (reasoning_effort), and Gemini (thinkingConfig) providers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 13:35:00 -08:00
William Valentin	f0e3987d1c	feat: wire per-tier fallbacks in daemon model router setup Reads the optional fallback field from each tier's config and builds a tierFallbacks map passed to ModelRouter at startup. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 12:10:17 -08:00
William Valentin	22230a3e3f	feat: add web UI dashboard SPA with dashboard, chat, sessions, and settings pages - Add SPA shell with hash-based router, sidebar navigation, and WebSocket RPC client - Add dashboard page with system health cards, channel status, and auto-refresh - Add chat page with session selector, streaming tool events, and markdown rendering - Add sessions page with list, history viewer, and delete functionality - Add settings page with hook pattern editor, tool list, and config viewer - Add backend handlers: sessions.delete, sessions.switch, system.channels, system.usage - Wire channelRegistry into gateway server for channel status reporting - Extend static file server with .mjs, .png, .ico, .woff2 content types	2026-02-07 10:07:45 -08:00
William Valentin	2a962abcd0	feat: add audio transcription pipeline for voice messages Adds Whisper-compatible audio transcription via configurable endpoint. New functions: isSupportedAudio(), mimeToExtension(), transcribeAudio(), buildUserMessageWithAudio(). Config schema gains audio section with transcription_endpoint, api_key, and model. Daemon wires transcription into the message router. Channel adapters extract audio from voice/audio messages (Telegram voice+audio, Discord audio/, Slack audio/, WhatsApp ptt+audio). Includes 57 media tests (was 25, now covers all audio paths).	2026-02-07 09:09:13 -08:00
William Valentin	d4530a7034	feat: add runtime provider/model switching via /model <tier> <provider/model> - ModelRouter: add setClient(), labels map, getLabel(), getAllLabels() - TUI commands: parse /model <tier> <provider/model> syntax with autocompletion - TUI minimal: handle provider switching via createClientFromConfig factory - Daemon: wire initial labels into router config - Fix /model alias mappings (opus=complex, sonnet=default, haiku=fast) - Add design doc and update state.json with feature status	2026-02-06 23:42:14 -08:00
William Valentin	73fc5d173d	feat: add auto-login for GitHub Copilot when no token is available GitHubModelsClient now lazily resolves tokens at first API call. If no token exists (env var, stored OAuth, or config), it triggers the OAuth device flow automatically via an onLoginRequired callback wired in both the TUI and daemon entry points.	2026-02-06 22:33:48 -08:00
William Valentin	f363717f5f	feat: add GitHub Copilot model provider with OAuth device flow Add a new 'github' model provider backed by the Copilot API (api.githubcopilot.com), with OAuth device flow for authentication. - New src/auth/github.ts: device flow login, token storage at ~/.config/flynn/auth.json with 0600 permissions - New src/models/github.ts: OpenAI-compatible client with streaming, tool calling, and Copilot-specific headers - Add 'github' to provider enum in config schema - Register provider in daemon factory and TUI client factory - Refactor TUI to use provider-agnostic client factory (was hardcoded to AnthropicClient for all tiers) - Add /login command to TUI for interactive OAuth authorization - Add Copilot model cost tracking entries	2026-02-06 22:26:52 -08:00
William Valentin	a515912537	feat: add multimodal media pipeline for image support across all providers and channels Widen Message.content from string to string \| MessageContentPart[] to support multimodal content. Add Attachment type to channel layer, media conversion utilities, and image extraction to all channel adapters (Telegram, Discord, Slack, WhatsApp). Update all model clients (Anthropic, OpenAI, Gemini, Bedrock) to convert structured content to provider-specific formats. Fix downstream consumers (tokens, compaction, TUI, local models) to handle the widened type via getMessageText() helper.	2026-02-06 17:17:21 -08:00
William Valentin	880744846f	feat: wire new providers, auth, mention-gating, and browser into daemon Update config schema with server auth fields (token, tailscale_identity, auth_http), channel mention settings, browser config, and openrouter/bedrock provider enum values. Wire GeminiClient, BedrockClient, OpenRouter into createClientFromConfig. Initialize BrowserManager and register browser tools in daemon startup. Pass auth config and channel mention settings through to gateway and adapters. Add puppeteer-core, @google/generative-ai, and @aws-sdk/client-bedrock-runtime dependencies.	2026-02-06 16:52:18 -08:00
William Valentin	4dfa242716	feat: wire Docker sandboxing and agent routing into daemon	2026-02-06 16:04:14 -08:00
William Valentin	ee0af0cc06	feat: add tool allow/deny profiles with per-agent and per-provider filtering Implements configurable tool filtering with four built-in profiles (minimal, messaging, coding, full), global and per-agent/per-provider allow/deny lists with glob pattern support, and defense-in-depth enforcement at both tool listing and execution time. New: src/tools/policy.ts (ToolPolicy engine), src/tools/policy.test.ts (37 tests) Modified: config schema, tool registry, tool executor, NativeAgent, AgentOrchestrator, daemon wiring, gateway tool handler, test mocks	2026-02-06 15:30:34 -08:00
William Valentin	4316dbd3be	feat: add P2 features — retry policy, prompt templating, usage tracking, tech debt cleanup - Extract shared splitMessage() into channels/utils.ts (dedup 4 adapters) - Add Slack user name resolution with caching (users.info API) - Add withRetry() with exponential backoff + jitter, isRetryable() filter - Wire retry config into ModelRouter.chat() (non-streaming only) - Add assembleSystemPrompt() multi-file template system (SOUL/AGENTS/IDENTITY/USER/TOOLS.md) - Add usage tracking accumulators in NativeAgent + AgentOrchestrator - Add estimateCost() with per-model pricing table - Add /usage TUI command with full usage report formatting - Add retrySchema and promptSchema to config schema Tests: 569 passing, typecheck clean	2026-02-06 15:12:35 -08:00
William Valentin	de68deb1b2	feat: add WhatsApp channel adapter (Phase 3c)	2026-02-06 14:42:07 -08:00
William Valentin	7a35b22458	feat: wire up all Phase 2-6 features into daemon and config Integrate all new features into the shared infrastructure: - Config schema: add memory, discord, slack, process, web_search schemas - Daemon wiring: memory store init, tool registration, channel adapters - Orchestrator: memory injection into system prompt, extraction on compaction - Agent: add setSystemPrompt() for dynamic prompt updates - Channel/tool index: export new adapters and tool factories - Add @slack/bolt, discord.js, turndown, linkedom, @mozilla/readability deps - Update state.json with Phase 3b completion (494 tests passing)	2026-02-06 14:24:39 -08:00
William Valentin	306e11bd2e	feat: add multi-model delegation (Phase 0) and context compaction (Phase 1) Phase 0 — Multi-Model Delegation: - AgentOrchestrator wraps NativeAgent with delegate() for stateless single-turn calls to any model tier (fast/default/complex/local) - DelegationConfig maps task types (compaction, classification, etc.) to model tiers - Delegation prompts for compaction, memory extraction, classification, and tool summarisation - Per-tier usage tracking for cost visibility - Config schema: agents.delegation and agents.primary_tier Phase 1 — Context Compaction: - Token estimation (char/4 heuristic) with context window lookup - shouldCompact() threshold check against context window percentage - compactHistory() splits old/recent messages, delegates summary to fast tier, returns CompactionResult - Automatic compaction in AgentOrchestrator.process() when configured - Force-compact via orchestrator.compact() with session persistence - Session.replaceHistory() with atomic SQLite transaction - /compact TUI command with feedback on compacted token counts - Config schema: compaction.enabled, threshold_pct, keep_turns, summary_max_tokens Tests: 385 passing across 50 files (22 new tests in 2 new test files)	2026-02-06 13:17:02 -08:00
William Valentin	e4b7f96d33	fix: provider-aware model routing with fallback visibility - Extract createClientFromConfig() to dispatch on provider field instead of hardcoding all tiers as AnthropicClient - Add fallback/fallbackReason metadata to ChatResponse and ChatStreamEvent so callers know when a fallback model was used - Enhance doctor check to report full model stack and warn on missing API keys for cloud providers - Log fallback warnings in NativeAgent and display them in TUI - Support tier names and local_providers entries in fallback_chain - Add 8 tests for createClientFromConfig covering all provider types	2026-02-06 09:58:56 -08:00
William Valentin	ba15b36e49	feat(daemon): wire CronScheduler into channel registry Registers CronScheduler as a channel adapter when automation.cron jobs are configured, enabling scheduled message delivery through the agent pipeline.	2026-02-05 22:23:03 -08:00
William Valentin	7c41ffad71	feat: add skills system for extensible capability packages Implement a three-tier skill system (bundled/managed/workspace) that extends Flynn's abilities via SKILL.md instructions injected into the system prompt. - SkillManifest/Skill types with requirements gating (OS, binaries, env) - Loader: discovers skills from directories, validates manifests, checks system requirements, infers manifest from SKILL.md if missing - SkillRegistry: holds skills, generates system prompt additions, supports override by name (workspace > managed > bundled) - SkillInstaller: copies/removes skills in managed directory with upgrade support - Config: add skills.workspace_dir, managed_dir, bundled_dir options - Daemon: loads all skills at startup, injects available skill instructions into the system prompt - Tests: 45 new tests (loader 22, registry 11, installer 12)	2026-02-05 20:20:03 -08:00
William Valentin	cd839c7f0c	feat: add MCP integration for external tool servers Implement Model Context Protocol (MCP) support so Flynn can spawn MCP server processes, discover their tools, and make them available to the agent alongside builtin tools. - McpClient: wraps @modelcontextprotocol/sdk with StdioClientTransport for process lifecycle, tool discovery (listTools), and invocation (callTool) - McpManager: lifecycle management for multiple MCP servers with startAll/stopAll/restart, tool bridging into ToolRegistry - Bridge: converts MCP tools to Flynn Tool interface with mcp:<server>:<tool> namespacing to avoid collisions with builtin tools - Config: add env and cwd fields to mcp server schema - ToolRegistry: add unregister() method for MCP server cleanup - Daemon: wire McpManager into startup and shutdown lifecycle - Tests: 28 new tests (bridge, manager, registry unregister)	2026-02-05 20:10:37 -08:00
William Valentin	aa95f2132c	feat: add channel adapter abstraction with Telegram and WebChat adapters Implement Phase 3 channel adapters that decouple message sources from the agent via a uniform ChannelAdapter interface and ChannelRegistry. - Add ChannelAdapter/InboundMessage/OutboundMessage types - Add ChannelRegistry for adapter lifecycle and message routing - Add TelegramAdapter (grammy bot, auth middleware, confirmations, chunking) - Add WebChatAdapter (thin shim over GatewayServer) - Refactor daemon to use ChannelRegistry with per-channel-per-user agents - Add config.get/config.patch gateway handlers (Phase 2 loose end) - Add system.restart gateway handler (Phase 2 loose end) - Add implementation plans and design docs Tests: 225 passing (33 new channel adapter + gateway handler tests)	2026-02-05 20:00:36 -08:00
William Valentin	f30a8bc318	feat(gateway): add WebSocket gateway with JSON-RPC protocol and auth Phase 2 of the Flynn roadmap. Adds a WebSocket gateway server that starts alongside the Telegram bot, providing real-time API access to the agent, sessions, and tools. Protocol: JSON-RPC-like (request/response/event) over WebSocket. 8 methods: agent.send, agent.cancel, sessions.list, sessions.history, sessions.create, tools.list, tools.invoke, system.health. Auth: Bearer token + Tailscale identity header support. Session bridge: per-connection agent instances with shared model router. New files: src/gateway/ (protocol, router, server, auth, session-bridge, handlers for agent/sessions/tools/system). 57 new tests (181 total), typecheck clean.	2026-02-05 19:11:25 -08:00
William Valentin	b9601b50ab	feat(daemon): wire tool registry and executor into agent	2026-02-05 17:49:32 -08:00
William Valentin	b00706325b	feat: add tool framework foundation (types, registry, executor, shell tool, model types, SOUL.md) - Task 0: SOUL.md + loadSystemPrompt() in daemon - Task 1: Tool type definitions (Tool, ToolCall, ToolResult, etc.) - Task 2: ToolRegistry with Anthropic/OpenAI serialization - Task 3: ToolExecutor with hooks, timeout, truncation - Task 4: shell.exec builtin tool - Task 8: Model types updated for tool use (ToolDefinition, ModelToolCall, etc.) - Task 15: Model index exports for tool types	2026-02-05 17:39:40 -08:00
William Valentin	5558687ab9	feat: wire up num_gpu config and updated clients to daemon and TUI	2026-02-05 15:51:30 -08:00
William Valentin	d86710577d	feat: wire up LlamaCppClient to model router Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-05 13:20:20 -08:00
William Valentin	f891c7aee8	fix: add API key/auth token support across all model clients	2026-02-05 10:56:40 -08:00
William Valentin	fb7575f850	refactor: integrate SessionManager into daemon and agent Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-05 00:43:09 -08:00
William Valentin	6e6c263e14	feat: integrate model router, session persistence, and hook engine - NativeAgent now loads/saves messages to SessionStore - Daemon creates ModelRouter with fallback chain support - Telegram bot handles confirmation callbacks from HookEngine - Session data stored in ~/.local/share/flynn/sessions.db Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-05 00:05:42 -08:00
William Valentin	b6b85f07d0	feat: wire daemon, agent, and telegram bot together Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-02 21:02:50 -08:00
William Valentin	70c3960527	feat: add daemon skeleton with lifecycle management Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-02 20:56:45 -08:00

46 Commits