flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	148219153e	feat(audio): add tests, token estimation, and config override for native audio - Add capabilities.test.ts (18 tests) for supportsAudioInput() - Add 15 audio tests to media.test.ts (hasAudio, stripAudioParts, attachmentToAudioSource) - Add estimateAudioTokens() to tokens.ts (base64→bytes→duration→tokens) - Update estimateMessageTokens() to include audio content parts - Add 5 audio token tests to tokens.test.ts - Add supports_audio config override to model schema - Wire supports_audio from tier config through routing to capability check Total tests: 1369 (was 1331, +38 audio-related)	2026-02-11 18:27:19 -08:00
William Valentin	a875bcc4ae	feat(audio): add audio.transcribe tool with Whisper-compatible API support - Add createAudioTranscribeTool with OpenAI/Groq/Ollama/llama.cpp provider support - Refactor audio config schema to nested audio.enabled + audio.provider structure - Move audio tool registration to initTools() for conditional enablement - Fix duplication bug in audio-transcribe.ts URL download handler - Support base64 data and URL-based audio input with format detection	2026-02-11 18:13:19 -08:00
William Valentin	d62e836b5d	feat(audit): Add core audit logging infrastructure - Add AuditLogger class with rotation support - Add audit configuration to config schema - Instrument tool execution with full audit logging - Instrument session lifecycle (create, message, delete, transfer, compact) - Add audit logger initialization in daemon - Add cron scheduler audit logging Audit events captured: - tool.start/success/error/denied - session.create/message/delete/transfer/compact - cron.trigger/add/remove All logs go to ~/.local/share/flynn/audit.log (JSON lines) with rotation (10MB files, 30-day retention)	2026-02-11 15:58:07 -08:00
William Valentin	eea7ca62a8	chore: increase GmailWatcher default poll interval from 60s to 300s	2026-02-11 08:43:48 -08:00
William Valentin	60b214e7c4	feat: add per-cron-job model tier selection Allow cron jobs to specify a `model_tier` field that controls which LLM tier handles the job, without needing separate agent configs. Precedence: cron job model_tier > agent config > global primary_tier > 'default'. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 22:31:18 -08:00
William Valentin	25482b8516	feat: sync PROVIDER_NAMES with config schema and update README docs Extract MODEL_PROVIDERS const from config schema as single source of truth for provider names. PROVIDER_NAMES in TUI commands now imports from schema instead of maintaining a hardcoded list. Adds tests verifying sync. Updates README TUI Commands section with /model hot-swap documentation, supported providers, and runtime model switching examples.	2026-02-10 21:26:18 -08:00
William Valentin	bf9ca690f3	fix(agent): detect repeated tool call loops and make max_iterations configurable Local LLMs often get stuck calling the same tool repeatedly because they lack the sophistication to synthesize results. The agent loop had no safeguard — it re-executed whatever the model requested up to 10 times. Add fingerprint-based loop detection: if the same tool+args combination repeats 3 consecutive times, break the loop and return the last results. Also add agents.max_iterations to the config schema so the iteration limit is user-configurable (default: 10).	2026-02-10 19:35:09 -08:00
William Valentin	f204ff1dd7	feat(tools): add Google Docs, Drive, and Tasks read-only tools Add three new Google service integrations following the established Gmail/GCal pattern: - Google Docs (docs.list, docs.search, docs.read): list, search, and read document content as plain text via Docs + Drive APIs - Google Drive (drive.list, drive.search, drive.read): list, search, and read files with export support for Workspace files (Docs→text, Sheets→CSV, Slides→text) - Google Tasks (tasks.lists, tasks.list): list task lists and tasks with status, due dates, and notes Each service has its own config section, OAuth auth command, tool policy group, and test suite (53 new tests). The setup wizard now offers to configure all Google services together and run OAuth auth flows automatically after saving config. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 12:59:15 -08:00
William Valentin	94264e848c	feat(tools): add Google Calendar tools and register Gmail/GCal in daemon Add calendar.today, calendar.list, calendar.search tools mirroring the Gmail tool pattern. Includes gcal-auth CLI command, config schema, tool policy entries (messaging/coding profiles + group:gcal), and 17 tests. Also wires up gmail and gcal tool registration in the daemon and TUI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 11:40:53 -08:00
William Valentin	213dba855a	refactor: make telegram config optional for non-telegram setups Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 09:27:18 -08:00
William Valentin	35f4cab0dc	feat: add log-level system to suppress noisy fallback debug output Replace console.debug/log/warn calls in model router, retry, and daemon startup with a structured logger that respects a configurable log_level. Default level is 'info', suppressing verbose fallback debug messages in the TUI while keeping them available via config when needed. - Add src/logger.ts with debug/info/warn/error/silent levels - Wire log_level into config schema (default: 'info') - Initialize log level in both daemon and TUI startup paths - Convert all console.debug in router.ts and retry.ts to logger.debug - Convert console.log/warn in daemon/models.ts to logger.info/warn	2026-02-09 21:23:07 -08:00
William Valentin	c2cc052694	feat(02-01): implement deepMerge and overlay-aware loadConfig with tests - Add deepMerge utility for recursive object merging (arrays replace, not concat) - Extend loadConfig with optional overlayPath parameter - Merge happens before env var expansion and Zod validation - Add 6 deepMerge unit tests and 4 overlay integration tests - Re-export deepMerge from config/index.ts - All 1087 existing tests still pass	2026-02-09 20:56:29 -08:00
William Valentin	1e29da4da2	feat: complete DM pairing codes with channel adapters, gateway handlers, and TUI command (Tier 4 feature 4)	2026-02-09 18:28:10 -08:00
William Valentin	4413c4dc7c	feat: add gateway lock, shell completion, and tailscale serve (Tier 4 features 1-3)	2026-02-09 13:29:59 -08:00
William Valentin	9be8f76bc7	feat: implement Tier 3 features — lane queue, credential redaction, token dashboard, xAI, Voyage AI - Lane Queue: per-session FIFO queue in gateway replacing reject-when-busy (9 tests) - Credential Redaction: redactConfig() expanded to cover 18+ secret fields (16 tests) - Web UI Token Dashboard: system.tokenUsage endpoint + Usage page with summary cards - xAI (Grok) Provider: OpenAI-compatible client with model pricing - Voyage AI Embeddings: new embedding provider with configurable dimensions (5 tests) - Update gap analysis: 90→95 match (70%→74%), Tier 3 section marked DONE - Update state.json: test count 1001→1034, add tier3_completion entry Total: 1034 tests passing across 85 files, typecheck clean	2026-02-09 10:32:57 -08:00
William Valentin	1d126cddfb	feat: add Zhipu AI (GLM) model provider support Adds zhipuai as a new provider using the OpenAI-compatible API at api.z.ai. Supports api_key config or ZHIPUAI_API_KEY env var, with optional endpoint override. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 09:55:13 -08:00
William Valentin	06438bb44f	feat: add Gmail Pub/Sub watcher for inbound email automation New ChannelAdapter that monitors Gmail via Google Cloud Pub/Sub push notifications with polling fallback. Supports OAuth2 auth, configurable watch labels, template rendering with email metadata placeholders (from, to, subject, snippet, date, id, labels). Wired into daemon lifecycle and gateway (POST /gmail/push endpoint). Includes 16 tests covering auth, templates, push notifications, and channel routing.	2026-02-07 15:39:24 -08:00
William Valentin	88731a50e3	feat: add heartbeat monitor and vector memory search (Tier 2) Heartbeat: - HeartbeatMonitor with 5 checks: gateway, model, channels, memory, disk - Configurable interval, failure threshold, notification channel - Recovery notifications when health restores - 25 new tests Vector Memory Search: - EmbeddingProvider interface with OpenAI, Gemini, Ollama, LlamaCpp backends - SQLite-backed VectorStore with cosine similarity search - Text chunker with paragraph-aware splitting and overlap - HybridSearch merging keyword + vector results with configurable weight - Background indexer with dirty-namespace tracking - Graceful fallback to keyword search when embeddings unavailable - 51 new tests Config: automation.heartbeat + memory.embedding schema sections Total: 950 tests passing, all types clean	2026-02-07 14:45:11 -08:00
William Valentin	b50c140d25	feat: add Docker support and inbound webhooks (Tier 2) - Dockerfile: multi-stage build (node:22-alpine), better-sqlite3 native deps handled - .dockerignore + docker-compose.yml for deployment - FLYNN_DATA_DIR env var support in daemon, CLI, and TUI - WebhookHandler: ChannelAdapter for HTTP POST /webhooks/:name - Per-webhook HMAC auth, template rendering ({{body}}, {{json.field}}) - Config schema: automation.webhooks array with name/secret/message/output - Gateway routes webhook requests before static files (bypasses gateway auth) - 23 new tests for webhook functionality, 874 total tests passing	2026-02-07 14:36:05 -08:00
William Valentin	1c2f54fae3	feat: implement tier 1 quick wins (tool groups, typing, pruning, verbose, think) Five additive features with no breaking changes: - Tool groups: group:fs, group:runtime, group:web, group:memory syntactic sugar for allow/deny lists in tool policy config - Typing indicators: Discord sendTyping() and WhatsApp sendStateTyping() on message receipt for better UX feedback - Session pruning: TTL-based auto-cleanup via sessions.ttl config with hourly daemon timer and SQLite GROUP BY pruning - /verbose command: TUI command parser toggle for raw streaming display - !!think prefix: per-message extended thinking mode wired through Anthropic (budget_tokens), OpenAI/GitHub (reasoning_effort), and Gemini (thinkingConfig) providers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 13:35:00 -08:00
William Valentin	c8c3c74fde	feat: add per-tier fallback field to model config schema Each model tier (fast, default, complex, local) can now specify an optional fallback provider config that the router will try before falling through to the global fallback chain. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 12:08:17 -08:00
William Valentin	2a962abcd0	feat: add audio transcription pipeline for voice messages Adds Whisper-compatible audio transcription via configurable endpoint. New functions: isSupportedAudio(), mimeToExtension(), transcribeAudio(), buildUserMessageWithAudio(). Config schema gains audio section with transcription_endpoint, api_key, and model. Daemon wires transcription into the message router. Channel adapters extract audio from voice/audio messages (Telegram voice+audio, Discord audio/, Slack audio/, WhatsApp ptt+audio). Includes 57 media tests (was 25, now covers all audio paths).	2026-02-07 09:09:13 -08:00
William Valentin	f363717f5f	feat: add GitHub Copilot model provider with OAuth device flow Add a new 'github' model provider backed by the Copilot API (api.githubcopilot.com), with OAuth device flow for authentication. - New src/auth/github.ts: device flow login, token storage at ~/.config/flynn/auth.json with 0600 permissions - New src/models/github.ts: OpenAI-compatible client with streaming, tool calling, and Copilot-specific headers - Add 'github' to provider enum in config schema - Register provider in daemon factory and TUI client factory - Refactor TUI to use provider-agnostic client factory (was hardcoded to AnthropicClient for all tiers) - Add /login command to TUI for interactive OAuth authorization - Add Copilot model cost tracking entries	2026-02-06 22:26:52 -08:00
William Valentin	880744846f	feat: wire new providers, auth, mention-gating, and browser into daemon Update config schema with server auth fields (token, tailscale_identity, auth_http), channel mention settings, browser config, and openrouter/bedrock provider enum values. Wire GeminiClient, BedrockClient, OpenRouter into createClientFromConfig. Initialize BrowserManager and register browser tools in daemon startup. Pass auth config and channel mention settings through to gateway and adapters. Add puppeteer-core, @google/generative-ai, and @aws-sdk/client-bedrock-runtime dependencies.	2026-02-06 16:52:18 -08:00
William Valentin	daf8cac3fe	feat: add sandbox, agent_configs, and routing config schemas	2026-02-06 15:48:55 -08:00
William Valentin	ee0af0cc06	feat: add tool allow/deny profiles with per-agent and per-provider filtering Implements configurable tool filtering with four built-in profiles (minimal, messaging, coding, full), global and per-agent/per-provider allow/deny lists with glob pattern support, and defense-in-depth enforcement at both tool listing and execution time. New: src/tools/policy.ts (ToolPolicy engine), src/tools/policy.test.ts (37 tests) Modified: config schema, tool registry, tool executor, NativeAgent, AgentOrchestrator, daemon wiring, gateway tool handler, test mocks	2026-02-06 15:30:34 -08:00
William Valentin	4316dbd3be	feat: add P2 features — retry policy, prompt templating, usage tracking, tech debt cleanup - Extract shared splitMessage() into channels/utils.ts (dedup 4 adapters) - Add Slack user name resolution with caching (users.info API) - Add withRetry() with exponential backoff + jitter, isRetryable() filter - Wire retry config into ModelRouter.chat() (non-streaming only) - Add assembleSystemPrompt() multi-file template system (SOUL/AGENTS/IDENTITY/USER/TOOLS.md) - Add usage tracking accumulators in NativeAgent + AgentOrchestrator - Add estimateCost() with per-model pricing table - Add /usage TUI command with full usage report formatting - Add retrySchema and promptSchema to config schema Tests: 569 passing, typecheck clean	2026-02-06 15:12:35 -08:00
William Valentin	de68deb1b2	feat: add WhatsApp channel adapter (Phase 3c)	2026-02-06 14:42:07 -08:00
William Valentin	7a35b22458	feat: wire up all Phase 2-6 features into daemon and config Integrate all new features into the shared infrastructure: - Config schema: add memory, discord, slack, process, web_search schemas - Daemon wiring: memory store init, tool registration, channel adapters - Orchestrator: memory injection into system prompt, extraction on compaction - Agent: add setSystemPrompt() for dynamic prompt updates - Channel/tool index: export new adapters and tool factories - Add @slack/bolt, discord.js, turndown, linkedom, @mozilla/readability deps - Update state.json with Phase 3b completion (494 tests passing)	2026-02-06 14:24:39 -08:00
William Valentin	306e11bd2e	feat: add multi-model delegation (Phase 0) and context compaction (Phase 1) Phase 0 — Multi-Model Delegation: - AgentOrchestrator wraps NativeAgent with delegate() for stateless single-turn calls to any model tier (fast/default/complex/local) - DelegationConfig maps task types (compaction, classification, etc.) to model tiers - Delegation prompts for compaction, memory extraction, classification, and tool summarisation - Per-tier usage tracking for cost visibility - Config schema: agents.delegation and agents.primary_tier Phase 1 — Context Compaction: - Token estimation (char/4 heuristic) with context window lookup - shouldCompact() threshold check against context window percentage - compactHistory() splits old/recent messages, delegates summary to fast tier, returns CompactionResult - Automatic compaction in AgentOrchestrator.process() when configured - Force-compact via orchestrator.compact() with session persistence - Session.replaceHistory() with atomic SQLite transaction - /compact TUI command with feedback on compacted token counts - Config schema: compaction.enabled, threshold_pct, keep_turns, summary_max_tokens Tests: 385 passing across 50 files (22 new tests in 2 new test files)	2026-02-06 13:17:02 -08:00
William Valentin	c607ff4a4f	feat(config): export CronJobConfig type from config index	2026-02-05 22:23:25 -08:00
William Valentin	e157bc6102	feat(config): add automation.cron schema for scheduled jobs	2026-02-05 22:12:12 -08:00
William Valentin	7c41ffad71	feat: add skills system for extensible capability packages Implement a three-tier skill system (bundled/managed/workspace) that extends Flynn's abilities via SKILL.md instructions injected into the system prompt. - SkillManifest/Skill types with requirements gating (OS, binaries, env) - Loader: discovers skills from directories, validates manifests, checks system requirements, infers manifest from SKILL.md if missing - SkillRegistry: holds skills, generates system prompt additions, supports override by name (workspace > managed > bundled) - SkillInstaller: copies/removes skills in managed directory with upgrade support - Config: add skills.workspace_dir, managed_dir, bundled_dir options - Daemon: loads all skills at startup, injects available skill instructions into the system prompt - Tests: 45 new tests (loader 22, registry 11, installer 12)	2026-02-05 20:20:03 -08:00
William Valentin	cd839c7f0c	feat: add MCP integration for external tool servers Implement Model Context Protocol (MCP) support so Flynn can spawn MCP server processes, discover their tools, and make them available to the agent alongside builtin tools. - McpClient: wraps @modelcontextprotocol/sdk with StdioClientTransport for process lifecycle, tool discovery (listTools), and invocation (callTool) - McpManager: lifecycle management for multiple MCP servers with startAll/stopAll/restart, tool bridging into ToolRegistry - Bridge: converts MCP tools to Flynn Tool interface with mcp:<server>:<tool> namespacing to avoid collisions with builtin tools - Config: add env and cwd fields to mcp server schema - ToolRegistry: add unregister() method for MCP server cleanup - Daemon: wire McpManager into startup and shutdown lifecycle - Tests: 28 new tests (bridge, manager, registry unregister)	2026-02-05 20:10:37 -08:00
William Valentin	dbf1acd822	feat: add streaming support and num_gpu option to Ollama client	2026-02-05 15:51:28 -08:00
William Valentin	0528b895b0	feat: add local_providers to config schema	2026-02-05 13:33:50 -08:00
William Valentin	f891c7aee8	fix: add API key/auth token support across all model clients	2026-02-05 10:56:40 -08:00
William Valentin	4adf172c25	feat: add config schema and loader with env var expansion	2026-02-02 20:54:19 -08:00

38 Commits