flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	46538e71a8	chore(lint): restore zero-error eslint baseline	2026-02-15 22:25:29 -08:00
William Valentin	2eccd3e8eb	test(tools): verify timeout abort prevents post-timeout side effects	2026-02-15 22:17:49 -08:00
William Valentin	b4006e91ff	feat(tools): extend cancellation to browser, web, and process tools	2026-02-15 22:12:03 -08:00
William Valentin	2cdfb66071	feat(tools): propagate timeout abort signals to tool execution	2026-02-15 22:05:43 -08:00
William Valentin	4cdad8eee9	fix(tools): clear timeout timers and update audit state	2026-02-15 21:44:40 -08:00
William Valentin	81c97a9df1	feat(memory): add experimental qmd search backend	2026-02-15 19:33:43 -08:00
William Valentin	ab89378fce	feat(security): enforce elevated mode and sandbox execution	2026-02-15 17:02:05 -08:00
William Valentin	735f14d0b1	feat(security): add /elevate command and audit events	2026-02-15 16:56:49 -08:00
William Valentin	3451df41b9	feat(tools): enforce skill capabilities and secret scopes	2026-02-15 10:16:51 -08:00
William Valentin	955b9e28e0	feat: add OpenAI OAuth, strict model overrides, and Gmail pull mode	2026-02-13 14:55:40 -08:00
William Valentin	9f81c01603	feat(session): persist model tier overrides per session Store per-session config in SQLite and route /model and /reset through command fast-paths so channel sessions keep independent model selection across reconnects and restarts.	2026-02-13 01:04:26 -08:00
William Valentin	90ce622080	feat(policy): enforce truthfulness and autonomy guardrails Add runtime truthfulness modes and autonomy-level tool gating with audit metadata for overrides/denials. Wire policy through prompt assembly, tool execution context, and daemon/gateway agent paths; update tests and planning state for Phase 3 PR #2 completion.	2026-02-12 16:06:45 -08:00
William Valentin	0b44adbaea	fix(audio): add SSRF protection, MIME type fix, and tests for audio-transcribe tool - Add URL validation blocking localhost, private IPs, and non-http protocols - Use response Content-Type header instead of hardcoded audio/wav for URL downloads - Add 25 tests covering validation, SSRF, config errors, transcription paths, and error handling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 21:57:45 -08:00
William Valentin	a875bcc4ae	feat(audio): add audio.transcribe tool with Whisper-compatible API support - Add createAudioTranscribeTool with OpenAI/Groq/Ollama/llama.cpp provider support - Refactor audio config schema to nested audio.enabled + audio.provider structure - Move audio tool registration to initTools() for conditional enablement - Fix duplication bug in audio-transcribe.ts URL download handler - Support base64 data and URL-based audio input with format detection	2026-02-11 18:13:19 -08:00
William Valentin	d62e836b5d	feat(audit): Add core audit logging infrastructure - Add AuditLogger class with rotation support - Add audit configuration to config schema - Instrument tool execution with full audit logging - Instrument session lifecycle (create, message, delete, transfer, compact) - Add audit logger initialization in daemon - Add cron scheduler audit logging Audit events captured: - tool.start/success/error/denied - session.create/message/delete/transfer/compact - cron.trigger/add/remove All logs go to ~/.local/share/flynn/audit.log (JSON lines) with rotation (10MB files, 30-day retention)	2026-02-11 15:58:07 -08:00
William Valentin	6090508bad	style: auto-fix ESLint issues (curly braces and formatting) - Add curly braces to all if/else/for/while statements - Fix indentation and trailing spaces - Auto-fixed 372 linting errors using eslint --fix - Remaining issues are warnings only (non-null assertions, explicit any types)	2026-02-11 10:30:24 -08:00
William Valentin	5270234bbb	feat: improve tool usage guidance in SOUL.md and add cron.create/cron.delete tools - SOUL.md: list all available tools (web.search, memory., cron., etc.) and add Tool Usage Rules section enforcing 'act, don't narrate' - cron.ts: add getJob(), addJob(), removeJob() to CronScheduler for runtime (ephemeral) cron job management - cron tools: add cron.create and cron.delete tools, enhance cron.list to show schedule/output/message details - policy.ts: add cron tools to messaging and coding profiles, add group:cron to tool groups Fixes issue where models would narrate tool intent ('let me search...') then stop without actually calling tools.	2026-02-11 09:32:36 -08:00
William Valentin	eea7ca62a8	chore: increase GmailWatcher default poll interval from 60s to 300s	2026-02-11 08:43:48 -08:00
William Valentin	4ce8e81c01	fix(gmail): sanitize HTML entities and tags in tool output Gmail API returns snippets with HTML entities (&, ', <br>, etc.) that leaked into LLM responses as raw HTML. Added shared sanitizeHtml() utility in src/utils/html.ts and applied it to gmail tool snippets, HTML body fallback, and gmail watcher snippets.	2026-02-10 16:30:14 -08:00
William Valentin	f204ff1dd7	feat(tools): add Google Docs, Drive, and Tasks read-only tools Add three new Google service integrations following the established Gmail/GCal pattern: - Google Docs (docs.list, docs.search, docs.read): list, search, and read document content as plain text via Docs + Drive APIs - Google Drive (drive.list, drive.search, drive.read): list, search, and read files with export support for Workspace files (Docs→text, Sheets→CSV, Slides→text) - Google Tasks (tasks.lists, tasks.list): list task lists and tasks with status, due dates, and notes Each service has its own config section, OAuth auth command, tool policy group, and test suite (53 new tests). The setup wizard now offers to configure all Google services together and run OAuth auth flows automatically after saving config. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 12:59:15 -08:00
William Valentin	50471d63af	feat(tools): add gmail.read tool for full email content The existing gmail.list and gmail.search tools only return snippets. gmail.read fetches the full message by ID using format: 'full', decodes base64url body parts (preferring text/plain, falling back to stripped HTML), and returns headers + body text. This enables workflows like searching for invoices and extracting amounts from the full content. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 12:01:49 -08:00
William Valentin	94264e848c	feat(tools): add Google Calendar tools and register Gmail/GCal in daemon Add calendar.today, calendar.list, calendar.search tools mirroring the Gmail tool pattern. Includes gcal-auth CLI command, config schema, tool policy entries (messaging/coding profiles + group:gcal), and 17 tests. Also wires up gmail and gcal tool registration in the daemon and TUI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 11:40:53 -08:00
William Valentin	8bf88049bf	feat: add runtime context awareness — system.info tool + date/time in system prompt - assembleSystemPrompt() now injects '# Runtime Context' with current date/time - New system.info tool: date, time, hostname, platform, arch, uptime, memory, Node.js version - Tool available in all profiles (minimal/messaging/coding/full) - 983 tests passing (+7 new)	2026-02-07 16:22:17 -08:00
William Valentin	131d23989c	feat: add file.patch tool for multi-hunk structured patches Implements apply_patch equivalent: a single tool call can make multiple line-based edits (replacements, insertions, deletions) across one or more files. Hunks are applied bottom-up to preserve line numbers. Includes 10 tests covering replacement, multi-hunk, insertion, deletion, multi-file, overlapping hunks error, OOB error, and edge cases.	2026-02-07 15:39:15 -08:00
William Valentin	88731a50e3	feat: add heartbeat monitor and vector memory search (Tier 2) Heartbeat: - HeartbeatMonitor with 5 checks: gateway, model, channels, memory, disk - Configurable interval, failure threshold, notification channel - Recovery notifications when health restores - 25 new tests Vector Memory Search: - EmbeddingProvider interface with OpenAI, Gemini, Ollama, LlamaCpp backends - SQLite-backed VectorStore with cosine similarity search - Text chunker with paragraph-aware splitting and overlap - HybridSearch merging keyword + vector results with configurable weight - Background indexer with dirty-namespace tracking - Graceful fallback to keyword search when embeddings unavailable - 51 new tests Config: automation.heartbeat + memory.embedding schema sections Total: 950 tests passing, all types clean	2026-02-07 14:45:11 -08:00
William Valentin	1c2f54fae3	feat: implement tier 1 quick wins (tool groups, typing, pruning, verbose, think) Five additive features with no breaking changes: - Tool groups: group:fs, group:runtime, group:web, group:memory syntactic sugar for allow/deny lists in tool policy config - Typing indicators: Discord sendTyping() and WhatsApp sendStateTyping() on message receipt for better UX feedback - Session pruning: TTL-based auto-cleanup via sessions.ttl config with hourly daemon timer and SQLite GROUP BY pruning - /verbose command: TUI command parser toggle for raw streaming display - !!think prefix: per-message extended thinking mode wired through Anthropic (budget_tokens), OpenAI/GitHub (reasoning_effort), and Gemini (thinkingConfig) providers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 13:35:00 -08:00
William Valentin	6bb424cddc	feat: add agent tools and sanitize tool names for Anthropic API Add 8 new agent-callable tools (sessions.list/history/create/delete, agents.list, message.send, cron.list/trigger) and sanitize tool names at the API boundary (dots → underscores) to comply with Anthropic's `^[a-zA-Z0-9_-]{1,128}` requirement. Reverse-maps sanitized names back to internal names for hook callbacks and tool execution. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 12:23:09 -08:00
William Valentin	b9bfee9c5b	feat: add outbound attachment support with media.send tool Introduces OutboundAttachment type on OutboundMessage, an OutboundAttachmentCollector (push/drain pattern), and a media.send tool that queues files for outbound delivery. Each channel adapter (Telegram, Discord, Slack, WhatsApp) sends attachments after the text reply. Includes 15 tests for collector and tool.	2026-02-07 09:09:00 -08:00
William Valentin	1e6f6bb5a4	feat: add image.analyze tool for vision model analysis Provides a factory createImageAnalyzeTool(modelClient) that sends images to a vision-capable model and returns a textual analysis. Includes 15 tests covering base64, URL, multi-image, error, and edge cases.	2026-02-07 09:08:53 -08:00
William Valentin	8c56a5a1a8	feat: add Chrome DevTools Protocol browser tools Add BrowserManager (puppeteer-core) with page pool and auto-detection of Chrome/Chromium. Six tools: browser.navigate, browser.screenshot, browser.click, browser.type, browser.content, browser.eval. Feature is opt-in (browser.enabled defaults to false). Add to coding tool profile. Includes 22 unit tests for manager and all tools.	2026-02-06 16:52:03 -08:00
William Valentin	1314ac0163	feat: add ToolRegistry.clone() and replace() for per-session registries	2026-02-06 15:58:19 -08:00
William Valentin	ee0af0cc06	feat: add tool allow/deny profiles with per-agent and per-provider filtering Implements configurable tool filtering with four built-in profiles (minimal, messaging, coding, full), global and per-agent/per-provider allow/deny lists with glob pattern support, and defense-in-depth enforcement at both tool listing and execution time. New: src/tools/policy.ts (ToolPolicy engine), src/tools/policy.test.ts (37 tests) Modified: config schema, tool registry, tool executor, NativeAgent, AgentOrchestrator, daemon wiring, gateway tool handler, test mocks	2026-02-06 15:30:34 -08:00
William Valentin	7a35b22458	feat: wire up all Phase 2-6 features into daemon and config Integrate all new features into the shared infrastructure: - Config schema: add memory, discord, slack, process, web_search schemas - Daemon wiring: memory store init, tool registration, channel adapters - Orchestrator: memory injection into system prompt, extraction on compaction - Agent: add setSystemPrompt() for dynamic prompt updates - Channel/tool index: export new adapters and tool factories - Add @slack/bolt, discord.js, turndown, linkedom, @mozilla/readability deps - Update state.json with Phase 3b completion (494 tests passing)	2026-02-06 14:24:39 -08:00
William Valentin	6d9e27a591	feat: enhance web-fetch with HTML-to-markdown extraction (Phase 6) Add turndown + readability for clean content extraction: - HTML-to-markdown conversion with smart article extraction - Format parameter (markdown/text/html) - Response caching for repeated fetches - 10 tests	2026-02-06 14:24:28 -08:00
William Valentin	6af26f407c	feat: add web search and background process tools (Phases 4-5) Phase 4 - Web search tool: - Brave Search API + SearXNG fallback - Configurable provider, max results - 14 tests Phase 5 - Background process management: - ProcessManager with start/status/output/kill/list tools - Configurable max concurrent, max runtime, buffer size - 28 tests	2026-02-06 14:24:23 -08:00
William Valentin	2e1071230a	feat: add persistent memory system (Phase 2) Implement file-based persistent memory with read/write/search tools: - MemoryStore with namespace-scoped JSON storage - memory-read, memory-write, memory-search builtin tools - Auto-extraction of facts during context compaction - Configurable via memory.enabled, memory.dir, memory.max_context_tokens	2026-02-06 14:23:59 -08:00
William Valentin	cd839c7f0c	feat: add MCP integration for external tool servers Implement Model Context Protocol (MCP) support so Flynn can spawn MCP server processes, discover their tools, and make them available to the agent alongside builtin tools. - McpClient: wraps @modelcontextprotocol/sdk with StdioClientTransport for process lifecycle, tool discovery (listTools), and invocation (callTool) - McpManager: lifecycle management for multiple MCP servers with startAll/stopAll/restart, tool bridging into ToolRegistry - Bridge: converts MCP tools to Flynn Tool interface with mcp:<server>:<tool> namespacing to avoid collisions with builtin tools - Config: add env and cwd fields to mcp server schema - ToolRegistry: add unregister() method for MCP server cleanup - Daemon: wire McpManager into startup and shutdown lifecycle - Tests: 28 new tests (bridge, manager, registry unregister)	2026-02-05 20:10:37 -08:00
William Valentin	b686eb3fa3	test: add end-to-end tool integration tests Tests the full agent -> tool execution -> model feedback loop: - Shell tool execution with mock model - Multi-tool chaining (file.write -> file.read) across iterations - Verification that tool results are correctly passed back to model	2026-02-05 17:50:17 -08:00
William Valentin	c96165fb2f	feat(tools): add barrel exports and allBuiltinTools list	2026-02-05 17:40:13 -08:00
William Valentin	b00706325b	feat: add tool framework foundation (types, registry, executor, shell tool, model types, SOUL.md) - Task 0: SOUL.md + loadSystemPrompt() in daemon - Task 1: Tool type definitions (Tool, ToolCall, ToolResult, etc.) - Task 2: ToolRegistry with Anthropic/OpenAI serialization - Task 3: ToolExecutor with hooks, timeout, truncation - Task 4: shell.exec builtin tool - Task 8: Model types updated for tool use (ToolDefinition, ModelToolCall, etc.) - Task 15: Model index exports for tool types	2026-02-05 17:39:40 -08:00
William Valentin	32dd3ad728	feat(tools): add file read/write/edit/list builtin tools	2026-02-05 17:39:20 -08:00

41 Commits