will/flynn

Files

T

William Valentin 51ff5523ae docs(plan): record microsoft teams gap completion

2026-02-16 02:00:19 -08:00

18 KiB

Raw Blame History

Flynn vs OpenClaw — Feature Gap Analysis

Date: 2026-02-06 Last updated: 2026-02-14 (synthetic model provider implemented) Purpose: Comprehensive comparison of Flynn's current implementation against OpenClaw's feature set, to guide prioritisation of future work.

Legend

MATCH — Flynn has equivalent functionality
PARTIAL — Flynn has some implementation but incomplete
MISSING — Not implemented in Flynn

1. Channels / Frontends

Feature	OpenClaw	Flynn	Status
Telegram	grammY bot	grammY bot (allowlists, mention gating, group support)	MATCH
WhatsApp	Baileys (WhatsApp Web)	whatsapp-web.js (allowlists, mention gating, groups)	MATCH
Discord	discord.js	discord.js (guild/channel allowlists, mention gating)	MATCH
Slack	Bolt SDK	Bolt SDK Socket Mode (channel allowlists, mention gating)	MATCH
Signal	signal-cli	Signal adapter	MATCH
iMessage / BlueBubbles	imsg + BlueBubbles	--	MISSING
Google Chat	Chat API	--	MISSING
Microsoft Teams	Bot Framework	Teams adapter	MATCH
Matrix	Extension	--	MISSING
Zalo / Zalo Personal	Extension	--	MISSING
WebChat	Gateway-served	Full WebSocket + SPA dashboard	MATCH
TUI (terminal)	`openclaw tui`	Minimal + Fullscreen (React/Ink)	MATCH
LINE / Feishu / Mattermost	Extensions/plugins	--	MISSING

Flynn has 6 of ~15 channels (Telegram, WhatsApp, Discord, Slack, WebChat, TUI).

2. Model Providers

Feature	OpenClaw	Flynn	Status
Anthropic (Claude)	Full + OAuth	Full (API key + auth token)	MATCH
OpenAI	Full + OAuth + Codex	Full (OpenAI SDK)	MATCH
Ollama (local)	Supported	Full (host, num_gpu config)	MATCH
Llama.cpp (local)	Supported	Full (endpoint, auth_token, context_window)	MATCH
Gemini / Google	Full provider	Full (Gemini SDK, vision support)	MATCH
OpenRouter	Supported	Full (via OpenAI-compatible client, custom baseURL)	MATCH
Amazon Bedrock	Supported	Full (Bedrock SDK, region/credentials)	MATCH
GitHub Models	Supported	Full (device flow auth, Codex models)	MATCH
Zhipu AI (GLM)	Supported	Full (OpenAI-compatible client, GLM models)	MATCH
MiniMax / Moonshot	Supported	--	MISSING
xAI (Grok)	Supported (v2026.2.6)	Full (OpenAI-compatible client, xai provider)	MATCH
Vercel AI Gateway	Supported	--	MISSING
Voyage AI embeddings	Supported (v2026.2.6)	Full (Voyage AI provider, configurable dimensions)	MATCH
Synthetic provider	Supported	Full (provider: `synthetic`, deterministic echo/fixed/error)	MATCH
OAuth subscription auth	Anthropic + OpenAI	API keys only	MISSING
Model failover chains	Full (fallback + rotation)	Full (configurable fallback chain + retry)	MATCH
Model tier routing	Per-agent, per-provider	default/fast/complex/local with per-agent override	MATCH
Provider-specific tool policy	Per-provider tool filtering	Full (per-provider allow/deny in tools config)	MATCH

3. Agent Runtime & Tools

Feature	OpenClaw	Flynn	Status
Tool loop with streaming	RPC mode + block streaming	Tool loop (max iterations, streaming)	MATCH
`exec` / shell	Full (background, pty, timeout, elevated)	Full (bash -c, configurable timeout, background via process tools)	MATCH
`read` / file read	Full (line ranges)	Full (line offset/limit)	MATCH
`write` / file write	Full	Full (auto-mkdir)	MATCH
`edit` / file edit	Full	Full (exact match, replace_all)	MATCH
`apply_patch`	Multi-hunk structured patches	Full (`file.patch` — multi-file, multi-hunk, bottom-up)	MATCH
`file.list` / glob	--	Full (glob filtering)	MATCH
`web_fetch`	Full (markdown/text extract, caching)	Full (HTML-to-markdown, readability, caching)	MATCH
`web.search`	Brave Search API	Full (Brave + SearXNG providers)	MATCH
Browser control	Full CDP (Chromium profiles, snapshots, actions)	Full CDP (Puppeteer, navigate/click/type/screenshot/evaluate)	MATCH
Lane Queue (serial exec)	Concurrency control for sessions	Full (per-session FIFO queue in gateway)	MATCH
Canvas / A2UI	Agent-driven visual workspace	--	MISSING
`process.*` tools	Background exec management (poll/log/write/kill)	Full (start/output/status/kill/list)	MATCH
`image.analyze` tool	Image analysis with configurable model	Full (multi-provider vision analysis)	MATCH
`message.send` tool	Cross-channel messaging + actions	Full (send to any registered channel)	MATCH
`cron.*` tools	Runtime cron management	Full (list/trigger)	MATCH
`sessions.*` tools	List/history/send/spawn across sessions	Full (list/history/create/delete)	MATCH
`agents.list` tool	Sub-agent discovery	Full (list configs with tiers/profiles)	MATCH
Tool profiles	minimal/coding/messaging/full	Full (4 profiles + per-agent + per-provider)	MATCH
Tool groups	`group:fs`, `group:runtime`, etc.	Full (group:fs, group:runtime, group:web, group:memory)	MATCH
Tool allow/deny lists	Global + per-agent + per-provider	Full (global + per-agent + per-provider allow/deny)	MATCH
System context / date awareness	--	`system.info` tool + automatic date/time injection in system prompt	MATCH

4. Session Management

Feature	OpenClaw	Flynn	Status
Session persistence	JSONL files	SQLite	MATCH (different storage)
Session isolation	Per-sender + group isolation	`{frontend}:{userId}` with agent config key	MATCH
Session transfer	Between channels	Between frontends	MATCH
Multi-agent routing	Isolated workspaces per agent	Full (AgentRouter + per-agent config/sandbox/tools)	MATCH
Session pruning	Tool result trimming (in-memory)	TTL-based auto-cleanup (configurable sessions.ttl)	MATCH
`/new` / `/reset`	Full	Full	MATCH
`/status`	Full (model + tokens + cost)	Full (model + tokens + cost)	MATCH

5. Context Window & Compaction

Feature	OpenClaw	Flynn	Status
Auto-compaction	Full (summarise older history)	Full (threshold-based, delegated to fast tier)	MATCH
Manual `/compact`	Full (with instructions)	Full (via command metadata)	MATCH
Pre-compaction memory flush	Silent agentic turn	Full (auto-extract memory before compaction)	MATCH
Token tracking	Full (per-response, cost)	Full (per-tier, per-call, estimated cost)	MATCH

6. Memory System

Feature	OpenClaw	Flynn	Status
Markdown memory files	`MEMORY.md` + daily logs	Namespace-based flat files (user/global/session)	MATCH
`memory.search` tool	Semantic vector search	Full (keyword search across namespaces)	MATCH
`memory.read` tool	Read memory files	Full (read by namespace)	MATCH
`memory.write` tool	Write memory files	Full (write/append to namespace)	MATCH
Vector embeddings	OpenAI/Gemini/local	Full (OpenAI, Gemini, Ollama, LlamaCpp providers)	MATCH
Hybrid search (BM25 + vector)	Full	Full (keyword + vector with configurable hybrid weight)	MATCH
QMD backend	Experimental	Full (experimental markdown-native backend configurable via `memory.qmd`)	MATCH

7. MCP (Model Context Protocol)

Feature	OpenClaw	Flynn	Status
MCP tool servers	Not emphasised	Full (stdio transport)	MATCH
MCP tool bridging	Not emphasised	Full (`mcp:{server}:{tool}`)	MATCH
MCP server lifecycle	Not emphasised	Full (start/stop/restart)	MATCH

Flynn actually has MCP support that OpenClaw doesn't emphasise — OpenClaw relies on its own native tool system and plugins instead.

8. Security & Safety

Feature	OpenClaw	Flynn	Status
Tool confirmation hooks	Full	Full (confirm/log/silent patterns)	MATCH
Chat ID allowlists	Per-channel	Full (Telegram, Discord, Slack, WhatsApp all have allowlists)	MATCH
DM pairing (unknown senders)	Full (pairing codes)	Full (PairingManager with TTL codes, SQLite persistence, channel adapter integration, gateway handlers, TUI /pair command)	MATCH
Credential redaction	Config responses redacted (v2026.2.6)	Full (18+ secret fields redacted from config API)	MATCH
Skill/plugin code safety scanner	Static analysis (v2026.2.6)	--	MISSING
Docker sandboxing	Full (per-session/agent/shared)	Full (per-agent sandbox via SandboxManager + Docker)	MATCH
Elevated mode	Host exec escape hatch	--	MISSING
Tool execution timeouts	Full (configurable)	Full (configurable per-process + shell)	MATCH
Output truncation	Full	51KB	MATCH
Gateway auth (token/password)	Full	Full (bearer token + Tailscale identity + HTTP auth)	MATCH

9. Automation & Scheduling

Feature	OpenClaw	Flynn	Status
Cron jobs	Full (runtime + config)	Full (YAML config + runtime trigger via tools)	MATCH
Webhooks	Full (inbound triggers)	Full (HTTP POST /webhooks/:name, HMAC auth, template rendering)	MATCH
Gmail Pub/Sub	Full	Full (OAuth2, Pub/Sub push + polling fallback, template rendering)	MATCH
Heartbeat	Full	Full (5 checks, configurable interval, failure threshold, notifications)	MATCH

10. Apps & Companion Devices

Feature	OpenClaw	Flynn	Status
macOS menu bar app	Full	--	MISSING
iOS node	Full (Canvas, Voice, Camera)	--	MISSING
Android node	Full (Canvas, Talk, Camera)	--	MISSING
Voice Wake / Talk Mode	Full (ElevenLabs)	--	MISSING
Camera / screen capture	Via nodes	--	MISSING
Location access	Via nodes	--	MISSING

11. Skills & Plugins

Feature	OpenClaw	Flynn	Status
Skills system	Bundled/managed/workspace	Full (bundled/managed/workspace tiers)	MATCH
Skill manifest	Full	Full (requirements, versioning, manifest.json)	MATCH
Skill installer	Registry install/upgrade/uninstall	Full (directory-based install/upgrade/uninstall)	MATCH
ClawHub registry	Community skill registry	--	MISSING
Workspace prompt injection	AGENTS.md, SOUL.md, TOOLS.md	Full (SOUL.md, AGENTS.md via prompt template system)	MATCH

12. Gateway & Infrastructure

Feature	OpenClaw	Flynn	Status
WebSocket control plane	Full	Full (JSON-RPC protocol, session bridge, handlers)	MATCH
Control UI (web dashboard)	Full	Full SPA (dashboard, chat, sessions, settings)	MATCH
Tailscale Serve/Funnel	Full integration	Full (auto-start/stop tailscale serve, availability check, daemon lifecycle)	MATCH
Remote gateway access	SSH tunnels + tailnet	Tailscale-only binding + Tailscale Serve	MATCH
Health checks / doctor	10+ checks	10 checks	MATCH
`onboard` wizard	Full guided setup	Full (`flynn setup` + first-run auto-trigger)	MATCH
Docker deployment	Full	Full (multi-stage Dockerfile, docker-compose.yml)	MATCH
Nix deployment	Full	Full (flake + package + optional NixOS module)	MATCH
Shell completion	Auto-detect + cached (v2026.2.3)	Full (bash/zsh/fish generation + --install flag)	MATCH
Announce delivery mode	Isolated job delivery (v2026.2.3)	--	MISSING
Fly.io / Railway / Render	Supported	Full (docs + templates + PORT override support)	MATCH
Bonjour/mDNS discovery	Full	--	MISSING
Gateway lock	Full	Full (single-client mode, code 4003, UI detection)	MATCH

13. Chat Commands

Feature	OpenClaw	Flynn	Status
`/status`	Full	Full	MATCH
`/new` / `/reset`	Full	Full	MATCH
`/compact`	Full	Full (manual via command)	MATCH
`/think <level>`	Full (off to xhigh)	`!!think` prefix (per-message, multi-provider)	MATCH
`/verbose`	Full	Full (toggle raw streaming + tool output)	MATCH
`/usage`	Full (off/tokens/full)	Full (per-tier breakdown + cost)	MATCH
`/local` / `/cloud`	--	Full	Flynn-unique
`/model`	--	Full (tier switching)	Flynn-unique

14. Miscellaneous

Feature	OpenClaw	Flynn	Status
Streaming & chunking	Full (per-channel limits)	Full (streaming + per-channel chunking)	MATCH
Typing indicators	Full	Telegram, Discord, WhatsApp (per-adapter)	MATCH
Presence tracking	Full	Full (`system.presence` with online/offline inference from recent sender activity)	MATCH
Web UI token dashboard	Usage visualization (v2026.2.6)	Full (Usage page with summary cards, per-session table, auto-refresh)	MATCH
Usage tracking / cost	Full	Full (per-tier tokens, estimated cost via MODEL_COSTS)	MATCH
Markdown rendering	Per-channel formatting	Full (TUI markdown renderer + channel-specific)	MATCH
Media pipeline	Images/audio/video/transcription	Full (image analysis, audio transcription, media.send)	MATCH
Group chat support	Full (mention gating, routing)	Full (all channels support mention gating + group filtering)	MATCH
Retry policy	Full (configurable)	Full (configurable retries, backoff, delay caps)	MATCH
System prompt templating	AGENTS.md, SOUL.md, IDENTITY.md, USER.md	Full (SOUL.md, AGENTS.md, configurable search dirs + extra sections)	MATCH

Summary Scorecard

Category	Items	Match	Partial	Missing
Channels	13	6	0	7
Model Providers	18	14	0	4
Agent & Tools	22	21	0	1
Sessions	7	7	0	0
Context/Compaction	4	4	0	0
Memory	7	6	0	1
MCP	3	3	0	0
Security	10	8	0	2
Automation	4	4	0	0
Companion Apps	6	0	0	6
Skills/Plugins	5	4	0	1
Gateway/Infra	13	8	0	5
Chat Commands	6	6	0	0
Misc	10	10	0	0
TOTAL	128	101 (79%)	0 (0%)	27 (21%)

Note: Match rate improved from 77% to 78% after implementing setup wizard (flynn setup + first-run auto-trigger).

Remaining True Gaps (prioritized)

Tier 1 — Quick Wins (DONE — implemented 2026-02-07)

All five Tier 1 items implemented: !!think prefix, /verbose command, typing indicators (Discord/WhatsApp), session pruning (TTL), tool groups. See docs/plans/2026-02-07-tier1-quick-wins-design.md.

Tier 2 — Meaningful New Features (DONE — implemented 2026-02-07)

All four Tier 2 items implemented: inbound webhooks (HTTP POST /webhooks/:name with HMAC auth), vector memory search (hybrid keyword+vector with OpenAI/Gemini/Ollama/LlamaCpp embeddings), Dockerfile (multi-stage build), heartbeat monitor (5 checks with notification). See docs/plans/2026-02-07-tier2-implementation-plan.md.

Tier 3 — Practical Improvements (DONE — implemented 2026-02-09)

All five Tier 3 items implemented: Lane Queue (per-session FIFO in gateway), credential redaction (18+ secret fields), Web UI token dashboard (usage page with summary cards), xAI Grok provider (OpenAI-compatible), Voyage AI embeddings (configurable dimensions). +33 new tests.

Tier 4 — Additional Channels (if desired) → Tier 4 — Implemented Features + Remaining Channels

Implemented (2026-02-09):

Gateway lock (single-client mode with code 4003)
Shell completion (bash/zsh/fish with --install flag)
Tailscale Serve/Funnel (auto-start/stop with daemon lifecycle)
DM pairing codes (PairingManager with TTL codes, all 4 channel adapters, gateway handlers, TUI /pair command)

Remaining channels (if desired): 6. Google Chat (Chat API)

Tier 5 — Deferred / Niche

Companion apps (macOS/iOS/Android) — massive scope
LINE, Feishu, Mattermost — niche audience
iMessage/BlueBubbles — Apple ecosystem only
Canvas/A2UI — experimental visual workspace
Voice Wake / Talk Mode — ElevenLabs TTS integration
Nix/Fly.io/Railway deployment — platform-specific
OAuth subscription auth — complex
Skill/plugin safety scanner — static analysis
Announce delivery mode — isolated job delivery
Bonjour/mDNS discovery — LAN-only use case
MiniMax/Moonshot — regional providers
Synthetic provider — testing/mock
Elevated mode — sandbox escape hatch
~~Onboard wizard — guided setup~~ (DONE — flynn setup + first-run auto-trigger, 2026-02-10)
ClawHub/skill registry — community marketplace

What Flynn Has That OpenClaw Doesn't Emphasise

Full MCP protocol support with stdio transport, tool bridging, and server lifecycle management
Model tier switching via chat commands (/local, /cloud, /model)
10 model providers (Anthropic, OpenAI, Gemini, Ollama, Llama.cpp, OpenRouter, Bedrock, GitHub, Zhipu, xAI)
SQLite session storage (vs OpenClaw's JSONL files)
Configurable retry policy with exponential backoff
Skill installer with managed directory + upgrade support
Audio transcription pipeline for voice messages
Per-message extended thinking via !!think prefix (multi-provider: Anthropic, OpenAI, Gemini, GitHub)
Tool groups for policy config (group:fs, group:runtime, group:web, group:memory)

18 KiB Raw Blame History