chore(vault): capture generated Obsidian updates

2026-05-27 14:31:39 -07:00
parent a5c105129e
commit 2e751d6ff5
22 changed files with 584 additions and 71 deletions
@@ -0,0 +1,26 @@
+# Daily Hermes + AI Research Brief — 2026-05-23
+
+## Important updates
+
+- **Hermes v0.14.0 is the big local-agent upgrade to test next.** The latest GitHub release (May 16) says Hermes added SuperGrok OAuth with 1M-token Grok 4.3 context, an OpenAI-compatible local proxy for OAuth-authed providers, first-class `x_search`, Microsoft Teams plumbing, lighter/lazy installs, PyPI install support, ~19s faster cold start, 180x faster browser CDP calls, LINE/SimpleX gateways, `/handoff`, LSP diagnostics on writes, improved `computer_use`, Windows beta, OpenRouter Pareto Code router, and trusted Hugging Face skills taps. This directly affects Atlas provider routing, gateway reach, browser automation, and local coding-agent ergonomics. Source: [Hermes releases](https://github.com/NousResearch/hermes-agent/releases).
+
+- **Hermes v0.13.0’s durable Kanban board is still the most relevant architectural feature for Atlas/swarm work.** The release notes describe Kanban as a durable multi-agent board with heartbeat, reclaim, zombie detection, auto-block, dependency DAGs, dispatch daemon, and worker-gated tools. That maps closely to Will’s local swarm / Pi agent harness needs: durable task state beats fragile background chats. Source: [Hermes v0.13 release](https://github.com/NousResearch/hermes-agent/blob/main/RELEASE_v0.13.0.md).
+
+- **OpenAI Codex is moving toward long-running, remote, cross-device agents.** The May 21 Codex changelog says Goal mode is now available across the app, IDE extension, and CLI for objectives that can run “hours or even days”; Codex also added Appshots, remote computer use after a Mac locks, plugin marketplace sharing with skills/MCP/hooks bundles, and browser-use reliability/structured extraction improvements. Useful pattern for Hermes/Atlas: explicit goal mode + remote host continuity + shareable tool bundles. Source: [Codex changelog](https://developers.openai.com/codex/changelog).
+
+- **Codex now supports trusted non-interactive automation tokens.** OpenAI’s May 5 changelog entry says Enterprise admins can allow members to create Codex access tokens for scripts, schedulers, and private CI runners. Even if Will does not use Codex Enterprise, this is a strong signal: scheduled/CI agent auth is becoming a first-class surface, and Hermes cron/gateway auth should keep separating human OAuth, job identity, and least-privilege automation tokens. Source: [Codex changelog](https://developers.openai.com/codex/changelog).
+
+- **Agent traffic gateways are converging around LLM + MCP + A2A + Kubernetes in one data plane.** Agentgateway positions itself as a Linux Foundation project for “LLM, MCP, A2A, and HTTP in one data plane,” with integrations for Claude Code, Codex, OpenCode, Hermes, MCP servers, OpenTelemetry, Kubernetes, and major providers. For CoreWeave-style GPU/k8s work, watch this class of gateway as the possible control plane between local agents, hosted inference, MCP tools, and observability. Source: [agentgateway](https://agentgateway.dev/).
+
+## Actionable ideas for us
+
+- **[quick] Upgrade-check Hermes locally** with `hermes update` / `hermes doctor`, then specifically smoke-test v0.14 features that matter: PyPI/lazy deps, browser CDP speed, LSP diagnostics, `/handoff`, and the OpenRouter Pareto Code router.
+- **[experiment] Prototype a small Hermes Kanban board** for Atlas jobs: one dispatcher task, two worker profiles, heartbeat/reclaim enabled, and Obsidian note output as the completion artifact.
+- **[experiment] Test Hermes’ OpenAI-compatible local proxy** as a unifying endpoint for Codex/Aider/Cline/Continue against existing OAuth-backed providers; compare latency, context caching, and failure modes.
+- **[watch] Track agentgateway’s MCP/A2A routing model** for whether it can front local MCP servers and GPU-hosted inference without overcomplicating Atlas’ current setup.
+
+## Worth ignoring
+
+- Generic “Claude Code vs Codex” comparison posts unless they include reproducible benchmarks, config details, or concrete failure cases.
+- Broad “top LLMOps tools” listicles; most are vendor SEO unless they show real routing, eval, cost, or Kubernetes deployment examples.
+- Consumer-only agent demos that do not expose tool permissions, auth, logs, or recoverability.
@@ -0,0 +1,27 @@
+# Daily Hermes + AI Research Brief — 2026-05-24
+
+## Important updates
+
+- **Hermes Agent v0.14.0 shipped May 16 as the “Foundation Release.”** Highlights: PyPI install path, ~19s faster cold start, 180x faster Browser CDP calls, native Windows beta, OpenAI-compatible local proxy for OAuth-authed providers, xAI/SuperGrok OAuth with 1M-context Grok, LINE/SimpleX/Teams platform work, `/handoff`, LSP diagnostics on writes, and new optional skills. This matters for Atlas because it reduces install friction and opens a useful “Hermes as local provider proxy” pattern for Codex/Aider/Cline/Continue experiments. Source: [GitHub releases](https://github.com/NousResearch/hermes-agent/releases).
+
+- **Hermes v0.13.0’s durable Kanban/multi-agent board remains the most relevant recent Hermes architecture change.** The release notes describe durable heartbeats, reclaim/zombie detection, auto-blocking, task lifecycle, and worker/orchestrator patterns. This is directly applicable to Will’s Atlas/local-swarm work: use the board as the durable coordination boundary instead of ad-hoc background sessions. Source: [Hermes releases](https://github.com/NousResearch/hermes-agent/releases).
+
+- **LiteLLM opened an alpha Managed Agents Platform on May 8.** It is a self-hosted platform for running multiple agents in production, focused on per-team/context sandboxes and session persistence across pod restarts/upgrades. Worth watching as a comparison point for Hermes Kanban + profiles + gateway on Kubernetes. Source: [LiteLLM blog](https://docs.litellm.ai/blog/agent-platform-alpha).
+
+- **vLLM’s latest developer docs now document experimental disaggregated prefilling.** The page describes splitting prefill and decode across separate vLLM instances and transferring KV cache via connectors such as NIXL, OffloadingConnector, and FlexKV; it was updated May 6. This is practical for CoreWeave-style GPU/k8s serving experiments where prefill/decode bottlenecks need separate scaling. Source: [vLLM disaggregated prefilling docs](https://docs.vllm.ai/en/latest/features/disagg_prefill/).
+
+- **MCP governance/security capacity is expanding.** The MCP project added Clare Liguori as Core Maintainer and Den Delimarsky as Lead Maintainer; the post emphasizes production agent runtimes, triggers/events, authorization, RFC 8707 resource indicators, and security work. This matters because MCP is becoming long-lived infra, not just a tool-call fad; Hermes MCP integration should track auth/security spec changes closely. Source: [MCP blog](https://blog.modelcontextprotocol.io/posts/2026-04-08-expanding-maintainer-team/).
+
+## Actionable ideas for us
+
+- **[quick]** Test `pip install hermes-agent` in a clean disposable venv/container and note any Atlas bootstrap simplifications vs the current install path.
+- **[experiment]** Prototype the new Hermes OpenAI-compatible local proxy with one external coding client (Aider/Cline/Codex) and one OAuth provider; record latency, auth refresh behavior, and whether prompt caching survives the proxy path.
+- **[experiment]** Map Hermes Kanban concepts to the Pi/Atlas swarm: task table = durable queue, profile = worker role, heartbeat/reclaim = supervisor; avoid inventing a parallel scheduler until this is evaluated.
+- **[watch]** Track vLLM disaggregated prefill + LiteLLM componentized/agent platform work for a future GPU/k8s inference control plane design.
+
+## Worth ignoring
+
+- Generic “agent framework star race” and listicle posts unless they include reproducible architecture details or benchmarks.
+- Consumer-only “AI assistant” announcements without tool protocol, deployment, privacy, or local-first implications.
+
+Saved to Obsidian: Atlas/Daily Research/2026-05-24 - Hermes AI Brief.md
@@ -0,0 +1,24 @@
+# Daily Hermes + AI Research Brief — 2026-05-25
+
+## Important updates
+
+- **Hermes v0.14.0 is the big local-agent upgrade to study.** The May 16 release claims PyPI install support, ~19s faster cold start, 180× faster browser CDP calls, native Windows beta, LINE/SimpleX support, LSP diagnostics on write, prompt caching, `/handoff`, `computer_use`, and a local OpenAI-compatible proxy for OAuth-authed providers. This directly maps to Atlas goals: faster cron/gateway loops, cross-tool provider routing, and better spawned-agent ergonomics. [Source: GitHub releases](https://github.com/NousResearch/hermes-agent/releases)
+- **Hermes provider/tooling surface is expanding beyond “chat agent” into router/proxy glue.** The release notes highlight xAI SuperGrok OAuth, Grok 4.3 1M context, `x_search`, Microsoft Teams plumbing, Zed ACP registry integration via `uvx`, OpenRouter Pareto Code router, and `huggingface/skills` as a trusted tap. For Will, this is a reason to treat Hermes/Atlas as a control plane for heterogeneous model/tool backends rather than a single CLI. [Source: Hermes v0.14.0 release](https://github.com/NousResearch/hermes-agent/releases)
+- **OpenAI is pushing Codex toward durable, cross-app agent work.** Codex now has background computer use, an in-app browser, image generation, 90+ plugins including MCP servers, multiple terminals, SSH devbox support, reusable automation threads, scheduling, and preview memory. This is highly relevant because it validates the same design space Atlas is using: cron + memory + tools + multi-agent task carryover. [Source: OpenAI](https://openai.com/index/codex-for-almost-everything/)
+- **Agent governance is becoming an infra layer, not an afterthought.** Databricks’ Unity AI Gateway release adds MCP governance, on-behalf-of permissions, LLM/MCP observability, cost attribution, model fallback/rate-limit/guardrail policies, and consistent controls across OpenAI/Anthropic/Google/open-source models. Practical takeaway: Atlas/Hermes should log model+tool calls as first-class audit events, especially for Pi swarm and GPU/k8s agents. [Source: Databricks](https://www.databricks.com/blog/ai-gateway-governance-layer-agentic-ai)
+- **MCP is continuing to standardize enterprise tool use.** Red Hat’s MCP guidance frames MCP as the layer for tool discovery, policy, and auditable actions, moving beyond one-off RAG/tool-call glue. This supports investing in MCP-compatible wrappers for local services instead of bespoke integrations. [Source: Red Hat Developer](https://developers.redhat.com/articles/2026/01/08/building-effective-ai-agents-mcp)
+- **vLLM v0.21.0 has inference changes worth tracking for local/GPU backends.** The release includes KV offload + Hybrid Memory Allocator work, speculative decoding that respects reasoning/thinking budgets, new Blackwell attention backend work for DeepSeek/Kimi-style models, more model/tool-parser support, and a breaking C++20 build requirement. This matters for any CoreWeave-style or home GPU inference path. [Source: vLLM releases](https://github.com/vllm-project/vllm/releases)
+
+## Actionable ideas for us
+
+- **[quick]** Check whether the running Hermes install is already on v0.14.0; if not, schedule a manual upgrade window and verify cron/gateway after restart.
+- **[quick]** Add a short “agent audit event” note/template for Atlas: model, provider, tool/MCP server, data touched, outcome, and cost/latency where available.
+- **[experiment]** Prototype Hermes’ local OpenAI-compatible proxy with one external coding tool (Codex/Aider/Cline/Continue) to see if Atlas can centralize provider auth and routing.
+- **[watch]** Track vLLM v0.21.x adoption notes before upgrading any production-ish inference service because C++20/Transformers v5 changes may break builds.
+
+## Worth ignoring
+
+- Generic “2026 agent landscape” listicles unless they include reproducible implementation details or protocol/tooling changes.
+- Consumer-only Codex/Claude comparisons without concrete workflow, eval, or infra takeaways.
+
+Saved to Obsidian: Atlas/Daily Research/2026-05-25 - Hermes AI Brief.md
@@ -0,0 +1,28 @@
+# Daily Hermes + AI Research Brief — 2026-05-26
+
+## Important updates
+
+- **Hermes Agent v0.14.0 is the big local-agent update to study.** The May 16 release notes say PyPI install now works, cold start is ~19s faster, Browser CDP calls are 180× faster, native Windows is in beta, LINE/SimpleX were added, `x_search` landed, an OpenAI-compatible local proxy can expose OAuth-backed providers to Codex/Aider/Cline/Continue, and `/handoff`, prompt caching, LSP diagnostics, video generation, and provider-agnostic computer-use improved. This is directly relevant to Atlas as a multi-channel, local-first, tool-heavy agent stack. Source: [NousResearch/hermes-agent releases](https://github.com/NousResearch/hermes-agent/releases).
+
+- **Hermes' May 7 “Tenacity” release introduced durable multi-agent coordination primitives.** The release search result reports Kanban as a durable multi-agent board with heartbeat, reclaim, zombie detection, and auto-block behavior. That maps closely to Will's swarm/worker orchestration needs: durable task state beats pure chat-thread delegation for long-running local agents. Source: [Hermes Agent v2026.5.7 release](https://github.com/NousResearch/hermes-agent/releases) / [NewReleases mirror](https://newreleases.io/project/github/NousResearch/hermes-agent/release/v2026.5.7).
+
+- **GitHub Copilot is pushing hard on cloud-agent ergonomics and routing.** The Copilot changelog lists May updates for auto model selection in VS Code, semantic issue search, applying code-review feedback with Copilot cloud agent, one-click fixes for failing Actions, cheaper/simple-task models, REST API auditing of cloud-agent config, and remote control for Copilot CLI sessions. Useful pattern for Atlas: expose agent state, config, and fix loops as first-class APIs/UI actions. Source: [GitHub Copilot changelog](https://github.blog/changelog/label/copilot/).
+
+- **OpenAI's Ramp case study is a concrete benchmark for agent usefulness: code review + on-call assistant.** Ramp says Codex with GPT-5.5 gives substantive PR feedback “in minutes instead of hours” and is being used to build an internal on-call assistant for complex incident workflows. Practical takeaway: measure Atlas/Hermes value around latency-to-first-review, incident context retention, and whether agents reduce interruption cost—not just task completion demos. Source: [OpenAI: Ramp engineers accelerate code review with Codex](https://openai.com/index/ramp/).
+
+- **Microsoft Agent Framework is converging on the same building blocks Hermes already emphasizes.** Microsoft documents agents that call tools and MCP servers, workflows with type-safe routing/checkpointing/human-in-loop, model clients, agent sessions, memory providers, middleware, and MCP clients; supported backends include Foundry, Anthropic, Azure OpenAI, OpenAI, Ollama, and more. This is worth watching for interoperability ideas and enterprise patterns around middleware/checkpointing. Source: [Microsoft Agent Framework overview](https://learn.microsoft.com/en-us/agent-framework/overview/).
+
+- **Anthropic's agent platform direction continues to validate MCP + generated client surfaces.** Anthropic's May 18 announcement says it acquired Stainless; the search snippet frames the move around MCP and agent connectivity. Even if product details are thin from the public page, the signal is clear: typed API/client generation plus MCP-style tool connectivity is becoming core agent infrastructure. Source: [Anthropic acquires Stainless](https://www.anthropic.com/news/anthropic-acquires-stainless).
+
+## Actionable ideas for us
+
+- **[quick] Upgrade-check Hermes and read v0.14 notes against Atlas pain points.** Prioritize PyPI install, CDP speedups, `/handoff`, local proxy, prompt caching, LSP diagnostics, and `x_search`.
+- **[experiment] Prototype a Hermes Kanban-backed worker lane** for one recurring Atlas task: create → claim → heartbeat → complete/block. Compare with current cron/session-only flow.
+- **[experiment] Test the OpenAI-compatible local proxy** as a unifying endpoint for Aider/Cline/Continue/Codex-style tools using existing Hermes auth/provider routing.
+- **[watch] Track Copilot's cloud-agent API/config surface** for ideas: REST-auditable agent config, remote CLI control, semantic issue search, and auto model routing are all patterns Atlas could mirror locally.
+
+## Worth ignoring
+
+- Generic “best AI coding agents in 2026” listicles unless they include reproducible benchmarks or specific workflow design ideas.
+- Funding/enterprise-adoption announcements without concrete APIs, open-source code, or operational lessons.
+- Consumer AI stories unless they improve local/private agent workflows or tool orchestration.
@@ -0,0 +1,56 @@
+---
+type: agent-audit-event
+project: "Atlas Capability Upgrade Program"
+status: accepted
+owner: "Atlas"
+created: 2026-05-25
+updated: 2026-05-25
+source: "Telegram request: prototype Hermes local OpenAI-compatible proxy with one external coding tool"
+tags: [atlas, audit, agent-event, hermes, proxy, codex]
+---
+
+# Agent Audit Event — Codex + Hermes local proxy experiment
+
+## Summary
+
+- **When:** 2026-05-25
+- **Actor:** Atlas / Hermes default Telegram session
+- **Trigger:** User request from daily Hermes + AI research brief follow-up
+- **Outcome:** partial
+- **One-line result:** Codex can be pointed at a local OpenAI-compatible base URL, but the current Hermes proxy is not ready for a complete Codex-through-Hermes run without a logged-in proxy upstream and/or endpoint compatibility work.
+
+## Runtime
+
+- **Model:** gpt-5.5
+- **Provider:** OpenAI Codex backend for Hermes session
+- **Profile/session:** default Hermes Telegram session
+- **Tools used:** terminal, file/search/read/write, process
+- **MCP servers / external services:** Hermes local proxy CLI; Codex CLI 0.129.0; local loopback capture server
+
+## Data touched
+
+- **Local paths:** `/home/will/.hermes/hermes_cli/proxy/*`, `/home/will/.codex/config.toml`, temporary files under `/tmp`
+- **Remote systems:** none for the prototype request path; no real upstream completion was performed
+- **Secrets/PII exposure:** Codex used its own local auth while probing a local capture endpoint; do not use raw request capture servers for future authenticated probes unless auth is disabled with a temporary `CODEX_HOME`.
+- **Writes/side effects:** created and removed/terminated a temporary local HTTP capture process; wrote this audit note
+
+## Cost and latency
+
+- **Approx. cost:** no successful model completion through the proxy; cost unknown / likely none for the failed local capture run
+- **Latency / runtime:** quick local smoke, under a few minutes
+- **Notable performance notes:** Codex retried `/v1/responses` several times after websocket upgrade failures before falling back to POST.
+
+## Evidence
+
+- **Hermes proxy status:** `hermes proxy status` showed `nous` and `xai` adapters available but not logged in on this profile.
+- **Hermes proxy allowed paths:**
+  - Nous: `/chat/completions`, `/completions`, `/embeddings`, `/models`
+  - xAI: `/chat/completions`, `/completions`, `/embeddings`, `/models`, `/responses`
+- **Codex endpoint behavior:** with `openai_base_url="http://127.0.0.1:8765/v1"`, Codex 0.129.0 probed `GET /v1/models?client_version=0.129.0`, then attempted websocket `GET /v1/responses`, then `POST /v1/responses`.
+- **Compatibility finding:**
+  - Codex is not a good first full-success target for the Nous proxy path because Codex uses Responses API, while the Nous proxy adapter only forwards chat/completions-style paths.
+  - Codex may be plausible through the xAI proxy adapter because xAI allows `/responses`, but the adapter was not logged in and websocket behavior still needs verification.
+  - Aider is likely the better first full-success target for Nous because it uses OpenAI-compatible chat completions, but it is not installed on this machine.
+- **Follow-up:** For a clean full prototype, either:
+  1. authenticate `hermes proxy --provider xai` and retry Codex with a temporary `CODEX_HOME`, or
+  2. install/use Aider against `hermes proxy --provider nous` once Nous Portal proxy auth is available.
@@ -0,0 +1,70 @@
+---
+title: Obsidian Data Accuracy Audit
+area: atlas-reports
+tags: [obsidian, data-quality, infrastructure, audit]
+created: 2026-05-27
+status: completed
+---
+
+# Obsidian Data Accuracy Audit
+
+## Scope
+
+Pass over the shared Obsidian vault for stale infrastructure/model claims, prompted by the wrong local LLM model in [[Infrastructure/Architecture]].
+
+Vault path checked:
+
+`/home/will/lab/swarm/swarm-common/obsidian-vault/will/will-shared-zap`
+
+## Live evidence checked on 2026-05-27
+
+- Local llama.cpp service: `llama-server.service` active on `18806`.
+- `/v1/models` on `http://127.0.0.1:18806` returned current model id:
+  - `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`
+- Ollama on `18807` includes `nomic-embed-text:latest` for embeddings and older chat models; it is not the direct local LLM endpoint used by llama.cpp.
+- Hermes default model config: `openai-codex` / `gpt-5.5`.
+- Current host IPs include `192.168.153.130`, `192.168.153.141`, and Tailscale `100.123.88.127`.
+- Vault path is on host btrfs (`/home`), not currently a VM `virtiofs` mount.
+- n8n on `18808` returned HTTP 200.
+- MinIO health at `192.168.153.253:9000` returned HTTP 200.
+- Hermes cron job `atlas-minio-self-backup` is enabled, scheduled daily at 02:50, and last status was `ok`.
+
+## Updated during this pass
+
+- [[Infrastructure/Architecture]]
+  - Corrected local LLM from Gemma 3 to `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`.
+  - Corrected Hermes model from `gpt-5.4` to `gpt-5.5` via `openai-codex`.
+  - Marked OpenClaw VM / `virtiofs` details as historical.
+  - Corrected current host IPs, MinIO bucket purpose, Gitea repo references, and local model verification command.
+- [[Infrastructure/Automation/Cron Jobs]]
+  - Marked the old OpenClaw cron inventory as stale/historical.
+  - Added current Hermes cron/MinIO backup details from live `cronjob list`.
+- [[Resources/Service Catalog]]
+  - Refreshed high-risk local service rows: SearXNG, LiteLLM, Brave MCP, whisper containers, direct local LLM model id.
+
+## Previously blocked edits completed
+
+Will repaired the permissions on the previously blocked files. On the follow-up pass, ownership was verified as `will:will` and the stale active-note claims were corrected in:
+
+- [[Infrastructure/Services/Docker Services]]
+  - Updated old LAN host references from `192.168.153.113` / `.117` to current local/LAN endpoints.
+  - Updated llama.cpp model from `gemma-3-12b-it-q4_0.gguf` to `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`.
+  - Updated MinIO backup wording from current OpenClaw `zap` backups to current Atlas/Hermes `atlas` backups with legacy `zap` noted.
+- [[Infrastructure/Automation/n8n IMAP Triage Pipeline]]
+  - Updated model and endpoint to the current Gemma 4 / `18806` endpoint.
+  - Marked old `claw`/`openclaw` VM ownership checks as historical.
+- [[Infrastructure/Automation/n8n Nightly Vault Sync]]
+  - Updated local LLM and health URLs to current local endpoints.
+
+## Historical snapshot notes
+
+Many `Notes/YYYY-MM-DD Nightly Vault Sync.md` files contain the old Gemma 3 model id and old VM/virtiofs/IP state. I did **not** rewrite those because they are dated snapshots. They should be read as historical breadcrumbs, not canonical current-state docs.
+
+## Search patterns used
+
+Searched current vault markdown for local-model and infrastructure drift terms including:
+
+- `gemma-3-12b`, `gemma 3`, `gpt-5.4`
+- `192.168.153.113`, `192.168.153.117`
+- `~/.openclaw`, `openclaw.json`, `bucket zap`
+- local service ports `18801`–`18814`, `27123`
@@ -0,0 +1,34 @@
+---
+type: daily-review
+date: 2026-05-23
+tags: [type/daily-review, automation/n8n]
+---
+
+# Daily Review 2026-05-23
+
+## Top priorities
+
+- [ ] 
+- [ ] 
+- [ ] 
+
+## Inbox sweep
+
+- [ ] Review [[Inbox]]
+- [ ] Promote useful captures into [[Projects Home]], [[Resources Home]], [[Decisions Home]], or [[Runbooks Home]]
+
+## Open loops
+
+- [ ] Check [[Projects Home]]
+- [ ] Check [[Meetings Home]] action items
+- [ ] Check [[Runbooks Home]] for procedures that need updates
+
+## Notes / log
+
+- 
+
+## End-of-day reflection
+
+- What moved forward?
+- What is blocked?
+- What should start tomorrow?
@@ -0,0 +1,34 @@
+---
+type: daily-review
+date: 2026-05-24
+tags: [type/daily-review, automation/n8n]
+---
+
+# Daily Review 2026-05-24
+
+## Top priorities
+
+- [ ] 
+- [ ] 
+- [ ] 
+
+## Inbox sweep
+
+- [ ] Review [[Inbox]]
+- [ ] Promote useful captures into [[Projects Home]], [[Resources Home]], [[Decisions Home]], or [[Runbooks Home]]
+
+## Open loops
+
+- [ ] Check [[Projects Home]]
+- [ ] Check [[Meetings Home]] action items
+- [ ] Check [[Runbooks Home]] for procedures that need updates
+
+## Notes / log
+
+- 
+
+## End-of-day reflection
+
+- What moved forward?
+- What is blocked?
+- What should start tomorrow?
@@ -0,0 +1,34 @@
+---
+type: daily-review
+date: 2026-05-25
+tags: [type/daily-review, automation/n8n]
+---
+
+# Daily Review 2026-05-25
+
+## Top priorities
+
+- [ ] 
+- [ ] 
+- [ ] 
+
+## Inbox sweep
+
+- [ ] Review [[Inbox]]
+- [ ] Promote useful captures into [[Projects Home]], [[Resources Home]], [[Decisions Home]], or [[Runbooks Home]]
+
+## Open loops
+
+- [ ] Check [[Projects Home]]
+- [ ] Check [[Meetings Home]] action items
+- [ ] Check [[Runbooks Home]] for procedures that need updates
+
+## Notes / log
+
+- 
+
+## End-of-day reflection
+
+- What moved forward?
+- What is blocked?
+- What should start tomorrow?
@@ -0,0 +1,34 @@
+---
+type: daily-review
+date: 2026-05-26
+tags: [type/daily-review, automation/n8n]
+---
+
+# Daily Review 2026-05-26
+
+## Top priorities
+
+- [ ] 
+- [ ] 
+- [ ] 
+
+## Inbox sweep
+
+- [ ] Review [[Inbox]]
+- [ ] Promote useful captures into [[Projects Home]], [[Resources Home]], [[Decisions Home]], or [[Runbooks Home]]
+
+## Open loops
+
+- [ ] Check [[Projects Home]]
+- [ ] Check [[Meetings Home]] action items
+- [ ] Check [[Runbooks Home]] for procedures that need updates
+
+## Notes / log
+
+- 
+
+## End-of-day reflection
+
+- What moved forward?
+- What is blocked?
+- What should start tomorrow?
@@ -0,0 +1,34 @@
+---
+type: daily-review
+date: 2026-05-27
+tags: [type/daily-review, automation/n8n]
+---
+
+# Daily Review 2026-05-27
+
+## Top priorities
+
+- [ ] 
+- [ ] 
+- [ ] 
+
+## Inbox sweep
+
+- [ ] Review [[Inbox]]
+- [ ] Promote useful captures into [[Projects Home]], [[Resources Home]], [[Decisions Home]], or [[Runbooks Home]]
+
+## Open loops
+
+- [ ] Check [[Projects Home]]
+- [ ] Check [[Meetings Home]] action items
+- [ ] Check [[Runbooks Home]] for procedures that need updates
+
+## Notes / log
+
+- 
+
+## End-of-day reflection
+
+- What moved forward?
+- What is blocked?
+- What should start tomorrow?
@@ -1,6 +1,6 @@
-# Decision / Runbook Suggestions 2026-05-18
+# Decision / Runbook Suggestions 2026-05-25

-Generated: 2026-05-18T15:29:06.051Z
+Generated: 2026-05-25T15:00:25.654Z

 Review candidates and promote useful items into durable Decision or Runbook notes. This note is overwritten weekly.

@@ -0,0 +1,30 @@
+# Inbox Triage 2026-05-24
+
+Generated: 2026-05-24T01:00:00.048Z
+
+## Inbox items
+
+- [ ] [[Chat Summaries]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox Home]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Weekly Review]] — classify as Project / Resource / Decision / Runbook / Archive
+
+## Promote to Projects
+
+- [ ] 
+
+## Promote to Resources
+
+- [ ] 
+
+## Promote to Decisions
+
+- [ ] 
+
+## Promote to Runbooks
+
+- [ ] 
+
+## Archive / Delete / Defer
+
+- [ ] 
@@ -0,0 +1,30 @@
+# Inbox Triage 2026-05-25
+
+Generated: 2026-05-25T01:00:12.199Z
+
+## Inbox items
+
+- [ ] [[Chat Summaries]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox Home]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Weekly Review]] — classify as Project / Resource / Decision / Runbook / Archive
+
+## Promote to Projects
+
+- [ ] 
+
+## Promote to Resources
+
+- [ ] 
+
+## Promote to Decisions
+
+- [ ] 
+
+## Promote to Runbooks
+
+- [ ] 
+
+## Archive / Delete / Defer
+
+- [ ] 
@@ -0,0 +1,30 @@
+# Inbox Triage 2026-05-26
+
+Generated: 2026-05-26T01:00:00.134Z
+
+## Inbox items
+
+- [ ] [[Chat Summaries]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox Home]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Weekly Review]] — classify as Project / Resource / Decision / Runbook / Archive
+
+## Promote to Projects
+
+- [ ] 
+
+## Promote to Resources
+
+- [ ] 
+
+## Promote to Decisions
+
+- [ ] 
+
+## Promote to Runbooks
+
+- [ ] 
+
+## Archive / Delete / Defer
+
+- [ ] 
@@ -0,0 +1,30 @@
+# Inbox Triage 2026-05-27
+
+Generated: 2026-05-27T01:00:00.703Z
+
+## Inbox items
+
+- [ ] [[Chat Summaries]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox Home]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Inbox]] — classify as Project / Resource / Decision / Runbook / Archive
+- [ ] [[Weekly Review]] — classify as Project / Resource / Decision / Runbook / Archive
+
+## Promote to Projects
+
+- [ ] 
+
+## Promote to Resources
+
+- [ ] 
+
+## Promote to Decisions
+
+- [ ] 
+
+## Promote to Runbooks
+
+- [ ] 
+
+## Archive / Delete / Defer
+
+- [ ] 
@@ -3,7 +3,7 @@ title: Architecture
 area: infrastructure
 tags: [infrastructure, homelab, assistant, integrations, automation, diagrams]
 created: 2026-03-18
-updated: 2026-05-19
+updated: 2026-05-27
 status: active
 related: [[Infrastructure/Services/Docker Services]], [[Infrastructure/Automation/Cron Jobs]], [[Infrastructure/Automation/n8n Workflows]]
 ---
@@ -12,7 +12,7 @@ related: [[Infrastructure/Services/Docker Services]], [[Infrastructure/Automatio

 High-level map of Will's homelab + assistant stack. For operational details (IPs, credential paths, commands), see TOOLS.md in the assistant workspace.

-Operational note: the shared Obsidian vault lives on a `virtiofs` mount and is touched by both the host Obsidian user (`claw`) and the VM assistant user (`openclaw`); collaborative note folders need write permissions that work for both sides.
+Operational note: current vault path is `/home/will/lab/swarm/swarm-common/obsidian-vault/will/will-shared-zap` on the host btrfs filesystem. Older notes about a VM `virtiofs` mount and `claw`/`openclaw` ownership are historical unless the VM is re-enabled.

 See sub-notes for details:
 - [[Infrastructure/Services/Docker Services]]
@@ -21,16 +21,15 @@ See sub-notes for details:

 ## Current deployment

- The assistant currently runs in a VM on Will's laptop.
- Planned later move: main host.
+- Atlas/Hermes currently runs from Will's host-side `~/.hermes` checkout on the laptop; older OpenClaw VM notes are historical.
+- OpenClaw VM infrastructure is intentionally dormant unless Will re-enables it.
 - Shared note workspace lives in `will/will-shared-zap/` inside the Obsidian vault tree.

 ## Networking

- **Primary host LAN IP:** `192.168.153.130`
- **Secondary host LAN IP:** `192.168.153.140` (stale/unreachable from VM as of 2026-04-15)
- **Previous host LAN IPs:** `192.168.153.113`, `192.168.153.117`
+- **Primary host LAN IPs:** `192.168.153.130`, `192.168.153.141`
 - **Tailscale IP:** `100.123.88.127`
+- **Previous host LAN IPs:** `192.168.153.113`, `192.168.153.117`
 - **Separate MinIO endpoint:** `192.168.153.253:9000`

 ## Diagram set
@@ -62,9 +61,9 @@ Why Canvas: it gives much better layout control in Obsidian than Mermaid for thi
 - **Ollama embeddings** on `18807` for memory search

 ### Automation / orchestration
- **OpenClaw runtime:** main assistant in a VM
- **Current main interactive session model:** `gpt-5.4`
- **Local LLM runtime:** llama.cpp on `18806` serving `gemma-3-12b-it-q4_0.gguf`
+- **Atlas/Hermes runtime:** default Hermes profile, host-side `~/.hermes` checkout; Telegram/Discord gateway active.
+- **Current main interactive session model:** `gpt-5.5` via `openai-codex`.
+- **Local LLM runtime:** llama.cpp on `18806` serving `gemma-4-26B-A4B-it-UD-IQ2_M.gguf` (Gemma 4 26B GGUF, ctx 32768).
 - **n8n-agent:** `18808` for scheduled/API-native workflows

 ### Speech / media
@@ -74,14 +73,14 @@ Why Canvas: it gives much better layout control in Obsidian than Mermaid for thi
 ## Storage / repos

 ### MinIO
- Bucket: `zap`
- Purpose: full `~/.openclaw` backups every 6 hours
+- Bucket: `atlas` for current Atlas/Hermes laptop backups; legacy `zap` bucket still exists for older OpenClaw-era artifacts.
+- Purpose: daily `~/.hermes`/Atlas backup snapshots via Hermes cron script `atlas-backup-to-minio-cron.sh`
 - Accessed at `192.168.153.253:9000`

 ### Gitea
- Repo host: `gitea-http.taildb3494.ts.net`
- Repo: `will/swarm-zap.git`
- Purpose: workspace backup / review / sync point
+- Repo host: `gitea-gitea-ssh.taildb3494.ts.net`
+- Swarm repo: `will/swarm-master.git`
+- Atlas/Hermes repo: Gitea `will/atlas-code` (authoritative) with upstream GitHub mirror/backup as documented in Hermes memory.

 ## Kubernetes

@@ -93,8 +92,8 @@ Why Canvas: it gives much better layout control in Obsidian than Mermaid for thi

 - **Telegram:** primary interrupt / reminder / mobile channel
 - **Discord:** deeper technical work and scoped threads
- **Obsidian:** shared vault `will/will-shared-zap/` under `/mnt/swarm-common/obsidian-vault/`
- **Google Workspace:** Gmail / Calendar / Drive / Tasks via `gog`
+- **Obsidian:** shared vault `will/will-shared-zap/` at `/home/will/lab/swarm/swarm-common/obsidian-vault/will/will-shared-zap`; currently on host btrfs, not a VM `virtiofs` mount.
+- **Google Workspace:** Gmail / Calendar / Drive via Hermes/Google Workspace tooling; old `gog`/OpenClaw paths in historical notes may be stale.

 ## Operating decisions worth keeping

@@ -102,12 +101,10 @@ Why Canvas: it gives much better layout control in Obsidian than Mermaid for thi
 - n8n vs cron: use n8n when it has the right native node or API path, keep shell-heavy local tasks in OpenClaw or OS cron
 - Shared vault is human-readable layer; workspace memory remains source of truth

-## OpenClaw local model provider migration
- Date: 2026-04-15
- Local llama.cpp Gemma model is now exposed as `local/gemma-3-12b-local`
- Previous namespace was `litellm/gemma-3-12b-local`
- Migration required two config changes in `~/.openclaw/openclaw.json`:
-  - move the model from `models.providers.litellm.models` to a dedicated `models.providers.local` pointing at `http://192.168.153.130:18806/v1`
-  - update stale explicit model ref in `agents.defaults.models` from `litellm/gemma-3-12b-local` to `local/gemma-3-12b-local`
- Lesson: changing provider blocks alone may not change the live model catalog or Control UI if old `provider/model` refs remain in defaults/fallback maps
- Verify with: `openclaw models list | grep -i -E "gemma|local/"`
+## Local model provider notes
+- Date checked: 2026-05-27
+- Current direct local model endpoint: `http://127.0.0.1:18806/v1` / `http://192.168.153.130:18806/v1`.
+- Current served model id from `/v1/models`: `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`.
+- Ollama on `18807` is for embeddings (`nomic-embed-text`) and also has older chat models installed; do not describe the direct local LLM as Gemma 3.
+- The 2026-04-15 OpenClaw `local/gemma-3-12b-local` migration was historical and should not be used as current Atlas/Hermes config guidance.
+- Verify with: `curl -s http://127.0.0.1:18806/v1/models`.
@@ -3,16 +3,16 @@ title: Cron Jobs
 area: automation
 tags: [automation, health, assistant, cron]
 created: 2026-03-18
-updated: 2026-04-28
+updated: 2026-05-27
 status: active
 related: [[Infrastructure/Architecture]], [[Infrastructure/Automation/n8n Workflows]], [[Plans/Daily Schedule]]
 ---

-# OpenClaw Cron Jobs
+# Hermes Cron Jobs

-This note reflects the **live OpenClaw cron inventory** as checked on 2026-04-13.
+This note reflects a partial **live Hermes cron inventory** checked on 2026-05-27. Older OpenClaw job names below are historical/stale unless they also appear in the current Hermes `cronjob list`.

-All jobs run via the OpenClaw cron scheduler.
+All current jobs run via the Hermes cron scheduler.

 ## Enabled jobs

@@ -87,18 +87,14 @@ All jobs run via the OpenClaw cron scheduler.

 ## OS cron (VM)

-### MinIO backup
- **Schedule:** every 6 hours at minute `17`
- **Purpose:** full `~/.openclaw` backup to MinIO bucket `zap`
- **Behavior note:** backup script now tolerates benign hot-backup `tar` race warnings like `file changed as we read it`, but fails hard on non-benign archive creation errors so broken archives are less likely to be silently uploaded
- **Failure alerting:** cron should call `scripts/backup-to-minio-wrapper.sh`, which appends output to `logs/minio-backup.log` and sends a Discord alert to `#general` if the backup exits non-zero
+### Atlas/Hermes MinIO backup
+- **Schedule:** daily at `02:50` America/Los_Angeles via Hermes job `atlas-minio-self-backup` (`c15ee395a38d`)
+- **Purpose:** Atlas/Hermes backup to MinIO bucket `atlas` using `atlas-backup-to-minio-cron.sh`
+- **Last live check:** job enabled and last status `ok` on 2026-05-27

-### Session transcript cleanup
- **Schedule:** daily at `03:17` (America/Los_Angeles)
- **Script:** `scripts/cleanup-session-transcripts.sh`
- **Purpose:** delete session `.jsonl` files older than 90 days from `~/.openclaw/agents/main/sessions/`
- **Log:** `logs/cleanup-sessions.log`
- **Added:** 2026-03-31
+### System / local automation watchdogs
+- **Examples live on 2026-05-27:** `system threshold watchdog` every 10m, `blocked kanban escalation` every 30m, `local-ai-automation-watchdog` daily 03:20, `agent-ops-watchdog` daily 03:40, `hermes-live-checkout-kanban-guard` every 5m.
+- **Delivery:** mostly Discord `#ops-alerts` or origin, depending on job.

 ## Operational notes

@@ -110,4 +106,4 @@ All jobs run via the OpenClaw cron scheduler.

 ## Note

-Earlier versions of this note listed several jobs that are not present in the current live cron inventory anymore. This file was refreshed from the actual runtime state on 2026-03-27.
+Earlier versions of this note listed several OpenClaw jobs that are not present in the current Hermes cron inventory anymore. Treat this file as refreshed for high-risk/current infra items on 2026-05-27, not a complete replacement for `hermes cron list` / the Hermes `cronjob` tool.
@@ -3,7 +3,7 @@ title: n8n Nightly Vault Sync
 area: infrastructure
 tags: [n8n, obsidian, llm, automation, nightly]
 created: 2026-03-27
-updated: 2026-03-27 23:39 UTC
+updated: 2026-05-27
 status: active
 related: [[Infrastructure/Automation/n8n Workflows]], [[Infrastructure/Architecture]], [[Infrastructure/Automation/Cron Jobs]], [[Infrastructure/Services/Docker Services]]
 ---
@@ -16,7 +16,7 @@ related: [[Infrastructure/Automation/n8n Workflows]], [[Infrastructure/Architect
 - **Workflow ID:** `75JCevkdgkyCr2qH`
 - **Status:** active
 - **Trigger:** nightly cron expression `45 23 * * *`
- **Local LLM:** `gemma-3-12b-it-q4_0.gguf`
+- **Local LLM:** `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`

 ## Purpose

@@ -31,9 +31,9 @@ Each run:
   - `Infrastructure/Automation/Cron Jobs.md`
   - `Infrastructure/Services/Docker Services.md`
 2. checks live health from:
-   - `http://192.168.153.113:18808/healthz`
-   - `http://192.168.153.113:18806/v1/models`
-3. sends that context to the local Gemma model
+   - `http://127.0.0.1:18808/healthz`
+   - `http://127.0.0.1:18806/v1/models`
+3. sends that context to the local Gemma 4 model
 4. writes a nightly note to:
   - `Notes/YYYY-MM-DD Nightly Vault Sync.md`

@@ -3,14 +3,14 @@ title: Docker Services
 area: infrastructure
 tags: [infrastructure, homelab, docker]
 created: 2026-03-18
-updated: 2026-03-27
+updated: 2026-05-27
 status: active
 related: [[Infrastructure/Architecture]], [[Infrastructure/Automation/n8n Workflows]]
 ---

 # Docker Services

-Most service containers run on the LAN host at `192.168.153.113`. Some adjacent AI/runtime services run as host-level user services instead of Docker; those are noted separately below.
+Most service containers run on Will's laptop/host network and publish local/LAN ports. Current host LAN IPs include `192.168.153.130` and `192.168.153.141`; older `192.168.153.113` / `.117` URLs should be treated as historical unless re-verified.

 ## Dockerized services

@@ -19,7 +19,7 @@ Most service containers run on the LAN host at `192.168.153.113`. Some adjacent
 - **Image:** `searxng/searxng:latest`
 - **Port:** `18803 -> 8080`
 - **Role:** preferred local-first web search backend
- **URLs:** `http://192.168.153.113:18803`, `http://192.168.153.117:18803`
+- **URLs:** `http://127.0.0.1:18803`, `http://192.168.153.130:18803`

 ### brave-search (MCP)
 - **Container:** `brave-search`
@@ -33,8 +33,8 @@ Most service containers run on the LAN host at `192.168.153.113`. Some adjacent
 - **Image:** `docker.n8n.io/n8nio/n8n:latest`
 - **Port:** `18808 -> 5678`
 - **Role:** automation workflows and integration runtime
- **Primary URL:** `http://192.168.153.113:18808`
- **Health:** `/healthz` returned `{"status":"ok"}` during live check on 2026-03-27
+- **Primary URL:** `http://127.0.0.1:18808` / `http://192.168.153.130:18808`
+- **Health:** `/healthz` returned HTTP 200 during live check on 2026-05-27
 - **Workflows:** see [[Infrastructure/Automation/n8n Workflows]]

 ### whisper-server
@@ -58,9 +58,9 @@ Most service containers run on the LAN host at `192.168.153.113`. Some adjacent

 ### llama.cpp
 - **Port:** `18806`
- **Model:** `gemma-3-12b-it-q4_0.gguf`
+- **Model:** `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`
 - **Role:** local OpenAI-compatible LLM endpoint for private/offline reasoning and workflow inference
- **Live check:** model endpoint and chat-completion path re-verified on 2026-03-27
+- **Live check:** `/v1/models` returned Gemma 4 26B GGUF on 2026-05-27

 ### Ollama
 - **Port:** `18807`
@@ -71,9 +71,9 @@ Most service containers run on the LAN host at `192.168.153.113`. Some adjacent

 ### MinIO
 - **Endpoint:** `192.168.153.253:9000`
- **Bucket:** `zap`
- **Role:** object storage for OpenClaw backups
+- **Bucket:** `atlas` for current Atlas/Hermes backups; legacy `zap` bucket still exists for old OpenClaw-era artifacts
+- **Role:** object storage for Atlas/Hermes backups

 ## Operational note

-The shared Obsidian vault is not a Docker service, but it is a critical part of this stack. Its Local REST API path was re-verified on 2026-03-27 after fixing cross-user write permissions on the shared `virtiofs` mount.
+The shared Obsidian vault is not a Docker service, but it is a critical part of this stack. Current path is `/home/will/lab/swarm/swarm-common/obsidian-vault/will/will-shared-zap`; `findmnt` showed it on host btrfs on 2026-05-27. Older VM/`virtiofs` permission notes are historical.
@@ -1,12 +1,12 @@
 # Obsidian Automation Health

-Updated: 2026-05-23T01:00:49.446Z
+Updated: 2026-05-27T21:00:30.749Z

 ## Status

 - Overall: Needs attention
 - Obsidian REST: OK
- Reindex trigger: {"status":"ok","mode":"incremental","vault_path":"/home/will/lab/swarm/swarm-common/obsidian-vault/will/will-shared-zap","note_count":162,"added":0,"changed":0,"removed":0,"chunks_indexed":0,"vector_count":321,"state_file":"/home/will/.hermes/data/rag-search/obsidian_index_state.json","chroma_path":"/home/will/.hermes/data/rag-search/chroma"}
- RAG/embedding check: {"ok":false,"status":"failed","exitCode":127,"output":"RAG health script missing: /home/will/.hermes/scripts/rag_embedding_health.py","durationMs":0}
+- Reindex trigger: {"error":{"message":"500 - \"{\\n  \\\"error\\\": \\\"reindex failed\\\",\\n  \\\"exit_code\\\": 3,\\n  \\\"stderr\\\": \\\"\\\"\\n}\"","name":"AxiosError","stack":"AxiosError: Request failed with status code 500\n    at settle (/usr/local/lib/node_modules/n8n/node_modules/.pnpm/axios@1.13.5/node_modules/axios/lib/core/settle.js:19:12)\n    at RedirectableRequest.handleResponse (/usr/local/lib/node_modules/n8n/node_modules/.pnpm/axios@1.13.5/node_modules/axios/lib/adapters/http.js:746:9)\n    at
+- RAG/embedding check: {"ok":false,"status":"failed","exitCode":1,"output":"{\n  \"checks\": {\n    \"obsidian_semantic\": {\n      \"error\": \"HTTPError: HTTP Error 503: Service Unavailable\",\n      \"ok\": false\n    },\n    \"ollama\": {\n      \"embedding_dim\": 768,\n      \"error\": null,\n      \"model\": \"nomic-embed-text\",\n      \"ok\": true\n    }\n  },\n  \"durationMs\": 3135,\n  \"failed\": [\n    \"obsidian_semantic\"\n  ],\n  \"ok\": false,\n  \"status\": \"failed\"\n}","durationMs":3187}

 This note is automatically overwritten by n8n.
@@ -1,7 +1,7 @@
 ---
 type: service-catalog
 created: 2026-05-14T14:50:46-07:00
-updated: 2026-05-14T14:50:46-07:00
+updated: 2026-05-27T12:12:06-07:00
 tags:
  - service-catalog
  - swarm
@@ -13,7 +13,7 @@ tags:

 Canonical index of local services, automation tools, Hermes capabilities, and where to find their operational docs.

-> Generated/updated by Atlas from live system inventory on `2026-05-14T14:50:46-07:00`. Secrets are intentionally omitted.
+> Generated by Atlas from live system inventory on `2026-05-14T14:50:46-07:00`; high-risk local AI/service rows refreshed on `2026-05-27T12:12:06-07:00`. Secrets are intentionally omitted.

 ## Quick links

@@ -41,9 +41,9 @@ Canonical index of local services, automation tools, Hermes capabilities, and wh

 | Service | Port | Status | Purpose | Health / base URL |
 | --- | --- | --- | --- | --- |
-| Brave Search MCP | 18802 | HTTPError | Brave Search MCP server for Hermes MCP tools | `http://127.0.0.1:18802/mcp` |
-| SearXNG | 18803 | TimeoutError | SearXNG metasearch | `http://127.0.0.1:18803/search?q=test&format=json` |
-| LiteLLM | 18804 | TimeoutError | LiteLLM OpenAI-compatible model proxy | `http://127.0.0.1:18804/health/liveliness` |
+| Brave Search MCP | 18802 | HTTP 406 on plain GET `/mcp` | Brave Search MCP server for Hermes MCP tools | `http://127.0.0.1:18802/mcp` |
+| SearXNG | 18803 | OK 200 | SearXNG metasearch | `http://127.0.0.1:18803/search?q=test&format=json` |
+| LiteLLM | 18804 | no listener / HTTP 000 on 2026-05-27 | LiteLLM OpenAI-compatible model proxy | `http://127.0.0.1:18804/health/liveliness` |
 | Kokoro TTS | 18805 | OK 200 | Kokoro local TTS | `http://127.0.0.1:18805/health` |
 | llama.cpp | 18806 | OK 200 | llama.cpp local LLM | `http://127.0.0.1:18806/v1/models` |
 | Ollama embeddings | 18807 | OK 200 | Ollama embeddings API | `http://127.0.0.1:18807/api/version` |
@@ -61,9 +61,8 @@ Canonical index of local services, automation tools, Hermes capabilities, and wh
 | Container | Status | Ports |
 | --- | --- | --- |
 | n8n-agent | Up 21 hours (healthy) | 0.0.0.0:18808->5678/tcp, [::]:18808->5678/tcp |
-| whisper-server | Up 25 hours (healthy) | 0.0.0.0:18811->8080/tcp, [::]:18811->8080/tcp |
-| litellm | Up 4 hours (healthy) | 0.0.0.0:18804->4000/tcp, [::]:18804->4000/tcp |
-| litellm-db | Up 25 hours (healthy) | 5432/tcp |
+| whisper-server-gpu | Up 27 hours (healthy) | 0.0.0.0:18801->8080/tcp, [::]:18801->8080/tcp |
+| whisper-server | Up 27 hours (healthy) | 0.0.0.0:18811->8080/tcp, [::]:18811->8080/tcp |
 | kokoro-tts | Up 25 hours | 0.0.0.0:18805->8880/tcp, [::]:18805->8880/tcp |
 | brave-search | Up 25 hours | 0.0.0.0:18802->8000/tcp, [::]:18802->8000/tcp |
 | searxng | Up 25 hours | 0.0.0.0:18803->8080/tcp, [::]:18803->8080/tcp |
@@ -246,7 +245,7 @@ Profile          Model                        Gateway      Alias        Distribu
 | --- | --- | --- |
 | Web search | SearXNG `18803` or Brave MCP `18802` | Hermes web search and MCP Brave Search are both available |
 | Model proxy | LiteLLM `18804` | Use for OpenAI-compatible routed models |
-| Direct local LLM | llama.cpp `18806` | Current model: Gemma 4 26B GGUF; useful for n8n/local automation |
+| Direct local LLM | llama.cpp `18806` | Current model id: `gemma-4-26B-A4B-it-UD-IQ2_M.gguf`; useful for n8n/local automation |
 | Embeddings | Ollama `18807` | Use raw Ollama API root, not `/v1`, for `/api/embed` |
 | Text-to-speech | Kokoro `18805` / Hermes TTS tool | Local speech generation |
 | Speech-to-text | Whisper `18811` and wrappers | Local transcription fallback |