flynn

will/flynn

Author	SHA1	Message	Date
William Valentin	6f5dd741a9	docs: add llama.cpp integration design Design for adding LlamaCppClient to support local LLM inference via llama-server with CUDA. Target model: Qwen 2.5 14B Q4_K_M. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-05 13:05:58 -08:00