Explore Help

Register Sign In

will/flynn

1

0

You've already forked flynn

Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity

Files

2cadff901dcc0d93211b2bc862d29540e6f70f48

flynn/docs/plans/2026-02-15-skill-safety-scanner-checklist.md

T

William Valentin f2cdd1abd2 docs: add safety docs and OpenClaw gap roadmap

2026-02-15 10:17:07 -08:00

4.3 KiB

Raw Blame History

Skill/Plugin Safety Scanner — Implementation Checklist

Date: 2026-02-15

Parent roadmap: docs/plans/2026-02-15-openclaw-gap-roadmap.md

Goal: Close the gap item "Skill/plugin code safety scanner" by adding static analysis gates for skills so unsafe skill packages are rejected (or marked unavailable) before they can be injected into prompts or used in routing.

Scope

In scope

Add a static scanner for skill directories.
Run the scanner during skill load (covers daemon startup, watcher reloads, and CLI skill operations).
Optionally run the scanner during install/upgrade (pre-copy) to avoid persisting unsafe content.
Emit audit events (pass/fail + reason) without leaking sensitive content.

Out of scope (for this milestone)

Full SAST for arbitrary languages.
Code signing / provenance chains.
Remote registries.

Baseline

Current skill load/install entry points:

Load/validate: src/skills/loader.ts (loadSkill())
Install/upgrade: src/skills/installer.ts
CLI workflows: src/cli/skills.ts
Prompt injection: src/skills/registry.ts -> src/daemon/services.ts (# Available Skills section)

Scanner Policy (MVP)

File system safety

Deny any symlinks inside a skill directory.
Deny files above a size threshold (default: 1MB) unless allowlisted.
Deny binary blobs (heuristic: NUL bytes or high non-text ratio) for SKILL.md and manifest.json.

Manifest safety

manifest.json must parse as JSON if present.
If a skill is intended for routing (i.e. referenced by an intent target), require manifest.json.permissions.
- This aligns with the deny-by-default runtime enforcement: skills without permissions should not be routable.

Prompt content safety (lightweight)

Scan SKILL.md for obvious prompt-injection patterns:
- "ignore previous" / "system prompt" / "exfiltrate" / "send secrets" etc.
Treat these as warnings or failures (recommend: failure for now).

Implementation Strategy

Scanner API

Create a small scanner module:

src/skills/scanner.ts
- scanSkillDirectory(dir): { ok: boolean; issues: SkillScanIssue[] }
- SkillScanIssue = { severity: 'error' | 'warn'; code: string; message: string; path?: string }

Config knobs (optional in MVP):

skills.scan.enabled (default true)
skills.scan.max_file_size_bytes (default 1_000_000)
skills.scan.fail_on_warnings (default false)

Enforce during load

In src/skills/loader.ts inside loadSkill():
- run scanner before returning a Skill
- on failure:
  - either return null (hard fail), OR
  - return Skill with available=false and add reasons

Recommendation: mark skill unavailable (not null) so the user can see it in flynn skills list with reasons.

Enforce during install (optional but recommended)

In src/skills/installer.ts:
- scan sourceDir before copying
- fail install if scanner errors

Audit events

Add audit event types for skill scans:
- skills.scan.pass
- skills.scan.fail
Ensure messages do not include raw secret content; include issue codes and counts.

PR Breakdown

PR 1 — Scanner module + loader integration

Checklist:

Add src/skills/scanner.ts with MVP rules.
Integrate into src/skills/loader.ts.
Update src/skills/loader.test.ts with fixtures:
- symlink skill rejected
- oversized file rejected
- injection marker in SKILL.md rejected

Acceptance:

flynn doctor / skill load does not crash on bad skills.
Bad skills become unavailable with clear reasons.

Tests:

pnpm test:run src/skills/loader.test.ts

PR 2 — Installer integration

Checklist:

Add scan preflight to src/skills/installer.ts install().
Add tests for install failure on scan errors.

Tests:

pnpm test:run src/skills/installer.test.ts (add if missing)

PR 3 — Audit events

Checklist:

Extend src/audit/types.ts to include skill scan events.
Extend src/audit/logger.ts helpers.
Emit events from loader/install paths.
Add targeted tests validating event shape (if audit logger has test coverage).

Acceptance:

Audit logs show scan pass/fail with counts and stable issue codes.

Final Checks

pnpm typecheck
pnpm test:run
Update docs/plans/state.json entry to completed once implemented (include test status).

Reference in New Issue View Git Blame Copy Permalink

Powered by Gitea Version: 1.26.1 Page: 353ms Template: 5ms

Auto

English

Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語简体中文繁體中文（台灣）繁體中文（香港） 한국어

Licenses API