# Plan: K8s Agent Production Validation
## Goal
Address the identified gaps in k8s agent production readiness:
1. Create a lightweight k8s skill for quick checks (parallel to existing workflow)
2. Validate agents against live cluster with read-only operations
## Context
**Already exists:**
- `/cluster-status` command → `cluster-health-check` workflow (multi-step, 4 agents)
- All k8s agents are defined with proper boundaries and formats
**Gap:** No single-agent skill for ultra-quick status (like `sysadmin-health` for linux).
---
## Phase 1: Create K8s Quick-Status Skill
Create `~/.claude/skills/k8s-quick-status/SKILL.md`
**Purpose:** Lightweight single-agent check (haiku) for instant cluster pulse. Faster than workflow when you just need basics.
**Checks:**
- `kubectl get nodes` - node readiness
- `kubectl get pods -A --field-selector=status.phase!=Running` - unhealthy pods
- `kubectl get events -A --field-selector=type=Warning --sort-by='.lastTimestamp'` - recent warnings
- `argocd app list` - sync status at a glance
**Format:** Match `sysadmin-health` structure exactly (YAML frontmatter + Markdown).
---
## Phase 2: Live Cluster Validation
Run read-only commands to validate connectivity and output quality.
### Test Sequence
| Agent | Commands | Purpose |
|-------|----------|---------|
| k8s-diagnostician | `kubectl get nodes -o wide`
`kubectl get pods -A`
`kubectl top nodes` | Cluster connectivity |
| prometheus-analyst | `curl localhost:9090/api/v1/query?query=up`
`curl localhost:9090/api/v1/alerts` | Metrics access |
| argocd-operator | `argocd app list` | GitOps connectivity |
| git-operator | `git -C log --oneline -n 5` | Repo access |
### Success Criteria
- All commands execute without error
- kubectl has valid kubeconfig
- Prometheus is reachable
- ArgoCD CLI is authenticated
- Git repo is accessible
---
## Phase 3: Document Results
Update `~/.claude/state/future-considerations.json` with:
- Validation status per agent
- Any connectivity issues discovered
- Recommendations for fixes
---
## Files to Create/Modify
| File | Action |
|------|--------|
| `~/.claude/skills/k8s-quick-status/SKILL.md` | Create |
| `~/.claude/state/future-considerations.json` | Update |
---
## Execution Order
1. Create `k8s-quick-status` skill
2. Run kubectl connectivity test
3. Run Prometheus connectivity test
4. Run ArgoCD connectivity test
5. Run Git repo connectivity test
6. Document findings in `future-considerations.json`
7. Report summary