feat: Implement Phase 3 automation for K8s agent system
Automation components for scheduled and event-driven workflows:
Scheduler:
- scheduler.sh for cron-based workflow execution
- Logs workflow runs to ~/.claude/logs/workflows/
- Notifies dashboard on completion
Alertmanager Integration:
- webhook-receiver.sh for processing alerts
- Dashboard endpoint /api/webhooks/alertmanager
- Example alertmanager-config.yaml with routing rules
- Maps alerts to workflows (crashloop, node issues, resources)
New Incident Workflows:
- node-issue-response.yaml: Handle NotReady/unreachable nodes
- resource-pressure-response.yaml: Respond to memory/CPU overcommit
- argocd-sync-failure.yaml: Investigate and fix sync failures
Dashboard Updates:
- POST /api/webhooks/alertmanager endpoint
- POST /api/workflows/{name}/complete endpoint
- Alerts create pending actions for visibility
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
@@ -55,6 +55,10 @@ func main() {
|
||||
r.Get("/history", api.GetActionHistory(s))
|
||||
r.Get("/workflows", api.GetWorkflows(s))
|
||||
r.Post("/workflows/{name}/run", api.RunWorkflow(s))
|
||||
r.Post("/workflows/{name}/complete", api.CompleteWorkflow(s))
|
||||
|
||||
// Webhook endpoints
|
||||
r.Post("/webhooks/alertmanager", api.AlertmanagerWebhook(s))
|
||||
})
|
||||
|
||||
// Static files
|
||||
|
||||
Reference in New Issue
Block a user