A curated list of AI-powered DevOps & SRE (Site Reliability Engineering) agents, tools, and resources for automating and enhancing reliability practices
- Control Flow (Open Source)
- Obot (Open Source)
- RunWhen
- KubeStellar Console - Open source AI-powered multi-cluster Kubernetes dashboard with AI chat for cluster operations, real-time observability, and CNCF integrations (Argo, Kyverno, Prometheus, and 20+ others).
- Komodor Klaudia AI
- Agent SRE
- Cleric
- Holmes GPT
- k8s-GPT (Open Source)
- Nudge Bee
- Parity
- SRE.ai
- Resolve AI
- Sherlocks.ai
- IncidentFox (Open Source)
- Open SRE Agent (Open Source)