AI SRE

AI SRE Landscape

Companies building AI SRE agents, organized by approach.

Agentic AI SRE platforms

Standalone AI agents built specifically to investigate and act on production issues across multiple tools and systems.

Incident management platforms with AI

Started as incident coordination tools and added AI investigation on top.

Observability platforms with AI

Large observability vendors adding AI investigation with direct access to the telemetry they already collect.

General-purpose agents (DIY)

General-purpose AI agents (Claude Code, Cursor, or custom builds) connected to observability and infrastructure APIs. Often the fastest way to get something running - a working prototype can come together in days, and for well-documented systems with contained architectures, these agents deliver real value on investigation tasks. Best suited for environments where the problem space is well-scoped. Without broader system understanding or a way to verify results, accuracy tends to plateau for most teams regardless of the underlying model.

For background on the category, see What is AI SRE?