AI SRE

Feed

Articles, talks, and writing about AI agents in production.

2026

Fixing Claude with Claude: Anthropic reports on AI SRE Mar 19

The Register · theregister.com

Your Data is Made Powerful By Context Mar 9

Charity Majors · charity.wtf

We Automated Everything Except Knowing What's Going On Mar 2

Kenneth Eversole · eversole.dev

Everyone is a junior engineer in the age of AI Mar 1

Kelsey Hightower · thenewstack.io

Why Your On-Call Engineer Is Your Most Expensive Bottleneck Mar 1

Pranav Shil · medium.com

Practical Considerations for AI Incident Reviews Mar 1

Fischer · fgj.codes

The Picture They Paint of You Feb 23

Fred Hebert · ferd.ca

Building An Elite AI Engineering Culture In 2026 Feb 18

Chris Roth · cjroth.com

Lots of AI SRE, no AI incident management Feb 14

Lorin Hochstein · surfingcomplexity.blog

Are bugs and incidents inevitable with AI coding agents? Jan 28

David Loker · stackoverflow.blog

Bring Back Ops Pride Jan 19

Charity Majors · charity.wtf

Software engineering when the machine writes the code Jan 19

Shayon Mukherjee · shayon.dev

How we built an AI SRE agent that investigates like a team of engineers Jan 12

Daniel Shan, Tristan Ratchford · datadoghq.com

Software Acceleration and Desynchronization Jan 5

Fred Hebert · ferd.ca

Your AI SRE needs better observability, not bigger models Jan 1

ClickHouse · clickhouse.com

Building internal agents Jan 1

Will Larson · lethain.com

Human-Centred AI for SRE: Multi-Agent Incident Response without Full Automation Jan 1

InfoQ · infoq.com

Tribal Knowledge Kills On-Call Jan 1

Angelika Alashwah · medium.com

2025

End-of-Year Observability Retrospective with Charity Majors Dec 1

Horovits · medium.com

Facilitating AI adoption at Imprint Dec 1

Will Larson · lethain.com

AI and the Ironies of Automation Nov 21

Uwe Friedrichsen · ufried.com

Notes from the 2025 'AI Agents in Production' Conference Nov 18

Mark Torres · markptorres.com

From 4 Hours to 8 Minutes with AI Agents That Transform SRE Oct 1

P. Jausovec · usenix.org

Ongoing Tradeoffs, and Incidents as Landmarks Sep 20

Fred Hebert · ferd.ca

The Future of AI in SRE: Preventing Failures, Not Fixing Them Jun 1

The New Stack · thenewstack.io

The naked truth about AI-assisted coding Jun 1

Krasimir Tsonev · krasimirtsonev.com

On-Call Is Ruining My Life and Other Tales Jun 1

SREcon25 Americas · youtube.com

Stop Building AI Tools Backwards Jun 1

Hazel Weakly · hazelweakly.me

Another observability 3.0 appears on the horizon Mar 24

Charity Majors · charity.wtf

What Progress In Learning From Incidents Actually Looks Like Feb 28

John Allspaw · adaptivecapacitylabs.com

Observability: the present and future, with Charity Majors Jan 1

Gergely Orosz · newsletter.pragmaticengineer.com

2024

LLMs won't save us Dec 12

Niall Murphy · blog.relyabilit.ie

Learning from Major Incidents: The Opportunities We're Missing Jul 22

Nora Jones · pagerduty.com

Generative AI is not going to build your engineering team for you Jun 10

Charity Majors · charity.wtf

Alert on symptoms, not causes Mar 6

Galo Navarro · varoa.net