AI SRE

Feed

Articles, talks, and writing about AI agents in production.

2026

Fixing Claude with Claude: Anthropic reports on AI SRE Mar 19
The Register · theregister.com
Your Data is Made Powerful By Context Mar 9
Charity Majors · charity.wtf
We Automated Everything Except Knowing What's Going On Mar 2
Kenneth Eversole · eversole.dev
Everyone is a junior engineer in the age of AI Mar 1
Kelsey Hightower · thenewstack.io
Why Your On-Call Engineer Is Your Most Expensive Bottleneck Mar 1
Pranav Shil · medium.com
Practical Considerations for AI Incident Reviews Mar 1
Fischer · fgj.codes
The Picture They Paint of You Feb 23
Fred Hebert · ferd.ca
Building An Elite AI Engineering Culture In 2026 Feb 18
Chris Roth · cjroth.com
Lots of AI SRE, no AI incident management Feb 14
Lorin Hochstein · surfingcomplexity.blog
Are bugs and incidents inevitable with AI coding agents? Jan 28
David Loker · stackoverflow.blog
Bring Back Ops Pride Jan 19
Charity Majors · charity.wtf
Software engineering when the machine writes the code Jan 19
Shayon Mukherjee · shayon.dev
How we built an AI SRE agent that investigates like a team of engineers Jan 12
Daniel Shan, Tristan Ratchford · datadoghq.com
Software Acceleration and Desynchronization Jan 5
Fred Hebert · ferd.ca
Your AI SRE needs better observability, not bigger models Jan 1
ClickHouse · clickhouse.com
Building internal agents Jan 1
Will Larson · lethain.com
Human-Centred AI for SRE: Multi-Agent Incident Response without Full Automation Jan 1
InfoQ · infoq.com
Tribal Knowledge Kills On-Call Jan 1
Angelika Alashwah · medium.com

2025

End-of-Year Observability Retrospective with Charity Majors Dec 1
Horovits · medium.com
Facilitating AI adoption at Imprint Dec 1
Will Larson · lethain.com
AI and the Ironies of Automation Nov 21
Uwe Friedrichsen · ufried.com
Notes from the 2025 'AI Agents in Production' Conference Nov 18
Mark Torres · markptorres.com
From 4 Hours to 8 Minutes with AI Agents That Transform SRE Oct 1
P. Jausovec · usenix.org
Ongoing Tradeoffs, and Incidents as Landmarks Sep 20
Fred Hebert · ferd.ca
The Future of AI in SRE: Preventing Failures, Not Fixing Them Jun 1
The New Stack · thenewstack.io
The naked truth about AI-assisted coding Jun 1
Krasimir Tsonev · krasimirtsonev.com
On-Call Is Ruining My Life and Other Tales Jun 1
SREcon25 Americas · youtube.com
Stop Building AI Tools Backwards Jun 1
Hazel Weakly · hazelweakly.me
Another observability 3.0 appears on the horizon Mar 24
Charity Majors · charity.wtf
What Progress In Learning From Incidents Actually Looks Like Feb 28
John Allspaw · adaptivecapacitylabs.com
Observability: the present and future, with Charity Majors Jan 1
Gergely Orosz · newsletter.pragmaticengineer.com

2024

LLMs won't save us Dec 12
Niall Murphy · blog.relyabilit.ie
Learning from Major Incidents: The Opportunities We're Missing Jul 22
Nora Jones · pagerduty.com
Generative AI is not going to build your engineering team for you Jun 10
Charity Majors · charity.wtf
Alert on symptoms, not causes Mar 6
Galo Navarro · varoa.net