Stop Grepping, Start Reasoning: Skill-Based Agentic SRE ๐
Codemotion Milan 2026
Status: submitted
Letโs be honest: debugging production at scale is manual labor. ๐ซ At Google, we decided making SREs act like human search engines was a bug, not a feature. Enter the Agentic SRE Extensionโa skill-based framework that lives in your terminal and knows its way around Kubernetes. ๐ In this deep dive, Iโll show you how we use modular skills and MCP to chain diagnostic tools, analyze metric regressions, and execute safe mitigations without "deleted production" anxiety. ๐ฅ Weโll dissect the "Outage Investigator" agent's logic loop, see it draft a technical postmortem in seconds, and discuss why we built this as a portable framework whose skills can be leveraged by any modern AI harness. Youโll leave with the code to wire up your own Kubernetes stack and let the agent's skills do the heavy lifting while you drink espresso. โ๏ธ No fluff, no 101s. Just AI agents, specialized skills, and less toil.
