Conf42 Site Reliability Engineering (SRE) 2025 - Online

- premiere 5PM GMT

AI-Driven Self-Healing Infrastructure: The Next Evolution of SRE

Abstract

Site Reliability Engineering (SRE) has evolved from manual incident response to automated workflows, but AI is unlocking the next major shift—self-healing infrastructure. What if failures could be predicted, prevented, and resolved autonomously, without human intervention?

Keynote Overview

In this keynote, I will explore:

  • AI-Driven Failure Prediction
    How AI is enabling the prediction, prevention, and resolution of failures in infrastructure.

  • Automated Remediation and Self-Adaptive Environments
    The shift from reactive alert-based monitoring to predictive, self-healing reliability engineering.

  • Real-World Insights
    Case studies and practical examples of how AI is being applied to reduce Mean Time to Recovery (MTTR) and automate resilience.

  • The Implications of AI-Native SRE
    Long-term impacts on engineers and organizations, and how to prepare for this evolution.

As AI continues to transform infrastructure reliability, this talk will outline practical strategies for embracing these advancements and preparing for the next wave of SRE innovation.

...

Vijaybhasker Pagidoju

Lead Site Reliability Engineer @ Centene Corporation

Vijaybhasker Pagidoju's LinkedIn account



Join the community!

Learn for free, join the best tech learning community for a price of a pumpkin latte.

Annual
Monthly
Newsletter
$ 0 /mo

Event notifications, weekly newsletter

Delayed access to all content

Immediate access to Keynotes & Panels

Community
$ 8.34 /mo

Immediate access to all content

Courses, quizes & certificates

Community chats

Join the community (7 day free trial)