Conf42 Site Reliability Engineering (SRE) 2025 - Online

- premiere 5PM GMT

AI and Chaos Engineering: Smarter Failure Testing for Resilient Systems

Abstract

Modern cloud systems are becoming increasingly complex, making traditional failure testing methods inefficient and reactive. AI-driven Chaos Engineering introduces automation, intelligence, and adaptability to fault injection, enabling predictive failure detection and self-healing capabilities. By leveraging machine learning, SREs can identify failure patterns, optimize chaos experiments dynamically, and accelerate incident response. This talk explores how AI enhances Chaos Engineering, reducing downtime, improving resilience, and enabling proactive reliability strategies. Attendees will gain insights into real-world implementations of AI-powered failure testing and how to integrate it into their SRE practices.

...

Rahul Amte

Senior Cloud Engineer @ Nivid Technologies

Rahul Amte's LinkedIn account



Join the community!

Learn for free, join the best tech learning community for a price of a pumpkin latte.

Annual
Monthly
Newsletter
$ 0 /mo

Event notifications, weekly newsletter

Delayed access to all content

Immediate access to Keynotes & Panels

Community
$ 8.34 /mo

Immediate access to all content

Courses, quizes & certificates

Community chats

Join the community (7 day free trial)