During a Chaos experiments, the system might withstand the injected failure condition or a weakness might be exposed. This talk will cover what I have learnt after conducting chaos experiments on different types of applications and infrastructure. - Commonly identified scenarios - Best practices to handle the identified weakness - Knowledge sharing with the results from the Chaos experiments
This talk is about a new enterprise SRE adoption framework, named Arctic. Given the growing focus on infrastructure and service/application reliability, more and more enterprises are adopting Site Reliability Engineering (SRE). It will be beneficial for enterprises to use a framework for SRE adoption like Scrum, XP or Kanban that exists for Agile adoption. Without the availability of...
Priority access to all content
Video hallway track
Community chat
Exclusive promotions and giveaways