In the dynamic landscape of event-based systems, ensuring optimal performance is crucial for delivering a seamless user experience. This talk dives into the fascinating world of Service Level Objectives (SLOs) and explores their application in the context of event-driven architectures.
In cloud-native applications, resilience is paramount. This talk delves into a comprehensive toolkit of deployment and runtime patterns, equipping you with the knowledge to design and implement resilient systems that can withstand and gracefully recover from disruptions.
This talk will detail the anti-pattern where traditional operations are rebranded as “SRE” but everything else stays nearly the same. The same tools, processes, and interactions persist. SRE is seen as a mandate without any real change. In practice, the only change is the team’s renaming.
If you were tasked with implementing “reliability engineering” how would you approach? The last few years saw a log of interest in Site Reliability Engineering (SRE). Ever since Google publish the first SRE book (https://sre.google/books/) there was an “explosion” in the interest on the subject. Many saw it as an approach to implement DevOps. Others said it was a...
Technology ecosystems are complex and it is really important to understand every change and how it affects our systems, as well as the service provided. Users expect systems to be up, responsive, fast, consistent, and reliable. Reliability for systems means that they are doing what their users need them to do. A system's reliability is essentially how happy users are and we know those happy...
Assessing your system's reliability through SLOs is a great way to really understand and measure how happy users are with your service(s). Error Budgets give you the amount of reliability you have left before users are unhappy. Ideally, you want to be alerted way before users are dissatisfied and take the appropriate measures to ensure they aren't. How can you achieve that? That's where...
GitOps is a paradigm or a set of practices that empowers developers to perform tasks which typically (only) fall under the purview of operations. It’s a way to do Kubernetes cluster management and application delivery by using Git as a single source of truth for declarative infrastructure and applications. Being Git at the center of delivery pipelines, engineers use familiar tools to make pull...
Priority access to all content
Video hallway track
Community chat
Exclusive promotions and giveaways