Conf42: online tech events

Conf42: Site Reliability Engineering (SRE) 2025

What Can We Learn from Formula 1 Incident Management

How can software and SRE teams learn about incident management from Formula 1? This talk will discuss the key takeaways from a real-life incident where Red Bull Racing performed a miraculous repair on Max Verstappen’s car in Hungary, turning a potential disaster into a podium finish.

Ricardo Castro

FanDuel / Blip.pt

Conf42: Site Reliability Engineering (SRE) 2024

SLOs for Event-Based Systems: Navigating the Triad of Availability, Freshness, and Correctness

In the dynamic landscape of event-based systems, ensuring optimal performance is crucial for delivering a seamless user experience. This talk dives into the fascinating world of Service Level Objectives (SLOs) and explores their application in the context of event-driven architectures.

Ricardo Castro

FanDuel

Conf42: Cloud Native 2024

Architecting Resilient Cloud Native Applications: A Practical Guide to Deployment and Runtime Patterns

In cloud-native applications, resilience is paramount. This talk delves into a comprehensive toolkit of deployment and runtime patterns, equipping you with the knowledge to design and implement resilient systems that can withstand and gracefully recover from disruptions.

Ricardo Castro

FanDuel / Blip

Conf42: Site Reliability Engineering 2023

Overcoming SRE Anti-Pattern Roadblocks: Rebranding the Operations Team

This talk will detail the anti-pattern where traditional operations are rebranded as “SRE” but everything else stays nearly the same. The same tools, processes, and interactions persist. SRE is seen as a mandate without any real change. In practice, the only change is the team’s renaming.

Ricardo Castro

FanDuel/Blip.pt

Conf42: Cloud Native 2023

CI/CD > Build/Deploy Automation

Many organizations set up an automation server, build a few pipelines and advertise they’re doing CI/CD, failing to capitalize on its real value.

Ricardo Castro

FanDuel/Blip.pt

Conf42: DevOps 2023

Baking in Reliability

If you were tasked with implementing “reliability engineering” how would you approach? The last few years saw a log of interest in Site Reliability Engineering (SRE). Ever since Google publish the first SRE book (https://sre.google/books/) there was an “explosion” in the interest on the subject. Many saw it as an approach to implement DevOps. Others said it was a...

Ricardo Castro

FanDuel / Blip.pt

Conf42: Incident Management 2022

Relia...bility?

Technology ecosystems are complex and it is really important to understand every change and how it affects our systems, as well as the service provided. Users expect systems to be up, responsive, fast, consistent, and reliable. Reliability for systems means that they are doing what their users need them to do. A system's reliability is essentially how happy users are and we know those happy...

Ricardo Castro

Anova

Conf42: Site Reliability Engineering 2022

Alerting on SLOs and Error Budget Policies

Assessing your system's reliability through SLOs is a great way to really understand and measure how happy users are with your service(s). Error Budgets give you the amount of reliability you have left before users are unhappy. Ideally, you want to be alerted way before users are dissatisfied and take the appropriate measures to ensure they aren't. How can you achieve that? That's where...

Ricardo Castro

Anova

Conf42: Site Reliability Engineering 2021

GitOps: yea or nay?

GitOps is a paradigm or a set of practices that empowers developers to perform tasks which typically (only) fall under the purview of operations. It’s a way to do Kubernetes cluster management and application delivery by using Git as a single source of truth for declarative infrastructure and applications. Being Git at the center of delivery pipelines, engineers use familiar tools to make pull...

Ricardo Castro

FARFETCH

Newsletter

$ 0 /mo

Event notifications, weekly newsletter

Delayed access to all content

Immediate access to Keynotes & Panels

Email address

First Name

Last Name

Company

Job Title

Phone Number

Country

Community

$ 8.34 /mo

Access to Circle community platform

Immediate access to all content

Live events!

Regular office hours, Q&As, CV reviews

Courses, quizes & certificates

Community chats

Join the community (7 day free trial)

Ricardo Castro

Conf42 Speaker profile

Conf42: Site Reliability Engineering (SRE) 2025

What Can We Learn from Formula 1 Incident Management

Ricardo Castro

Conf42: Site Reliability Engineering (SRE) 2024

SLOs for Event-Based Systems: Navigating the Triad of Availability, Freshness, and Correctness

Ricardo Castro

Conf42: Cloud Native 2024

Architecting Resilient Cloud Native Applications: A Practical Guide to Deployment and Runtime Patterns

Ricardo Castro

Conf42: Site Reliability Engineering 2023

Overcoming SRE Anti-Pattern Roadblocks: Rebranding the Operations Team

Ricardo Castro