April 17 2025 - premiere 5PM GMT

Zero Downtime ML Deployments: SRE Techniques for Seamless Reliability

Abstract

Ensuring zero-downtime ML deployments is a challenge for SREs. Traditional observability falls short for ML at scale. This talk explores the ML-SRE gap, breaking down systems and introducing key techniques to enhance observability, monitor performance, and ensure seamless, reliable deployments.

See all 109 talks at this event!

Join the community!

Learn for free, join the best tech learning community for a price of a pumpkin latte.

Newsletter

$ 0 /mo

Event notifications, weekly newsletter

Delayed access to all content

Immediate access to Keynotes & Panels

Email address

First Name

Last Name

Company

Job Title

Phone Number

Country

Community

$ 8.34 /mo

Access to Circle community platform

Immediate access to all content

Live events!

Regular office hours, Q&As, CV reviews

Courses, quizes & certificates

Community chats

Join the community (7 day free trial)

Conf42 Site Reliability Engineering (SRE) 2025 - Online

April 17 2025 - premiere 5PM GMT

Zero Downtime ML Deployments: SRE Techniques for Seamless Reliability

Abstract

Payal Godhani

Principal Engineer @ Oracle Cloud Infrastructure

Join the community!

Featured event

2025

2024

Info

Conf42 Site Reliability Engineering (SRE) 2025 - Online

April 17 2025 - premiere 5PM GMT

Zero Downtime ML Deployments: SRE Techniques for Seamless Reliability

Abstract

Payal Godhani

Principal Engineer @ Oracle Cloud Infrastructure

Join the community!