As Site Reliability Engineers it is our mission to ensure our services are highly available, secure, and scalable. With hundreds or thousands of different metrics across a (potentially) distributed system that you could monitor and alert on, where do we begin? How do we define what it means for a service to be "healthy"? This lightning talk focuses on the four golden signals of monitoring that...
Priority access to all content
Video hallway track
Community chat
Exclusive promotions and giveaways