July 12
02:30pm–03:10pm
Service level objectives and error budgets are the cornerstone of site reliability engineering and a critical tool for organizations to find an appropriate balance between reliability and rates of feature development. In this talk, you will learn from the Google Customer Reliability Engineering team how to set and measure useful service level indicators and objectives for needs ranging from interactive, latency-sensitive, query-based systems to batch throughput-oriented systems. You will learn how to set high-signal-to-noise-ratio alerting based on the error budget, and how to make longer-term changes to development priorities if your budget is overspent or underspent.