Practical ways to implement Site Reliability Engineering
Notes:
"Companion to the bestselling SRE book"--Cover Includes bibliographical references and index
Contents:
How SRE relates to DevOps Part 1. Foundations. Implementing SLOs SLO engineering case studies Monitoring Alerting on SLOs Eliminating toil Simplicity Part 2. Practices. On-call Incident response Postmortem culture : learning from failure Managing load Introducing non-abstract large system design Data processing pipelines Configuration design and best practices Configuration specifics Canarying releases Part 3. Processes. Identifying and recovering from overload SRE engagement model SRE : reaching beyond your walls SRE team lifecycles Organizational change management in SRE Example SLO document Example error budget policy Results of postmortem analysis