SRE Principles and Practices
Learn what SRE is, explore differences between SRE and DevOps, and discover key SRE principles and practices.
Service Level Objectives and Error Budgets
Gain a deeper understanding of service level objectives (SLOs), error budgets, and error budget policies.
Reducing Toil
Understand the negative effects of toil and explore human and organisational opportunities to diminish it.
Monitoring and Service Level Indicators
Discover Service Level Indicators (SLIs) and how they relate to Service Level Objectives (SLOs), the monitoring landscape, observability and setting measurable service objectives.
SRE Tools and Automation
Learn what automation is, explore DevOps and SRE automation focus, discover types of SRE automation, automation for security, progressive deployments, AIOps, VSM Platforms, Platform Engineering, Generative AI use cases, and gain an overview of the tooling landscape.
Anti-Fragility and Learning from Failure
Discover the benefits of learning from failure, explore the concepts of anti-fragility, chaos engineering, and shifting organizational balance.
Organisational Impact of SRE
Learn the benefits of embracing SRE for organisations, explore SRE adoption patterns, the organisational impact of SRE, sustainable incident response, blameless post-mortems and scaling SRE.
SRE, Other Frameworks, The Future
Explore SRE, DevOps, and related frameworks, including Agile, ITSM, VSM, and Platform Engineering. Learn about the evolution of SRE with spinoffs such as Network Reliability Engineering and Customer Reliability Engineering.
Download the Certification Blueprint to learn more about the various topics, principles and practices covered in this certification.