O’Reilly – Infrastructure and Ops Hour Sre With Tammy Butow

O’Reilly – Infrastructure and Ops Hour Sre With Tammy Butow-iLLiTERATE
English | Size: 915.89 MB
Category: Tutorial


Join us for a special conversation on site reliability engineering with Sam Newman and Gremlin principal SRE Tammy Butow.

Cloud Academy – SRE Reducing Toil-STM

Cloud Academy – SRE Reducing Toil-STM
English | Size: 373.96 MB
Category: Tutorial


This course looks at what toil is, and why having less of it is a good thing. Toil, as quoted here by Google, is “the kind of work tied to running a production service that tends to be manual, repetitive, automatable, tactical devoid of enduring value, and that scales linearly as the service grows.” By the end of this course, you will have a clear understanding of what toil is, how to recognize it and how to address and replace it with automation

Cloud Academy – SRE Monitoring and Service Level Indicators

Cloud Academy – SRE Monitoring and Service Level Indicators-STM
English | Size: 481.78 MB
Category: Tutorial


This course explores the subject of monitoring and service Level Indicators and how both work together to allow you to measure and track whether stated service level objectives are being met or not. By the end of this course, you’ll have a clear understanding of monitoring and SLIs, and how to apply both of them correctly within your own organization

Cloud Academy – SRE Tools and Automation

Cloud Academy – SRE Tools and Automation-STM
English | Size: 372.81 MB
Category: Tutorial


This course delves into the subject of tools and automation within site reliability engineering (SRE). Automation is carried out in SRE to solve practical problems typically those identified as toil. And having the right tools for the right job is important when performing SRE By the end of this course, you’ll have a clear understanding of the available tools and practices and how to apply each to a particular SRE automation requirement If you have any feedback relating to this course,

Cloud Academy – SRE Principles and Practices

Cloud Academy – SRE Principles and Practices-STM
English | Size: 217.59 MB
Category: Tutorial


Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. This course introduces you to Site Reliability Engineering and takes you through its important features. It’ll answer the fundamental question “What is Site Reliability Engineering?” before moving onto explaining the key differences between SRE and DevOps, and then finish by reviewing SRE principles and practices