Monitoring and Alerting with Prometheus
Become a DevOps monitoring expert using Prometheus and Grafana, monitor your infrastructure and applications as a pro.
AWS MasterClass: Monitoring and DevOps with AWS CloudWatch
AWS Master Class – Master Monitoring and Alerting Services in Amazon Cloud Using AWS CloudWatch & SNS for DevOps and Ops
Site Reliability Engineering: Measuring and Managing Reliability
Service level indicators (SLIs) and service level objectives (SLOs) are fundamental tools for measuring and managing reliability. In this course, students learn approaches for devising appropriate SLIs and SLOs and managing reliability through the use of an error budget.