

Monitoring and Alerting with Prometheus
Paid Course
https://www.udemy.com/course/monitoring-and-alerting-with-prometheus/
Become a DevOps monitoring expert using Prometheus and Grafana, monitor your infrastructure and applications as a pro.

AWS MasterClass: Monitoring and DevOps with AWS CloudWatch
Paid Course
https://www.udemy.com/course/aws-monitoring-alerting-with-aws-cloudwatch-and-aws-sns/
AWS Master Class – Master Monitoring and Alerting Services in Amazon Cloud Using AWS CloudWatch & SNS for DevOps and Ops

Site Reliability Engineering: Measuring and Managing Reliability
Free Course
https://www.coursera.org/learn/site-reliability-engineering-slos
Service level indicators (SLIs) and service level objectives (SLOs) are fundamental tools for measuring and managing reliability. In this course, students learn approaches for devising appropriate SLIs and SLOs and managing reliability through the use of an error budget.