Skip to main content
Intermediate ~14 hours

SLOs and Error Budgets for a Real Service

Define and implement SLOs + error budgets for an existing service. Drive a real reliability conversation.

Prometheus or DatadogGrafanaService mesh or proxy for SLI collection

About this project

SLOs are the SRE lingua franca. This project teaches: SLI selection (what to measure), SLO target setting (with stakeholder buy-in), burn-rate alerting, and the error-budget policy that drives release decisions. Pick a service (yours or open-source) and implement the full SLO setup.

Why build this in 2026?

SLO discipline is what separates senior SREs from juniors. Most candidates can't articulate the math.

What you'll ship

  • SLO document (1-2 pages)
Live dashboards for each SLO
Burn-rate alert rules

Sign up to see the full project brief

Full deliverables, success criteria, and AI Career Tutor support — free.

You'll unlock:Complete project brief, AI tutor that knows this project, and progress tracking when you start.

Skills you'll practice

monitoringincident responsedistributed systems