SLOs and Error Budgets for a Real Service
Define and implement SLOs + error budgets for an existing service. Drive a real reliability conversation.
Prometheus or DatadogGrafanaService mesh or proxy for SLI collection
About this project
SLOs are the SRE lingua franca. This project teaches: SLI selection (what to measure), SLO target setting (with stakeholder buy-in), burn-rate alerting, and the error-budget policy that drives release decisions. Pick a service (yours or open-source) and implement the full SLO setup.
Why build this in 2026?
SLO discipline is what separates senior SREs from juniors. Most candidates can't articulate the math.
What you'll ship
- SLO document (1-2 pages)
Live dashboards for each SLO
Burn-rate alert rules
Sign up to see the full project brief
Full deliverables, success criteria, and AI Career Tutor support — free.
You'll unlock:Complete project brief, AI tutor that knows this project, and progress tracking when you start.
Skills you'll practice
monitoringincident responsedistributed systems