Skip to main content
Intermediate ~24 hours

Modern Data Stack in a Box

Set up the full modern data stack locally: dbt + DuckDB or Snowflake, Airflow, Metabase. The dominant 2026 stack.

dbtDuckDB or SnowflakeApache AirflowMetabasePython

About this project

Most data-engineer tutorials are 5+ years old. This project teaches the actual 2026 stack: dbt for transformations, DuckDB (or Snowflake free tier) for warehouse, Airflow for orchestration, Metabase for BI. Build a real pipeline: scrape or fetch a public dataset (Strava, weather, stocks), transform with dbt, schedule with Airflow, visualize in Metabase.

Why build this in 2026?

dbt + warehouse + orchestrator is the dominant 2026 data stack. Any data engineer interview will probe this.

What you'll ship

  • GitHub repo with dbt project + Airflow DAGs
README with architecture diagram
Metabase dashboard screenshots

Sign up to see the full project brief

Full deliverables, success criteria, and AI Career Tutor support — free.

You'll unlock:Complete project brief, AI tutor that knows this project, and progress tracking when you start.

Skills you'll practice

sqlpythondbtairflow