Skip to content

cpisani47/trusted-data-pipeline

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

trusted-data-pipeline

This project is used as a demo for my talk: Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb at PyCon Australia 2023

The slides for the workshop is on this link.

Data pipelines and architecture

Dagster global assets

Setup Python environment

git clone git@github.com:danhphan/trusted-data-pipeline.git
cd trusted-data-pipeline
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Setup Dbt

cd ./dbt_jaffle_shop/
dbt deps
dbt build

Run and test Dbt models

dbt run
dbt test
dbt test --select customers

Generate and view docs

dbt docs generate
dbt docs serve --port 8081

dbt show --inline "select count(*) from {{ ref('stg_customers') }}"

Setup Dagster

cd ./dagster_dbt_jaffle/
dagster dev # or: dagit

About

Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 89.1%
  • Dockerfile 10.9%