Skip to content

tanega/data-duck-pond

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The Duck Pond

Birds of New York(1910).Art by Louis Agassiz Fuertes

experimental

Introduction

A experimental poor man's datalake made with Dagster for data orchestration, DuckDB as an high-performance analytical database system, Metabase for data vizualisation and MinIO for persistent data storage.

Everything is shipped as Docker containers with Traefik as a reverse-proxy.

Prerequisites

  • Having Docker ( v20.10 or latest) installed on your system
  • Active ** Python environment** (ideally Python 3.9.8 for consistency) with Poetry installed (for more information, see Dagster Container README)
  • Install NPM and Node to run launch scripts; See these instructions

Getting started locally

Build and run containers

    # Build images
    npm run d:build
    # Start containers with detach daemon
    npm run d:up-d
    # Alternatively keep them attached
    npm run d:up
    # When you're done with, shut down...
    npm run d:down
    # ...remove volumes if needed
    npm run d:down:all

Alternatively, you can run Docker scripts directly

    # make scripts/docker.sh executable
    chmod u+x ./scripts/docker.sh
    # list all running composed containers
    ./scripts/docker.sh ps
    # build images
    ./scripts/docker.sh build
    # launch containers
    ./scripts/docker.sh up -d
    # remove containers and network
    ./scripts/docker.sh down
    # ...and so on!

Take a look at the running services

Contributing

Contributions guidelines will be posted soon!

Credits

  • Cover image from Eaton, Elon Howard. Birds of New York. pt. 1 (1910). Art by Louis Agassiz Fuertes. Contributed in BHL from Eaton, Elon Howard. Birds of New York. pt. 1 (1910). Art by Louis Agassiz Fuertes. Contributed in BHL from Gerstein Science Information Centre (https://s.si.edu/2LlwjIL)
  • Special thanks to MileTwo for his Docker multi stage build template for Dagster.
  • Every building blocks of this repo are Open Source projects. s/o to every contributors involved !