Skip to content

Not Only Cloud (Services) Data Stack, for maximum productivity and reproducibility in local and cloud environments

License

Notifications You must be signed in to change notification settings

ousmaneTra/nocloud-data-stack

Repository files navigation

Not Only Cloud Data Stack

Requirements

  • docker
  • Linux shell (you can WSL/Debian on Windows)
  • make

Components

Data warehouse: Clickhouse
Data integration, orchestrator: Mage
Datalake: Minio
Dataviz: Apache Superset
Data Exploration/Science: JupyterLab

Configurations

Set env variables : carrefully choose sensitive ones

$ cp .env.template .env && vim .env # set your env values
$ make .env.make && source .env.make # load env into current shell
$ cp ./superset/$STACK_ENV/.env.template ./superset/$STACK_ENV/.env && vim ./superset/$STACK_ENV/.env # edit superset env

Volumes: in production, recommanded to store data on different disks

$ cp volumes.template.yaml volumes-$STACK_ENV.yaml
$ edit volumes-$STACK_ENV.yaml # set 'device' path on volumes

Verify configurations final_compose.yaml

$ make verify # then open final_compose.yaml to verify

Run the stack

$ make # or make start :)

stop the stack

$ make stop

Note: the folder ./superset/docker is copied from https://github.com/apache/superset/tree/master/docker

Have Fun!

Clickhouse: http://localhost:8123/play

Mage: http://localhost:6789/pipelines

Minio: http://localhost:9090/login

Superset: http://localhost:8088/login/

JupyterLab: http://localhost:8888/ (get the token from Docker, in container logs)

Optional: Expose superset

We can securely expose a given service using Nginx + certbot + letsencrypt.

Example for superset:

$ vim .env # set DOMAIN_NAME and CERT_EMAIL
$ make .env.make && source .env.make # load env into current shell
$ make nginx_init

Demo here : https://demo.graphzit.com/

About

Not Only Cloud (Services) Data Stack, for maximum productivity and reproducibility in local and cloud environments

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published