Skip to content

sturc/jupyter_with_yarn

Repository files navigation

Jupyter Notebook with Yarn / Hadoop

Combines a Jupyter Notebook with an Hadoop / Yarn Container.

The Hadoop image is based on https://hub.docker.com/r/harisekhon/hadoop.

As Jupyter Notebook the pyspark notebook is used https://github.com/jupyter/docker-stacks/tree/main/images/pyspark-notebook. Images are hosted on https://quay.io/repository/jupyter/pyspark-notebook

Both containers are connected via a bridge network.

Start, Stop, Delete

Start the Jupyter Notebook and the Hadoop Cluster by executing: docker-compose up

Stop the Jupter Notebook with: docker-compose stop

Delete the Containers with: docker-compose down

Login to Jupyter

Execute one of the following commands to display the required login token:

For the traditional Jupyter Server: docker exec -it notebook jupyter notebook list

For the Jupyter Lab Server: docker exec -it notebook jupyter server list

The notebook is reachable with http://localhost:8888.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published