Skip to content

Data science Environment with IPython Notebooks, Spark Cluster via docker

Notifications You must be signed in to change notification settings

dockerparis/datascience-dockerized

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Datascience-dockerized

======================

Remote Hackathon

Data science Environment with IPython Notebooks, Spark Cluster via docker

Spins up a Container with Spark installed, and IPython Notebook server.

Clone

Get into pyspark-docker

Build your docker image : ** sh pyspark-docker/build-images **

Run the container with the notebook server :

  **docker run -d -p 8888:8888 samklr/pyspark-notebook ipython notebook --profile=pyspark**

If you just want to play with spark from the command line :

  ** sudo docker run -i -t samklr/pyspark-notebook pyspark **
  ** sudo docker run -i -t samklr/pyspark-notebook spark-shell **

Head to your browser : * http://[your_IP_address or localhost]:8888 *

Next : Full web app with embedded notebooks and control of environments.

        *Support for Scala via iscala and Andy's spark notebook*

        *Full support for multiple clusters via Kubernetes (work in progress)*

        *Deploy Script to Mesos/Marathon*

Sam Bessalah

@samklr

About

Data science Environment with IPython Notebooks, Spark Cluster via docker

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages