Skip to content

A Big Data Analytics VM for doing Data Science. It provides a huge kickstart to those working with the Big Data Analytics side of Data Science. Essentially, this project automates the creation of the Big Data Scientist's toolbox on a virtual machine (VM). In a few minutes one can begin working with a fully configured data science lab instead of …

License

Notifications You must be signed in to change notification settings

rickfarmer/data-science-vm

Repository files navigation

Data Science VM

##Need to install the following Gems vagrant plugin install vagrant-omnibus vagrant plugin install vagrant-env

Users

root/vagrant joe/joe chuck/chuck cloudera/cloudera

Hive Embedded DB

PostgresSQL Host, e63:7432 DB name, hive Username, hive Password, 8xlpmpA6NE

Hue

http://e63:8888/ hdfs/hdfs

Test the Cluster

sudo -u hdfs hadoop jar /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar pi 10 100

About

A Big Data Analytics VM for doing Data Science. It provides a huge kickstart to those working with the Big Data Analytics side of Data Science. Essentially, this project automates the creation of the Big Data Scientist's toolbox on a virtual machine (VM). In a few minutes one can begin working with a fully configured data science lab instead of …

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published