Skip to content

Spark Cluster provisioned using docker compose. Purpose is to test Neo4j GDS + Spark

Notifications You must be signed in to change notification settings

runfourestrun/Spark-Docker-Compose

Repository files navigation

Spark Infrastructure provisioned using docker-compose

Helpful Commands:

  • docker build -t cluster-apache-spark:3.0.2 .

  • docker-compose up -d

  • psql -U postgres -h 0.0.0.0 -p 5432

  • Spark Master Port: 9090

  • Spark Worker1 Port: 9091

  • Spark Worker2 Port: 909

Need to run this command:

  • pyspark --packages graphframes:graphframes-0.8.1-spark2.4-s_2.11 --jars graphframes-0.8.1-spark2.4-s_2.11.jar

Example pyspark submit

  • /opt/spark/bin/spark-submit --master spark://spark-master:7077
    --driver-memory 1G
    --executor-memory 1G
    /opt/spark-apps/main.py

About

Spark Cluster provisioned using docker compose. Purpose is to test Neo4j GDS + Spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published