kafka-spark-cassandra

Chef repository to install/config/execute the following servers:

Apache Kafka
Apache Spark
Spark Jobserver
Apache Cassandra

Cookbooks

This repository contains the following cookbooks (some are just wrappers):

##java-wrapper Wrapper of the java cookbook to install Java 8.

##scala-wrapper Wrapper of the scala cookbook to install Scala 2.11.7.

##zookeeper-wrapper Wrapper of the zookeeper-cluster cookbook to install Zookeeper 3.5.0.

##kafka Simplified version of the apache-kafka cookbook to install Kafka 0.8.2.1.

##spark Cookbook to install Spark 1.4.1.

##spark-jobserver Cookbook to install Spark Jobserver version 0.5.2.

##cassandra Cookbook to install the Cassandra from the datastax stable release.

Databags

`zookeeper.json`

Contains all the zookeeper nodes ip. We assume they are also Kafka brokers.

`spark.json`

Constains the spark master ip and the workers ip.

`cassandra.json`

Constains the cassandra seed servers.

Roles

`Basic Server`

Base role for all the cluster nodes.

Installed Cookbooks

java_wrapper

Sample

/nodes/<server-ip>.json

{
  "run_list": [
     "role[basic-server]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

`Zookeeper Cluster`

Role for all the zookeeper nodes.

Installed Cookbooks

java_wrapper
zookeeper_wrapper

Sample

/nodes/<server-ip>.json

{
  "run_list": [
	"role[zookeeper-cluster]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

/data_bags/zookeeper.json

{
  "id": "zookeeper",
  "nodes": [
    "<server-ip>"
  ]
}

Note: You can change the databag name, and then just override the attribute default[zookeeper-cluster][databag] with the new name.

`Kafka Cluster`

Role for all the Kafka brokers.

Installed Cookbooks

java_wrapper
zookeeper_wrapper
kafka

Sample

/nodes/<server-ip>.json

{
  "run_list": [
	 "role[kafka-cluster]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  },
  "apache_kafka": {
    "broker.id": 0
  }  
}

Note:

The broker-id must be different for each of the cluster brokers.
For now we assume that the kafka brokers are the same as the zookeeper nodes, so, we are using the zookeeper databag.

`Spark Master`

Role for the spark master.

Installed Cookbooks

java_wrapper
scala_wrapper
spark

Sample

/nodes/<server-ip>.json

{
  "run_list": [
  	"role[spark-master]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

/data_bags/spark.json

{
  "id": "spark",
  "master": "<server-ip>"
}

`Spark Worker`

Role for all the spark workers.

Installed Cookbooks

java_wrapper
scala_wrapper
spark

Sample

/nodes/<server-ip>.json

{
  "run_list": [
  	"role[spark-worker]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

/data_bags/spark.json

{
  "id": "spark",
  "master": "<server-ip>"
}

`Spark Jobserver`

Role for the spark jobserver.

Installed Cookbooks

java_wrapper
scala_wrapper
spark_jobserver

Sample

/nodes/<server-ip>.json

{
  "run_list": [
  	"role[spark-jobserver]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

`Cassandra`

Role for all the cassandra nodes.

Installed Cookbooks

java_wrapper
monit_wrapper
cassandra

Sample

/nodes/<server-ip>.json

{
  "run_list": [
  	"role[cassandra-cluster]"
  ],
  "automatic": {
    "ipaddress": "<server-ip>"
  }
}

/data_bags/cassandra.conf

{
  "id": "cassandra",
  "seeds": [
    "<server-ip>"
  ]
}

Note: The seeds attribute must contain all the cassandra seed servers.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data_bags		data_bags
environments		environments
nodes		nodes
roles		roles
site-cookbooks		site-cookbooks
.gitignore		.gitignore
.kitchen.yml		.kitchen.yml
Cheffile		Cheffile
Gemfile		Gemfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kafka-spark-cassandra

Cookbooks

Databags

`zookeeper.json`

`spark.json`

`cassandra.json`

Roles

`Basic Server`

Installed Cookbooks

Sample

`Zookeeper Cluster`

Installed Cookbooks

Sample

`Kafka Cluster`

Installed Cookbooks

Sample

`Spark Master`

Installed Cookbooks

Sample

`Spark Worker`

Installed Cookbooks

Sample

`Spark Jobserver`

Installed Cookbooks

Sample

`Cassandra`

Installed Cookbooks

Sample

About

Releases

Packages

Contributors 2

Languages

passworks/kafka-spark-cassandra

Folders and files

Latest commit

History

Repository files navigation

kafka-spark-cassandra

Cookbooks

Databags

zookeeper.json

spark.json

cassandra.json

Roles

Basic Server

Installed Cookbooks

Sample

Zookeeper Cluster

Installed Cookbooks

Sample

Kafka Cluster

Installed Cookbooks

Sample

Spark Master

Installed Cookbooks

Sample

Spark Worker

Installed Cookbooks

Sample

Spark Jobserver

Installed Cookbooks

Sample

Cassandra

Installed Cookbooks

Sample

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

`zookeeper.json`

`spark.json`

`cassandra.json`

`Basic Server`

`Zookeeper Cluster`

`Kafka Cluster`

`Spark Master`

`Spark Worker`

`Spark Jobserver`

`Cassandra`

Packages