Kafka Sink Connect Google Cloud (GCP) Bigtable

Apache Kafka Sink Only Connect can stream messages from Apache Kafka to Google Cloud Platform (GCP) wide column store Bigtable.

What is Apache Kafka

Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation and written in Scala and Java. The project aims to provide a unified, high-throughput, low-latency platform for real-time data feeds. Please look at Apache Kafka home page.

What is Google Cloud Bigtable

Bigtable is a compressed, high performance, proprietary data storage system built on Google File System, Chubby Lock Service, SSTable and a few other Google technologies. On May 6, 2015, a public version of Bigtable was made available as a service in the Google Cloud Platform. For more details, please refer to GCP Bigtable home page.

High Level Architecture

This project leverages bigtable-client-core library (NO HBase) to stream data to GCP Bigtable. bigtable-client-core internally use the gRPC framework to talk to GCP Bigtable.

Prerequisites

You have Apache ZooKeeper and Apache Kafka installed and running on your computer. Please refer to the respective sites to download and start ZooKeeper and Kafka. You also need Java version 11 or above.

Tested Software Versions

Software	Version	Note
Java	11	Tested using Java 11.
Kafka	3.3.1	Please refer. Tested using kafka_2.13-3.3.1, should work with older versions.
bigtable-client-core	1.27.1	Please refer.
Kafka connect-api	3.3.1	Please refer.
grpc-netty-shaded	1.51.0	Please refer.

Configurations

Please refer to project Wiki

Constraints

The current configuration system supports streaming messages from a given topic to a table. You can subscribe to any number of topics, but a topic can be pointed to one and only one table. Say, for example, if you subscribed from a topic named demo-topic, you should have a yml file named demo-topic.yml. That yml file contains all the configuration required to transform and write data into Bigtable.

How to build the artifact

Please refer to project Wiki

How to deploy the connector

Please refer to project Wiki

How to start the connector in stand-alone mode

Please refer to project Wiki

Questions

Either create issues in this project or send it to [email protected]. Thanks!

Name		Name	Last commit message	Last commit date
Latest commit History 255 Commits
.github/workflows		.github/workflows
etc		etc
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
codecov.yml		codecov.yml
kafka-connect-bigtable.png		kafka-connect-bigtable.png
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kafka Sink Connect Google Cloud (GCP) Bigtable

What is Apache Kafka

What is Google Cloud Bigtable

High Level Architecture

Prerequisites

Tested Software Versions

Configurations

Constraints

How to build the artifact

How to deploy the connector

How to start the connector in stand-alone mode

Questions

License

About

Releases

Packages

Contributors 3

Languages

License

sanjuthomas/kafka-connect-gcp-bigtable

Folders and files

Latest commit

History

Repository files navigation

Kafka Sink Connect Google Cloud (GCP) Bigtable

What is Apache Kafka

What is Google Cloud Bigtable

High Level Architecture

Prerequisites

Tested Software Versions

Configurations

Constraints

How to build the artifact

How to deploy the connector

How to start the connector in stand-alone mode

Questions

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages