Skip to content

whoisandy/k2s3

Repository files navigation

k2s3

Kafka to S3 Streamer

Stream JSON messages from Kafka to partitioned Parquet files stored on S3

TODO:

  • Add Kafka consumer logic
  • Enable partition columns as cli option
  • Ability to enable/disable compaction
  • Ability to alter table with new partitions dynamically
  • Ability to include/exclude fields from JSON
  • Convert batches to partitioned parquet tables
  • Add/Update kafka offset checkpoint files along with parquet partitioned files

About

Kafka to S3 Streamer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages