Skip to content

MUmarAmanat/dlt_aws

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Delta Live table + AWS Kinesis

This repository contains code for producing data for kinesis and then reading from AWS Kinesis using Databricks Delta Live Tables.

Requirements

  1. AWS credentials and sufficient rights for creating a Kinesis data stream
  2. Databricks Premium Tier account
  3. Databricks rights for running delta live tables
  4. Youtube Data from kaggle (https://www.kaggle.com/datasets/datasnaek/youtube-new)

How to run?

  1. Clone https://github.com/MUmarAmanat/dlt_aws.git into Databricks Repos
  2. First execute StreamDataProducer Notebook
  3. Execute StreamDataProducer Notebook from Workflow/Delta Live Tables as DLT pipeline

About

Repository for DLT and kinesis integration

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages