Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using Delta Live Tables Sink as a Custom Materialization in dbt #948

Open
datahunter1205 opened this issue Feb 27, 2025 · 2 comments
Open
Labels
enhancement New feature or request

Comments

@datahunter1205
Copy link

Describe the feature

We are using Databricks and would like to implement all data transformations—both streaming and batch—entirely within dbt. Currently, dbt supports Streaming Tables in Databricks, which internally create Delta Live Tables (DLT). Additionally, as per Databricks documentation, it is possible to write stream output to a Kafka topic using writeStream in a Delta Live Tables pipeline.

Describe alternatives you've considered


Additional context

The outlined approach includes:

  • Setting up Kafka configurations (broker URL, topic, security settings)
  • Creating a DLT pipeline
  • Defining a streaming source (files, Delta tables, etc.)
  • Using writeStream with Kafka options to publish the data

The introduction of the new Sinks API in DLT addresses the need to write processed data to external event streams, such as Apache Kafka and Azure Event Hubs, as well as writing to a Delta Table. These features are currently in Public Preview, with plans for further expansion.

Who will this benefit?

This feature will benefit users who are looking to integrate their data pipelines entirely within dbt and require seamless publishing of streaming data to external platforms such as Kafka and Azure Event Hubs. Specific use cases include real-time data processing and integration with external event streaming platforms for further analytics and monitoring.

Are you interested in contributing this feature?

--

@datahunter1205 datahunter1205 added the enhancement New feature or request label Feb 27, 2025
@benc-db
Copy link
Collaborator

benc-db commented Feb 27, 2025

Today, we only support DLT capabilities that are part of the DBSQL API. We do not create pipelines directly.

@amychen1776
Copy link

@benc-db I wonder if there's a world where you could create sinks as a custom materialization (Materialize supports it in their dbt adapter but they can do that because they're 100% SQL based)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants