Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Repoint to Snowplow's new-generation data models #2

Open
alexanderdean opened this issue Apr 30, 2021 · 2 comments
Open

Repoint to Snowplow's new-generation data models #2

alexanderdean opened this issue Apr 30, 2021 · 2 comments
Labels
enhancement New feature or request

Comments

@alexanderdean
Copy link

The Dbt Snowplow model is years out of date at this point; the modern Snowplow data models are here:

https://github.com/snowplow/data-models

/cc @paulboocock @carabaestlein

@buremba
Copy link
Contributor

buremba commented Apr 30, 2021

Hey @alexanderdean, we model the main events table and make use of the dbt's Snowplow package for sessionization but dbt also doesn't seem to support the new data models.

We actually deprecated our in-house data modeling language in favor of our new dbt integration which extends dbt's resource files in order to build something similar to LookML but for behavioral data.

Looking at the repository that you shared, my understanding is that Snowplow takes care of the transformation itself so we don't need dbt transformation. In that case, I see two ways to make Rakam work with the new data models:

  1. If there is a way to get the Iglu schemas for a Snowplow deployment, we can generate dbt sources from JSON Schema using a macro in Rakam. We were already planning to build something similar for Iteratively & Avo integration.

  2. If Snowplow creates tables & columns with the descriptions defined in Iglu schema, we already have a way to create dbt sources from database tables so it might be the easier way to integrate with the new models. We implemented Segment and Firebase to Rakam running a metadata query to extract the event type & property definitions and create the models with the metrics that are relevant to the event types.

I will try out Snowplow's new version this weekend in order to understand how the new data models work. Also, it would be great to discuss it with your team before actually implementing the new models.

@buremba buremba added the enhancement New feature or request label Apr 30, 2021
@alexanderdean
Copy link
Author

Thanks for the detailed note @buremba - will leave Cara and Paul to follow up from our side!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants