Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

incremental_partitions materialization #40

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

timnon
Copy link

@timnon timnon commented Nov 1, 2021

Hi, i wonder whether this materialization called "incremental_partitions" is useful. It adds another partition level called data_creation_ts to every created table which contains the timestamp (in milliseconds) when a data row was created. To increment a table, the "insert" command is used, also adding the newest timestamp, resulting in different timestamps for potentially the same data. Therefore, in the final posthook a cleanup operation is run which makes sure that for each partition only the latest data_creation_ts partition is kept.

So a table can be updated on a partition level later on, for instance if for a certain date partition a new run is invoked. However, the partitions are not hardcoded and can be arbitrary.

Works pretty well for me and has less than 50 rows of code.

Is this already covered with previous materializations? They have far more code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant