chore: limit the amount of context data we parse #684

pnadolny13 · 2024-07-18T16:45:18Z

We have too much data in the context_base table so performance is poor. The data volume is increasing with time so the last 6 months has more data than all before it. This is likely because more users are on newer versions of meltano that send our rich unstructured events and because usage has grown.

I manually truncated the context_base incremental table to remove all data before this year and made a backup table of the original. The table is transient but the backup is not so it will be properly persisted if we ever need that processed historical data. Since the context_base table will continue to grow and we'll have to manually prune it periodically, I created this PR which limits all downstream tables to filter only for 6 months of data so their performance should be relatively static even as the base table grows.

limit the amount of context data we parse

5661b00

pnadolny13 had a problem deploying to test July 18, 2024 16:45 — with GitHub Actions Failure

pnadolny13 temporarily deployed to test July 18, 2024 16:45 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: limit the amount of context data we parse #684

chore: limit the amount of context data we parse #684

pnadolny13 commented Jul 18, 2024

chore: limit the amount of context data we parse #684

Are you sure you want to change the base?

chore: limit the amount of context data we parse #684

Conversation

pnadolny13 commented Jul 18, 2024