Reduce size of test db, anonymize unneeded fields #1693
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
We want a new version of the development sample database because it's been a long time since the last one was made.
We decided to reduce the number of sounds and downloads that we keep, in order to make the db smaller.
Additionally, delete tables that contain private information that shouldn't be shared, and anonymise a few further columns.
After removing sounds/downloads, but before cleaning the additional tables, the dump was about 800mb (60 compressed). I noticed that most of the space was used by old silk logs and moderation tickets that were no longer connected to any sounds or users. After deleting this, the database was down to 60mb uncompressed/7mb compressed.
Because it's so small, we could probably get away with increasing the number of sounds a bit more if we want. We'd need to evaluate this against the size of the sound archive.