Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support JSONL and incremental importing #35

Open
PeterJCLaw opened this issue Jan 8, 2024 · 1 comment
Open

Support JSONL and incremental importing #35

PeterJCLaw opened this issue Jan 8, 2024 · 1 comment

Comments

@PeterJCLaw
Copy link
Collaborator

Currently the memory usage of devdata during an import can be fairly large -- it scales with the size of the export being imported. This can result in an import failing part way through due to the process being OOMKilled. While ideally devdata would only have smallish data to work with, that isn't always practical.

It would be great if devdata supported exporting to JSONL (i.e: newline separated JSON) as this can be parsed incrementally and could thus should be able to imported incrementally too.

@PeterJCLaw
Copy link
Collaborator Author

Had a quick look at this; it's a bit more complicated than might be hoped due to the way that we work with the tree of exported files -- we'd need to indicate or detect the format in several places, meaning there's potentially a bunch of passing through which might need to happen if we don't want to just rely on e.g: file extensions (which might be a reasonable way to go if we wanted to).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant