Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Faster recognition of experiments that have already run (or skipped) #28

Open
jjc2718 opened this issue Oct 5, 2020 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@jjc2718
Copy link
Member

jjc2718 commented Oct 5, 2020

Right now, it's straightforward to recognize experiments that have already been run: we just check for the output file, and if it exists we skip it.

However, recognizing experiments that were skipped before (e.g. because of too few positive labels, etc) is slower, as we have to preprocess the data again to arrive at that conclusion. It would be faster to create some sort of file for each gene showing which experiments were skipped, then on rerun we can load the file (if it exists) and know which experiments to skip again, much faster.

I don't think this will improve runtime that much, but it should help a bit.

@jjc2718 jjc2718 added the enhancement New feature or request label Oct 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant