Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: use manifest CSVs with MultiCollection etc. #11

Open
wants to merge 28 commits into
base: main
Choose a base branch
from

Conversation

ctb
Copy link
Member

@ctb ctb commented Oct 13, 2024

With sourmash-bio/sourmash_plugin_branchwater#430, we can now use lists of .sig.zip files and/or standalone manifest CSVs.

So, I ran the benchmarks with sourmash-bio/sourmash_plugin_branchwater#471, which includes both sourmash-bio/sourmash_plugin_branchwater#430 and some other speedups, and got a really nice outcome - it's faster than ever, although it also uses more memory because (presumably) it's keeping many more metagenomes in memory simultaneously, due to the speed.

plugin version time memory
v0.8.6 814s 18.3 GB
v0.9.0 1332s 27.1 GB
v0.9.6 17523s 27.2 GB
unreleased w/430/471 1096.7 16.5 GB
unreleased w/430/471* 559.7 38.3 GB

* this run used sig.zip files and manifest CSVs in the benchmarking

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant