Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MRG] Update the sourmash doc with gather and tax #79

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

ccbaumler
Copy link
Contributor

Fixes #61
Fixes #28

Changed the lca command to gather. Added some detailed text and a bonus task.


```
curl -L https://osf.io/4f8n3/download -o genbank-k31.lca.json.gz
gunzip genbank-k31.lca.json.gz
curl -JLO https://farm.cse.ucdavis.edu/~ctbrown/sourmash-db/gtdb-rs214/gtdb-rs214-reps.k31.lca.json.gz
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Would that be a good time to introduce ln -s as well?

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

I did add a Bonus goal at the end of the document to use the standard zip database, I think I should switch them around. Introduce the zip db in the document and add a bonus about using other database types...?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Would that be a good time to introduce ln -s as well?

sure!

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

I did add a Bonus goal at the end of the document to use the standard zip database, I think I should switch them around. Introduce the zip db in the document and add a bonus about using other database types...?

zip better. Not sure why other database types needed here, but sure, in the bonus section... :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Sourmash lca gather not a command anymore update lca gather to just gather
2 participants