-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
upgrade search
to display more information?
#2002
Comments
👍🏻 . I definitely want
My only issue with using I also think we need to be a bit clearer about how |
I, uhh, have no idea :). I kind of like the idea of Hmm, do we even allow cos/angular similarity on num sketches? I'm not sure we should. |
as far as I can tell, we do, so I kept it enabled for |
Note new plugin |
Note that cos similarity can be accurately estimated by FracMinHash per https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11160586/! Two specific thoughts:
|
As I was thinking about the ANI stuff #1967 #2001 I came up with an idea. 💡
right now, search outputs largely useless CSV files, with minimal information. (see #1390 and #1555 for relevant issues.) As long as we support num MinHashes in search (which will be forever, probably, per #1354) in sourmash, we are stuck with some command that does command-line comparison with Jaccard.
since search is useless, I've found myself using
prefetch
a lot more , because it outputs so much more information in the CSV. it does not give good human readable output.so, back to search: the problem is that search is the first thing people are going to try out, because it's so ...obviously the command you want to use! 'search'! you're not going to use prefetch to do a search!
SO.
BUT.
what if we:
search
tojaccard
(and upgrade it with ANI output, as per display ANI in search results? #2001);prefetch
tosearch
and upgraded its output to by default ANI (and then aliased it to prefetch);I think we could add
jaccard
and do theprefetch
upgrade (without the renames) as part of this next release, and then do theprefetch
->search
rename as of sourmash 5.0 with a deprecation warning forsearch
now.this is in line with our increasingly solid belief that FracMinHash/scaled sketches are the way to go, and it also makes ANI nice and visible in prefetch, which I like (again, #2001). note that after
compute
is removed in #1286, you will have to work hard to buildnum
sketches anyway, assourmash sketch
builds scaled sketches by default.@phiweger @luizirber @bluegenes @taylorreiter any thoughts, hot takes, etc?
The text was updated successfully, but these errors were encountered: