-
Notifications
You must be signed in to change notification settings - Fork 750
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update module: diamond/blastp #7350
Comments
Small comment, switch statements are "deprecated". Nextflow vscode docs - |
I made a PR (#7381) with most of the requested changes that we can work on before merging. |
Oh, that's kind of you. I meant to do this after we had discussed things!
Sure, the output format needs to be set. See my comment below.
I'm not sure if it works to provide a
This is a matter of taste, I suppose. Personally, I find it much more explicit to look up the output formats supported by the tool and call the module with the number, and let the pipeline calculate a filename than let the filename suffix decide the output format. (As I mention above, the |
@vagkaratzas I'm waiting a moment with the review to let you reply. No hurry, I have a local module for the time being. |
Ok, I agree with the --compress mode and I made the respective changes in the module. I also added a relevant nf-test to check that this works as intended. Feel free to look/comment at the PR. For the outfmt, I suggest we get some more input/opinions from others before making any changes. The module is not currently being used by any nf-core pipelines, so I don't think we will break anything ;) |
Is there an existing module for this?
Is there an open PR for this?
Is there an open issue for this?
Are you going to work on this?
Assignees
to facilitate tracking who is working on the moduleI have some issues with the current module, some easy to solve, some that need a discussion since they potentially change the interface of the module.
--include-lineage
parameter supported by Diamond 2.10 but not the current 2.8. Easy to fix of course.--compress 1
. Seems like a good feature to support, but is made difficult by the current third parameter being the file suffix which in turn decides the output format. My suggestion is to change the third param to the numerical output format and set the suffix from this instead. If$args
contains--compress 1
, a.gz
can be added to the file suffix (except for.daa
). (Overall, I also find the use of a suffix to decide the output format a bit too implicit for my taste. I definitely consider the blast output format a.tsv
and would use that if I didn't know about the taxonomy (102) format.)process_high
instead of medium.find
to get the path of the input.dmnd
but I don't understand why since this is param two to the module.Update: I realize I need to match output channels from this module with the database channel (
meta2
) in my pipeline before calling the next module, i.e. join the output of this module with a channel that is indexed with database names (meta2.id
). As I see this, I can either return both meta objects or add a field (e.g.db
) containingmeta2.id
to themeta
object before returning it. An alternative is perhaps to modifymeta.id
to contain the database name before calling this module.I need a module like my description above for #metatdenovo, so I've made a local while we discuss. At the moment it looks like the below but hasn't been properly tested yet (i.e. WIP). I'm happy to contribute this to nf-core/modules if and when we agree.
The text was updated successfully, but these errors were encountered: