[WIP] add rules for protein mapping #159

bluegenes · 2022-02-10T21:16:35Z

This PR introduces rules to allow mapping nucleotide reads to protein references using Paladin.

not functional yet.

Main questions at this point:

Do we want to just do this when the user selects protein sourmash? Or do we want to enabling running both protein and nucleotide sourmash within the same grist output folder?
- mostly what I'm getting at here is whether or not we want to include the moltype in the gather output filename, because we expect folks might want to run both moltypes. I know I want to run both, but I'm not sure if this is a general use case.
Do we want to store proteomes in the same folder as genbank genomes? Or in a separate folder, e.g. proteomes?

To do:

make checkpoints --> download proteomes work
new checkpoint to prodigal proteome if not downloadable
try BBMerge, fall back to PEAR read merging if don't like
add tests
Add reporting and visualization

taylorreiter · 2022-03-01T18:36:42Z

hot takes

Do we want to just do this when the user selects protein sourmash? Or do we want to run both protein and nucleotide sourmash at the same time (e.g. have different gather checkpoints)?

I think when a user selects protein sourmash would be a good default, and perhaps a good-enough-for-now. It could be cool to have a --protein flag, so that when a user uses nucleotide sourmash, they can still map in protein space. But I'm not sure how much snakemake work this is vs. the amount of gain for enabled use cases.

Do we want to store proteomes in the same folder as genbank genomes? Or in a separate folder, e.g. proteomes?

I would prefer a proteomes directory personally i think. I'm willing to be persuaded differently though :)

bluegenes added 3 commits February 8, 2022 12:20

init

8714d69

init protein mapping rules

5211630

cleanup

4c6e271

bluegenes added 2 commits May 13, 2022 19:38

reftype

86b3f38

Merge branch 'latest' into add-protein-mapping

6cf751a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] add rules for protein mapping #159

[WIP] add rules for protein mapping #159

bluegenes commented Feb 10, 2022 •

edited

Loading

taylorreiter commented Mar 1, 2022

[WIP] add rules for protein mapping #159

Are you sure you want to change the base?

[WIP] add rules for protein mapping #159

Conversation

bluegenes commented Feb 10, 2022 • edited Loading

taylorreiter commented Mar 1, 2022

bluegenes commented Feb 10, 2022 •

edited

Loading