Annotate and rank SNVs per family #502

fellen31 · 2024-11-12T16:00:45Z

This PR:

Annotates and ranks SNVs per family instead of per project. This is because genmod does not support compound scoring with multiple families in the VCF. Someone that still wants annotated variants per sample can run each sample as a separate family.
Changes the output documentation and structure to match sample and family for all variants
Puts the validating of the samplesheet into functions, and removed the use of ifEmpty. This is because these error messages would always show up when you had an error, even if that error was unrelated.
Removed support for automatically creating an echvar database with SNVs and INDELs, since this requires all variants to be combined into one VCF. Might add this back into the future.
Removes the containts_affected logic from the snv-calling workflow, since this was previously changed to be checked before pipeline start, and is now done in one of the functions described in point 3.

Closes #501 and #276.

PR checklist

Lucpen

Just a tiny typo and one question:
why not make a switch for this?
Removed support for automatically creating an echvar database with SNVs and INDELs, since this requires all variants to be combined into one VCF. Might add this back into the future.

subworkflows/local/short_variant_calling/main.nf

Co-authored-by: Lucía Peña-Pérez <[email protected]>

fellen31 · 2024-11-15T11:50:04Z

Just a tiny typo and one question: why not make a switch for this? Removed support for automatically creating an echvar database with SNVs and INDELs, since this requires all variants to be combined into one VCF. Might add this back into the future.

My idea initially was that it would be nice if you could create a reference database or panel of normals while running the pipeline. For SNVs and INDELs, yes it would be fairly easy to make a switch. But we need to add a module to merge VCFs together before creating a database (and then it should really be a workflow).

But for creating SVs, CNVs, STRs and methylation databases we would need a different merging strategies. It would perhaps be nice, but I need to think about it some more. It might work for a 100 samples, but perhaps you would never want to start 1000+ samples at once, and then creating the databases within the pipeline becomes unnecessary, because you would need to combine samples from multiple pipeline runs anyway.

In short, I'm removing the functionality because it's easier at the moment. It's not something that we need in production, and not something I'm sure is desirable to have in the pipeline in the future.

fellen31 force-pushed the variants-per-family branch 8 times, most recently from 5c783b3 to 2e36008 Compare November 12, 2024 16:23

fellen31 linked an issue Nov 12, 2024 that may be closed by this pull request

Run RANK_VARIANTS per family #276

Open

fellen31 force-pushed the variants-per-family branch 2 times, most recently from 279ecc2 to 2dbe94d Compare November 12, 2024 17:17

fellen31 marked this pull request as ready for review November 12, 2024 17:40

fellen31 requested a review from a team as a code owner November 12, 2024 17:40

fellen31 added 3 commits November 15, 2024 09:14

Annotate and rank SNVs per family

5ec6ba5

Update CHANGELOG again

22cb3a9

remove mistakenly added file

c20890e

fellen31 force-pushed the variants-per-family branch from edf2162 to c20890e Compare November 15, 2024 08:14

Update output.md

8b60003

Lucpen reviewed Nov 15, 2024

View reviewed changes

subworkflows/local/short_variant_calling/main.nf Outdated Show resolved Hide resolved

Update subworkflows/local/short_variant_calling/main.nf

ab6910b

Co-authored-by: Lucía Peña-Pérez <[email protected]>

Lucpen approved these changes Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Annotate and rank SNVs per family #502

Annotate and rank SNVs per family #502

fellen31 commented Nov 12, 2024 •

edited

Loading

Lucpen left a comment

fellen31 commented Nov 15, 2024 •

edited

Loading

Annotate and rank SNVs per family #502

Are you sure you want to change the base?

Annotate and rank SNVs per family #502

Conversation

fellen31 commented Nov 12, 2024 • edited Loading

PR checklist

Lucpen left a comment

Choose a reason for hiding this comment

fellen31 commented Nov 15, 2024 • edited Loading

fellen31 commented Nov 12, 2024 •

edited

Loading

fellen31 commented Nov 15, 2024 •

edited

Loading