Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sfitz update call sSNV to 8.0.0-rc.1 #153

Merged
merged 35 commits into from
Dec 22, 2023
Merged

Conversation

sorelfitzgibbon
Copy link
Contributor

@sorelfitzgibbon sorelfitzgibbon commented Dec 14, 2023

  • I have read the code review guidelines and the code review best practice on GitHub check-list.

  • The name of the branch is meaningful and well formatted following the standards, using [AD_username (or 5 letters of AD if AD is too long)-[brief_description_of_branch].

  • I have set up or verified the branch protection rule following the github standards before opening this pull request.

  • I have added my name to the contributors listings in the
    metadata.yaml and the manifest block in the nextflow.config as part of this pull request, am listed
    already, or do not wish to be listed. (This acknowledgement is optional.)

  • I have added the changes included in this pull request to the CHANGELOG.md under the next release version or unreleased, and updated the date.

  • [x]] I have updated the version number in the metadata.yaml and manifest block of the nextflow.config file following semver, or the version number has already been updated. (Leave it unchecked if you are unsure about new version number and discuss it with the infrastructure team in this PR.)

  • I have tested the pipeline on at least one A-mini sample.

  • Update to use call-sSNV v8.0.0-rc.1, including removing id from the call-sSNV input yamls.

  • nftest expected files were update to new versions after careful inspection to ensure no variant changes. This was to deal with sample order changes.

  • nftest.yaml paths were updated to use wild cards for the actual file paths and to reflect the new sample ID (extracted from BAMs) in the expected file paths.

Strangely!!!, a new variant showed up for MuSE in one of the test runs.
MuSE-2.0.4_LUAD0000034_CPT0053040010_SNV.vcf.gz:
chr21 8033328 . G T . PASS SOMATIC GT:DP:AD:BQ:SS 0/1:29:22,7:29,30:2 0/0:12:12,0:28,0:.
I kept the run in the output directory, under extra-MuSE-variant-chr21-8033328

Closes #123

Testing Results

nftest run test-call-sSNV
output dir: /hot/software/pipeline/metapipeline-DNA/Nextflow/development/unreleased/sfitz-update-call-sSNV
log: /hot/software/pipeline/metapipeline-DNA/Nextflow/development/unreleased/sfitz-update-call-sSNV/log-nftest-20231221T222740Z.log

Base automatically changed from yashpatel-allow-pipeline-ordering to main December 21, 2023 19:11
@sorelfitzgibbon
Copy link
Contributor Author

Testing completed after merging in main. Test results updated with new log. Note the weird MuSE variant described above.

Copy link
Collaborator

@yashpatel6 yashpatel6 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Any concerns @zhuchcn @nwiltsie @tyamaguchi-ucla ?

@yashpatel6
Copy link
Collaborator

The new variant is interesting, the only MuSE change is from 2.0.3 to 2.0.4; this sample is different from the one used for testing in call-sSNV so we didn't see it there?

@nwiltsie
Copy link
Member

The new variant is interesting, the only MuSE change is from 2.0.3 to 2.0.4;

There's also virtually no change between those two versions: wwylab/MuSE@v2.0.3...v2.0.4 ... very weird.

@yashpatel6
Copy link
Collaborator

Also, the variant only showed up in one of the test runs?

@sorelfitzgibbon
Copy link
Contributor Author

Also, the variant only showed up in one of the test runs?

I think so, although I can't say for sure. NFtest assertions weren't working for much of the testing and I dutifully deleted all the failed runs for that day, on that day.

Copy link
Member

@zhuchcn zhuchcn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't have any concern, also no idea about the new variant! Does MuSE use any random process so the output isn't always reproducible (I doubt it does)?

@sorelfitzgibbon
Copy link
Contributor Author

It sure looks stochastic! One run had the extra variant, the next run did not. I concatenated all of the log.command.sh for the MuSE processes and found no difference between the two runs. I launched with nftest run test-call-sSNV both times.

@nwiltsie
Copy link
Member

It sure looks stochastic! One run had the extra variant, the next run did not.

Oh I do not like that! Would you mind posting the output paths for those two runs?

@sorelfitzgibbon
Copy link
Contributor Author

/hot/software/pipeline/metapipeline-DNA/Nextflow/development/unreleased/sfitz-update-call-sSNV/test-call-sSNV
/hot/software/pipeline/metapipeline-DNA/Nextflow/development/unreleased/sfitz-update-call-sSNV/extra-MuSE-variant-chr21-8033328/test-call-sSNV

@yashpatel6
Copy link
Collaborator

I ran the test just now and the output did not contain the new variant: /hot/software/pipeline/metapipeline-DNA/Nextflow/development/unreleased/yashpatel-test-call-ssnv-update 🤔

@tyamaguchi-ucla
Copy link
Contributor

If there's stochasticity in the algorithm, we can suggest that our collaborator (Shuangxi) implement an option to set a seed.

@yashpatel6
Copy link
Collaborator

Agreed, it would be worth confirming with Shuangxi whether stochasticity is expected in MuSE and then request a seeding option if there is

Copy link
Contributor

@tyamaguchi-ucla tyamaguchi-ucla left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks generally good to me!
I'm not sure if there's been any discussion about updating the final output structure of Intersect-BCFtools-1.17, but I think organizing it a bit more would be helpful for end users. For instance, should we create a new directory for storing figures, or can they be placed under /QC, considering the increasing number of pipelines generating figures?

@yashpatel6
Copy link
Collaborator

yashpatel6 commented Dec 22, 2023

Intersect-BCFtools-1.17, but I think organizing it a bit more would be helpful for end users. For instance, should we create a new directory for storing figures, or can they be placed under /QC, considering the increasing number of pipelines generating figures?

We've had a little bit of discussion about what should be in the QC directory and what shouldn't but not fully formalized yet; we'll formalize it in the next NF WG

@yashpatel6
Copy link
Collaborator

I confirmed that the commands bring run are the same between the two tests so if no other concerns from @nwiltsie, we can merge this and follow up with Shuangxi about stochasticity?

@nwiltsie
Copy link
Member

I confirmed that the commands bring run are the same between the two tests so if no other concerns from @nwiltsie, we can merge this and follow up with Shuangxi about stochasticity?

Sounds good to me!

@sorelfitzgibbon
Copy link
Contributor Author

OK merging! I have sent an email to Shuangxi.

@sorelfitzgibbon sorelfitzgibbon merged commit b455ea6 into main Dec 22, 2023
1 check passed
@sorelfitzgibbon sorelfitzgibbon deleted the sfitz-update-call-sSNV branch December 22, 2023 23:23
@tyamaguchi-ucla
Copy link
Contributor

OK merging! I have sent an email to Shuangxi.

Also, maybe consider creating a GitHub issue in the MuSE repo?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

MuSE gets stuck for several days
5 participants