-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
annotation bug fix and document updates
- Loading branch information
1 parent
b53e255
commit 968f274
Showing
24 changed files
with
238 additions
and
250 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -34,8 +34,11 @@ The pipeline has 6 main steps: Indel calling using platypus, basic annotations, | |
ANNOVAR ([`Annovar`](https://annovar.openbioinformatics.org/en/latest/)) | ||
: annotate_variation.pl is used to annotate variants. The tool makes classifications for intergenic, intogenic, nonsynoymous SNP, frameshift deletion or large-scale duplication regions. | ||
|
||
ENSEMBL VEP(['ENSEBL VEP'](https://www.ensembl.org/info/docs/tools/vep/index.html)) :can also be used alternative to annovar. Gene annotations will be extracted. | ||
|
||
Reliability and confidation annotations: It is an optional ste for mapability, hiseq, selfchain and repeat regions checks for reliability and confidence of those scores. | ||
|
||
|
||
3. Deep Annotation (--runIndelDeepAnnotation True): | ||
|
||
If basic annotations are applied, an extra optional step for number of extra indel annotations like enhancer, cosmic, mirBASE, encode databases can be applied too. | ||
|
@@ -68,6 +71,20 @@ The pipeline has 6 main steps: Indel calling using platypus, basic annotations, | |
annotate_variation.pl -downdb wgEncodeGencodeBasicV19 humandb/ -build hg19 | ||
``` | ||
|
||
Gene annotation is also possible with ENSEMBL VEP tool, for test purposes only, it can be used online. But for big analysis, it is recommended to either download cache file or use --download_cache flag in parameters. | ||
|
||
Follow the documentation [here](https://www.ensembl.org/info/docs/tools/vep/script/vep_cache.html#cache) | ||
|
||
Example: | ||
|
||
Download [cache](https://ftp.ensembl.org/pub/release-110/variation/indexed_vep_cache/) | ||
|
||
```console | ||
cd $HOME/.vep | ||
curl -O https://ftp.ensembl.org/pub/release-110/variation/indexed_vep_cache/homo_sapiens_vep_110_GRCh38.tar.gz | ||
tar xzf homo_sapiens_vep_110_GRCh38.tar.gz | ||
``` | ||
|
||
4. Download the pipeline and test it on a minimal dataset with a single command: | ||
|
||
```console | ||
|
@@ -121,6 +138,8 @@ Note that some form of configuration will be needed so that Nextflow knows how t | |
Annotations are optional for the user. | ||
All VCF and BED files need to be indexed with tabix and should be in the same folder! | ||
|
||
The reference set bundle which is used in PCAWG study can be found and downloaded [here](https://dcc.icgc.org/api/v1/download?fn=/PCAWG/reference_data/pcawg-dkfz/dkfz-workflow-dependencies_150318_0951.tar.gz). (NOTE: only in hg19) | ||
|
||
**Basic Annotation Files** | ||
|
||
- dbSNP INDELs (vcf) | ||
|
@@ -176,18 +195,23 @@ Please read [usage](https://github.com/ghga-de/nf-platypusindelcalling/blob/main | |
|
||
## Credits | ||
|
||
nf-platypusindelcalling was originally written by Kuebra Narci [email protected]. | ||
nf-platypusindelcalling was originally translated from roddy-based pipeline by Kuebra Narci [email protected]. | ||
|
||
The pipeline is originally written in workflow management language Roddy. [Inspired github page](https://github.com/DKFZ-ODCF/IndelCallingWorkflow) | ||
|
||
The Indel calling workflow was in the pan-cancer analysis of whole genomes (PCAWG) and can be cited in the following publication: | ||
|
||
Pan-cancer analysis of whole genomes. The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Nature volume 578, pages 82–93 (2020). DOI 10.1038/s41586-020-1969-6 | ||
|
||
We thank the following people for their extensive assistance in the development of this pipeline: | ||
|
||
- Nagarajan Paramasivam (@NagaComBio) [email protected] | ||
|
||
|
||
**TODO** | ||
|
||
<!-- TODO nf-core: If applicable, make list of people who have also contributed --> | ||
|
||
Nagarajan Paramasivam @NagaComBio [email protected] | ||
|
||
## Contributions and Support | ||
|
||
If you would like to contribute to this pipeline, please see the [contributing guidelines](.github/CONTRIBUTING.md). | ||
|
This file was deleted.
Oops, something went wrong.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.