Skip to content
This repository has been archived by the owner on Apr 16, 2021. It is now read-only.

Repository for CCR analysis as part of Dr. Aaron Quinlan's lab rotation

Notifications You must be signed in to change notification settings

jkunisak/quinlan_lab_rotation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

quinlan_lab_rotation

This is the repository that contains the data and files needed to generate the 2nd version of CCRs. This version includes variants from gnomad exome and whole genome sequencing samples.

The goal of this project is to generate a new model for CCRs that will account for different SIFT/PolyPhen scores and VAF. This will allow for the new CCR model to identify regions enriched in Clinvar pathogenic variants.

Required files include:

  1. new_CCR_jason.ipynb = jupyter notebook for generation of new CCR models
  2. exter.py function to read the gff file
  3. pathoscore.py function from PathoScore repository
  4. PathoScore Clinvar truth sets: see here to make the Clinvary truth-set directory
  5. gnomad exome and wgs vcf files
  6. GFF transcript file

Output files:

  1. pathoscore results will be provided in the pathoscore_results output directory
  2. new_CCR.bed.gz = bed file with the CCR windows
  3. new_CCR.bed.gz.tbi = indexed bed file
  4. new_CCR.txt = txt file with the CCR windows

About

Repository for CCR analysis as part of Dr. Aaron Quinlan's lab rotation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages