Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes made:
CNA Gene Merge Tool: CNA Gene Merge Tool #653
Script that will take a CNA file and perform the following gene merges:
CDKN2Ap16/14 -> CDKN2A
MLLs -> KMTs (MLL, MLL2, MLL3 and MLL4 -> KMT2A KMT2B KMT2C KMT2D)
FAM123B -> AMER1
MYCL1 -> MYCL
Remove duplicate variants from MAF
Script to remove duplicate maf records based on the 8 key columns (Entrez_Gene_Id, Chromosome, Start_Position, End_Position, Variant_Classification, Tumor_Seq_Allele2, Tumor_Sample_Barcode, HGVSp_Short)
Calculates VAF for each record and picks the record with high VAF
Formula for VAF = t_alt_count / (t_ref_count + t_alt_count)
Missing patient ids in patient file GENIE submission process - patient ids with no attributes are missing in patient file. #654
Patients that have no attributes only appear in the sample clinical data file.
Vital status location change GENIE submission process - vital status location change #657
Patient vital status attributes were moved from the patient file into vital_status.txt. Since they GENIE has reversed that decision and now wants vital status information back in the patient clinical file.
Change value for clinical attributes with no value GENIE submission process - change value for clinical attributes with no value. #656
AGE_AT_SEQ_REPORT, ONCOTREE_CODE, BIRTH_YEAR, YEAR_DEATH, YEAR_CONTACT, INT_CONTACT, INT_DOD cannot be NA or blank. All attribute values should be "Unknown" when the patient/attribute or sample/attribute pair is missing a value.
Also, for attributes that take on NAACR values, like SEX, PRIMARY_RACE, SECONDARY_RACE, TERTIARY_RACE, ETHNICITY, they should take on the NAACR value for NA - (SEX = 9, RACE = 99, ETHNICITY = 9)
Gene panel names missing prefix in clinical file GENIE submission process - gene panel names missing prefix in clinical file. #655
In the "Sequence Assay ID" column of the data_clinical_sample.txt file, IMPACT panel names should be prefixed with an "MSK-".