Cannot use FastaVariant without genotype #152

victorlin · 2019-07-18T02:05:05Z

I am trying to generate a consensus with the ClinVar VCF file (FTP link: ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/vcf_GRCh37/clinvar_20190715.vcf.gz).
However, the file does not contain genotype information (no samples). This seems to trigger an IndexError while parsing in __init__.py (line 1111).

I'm fairly new to working with VCF files in general - please let me know if I'm overlooking anything.

The text was updated successfully, but these errors were encountered:

mdshw5 · 2019-07-18T14:51:58Z

This might be the wrong tool for the task. Currently, FastaVariant only incorporates genotypes from individual samples in a VCF, and only SNPs and MNPs, not indels (see #84). If you're trying to add ClinVar alleles prior to alignment, I might suggest you use something like hisat2 to build a graph-based index. Otherwise, if you want to make a consensus FASTA from this VCF, I think FastaAlternateReferenceMaker will do the trick.

victorlin · 2019-07-21T22:18:02Z

Thanks for the pointers. I'll look into the other tools you've mentioned.

mdshw5 added the wontfix label Jul 18, 2019

mdshw5 self-assigned this Jul 18, 2019

victorlin closed this as completed Jul 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot use FastaVariant without genotype #152

Cannot use FastaVariant without genotype #152

victorlin commented Jul 18, 2019

mdshw5 commented Jul 18, 2019

victorlin commented Jul 21, 2019

Cannot use FastaVariant without genotype #152

Cannot use FastaVariant without genotype #152

Comments

victorlin commented Jul 18, 2019

mdshw5 commented Jul 18, 2019

victorlin commented Jul 21, 2019