Replies: 2 comments
-
From Jana Obšteter (pasted from a slack thread)
|
Beta Was this translation helpful? Give feedback.
0 replies
-
There's a thread at #523 too |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everyone,
Sorry for the long questions. I am trying to prepare data for running
tsinfer
(Kelleher et al 2019) on some malaria parasite WGS data, but got stuck with details in inferring the ancestral states and filtering the results. I learned from the tsinfer paper and Jana’s talk that we could useest-sfs
(Keightley and Jackson, 2018) to estimate the ancestral allele state. It seems not very hard to run the program itself. However, it is not clear to me:est-sfs
: Keightley’s paper used EPO to align 6 primates. I do not know if there is a similar tool for non-human species. Can I just try to find multiple WGS raws reads for the focal species and one high-quality WGS reads for each outgroup species, and align them to the reference genome of the focal species using bowtie2 or bwa? Or should I just use some multi-aligner like GSalign (by Lin et al 2020) to directly align the outgroup species’ reference gnomes to the reference gnome of the focal species?est-sfs
results (i.e what threshold do we use to say we are confident about the inferred ancestral allele).est-sfs
generates a result file contains the probability of the major allele being ancestral for each site. It ranges from 0 to 1. We will be confident about the ancestral state if the probability is close 0 or 1 but hard to determine if the probability is in the middle. Do we have any suggestions on what threshold should be used to filter the sites based on the probability of major allele being ancestral?Any suggestions or comments would be appreciated! Thank you so much!
Beta Was this translation helpful? Give feedback.
All reactions