-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Accuracy #59
Comments
Hi Ralph, stringMLST should be deterministic given the same reads and kmer db. If you update the database that has the possibility of calling a different ST because additional gene sequences and alleles are available - this I don't think is super surprising. I would be surprised if running stringMLST on the same sample, with the same db resulted in different results. This should really only happen if there's significant contamination and even then be rare.
|
I appreciate the clarification. I have 2 more questions if you don't mind.
|
Technically yes, though it's sensitive to read errors, which tend to be more prevalent in ONT reads, and converges to a solution better with higher read count which can sometimes be an issue with ONT reads. If you have high coverage ONT, error corrected data you should be fine.
Variant calling provides much more fine grained data - remember an ST is 7 data points, variant calls could be hundreds or thousands. STs are an approximation of genetic relatedness (more phenotype than genotype). They may not cluster in the same branch but they'll likely appear in the same subtree. |
Hi, I am using stringMLST and I noticed that I got different results for running the same sample more than once. The database changed (updated). What was surprising that some of the assigned ST's in the first run were completely different in the second. is this normal?
The text was updated successfully, but these errors were encountered: