Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QUESTION] mapping to reference stringency #48

Open
ccr25 opened this issue Jul 1, 2022 · 1 comment
Open

[QUESTION] mapping to reference stringency #48

ccr25 opened this issue Jul 1, 2022 · 1 comment

Comments

@ccr25
Copy link

ccr25 commented Jul 1, 2022

Hi,

Is it possible to increase the stringency of the chunk mapping to the reference for enrichment? We are getting a few thousands reads that map to our reference from UNCALLED but when we run the data through kraken we get very few accurate reads to our bacterial genome we are enriching for. I realize we need to allow for errors but can we adjust the number of mismatches allowed in a chunk? Thanks,

Chandler

@skovaka
Copy link
Owner

skovaka commented Jul 7, 2022

When you say a few thousand reads, out of how many? If it's 1,000 out of 1 million, for example, that's a 0.1% error rate, which is about as good as you can expect.

Are you enriching for a single bacterial genome, or multiple? Repeats and low-complexity sequences are the main cause of miss-classification for UNCALLED. Besides that, reducing the number of chunks to attempt mapping is the main way to improve precision, at the cost of sensitivity of course. Hope that helps!

Thanks,
Sam

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants