Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

using a fragmented genome as an input #68

Open
dcopetti opened this issue Jun 17, 2019 · 0 comments
Open

using a fragmented genome as an input #68

dcopetti opened this issue Jun 17, 2019 · 0 comments

Comments

@dcopetti
Copy link

Hello,

I am dealing with a complex plant genome (higly heterozygous, haploid genome size 2.5 Gb, I have a diploid assembly of 5 Gb, N50 200 kb, N90 5 kb) and I would like to extend its contiguity with ONT reads.
I thought of using the assembly (has scaffolds, but only 1.5% Ns) as anonther input with the ONT data in the minimap stage, to create extensions of the scaffolds. The ONT data (20x of 5 Gb, N50 9 kb) could be selected to have reads only above 2 kb, for example.

My question is whether the minimap2 alignment or graph construction steps will be affected by having two types of data: highly-accurate scaffolds and ~85% accurate ONT reads. Do you think this will be a good strategy?

Basically, the scaffolds (highly accurate at the nt level, but of coverage 1x) will be a baseline set of sequences to be extended with the longer (with coverage) ONT reads.
Do you think it is worth a try?
Thanks,

Dario

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant