Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read length is not restricted to the set limit #23

Open
schmeing opened this issue May 12, 2020 · 2 comments
Open

Read length is not restricted to the set limit #23

schmeing opened this issue May 12, 2020 · 2 comments
Assignees

Comments

@schmeing
Copy link

schmeing commented May 12, 2020

I run the simulation using the following commands:

./generate_reads.py -d -r ref.fa -a abundance.txt -o uniform-reads -t 18000000  -l 100 -i 164 -s 30
./trim_reads.pl -i reads1.fastq -f uniform-reads.1.fasta -o bear1.fq -r dri_read1.txt.per -q reads1.fastq.err.qual -m reads2.fastq.err.matr
./trim_reads.pl -i reads2.fastq -f uniform-reads.2.fasta -o bear2.fq -r dri_read2.txt.per -q reads2.fastq.err.qual -m reads2.fastq.err.matr

Looking at the cigar string of the mappings, the reason seems to be InDels that are inserted after the sequence was cut to the desired length:

66M1D33M
89M1I11M

If it helps I can supply the input files, but I assume this is general and not limited to those.

@sej917 sej917 self-assigned this May 13, 2020
@sej917
Copy link
Owner

sej917 commented May 13, 2020

Oof, yes, this needs to be fixed. Back when I wrote BEAR, a million years ago, non-uniform read length distributions were a lot more common (at least with the folks I was working with), so this specific behaviour wouldn't really be noticeable. Clearly not the case these days.

if you wouldn't mind, could you supply the input files? Let's just limit it to the files for your second command. Your diagnosis is almost certainly correct, but I'd like to be 100% certain.

@schmeing
Copy link
Author

Can you send me an email here, so I can reply with the download link, please?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants