-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Empty counts files #69
Comments
I suggest that the count file ( So there is an error happening upstream. So you should check all output files of the previous processes to see where the pipeline creates empty counts. My suggestion is that the assignment did not work (process |
Any news here? Could you solve it? |
Hi, yes and no. I figured out that my barcode reads had extra bases, and I think that was causing it to not be able to match them to barcodes properly. So I trimmed those and ran it again. Now it fails at this point:
In a previous post where you aided me I got this error with a different data set and you said it could be ignored and was a result of my not having replicates to calculate correlations with. That is the case with this dataset as well. However, unlike that previous data, the outputs generated by this point don't make any sense. There are large groups of barcodes with identical count results, and in the case of one set of data every RNA count is exactly 25,000 except one that is 50,000 and one that is 100,000. Looking at raw DNA counts showed small whole numbers that were grouped (ie: a handful of barcodes with 4, a handful with 16, a handful with 2) instead of the more continuous distribution we expected. It is unclear if something is still going wrong in the processing, or if something is weird with our sequencing data itself. Any insights? We are really excited to use this software for some larger scale projects, but so far we haven't gotten it to work. Also happy to correspond by email if you're willing to take the conversation off of your issues page. Thanks in advance for your continued assistance. |
ok. the first part is because of missing replicates as you already figured out by yourself. But the missing replicates will make the second part very dificult to find out. This is a really strange behaviour. Maybe you have PCR artefacts. Did you use UMIs? Than you should check the duplication rate. I am just guessing but I think this large very even numbers like 50,000 or 10,000 are the issue of some counting artefact. Im not sure what exactly causes the issue but e.g. it can be possible that the command I really do not know if is an issue wih the assignment (e.g. if most barcodes are assigned to specific oligos) or the with your count data (e.g. PCR artefacts), or your experiment design in general (having elements in taht have an enourmous activity compared to the other. |
I got the pipeline to work with a previous set of sequencing data but now it is finishing with the following error message, and leaving empty count files in the outputs folder. All that I've changed is the data directory, and an updated experiment.csv file to account for the new data. Any idea where the breakdown might be occurring?
Thanks!
The text was updated successfully, but these errors were encountered: