Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

running on 10x + iso-seq #1

Open
dabitz opened this issue Jun 1, 2021 · 6 comments
Open

running on 10x + iso-seq #1

dabitz opened this issue Jun 1, 2021 · 6 comments

Comments

@dabitz
Copy link

dabitz commented Jun 1, 2021

Hi,

I wonder if bcctools also works correcting barcodes from 10x library followed by iso-seq instead of illumina seq? It would be nice if it works, since I am seeing a lot of unexpected barcodes in my sample. I guess because of some errors in barcodes.

Best,
André

@bkehr
Copy link
Contributor

bkehr commented Jun 7, 2021

Hi André,

I'd be happy to implement an option for Iso-Seq data in principle.

However, bcctools only corrects substitution errors and no indel errors, which is why I don't recommend using bcctools for barcode correction in long read data in general.

Best regards
Birte

@dabitz
Copy link
Author

dabitz commented Jun 8, 2021

Hi Birte,

Thanks for the reply. I guess with HiFi pacbio Iso-Seq reads indel issues should be minor. Thus, it would be nice to have a way to correct at least the substitution errors. I would be happy to try it, as I have currently experienced and issue with our combined 10X-Iso-Seq library that seems to be split in too much barcodes...

Best,
André

@bkehr
Copy link
Contributor

bkehr commented Jun 8, 2021

Hi André,

in this case I'll be happy to give it a try. I'd suggest to add an option to bcctools that allows you to specify your UMI/barcode design, with the restriction that your barcodes cannot be longer than 16 bp (I guess they are 16 bp long in your 10x library).

Do you currently process your data with the analysis steps recommended by PacBio? If yes, my suggestion would be to replace the isoseq3 tag command (Step 3) by bcctools.

Best,
Birte

@dabitz
Copy link
Author

dabitz commented Jun 8, 2021

Hi Birte,

Thanks for implementing this option. I think it will be very nice to have this correction step on the pipeline. And yes, my barcode is 16bp + 12 UMI (10X v3). I am currently, using this pipeline from cupcakehttps://github.com/Magdoll/cDNA_Cupcake/wiki/Iso-Seq-Single-Cell-Analysis:-Recommended-Analysis-Guidelines

but I stop at step 5, since later I need to split the fasta by barcodes and then map the best cell individually. Therefore, it would be nice to have an option to add the barcode as prefix for the read names. Thus, I can easily demultiplex by name.

Cheers
André

@bkehr
Copy link
Contributor

bkehr commented Jun 8, 2021

Excellent, thanks. I should be able to work with this information and will get back to you once I have something working.

@dabitz
Copy link
Author

dabitz commented Jun 8, 2021

Great! Looking forward!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants