Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handling computed copy numbers with spike-ins #13

Open
LukaP-BB opened this issue Jan 4, 2024 · 2 comments
Open

Handling computed copy numbers with spike-ins #13

LukaP-BB opened this issue Jan 4, 2024 · 2 comments

Comments

@LukaP-BB
Copy link

LukaP-BB commented Jan 4, 2024

Hi, I'm considering using COMPASS with my data, but I'm a bit reluctant to use it with the raw read counts as suggested in the readme. My data has spike-ins that I manually identify in my workflow, allowing me to compute and segment CNVs in my data. I end up with calls that are way less noisy than the raw data, that I would rather use.

Would using ploidy values work well with the algorithm ?
And is there a way to label the cells beforehand, at least the spike-ins, to help with the tree inference ?

Many thanks !

@e-sollier
Copy link
Collaborator

Hi,

If you already have copy numbers for each region and each cell, I would use in place of the raw read counts something like int(50*copy_number). Using the copy numbers directly won't work well because COMPASS assumes that the read counts come from a negative binomial distribution, so if you use copy numbers as read counts you would end up with very low read counts, which would be inferred to be rather unreliable.

Cells cannot be labelled. I don't know which protocol you are using, what do you mean by spike-in cells ?

@LukaP-BB
Copy link
Author

LukaP-BB commented Jan 8, 2024

I see, thanks for your reply !
In our protocol, we add "normal" cells to the mix, that serve as reference for the copy numbers. The first step of the bioinformatics workflow is to identify these cells to compute the ploidy using the dedicated method in the mosaic library.
These cells could be considered as the "root" of the phylogenic tree, since they have no CNAs or SNVs.

I'll give the 50*CN method a try and come back to you if I encounter any issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants