Q: Using exclusion mask feature vs excluding regions in downstream analyses #1290
Replies: 1 comment 1 reply
-
Well, let's see - we're planning to make a release sometime this summer. I see from your other issue you're trying out the github version; that's fine as long as you're OK with things possibly changing. (We have no plans to change the masking stuff, though.)
Well, that depends what you do, I think? The masked tree sequence has just "missing data" in the masked regoins; so as long as your analyses do the right thing with that then you're OK. For instance, if you write out to VCF then that will be equivalent to writing out the un-masked VCF and removing rows in the masked region. To compute statistics using the tree sequence itself correctly you need to do some extra stuff (since at present these don't deal with missing data appropriately), and I'm happy to help with that (maybe over as a tskit discussion?).
Well, the second thing is more stable and certain, so maybe masking it downstream? |
Beta Was this translation helpful? Give feedback.
-
Hi,
I became aware of the
exclusion_mask
while reading through your github issues. It was recommended for excluding the low recombination regions when we use realistic recombination maps like HapMap. I want to use the exclusion mask functionality but it is currently not a part of a stable release.I have a few questions:
Thanks!
Best,
Isin
Beta Was this translation helpful? Give feedback.
All reactions