Q: How many fragments are generally sufficient for hmmratac function? #687

leqi0001 · 2025-02-12T02:58:06Z

Hi,

I have a big cohort of scATACseq data, totaling 1 billion fragments from 24 pools of experiments (24 billion total). I've been trying to run hmmratac on these 24 fragment files with --cutoff-analysis-only, but it has been a week. The log says it downsampled to 800 million fragments for training, but the step to generate short, mono-, di-, and tri-nucleosomal signals has run for 3 days. Should I downsample the fragments files before hmmratac? There must be diminishing returns with more fragments, but I'm not sure how to decide the degree of downsampling.

Appreciate any suggestions!

taoliu · 2025-02-12T20:47:25Z

800 millions reads is still too much for one single run on human genome... You can further down-sample to about 50million.
As for scATAC, you may want to select a subset of cells to call peaks.
Did you use MACS3 v3.0.2?

leqi0001 · 2025-02-12T21:06:19Z

@taoliu
Thanks for your reply! I'm trying downsampling and see what happens. Is there any downside with using too many reads other than memory/time consumption, such as increased noise?

Yes I used v3.0.2.

taoliu · 2025-02-20T04:52:40Z

@leqi0001 First, it will be a waste of $ if you sample is already saturated. Secondly, it will take more time and memory to process -- especially for hmmratac. Lastly, for method calling peaks based on p-value cutoff, such as callpeak, p-value will be overly optimistic when the sample size is large. In this case, effective size (using foldchange) should be considered together with p-value cutoff.

leqi0001 added the General Question label Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q: How many fragments are generally sufficient for hmmratac function? #687

Q: How many fragments are generally sufficient for hmmratac function? #687

leqi0001 commented Feb 12, 2025 •

edited

Loading

taoliu commented Feb 12, 2025

leqi0001 commented Feb 12, 2025 •

edited

Loading

taoliu commented Feb 20, 2025

Q: How many fragments are generally sufficient for hmmratac function? #687

Q: How many fragments are generally sufficient for hmmratac function? #687

Comments

leqi0001 commented Feb 12, 2025 • edited Loading

taoliu commented Feb 12, 2025

leqi0001 commented Feb 12, 2025 • edited Loading

taoliu commented Feb 20, 2025

leqi0001 commented Feb 12, 2025 •

edited

Loading

leqi0001 commented Feb 12, 2025 •

edited

Loading