The train step needs large memory #51

jiangzy26 · 2024-03-29T11:25:55Z

Hi,
When I run the RELERNN_TRAIN with default settings, the step was killed because of the large memory, how to deal with this? could you share your help? Thank you very much.

jiangzy26 · 2024-04-01T08:13:50Z

Or, Could I use the same command line to simulate each chromosome, and Train each chromosome to get the predicted result?

andrewkern · 2024-05-15T22:36:45Z

how much memory are you using?

willright28 · 2024-07-17T01:51:36Z

Hi, @andrewkern and all users,

I had the same question. The training step was fine when no demography history was set, but it took up to 3 T or more of space (no tested but killed) when demography history was set.

My code is:
ReLERNN/ReLERNN_SIMULATE
-v A.phased.vcf \ #only filter missing snp --max-missing 0.9 using vcftools
--phased
-g fasta.fai
-d ./A
-n A_two-epoch_unfold.final.summary \ #output of stairwayplot2
-l 1
-m reference.missing.bed
-t 80

Any clue on how to solve this?

Thanks in advance!

Best,
willright28

andrewkern · 2024-07-17T14:50:42Z

hi @willright28 -- what does your demographic history look like? I'm guessing a contracting population size moving to the present?

willright28 · 2024-07-18T01:49:11Z

Hi @andrewkern
Thanks for your reply. The demographic history looks like this, a strong bottleneck then a rebound:

andrewkern · 2024-07-18T20:55:32Z

Just so I'm oriented here-- is the y-axis correct? you have Ne going down to 0.1? Are these in relative units?

willright28 · 2024-07-19T01:04:19Z

Sorry for the misleading, the y-axis is log-transformed. The lowest Ne is ~1300.

andrewkern · 2024-07-19T19:04:29Z

Okay thanks for the clarification. I'm looking at your code-- it looks like your -g flag is pointing to a .fai file-- is that a indexed fasta or a bed file?

My code is: ReLERNN/ReLERNN_SIMULATE -v A.phased.vcf \ #only filter missing snp --max-missing 0.9 using vcftools --phased -g fasta.fai -d ./A -n A_two-epoch_unfold.final.summary \ #output of stairwayplot2 -l 1 -m reference.missing.bed -t 80

willright28 · 2024-07-20T02:42:49Z

The .fai file is a bed file formated as chr1 0 100000 (length of chr)

willright28 · 2024-07-20T02:46:32Z

Maybe I can set --maxsites to a reasonable number to avoid this problem?

andrewkern · 2024-07-22T18:56:20Z

this shouldn't be too big. can you provide for me your input files and I can poke around

willright28 · 2024-07-29T03:05:17Z

I have tested to run a single chromosome, chr1, which is ~ 10% of the whole genome. The program worked fine and only took ~ 0.1 T memory.
Is running each chromosome separately and using the same demographic history in the -n parameter ok?

andrewkern · 2024-07-31T17:46:44Z

yes it should be fine to run each chromosome separately

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The train step needs large memory #51

The train step needs large memory #51

jiangzy26 commented Mar 29, 2024

jiangzy26 commented Apr 1, 2024

andrewkern commented May 15, 2024

willright28 commented Jul 17, 2024

andrewkern commented Jul 17, 2024

willright28 commented Jul 18, 2024

andrewkern commented Jul 18, 2024

willright28 commented Jul 19, 2024

andrewkern commented Jul 19, 2024

willright28 commented Jul 20, 2024

willright28 commented Jul 20, 2024

andrewkern commented Jul 22, 2024

willright28 commented Jul 29, 2024

andrewkern commented Jul 31, 2024

The train step needs large memory #51

The train step needs large memory #51

Comments

jiangzy26 commented Mar 29, 2024

jiangzy26 commented Apr 1, 2024

andrewkern commented May 15, 2024

willright28 commented Jul 17, 2024

andrewkern commented Jul 17, 2024

willright28 commented Jul 18, 2024

andrewkern commented Jul 18, 2024

willright28 commented Jul 19, 2024

andrewkern commented Jul 19, 2024

willright28 commented Jul 20, 2024

willright28 commented Jul 20, 2024

andrewkern commented Jul 22, 2024

willright28 commented Jul 29, 2024

andrewkern commented Jul 31, 2024