GPU memory comsumption increases every epoch during training #55

jhkwag970 · 2025-01-23T06:32:50Z

Hello,

Thank you for sharing your work!

As I am training on ImageNet1K, I noticed that memory consumption increases by approximately 254MB with each epoch. If this trend continues, the total memory usage will reach 254MB * 300 = 76.2GB.

Is this the intended behavior?

Thank you!

ahatamiz · 2025-02-09T18:39:39Z

Hi @jhkwag970

Could you please provide more details ? specifically whether you are using this version of the codebase or another.

I have not faced this issue before in any of the training runs.

Best

jhkwag970 · 2025-02-09T19:17:45Z

@ahatamiz
Hello, Thank you for your response. I am using the current MambaVision repo. After validation and reloading memory for training, memory consumption is higher than in the previous epoch. I tried using torch.cuda.empty_cache() as an alternative solution for now. I just wanted to make sure this will not cause any problems during training.

jhkwag970 changed the title ~~GPU memory comsumption increases every epoch~~ GPU memory comsumption increases every epoch during training Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory comsumption increases every epoch during training #55

GPU memory comsumption increases every epoch during training #55

jhkwag970 commented Jan 23, 2025

ahatamiz commented Feb 9, 2025

jhkwag970 commented Feb 9, 2025

GPU memory comsumption increases every epoch during training #55

GPU memory comsumption increases every epoch during training #55

Comments

jhkwag970 commented Jan 23, 2025

ahatamiz commented Feb 9, 2025

jhkwag970 commented Feb 9, 2025