You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
with default --cutoff_len=256 --mbatch_size 1 --batch_size 1 --lr 2e-4, it costs about 45315MiB(44.2G) in A100. Not 40G. And it costs 61843MiB with --cutoff_len=512.
and with larger cutoff_len like 768, it will cause OOM. but it's really indeed need larger cutoff_len.
The text was updated successfully, but these errors were encountered:
better629
changed the title
65b-4bit OOM with larger cutoff_len
finetune 65b-4bit OOM with larger cutoff_len
May 16, 2023
with default
--cutoff_len=256 --mbatch_size 1 --batch_size 1 --lr 2e-4
, it costs about 45315MiB(44.2G) in A100. Not 40G. And it costs 61843MiB with--cutoff_len=512
.and with larger cutoff_len like 768, it will cause OOM. but it's really indeed need larger cutoff_len.
The text was updated successfully, but these errors were encountered: