When fine-tuning llama-7b, approximately how much GPU memory is required for training? #2

zty07 · 2023-10-15T15:38:03Z

When fine-tuning llama, approximately how much GPU memory is required for training?

MJ10 · 2024-02-13T17:04:36Z

Hi @zty07, sorry for the extremely late response. Could you please clarify which experiment you are interested in running? The memory would depend on the task (specifically the sequence length). The quantization code is somewhat broken unfortunately but will be fixed soon which should help with lowering the memory requirements.

isaacbmiller · 2024-03-21T03:35:36Z

@MJ10 Did the quantization code ever get fixed?

abdalgader-a · 2024-04-28T21:04:14Z

@MJ10 -- running the next sentence code with 2B/3B parameter size model thrown OOM? any suggestion to resolve?
(PS: I used A100 80GB 8GPUs)

isaacbmiller · 2024-05-14T04:56:29Z

@abdalgader-a I have managed to get it running on a single A100, but my num_samples is way less than 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When fine-tuning llama-7b, approximately how much GPU memory is required for training? #2

When fine-tuning llama-7b, approximately how much GPU memory is required for training? #2

zty07 commented Oct 15, 2023

MJ10 commented Feb 13, 2024

isaacbmiller commented Mar 21, 2024

abdalgader-a commented Apr 28, 2024 •

edited

Loading

isaacbmiller commented May 14, 2024

When fine-tuning llama-7b, approximately how much GPU memory is required for training? #2

When fine-tuning llama-7b, approximately how much GPU memory is required for training? #2

Comments

zty07 commented Oct 15, 2023

MJ10 commented Feb 13, 2024

isaacbmiller commented Mar 21, 2024

abdalgader-a commented Apr 28, 2024 • edited Loading

isaacbmiller commented May 14, 2024

abdalgader-a commented Apr 28, 2024 •

edited

Loading