Question about GPU Usage, Training Time, and Dataset Size #6

aoliao12138 · 2024-11-29T11:40:34Z

Nice work! I have a few questions regarding the details of your experiment.

How many GPUs did you use for training?
How much time did it take to train the model?
Could you share the size of the dataset you used?

Thanks in advance!

SHYuanBest · 2024-11-30T03:28:02Z

Thank you for your interest in ConsisID. We used 40 NVIDIA H100 GPUs with a total batch size of 80, and trained for 1800 steps. The training dataset consists of approximately 140,000 video clips. More details can be found in our report.

SHYuanBest · 2024-12-01T02:52:28Z

but only a single 80G graphics card is needed for training

tyrink · 2024-12-11T12:07:09Z

but only a single 80G graphics card is needed for training

Since the above mentions that 40 NVIDIA H100 GPUs are used for training, how long will it take to train on a single 80G gpu such as A100? Will it affect the model performance?

SHYuanBest · 2024-12-11T12:11:53Z

Fro Q1, the specific speed difference may require actual testing. For Q2, ideally, it will not affect performance, but in reality, the global batch size (40x GPU vs 1x GPU) may affect convergence.

1151368613 · 2025-01-21T14:10:42Z

Thank you for your interest in ConsisID. We used 40 NVIDIA H100 GPUs with a total batch size of 80, and trained for 1800 steps. The training dataset consists of approximately 140,000 video clips. More details can be found in our report.

请问您用40张H100GPU训练了多长时间大概。

SHYuanBest · 2025-01-21T14:46:17Z

Thank you for your interest in ConsisID. We used 40 NVIDIA H100 GPUs with a total batch size of 80, and trained for 1800 steps. The training dataset consists of approximately 140,000 video clips. More details can be found in our report.

请问您用40张H100GPU训练了多长时间大概。

About 7～8 hours.

1151368613 · 2025-01-22T10:37:38Z

好的感谢您的回答，然后我还想问下，请问你们训练的时候是单精度训练还是双精度训练，因为我想用H800显卡跑，但是H800在双精度上性能比较差。

SHYuanBest · 2025-01-22T10:51:37Z

好的感谢您的回答，然后我还想问下，请问你们训练的时候是单精度训练还是双精度训练，因为我想用H800显卡跑，但是H800在双精度上性能比较差。

用的bf16训练

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about GPU Usage, Training Time, and Dataset Size #6

Question about GPU Usage, Training Time, and Dataset Size #6

aoliao12138 commented Nov 29, 2024

SHYuanBest commented Nov 30, 2024

SHYuanBest commented Dec 1, 2024

tyrink commented Dec 11, 2024

SHYuanBest commented Dec 11, 2024

1151368613 commented Jan 21, 2025

SHYuanBest commented Jan 21, 2025

1151368613 commented Jan 22, 2025

SHYuanBest commented Jan 22, 2025

Question about GPU Usage, Training Time, and Dataset Size #6

Question about GPU Usage, Training Time, and Dataset Size #6

Comments

aoliao12138 commented Nov 29, 2024

SHYuanBest commented Nov 30, 2024

SHYuanBest commented Dec 1, 2024

tyrink commented Dec 11, 2024

SHYuanBest commented Dec 11, 2024

1151368613 commented Jan 21, 2025

SHYuanBest commented Jan 21, 2025

1151368613 commented Jan 22, 2025

SHYuanBest commented Jan 22, 2025