【Question】What is the minimum number of GPUs required to train deepseek 671B with GRPO? How about using LoRA? #5911
Triggered via issue
February 25, 2025 00:58
Status
Success
Total duration
11s
Artifacts
–