-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Issues: hpcaitech/ColossalAI
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG]:The program froze while attempting R1 Lora
bug
Something isn't working
#6234
opened Feb 28, 2025 by
ygxw0909
2 tasks done
[BUG]: cannot import name 'CpuAdamArmExtension' from 'colossalai.kernel.extensions' (unknown location)
bug
Something isn't working
#6233
opened Feb 28, 2025 by
Fence
2 tasks done
[BUG]: how to fine tune DeepSeek-R1-Distill-Qwen-7B with lora
bug
Something isn't working
#6232
opened Feb 28, 2025 by
AI-HR
2 tasks done
[BUG]: 【R1 SFT Bug,loss should start from 1】
bug
Something isn't working
#6227
opened Feb 27, 2025 by
447428054
2 tasks done
[BUG]: RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
bug
Something isn't working
#6225
opened Feb 27, 2025 by
klompn
2 tasks done
[BUG]: Lora load error
bug
Something isn't working
#6221
opened Feb 25, 2025 by
447428054
2 tasks done
[BUG]: EP16 negative split
bug
Something isn't working
#6220
opened Feb 25, 2025 by
447428054
2 tasks done
【Question】What is the minimum number of GPUs required to train deepseek 671B with GRPO? How about using LoRA?
#6219
opened Feb 25, 2025 by
LiuShixing
[BUG]: /bin/bash: line 0: export: `NPU-VISIBLE-DEVICES=0,1,2,3,4,5,6,7': not a valid identifier
bug
Something isn't working
#6217
opened Feb 24, 2025 by
Gera001
2 tasks done
Respecting regulations and stabilizing the ecosystem by activists
bug
Something isn't working
#6216
opened Feb 24, 2025 by
MASIHMIRSALI
2 tasks done
[BUG]: Precision overflow occurs when moe forward is performed
bug
Something isn't working
#6212
opened Feb 21, 2025 by
zh2333
2 tasks done
[BUG]: failed to install coati in npu docker environment
bug
Something isn't working
#6209
opened Feb 20, 2025 by
wangyuan249
2 tasks done
[BUG]: 该如何安装colossal到NPU上,看项目有相关描述,但没找到相关教程
bug
Something isn't working
#6205
opened Feb 20, 2025 by
obj12
2 tasks done
[DOC]: Update the documentation of ShardConfig for 1D, 2D, 2.5D, 3D tensor parallelism
documentation
Improvements or additions to documentation
#6197
opened Feb 18, 2025 by
giriprasad51
[FEATURE]: Expert Parallel for qwen/deepseek
enhancement
New feature or request
#6180
opened Jan 12, 2025 by
Guodanding
[BUG]: RuntimeError: mat1 and mat2 must have the same dtype, but got Float and BFloat16
bug
Something isn't working
#6169
opened Dec 25, 2024 by
balcklive
1 task done
[BUG]: Gemini saved an additional portion of the weights while using tie_word_embeddings=True
bug
Something isn't working
#6160
opened Dec 13, 2024 by
ericxsun
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.