-
Notifications
You must be signed in to change notification settings - Fork 855
Issues: NVIDIA/nccl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
why NCCL_MAX_NCHANNELS cannot limit ncclDevKernel_SendRecv grid size
#1598
opened Feb 8, 2025 by
Graham1025
Is there a relationship between ncclTopoSearchNextGpuSort and followPath
#1596
opened Feb 7, 2025 by
zhangdexin
Question about algorithms used for all-gathers and reduce-scatters
#1594
opened Feb 5, 2025 by
siddharth9820
Design question: what is
comm->planner.tmpCollWorkQueue
used for?
#1592
opened Jan 30, 2025 by
YconquestY
NCCL internal error for ncclCommInitRank when using infiniband
#1591
opened Jan 29, 2025 by
SzymonOzog
NCCL Ignores Specified SOCKET_IFNAME Configuration on Worker Nodes in Multi-Node Setup
#1581
opened Jan 18, 2025 by
rachid2198
NCCL_SOCKET_IFNAME has no effect during pytorch distributed training with multiple NICs
#1580
opened Jan 18, 2025 by
hanruijiang
BusBW of 2-node tree-based Allreduce exceeds the theoretical limit
#1576
opened Jan 16, 2025 by
JK-Jiagn
Potential group\collective life-time management issue in profiler plugin.
#1569
opened Jan 9, 2025 by
wiryls
[Hopper/NVLINK4] Origin of failure of fabric manager manifested through NCCL-based codes
#1562
opened Jan 3, 2025 by
vitduck
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.