nccl

Sample examples of how to call collective operation functions on multi-GPU environments. A simple example of using broadcast, reduce, allGather, reduceScatter and sendRecv operations.

mpi cuda nccl laplacian-matrix

Updated Aug 28, 2023

lanl / pyDNMFk

Star

Python Distributed Non Negative Matrix Factorization with custom clustering

python machine-learning hpc distributed-computing latent-features mpi4py cupy nccl nonnegative-matrix-factorization outofmemory tensorfactorization

Updated Aug 22, 2023
Python

BaguaSys / bagua-net

Star

High performance NCCL plugin for Bagua.

distributed-computing nccl bagua

Updated Sep 15, 2021
Rust

rohwid / auto-nvidia-cuda-driver

Star

Installation script to install Nvidia driver and CUDA automatically in Ubuntu

cuda ubuntu1604 cudnn bash-scripting nvidia-driver nccl ubuntu1804

Updated Apr 24, 2022
Shell

openhackathons-org / nways_multi_gpu

Star

N-Ways to Multi-GPU Programming

hpc mpi cuda nccl nvshmem nsight-systems

Updated Apr 4, 2023
C

UCBerkeley-Spring2022-CS267-project / blinkplus

Star

Blink+: Increase GPU group bandwidth by utilizing across tenant NVLink.

gpu nccl collective-communication

Updated Jun 22, 2022
Jupyter Notebook

YinLiu-91 / ncclOperationPlus

Star

use ncclSend ncclRecv realize ncclSendrecv ncclGather ncclScatter ncclAlltoall

cplusplus cpp mpi cuda nccl ncclsendrecv ncclgather ncclscatter ncclalltoall

Updated Mar 1, 2022
Cuda

lancelee82 / necklace

Star

Distributed deep learning framework based on pytorch/numba/nccl and zeromq.

deep-learning mxnet pytorch distributed numba zerorpc distributed-deep-learning distributed-training nccl

Updated Aug 10, 2023
Python

Improve this page

Add a description, image, and links to the nccl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nccl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nccl

Here are 35 public repositories matching this topic...

cupy / cupy

coreylowman / cudarc

Bluefog-Lib / bluefog

FZJ-JSC / tutorial-multi-gpu

huggingface / llm_training_handbook

huggingface / large_language_model_training_playbook

microsoft / msrflute

LambdaLabsML / distributed-training-guide

google / nccl-fastsocket

JuliaGPU / NCCL.jl

1duo / nccl-examples

lcskrishna / nccl-rccl-parser

muriloboratto / NCCL

lanl / pyDNMFk

BaguaSys / bagua-net

rohwid / auto-nvidia-cuda-driver

openhackathons-org / nways_multi_gpu

UCBerkeley-Spring2022-CS267-project / blinkplus

YinLiu-91 / ncclOperationPlus

lancelee82 / necklace

Improve this page

Add this topic to your repo