You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A thread on Zulip mentioned some additional care required for NCCL within a Legion task. Rohan spotted a problem in FlexFlow's use of ncclAllReduce. You may need to add concurrent_task_barrier before and after the call, and call set_concurrent_barrier on the task. More info is in the comment for that barrier.
The text was updated successfully, but these errors were encountered:
flexflow-train/lib/kernels/src/cuda/optimizer_kernel.cu
Line 83 in de7fa32
A thread on Zulip mentioned some additional care required for NCCL within a Legion task. Rohan spotted a problem in FlexFlow's use of
ncclAllReduce
. You may need to addconcurrent_task_barrier
before and after the call, and callset_concurrent_barrier
on the task. More info is in the comment for that barrier.The text was updated successfully, but these errors were encountered: