Mojo function
init_comms
init_comms(ngpus: Int)
Pre-initialize NCCL/RCCL communicators.
Must be called from a single thread before using allreduce from multiple threads. _get_global_comms has a check-then-create race: two threads seeing null simultaneously would both call ncclCommInitAll and one would leak its communicators.
Raises:
If the NCCL/RCCL communicator initialization fails.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!