Skip to main content

Mojo function

init_comms

init_comms(ngpus: Int)

Pre-initialize NCCL/RCCL communicators.

Must be called from a single thread before using allreduce from multiple threads. _get_global_comms has a check-then-create race: two threads seeing null simultaneously would both call ncclCommInitAll and one would leak its communicators.

Raises:

If the NCCL/RCCL communicator initialization fails.

Was this page helpful?