Skip to main content

Mojo module

reducescatter

Multi-GPU reducescatter implementation for distributed tensor reduction across GPUs.

comptime values​

elementwise_epilogue_type​

comptime elementwise_epilogue_type = def[dtype: DType, width: Int, *, alignment: Int, ?, .element_types.values: KGENParamList[CoordLike], .element_types`1: TypeList[values]](Coord[element_types], SIMD[dtype, width]) capturing -> None ``

Structs​

Functions​