IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.

Skip to main content

For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

reducescatter

Multi-GPU reducescatter implementation for distributed tensor reduction across GPUs.

`comptime` values

`elementwise_epilogue_type`

comptime elementwise_epilogue_type = def[dtype: DType, width: SIMDSize, *, alignment: Int](Coord[_], SIMD[dtype, width]) capturing -> None

Structs

ReduceScatterConfig: Configuration for axis-aware reduce-scatter partitioning.

Functions

reducescatter: Per-device reducescatter operation with axis-aware scatter.

comptime values
- elementwise_epilogue_type
Structs
Functions