For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo module
reducescatter
Multi-GPU reducescatter implementation for distributed tensor reduction across GPUs.
comptime valuesβ
elementwise_epilogue_typeβ
comptime elementwise_epilogue_type = def[dtype: DType, width: SIMDSize, *, alignment: Int](Coord[_], SIMD[dtype, width]) capturing -> None
Structsβ
- β
ReduceScatterConfig: Configuration for axis-aware reduce-scatter partitioning.
Functionsβ
- β
reducescatter: Per-device reducescatter operation with axis-aware scatter.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!