For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo function

rms_norm_gpu_warp_tiling

def rms_norm_gpu_warp_tiling[mut: Bool, LayoutType: TensorLayout, origin: Origin[mut=mut], dtype: DType, rank: Int, //, simd_width: Int, max_warps_per_block: Int, chunks_per_thread: Int, exact_fit: Bool, input_fn: def[width: Int, rank: Int](IndexList[rank]) capturing -> SIMD[dtype, width], output_fn: def[width: SIMDSize, alignment: Int](IndexList[rank], SIMD[dtype, width]) capturing -> None, multiply_before_cast: Bool](shape: IndexList[rank], gamma: TileTensor[dtype, LayoutType, origin], epsilon: Scalar[dtype], weight_offset: Scalar[dtype], num_cols: Int)