Skip to main content
Log in

Mojo function

get_mma_shape

get_mma_shape[input_type: DType, accum_type: DType, shape_id: Int = 0]() -> Index[3]

Returns the appropriate matrix multiply-accumulate (MMA) shape for tensor core operations.

Selects the optimal MMA shape based on the GPU architecture, input data type, accumulation data type, and optional shape identifier. This function handles different configurations for both NVIDIA and AMD GPUs.

Parameters:

  • input_type (DType): The data type of the input matrices (A and B).
  • accum_type (DType): The data type used for accumulation (C and D).
  • shape_id (Int): Optional identifier to select between multiple valid shapes (default: 0).

Returns:

An IndexList[3] containing the MMA dimensions in the format [M, N, K], where M×N is the output matrix size and K is the reduction dimension.