For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo struct

MMAConfig

struct MMAConfig[InType: DType, OutType: DType, mma_shape: IndexList[3], transpose_b: Bool = True]

Implemented traits

AnyType, Copyable, ImplicitlyCopyable, ImplicitlyDeletable, Movable, RegisterPassable, TrivialRegisterPassable

`comptime` members

`k_group_size_a`

comptime k_group_size_a = (MMAConfig[InType, OutType, mma_shape, transpose_b].simd_width // MMAConfig[InType, OutType, mma_shape, transpose_b].registers_per_thread_a)

`k_group_size_b`

comptime k_group_size_b = (MMAConfig[InType, OutType, mma_shape, transpose_b].simd_width // MMAConfig[InType, OutType, mma_shape, transpose_b].registers_per_thread_b)

`mma`

comptime mma = TensorCore()

`registers_per_thread_a`

comptime registers_per_thread_a = num_matrix_reg[mma_shape[0], mma_shape[2]]()

`registers_per_thread_b`

comptime registers_per_thread_b = num_matrix_reg[mma_shape[1], mma_shape[2]]()

`simd_width`

comptime simd_width = simd_width_of[InType]()

Methods

`adjusted_mma_k_shape_a`

static def adjusted_mma_k_shape_a() -> Int

Returns:

Int

`adjusted_mma_k_shape_b`

static def adjusted_mma_k_shape_b() -> Int

Returns:

Int

Implemented traits​

comptime members​

k_group_size_a​

k_group_size_b​

mma​

registers_per_thread_a​

registers_per_thread_b​

simd_width​

Methods​

adjusted_mma_k_shape_a​

adjusted_mma_k_shape_b​