Mojo struct

MatmulConfig

@register_passable(trivial) struct MatmulConfig[a_type: DType, b_type: DType, c_type: DType, transpose_b: Bool = True]

Static configuration of GPU matmul.

Fields

cta_group (Int):
mma_shape (IndexList[3]):
cluster_shape (IndexList[3]):
AB_swapped (Bool):
block_swizzle_size (Int):
raster_order (RasterOrder):
block_tile_shape (IndexList[3]):
num_split_k (Int):
num_pipeline_stages (UInt):
num_clc_pipeline_stages (UInt):
num_accum_pipeline_stages (UInt):
num_output_stages (UInt):
output_tile_shape (IndexList[2]):
a_swizzle (TensorMapSwizzle):
b_swizzle (TensorMapSwizzle):
c_swizzle (TensorMapSwizzle):
k_group_size (UInt):

Implemented traits

AnyType, Copyable, Equatable, Hashable, ImplicitlyCopyable, ImplicitlyDestructible, Movable, Stringable, Writable

`comptime` members

`copyinitis_trivial`

comptime __copyinit__is_trivial = True

`delis_trivial`

comptime __del__is_trivial = True

`moveinitis_trivial`

comptime __moveinit__is_trivial = True

`accum_type`

comptime accum_type = get_accum_type[a_type]()

Methods

`init`

__init__(*, cta_group: Int = 2, mma_shape: IndexList[3] = get_mma_shape[a_type, MatmulConfig[a_type, b_type, c_type, transpose_b].accum_type](), cluster_shape: IndexList[3] = Index(2, 1, 1), AB_swapped: Bool = False, num_split_k: Int = 1, block_swizzle_size: Int = 0, raster_order: RasterOrder = RasterOrder.AlongM, k_group_size: UInt = 1, num_pipeline_stages: Optional[UInt] = None, num_accum_pipeline_stages: UInt = 2, num_clc_pipeline_stages: UInt = 2) -> Self

`eq`

__eq__(self, other: Self) -> Bool

Returns:

Bool

`swap_AB_type`

swap_AB_type(self) -> MatmulConfig[b_type, a_type, c_type, transpose_b]

Returns:

MatmulConfig

`str`

__str__(self) -> String

Returns:

String

`write_to`

write_to(self, mut writer: T)

`repr`

__repr__(self) -> String

Returns:

String

`hash`

__hash__[H: Hasher](self, mut hasher: H)

Updates hasher with the underlying bytes.

Parameters:

H (Hasher): The hasher type.

Args:

hasher (H): The hasher instance.

Fields​

Implemented traits​

comptime members​

__copyinit__is_trivial​

__del__is_trivial​

__moveinit__is_trivial​

accum_type​

Methods​

__init__​

__eq__​

swap_AB_type​

__str__​

write_to​

__repr__​

__hash__​

Fields

Implemented traits

`comptime` members

`copyinitis_trivial`

`delis_trivial`

`moveinitis_trivial`

`accum_type`

Methods

`init`

`eq`

`swap_AB_type`

`str`

`write_to`

`repr`

`hash`