Mojo function
swapAB_largeM_clustered
swapAB_largeM_clustered[a_type: DType, b_type: DType, c_type: DType, transpose_b: Bool = True](m: Scalar[DType.uint], pdl_level: PDLLevel) -> MatmulConfig[a_type, b_type, c_type, transpose_b]
Config for m in [129, 240] range with cluster=(2,1,1).
Pattern: - BN = 72 + ((m - 129) // 16) * 8 - Stages: 12 for m<=160, 10 for m<=224, 8 otherwise - cluster = (2,1,1), k_group_size = 2, swapAB = True
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!