Mojo function
get_accumulator_dims
get_accumulator_dims[*, c_smem_dim1: Int, block_tile_shape: IndexList[3], mma_shape: IndexList[3], cta_group: Int]() -> IndexList[2]
Compute register accumulator dimensions for blockwise FP8.
Returns (num_stages, num_elements) for the register tile shape.
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!