Skip to main content

Mojo function

output_gmem_to_smem_STMatrix

output_gmem_to_smem_STMatrix[output_type: DType, accum_type: DType, num_m_mmas: Int, o_frag_size: Int, //, BM: Int, padded_depth: Int, swizzle: Swizzle, num_consumer: Int](warp_group_thread_idx: UInt32, local_warp_group_idx: UInt32, output_reg_tile: LayoutTensor[accum_type, Layout.row_major(num_m_mmas, o_frag_size), MutableAnyOrigin, address_space=AddressSpace(5)], accum_smem_tile: LayoutTensor[output_type, Layout.row_major(BM, padded_depth), MutableAnyOrigin, address_space=AddressSpace(3)])

Was this page helpful?