Mojo function
output_reg_to_smem_st_matrix
output_reg_to_smem_st_matrix[output_type: DType, accum_type: DType, num_m_mmas: Int, padded_depth: Int, o_frag_size: Int, //, BM: Int, swizzle: Swizzle, num_consumer: Int](warp_group_thread_idx: UInt32, local_warp_group_idx: UInt32, output_reg_tile: TileTensor[accum_type, Layout[ComptimeInt[num_m_mmas], ComptimeInt[o_frag_size], ComptimeInt[o_frag_size], ComptimeInt[1]], MutAnyOrigin, address_space=AddressSpace.LOCAL], accum_smem_tile: TileTensor[output_type, Layout[ComptimeInt[BM], ComptimeInt[padded_depth], ComptimeInt[padded_depth], ComptimeInt[1]], MutAnyOrigin, address_space=AddressSpace.SHARED])
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!