Mojo function
output_reg_to_smem
output_reg_to_smem[output_type: DType, accum_type: DType, num_m_mmas: Int, o_frag_size: Int, //, BM: Int, BN: Int, padded_depth: Int, swizzle: Swizzle, num_consumer: Int](tid: UInt32, local_warp_group_idx: UInt32, warp_y: UInt32, q_smem: UnsafePointer[Scalar[output_type], address_space=AddressSpace(3)], output_reg_tile: LayoutTensor[accum_type, Layout.row_major(num_m_mmas, o_frag_size), MutableAnyOrigin, address_space=AddressSpace(5)]) -> LayoutTensor[output_type, Layout.row_major(BM, padded_depth), MutableAnyOrigin, address_space=AddressSpace(3)]
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!