Mojo function
store_matrix_d
store_matrix_d[dtype: DType, //, m: Int, n: Int, k: Int](d_ptr: UnsafePointer[SIMD[dtype, 1]], d: SIMD[dtype, 4], tile_row: Int, tile_col: Int, ldm: Int)
Stores matrix D tile from registers to memory in specific order after performing tensor core based warp sync mma op.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!