Skip to main content
Log in

Mojo function

st_matrix

st_matrix[type: DType, //, simd_width: Int, *, transpose: Bool = False](ptr: UnsafePointer[SIMD[type, 1], address_space=AddressSpace(3)], d: SIMD[type, simd_width])

Performs warp sync copy from registers to shared memory. Loads in a fashion that can be used directly by tensor core MMA instructions.