Mojo function
store_fragment_to_smem
store_fragment_to_smem[swizzle: Swizzle, stageN: Int, transpose_c: Bool = False, c_swizzle: TensorMapSwizzle = TensorMapSwizzle.SWIZZLE_128B](vec: SIMD[dtype, size], dst: LayoutTensor[_dtype, layout, MutAnyOrigin, address_space=AddressSpace.SHARED, element_layout=element_layout, layout_int_type=layout_int_type, linear_idx_type=linear_idx_type, masked=masked, alignment=alignment], warp_offset: UInt32 = 0)
Store a fragment to shared memory using st.matrix.
This function provides a static interface compatible with stsm_helper, delegating to the underlying st.matrix operations.
Template Parameters: swizzle: Pre-computed swizzle pattern. stageN: Stage width in elements. transpose_c: Whether output is transposed. c_swizzle: TMA swizzle mode (for configuration).
Args:
- โvec (
SIMD): Source SIMD fragment. - โdst (
LayoutTensor): Destination shared memory tile. - โwarp_offset (
UInt32): Additional warp-based offset for transpose mode.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!