Mojo function
ds_read_tr16_b64
ds_read_tr16_b64[dtype: DType, //](shared_ptr: UnsafePointer[Scalar[dtype], address_space=AddressSpace(3), alignment=alignment, mut=mut, origin=origin]) -> SIMD[dtype, 4]
Reads a 64-bit LDS transpose block using TR16 layout and returns SIMD[dtype, 4] of 16-bit types.
Notes:
- Only supported on AMD GPUs.
- Maps directly to llvm.amdgcn.ds.read.tr16.b64 intrinsic.
- Result width is fixed to 4 elements of dtype.
Args:
- shared_ptr (
UnsafePointer
): Pointer to the LDS transpose block.
Returns:
SIMD
: SIMD[dtype, 4] of 16-bit types.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!