Mojo function
global_load_dword
global_load_dword[dtype: DType, width: Int](sgpr_address: SIMD[DType.uint32, 2], vgpr_offset: UInt32) -> SIMD[dtype, width]
Loads 8 or 16 bytes from global memory to a SIMD vector. Each load is asynchronous and needs to be waited on with wait_vmcount.
Parameters:
- βdtype (
DType
): The data type of global memory. - βwidth (
Int
): The width of the SIMD vector to load.
Args:
- βsgpr_address (
SIMD
): Two consecutive SGPRs that contain the address of the global memory ptr. - βvgpr_offset (
UInt32
): The offset of the address that this thread will load from.
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!