Mojo function
async_copy_wait_group
async_copy_wait_group(n: SIMD[int32, 1])
Waits for the completion of n
most recently committed cp.async-groups.
This function blocks execution until the specified number of previously committed cp.async-groups have completed their memory transfers.
Note: - Only supported on NVIDIA GPUs - Maps to the cp.async.wait.group PTX instruction - Provides fine-grained control over asynchronous transfer synchronization - Can be used to implement a pipeline of asynchronous transfers
Args:
- n (
SIMD[int32, 1]
): The number of pending cp.async-groups to wait for. Must be > 0.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!