Mojo function
async_copy_wait_group
async_copy_wait_group(n: SIMD[int32, 1])
Waits for the completion of n
most recently committed cp.async-groups.
This function blocks execution until the specified number of previously committed cp.async-groups have completed their memory transfers.
Notes:
- Only supported on NVIDIA GPUs.
- Maps to the cp.async.wait.group PTX instruction.
- Provides fine-grained control over asynchronous transfer synchronization.
- Can be used to implement a pipeline of asynchronous transfers.
Args:
- n (
SIMD[int32, 1]
): The number of pending cp.async-groups to wait for. Must be > 0.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!