Mojo function
async_copy_wait_all
async_copy_wait_all()
Waits for completion of all committed cp.async-groups.
This function blocks execution until all previously committed cp.async-groups have completed their memory transfers. It provides a barrier to ensure all asynchronous copies are finished.
Notes:
- Only supported on NVIDIA GPUs.
- Maps to the cp.async.wait.all PTX instruction.
- Ensures all outstanding asynchronous transfers are complete.
- More coarse-grained than
async_copy_wait_group()
.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!