Mojo function
async_copy_wait_all
async_copy_wait_all()
Waits for completion of all committed cp.async-groups.
This function blocks execution until all previously committed cp.async-groups have completed their memory transfers. It provides a barrier to ensure all asynchronous copies are finished.
Note:
- Only supported on NVIDIA GPUs
- Maps to the cp.async.wait.all PTX instruction
- Ensures all outstanding asynchronous transfers are complete
- More coarse-grained than async_copy_wait_group()
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!