Skip to main content
Log in

Mojo function

async_copy_wait_group

async_copy_wait_group(n: SIMD[int32, 1])

Waits for the completion of n most recently committed cp.async-groups.

This function blocks execution until the specified number of previously committed cp.async-groups have completed their memory transfers.

Note: - Only supported on NVIDIA GPUs - Maps to the cp.async.wait.group PTX instruction - Provides fine-grained control over asynchronous transfer synchronization - Can be used to implement a pipeline of asynchronous transfers

Args:

  • n (SIMD[int32, 1]): The number of pending cp.async-groups to wait for. Must be > 0.