Skip to main content
Log in

Mojo function

async_copy_wait_group

async_copy_wait_group(n: SIMD[int32, 1])

Waits for the completion of n most recently committed cp.async-groups.

This function blocks execution until the specified number of previously committed cp.async-groups have completed their memory transfers.

Notes:

  • Only supported on NVIDIA GPUs.
  • Maps to the cp.async.wait.group PTX instruction.
  • Provides fine-grained control over asynchronous transfer synchronization.
  • Can be used to implement a pipeline of asynchronous transfers.

Args:

  • n (SIMD[int32, 1]): The number of pending cp.async-groups to wait for. Must be > 0.