Mojo module
warp
This module includes intrinsics for NVIDIA GPUs shuffle instructions.
Aliases
-
FULL_MASK = ((2 ** _resolve_warp_size()) + -1)
:
Structs
Functions
-
broadcast
: -
lane_group_max
: -
lane_group_max_and_broadcast
: -
lane_group_min
: -
lane_group_reduce
: Takes in an input function to computes warp shuffle based reduction operation. -
lane_group_sum
: -
lane_group_sum_and_broadcast
: -
max
: -
min
: -
reduce
: Takes in an input function to computes warp shuffle based reduction operation. -
shuffle_down
: Copies values from other lanes in the warp. -
shuffle_idx
: Copies a value from a source lane to other lanes in a warp. -
shuffle_up
: Copies values from other lanes in the warp. -
shuffle_xor
: Copies values from between lanes (butterfly pattern). -
sum
:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!