Mojo function
block_reduce_dual_sum
block_reduce_dual_sum[dtype: DType, max_warps_per_block: Int](val0: Scalar[dtype], val1: Scalar[dtype]) -> Tuple[Scalar[dtype], Scalar[dtype]]
Combined block reduction for two sums using only 2 barriers.
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!