Mojo function

reduce

reduce[shuffle: fn[DType, Int](val: SIMD[$0, $1], offset: SIMD[uint32, 1]) -> SIMD[$0, $1], func: fn[DType, Int](SIMD[$0, $1], SIMD[$0, $1]) capturing -> SIMD[$0, $1], val_type: DType, simd_width: Int](val: SIMD[val_type, simd_width]) -> SIMD[val_type, simd_width]

Takes in an input function to computes warp shuffle based reduction operation.