Skip to main content

Mojo function

broadcast_pull_1stage_kernel

broadcast_pull_1stage_kernel[dtype: DType, layout: TensorLayout, BLOCK_SIZE: Int, ngpus: Int, simd_width: Int = simd_width_of[dtype, get_gpu_target()](), pdl_level: PDLLevel = PDLLevel()](output: TileTensor[dtype, layout, MutAnyOrigin], input: TileTensor[dtype, layout, ImmutAnyOrigin], rank_sigs: InlineArray[UnsafePointer[Signal, MutAnyOrigin], 8], my_rank: Int)

Was this page helpful?