Mojo function
group_norm_gpu
group_norm_gpu[dtype: DType, rank: Int, //, input_fn: def[width: Int, rank: Int](IndexList[rank]) capturing -> SIMD[dtype, width], gamma_fn: def[width: Int](IndexList[1]) capturing -> SIMD[dtype, width], beta_fn: def[width: Int](IndexList[1]) capturing -> SIMD[dtype, width]](shape: IndexList[rank, element_type=shape.element_type], epsilon: Scalar[dtype], output: TileTensor[dtype, output.LayoutType, output.origin, address_space=output.address_space, linear_idx_type=output.linear_idx_type, element_size=output.element_size], num_groups: Int, ctx: DeviceContext)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!