Mojo function
quantize_dynamic_scaled_fp4_async
quantize_dynamic_scaled_fp4_async[input_dtype: DType, output_dtype: DType, scales_dtype: DType, //, SF_VECTOR_SIZE: Int](ctx: DeviceContext, output_tensor_tile: TileTensor[output_dtype, output_tensor_tile.LayoutType, output_tensor_tile.origin, linear_idx_type=output_tensor_tile.linear_idx_type, element_size=output_tensor_tile.element_size], scales_tensor_tile: TileTensor[scales_dtype, scales_tensor_tile.LayoutType, scales_tensor_tile.origin, linear_idx_type=scales_tensor_tile.linear_idx_type, element_size=scales_tensor_tile.element_size], input_tensor_tile: TileTensor[input_dtype, input_tensor_tile.LayoutType, input_tensor_tile.origin, linear_idx_type=input_tensor_tile.linear_idx_type, element_size=input_tensor_tile.element_size], tensor_sf: Float32 = 1)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!