Skip to main content

Mojo function

quantize_dynamic_block_scaled

quantize_dynamic_block_scaled[out_dtype: DType, scales_dtype: DType, in_dtype: DType, //, *, SF_VECTOR_SIZE: Int, target: StringSlice[StaticConstantOrigin] = "cpu"](output_device: NDBuffer[out_dtype, 2, MutAnyOrigin, output_device.shape], scales_device: NDBuffer[scales_dtype, 5, MutAnyOrigin, scales_device.shape], input_device: NDBuffer[in_dtype, 2, MutAnyOrigin, input_device.shape], tensor_sf: Float32, ctx: DeviceContext)

Was this page helpful?