Mojo function
quantize_dynamic_scaled_fp4_async
quantize_dynamic_scaled_fp4_async[input_dtype: DType, output_dtype: DType, scales_dtype: DType, input_layout: Layout, output_layout: Layout, scales_layout: Layout, //, SF_VECTOR_SIZE: Int](ctx: DeviceContext, output_tensor: LayoutTensor[output_dtype, output_layout, MutAnyOrigin], scales_tensor: LayoutTensor[scales_dtype, scales_layout, MutAnyOrigin], input_tensor: LayoutTensor[input_dtype, input_layout, MutAnyOrigin], tensor_sf: Float32 = 1)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!