Mojo function
quantize_static_scaled_fp8
quantize_static_scaled_fp8[out_dtype: DType, in_dtype: DType, scale_is_inverted: Bool = True](out_tensor: TileTensor[out_dtype, out_tensor.LayoutType, out_tensor.origin, address_space=out_tensor.address_space, linear_idx_type=out_tensor.linear_idx_type, element_size=out_tensor.element_size], in_tensor: TileTensor[in_dtype, in_tensor.LayoutType, in_tensor.origin, address_space=in_tensor.address_space, linear_idx_type=in_tensor.linear_idx_type, element_size=in_tensor.element_size], scale: Float32, context: DeviceContext)
TileTensor implementation of static scaled FP8 quantization.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!