Mojo module
fp8_utils
Shared FP8 quantization utilities.
Provides common functions for FP8 scale computation and quantization used across fused normalization kernels and standalone quantization kernels.
Functionsโ
- โ
compute_dynamic_fp8_scale: Compute dynamic FP8 scale factor and its reciprocal from a row max. - โ
fp8_quantize: Quantize values to FP8, optionally clamping to the representable range.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!