Skip to main content

Mojo module

fp8_utils

Shared FP8 quantization utilities.

Provides common functions for FP8 scale computation and quantization used across fused normalization kernels and standalone quantization kernels.

NOTE: comm/allreduce_rmsnorm_fp8.mojo inlines copies of these functions to avoid a circular dependency (linalg depends on comm). If you change the logic here, update that copy too. See KERN-2477.

Functionsโ€‹

Was this page helpful?