Skip to main content

Mojo module

mxfp4_dequant

MXFP4 dequantization kernel for H100 (SM90).

Converts packed MXFP4 weights (uint8, 2 FP4 values per byte) with E8M0 block scales into float8_e4m3fn or bfloat16.

Scales are in 2D layout [N, K/SF_VECTOR_SIZE] where each scale covers SF_VECTOR_SIZE (32) consecutive elements.

Functions

Was this page helpful?