IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

mxfp4_dequant

MXFP4 dequantization kernel for H100 (SM90).

Converts packed MXFP4 weights (uint8, 2 FP4 values per byte) with E8M0 block scales into float8_e4m3fn or bfloat16.

Scales are in 2D layout [N, K/SF_VECTOR_SIZE] where each scale covers SF_VECTOR_SIZE (32) consecutive elements.

Functions