IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

mxfp4_matmul_sm90

MXFP4 matmul on H100 (SM90) via dequant-to-FP8 + FP8 GEMM.

Dequantizes MXFP4 weights to FP8, then uses the SM90 warp-specialized FP8 GEMM. Activations (BF16) are cast to FP8 on-the-fly.

Functions