Skip to main content

Mojo module

mxfp4_matmul_sm90

MXFP4 matmul on H100 (SM90) via dequant-to-FP8 + FP8 GEMM.

Dequantizes MXFP4 weights to FP8, then uses the SM90 warp-specialized FP8 GEMM. Activations (BF16) are cast to FP8 on-the-fly.

Functions

Was this page helpful?