Skip to main content

Mojo module

grouped_matmul_block_scaled_dispatch

General dispatch for grouped block-scaled matmul.

Routes to format-specific grouped matmul implementations based on the input dtype and target GPU architecture. Currently supports NVFP4, MXFP4, and MXFP8 on SM100.

Functions

Was this page helpful?