Skip to main content

Mojo module

grouped_matmul_block_scaled_dispatch

General dispatch for grouped block-scaled matmul.

Routes to format-specific grouped matmul implementations based on the input dtype and target GPU architecture. Currently supports NVFP4 on SM100; MXFP4 and MXFP8 will be added in subsequent commits.

Functions

Was this page helpful?