IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

mha_decode

RDNA Wave32 MHA decode kernel.

Same recipe as prefill, plus split-K partitioning of the KV span across blocks for grid-level parallelism.