Skip to main content

Mojo module

mha_structured

MHA prefill kernel for gfx950 with structured scheduling.

Supports depth=64, 128, 256. Uses TileTensor throughout for register and SMEM tile management, with TiledMmaOp for MMA dispatch.

Functions​

Was this page helpful?