Mojo module
mla_prefill_per_token_scale
Per-token-scale MLA prefill kernel.
Thin dispatch wrapper that creates scale TMA tiles and calls the generic MLA kernel's per-token-scale variant (mla_prefill_kernel_per_token_scale).
Structs
Functions
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!