For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo module
softmax_warp
Softmax warp group logic for FA4 (SM100 Flash Attention).
Functionsβ
- β
fa4_lse_combine_write: LSE-combine two TMEM_O fragments and TMA-store a depth-column slice. - β
fa4_scale_write_output: - β
fa4_softmax:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!