For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo module
mha_decode_partition_heuristic
Functionsβ
- β
cuda_mha_decoding_max_num_partitions: - β
cuda_mha_decoding_num_partitions: - β
hip_mha_decoding_num_partitions: Wave-aligned split-K target for MI355X MHA + MLA decode. - β
mha_decoding_max_num_partitions: - β
mha_decoding_num_partitions:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!