For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo function
mha_decoding_num_partitions
def mha_decoding_num_partitions(batch_size: Int, num_keys: Int, heads_per_group: Int, ctx: DeviceContext, is_mla: Bool = False) -> Int
Returns:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!