For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo function
mla_prefill_plan_kernel
def mla_prefill_plan_kernel[BufferRowOffsetsLayoutType: TensorLayout, CacheOffsetsLayoutType: TensorLayout, BufferLengthsLayoutType: TensorLayout, InputRowOffsetsLayoutType: TensorLayout, cache_t: KVCacheT](buffer_row_offsets: TileTensor[DType.uint32, BufferRowOffsetsLayoutType, MutUntrackedOrigin], cache_offsets: TileTensor[DType.uint32, CacheOffsetsLayoutType, MutUntrackedOrigin], buffer_lengths: TileTensor[DType.int32, BufferLengthsLayoutType, MutUntrackedOrigin], input_row_offsets: TileTensor[DType.uint32, InputRowOffsetsLayoutType, ImmutUntrackedOrigin], k_cache: cache_t, buffer_token_size: UInt32)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!