For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo function
fp8_index_kernel
def fp8_index_kernel[dtype: DType, OutputLT: TensorLayout, QLT: TensorLayout, QSLT: TensorLayout, k_operand_type: MHAOperand, ks_operand_type: MHAOperand, block_tile_shape: InlineArray[Int, Int(2)], VLLT: TensorLayout, num_heads: Int, depth: Int, _is_cache_length_accurate: Bool = False](output_tt: TileTensor[DType.float32, OutputLT, MutAnyOrigin], q_tt: TileTensor[dtype, QLT, ImmutAnyOrigin], q_s_tt: TileTensor[DType.float32, QSLT, MutAnyOrigin], k_operand: k_operand_type, ks_operand: ks_operand_type, valid_length_tt: TileTensor[DType.uint32, VLLT, ImmutAnyOrigin])
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!