Mojo module
mha_fa3_utils
comptime values
KVTMATile
comptime KVTMATile[dtype: DType, swizzle_mode: TensorMapSwizzle, *, BN: Int, depth: Int, BK: Int = depth] = TMATensorTile[dtype, _split_last_layout[dtype](IndexList[3, DType.int64](BN, 1, BK, Tuple[]()), swizzle_mode, True), _ragged_desc_layout[dtype](IndexList[3, DType.int64](BN, 1, BK, Tuple[]()), swizzle_mode)]
Parameters
QTMATile
comptime QTMATile[dtype: DType, swizzle_mode: TensorMapSwizzle, *, BM: Int, depth: Int, group: Int, decoding: Bool] = TMATensorTile[dtype, _split_last_layout[dtype](q_smem_shape[dtype, swizzle_mode, BM=BM, group=group, depth=depth, decoding=decoding](), swizzle_mode, True), _ragged_desc_layout[dtype](q_smem_shape[dtype, swizzle_mode, BM=BM, group=group, depth=depth, decoding=decoding](), swizzle_mode)]
Parameters
Structs
-
MHAPosition: Position of the MHA-kernel. Whendecoding=False,q_head_stride == q_num_heads. Whendecoding=True,q_head_stride == 1. -
NonNullPointer: -
NullPointer: -
Pack: -
PositionSummary:
Traits
Functions
-
get_q_head_idx: -
get_seq_info: -
kv_coord: -
output_reg_to_smem: -
output_reg_to_smem_st_matrix: -
produce: -
q_coord: Returns the coordinates for a tma load on theQmatrix. This load can be 3D, 4D, or 5D. -
q_gmem_shape: -
q_smem_shape: -
q_tma:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!