For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo struct
AMDStructuredConfig
struct AMDStructuredConfig[config: MHAConfig[config.dtype], group: Int, token_gen: Bool = False, mla_mode: Bool = False]
Implemented traitsβ
AnyType,
Copyable,
ImplicitlyCopyable,
ImplicitlyDeletable,
Movable
comptime membersβ
depth_paddedβ
comptime depth_padded = False
double_bufferβ
comptime double_buffer = not token_gen
double_buffer_k_onlyβ
comptime double_buffer_k_only = token_gen and (config.block_n() <= Int(64))
full_kvβ
comptime full_kv = True
shared_kvβ
comptime shared_kv = token_gen and (config > Int(256))
Methodsβ
heads_per_tileβ
q_head_idxβ
q_tile_idxβ
kv_head_idxβ
get_mma_shapeβ
get_q_offsetβ
get_output_offsetβ
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!