Mojo struct
MHAAttentionConfigRDNA
struct MHAAttentionConfigRDNA[token_gen: Bool, config: MHAConfig[config.dtype], group: Int]
RDNA-specific attention configuration for Wave32 WMMA.
This config always uses:
- MMA shape: 16x16x16 (only shape supported by RDNA WMMA)
- k_group_size: 1
- shared_kv: False (P reuses K's shared memory)
- full_kv: False
- depth_padded: True
- double_buffer: False
Implemented traits
AnyType,
AttentionConfig,
Copyable,
ImplicitlyCopyable,
ImplicitlyDestructible,
Movable
comptime members
depth_padded
comptime depth_padded = True
double_buffer
comptime double_buffer = False
full_kv
comptime full_kv = False
shared_kv
comptime shared_kv = False
Methods
q_head_idx
q_tile_idx
kv_head_idx
get_mma_shape
get_q_offset
get_output_offset
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!