Mojo module
mha_mask
Aliases
MASK_VALUE
alias MASK_VALUE = -10000
Structs
- 
AndMask: Mask that's the AND of two masks. - 
CausalMask: MHA causal mask ensures a token is only affected by previous tokens. - 
ChunkedMask: Mask implementing Chunked attention. - 
MaskName: A tile's masking status. - 
MaterializedMask: Mask that's backed by a materialized tensor. - 
NullMask: Mask that's effectively a noop. - 
OrMask: Mask that's the OR of two masks. - 
SlidingWindowCausalMask: Mask implementing Sliding Window attention. - 
TileMaskStatus: A tile's masking status. 
Traits
- 
MHAMask: The MHAMask trait describes masks for MHA kernels, such as the causal mask. 
Functions
- 
ChunkedCausalMask: Mask implementing Chunked Causal attention for Llama4 models. 
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!