Mojo module
mha_mask
Aliases
MASK_VALUE
alias MASK_VALUE = -10000
Structs
- AndMask: Mask that's the AND of two masks.
- CausalMask: MHA causal mask ensures a token is only affected by previous tokens.
- ChunkedMask: Mask implementing Chunked attention.
- MaskName: A tile's masking status.
- MaterializedMask: Mask that's backed by a materialized tensor.
- NullMask: Mask that's effectively a noop.
- OrMask: Mask that's the OR of two masks.
- SlidingWindowCausalMask: Mask implementing Sliding Window attention.
- TileMaskStatus: A tile's masking status.
Traits
- MHAMask: The MHAMask trait describes masks for MHA kernels, such as the causal mask.
Functions
- ChunkedCausalMask: Mask implementing Chunked Causal attention for Llama4 models.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
