Mojo module
mha_mask
Aliases
MASK_VALUE
alias MASK_VALUE = -10000
Structs
-
AndMask: Mask that's the AND of two masks. -
CausalMask: MHA causal mask ensures a token is only affected by previous tokens. -
ChunkedMask: Mask implementing Chunked attention. -
MaskName: A tile's masking status. -
MaterializedMask: Mask that's backed by a materialized tensor. -
NullMask: Mask that's effectively a noop. -
OrMask: Mask that's the OR of two masks. -
SlidingWindowCausalMask: Mask implementing Sliding Window attention. -
TileMaskStatus: A tile's masking status.
Traits
-
MHAMask: The MHAMask trait describes masks for MHA kernels, such as the causal mask.
Functions
-
ChunkedCausalMask: Mask implementing Chunked Causal attention for Llama4 models.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!