Mojo module
mha_mask
Aliases
-
MASK_VALUE = -10000
:
Structs
-
AndMask
: Mask that's the AND of two masks. -
CausalMask
: MHA causal mask ensures a token is only affected by previous tokens. -
ChunkedMask
: Mask implementing Chunked attention. -
MaterializedMask
: Mask that's backed by a materialized tensor. -
NullMask
: Mask that's effectively a noop. -
OrMask
: Mask that's the OR of two masks. -
SlidingWindowCausalMask
: Mask implementing Sliding Window attention. -
TileMaskStatus
: A tile's masking status.
Traits
-
MHAMask
: The MHAMask trait describes masks for MHA kernels, such as the causal mask.
Functions
-
ChunkedCausalMask
: Mask implementing Chunked Causal attention for Llama4 models.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!