Skip to main content

Mojo struct

KVCacheStaticParams

struct KVCacheStaticParams

Fields​

  • ​num_heads (Int):
  • ​head_size (Int):
  • ​is_mla (Bool):

Implemented traits​

AnyType, Copyable, Equatable, ImplicitlyCopyable, ImplicitlyDestructible, Movable, RegisterPassable, TrivialRegisterPassable

Methods​

__init__​

__init__(num_heads: Int, head_size: Int, is_mla: Bool = False) -> Self

Initialize KVCacheStaticParams. Args: num_heads (Int): Number of attention heads. head_size (Int): Size of each attention head. is_mla (Bool, optional): Whether to use Multi-Linear Attention (MLA) mode. If true, we only store k cache. If False, we store k and v cache. Defaults to False.