IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo package

kv_cache

Contains implementations for several types of key-value caches.

KV caches are used in transformer models to store key-value tensors output from self-attention layers.

These APIs are used in the higher-level functions in the nn package.

Modulesโ€‹