IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Python function

load_kv_manager

load_kv_manager()

max.pipelines.kv_cache.load_kv_manager(params, max_batch_size, max_seq_len, session, available_cache_memory)

source

Loads a KV cache manager from the given params.

Accepts both KVCacheParams (single cache) and MultiKVCacheParams (multiple caches). The returned PagedKVCacheManager natively handles all caches with a single BlockManager and KVConnector.

Parameters:

Return type:

PagedKVCacheManager