Skip to main content

Python function

load_kv_manager

load_kv_manager()

max.kv_cache.load_kv_manager(params, max_batch_size, max_seq_len, session, available_cache_memory)

source

Loads a KV cache manager from the given params.

Accepts both KVCacheParams (single cache) and MultiKVCacheParams (multiple caches). The returned PagedKVCacheManager natively handles all caches with a single BlockManager and KVConnector.

Parameters:

Return type:

PagedKVCacheManager