Skip to main content

Python module

registry

estimate_kv_cache_size()โ€‹

max.kv_cache.registry.estimate_kv_cache_size(params, max_batch_size, max_seq_len, num_layers, available_cache_memory, devices, **kwargs)

Parameters:

Return type:

int

infer_optimal_batch_size()โ€‹

max.kv_cache.registry.infer_optimal_batch_size(params, max_seq_len, num_layers, available_cache_memory, devices, **kwargs)

Parameters:

Return type:

int

load_kv_manager()โ€‹

max.kv_cache.registry.load_kv_manager(params, max_batch_size, max_seq_len, num_layers, devices, session, available_cache_memory=None, page_size=512)

Parameters:

Return type:

PagedKVCacheManager | NullKVCacheManager

Was this page helpful?