Skip to main content

Python class

KVCacheParamInterface

KVCacheParamInterface

class max.nn.kv_cache.KVCacheParamInterface(*args, **kwargs)

source

Bases: Protocol

Interface for KV cache parameters.

bytes_per_block

property bytes_per_block: int

source

Number of bytes per cache block.

data_parallel_degree

data_parallel_degree: int

source

enable_kvcache_swapping_to_host

enable_kvcache_swapping_to_host: bool

source

get_symbolic_inputs()

get_symbolic_inputs()

source

Returns the symbolic inputs for the KV cache.

Return type:

FlattenableInputSymbols

host_kvcache_swap_space_gb

host_kvcache_swap_space_gb: float | None

source

n_devices

n_devices: int

source

page_size

page_size: int

source