Python class
KVCacheInputs
KVCacheInputs
class max.nn.kv_cache.KVCacheInputs(inputs)
Bases: Generic[_Tensor, _Buffer]
Symbolic graph input types for all devices’ paged KV cache.
-
Parameters:
-
inputs (Sequence[KVCacheInputsPerDevice[_Tensor, _Buffer]])
flatten()
flatten()
-
Return type:
-
list[_Tensor | _Buffer]
inputs
inputs: Sequence[KVCacheInputsPerDevice[_Tensor, _Buffer]]
unflatten()
unflatten(it)
-
Parameters:
-
Return type:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!