Mojo module
types
Structs
-
ContinuousBatchingKVCache
: Wrapper for the ContinuousKVCache of a given layer in the transformer model. -
ContinuousBatchingKVCacheCollection
: This is a "view" of the cache for the given sequences in the batch. -
KVCacheStaticParams
: -
PagedKVCache
: The PagedKVCache is a wrapper around the KVCache blocks for a given layer. It is used to access the KVCache blocks for PagedAttention. -
PagedKVCacheCollection
: -
PagedKVCacheCollectionFA3Fallback
: -
PagedKVCacheFA3Fallback
: The PagedKVCache is a wrapper around the KVCache blocks for a given layer. It is used to access the KVCache blocks for PagedAttention.
Traits
-
KVCacheT
: Trait for different KVCache types and implementations. -
KVCollectionT
:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!