For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo function
print_kv_cache_cont_batch_generic_gpu
def print_kv_cache_cont_batch_generic_gpu[target: StringSlice[StaticConstantOrigin], dtype: DType, kv_params: KVCacheStaticParams](valid_lengths: LayoutTensor[DType.uint32, element_layout=valid_lengths.element_layout, layout_int_type=valid_lengths.layout_int_type, linear_idx_type=valid_lengths.linear_idx_type, masked=valid_lengths.masked, alignment=valid_lengths.alignment], kv_collection: ContinuousBatchingKVCacheCollection[dtype, kv_params], layer_idx: UInt32, is_print_compact: Bool, context: DeviceContext)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!