Python function
unflatten_ragged_attention_inputs
unflatten_ragged_attention_inputs()โ
max.nn.kv_cache.unflatten_ragged_attention_inputs(kv_inputs_flat, *, n_devices)
Unmarshals flattened KV graph inputs into typed cache values.
-
Parameters:
-
Return type:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!