For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Python function
ragged_token_merger
ragged_token_merger()
max.pipelines.speculative.ragged_token_merger(device)
Builds a graph that merges prompt and draft tokens into a single ragged sequence.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!