For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Python function
build_greedy_acceptance_sampler_graph
build_greedy_acceptance_sampler_graph()
max.pipelines.sampling.build_greedy_acceptance_sampler_graph(device)
Builds a graph that implements strict greedy acceptance for MTP.
Draft tokens are accepted only when they match the argmax of the target logits at each position. Always produces a recovered token for every draft position and a bonus token from the final (+1) target position.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!