For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Python class
RejectionRunner
RejectionRunner
class max.pipelines.sampling.RejectionRunner(session, device_ref)
Bases: Protocol
Interface for rejection sampling runners.
-
Parameters:
-
- session (InferenceSession)
- device_ref (DeviceRef)
run()
run(draft_tokens, draft_logits, target_logits, target_logit_offsets, all_draft_logits, context_batch)
Run the rejection sampler.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!