Python class
UnifiedEagleOutputs
UnifiedEagleOutputsβ
class max.pipelines.lib.UnifiedEagleOutputs(*, logits=None, next_token_logits=None, logit_offsets=None, hidden_states=None, num_accepted_draft_tokens, next_tokens, next_draft_tokens)
Bases: ModelOutputs
Outputs from a unified EAGLE graph execution.
-
Parameters:
hidden_statesβ
hidden_states: None = None
Optional hidden states for text generation.
Single-device shape is [T_h, H] where:
- none mode: NONE (default)
- last-token mode:
T_h = B - all-token mode:
T_h = total_input_tokens
For data parallel models, the hs will be on the first gpu since it is replicated.
logit_offsetsβ
logit_offsets: None = None
Cumulative row offsets into logits for text generation.
Shape is [B + 1]. Per-sequence logits are:
logits[logit_offsets[i]:logit_offsets[i + 1], :].
logitsβ
Primary logits buffer.
For text generation this has shape [T, V] where:
- last-token mode:
T = B(default) - all-token mode:
T = total_input_tokens - variable mode:
T = logit_offsets[-1](typicallyB * return_n_logits)
next_draft_tokensβ
next_draft_tokens: Buffer
next_token_logitsβ
next_token_logits: None = None
Next-token logits for text generation, shape [B, V] when present.
next_tokensβ
next_tokens: Buffer
num_accepted_draft_tokensβ
num_accepted_draft_tokens: Buffer
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!