Skip to main content

Python class

SamplingParamsInput

SamplingParamsInput

class max.interfaces.SamplingParamsInput(top_k=None, top_p=None, min_p=None, temperature=None, frequency_penalty=None, presence_penalty=None, repetition_penalty=None, max_new_tokens=None, min_new_tokens=None, ignore_eos=None, stop=None, stop_token_ids=None, detokenize=None, seed=None, logits_processors=None)

source

Bases: object

Input dataclass for creating SamplingParams instances.

All fields are optional, allowing partial specification with None values indicating “use default”. This enables static type checking while maintaining the flexibility to specify only the parameters you want to override.

Parameters:

detokenize

detokenize: bool | None = None

source

Whether to convert output token IDs back to text. Defaults to None (use class default).

frequency_penalty

frequency_penalty: float | None = None

source

The penalty applied proportionally to token frequency in the generated text. Defaults to None (use class default).

ignore_eos

ignore_eos: bool | None = None

source

Whether to continue generating past end-of-sequence tokens. Defaults to None (use class default).

logits_processors

logits_processors: Sequence[Callable[[ProcessorInputs], None]] | None = None

source

Callables applied to model logits before sampling. Defaults to None (use class default).

max_new_tokens

max_new_tokens: int | None = None

source

The maximum number of tokens to generate. Defaults to None (use class default).

min_new_tokens

min_new_tokens: int | None = None

source

The minimum number of tokens to generate before stopping. Defaults to None (use class default).

min_p

min_p: float | None = None

source

The minimum probability threshold for a token relative to the most likely token. Defaults to None (use class default).

presence_penalty

presence_penalty: float | None = None

source

The flat penalty applied to tokens that have appeared at least once. Defaults to None (use class default).

repetition_penalty

repetition_penalty: float | None = None

source

The factor by which logits of repeated tokens are divided. Defaults to None (use class default).

seed

seed: int | None = None

source

The random seed for reproducible sampling. Defaults to None (use class default).

stop

stop: list[str] | None = None

source

A list of strings that, when generated, will stop the generation. Defaults to None (use class default).

stop_token_ids

stop_token_ids: list[int] | None = None

source

A list of token IDs that, when generated, will stop the generation. Defaults to None (use class default).

temperature

temperature: float | None = None

source

The temperature for controlling output randomness. Defaults to None (use class default).

top_k

top_k: int | None = None

source

The number of most probable tokens to keep when sampling. Defaults to None (use class default).

top_p

top_p: float | None = None

source

The cumulative probability threshold for nucleus sampling. Defaults to None (use class default).