IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Python module

max.pipelines.architectures.olmo_modulev3

OLMo transformer architecture for text generation.

OlmoConfigโ€‹

class max.pipelines.architectures.olmo_modulev3.OlmoConfig(*, hidden_size, num_attention_heads, num_key_value_heads, num_hidden_layers, rope_theta, rope_scaling_params, max_seq_len, intermediate_size, interleaved_rope_weights, vocab_size, dtype, kv_params, return_logits=ReturnLogits.LAST_TOKEN, norm_method='rms_norm', attention_bias=False, rms_norm_eps=None, tie_word_embeddings=False, stacked_mlp=False, stacked_qkv=False, attention_multiplier, embedding_multiplier, residual_multiplier, devices, clip_qkv=None, norm_elementwise_affine=False, longrope_scaling_params=None, logits_scaling=1.0, return_hidden_states=ReturnHiddenStates.NONE)

source

Bases: Llama3Config

Model configuration for Olmo graph construction/execution.

Parameters:

finalize()โ€‹

finalize(huggingface_config, state_dict, return_logits, return_hidden_states=ReturnHiddenStates.NONE, norm_method='rms_norm', attention_bias=False)

source

Define parameters that canโ€™t be determined just from the pipeline config.

Parameters:

Return type:

None

norm_elementwise_affineโ€‹

norm_elementwise_affine: bool = False

source

OlmoModelโ€‹

class max.pipelines.architectures.olmo_modulev3.OlmoModel(pipeline_config, session, devices, kv_cache_config, weights, adapter=None, return_logits=ReturnLogits.LAST_TOKEN, return_hidden_states=ReturnHiddenStates.NONE)

source

Bases: Llama3Model

Olmo pipeline model implementation.

Parameters:

config_classโ€‹

config_class

source

alias of OlmoConfig

norm_methodโ€‹

norm_method: Literal['rms_norm'] | Literal['layer_norm'] = 'layer_norm'

source