IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Python module

max.pipelines.architectures

MAX includes built-in support for a wide range of model architectures. Each architecture module registers a SupportedArchitecture instance that tells the pipeline system how to load, configure, and execute a particular model family.

Text generationโ€‹

deepseekV2DeepSeek-V2 mixture-of-experts architecture for text generation.
deepseekV3DeepSeek-V3 mixture-of-experts architecture for text generation.
deepseekV3_2DeepSeek-V3.2 mixture-of-experts architecture for text generation.
deepseekV3_nextnDeepSeek-V3 NextN multi-token prediction draft model for speculative decoding.
dflash_llama3DFlash draft model for Llama3-family targets.
eagle3_deepseekV3DeepseekV3 + Eagle3 speculator pipeline.
eagle_llama3EAGLE speculative decoding draft model for Llama 3.
exaoneEXAONE architecture, builds on llama3.
exaone_modulev3EXAONE (eager) architecture, builds on llama3_modulev3.
gemma3Gemma 3 transformer architecture for text generation.
gemma3_modulev3Gemma 3 transformer architecture for text generation.
gemma3multimodalGemma 3 vision-language architecture for multimodal text generation.
gemma3multimodal_modulev3Gemma 3 vision-language architecture for multimodal text generation.
gemma4Gemma 4 vision-language architecture for multimodal text generation.
gpt_ossGPT-OSS mixture-of-experts architecture for text generation.
gpt_oss_modulev3GPT-OSS mixture-of-experts architecture for text generation.
graniteGranite architecture, builds on llama3.
granite_modulev3Granite (eager) architecture, builds on llama3_modulev3.
idefics3Idefics3 vision-language architecture for multimodal text generation.
idefics3_modulev3Idefics3 vision-language architecture for multimodal text generation.
internvlInternVL vision-language architecture for multimodal text generation.
kimik2_5Kimi K2.5 mixture-of-experts architecture for text generation.
lfm2
llama3Llama 3 transformer architecture for text generation.
llama3_modulev3Llama 3 transformer architecture for text generation.
mambaMamba state-space architecture for text generation.
minimax_m2
mistralMistral transformer architecture for text generation.
mistral3Mistral 3 vision-language architecture for multimodal text generation.
olmoOLMo transformer architecture for text generation.
olmo2OLMo 2 transformer architecture for text generation.
olmo2_modulev3OLMo 2 transformer architecture for text generation.
olmo3OLMo 3 transformer architecture for text generation.
olmo_modulev3OLMo transformer architecture for text generation.
phi3Phi-3 transformer architecture for text generation.
phi3_modulev3Phi-3 transformer architecture for text generation.
pixtralPixtral vision-language architecture for multimodal text generation.
pixtral_modulev3Pixtral vision-language architecture for multimodal text generation.
qwen2Qwen2 transformer architecture for text generation.
qwen2_5vlQwen2.5-VL vision-language architecture for multimodal text generation.
qwen3Qwen3 transformer architecture for text generation.
qwen3_5
qwen3vl_moeQwen3-VL vision-language architecture for multimodal text generation.
step3p5
unified_dflash_llama3DFlash speculative decoding for Llama3 with unified graph compilation.
unified_eagle_llama3EAGLE speculative decoding draft model for Llama 3 with unified graph compilation.
unified_mtp_deepseekV3DeepSeek-V3 multi-token prediction draft model for speculative decoding with unified graph compilation.

Embeddingsโ€‹

bertBERT sentence transformer architecture for embeddings generation.
mpnetMPNet sentence transformer architecture for embeddings generation.
mpnet_modulev3MPNet sentence transformer architecture for embeddings generation.
qwen3_embeddingQwen3 architecture for embeddings generation.
qwen3_embedding_modulev3Qwen3 architecture for embeddings generation.

Image generationโ€‹

flux2FLUX.2 diffusion architecture for image generation.
qwen_imageQwen-Image diffusion architecture for image generation.
qwen_image_editQwen-Image-Edit diffusion architecture for image editing.
wanWan diffusion architecture for video generation.
z_image_modulev3Z-Image diffusion architecture for image generation.