Skip to main content

Python module

max.pipelines.architectures

MAX includes built-in support for a wide range of model architectures. Each architecture module registers a SupportedArchitecture instance that tells the pipeline system how to load, configure, and execute a particular model family.

Text generationโ€‹

deepseekV2DeepSeek-V2 mixture-of-experts architecture for text generation.
deepseekV3DeepSeek-V3 mixture-of-experts architecture for text generation.
deepseekV3_2DeepSeek-V3.2 mixture-of-experts architecture for text generation.
deepseekV3_nextnDeepSeek-V3 NextN multi-token prediction draft model for speculative decoding.
eagle_llama3EAGLE speculative decoding draft model for Llama 3.
exaoneEXAONE architecture, builds on llama3.
exaone_modulev3EXAONE (eager) architecture, builds on llama3_modulev3.
gemma3Gemma 3 transformer architecture for text generation.
gemma3_modulev3Gemma 3 transformer architecture for text generation.
gemma3multimodalGemma 3 vision-language architecture for multimodal text generation.
gemma3multimodal_modulev3Gemma 3 vision-language architecture for multimodal text generation.
gemma4Gemma 4 vision-language architecture for multimodal text generation.
gpt_ossGPT-OSS mixture-of-experts architecture for text generation.
gpt_oss_modulev3GPT-OSS mixture-of-experts architecture for text generation.
graniteGranite architecture, builds on llama3.
granite_modulev3Granite (eager) architecture, builds on llama3_modulev3.
idefics3Idefics3 vision-language architecture for multimodal text generation.
idefics3_modulev3Idefics3 vision-language architecture for multimodal text generation.
internvlInternVL vision-language architecture for multimodal text generation.
kimik2_5Kimi K2.5 mixture-of-experts architecture for text generation.
llama3Llama 3 transformer architecture for text generation.
llama3_modulev3Llama 3 transformer architecture for text generation.
mambaMamba state-space architecture for text generation.
minimax_m2
mistralMistral transformer architecture for text generation.
mistral3Mistral 3 vision-language architecture for multimodal text generation.
olmoOLMo transformer architecture for text generation.
olmo2OLMo 2 transformer architecture for text generation.
olmo2_modulev3OLMo 2 transformer architecture for text generation.
olmo3OLMo 3 transformer architecture for text generation.
olmo_modulev3OLMo transformer architecture for text generation.
phi3Phi-3 transformer architecture for text generation.
phi3_modulev3Phi-3 transformer architecture for text generation.
pixtralPixtral vision-language architecture for multimodal text generation.
pixtral_modulev3Pixtral vision-language architecture for multimodal text generation.
qwen2Qwen2 transformer architecture for text generation.
qwen2_5vlQwen2.5-VL vision-language architecture for multimodal text generation.
qwen3Qwen3 transformer architecture for text generation.
qwen3vl_moeQwen3-VL vision-language architecture for multimodal text generation.
unified_eagle_llama3EAGLE speculative decoding draft model for Llama 3 with unified graph compilation.
unified_mtp_deepseekV3DeepSeek-V3 multi-token prediction draft model for speculative decoding with unified graph compilation.

Embeddingsโ€‹

bertBERT sentence transformer architecture for embeddings generation.
mpnetMPNet sentence transformer architecture for embeddings generation.
mpnet_modulev3MPNet sentence transformer architecture for embeddings generation.
qwen3_embeddingQwen3 architecture for embeddings generation.
qwen3_embedding_modulev3Qwen3 architecture for embeddings generation.

Image generationโ€‹

flux1_modulev3FLUX.1 diffusion architecture for image generation.
flux2FLUX.2 diffusion architecture for image generation.
flux2_modulev3FLUX.2 diffusion architecture for image generation.
qwen_imageQwen-Image diffusion architecture for image generation.
qwen_image_editQwen-Image-Edit diffusion architecture for image editing.
wanWan diffusion architecture for video generation.
z_image_modulev3Z-Image diffusion architecture for image generation.