Welcome to

MAX simplifies the process to deploy your own AI endpoint. Try it now with these tutorials:

Explore

What is MAX?

An overview of MAX, what it does, and how to use it.

Tutorials

Step-by-step programming guides using MAX APIs.

API reference

Python, C, and Mojo API libraries for MAX.

MAX Engine

An introduction to the features and technology in MAX Engine.

MAX Serve

An introduction to our model serving library called MAX Serve.

Extensibility

Coming soon: an API to write custom ops for both CPUs and GPUs.

Quantization

Learn more about our support for quantized models.

Go to blog

Evaluating Llama Guard with MAX 24.6 and Hugging Face

Build a Continuous Chat Interface with Llama 3 and MAX Serve

Introducing MAX 24.6: A GPU Native Generative AI Platform

MAX GPU: State of the Art Throughput on a New GenAI platform

Chat with Documents Using Llama3.1, RAG, and MAX

Why Magic?