Welcome to

MAX simplifies the process to deploy your own AI endpoint. Try it now:

Run MAX now

Deploy an LLM on a GPU

Explore

Tutorials

Step-by-step programming guides using MAX APIs.

Models

Ready-to-deploy GenAI models accelerated with MAX.

APIs

Python, C, and Mojo API libraries for MAX.

Concepts

What is MAX

An introduction to the features and technology in the MAX platform.

MAX Serve

An introduction to our model serving library called MAX Serve.

MAX container

A guide to our official Docker container for MAX deployments.

MAX changelog

A summary of changes in each MAX release.

Tutorials

Model Repository

We’re on a mission to make open source AI models as fast and easy to use as they can be. Check out the 400+ AI models that run on MAX - each with step-by-step install instructions for CPU, GPU, and Cloud.

Go to site

Latest blog posts

Go to blog

Democratizing AI Compute, Part 8: What about the MLIR compiler infrastructure?

Democratizing AI Compute, Part 7: What about Triton and Python eDSLs?

MAX 25.2: Unleash the power of your H200's–without CUDA!

Democratizing AI Compute, Part 6: What about AI compilers (TVM and XLA)?

Democratizing AI Compute, Part 5: What about CUDA C++ alternatives like OpenCL?

Modverse #46: MAX 25.1, MAX Builds, and Democratizing AI Compute