3 docs tagged with "gcp"

Deploy Llama 3 on GPU with MAX

Learn how to deploy MAX pipelines to cloud

Create a GPU-enabled Kubernetes cluster with the cloud provider of your choice and deploy Llama 3.1 with MAX using Helm.

Learn how to deploy Llama 3 on Google Cloud Run using MAX for serverless GPU inferencing