max
The max command line tool allows you to create an OpenAI-compatible endpoint
with a simple max serve command. It also includes a command to benchmark your
endpoint using built-in datasets or your own dataset.
To install the max CLI, install the modular package as shown
in the install guide.
Usage
max [OPTIONS] COMMAND [ARGS]...Options
-
--log-level <log_level>Set logging level explicitly (ignored if –verbose or –quiet is used).
-
Options:
-
DEBUG | INFO | WARNING | ERROR
-
-
--versionShow the MAX version and exit.
Commands
-
Run benchmark tests on a serving model.
-
Encode text input into model embeddings.
-
Generate text using the specified model.
-
list:List available pipeline configurations and…
-
Start a model serving endpoint for inference.
-
Load and compile the model to prepare caches.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!