For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Python module
max.pipelines.lora
LoRA adapter management for MAX pipelines.
Adapter management
LoRAConfig | Configuration for LoRA (Low-Rank Adaptation) inference. |
|---|---|
LoRAManager | Manages multiple LoRA models and buffers for the forward pass. |
LoRAModel | Manages LoRA weights and configuration for a single adapter. |
LoRA types
LoRAOperation | Enum for different LoRA operations. |
|---|---|
LoRARequest | Container for LoRA adapter requests. |
LoRAResponse | Response from LoRA operations. |
LoRAStatus | Enum for LoRA operation status. |
LoRAType | Enumeration for LoRA Types. |
LORA_REQUEST_ENDPOINT | str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str |
|---|---|
LORA_RESPONSE_ENDPOINT | str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str |
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!