Model
#include "max/c/model.h"
Functionsβ
M_newCompileConfig()
β
M_CompileConfig *M_newCompileConfig()
Creates an object you can use to configure model compilation.
You need M_CompileConfig
as an argument for several functions, including M_setModelPath()
, M_setTorchInputSpecs()
, and M_compileModel()
.
-
Returns:
A pointer to a new compilation configuration. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by calling
M_freeCompileConfig()
. This compilation configuration can only be used for a single compilation call. Any subsequent compilations must be passed a newM_CompileConfig
(created by callingM_newCompileConfig()
again).
M_cloneCompileConfig()
β
M_CompileConfig *M_cloneCompileConfig(M_CompileConfig *other)
Clones an object you can use to configure model compilation.
-
Returns:
A pointer to a deep-cloned compilation configuration. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by calling
M_freeCompileConfig()
. This compilation configuration can only be used for a single compilation call. Any subsequent compilations must be passed a newM_CompileConfig
(created by callingM_newCompileConfig()
orM_cloneCompileConfig()
again).
M_setModelPath()
β
void M_setModelPath(M_CompileConfig *compileConfig, const char *path)
Sets the path to a model.
You must call this before you call M_compileModel()
. Otherwise, M_compileModel()
returns an error in status
.
Note: PyTorch models must be in TorchScript format.
-
Parameters:
- compileConfig β The compilation configuration for your model, from
M_newCompileConfig()
. - path β The path to your model. The model does not need to exist on the filesystem at this point. This follows the same semantics and expectations as
std::filesystem::path
.
- compileConfig β The compilation configuration for your model, from
M_newModelSource()
β
M_ModelSource *M_newModelSource(void *source, M_FrameworkFormat format)
Creates an opaque torchscript model representation.
-
Parameters:
- source β A pointer to the model representation.
- format β The framework format matching the model representation.
-
Returns:
A pointer to the opaque model representation. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by calling
M_freeModelSource()
.
M_setModelSource()
β
void M_setModelSource(M_CompileConfig *compileConfig, M_ModelSource *modelSource)
Sets the opaque representation of the model for compilation.
You must call this or M_setModelPath()
before you call M_compileModel()
. Otherwise, M_compileModel()
returns an error in status
.
-
Parameters:
- compileConfig β The compilation configuration for your model, from
M_newCompileConfig()
. - M_ModelSource β The opaque representation of your model.
- compileConfig β The compilation configuration for your model, from
M_enableVisualization()
β
void M_enableVisualization(M_CompileConfig *compileConfig, const char *path)
Enables visualization.
When enabled, a maxviz file is generated and saved to a specified output directory. If no output directory is specified, the output file is saved to present working directory. The output file can be used as input to Netron to visualize a model graph.
-
Parameters:
- compileConfig β The compilation configuration for your model, from
M_newCompileConfig()
. - path β The path specified for the output visualization directory. This follows the same semantics and expectations as
std::filesystem::path
.
- compileConfig β The compilation configuration for your model, from
M_compileModel()
β
M_AsyncCompiledModel *M_compileModel(const M_RuntimeContext *context, M_CompileConfig **compileConfig, M_Status *status)
Compiles a model.
This immediately returns an M_AsyncCompiledModel
, with compilation happening asynchronously. If you need to block to await compilation, you can then call M_waitForCompilation()
.
You must call M_setModelPath()
before you call this. For example:
M_CompileConfig *compileConfig = M_newCompileConfig();
M_setModelPath(compileConfig, modelPath);
M_AsyncCompiledModel *compiledModel =
M_compileModel(context, &compileConfig, status);
if (M_isError(status)) {
logError(M_getError(status));
return EXIT_FAILURE;
}
When using a TorchScript model, you must also specify the input shapes via M_setTorchInputSpecs()
before you compile it.
The M_AsyncCompiledModel
returned here is not ready for inference yet. You need to then initialize the model with M_initModel()
.
-
Parameters:
- context β The runtime context, from
M_newRuntimeContext()
. - compileConfig β Address of compilation configuration for your model created with
M_newCompileConfig()
, and with the model set viaM_setModelPath()
. Ownership of configuration is handed over to API. - status β The status used to report errors in the case of failures during model compilation.
- context β The runtime context, from
-
Returns:
A pointer to an
M_AsyncCompiledModel
. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by callingM_freeCompiledModel()
. If the config is invalid, it returns aNULL
pointer. If the model compilation fails, the pointer isNULL
and thestatus
parameter contains an error message.compileConfig
will be reset toNULL
after this call irrespective of status and cannot be reused, and any subsequent calls must take a newM_CompileConfig
.
M_waitForCompilation()
β
void M_waitForCompilation(M_AsyncCompiledModel *compiledModel, M_Status *status)
Blocks execution until the model is compiled.
This waits for the async compiled model to be complete after calling M_compileModel()
. When this function returns, the model is resolved to either a compiled model or an error.
-
Parameters:
- compiledModel β The model received from
M_compileModel()
. - status β The status used to report errors in the case of failures.
- compiledModel β The model received from
M_compileModelSync()
β
M_AsyncCompiledModel *M_compileModelSync(const M_RuntimeContext *context, M_CompileConfig **compileConfig, M_Status *status)
Synchronously compiles a model.
Unlike M_compileModel()
, this blocks until model compilation is complete. It returns an M_AsyncCompiledModel
without needing to call M_waitForCompilation()
. All other setup and usage is identical to M_compileModel()
.
-
Parameters:
- context β The runtime context, from
M_newRuntimeContext()
. - compileConfig β Address of compilation configuration for your model created with
M_newCompileConfig()
, and with the model set viaM_setModelPath()
. Ownership of configuration is handed over to API. - status β The status used to report errors in the case of failures during model compilation.
- context β The runtime context, from
-
Returns:
A pointer to an
M_AsyncCompiledModel
. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by callingM_freeCompiledModel()
. If the config is invalid, it returns aNULL
pointer. If the model compilation fails, the pointer isNULL
and thestatus
parameter contains an error message.compileConfig
will be reset toNULL
after this call irrespective of status and cannot be reused, and any subsequent calls must take a newM_CompileConfig
.
M_initModel()
β
M_AsyncModel *M_initModel(const M_RuntimeContext *context, const M_AsyncCompiledModel *compiledModel, const M_WeightsRegistry *weightsRegistry, M_Status *status)
Sets up a model for execution.
You can call this immediately after M_compileModel()
βyou donβt need to wait for the async compilation.
This function also returns immediately with model initialization happening asynchronously. For example:
M_AsyncModel *model = M_initModel(
context, compiledModel, weightsRegistry, status);
if (M_isError(status)) {
logError(M_getError(status));
return EXIT_FAILURE;
}
If you want to block until M_AsyncModel
is initialized, you can call M_waitForModel()
, but thatβs not necessary and you can immediately call M_executeModelSync()
.
-
Parameters:
- context β The runtime context, from
M_newRuntimeContext()
. - compiledModel β The compiled model, from
M_compileModel()
. - weightsRegistry β A mapping from weightsβ names to their data. The weights registry is used to update weights or otherwise pass weights to the model init block at runtime, without recompiling the model graph. If the model doesnβt use the weights registry, it is safe to pass as NULL
- status β The status used to report errors in the case of failures. The status contains an error only if the given context or compiled model is invalid. Other errors will not surface until the next synchronization point.
- context β The runtime context, from
-
Returns:
A pointer to an
M_AsyncModel
that holds an async value to a compiled model. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by callingM_freeModel()
. If model initialization fails, thestatus
parameter contains an error message.
M_getInputNames()
β
M_TensorNameArray *M_getInputNames(const M_AsyncCompiledModel *model, M_Status *status)
Gets all input tensor names.
-
Parameters:
- model β The compiled model.
- status β The status used to report errors in the case of failures. The status contains an error only if the given model is invalid.
-
Returns:
An array of input tensor names or a
NULL
pointer if the model is invalid. IfNULL
, thestatus
parameter contains an error message. Callers are responsible for freeing the returned array by callingM_freeTensorNameArray()
.
M_getOutputNames()
β
M_TensorNameArray *M_getOutputNames(const M_AsyncCompiledModel *model, M_Status *status)
Gets all output tensor names
-
Parameters:
- model β The compiled model.
- status β The status used to report errors in the case of failures. The status contains an error only if the given model is invalid.
-
Returns:
An array of output tensor names or a
NULL
pointer if the model is invalid. IfNULL
, thestatus
parameter contains an error message. Callers are responsible for freeing the returned array by callingM_freeTensorNameArray()
.
M_getTensorNameAt()
β
const char *M_getTensorNameAt(const M_TensorNameArray *tensorNameArray, size_t index)
Gets the tensor name in tensorNameArray
at index
.
-
Parameters:
- tensorNameArray β The tensor name array.
- index β The index of the tensor name to get.
-
Returns:
A pointer to the tensor name at
index
or aNULL
pointer if the index is out of bounds, or iftensorNameArray
isNULL
. The returned string is owned bytensorNameArray
. The returned string is null terminated.
M_getModelInputSpecByName()
β
M_TensorSpec *M_getModelInputSpecByName(const M_AsyncCompiledModel *model, const char *tensorName, M_Status *status)
Gets the specifications for an input tensor by the tensorβs name.
-
Parameters:
- model β The compiled model.
- tensorName β The name of the input tensor.
- status β The status used to report errors in the case of failures. The status contains an error only if the given model or
tensorName
is invalid.
-
Returns:
A pointer to an
M_TensorSpec
, or aNULL
pointer if the model or index is invalid. IfNULL
, thestatus
parameter contains an error message.
M_getModelOutputSpecByName()
β
M_TensorSpec *M_getModelOutputSpecByName(const M_AsyncCompiledModel *model, const char *tensorName, M_Status *status)
Gets the specifications for an output tensor by the tensorβs name.
-
Parameters:
- model β The compiled model.
- tensorName β The name of the output tensor.
- status β The status used to report errors in the case of failures. The status contains an error only if the given model or
tensorName
is invalid.
-
Returns:
A pointer to an
M_TensorSpec
, or aNULL
pointer if the model or index is invalid. IfNULL
, thestatus
parameter contains an error message.
M_waitForModel()
β
void M_waitForModel(M_AsyncModel *model, M_Status *status)
Blocks execution until the model is initialized.
This waits for the model setup to finish in M_initModel()
.
-
Parameters:
- model β The model.
- status β The status used to report errors in the case of failures.
M_executeModelSync()
β
M_AsyncTensorMap *M_executeModelSync(const M_RuntimeContext *context, M_AsyncModel *initializedModel, M_AsyncTensorMap *inputs, M_Status *status)
Executes a model synchronously.
The inputs and outputs are M_AsyncTensorMap
objects to allow chaining of inference. This operation is blocking and waits until the output results are ready.
For a complete code example, see the guide to Get started in C.
-
Parameters:
- context β The runtime context.
- initializedModel β The model to execute, from
M_initModel()
. Although that function is async, you can pass theM_AsyncModel
here immediately. - inputs β The tensor inputs.
- status β The status used to report errors in the case of failures. This includes failures encountered while running the model; there is no need for an explicit synchronization point.
-
Returns:
A pointer to an
M_AsyncTensorMap
that holds the output tensors. These tensors are in a resolved state. You are responsible for the memory associated with the pointer returned. You can deallocate the memory by callingM_freeAsyncTensorMap()
. In the case that executing the model fails, thestatus
parameter contains an error message.
M_getNumModelInputs()
β
size_t M_getNumModelInputs(const M_AsyncCompiledModel *model, M_Status *status)
Gets the number of inputs for the model.
If the model is not yet resolved/ready, this function blocks execution.
You should call M_compileModel()
before calling this.
-
Parameters:
- model β The compiled model.
- status β The status used to report errors in the case of failures.
-
Returns:
The number of inputs for the model, or
0
if there is an error in getting the model metadata. If0
, thestatus
parameter contains an error message.
M_getNumModelOutputs()
β
size_t M_getNumModelOutputs(const M_AsyncCompiledModel *model, M_Status *status)
Gets the number of outputs for the model.
If the model is not yet resolved/ready, this function blocks execution.
You should call M_compileModel()
before calling this.
-
Parameters:
- model β The compiled model.
- status β The status used to report errors in the case of failures.
-
Returns:
The number of outputs for the model, or
0
if there is an error in getting the model metadata. If0
, thestatus
parameter contains an error message.
M_validateInputTensorSpec()
β
void M_validateInputTensorSpec(const M_AsyncCompiledModel *model, M_AsyncTensorMap *tensors, M_Status *status)
Validate input tensor specs for compatibility with the compiled model.
The status message shows which validation check failed for the input.
-
Parameters:
- model β The compiled model.
- tensors β The tensors whose specs need to be validated
- status β The status used to report errors in the case of failures.
-
Returns:
True if the
tensors
has valid specs for themodel
M_freeModel()
β
void M_freeModel(M_AsyncModel *model)
Deallocates the memory for the model. No-op if model
is NULL
.
-
Parameters:
model β The model to deallocate.
M_freeCompiledModel()
β
void M_freeCompiledModel(M_AsyncCompiledModel *model)
Deallocates the memory for the compiled model. No-op if model
is NULL
.
-
Parameters:
model β The compiled model to deallocate.
M_freeCompileConfig()
β
void M_freeCompileConfig(M_CompileConfig *config)
Deallocates the memory for the compile config. No-op if config
is NULL
.
-
Parameters:
config β The compilation configuration to deallocate.
M_freeModelSource()
β
void M_freeModelSource(M_ModelSource *modelSource)
Deallocates the memory for the model source. No-op if modelSource
is NULL
.
-
Parameters:
modelSource β The model source to deallocate.
M_exportCompiledModel()
β
void M_exportCompiledModel(M_AsyncCompiledModel *model, const char *path, M_Status *status)
Exports a compiled model as a MEF to a given path.
-
Parameters:
- model β The model instance to export.
- path β The path of the MEF file to export.
- status β The status used to report errors in the case of failures.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
If you'd like to share more information, please report an issue on GitHub
π What went wrong?