Python class
AudioGenerationOutput
AudioGenerationOutput
class max.interfaces.AudioGenerationOutput(final_status, steps_executed, audio_data=<factory>, buffer_speech_tokens=None, metadata=<factory>)
Bases: Struct
Represents a response from the audio generation API.
This class encapsulates the result of an audio generation request, including the final status, generated audio data, and optional buffered speech tokens.
-
Parameters:
audio_data
The generated audio data, if available.
buffer_speech_tokens
buffer_speech_tokens: ndarray[tuple[Any, ...], dtype[integer[Any]]] | None
Buffered speech tokens, if available.
final_status
final_status: GenerationStatus
The final status of the generation process.
is_done
property is_done: bool
Indicates whether the audio generation process is complete.
-
Returns:
-
Trueif generation is done,Falseotherwise.
metadata
metadata: AudioGenerationMetadata
Metadata associated with the audio generation, such as chunk information, prompt details, or other relevant context.
steps_executed
steps_executed: int
The number of steps previously executed.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!