For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo struct

RMSNormFusedQuantizeDynamicScaledFP8

struct RMSNormFusedQuantizeDynamicScaledFP8

Registers the mo.composite.rms_norm_fused_quantize_dynamic_scaled_fp8 graph op with the graph compiler.

Implemented traits

AnyType, ImplicitlyDeletable

Methods

`execute`

static def execute[input_dtype: DType, output_dtype: DType, scale_dtype: DType, rank: Int, target: StringSlice[ImmStaticOrigin]](output: ManagedTensorSlice[IOSpec[_, _].Output, static_spec=output.static_spec], scales: ManagedTensorSlice[IOSpec[_, _].Output, static_spec=scales.static_spec], input: ManagedTensorSlice[IOSpec[_, _].FusedInput, static_spec=input.static_spec], gamma: ManagedTensorSlice[IOSpec[_, _].Input, static_spec=gamma.static_spec], epsilon: Float32, weight_offset: Scalar[input_dtype], scale_ub: Float32, ctx: DeviceContext)

Implemented traits​

Methods​

execute​

Implemented traits

Methods

`execute`