For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo struct

Struct_batched_matmul_dynamic_scaled_fp8

struct Struct_batched_matmul_dynamic_scaled_fp8

Registers the mo.batched.matmul.dynamic.scaled.fp8 graph op with the graph compiler.

Implemented traits

AnyType, ImplicitlyDeletable

Methods

`execute`

static def execute[c_type: DType, a_type: DType, b_type: DType, a_scales_type: DType, b_scales_type: DType, //, input_scale_granularity: StringSlice[ImmStaticOrigin], weight_scale_granularity: StringSlice[ImmStaticOrigin], m_scale_granularity: Int, n_scale_granularity: Int, k_scale_granularity: Int, target: StringSlice[ImmStaticOrigin]](c: ManagedTensorSlice[IOSpec[_, _].Output, static_spec=c.static_spec], a: ManagedTensorSlice[IOSpec[_, _].Input, static_spec=a.static_spec], b: ManagedTensorSlice[IOSpec[_, _].Input, static_spec=b.static_spec], a_scales: ManagedTensorSlice[IOSpec[_, _].Input, static_spec=a_scales.static_spec], b_scales: ManagedTensorSlice[IOSpec[_, _].Input, static_spec=b_scales.static_spec], context: DeviceContext)

Implemented traits​

Methods​

execute​

Implemented traits

Methods

`execute`