IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo struct

DecodeSM100PVSS_FP8

struct DecodeSM100PVSS_FP8[operand_type: DType, accum_type: DType, *, config: MLA_SM100_Decode_Config, p_swizzle: TensorMapSwizzle = TensorMapSwizzle.SWIZZLE_64B]

Implemented traits​

AnyType, Copyable, ImplicitlyCopyable, ImplicitlyDeletable, Movable, RegisterPassable, TrivialRegisterPassable

comptime members​

BK​

comptime BK = config.BK_PV

BM​

comptime BM = config.BM

BN_PV​

comptime BN_PV = DecodeSM100PVSS_FP8[operand_type, accum_type, config=config, p_swizzle=p_swizzle].MMA_N

MMA_K​

comptime MMA_K = 32

MMA_M​

comptime MMA_M = config.MMA_M

MMA_N​

comptime MMA_N = config.MMA_PV_N

num_k_mmas​

comptime num_k_mmas = (config // Int(32))

operand_size​

comptime operand_size = size_of[operand_type]()

UMMAPVSS​

comptime UMMAPVSS = UMMAInsDescriptor.create[accum_type, operand_type, operand_type, Index[Int, Int, dtype=DType.uint32](config, config), transpose_b=False]()

Methods​

descriptor_v_block​

static def descriptor_v_block(kv_smem: UnsafePointer[Scalar[operand_type], MutAnyOrigin, address_space=AddressSpace.SHARED]) -> MMASmemDescriptorPair

Returns:

MMASmemDescriptorPair

descriptor_p_block​

static def descriptor_p_block(p_smem: UnsafePointer[Scalar[operand_type], MutAnyOrigin, address_space=AddressSpace.SHARED]) -> MMASmemDescriptorPair

Returns:

MMASmemDescriptorPair

mma​

static def mma[*, stage_idx: Int = Int(0)](a: MMASmemDescriptorPair, b: MMASmemDescriptorPair, c: UInt32, *, c_scale: UInt32, elect: Int32)