Skip to main content

Mojo function

dequant_mxfp4

dequant_mxfp4[*, SF_VECTOR_SIZE: Int = 32](ctx: DeviceContext, output: TileTensor[address_space=output.address_space, linear_idx_type=output.linear_idx_type, element_size=output.element_size], input: TileTensor[address_space=input.address_space, linear_idx_type=input.linear_idx_type, element_size=input.element_size], scales: TileTensor[address_space=scales.address_space, linear_idx_type=scales.linear_idx_type, element_size=scales.element_size], num_rows: Int, num_cols: Int, pdl_level: PDLLevel = PDLLevel())

Dequantize MXFP4 packed weights to FP8 or BF16.

Args: