Skip to main content

Mojo module

fp4_utils

comptime values

E2M1_TO_FLOAT32

comptime E2M1_TO_FLOAT32 = SIMD(0, 0.5, 1, 1.5, 2, 3, 4, 6, -0.0, -0.5, -1, -1.5, -2, -3, -4, -6, Tuple())

MXFP4_SF_DTYPE

comptime MXFP4_SF_DTYPE = DType.float8_e8m0fnu

MXFP4_SF_VECTOR_SIZE

comptime MXFP4_SF_VECTOR_SIZE = 32

MXFP8_SF_DTYPE

comptime MXFP8_SF_DTYPE = DType.float8_e8m0fnu

MXFP8_SF_VECTOR_SIZE

comptime MXFP8_SF_VECTOR_SIZE = 32

NVFP4_SF_DTYPE

comptime NVFP4_SF_DTYPE = DType.float8_e4m3fn

NVFP4_SF_VECTOR_SIZE

comptime NVFP4_SF_VECTOR_SIZE = 16

SF_ATOM_K

comptime SF_ATOM_K = 4

SF_ATOM_M

comptime SF_ATOM_M = Tuple(VariadicPack(32, 4))

SF_K_GROUP_SIZE

comptime SF_K_GROUP_SIZE[SF_VECTOR_SIZE: Int] = (4 * SF_VECTOR_SIZE)

Parameters

  • SF_VECTOR_SIZE (Int):

SF_MN_GROUP_SIZE

comptime SF_MN_GROUP_SIZE = ((load_from_mem SF_ATOM_M.__getitem__[0]()) * (load_from_mem SF_ATOM_M.__getitem__[1]()))

Functions

Was this page helpful?