Skip to main content
Log in

Mojo module

encodings

Implementations of quantization encodings.

Aliases

  • K_SCALE_SIZE = 12: Size of superblock scales and mins, in bytes.
  • QK_K = 256: Size of superblock quantized elements, in bytes.

Structs

Was this page helpful?