Skip to main content

module

encodings

Implementations of quantization encodings.

Aliases

  • QK_K = 256: Size of superblock quantized elements, in bytes.
  • K_SCALE_SIZE = 12: Size of superblock scales and mins, in bytes.

Structs