For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo struct
block_Q4_K
struct block_Q4_K
Fieldsβ
- βbase_scale (
Float16): - βbase_min (
Float16): - βq_scales_and_mins (
InlineArray[UInt8, 12]): - βq_bits (
InlineArray[UInt8, 128]):
Implemented traitsβ
comptime membersβ
group_countβ
comptime group_count = 8
group_sizeβ
comptime group_size = 32
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!