IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /get-started.md). For the complete documentation index, see llms.txt.

Skip to main content

For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /get-started.md).

Mojo module

quantization

`comptime` values

`logger`

comptime logger = Logger(stdout, prefix=String(""), source_location=False)

Structs

GGMLQ40Dequantize:
GGMLQ4KDequantize:
GGMLQ6KDequantize:
QMatmulGPU_b4_g128:
QMatmulGPU_b4_g32:
QMatmulGPURepackGGUF:
QMatmulGPURepackGPTQ_b4_g128:
QMatmulGPURepackGPTQ_b4_g128_desc_act:
QuantizeDynamicScaledFloat8:
QuantizeStaticScaledFloat8:
QuantizeTensorDynamicScaledFloat8:
ResizeBicubic:
ResizeLinear:
ResizeNearest:
RMSNormFusedQuantizeDynamicScaledFP8:
Struct_dequant_mxfp4:
Struct_grouped_quantize_dynamic_block_scaled:
Struct_interleave_block_scales:
Struct_mxfp4_preshuffle_b_5d: Run the AMD CDNA4 MXFP4 B 5D preshuffle as a custom op.
Struct_mxfp4_preshuffle_scale_4d_per_expert: Per-step A-scale preshuffle for the AMD CDNA4 preb grouped matmul.
Struct_quantize_dynamic_block_scaled:
Struct_quantize_dynamic_block_scaled_mxfp4:
Struct_unfused_qkv_matmul_ragged_paged_gguf_quantized:
VroomQ40Matmul:
VroomQ40RepackWeights:
VroomQ4KMatmul:
VroomQ4KRepackWeights:
VroomQ6KMatmul:
VroomQ6KRepackWeights:

Functions

comptime values
- logger
Structs
Functions

Edit this page

Edit this page

Was this page helpful?

Thank you! We'll create more content like this.

Thank you for helping us improve!