Mojo package
quantization
This package contains a set of APIs for quantizing tensor data.
Quantization is a technique used to reduce the precision of floating-point numbers, which are used in most neural networks. Quantization is a type of lossy compression, which means that some precision is lost, but the resulting tensors take less memory and computations are faster.
Modules
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!