This article provides an overview of how to quantize a network using PyTorch.
This document provides an overview of the shortcomings of previous quantization methods and how the new standard provided by Google works.
This tutorial contains the basic concepts of quantization in deep learning and the steps of how to transfer an existing network to a quantized version. Some Quantization-related APIs will also be introduced.