This article provides an overview of how to quantize a network using PyTorch.
This tutorial contains the basic concepts of quantization in deep learning and the steps of how to transfer an existing network to a quantized version. Some Quantization-related APIs will also be introduced.
This document provides an overview of the shortcomings of previous quantization methods and how the new standard provided by Google works.