jrcavani

Quantize Resnet50 model with TensorRT

Intro

TensorRT supports two approaches to prepare model for Quantization - Calibration or Training

First we need to add/replace regular model nn.Layers with TRT pytorch_quantization.nn layers. Quantization layers will gather statistics required for quantization.

Once the model is modified we can use the following approaches to gather statistics before quantization:

Calibrate pre-trainer model
Train (1 epoch) pre-trainer model

	In [1]: import faiss

	In [2]: import numpy as np

	In [4]: d = 32

	In [5]: m = 4

	In [6]: c = 6