Neural network quantization

mar · Post by **mar** » Wed Sep 09, 2020 1:12 am

linear is fine only if you have enough resolution, if most of your weights are close to zero, you risk quantizing all of them to 0 (or even worse to some non-zero value if you quantize linearly in min-max range)
of course, everything depends on the structure of the data to be quantized
but generally speaking, I'd pick a quantization scheme that lowers MSE any time
removing outliers seems dangerous, but I may be wrong
quantization-aware training seems like a great idea though

Fabio Gobbato · Post by **Fabio Gobbato** » Thu Sep 10, 2020 1:22 pm

I have tried to use quantized weights in the training, only when I calculate the error of the network and seems to work. I have to try with int8 but with int32 works well.
Thank you!

Neural network quantization

Re: Neural network quantization

Re: Neural network quantization