CEG4N: Counter-Example Guided Neural Network Quantization Refinement

2022-07-09 09:25:45

João Batista P. Matos Jr., Iury Bessa, Edoardo Manino, Xidan Song, Lucas C. Cordeiro

arXiv_AI

arXiv_AI Pose Quantization

Abstract
Abstract (translated)
URL
PDF

Abstract

Neural networks are essential components of learning-based software systems. However, their high compute, memory, and power requirements make using them in low resources domains challenging. For this reason, neural networks are often quantized before deployment. Existing quantization techniques tend to degrade the network accuracy. We propose Counter-Example Guided Neural Network Quantization Refinement (CEG4N). This technique combines search-based quantization and equivalence verification: the former minimizes the computational requirements, while the latter guarantees that the network's output does not change after quantization. We evaluate CEG4N~on a diverse set of benchmarks, including large and small networks. Our technique successfully quantizes the networks in our evaluation while producing models with up to 72% better accuracy than state-of-the-art techniques.

Abstract (translated)

URL

https://arxiv.org/abs/2207.04231

PDF

https://arxiv.org/pdf/2207.04231.pdf