Binary Neural Networks as a general-propose compute paradigm for on-device computer vision

2022-02-08 08:38:22

Guhong Nie (1), Lirui Xiao (1), Menglong Zhu (1), Dongliang Chu (1), Yue Shen (1), Peng Li (1), Kang Yang (1), Li Du (2), Bo Chen (1) ((1) DJI Innovations Inc, (2) School of Electronic Science and Engineering, Nanjing University)

arXiv_CV

arXiv_CV Segmentation Detection Super_Resolution Classification Inference Pose Quantization Matching

Abstract
Abstract (translated)
URL
PDF

Abstract

For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training scheme for high accuracy, and 3) a simple procedure to adapt to different vision tasks. The resultant framework overtakes 8-bit quantization in the speed-vs-accuracy tradeoff for classification, detection, segmentation, super-resolution and matching: our BNNs not only retain the accuracy levels of their 8-bit baselines but also showcase 1.3-2.4$\times$ faster FPS on mobile CPUs. Similar conclusions can be drawn for prototypical systolic-array-based AI accelerators, where our BNNs promise 2.8-7$\times$ fewer execution cycles than 8-bit and 2.1-2.7$\times$ fewer cycles than alternative BNN designs. These results suggest that the time for large-scale BNN adoption could be upon us.

Abstract (translated)

URL

https://arxiv.org/abs/2202.03716

PDF

https://arxiv.org/pdf/2202.03716.pdf