Multigrid-in-Channels Neural Network Architectures

Abstract
Abstract (translated)
URL
PDF

Abstract

We present a multigrid-in-channels (MGIC) approach that tackles the quadratic growth of the number of parameters with respect to the number of channels in standard convolutional neural networks (CNNs). It has been shown that there is a redundancy in standard CNNs, as networks with light or sparse convolution operators yield similar performance to full networks. However, the number of parameters in the former networks also scales quadratically in width, while in the latter case, the parameters typically have random sparsity patterns, hampering hardware efficiency.Our approach for building CNN architectures scales linearly with respect to the network's width while retaining full coupling of the channels as in standard this http URL this end, we replace each convolution block with its MGIC block utilizing a hierarchy of lightweight convolutions. Our extensive experiments on image classification, segmentation, and point cloud classification show that applying this strategy to different architectures like ResNet and MobileNetV3 considerably reduces the number of parameters while obtaining similar or better accuracy. For example, we obtain 76.1% top-1 accuracy on ImageNet with a lightweight network with similar parameters and FLOPs to MobileNetV3.

Abstract (translated)

URL

https://arxiv.org/abs/2011.09128

PDF

https://arxiv.org/pdf/2011.09128.pdf