Towards Practical Control of Singular Values of Convolutional Layers

2022-11-24 19:09:44

Alexandra Senderovich, Ekaterina Bulatova, Anton Obukhov, Maxim Rakhuba

arXiv_CV

Abstract
Abstract (translated)
URL
PDF

Abstract

In general, convolutional neural networks (CNNs) are easy to train, but their essential properties, such as generalization error and adversarial robustness, are hard to control. Recent research demonstrated that singular values of convolutional layers significantly affect such elusive properties and offered several methods for controlling them. Nevertheless, these methods present an intractable computational challenge or resort to coarse approximations. In this paper, we offer a principled approach to alleviating constraints of the prior art at the expense of an insignificant reduction in layer expressivity. Our method is based on the tensor-train decomposition; it retains control over the actual singular values of convolutional mappings while providing structurally sparse and hardware-friendly representation. We demonstrate the improved properties of modern CNNs with our method and analyze its impact on the model performance, calibration, and adversarial robustness. The source code is available at: this https URL

Abstract (translated)

URL

https://arxiv.org/abs/2211.13771

PDF

https://arxiv.org/pdf/2211.13771.pdf

Towards Practical Control of Singular Values of Convolutional Layers

Abstract

Abstract (translated)

URL

PDF Copy

PDF