Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

2021-05-10 17:19:38

Erik Englesson, Hossein Azizpour

arXiv_CV

arXiv_CV Regularization Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

We propose two novel loss functions based on Jensen-Shannon divergence for learning under label noise. Following the work of Ghosh et al. (2017), we argue about their theoretical robustness. Furthermore, we reveal several other desirable properties by drawing informative connections to various loss functions, e.g., cross entropy, mean absolute error, generalized cross entropy, symmetric cross entropy, label smoothing, and most importantly consistency regularization. We conduct extensive and systematic experiments using both synthetic (CIFAR) and real (WebVision) noise and demonstrate significant and consistent improvements over other loss functions. Also, we conduct several informative side experiments that highlight the different theoretical properties.

Abstract (translated)

URL

https://arxiv.org/abs/2105.04522

PDF

https://arxiv.org/pdf/2105.04522.pdf