Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

2021-04-12 17:58:30

Nadine Chang, Zhiding Yu, Yu-Xiong Wang, Anima Anandkumar, Sanja Fidler, Jose M. Alvarez

arXiv_CV

arXiv_CV Segmentation Recognition Detection Classification Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Training on datasets with long-tailed distributions has been challenging for major recognition tasks such as classification and detection. To deal with this challenge, image resampling is typically introduced as a simple but effective approach. However, we observe that long-tailed detection differs from classification since multiple classes may be present in one image. As a result, image resampling alone is not enough to yield a sufficiently balanced distribution at the object level. We address object-level resampling by introducing an object-centric memory replay strategy based on dynamic, episodic memory banks. Our proposed strategy has two benefits: 1) convenient object-level resampling without significant extra computation, and 2) implicit feature-level augmentation from model updates. We show that image-level and object-level resamplings are both important, and thus unify them with a joint resampling strategy (RIO). Our method outperforms state-of-the-art long-tailed detection and segmentation methods on LVIS v0.5 across various backbones.

Abstract (translated)

URL

https://arxiv.org/abs/2104.05702

PDF

https://arxiv.org/pdf/2104.05702.pdf