Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

2020-08-21 22:06:43

Carlo Biffi, Steven McDonagh, Philip Torr, Ales Leonardis, Sarah Parisot

arXiv_CV

Abstract
Abstract (translated)
URL
PDF

Abstract

Annotating such datasets is highly time consuming and expensive, which motivates the development of weakly supervised and few-shot object detection methods. However, these methods largely underperform with respect to their strongly supervised counterpart, as weak training signals \emph{often} result in partial or oversized detections. Towards solving this problem we introduce, for the first time, an online annotation module (OAM) that learns to generate a many-shot set of \emph{reliable} annotations from a larger volume of weakly labelled images. Our OAM can be jointly trained with any fully supervised two-stage object detection method, providing additional training annotations on the fly. This results in a fully end-to-end strategy that only requires a low-shot set of fully annotated images. The integration of the OAM with Fast(er) R-CNN improves their performance by $17\%$ mAP, $9\%$ AP50 on PASCAL VOC 2007 and MS-COCO benchmarks, and significantly outperforms competing methods using mixed supervision.

Abstract (translated)

URL

https://arxiv.org/abs/2008.09694

PDF

https://arxiv.org/pdf/2008.09694.pdf