Modification method for single-stage object detectors that allows to exploit the temporal behaviour of a scene to improve detection accuracy

Abstract
Abstract (translated)
URL
PDF

Abstract

A simple modification method for single-stage generic object detection neural networks, such as YOLO and SSD, is proposed, which allows for improving the detection accuracy on video data by exploiting the temporal behavior of the scene in the detection pipeline. It is shown that, using this method, the detection accuracy of the base network can be considerably improved, especially for occluded and hidden objects. It is shown that a modified network is more prone to detect hidden objects with more confidence than an unmodified one. A weakly supervised training method is proposed, which allows for training a modified network without requiring any additional annotated data.

Abstract (translated)

URL

https://arxiv.org/abs/2009.01617

PDF

https://arxiv.org/pdf/2009.01617.pdf