Abstract
Learning in data-scarce settings has recently gained significant attention in the research community. Semi-supervised object detection(SSOD) aims to improve detection performance by leveraging a large number of unlabeled images alongside a limited number of labeled images(a.k.a.,few-shot learning). In this paper, we present a comprehensive comparison of three state-of-the-art SSOD approaches, including MixPL, Semi-DETR and Consistent-Teacher, with the goal of understanding how performance varies with the number of labeled images. We conduct experiments using the MS-COCO and Pascal VOC datasets, two popular object detection benchmarks which allow for standardized evaluation. In addition, we evaluate the SSOD approaches on a custom Beetle dataset which enables us to gain insights into their performance on specialized datasets with a smaller number of object categories. Our findings highlight the trade-offs between accuracy, model size, and latency, providing insights into which methods are best suited for low-data regimes.
Abstract (translated)
在数据稀缺环境下进行学习最近引起了研究社区的广泛关注。半监督目标检测(SSOD)旨在通过利用大量未标记图像和少量已标注图像(即,少样本学习)来提高检测性能。在这篇论文中,我们对三种最先进的SSOD方法进行了全面比较,包括MixPL、Semi-DETR 和 Consistent-Teacher,并且我们的研究目的是了解这些方法在不同数量的标签图片下的表现差异。我们在实验中使用了两个流行的目标检测基准数据集——MS-COCO和Pascal VOC,以实现标准化评估。此外,我们还在一个自定义的甲壳虫(Beetle)数据集上对SSOD方法进行了评估,该数据集使我们可以了解这些方法在具有较少对象类别的专业化数据集上的性能表现。我们的研究结果强调了准确率、模型大小和延迟之间的权衡,并提供了关于哪些方法最适合低数据环境的见解。
URL
https://arxiv.org/abs/2601.13380