The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

Abstract
Abstract (translated)
URL
PDF

Abstract

Recent advancements in image segmentation have focused on enhancing the efficiency of the models to meet the demands of real-time applications, especially on edge devices. However, existing research has primarily concentrated on single-task settings, especially on semantic segmentation, leading to redundant efforts and specialized architectures for different tasks. To address this limitation, we propose a novel architecture for efficient multi-task image segmentation, capable of handling various segmentation tasks without sacrificing efficiency or accuracy. We introduce BiSeNetFormer, that leverages the efficiency of two-stream semantic segmentation architectures and it extends them into a mask classification framework. Our approach maintains the efficient spatial and context paths to capture detailed and semantic information, respectively, while leveraging an efficient transformed-based segmentation head that computes the binary masks and class probabilities. By seamlessly supporting multiple tasks, namely semantic and panoptic segmentation, BiSeNetFormer offers a versatile solution for multi-task segmentation. We evaluate our approach on popular datasets, Cityscapes and ADE20K, demonstrating impressive inference speeds while maintaining competitive accuracy compared to state-of-the-art architectures. Our results indicate that BiSeNetFormer represents a significant advancement towards fast, efficient, and multi-task segmentation networks, bridging the gap between model efficiency and task adaptability.

Abstract (translated)

近年来，在图像分割领域的进步主要集中在提高模型的实时应用需求，尤其是边缘设备。然而，现有的研究主要集中在单任务设置，尤其是在语义分割上，导致对不同任务的冗余努力和专业架构。为了克服这一局限，我们提出了一个名为BiSeNetFormer的新型多任务图像分割架构，能够处理各种分割任务，同时不牺牲效率或准确性。我们引入了BiSeNetFormer，它利用了两流语义分割架构的效率，并将其扩展到掩码分类框架。我们的方法保持了捕捉详细和语义信息的高效空间和上下文路径，同时利用了高效的可转换基分割头计算二进制掩码和类概率。通过轻松支持多个任务，包括语义和视网膜分割，BiSeNetFormer为多任务分割提供了一个通用的解决方案。我们在流行的数据集（城市风光和ADE20K）上评估我们的方法，证明了我们令人印象深刻的推理速度，同时保持与最先进架构的竞争准确性。我们的结果表明，BiSeNetFormer在快速、高效和多任务分割网络方面取得了显著的进展，缩小了模型效率和任务适应性的差距。

URL

https://arxiv.org/abs/2404.09570

PDF

https://arxiv.org/pdf/2404.09570.pdf

The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

Abstract

Abstract (translated)

URL

PDF Copy

PDF