Abstract
The last decade of computer vision has been dominated by Deep Learning architectures, thanks to their unparalleled success. Their performance, however, often comes at the cost of explainability owing to their highly non-linear nature. Consequently, a parallel field of eXplainable Artificial Intelligence (XAI) has developed with the aim of generating insights regarding the decision making process of deep learning models. An important problem in XAI is that of the generation of saliency maps. These are regions in an input image which contributed most towards the model's final decision. Most work in this regard, however, has been focused on image classification, and image segmentation - despite being a ubiquitous task - has not received the same attention. In the present work, we propose MiSuRe (Minimally Sufficient Region) as an algorithm to generate saliency maps for image segmentation. The goal of the saliency maps generated by MiSuRe is to get rid of irrelevant regions, and only highlight those regions in the input image which are crucial to the image segmentation decision. We perform our analysis on 3 datasets: Triangle (artificially constructed), COCO-2017 (natural images), and the Synapse multi-organ (medical images). Additionally, we identify a potential usecase of these post-hoc saliency maps in order to perform post-hoc reliability of the segmentation model.
Abstract (translated)
过去十年的计算机视觉领域,深度学习架构占据了主导地位,得益于它们无与伦比的的成功。然而,由于它们高度非线性的特点,其性能往往代价高昂。因此,一个旨在生成关于深度学习模型决策过程的洞察的并行领域——可解释性人工智能(XAI)应运而生。XAI的一个关键问题就是生成 saliency 地图。这些是输入图像中对于模型最终决策起最大贡献的 regions。然而,在这方面,大部分工作都集中在图像分类和图像分割——尽管这是一项普遍存在的任务,但并未得到同样的关注。在本文中,我们提出了 MiSuRe(最小充分区域)算法,用于为图像分割生成 saliency 地图。MiSuRe 生成的 saliency 地图的目标是消除无关区域,仅关注输入图像中关键用于分割决策的区域。我们对三个数据集进行了分析:Triangle(人工构造数据集),COCO-2017(自然图像数据集)和 Synapse multi-organ(医学图像数据集)。此外,我们还发现了这些后置 saliency 地图的一个潜在应用,以便对分割模型的后置可靠性进行评估。
URL
https://arxiv.org/abs/2406.12173