Paper Reading AI Learner

Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models

2023-03-22 15:23:09
Frederik Pahde, Maximilian Dreyer, Wojciech Samek, Sebastian Lapuschkin

Abstract

State-of-the-art machine learning models often learn spurious correlations embedded in the training data. This poses risks when deploying these models for high-stake decision-making, such as in medical applications like skin cancer detection. To tackle this problem, we propose Reveal to Revise (R2R), a framework entailing the entire eXplainable Artificial Intelligence (XAI) life cycle, enabling practitioners to iteratively identify, mitigate, and (re-)evaluate spurious model behavior with a minimal amount of human interaction. In the first step (1), R2R reveals model weaknesses by finding outliers in attributions or through inspection of latent concepts learned by the model. Secondly (2), the responsible artifacts are detected and spatially localized in the input data, which is then leveraged to (3) revise the model behavior. Concretely, we apply the methods of RRR, CDEP and ClArC for model correction, and (4) (re-)evaluate the model's performance and remaining sensitivity towards the artifact. Using two medical benchmark datasets for Melanoma detection and bone age estimation, we apply our R2R framework to VGG, ResNet and EfficientNet architectures and thereby reveal and correct real dataset-intrinsic artifacts, as well as synthetic variants in a controlled setting. Completing the XAI life cycle, we demonstrate multiple R2R iterations to mitigate different biases. Code is available on this https URL.

Abstract (translated)

最先进的机器学习模型常常在训练数据中学习伪相关性。当将这些模型用于重要决策时,例如医疗应用如皮肤癌检测时,这就构成了风险。为了解决这一问题,我们提出了“揭示以更新”(R2R)框架,该框架涵盖了整个可解释人工智能(XAI)生命周期,使用户可以迭代地识别、减轻和(再次)评估伪模型行为,而只需要少量的人类交互。在第一个步骤(1),R2R通过发现归因异常值或检查模型学习的潜在概念来揭示模型的弱点。在第二个步骤(2),负责的元数据被检测并空间定位在输入数据中,然后利用它来(再次)更新模型行为。具体来说,我们应用RRR、CDEP和ClArC方法来进行模型修正,并(再次)评估模型的性能和剩余的对元数据敏感性。使用两个医疗基准数据集来检测和估计 Melanoma 检测和骨龄估计,我们应用我们的 R2R 框架到 VGG、ResNet 和 EfficientNet 架构上,从而揭示和纠正真实的数据源固有元数据,以及在控制环境下的人造变异体。完成 XAI 生命周期后,我们展示了多个 R2R 迭代以减轻不同的偏见。代码在此 https URL 可用。

URL

https://arxiv.org/abs/2303.12641

PDF

https://arxiv.org/pdf/2303.12641.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot