Paper Reading AI Learner

Standardized CycleGAN training for unsupervised stain adaptation in invasive carcinoma classification for breast histopathology

2023-01-30 18:07:09
Nicolas Nerrienet, Rémy Peyret, Marie Sockeel, Stéphane Sockeel

Abstract

Generalization is one of the main challenges of computational pathology. Slide preparation heterogeneity and the diversity of scanners lead to poor model performance when used on data from medical centers not seen during training. In order to achieve stain invariance in breast invasive carcinoma patch classification, we implement a stain translation strategy using cycleGANs for unsupervised image-to-image translation. We compare three cycleGAN-based approaches to a baseline classification model obtained without any stain invariance strategy. Two of the proposed approaches use cycleGAN's translations at inference or training in order to build stain-specific classification models. The last method uses them for stain data augmentation during training. This constrains the classification model to learn stain-invariant features. Baseline metrics are set by training and testing the baseline classification model on a reference stain. We assessed performances using three medical centers with H&E and H&E&S staining. Every approach tested in this study improves baseline metrics without needing labels on target stains. The stain augmentation-based approach produced the best results on every stain. Each method's pros and cons are studied and discussed in this paper. However, training highly performing cycleGANs models in itself represents a challenge. In this work, we introduce a systematical method for optimizing cycleGAN training by setting a novel stopping criterion. This method has the benefit of not requiring any visual inspection of cycleGAN results and proves superiority to methods using a predefined number of training epochs. In addition, we also study the minimal amount of data required for cycleGAN training.

Abstract (translated)

泛化是计算病理学中的主要挑战之一。Slide preparation的不一致性和扫描仪的多样性在使用训练期间未曾看到的医疗中心数据时会导致模型性能不佳。为了在乳腺癌 patch 分类中实现染色不变异,我们采用了无监督图像到图像转换的循环GAN策略。我们比较了三种循环GAN基 approach 与在没有染色不变异策略的情况下获得的基准分类模型。提议的两种 approach 使用循环GAN的推断或训练 translation 来构建特定的染色分类模型。最后一种方法在训练期间使用它们来增加目标染色的数据。这限制了分类模型学习染色不变异特征。基准指标是通过训练和测试基准分类模型在参考染色上进行设置。我们使用三个医疗中心 H&E 和 H&E&S 染色评估了表现。在所有方法中,该研究提高了基准指标,无需对目标染色标签。基于染色增加的方法在每个染色上取得了最佳结果。每种方法的优点和缺点在本文中研究和讨论。然而,训练高性能的循环GAN模型本身 represents a challenge。在本研究中,我们介绍了一种系统的方法,通过设置一种新的停止准则来优化循环GAN训练。这种方法无需对循环GAN结果进行视觉检查,并证明了使用事先定义的训练 epochs 的方法的优越性。此外,我们还研究了循环GAN训练所需的最小数据量。

URL

https://arxiv.org/abs/2301.13128

PDF

https://arxiv.org/pdf/2301.13128.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot