Paper Reading AI Learner

Unpaired Image-to-Image Translation for Segmentation and Signal Unmixing

2025-05-27 05:36:50
Nikola Andrejic, Milica Spasic, Igor Mihajlovic, Petra Milosavljevic, Djordje Pavlovic, Filip Milisavljevic, Uros Milivojevic, Danilo Delibasic, Ivana Mikic, Sinisa Todorovic

Abstract

This work introduces Ui2i, a novel model for unpaired image-to-image translation, trained on content-wise unpaired datasets to enable style transfer across domains while preserving content. Building on CycleGAN, Ui2i incorporates key modifications to better disentangle content and style features, and preserve content integrity. Specifically, Ui2i employs U-Net-based generators with skip connections to propagate localized shallow features deep into the generator. Ui2i removes feature-based normalization layers from all modules and replaces them with approximate bidirectional spectral normalization -- a parameter-based alternative that enhances training stability. To further support content preservation, channel and spatial attention mechanisms are integrated into the generators. Training is facilitated through image scale augmentation. Evaluation on two biomedical tasks -- domain adaptation for nuclear segmentation in immunohistochemistry (IHC) images and unmixing of biological structures superimposed in single-channel immunofluorescence (IF) images -- demonstrates Ui2i's ability to preserve content fidelity in settings that demand more accurate structural preservation than typical translation tasks. To the best of our knowledge, Ui2i is the first approach capable of separating superimposed signals in IF images using real, unpaired training data.

Abstract (translated)

这项工作介绍了Ui2i,这是一种新颖的模型,用于无配对图像到图像的转换。它在基于内容的无配对数据集上进行训练,旨在跨领域进行风格迁移的同时保持内容不变。Ui2i建立在CycleGAN的基础上,对其进行关键修改以更好地分离内容和风格特征,并保护内容完整性。具体来说,Ui2i采用具有跳跃连接的U-Net生成器,将局部浅层特征深入传播到生成器中。Ui2i从所有模块中移除了基于特征的归一化层,并用近似的双向谱归一化进行替换——这是一种参数化的替代方案,增强了训练稳定性。为了进一步支持内容保持,通道和空间注意机制被整合到生成器中。通过图像尺度增强来促进训练过程。 在两个生物医学任务上的评估展示了Ui2i在要求比常规翻译任务更准确结构保留的情况下仍能保持内容保真度的能力:一个是免疫组化(IHC)图像中的细胞核分割领域的适应性;另一个是单通道免疫荧光(IF)图像中叠加的生物结构的解混。据我们所知,Ui2i是首个能够使用真实无配对训练数据分离IF图像中超叠信号的方法。

URL

https://arxiv.org/abs/2505.20746

PDF

https://arxiv.org/pdf/2505.20746.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot