Paper Reading AI Learner

RainDiffusion:When Unsupervised Learning Meets Diffusion Models for Real-world Image Deraining

2023-01-23 13:34:01
Mingqiang Wei, Yiyang Shen, Yongzhen Wang, Haoran Xie, Fu Lee Wang

Abstract

What will happen when unsupervised learning meets diffusion models for real-world image deraining? To answer it, we propose RainDiffusion, the first unsupervised image deraining paradigm based on diffusion models. Beyond the traditional unsupervised wisdom of image deraining, RainDiffusion introduces stable training of unpaired real-world data instead of weakly adversarial training. RainDiffusion consists of two cooperative branches: Non-diffusive Translation Branch (NTB) and Diffusive Translation Branch (DTB). NTB exploits a cycle-consistent architecture to bypass the difficulty in unpaired training of standard diffusion models by generating initial clean/rainy image pairs. DTB leverages two conditional diffusion modules to progressively refine the desired output with initial image pairs and diffusive generative prior, to obtain a better generalization ability of deraining and rain generation. Rain-Diffusion is a non adversarial training paradigm, serving as a new standard bar for real-world image deraining. Extensive experiments confirm the superiority of our RainDiffusion over un/semi-supervised methods and show its competitive advantages over fully-supervised ones.

Abstract (translated)

当 unsupervised learning 与扩散模型用于实际图像抑制时,会发生什么?为了回答这个问题,我们提出了 RainDiffusion,它是第一个基于扩散模型的 unsupervised 图像抑制范式。除了传统的图像抑制 unsupervised 智慧外, RainDiffusion 引入了稳定的配对真实数据的稳定训练,而不是弱对抗训练。 RainDiffusion 由两个合作分支组成:非扩散翻译分支(NTB)和扩散翻译分支(DTB)。 NTB利用循环一致性架构,通过生成初始清洁/雨水图像对,绕过标准扩散模型配对训练的难点。DTB利用两个条件扩散模块,逐步用初始图像对和扩散生成前向传播模型,逐渐优化期望输出,实现更好的抑制和雨生成泛化能力。 Rain-Diffusion 是一种无对抗训练范式,作为实际图像抑制的新标准。广泛的实验确认我们的 RainDiffusion 比无/半监督方法优越,并展示了它与完全监督方法的竞争优势。

URL

https://arxiv.org/abs/2301.09430

PDF

https://arxiv.org/pdf/2301.09430.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot