Paper Reading AI Learner

Decoupled Data Consistency with Diffusion Purification for Image Restoration

2024-03-10 00:47:05
Xiang Li, Soo Min Kwon, Ismail R. Alkhouri, Saiprasad Ravishanka, Qing Qu

Abstract

Diffusion models have recently gained traction as a powerful class of deep generative priors, excelling in a wide range of image restoration tasks due to their exceptional ability to model data distributions. To solve image restoration problems, many existing techniques achieve data consistency by incorporating additional likelihood gradient steps into the reverse sampling process of diffusion models. However, the additional gradient steps pose a challenge for real-world practical applications as they incur a large computational overhead, thereby increasing inference time. They also present additional difficulties when using accelerated diffusion model samplers, as the number of data consistency steps is limited by the number of reverse sampling steps. In this work, we propose a novel diffusion-based image restoration solver that addresses these issues by decoupling the reverse process from the data consistency steps. Our method involves alternating between a reconstruction phase to maintain data consistency and a refinement phase that enforces the prior via diffusion purification. Our approach demonstrates versatility, making it highly adaptable for efficient problem-solving in latent space. Additionally, it reduces the necessity for numerous sampling steps through the integration of consistency models. The efficacy of our approach is validated through comprehensive experiments across various image restoration tasks, including image denoising, deblurring, inpainting, and super-resolution.

Abstract (translated)

扩散模型最近因能够在图像修复任务中建模数据分布而获得了广泛关注,成为了一种强大的深度生成priors。由于其出色的数据分布建模能力,扩散模型在广泛的图像修复任务中表现出色。为解决图像修复问题,许多现有技术通过将扩散模型的反向采样过程引入 additional likelihood gradient 步骤来实现数据一致性。然而,这些额外的梯度步骤对现实世界的实际应用造成了较大的计算开销,从而增加了推理时间。当使用加速扩散模型采样器时,它们还提出了其他困难,因为数据一致性步骤的数量限制了反向采样步骤的数量。在本文中,我们提出了一种新型的基于扩散的图像修复求解器,通过将反向过程与数据一致性步骤解耦,从而解决了这些问题。我们的方法包括重建阶段和优化阶段。在重建阶段,我们维持数据一致性,通过扩散净化来实施先验。在优化阶段,我们通过扩散除杂来强制实施先验。通过将一致性模型集成到我们的方法中,我们减少了需要进行的采样步骤数量。我们通过在各种图像修复任务上进行全面的实验来验证我们方法的效力,包括图像去噪、去模糊、修复和超分辨率。

URL

https://arxiv.org/abs/2403.06054

PDF

https://arxiv.org/pdf/2403.06054.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot