Paper Reading AI Learner

SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder

2024-03-26 09:03:40
Dihan Zheng, Yihang Zou, Xiaowen Zhang, Chenglong Bao

Abstract

The data bottleneck has emerged as a fundamental challenge in learning based image restoration methods. Researchers have attempted to generate synthesized training data using paired or unpaired samples to address this challenge. This study proposes SeNM-VAE, a semi-supervised noise modeling method that leverages both paired and unpaired datasets to generate realistic degraded data. Our approach is based on modeling the conditional distribution of degraded and clean images with a specially designed graphical model. Under the variational inference framework, we develop an objective function for handling both paired and unpaired data. We employ our method to generate paired training samples for real-world image denoising and super-resolution tasks. Our approach excels in the quality of synthetic degraded images compared to other unpaired and paired noise modeling methods. Furthermore, our approach demonstrates remarkable performance in downstream image restoration tasks, even with limited paired data. With more paired data, our method achieves the best performance on the SIDD dataset.

Abstract (translated)

数据瓶颈已成为基于图像修复方法的学习中的一个基本挑战。研究人员试图通过成对或非成对样本来生成合成训练数据来解决这个挑战。本研究提出了一种半监督噪声建模方法——SeNM-VAE,该方法利用成对和未成对数据集来生成真实 degradation数据。我们的方法基于使用专门设计的图形模型建模降解和清洁图像的条件分布。在变分推理框架下,我们开发了一个处理成对和未成对数据的共同目标函数。我们将该方法应用于真实世界图像去噪和超分辨率任务。与其它未成对和成对噪声建模方法相比,我们的方法在合成降解图像的质量方面具有卓越的表现。此外,即使只有很少的成对数据,我们的方法在下游图像修复任务中也表现出优异的性能。随着更多成对数据的增加,我们的方法在SIDD数据集上实现最佳性能。

URL

https://arxiv.org/abs/2403.17502

PDF

https://arxiv.org/pdf/2403.17502.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot