Paper Reading AI Learner

Implicit Neural Representation for Cooperative Low-light Image Enhancement

2023-03-21 10:24:29
Shuzhou Yang, Moxuan Ding, Yanmin Wu, Zihan Li, Jian Zhang

Abstract

The following three factors restrict the application of existing low-light image enhancement methods: unpredictable brightness degradation and noise, inherent gap between metric-favorable and visual-friendly versions, and the limited paired training data. To address these limitations, we propose an implicit Neural Representation method for Cooperative low-light image enhancement, dubbed NeRCo. It robustly recovers perceptual-friendly results in an unsupervised manner. Concretely, NeRCo unifies the diverse degradation factors of real-world scenes with a controllable fitting function, leading to better robustness. In addition, for the output results, we introduce semantic-orientated supervision with priors from the pre-trained vision-language model. Instead of merely following reference images, it encourages results to meet subjective expectations, finding more visual-friendly solutions. Further, to ease the reliance on paired data and reduce solution space, we develop a dual-closed-loop constrained enhancement module. It is trained cooperatively with other affiliated modules in a self-supervised manner. Finally, extensive experiments demonstrate the robustness and superior effectiveness of our proposed NeRCo. Our code is available at this https URL.

Abstract (translated)

以下是三个因素限制了现有低光图像增强方法的应用:不可预测的亮度下降和噪声,metrics favorable和视觉友好版本的固有差异,以及有限的配对训练数据。为了解决这些问题,我们提出了一种隐含的神经网络表示方法,称为NeRCo,它以无监督的方式 robustly 恢复认知友好的结果。具体来说,NeRCo将真实场景的不同退化因素与可控制适应函数相结合,导致更好的鲁棒性。此外,对于输出结果,我们引入了语义导向的监督,从预训练的视觉语言模型中获取先验。相反,它不再仅仅跟随参考图像,而是鼓励结果满足主观期望,找到更多的视觉友好解决方案。进一步,为了减轻依赖配对数据并减少解决方案空间,我们开发了双重闭环限制增强模块。它与其他相关模块合作训练自我监督。最后,广泛的实验证明了我们提出的NeRCo的鲁棒性和优越性能。我们的代码可用在这个httpsURL上。

URL

https://arxiv.org/abs/2303.11722

PDF

https://arxiv.org/pdf/2303.11722.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot