Paper Reading AI Learner

Specularity Factorization for Low-Light Enhancement

2024-04-02 14:41:42
Saurabh Saini, P J Narayanan

Abstract

We present a new additive image factorization technique that treats images to be composed of multiple latent specular components which can be simply estimated recursively by modulating the sparsity during decomposition. Our model-driven {\em RSFNet} estimates these factors by unrolling the optimization into network layers requiring only a few scalars to be learned. The resultant factors are interpretable by design and can be fused for different image enhancement tasks via a network or combined directly by the user in a controllable fashion. Based on RSFNet, we detail a zero-reference Low Light Enhancement (LLE) application trained without paired or unpaired supervision. Our system improves the state-of-the-art performance on standard benchmarks and achieves better generalization on multiple other datasets. We also integrate our factors with other task specific fusion networks for applications like deraining, deblurring and dehazing with negligible overhead thereby highlighting the multi-domain and multi-task generalizability of our proposed RSFNet. The code and data is released for reproducibility on the project homepage.

Abstract (translated)

我们提出了一种新的附加图像因素分解技术,该技术处理由多个潜在极化子组件组成的图像。这些因素可以通过在分解过程中对稀疏度的调节来简单地递归估计。我们的模型驱动的{\em RSFNet}通过将优化展开到仅需要学习几个标量来处理的网络层中来估计这些因素。由此产生的因素可以通过网络或通过用户在可控制的方式进行融合,用于不同的图像增强任务。基于RSFNet,我们详细介绍了一个无需配对或非配对监督的零参考低光增强(LLE)应用。我们的系统在标准基准上提高了最先进的性能,并在多个其他数据集上取得了更好的泛化能力。我们还将我们的因素与其他任务特定的融合网络集成,用于诸如去雾、去噪和去雾等应用。通过显著的 overhead,提高了我们提出的RSFNet的多领域和多任务通用性。代码和数据发布在项目主页上以进行可重复性。

URL

https://arxiv.org/abs/2404.01998

PDF

https://arxiv.org/pdf/2404.01998.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot