Paper Reading AI Learner

A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction

2023-01-24 18:28:21
Junyu Wang, Shijie Wang, Wenyu Liu, Zengqiang Zheng, Xinggang Wang

Abstract

We present a simple, efficient, and scalable unfolding network, SAUNet, to simplify the network design with an adaptive alternate optimization framework for hyperspectral image (HSI) reconstruction. SAUNet customizes a Residual Adaptive ADMM Framework (R2ADMM) to connect each stage of the network via a group of learnable parameters to promote the usage of mask prior, which greatly stabilizes training and solves the accuracy degradation issue. Additionally, we introduce a simple convolutional modulation block (CMB), which leads to efficient training, easy scale-up, and less computation. Coupling these two designs, SAUNet can be scaled to non-trivial 13 stages with continuous improvement. Without bells and whistles, SAUNet improves both performance and speed compared with the previous state-of-the-art counterparts, which makes it feasible for practical high-resolution HSI reconstruction scenarios. We set new records on CAVE and KAIST HSI reconstruction benchmarks. Code and models are available at this https URL.

Abstract (translated)

我们介绍了一个简单、高效、可扩展的展开网络SAUNet,以简化超分辨率图像(HSI)重建网络的设计,采用自适应交替优化框架。SAUNet定制了Residual Adaptive ADMM框架(R2ADMM),通过一组可学习参数连接网络的各个阶段,促进使用先验掩模,极大地稳定了训练和解决精度下降问题。此外,我们引入了一个简单的卷积调制块(CMB),导致高效的训练、容易扩展和较少的计算。结合这两个设计,SAUNet可以无限地扩展到重要的13个阶段,并保持不断改进。相比之前的最先进的 counterpart,SAUNet在性能和速度方面都取得了改进,使其适用于实际的高分辨率HSI重建场景。我们在CAVE和KAIST HSI重建基准数据上创造了新的记录。代码和模型可在该httpsURL上获取。

URL

https://arxiv.org/abs/2301.10208

PDF

https://arxiv.org/pdf/2301.10208.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot