Paper Reading AI Learner

Experimental Results of Underwater Sound Speed Profile Inversion by Few-shot Multi-task Learning

2023-10-18 04:53:01
Wei Huang, Fan Gao, Junting Wang, Hao Zhang

Abstract

Underwater Sound Speed Profile (SSP) distribution has great influence on the propagation mode of acoustic signal, thus the fast and accurate estimation of SSP is of great importance in building underwater observation systems. The state-of-the-art SSP inversion methods include frameworks of matched field processing (MFP), compressive sensing (CS), and feedforeward neural networks (FNN), among which the FNN shows better real-time performance while maintain the same level of accuracy. However, the training of FNN needs quite a lot historical SSP samples, which is diffcult to be satisfied in many ocean areas. This situation is called few-shot learning. To tackle this issue, we propose a multi-task learning (MTL) model with partial parameter sharing among different traning tasks. By MTL, common features could be extracted, thus accelerating the learning process on given tasks, and reducing the demand for reference samples, so as to enhance the generalization ability in few-shot learning. To verify the feasibility and effectiveness of MTL, a deep-ocean experiment was held in April 2023 at the South China Sea. Results shows that MTL outperforms the state-of-the-art methods in terms of accuracy for SSP inversion, while inherits the real-time advantage of FNN during the inversion stage.

Abstract (translated)

水下声速剖面(SSP)分布对声波传播模式有很大的影响,因此快速和准确地估计SSP对构建水下观测系统非常重要。最先进的SSP反演方法包括匹配场处理(MFP)框架、压缩感知(CS)和前馈神经网络(FNN)等。其中,FNN在保持相同准确性的同时具有更好的实时性能。然而,为了训练FNN,需要相当多的历史SSP样本,这在许多海洋区域中是难以满足的。这种情况称为欠样本学习。为了解决这个问题,我们提出了一个多任务学习(MTL)模型,其中不同训练任务之间共享部分参数。通过MTL,可以提取共同特征,从而加速在给定任务上的学习过程,并减少对参考样本的需求,从而增强在欠样本学习中的泛化能力。为了验证MTL的可行性和有效性,2023年4月在南海进行了一个深海实验。结果表明,MTL在SSP反演方面的准确性超过了最先进的方法,而在反演阶段,FNN具有实时优势。

URL

https://arxiv.org/abs/2310.11708

PDF

https://arxiv.org/pdf/2310.11708.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot