Paper Reading AI Learner

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

2024-01-27 09:27:47
Minghao Wu, Yufei Wang, George Foster, Lizhen Qu, Gholamreza Haffari
       

Abstract

Document-level neural machine translation (DocNMT) aims to generate translations that are both coherent and cohesive, in contrast to its sentence-level counterpart. However, due to its longer input length and limited availability of training data, DocNMT often faces the challenge of data sparsity. To overcome this issue, we propose a novel Importance-Aware Data Augmentation (IADA) algorithm for DocNMT that augments the training data based on token importance information estimated by the norm of hidden states and training gradients. We conduct comprehensive experiments on three widely-used DocNMT benchmarks. Our empirical results show that our proposed IADA outperforms strong DocNMT baselines as well as several data augmentation approaches, with statistical significance on both sentence-level and document-level BLEU.

Abstract (translated)

文档级别神经机器翻译(DocNMT)旨在生成既连贯又完整的翻译,与其句子级别 counterpart 不同。然而,由于其较长的输入长度和训练数据有限,DocNMT 通常面临数据稀疏性的挑战。为了克服这一问题,我们提出了一个新颖的基于词重要性信息估计 norms of hidden states 和 training gradients 的 Importance-Aware Data Augmentation (IADA) 算法用于 DocNMT。我们对三个广泛使用的 DocNMT 基准进行全面的实验。我们的实证结果表明,与 strong DocNMT 基线以及多个数据增强方法相比,我们的 IADA 具有显著的优越性,具有统计学意义,无论是句子级别还是文档级别 BLEU。

URL

https://arxiv.org/abs/2401.15360

PDF

https://arxiv.org/pdf/2401.15360.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot