Paper Reading AI Learner

Quality Estimation based Feedback Training for Improving Pronoun Translation

2025-01-06 13:34:51
Harshit Dhankhar, Baban Gain, Asif Ekbal, Yogesh Mani Tripathi

Abstract

Pronoun translation is a longstanding challenge in neural machine translation (NMT), often requiring inter-sentential context to ensure linguistic accuracy. To address this, we introduce ProNMT, a novel framework designed to enhance pronoun and overall translation quality in context-aware machine translation systems. ProNMT leverages Quality Estimation (QE) models and a unique Pronoun Generation Likelihood-Based Feedback mechanism to iteratively fine-tune pre-trained NMT models without relying on extensive human annotations. The framework combines QE scores with pronoun-specific rewards to guide training, ensuring improved handling of linguistic nuances. Extensive experiments demonstrate significant gains in pronoun translation accuracy and general translation quality across multiple metrics. ProNMT offers an efficient, scalable, and context-aware approach to improving NMT systems, particularly in translating context-dependent elements like pronouns.

Abstract (translated)

代词翻译一直是神经机器翻译(NMT)领域的一个长期挑战,通常需要跨句子的上下文信息来确保语言准确性。为了解决这一问题,我们引入了ProNMT——一个旨在通过利用质量估计(QE)模型和基于生成可能性的独特反馈机制,在不依赖大量人工标注的情况下,迭代地优化预训练NMT模型以提高代词翻译质量和整体翻译准确性的新型框架。该框架结合了QE分数与特定于代词的奖励来指导训练过程,从而更好地处理语言中的细微差别。 广泛实验表明,ProNMT在多项指标上显著提高了代词翻译准确性以及总体翻译质量。这一方法提供了一种高效、可扩展且上下文感知的方式来改进NMT系统,特别是在翻译依赖于上下文的语言元素(如代词)时表现尤为突出。

URL

https://arxiv.org/abs/2501.03008

PDF

https://arxiv.org/pdf/2501.03008.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot