Tag-less Back-Translation

2019-12-22 19:20:10

Idris Abdulmumin, Bashir Shehu Galadanci, Aliyu Garba

arXiv_CL

arXiv_CL NMT

Abstract
Abstract (translated)
URL
PDF

Abstract

An effective method to generate a large number of parallel sentences for training improved neural machine translation (NMT) systems is the use of back-translations of the target-side monolingual data. Tagging, or using gates, has been used to enable translation models to distinguish between synthetic and natural data. This improves standard back-translation and also enables the use of iterative back-translation on language pairs that underperformed using standard back-translation. This work presents a simplified approach of differentiating between the two data using pretraining and finetuning. The approach - tag-less back-translation - trains the model on the synthetic data and finetunes it on the natural data. Preliminary experiments have shown the approach to continuously outperform the tagging approach on low resource English-Vietnamese neural machine translation. While the need for tagging (noising) the dataset has been removed, the approach outperformed the tagged back-translation approach by an average of 0.4 BLEU.

Abstract (translated)

URL

https://arxiv.org/abs/1912.10514

PDF

https://arxiv.org/pdf/1912.10514.pdf