Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach

2020-10-18 11:18:29

Zewei Sun, Mingxuan Wang, Hao Zhou, Chengqi Zhao, Shujian Huang, Jiajun Chen, Lei Li

arXiv_CL

arXiv_CL Transformer Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Discourse context has been proven useful when translating documents. It is quite a challenge to incorporate long document context in the prevailing neural machine translation models such as Transformer. In this paper, we propose multi-resolutional (MR) Doc2Doc, a method to train a neural sequence-to-sequence model for document-level translation. Our trained model can simultaneously translate sentence by sentence as well as a document as a whole. We evaluate our method and several recent approaches on nine document-level datasets and two sentence-level datasets across six languages. Experiments show that MR Doc2Doc outperforms sentence-level models and previous methods in a comprehensive set of metrics, including BLEU, four lexical indices, three newly proposed assistant linguistic indicators, and human evaluation.

Abstract (translated)

URL

https://arxiv.org/abs/2010.08961

PDF

https://arxiv.org/pdf/2010.08961.pdf