Regressive Ensemble for Machine Translation Quality Evaluation

2021-09-15 12:22:52

Michal Štefánik, Vít Novotný, Petr Sojka

arXiv_CL

arXiv_CL Relation Zero-Shot

Abstract
Abstract (translated)
URL
PDF

Abstract

This work introduces a simple regressive ensemble for evaluating machine translation quality based on a set of novel and established metrics. We evaluate the ensemble using a correlation to expert-based MQM scores of the WMT 2021 Metrics workshop. In both monolingual and zero-shot cross-lingual settings, we show a significant performance improvement over single metrics. In the cross-lingual settings, we also demonstrate that an ensemble approach is well-applicable to unseen languages. Furthermore, we identify a strong reference-free baseline that consistently outperforms the commonly-used BLEU and METEOR measures and significantly improves our ensemble's performance.

Abstract (translated)

URL

https://arxiv.org/abs/2109.07242

PDF

https://arxiv.org/pdf/2109.07242.pdf