On the ability of monolingual models to learn language-agnostic representations

2021-09-04 22:09:44

Leandro Rodrigues de Souza, Rodrigo Nogueira, Roberto Lotufo

arXiv_CL

arXiv_CL Zero-Shot

Abstract
Abstract (translated)
URL
PDF

Abstract

Pretrained multilingual models have become a de facto default approach for zero-shot cross-lingual transfer. Previous work has shown that these models are able to achieve cross-lingual representations when pretrained on two or more languages with shared parameters. In this work, we provide evidence that a model can achieve language-agnostic representations even when pretrained on a single language. That is, we find that monolingual models pretrained and finetuned on different languages achieve competitive performance compared to the ones that use the same target language. Surprisingly, the models show a similar performance on a same task regardless of the pretraining language. For example, models pretrained on distant languages such as German and Portuguese perform similarly on English tasks.

Abstract (translated)

URL

https://arxiv.org/abs/2109.01942

PDF

https://arxiv.org/pdf/2109.01942.pdf