BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese

2021-09-20 17:14:22

Nguyen Luong Tran, Duong Minh Le, Dat Quoc Nguyen

arXiv_CL

Abstract
Abstract (translated)
URL
PDF

Abstract

We present BARTpho with two versions -- BARTpho_word and BARTpho_syllable -- the first public large-scale monolingual sequence-to-sequence models pre-trained for Vietnamese. Our BARTpho uses the "large" architecture and pre-training scheme of the sequence-to-sequence denoising model BART, thus especially suitable for generative NLP tasks. Experiments on a downstream task of Vietnamese text summarization show that in both automatic and human evaluations, our BARTpho outperforms the strong baseline mBART and improves the state-of-the-art. We release BARTpho to facilitate future research and applications of generative Vietnamese NLP tasks. Our BARTpho models are available at: this https URL

Abstract (translated)

URL

https://arxiv.org/abs/2109.09701

PDF

https://arxiv.org/pdf/2109.09701.pdf