Joint Retrieval and Generation Training for Grounded Text Generation

2021-05-14 00:11:38

Yizhe Zhang, Siqi Sun, Xiang Gao, Yuwei Fang, Chris Brockett, Michel Galley, Jianfeng Gao, Bill Dolan

arXiv_AI

arXiv_AI Language_Model Text_Generation Transformer Pose Dialog Chat

Abstract
Abstract (translated)
URL
PDF

Abstract

Recent advances in large-scale pre-training such as GPT-3 allow seemingly high quality text to be generated from a given prompt. However, such generation systems often suffer from problems of hallucinated facts, and are not inherently designed to incorporate useful external information. Grounded generation models appear to offer remedies, but their training typically relies on rarely-available parallel data where corresponding documents are provided for context. We propose a framework that alleviates this data constraint by jointly training a grounded generator and document retriever on the language model signal. The model learns to retrieve the documents with the highest utility in generation and attentively combines them in the output. We demonstrate that by taking advantage of external references our approach can produce more informative and interesting text in both prose and dialogue generation.

Abstract (translated)

URL

https://arxiv.org/abs/2105.06597

PDF

https://arxiv.org/pdf/2105.06597.pdf