Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems

2021-10-11 12:36:30

Po-Nien Kung, Chung-Cheng Chang, Tse-Hsuan Yang, Hsin-Kai Hsu, Yu-Jia Liou, Yun-Nung Chen

arXiv_AI

arXiv_AI Language_Model Transformer Pose Dialog Chat

Abstract
Abstract (translated)
URL
PDF

Abstract

Task-oriented dialogue systems have been a promising area in the NLP field. Previous work showed the effectiveness of using a single GPT-2 based model to predict belief states and responses via causal language modeling. In this paper, we leverage multi-task learning techniques to train a GPT-2 based model on a more challenging dataset with multiple domains, multiple modalities, and more diversity in output formats. Using only a single model, our method achieves better performance on all sub-tasks, across domains, compared to task and domain-specific models. Furthermore, we evaluated several proposed strategies for GPT-2 based dialogue systems with comprehensive ablation studies, showing that all techniques can further improve the performance.

Abstract (translated)

URL

https://arxiv.org/abs/2110.05221

PDF

https://arxiv.org/pdf/2110.05221.pdf