InferEM: Inferring the Speaker's Intention for Empathetic Dialogue Generation

2022-12-13 05:12:40

Guoqing Lv, Xiaoping Wang, Jiang Li, Zhigang Zeng

arXiv_CL

arXiv_CL Attention Prediction Pose Dialog Chat

Abstract
Abstract (translated)
URL
PDF

Abstract

Current approaches to empathetic response generation typically encode the entire dialogue history directly and put the output into a decoder to generate friendly feedback. These methods focus on modelling contextual information but neglect capturing the direct intention of the speaker. We argue that the last utterance in the dialogue empirically conveys the intention of the speaker. Consequently, we propose a novel model named InferEM for empathetic response generation. We separately encode the last utterance and fuse it with the entire dialogue through multi-head attention based intention fusion module to capture the speaker's intention. Besides, we utilize previous utterances to predict the last utterance, which simulates human's psychology to guess what the interlocutor may speak in advance. To balance the optimizing rates of the utterance prediction and response generation, a multi-task learning strategy is designed for InferEM. Experimental results demonstrate the plausibility and validity of InferEM in improving empathetic expression.

Abstract (translated)

URL

https://arxiv.org/abs/2212.06373

PDF

https://arxiv.org/pdf/2212.06373.pdf