Multi-Modal Sarcasm Detection Based on Contrastive Attention Mechanism

2021-09-30 14:17:51

Xiaoqiang Zhang, Ying Chen, Guangyuan Li

arXiv_CL

Abstract
Abstract (translated)
URL
PDF

Abstract

In the past decade, sarcasm detection has been intensively conducted in a textual scenario. With the popularization of video communication, the analysis in multi-modal scenarios has received much attention in recent years. Therefore, multi-modal sarcasm detection, which aims at detecting sarcasm in video conversations, becomes increasingly hot in both the natural language processing community and the multi-modal analysis community. In this paper, considering that sarcasm is often conveyed through incongruity between modalities (e.g., text expressing a compliment while acoustic tone indicating a grumble), we construct a Contras-tive-Attention-based Sarcasm Detection (ConAttSD) model, which uses an inter-modality contrastive attention mechanism to extract several contrastive features for an utterance. A contrastive feature represents the incongruity of information between two modalities. Our experiments on MUStARD, a benchmark multi-modal sarcasm dataset, demonstrate the effectiveness of the proposed ConAttSD model.

Abstract (translated)

URL

https://arxiv.org/abs/2109.15153

PDF

https://arxiv.org/pdf/2109.15153.pdf