Abstract
Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it happens in face-to-face communication. Although humor detection is an established research area in NLP, in a multimodal context it is an understudied area. This paper presents a diverse multimodal dataset, called UR-FUNNY, to open the door to understanding multimodal language used in expressing humor. The dataset and accompanying studies, present a framework in multimodal humor detection for the natural language processing community. UR-FUNNY is publicly available for research.
Abstract (translated)
幽默是社会交往中表现出来的一种独特的、创造性的交际行为。它以多种方式产生,通过使用单词(文本)、手势(视觉)和韵律提示(声学)。从这三种形式中理解幽默属于多模态语言的范畴,这是近年来自然语言处理的一种研究趋势,在面对面交流中模拟自然语言。虽然幽默检测是NLP中一个已建立的研究领域,但在多模式的背景下,它是一个研究不足的领域。本文提出了一种多样的多模态数据集,称为UR-滑稽,为理解多模态语言在幽默表达中的作用打开了大门。数据集和相关研究为自然语言处理群体提供了一个多模态幽默检测框架。你的幽默是公开的研究。
URL
https://arxiv.org/abs/1904.06618