DialogueNeRF: Towards Realistic Avatar Face-to-face Conversation Video Generation

2022-03-15 14:16:49

Zanwei Zhou, Zi Wang, Shunyu Yao, Yichao Yan, Chen Yang, Guangtao Zhai, Junchi Yan, Xiaokang Yang

arXiv_CV

arXiv_CV Face Attention Quantitative Pose Dialog Chat

Abstract
Abstract (translated)
URL
PDF

Abstract

Conversation is an essential component of virtual avatar activities in the metaverse. With the development of natural language processing, textual and vocal conversation generation has achieved a significant breakthrough. Face-to-face conversations account for the vast majority of daily conversations. However, this task has not acquired enough attention. In this paper, we propose a novel task that aims to generate a realistic human avatar face-to-face conversation process and present a new dataset to explore this target. To tackle this novel task, we propose a new framework that utilizes a series of conversation signals, e.g. audio, head pose, and expression, to synthesize face-to-face conversation videos between human avatars, with all the interlocutors modeled within the same network. Our method is evaluated by quantitative and qualitative experiments in different aspects, e.g. image quality, pose sequence trend, and naturalness of the rendering videos. All the code, data, and models will be made publicly available.

Abstract (translated)

URL

https://arxiv.org/abs/2203.07931

PDF

https://arxiv.org/pdf/2203.07931.pdf