Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

2022-06-26 10:12:59

Ailin Huang, Zhewei Huang, Shuchang Zhou

arXiv_CV

arXiv_CV Regularization Face

Abstract
Abstract (translated)
URL
PDF

Abstract

This paper reports our solution for MultiMedia ViCo 2022 Conversational Head Generation Challenge, which aims to generate vivid face-to-face conversation videos based on audio and reference images. Our solution focuses on training a generalized audio-to-head driver using regularization and assembling a high visual quality renderer. We carefully tweak the audio-to-behavior model and post-process the generated video using our foreground-background fusion module. We get first place in the listening head generation track and second place in the talking head generation track in the official ranking. Our code will be released.

Abstract (translated)

URL

https://arxiv.org/abs/2206.12837

PDF

https://arxiv.org/pdf/2206.12837.pdf

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

Abstract

Abstract (translated)

URL

PDF Copy

PDF