Fixing Gaussian Mixture VAEs for Interpretable Text Generation

2019-06-16 15:41:07

Wenxian Shi, Hao Zhou, Ning Miao, Shenjian Zhao, Lei Li

arXiv_CL

Abstract
Abstract (translated)
URL
PDF

Abstract

Variational auto-encoder (VAE) with Gaussian priors is effective in text generation. To improve the controllability and interpretability, we propose to use Gaussian mixture distribution as the prior for VAE (GMVAE), since it includes an extra discrete latent variable in addition to the continuous one. Unfortunately, training GMVAE using standard variational approximation often leads to the mode-collapse problem. We theoretically analyze the root cause --- maximizing the evidence lower bound of GMVAE implicitly aggregates the means of multiple Gaussian priors. We propose Dispersed-GMVAE (DGMVAE), an improved model for text generation. It introduces two extra terms to alleviate mode-collapse and to induce a better structured latent space. Experimental results show that DGMVAE outperforms strong baselines in several language modeling and text generation benchmarks.

Abstract (translated)

URL

https://arxiv.org/abs/1906.06719

PDF

https://arxiv.org/pdf/1906.06719.pdf