Paper Reading AI Learner

D-VAE: A Variational Autoencoder for Directed Acyclic Graphs

2019-04-24 22:22:57
Muhan Zhang, Shali Jiang, Zhicheng Cui, Roman Garnett, Yixin Chen

Abstract

Graph structured data are abundant in the real world. Among different graph types, directed acyclic graphs (DAGs) are of particular interests to machine learning researchers, as many machine learning models are realized as computations on DAGs, including neural networks and Bayesian networks. In this paper, we study deep generative models for DAGs, and propose a novel DAG variational autoencoder (D-VAE). To encode DAGs into the latent space, we leverage graph neural networks. We propose a DAG-style asynchronous message passing scheme that allows encoding the computations defined by DAGs, rather than using existing simultaneous message passing schemes to encode the graph structures. We demonstrate the effectiveness of our proposed D-VAE through two tasks: neural architecture search and Bayesian network structure learning. Experiments show that our model not only generates novel and valid DAGs, but also produces a smooth latent space that facilitates searching for DAGs with better performance through Bayesian optimization.

Abstract (translated)

图结构数据在现实世界中非常丰富。在不同的图类型中,有向无环图(DAG)是机器学习研究者特别感兴趣的,因为许多机器学习模型都是通过对DAG的计算来实现的,包括神经网络和贝叶斯网络。本文研究了DAG的深生成模型,提出了一种新型的DAG变分自动编码器(D-VAE)。为了将DAG编码到潜在空间,我们利用了图神经网络。我们提出了一种DAG风格的异步消息传递方案,它允许对DAG定义的计算进行编码,而不是使用现有的同步消息传递方案来对图形结构进行编码。我们通过两个任务来证明我们提出的D-VAE的有效性:神经架构搜索和贝叶斯网络结构学习。实验表明,该模型不仅生成了新颖有效的DAG,而且通过贝叶斯优化产生了一个平滑的潜在空间,便于搜索性能更好的DAG。

URL

https://arxiv.org/abs/1904.11088

PDF

https://arxiv.org/pdf/1904.11088.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot