Paper Reading AI Learner

AlphaGAN: Generative adversarial networks for natural image matting

2018-07-26 12:17:22
Sebastian Lutz, Konstantinos Amplianitis, Aljosa Smolic

Abstract

We present the first generative adversarial network (GAN) for natural image matting. Our novel generator network is trained to predict visually appealing alphas with the addition of the adversarial loss from the discriminator that is trained to classify well-composited images. Further, we improve existing encoder-decoder architectures to better deal with the spatial localization issues inherited in convolutional neural networks (CNN) by using dilated convolutions to capture global context information without downscaling feature maps and losing spatial information. We present state-of-the-art results on the alphamatting online benchmark for the gradient error and give comparable results in others. Our method is particularly well suited for fine structures like hair, which is of great importance in practical matting applications, e.g. in film/TV production.

Abstract (translated)

我们提出了第一个用于自然图像消光的生成对抗网络(GAN)。我们的新型发电机网络经过培训,可以预测视觉上吸引人的α,同时增加来自鉴别器的对抗性损失,该鉴别器经过训练以对良好合成的图像进行分类。此外,我们改进现有的编码器 - 解码器架构,以通过使用扩张的卷积来捕获全局上下文信息而不缩减特征映射和丢失空间信息,从而更好地处理卷积神经网络(CNN)中继承的空间定位问题。我们在梯度误差的alphamatting在线基准测试中提供了最先进的结果,并在其他方面给出了可比较的结果。我们的方法特别适用于头发等精细结构,这在实际的消光应用中非常重要,例如,在电影/电视制作。

URL

https://arxiv.org/abs/1807.10088

PDF

https://arxiv.org/pdf/1807.10088.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot