Paper Reading AI Learner

ChildGAN: Large Scale Synthetic Child Facial Data Using Domain Adaptation in StyleGAN

2023-07-25 18:04:52
Muhammad Ali Farooq, Wang Yao, Gabriel Costache, Peter Corcoran

Abstract

In this research work, we proposed a novel ChildGAN, a pair of GAN networks for generating synthetic boys and girls facial data derived from StyleGAN2. ChildGAN is built by performing smooth domain transfer using transfer learning. It provides photo-realistic, high-quality data samples. A large-scale dataset is rendered with a variety of smart facial transformations: facial expressions, age progression, eye blink effects, head pose, skin and hair color variations, and variable lighting conditions. The dataset comprises more than 300k distinct data samples. Further, the uniqueness and characteristics of the rendered facial features are validated by running different computer vision application tests which include CNN-based child gender classifier, face localization and facial landmarks detection test, identity similarity evaluation using ArcFace, and lastly running eye detection and eye aspect ratio tests. The results demonstrate that synthetic child facial data of high quality offers an alternative to the cost and complexity of collecting a large-scale dataset from real children.

Abstract (translated)

在本研究中,我们提出了一种新的ChildGAN,即从StyleGAN2中提取合成男孩和女孩面部数据的GAN网络,ChildGAN通过使用转移学习实现平滑域转换,提供逼真高质量的数据样本。它提供了多种智能面部变换,包括面部表情、年龄进展、眨眼效应、头部姿势、皮肤和头发颜色的变化,以及多种照明条件。该数据集包含超过300k个不同的数据样本。此外,通过运行不同的计算机视觉应用程序测试,包括基于卷积神经网络的儿童性别分类器、面部定位和面部地标检测测试、使用ArcFace进行身份相似度评估,最后运行眼检测和眼 aspect ratio测试,结果验证渲染面部特征的独特性和特征,证明高质量的合成儿童面部数据可以替代从真实儿童收集大规模数据的成本和复杂性。

URL

https://arxiv.org/abs/2307.13746

PDF

https://arxiv.org/pdf/2307.13746.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot