Paper Reading AI Learner

Unifying Unsupervised Domain Adaptation and Zero-Shot Visual Recognition

2019-03-25 21:34:39
Qian Wang, Penghui Bu, Toby P. Breckon

Abstract

Unsupervised domain adaptation aims to transfer knowledge from a source domain to a target domain so that the target domain data can be recognized without any explicit labelling information for this domain. One limitation of the problem setting is that testing data, despite having no labels, from the target domain is needed during training, which prevents the trained model being directly applied to classify unseen test instances. We formulate a new cross-domain classification problem arising from real-world scenarios where labelled data is available for a subset of classes (known classes) in the target domain, and we expect to recognize new samples belonging to any class (known and unseen classes) once the model is learned. This is a generalized zero-shot learning problem where the side information comes from the source domain in the form of labelled samples instead of class-level semantic representations commonly used in traditional zero-shot learning. We present a unified domain adaptation framework for both unsupervised and zero-shot learning conditions. Our approach learns a joint subspace from source and target domains so that the projections of both data in the subspace can be domain invariant and easily separable. We use the supervised locality preserving projection (SLPP) as the enabling technique and conduct experiments under both unsupervised and zero-shot learning conditions, achieving state-of-the-art results on three domain adaptation benchmark datasets: Office-Caltech, Office31 and Office-Home.

Abstract (translated)

无监督域适应的目的是将知识从源域转移到目标域,这样目标域数据就可以被识别,而无需对此域进行任何明确的标记信息。问题设置的一个限制是,在培训期间,尽管没有标签,但仍需要来自目标域的测试数据,这会阻止直接应用培训模型对未看到的测试实例进行分类。我们提出了一个新的跨域分类问题,该问题源于现实场景,其中标记数据可用于目标域中类(已知类)的子集,我们希望在学习模型后识别属于任何类(已知类和未知类)的新样本。这是一个广义的零镜头学习问题,其中边信息以标记样本的形式来自源域,而不是传统零镜头学习中常用的类级语义表示。我们提出了一个统一的领域适应框架的无监督和零镜头学习条件。我们的方法从源域和目标域学习一个联合子空间,这样子空间中的两个数据的投影都可以是域不变的,并且容易分离。我们以受监督的位置保持投影(SLPP)为启用技术,在无监督和零镜头学习条件下进行实验,在三个领域适应基准数据集(Office Caltech、Office31和Office Home)上获得最新的结果。

URL

https://arxiv.org/abs/1903.10601

PDF

https://arxiv.org/pdf/1903.10601.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot