Simple and effective localized attribute representations for zero-shot learning

2020-06-10 16:46:12

Shiqi Yang, Kai Wang, Luis Herranz, Joost van de Weijer

arXiv_CV

arXiv_CV Detection Attention Relation Pose Zero-Shot

Abstract
Abstract (translated)
URL
PDF

Abstract

Zero-shot learning (ZSL) aims to discriminate images from unseen classes by exploiting relations to seen classes via their semantic descriptions. Some recent papers have shown the importance of localized features together with fine-tuning the feature extractor to obtain discriminative and transferable features. However, these methods require complex attention or part detection modules to perform explicit localization in the visual space. In contrast, in this paper we propose localizing representations in the semantic/attribute space, with a simple but effective pipeline where localization is implicit. Focusing on attribute representations, we show that our method obtains state-of-the-art performance on CUB and SUN datasets, and also achieves competitive results on AWA2 dataset, outperforming generally more complex methods with explicit localization in the visual space. Our method can be implemented easily, which can be used as a new baseline for zero shot learning.

Abstract (translated)

URL

https://arxiv.org/abs/2006.05938

PDF

https://arxiv.org/pdf/2006.05938.pdf