Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning

2021-03-06 08:07:26

Ali Cheraghian, Shafin Rahman, Pengfei Fang, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi

arXiv_CV

arXiv_CV Attention Embedding Knowledge Pose Few-Shot

Abstract
Abstract (translated)
URL
PDF

Abstract

Few-shot class incremental learning (FSCIL) portrays the problem of learning new concepts gradually, where only a few examples per concept are available to the learner. Due to the limited number of examples for training, the techniques developed for standard incremental learning cannot be applied verbatim to FSCIL. In this work, we introduce a distillation algorithm to address the problem of FSCIL and propose to make use of semantic information during training. To this end, we make use of word embeddings as semantic information which is cheap to obtain and which facilitate the distillation process. Furthermore, we propose a method based on an attention mechanism on multiple parallel embeddings of visual data to align visual and semantic vectors, which reduces issues related to catastrophic forgetting. Via experiments on MiniImageNet, CUB200, and CIFAR100 dataset, we establish new state-of-the-art results by outperforming existing approaches.

Abstract (translated)

URL

https://arxiv.org/abs/2103.04059

PDF

https://arxiv.org/pdf/2103.04059.pdf