Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding

Abstract
Abstract (translated)
URL
PDF

Abstract

Few-shot slot tagging becomes appealing for rapid domain transfer and adaptation, motivated by the tremendous development of conversational dialogue systems. In this paper, we propose a vector projection network for few-shot slot tagging, which exploits projections of contextual word embeddings on each target label vector as word-label similarities. Essentially, this approach is equivalent to a normalized linear model with an adaptive bias. The contrastive experiment demonstrates that our proposed vector projection based similarity metric can significantly surpass other variants. Specifically, in the five-shot setting on benchmarks SNIPS and NER, our method outperforms the strongest few-shot learning baseline by $6.30$ and $13.79$ points on F$_1$ score, respectively. Our code will be released at this https URL.

Abstract (translated)

URL

https://arxiv.org/abs/2009.09568

PDF

https://arxiv.org/pdf/2009.09568.pdf