Hard negative examples are hard, but useful

2020-07-24 19:34:58

Hong Xuan, Abby Stylianou, Xiaotong Liu, Robert Pless

arXiv_AI

arXiv_AI Embedding Image_Retrieval

Abstract
Abstract (translated)
URL
PDF

Abstract

Triplet loss is an extremely common approach to distance metric learning. Representations of images from the same class are optimized to be mapped closer together in an embedding space than representations of images from different classes. Much work on triplet losses focuses on selecting the most useful triplets of images to consider, with strategies that select dissimilar examples from the same class or similar examples from different classes. The consensus of previous research is that optimizing with the \textit{hardest} negative examples leads to bad training behavior. That's a problem -- these hardest negatives are literally the cases where the distance metric fails to capture semantic similarity. In this paper, we characterize the space of triplets and derive why hard negatives make triplet loss training fail. We offer a simple fix to the loss function and show that, with this fix, optimizing with hard negative examples becomes feasible. This leads to more generalizable features, and image retrieval results that outperform state of the art for datasets with high intra-class variance.

Abstract (translated)

URL

https://arxiv.org/abs/2007.12749

PDF

https://arxiv.org/pdf/2007.12749.pdf