LOCL: Learning Object-Attribute Composition using Localization

2022-10-07 18:48:45

Satish Kumar, ASM Iftekhar, Ekta Prashnani, B.S.Manjunath

arXiv_CV

arXiv_CV Weakly_Supervised

Abstract
Abstract (translated)
URL
PDF

Abstract

This paper describes LOCL (Learning Object Attribute Composition using Localization) that generalizes composition zero shot learning to objects in cluttered and more realistic settings. The problem of unseen Object Attribute (OA) associations has been well studied in the field, however, the performance of existing methods is limited in challenging scenes. In this context, our key contribution is a modular approach to localizing objects and attributes of interest in a weakly supervised context that generalizes robustly to unseen configurations. Localization coupled with a composition classifier significantly outperforms state of the art (SOTA) methods, with an improvement of about 12% on currently available challenging datasets. Further, the modularity enables the use of localized feature extractor to be used with existing OA compositional learning methods to improve their overall performance.

Abstract (translated)

URL

https://arxiv.org/abs/2210.03780

PDF

https://arxiv.org/pdf/2210.03780.pdf