FashionCLIP: Connecting Language and Images for Product Representations

2022-04-08 10:01:39

Patrick John Chia, Giuseppe Attanasio, Federico Bianchi, Silvia Terragni, Ana Rita Magalhães, Diogo Goncalves, Ciro Greco, Jacopo Tagliabue

arXiv_CL

arXiv_CL Classification Contrastive_Learning

Abstract
Abstract (translated)
URL
PDF

Abstract

The steady rise of online shopping goes hand in hand with the development of increasingly complex ML and NLP models. While most use cases are cast as specialized supervised learning problems, we argue that practitioners would greatly benefit from more transferable representations of products. In this work, we build on recent developments in contrastive learning to train FashionCLIP, a CLIP-like model for the fashion industry. We showcase its capabilities for retrieval, classification and grounding, and release our model and code to the community.

Abstract (translated)

URL

https://arxiv.org/abs/2204.03972

PDF

https://arxiv.org/pdf/2204.03972.pdf