FastCLIPStyler: Towards fast text-based image style transfer using style representation

2022-10-07 11:16:36

Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly

arXiv_AI

arXiv_AI Style_Transfer Inference

Abstract
Abstract (translated)
URL
PDF

Abstract

Artistic style transfer is usually performed between two images, a style image and a content image. Recently, a model named CLIPStyler demonstrated that a natural language description of style could replace the necessity of a reference style image. They achieved this by taking advantage of the CLIP model, which can compute the similarity between a text phrase and an image. In this work, we demonstrate how combining CLIPStyler with a pre-trained, purely vision-based style transfer model can significantly reduce the inference time of CLIPStyler. We call this model FastCLIPStyler. We do a qualitative exploration of the stylised images from both models and argue that our model also has merits in terms of the visual aesthetics of the generated images. Finally, we also point out how FastCLIPStyler can be used to further extend this line of research to create a generalised text-to-style model that does not require optimisation at inference time, which both CLIPStyler and FastCLIPStyler do currently.

Abstract (translated)

URL

https://arxiv.org/abs/2210.03461

PDF

https://arxiv.org/pdf/2210.03461.pdf