Paper Reading AI Learner

TimbreCLIP: Connecting Timbre to Text and Images

2022-11-21 07:40:01

Nicolas Jonason, Bob L.T. Sturm

arXiv_SD

arXiv_SD Embedding

Abstract
Abstract (translated)
URL
PDF

Abstract

We present work in progress on TimbreCLIP, an audio-text cross modal embedding trained on single instrument notes. We evaluate the models with a cross-modal retrieval task on synth patches. Finally, we demonstrate the application of TimbreCLIP on two tasks: text-driven audio equalization and timbre to image generation.

Abstract (translated)

URL

https://arxiv.org/abs/2211.11225

PDF

https://arxiv.org/pdf/2211.11225.pdf