Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

2021-06-22 08:30:41

Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

arXiv_SD

arXiv_SD GAN Language_Model Unsupervised Pose Matching Speech

Abstract
Abstract (translated)
URL
PDF

Abstract

We present a number of low-resource approaches to the tasks of the Zero Resource Speech Challenge 2021. We build on the unsupervised representations of speech proposed by the organizers as a baseline, derived from CPC and clustered with the k-means algorithm. We demonstrate that simple methods of refining those representations can narrow the gap, or even improve upon the solutions which use a high computational budget. The results lead to the conclusion that the CPC-derived representations are still too noisy for training language models, but stable enough for simpler forms of pattern matching and retrieval.

Abstract (translated)

URL

https://arxiv.org/abs/2106.11603

PDF

https://arxiv.org/pdf/2106.11603.pdf