Abstract
In this paper, we continue our previous work on the Dirichlet mixture model (DMM)-based VQ to derive the performance bound of the LSF VQ. The LSF parameters are transformed into the $\Delta$LSF domain and the underlying distribution of the $\Delta$LSF parameters are modelled by a DMM with finite number of mixture components. The quantization distortion, in terms of the mean squared error (MSE), is calculated with the high rate theory. The mapping relation between the perceptually motivated log spectral distortion (LSD) and the MSE is empirically approximated by a polynomial. With this mapping function, the minimum required bit rate for transparent coding of the LSF is estimated.
Abstract (translated)
在本文中,我们继续我们之前关于基于Dirichlet混合模型(DMM)的VQ的工作,以推导出LSF VQ的性能界限。 LSF参数被转换为$ \ Delta $ LSF域,$ \ Delta $ LSF参数的基础分布由具有有限数量的混合成分的DMM建模。用均方误差(MSE)表示的量化失真是用高速率理论计算的。感知动机对数谱失真(LSD)与MSE之间的映射关系通过多项式凭经验近似。利用该映射功能,估计LSF的透明编码所需的最小比特率。
URL
https://arxiv.org/abs/1808.00818