Abstract
In this paper, we propose a highly efficient method to estimate an image's mean opinion score (MOS) from a single opinion score (SOS). Assuming that each SOS is the observed sample of a normal distribution and the MOS is its unknown expectation, the MOS inference is formulated as a maximum likelihood estimation problem, where the perceptual correlation of pairwise images is considered in modeling the likelihood of SOS. More specifically, by means of the quality-aware representations learned from the self-supervised backbone, we introduce a learnable relative quality measure to predict the MOS difference between two images. Then, the current image's maximum likelihood estimation towards MOS is represented by the sum of another reference image's estimated MOS and their relative quality. Ideally, no matter which image is selected as the reference, the MOS of the current image should remain unchanged, which is termed perceptual cons tancy constrained calibration (PC3). Finally, we alternatively optimize the relative quality measure's parameter and the current image's estimated MOS via backpropagation and Newton's method respectively. Experiments show that the proposed method is efficient in calibrating the biased SOS and significantly improves IQA model learning when only SOSs are available.
Abstract (translated)
在本文中,我们提出了一种从单个意见分数(SOS)估计图像平均评分(MOS)的高效方法。假设每个SOS是正态分布的观察样本,而MOS是它的未知期望。因此,MOS推理被视为最大似然估计问题,其中考虑了成对图像的感知相关性以建模SOS的概率。具体来说,通过自监督骨架学习到的质量感知表示,我们引入了一个可学习的相对质量度量以预测两个图像之间的MOS差。那么,当前图像对MOS的最大似然估计就可以表示为另一个参考图像的估计MOS和它们之间的相对质量之和。理想情况下,无论选择哪个图像作为参考,当前图像的MOS都应该保持不变,这被称为感知一致性约束调节(PC3)。最后,我们分别通过反向传播和牛顿法对相对质量度量的参数和当前图像的估计MOS进行优化。实验证明,与仅使用SOS时相比,所提出的方法在调节带有偏差SOS方面非常有效,并且当仅可用SOS时,IQA模型的学习显著提高。
URL
https://arxiv.org/abs/2404.19595