Abstract
Image degradation is a prevalent issue in various real-world applications, affecting visual quality and downstream processing tasks. In this study, we propose a novel framework that employs a Vision-Language Model (VLM) to automatically classify degraded images into predefined categories. The VLM categorizes an input image into one of four degradation types: (A) super-resolution degradation (including noise, blur, and JPEG compression), (B) reflection artifacts, (C) motion blur, or (D) no visible degradation (high-quality image). Once classified, images assigned to categories A, B, or C undergo targeted restoration using dedicated models tailored for each specific degradation type. The final output is a restored image with improved visual quality. Experimental results demonstrate the effectiveness of our approach in accurately classifying image degradations and enhancing image quality through specialized restoration models. Our method presents a scalable and automated solution for real-world image enhancement tasks, leveraging the capabilities of VLMs in conjunction with state-of-the-art restoration techniques.
Abstract (translated)
图像退化是各种实际应用中的一个常见问题,它会影响视觉质量并影响后续处理任务。在这项研究中,我们提出了一种新颖的框架,该框架采用视觉-语言模型(VLM)自动将受损图像分类为预定义类别。VLM将输入图像归类为四种降质类型之一:(A) 超分辨率退化(包括噪声、模糊和JPEG压缩),(B) 反射伪影,(C) 运动模糊,或 (D) 无明显退化(高质量图像)。一旦分类完成,被分配到类别 A、B 或 C 的图像将使用针对每种特定降质类型量身定制的模型进行目标修复。最终输出是视觉质量得到提升的恢复后的图像。 实验结果证明了我们的方法在准确分类图像退化和通过专门的修复技术提高图像质量方面的有效性。本方法为实际图像增强任务提供了一种可扩展且自动化的解决方案,利用了VLM与最先进的修复技术相结合的能力。
URL
https://arxiv.org/abs/2506.05450