Paper Reading AI Learner

Revisiting SVD and Wavelet Difference Reduction for Lossy Image Compression: A Reproducibility Study

2025-12-06 00:00:51
Alena Makarova

Abstract

This work presents an independent reproducibility study of a lossy image compression technique that integrates singular value decomposition (SVD) and wavelet difference reduction (WDR). The original paper claims that combining SVD and WDR yields better visual quality and higher compression ratios than JPEG2000 and standalone WDR. I re-implemented the proposed method, carefully examined missing implementation details, and replicated the original experiments as closely as possible. I then conducted additional experiments on new images and evaluated performance using PSNR and SSIM. In contrast to the original claims, my results indicate that the SVD+WDR technique generally does not surpass JPEG2000 or WDR in terms of PSNR, and only partially improves SSIM relative to JPEG2000. The study highlights ambiguities in the original description (e.g., quantization and threshold initialization) and illustrates how such gaps can significantly impact reproducibility and reported performance.

Abstract (translated)

这项工作提出了一种对一种结合奇异值分解(SVD)和小波差分减少(WDR)的有损图像压缩技术进行独立再现性研究。原论文声称,将SVD与WDR相结合能够比JPEG2000和单独使用WDR提供更好的视觉质量和更高的压缩比率。我重新实现了所提出的方法,仔细审查了缺失的实现细节,并尽可能地复制了原始实验。然后,我在新的图像上进行了额外的实验,并利用PSNR(峰值信噪比)和SSIM(结构相似性指数)对性能进行了评估。 与原论文中的说法相反,我的结果表明SVD+WDR技术在PSNR方面通常不如JPEG2000或WDR,在SSIM方面的改善也仅部分优于JPEG2000。这项研究强调了原始描述中存在的模糊之处(例如量化和阈值初始化),并展示了这些缺口如何显著影响再现性和报告的性能。

URL

https://arxiv.org/abs/2512.06221

PDF

https://arxiv.org/pdf/2512.06221.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot