Paper Reading AI Learner

HySim: An Efficient Hybrid Similarity Measure for Patch Matching in Image Inpainting

2024-03-21 10:59:44
Saad Noufel, Nadir Maaroufi, Mehdi Najib, Mohamed Bakhouya

Abstract

Inpainting, for filling missing image regions, is a crucial task in various applications, such as medical imaging and remote sensing. Trending data-driven approaches efficiency, for image inpainting, often requires extensive data preprocessing. In this sense, there is still a need for model-driven approaches in case of application constrained with data availability and quality, especially for those related for time series forecasting using image inpainting techniques. This paper proposes an improved modeldriven approach relying on patch-based techniques. Our approach deviates from the standard Sum of Squared Differences (SSD) similarity measure by introducing a Hybrid Similarity (HySim), which combines both strengths of Chebychev and Minkowski distances. This hybridization enhances patch selection, leading to high-quality inpainting results with reduced mismatch errors. Experimental results proved the effectiveness of our approach against other model-driven techniques, such as diffusion or patch-based approaches, showcasing its effectiveness in achieving visually pleasing restorations.

Abstract (translated)

修复缺失图像区域是各种应用中的一项关键任务,如医学成像和遥感。趋势数据驱动的方法在图像修复方面效率很高,但通常需要进行大量的数据预处理。在应用受限数据可用性和质量的情况下,尤其是在与时间序列预测相关的应用中,模型驱动方法仍然有必要。本文提出了一种基于补丁技术的改进模型驱动方法。我们的方法与标准的平方差相似度(SSD)相似度度量方法有所区别,通过引入混合相似性(HySim),结合了切比雪夫距离和Minkowski距离的优势。这种杂糅增强了补丁选择,导致修复结果质量高,匹配误差降低。实验结果证明,我们的方法对其他模型驱动方法(如扩散或基于补丁的方法)的有效性进行了展示,突出了在实现观感良好的修复效果方面的有效性。

URL

https://arxiv.org/abs/2403.14292

PDF

https://arxiv.org/pdf/2403.14292.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot