Paper Reading AI Learner

Diversity-Measurable Anomaly Detection

2023-03-09 05:52:42
Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

Abstract

Reconstruction-based anomaly detection models achieve their purpose by suppressing the generalization ability for anomaly. However, diverse normal patterns are consequently not well reconstructed as well. Although some efforts have been made to alleviate this problem by modeling sample diversity, they suffer from shortcut learning due to undesired transmission of abnormal information. In this paper, to better handle the tradeoff problem, we propose Diversity-Measurable Anomaly Detection (DMAD) framework to enhance reconstruction diversity while avoid the undesired generalization on anomalies. To this end, we design Pyramid Deformation Module (PDM), which models diverse normals and measures the severity of anomaly by estimating multi-scale deformation fields from reconstructed reference to original input. Integrated with an information compression module, PDM essentially decouples deformation from prototypical embedding and makes the final anomaly score more reliable. Experimental results on both surveillance videos and industrial images demonstrate the effectiveness of our method. In addition, DMAD works equally well in front of contaminated data and anomaly-like normal samples.

Abstract (translated)

基于重构的异常检测模型通过抑制异常泛化能力来实现其目的,但不同正常模式的重构结果并不良好。尽管已经通过建模样本多样性来缓解这个问题,但由于不希望传输异常信息而导致了快速学习。在本文中,为了更好地处理权衡问题,我们提出了多样性可测量的异常检测框架(DMAD),以增强重构多样性,同时避免对异常的不希望泛化。为此,我们设计了一个金字塔变形模块(PDM),该模块将不同的正常模式建模,并通过估计从重构参考到原始输入的多尺度变形场来估计异常的严重性。与信息压缩模块集成在一起,PDM实际上将变形与原型嵌入分离,从而使最终的异常得分更加可靠。在监控视频和工业图像的实验结果中,证明了我们的方法和DMAD的有效性。此外,DMAD在污染数据和类似异常的正常样本中同样有效。

URL

https://arxiv.org/abs/2303.05047

PDF

https://arxiv.org/pdf/2303.05047.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot