Paper Reading AI Learner

Calibrated Out-of-Distribution Detection with a Generic Representation

2023-03-23 10:03:12
Tomas Vojir, Jan Sochman, Rahaf Aljundi, Jiri Matas

Abstract

Out-of-distribution detection is a common issue in deploying vision models in practice and solving it is an essential building block in safety critical applications. Existing OOD detection solutions focus on improving the OOD robustness of a classification model trained exclusively on in-distribution (ID) data. In this work, we take a different approach and propose to leverage generic pre-trained representations. We first investigate the behaviour of simple classifiers built on top of such representations and show striking performance gains compared to the ID trained representations. We propose a novel OOD method, called GROOD, that achieves excellent performance, predicated by the use of a good generic representation. Only a trivial training process is required for adapting GROOD to a particular problem. The method is simple, general, efficient, calibrated and with only a few hyper-parameters. The method achieves state-of-the-art performance on a number of OOD benchmarks, reaching near perfect performance on several of them. The source code is available at this https URL.

Abstract (translated)

分布不平衡检测是在实践中部署视觉模型的常见问题,并且解决它是安全关键应用中不可或缺的基本构建块。现有的分布不平衡检测解决方案专注于改善专门训练在分布(ID)数据上的分类模型的分布不平衡鲁棒性。在本研究中,我们采用了不同的方法,并提出了利用通用预训练表示的方法。我们首先研究了这些表示上简单的分类器的行为方式,并比ID训练表示表现出惊人的性能提升。我们提出了一种称为GROOD的新分布不平衡检测方法,它通过使用良好的通用表示实现卓越的性能。只需要一个简单的训练过程即可适应GROOD以特定问题。该方法简单、通用、高效、校准,且只需要几个超参数。该方法在多个分布不平衡基准测试中实现了最先进的性能,并在其中的几个测试中表现出几乎完美的性能。源代码可在这个httpsURL上获取。

URL

https://arxiv.org/abs/2303.13148

PDF

https://arxiv.org/pdf/2303.13148.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot