Paper Reading AI Learner

Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction

2019-04-19 20:23:27
Maosen Zhang, Qinyuan Ye, Liyuan Liu, Xiang Ren

Abstract

In recent years there is surge of interest in applying distant supervision (DS) to automatically generate training data for relation extraction. However, despite extensive efforts have been done on constructing advanced neural models, our experiments reveal that these neural models demonstrate only similar (or even worse) performance as compared with simple, feature-based methods. In this paper, we conduct thorough analysis to answer the question what other factors limit the performance of DS-trained neural models? Our results show that shifted labeled distribution commonly exists on real-world DS datasets, and impact of such issue is further validated using synthetic datasets for all models. Building upon the new insight, we develop a simple yet effective adaptation method for DS methods, called bias adjustment, to update models learned over source domain (i.e., DS training set) with label distribution statistics estimated on target domain (i.e., evaluation set). Experiments demonstrate that bias adjustment achieves consistent performance gains on all methods, especially on neural models, with up to a 22% relative F1 improvement.

Abstract (translated)

近年来,人们对应用远程监控(DS)自动生成用于关系提取的训练数据越来越感兴趣。然而,尽管我们在构建高级神经模型方面做了大量的努力,我们的实验表明,与简单的基于特征的方法相比,这些神经模型仅表现出相似(甚至更差)的性能。在本文中,我们进行了深入的分析来回答这个问题:哪些其他因素限制了DS训练神经模型的性能?我们的研究结果表明,在真实的DS数据集上通常存在着移位标记分布,这一问题的影响通过对所有模型的合成数据集得到进一步验证。基于新的认识,我们开发了一种简单而有效的DS方法自适应方法,称为偏差调整,用目标域(即评估集)上估计的标签分布统计信息更新源域(即DS训练集)上学习的模型。实验证明,偏差调整在所有方法上,特别是在神经模型上,都能获得一致的性能提高,F1的相对改善率高达22%。

URL

https://arxiv.org/abs/1904.09331

PDF

https://arxiv.org/pdf/1904.09331.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot