Paper Reading AI Learner

Factual or Biased? Predicting Sentence-Level Factuality and Bias of News

2023-01-27 16:56:24
Francielle Vargas, Fabiana Góes, Thiago A. S. Pardo, Fabrício Benevenuto

Abstract

We present a study on sentence-level factuality and bias of news articles across domains. While prior work in NLP has mainly focused on predicting the factuality of article-level news reporting and political-ideological bias of news media, we investigated the effects of framing bias in factual reporting across domains so as to predict factuality and bias at the sentence level, which may explain more accurately the overall reliability of the entire document. First, we manually produced a large sentence-level annotated dataset, titled FactNews, composed of 6,191 sentences from 100 news stories by three different outlets, resulting in 300 news articles. Further, we studied how biased and factual spans surface in news articles from different media outlets and different domains. Lastly, a baseline model for factual sentence prediction was presented by fine-tuning BERT. We also provide a detailed analysis of data demonstrating the reliability of the annotation and models.

Abstract (translated)

我们提出了一项研究,涉及不同领域新闻报道中句子级别的事实性和偏见。虽然先前在自然语言处理领域中的工作主要关注预测文章级别的事实性和新闻媒体的政治意识形态偏见,但我们研究了不同领域新闻报道中框架偏见的影响,以预测句子级别的事实性和偏见,这可能更准确地解释整个文档的总体可靠性。首先,我们手动制作了一个大型句子级别的注释数据集,名为Fact News,由三个不同媒体来源编写的100个新闻故事生成了300篇文章。此外,我们研究了不同媒体来源和不同领域的新闻报道中偏见和事实的范围。最后,通过微调BERT,我们介绍了一个基础句子预测模型。我们还提供了数据详细分析,以证明注释和模型的可靠性。

URL

https://arxiv.org/abs/2301.11850

PDF

https://arxiv.org/pdf/2301.11850.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot