Tracing and Removing Data Errors in Natural Language Generation Datasets

2022-12-21 02:28:07

Faisal Ladhak, Esin Durmus, Tatsunori Hashimoto

arXiv_CL

arXiv_CL Summarization Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Recent work has identified noisy and misannotated data as a core cause of hallucinations and unfaithful outputs in Natural Language Generation (NLG) tasks. Consequently, identifying and removing these examples is a key open challenge in creating reliable NLG systems. In this work, we introduce a framework to identify and remove low-quality training instances that lead to undesirable outputs, such as faithfulness errors in text summarization. We show that existing approaches for error tracing, such as gradient-based influence measures, do not perform reliably for detecting faithfulness errors in summarization. We overcome the drawbacks of existing error tracing methods through a new, contrast-based estimate that compares undesired generations to human-corrected outputs. Our proposed method can achieve a mean average precision of 0.91 across synthetic tasks with known ground truth and can achieve a two-fold reduction in hallucinations on a real entity hallucination evaluation on the NYT dataset.

Abstract (translated)

URL

https://arxiv.org/abs/2212.10722

PDF

https://arxiv.org/pdf/2212.10722.pdf