Paper Reading AI Learner

PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions

2023-05-24 08:59:00
Anthony Chen, Panupong Pasupat, Sameer Singh, Hongrae Lee, Kelvin Guu

Abstract

The remarkable capabilities of large language models have been accompanied by a persistent drawback: the generation of false and unsubstantiated claims commonly known as "hallucinations". To combat this issue, recent research has introduced approaches that involve editing and attributing the outputs of language models, particularly through prompt-based editing. However, the inference cost and speed of using large language models for editing currently bottleneck prompt-based methods. These bottlenecks motivate the training of compact editors, which is challenging due to the scarcity of training data for this purpose. To overcome these challenges, we exploit the power of large language models to introduce corruptions (i.e., noise) into text and subsequently fine-tune compact editors to denoise the corruptions by incorporating relevant evidence. Our methodology is entirely unsupervised and provides us with faux hallucinations for training in any domain. Our Petite Unsupervised Research and Revision model, PURR, not only improves attribution over existing editing methods based on fine-tuning and prompting, but also achieves faster execution times by orders of magnitude.

Abstract (translated)

大型语言模型的卓越能力伴随着一个持久的缺点是生成虚假且缺乏证据的支持声称,这种声称通常被称为“幻觉”。为了解决这个问题,最近的研究引入了涉及编辑和 attributed 语言模型输出的方法,特别是基于提示的编辑。然而,使用大型语言模型进行编辑的推断成本和速度目前的瓶颈是基于提示的方法。这些瓶颈激励了紧凑编辑的训练,但由于训练数据匮乏,这是具有挑战性的。为了克服这些挑战,我们利用大型语言模型的力量将错误(即噪声)引入文本,然后通过集成相关证据微调紧凑编辑,以消除错误。我们的方法论是完全 unsupervised 的,为我们在任何领域训练中的虚假幻觉提供了伪现实。我们的小型 unsupervised 研究和修订模型 purR 不仅基于 fine-tuning 和提示改进了现有的编辑方法,而且通过数倍数的速度加快了执行时间。

URL

https://arxiv.org/abs/2305.14908

PDF

https://arxiv.org/pdf/2305.14908.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot