Paper Reading AI Learner

Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?

2024-05-04 08:04:39
Julia Evans, Sameer Sadruddin, Jennifer D'Souza

Abstract

In this study, we address one of the challenges of developing NER models for scholarly domains, namely the scarcity of suitable labeled data. We experiment with an approach using predictions from a fine-tuned LLM model to aid non-domain experts in annotating scientific entities within astronomy literature, with the goal of uncovering whether such a collaborative process can approximate domain expertise. Our results reveal moderate agreement between a domain expert and the LLM-assisted non-experts, as well as fair agreement between the domain expert and the LLM model's predictions. In an additional experiment, we compare the performance of finetuned and default LLMs on this task. We have also introduced a specialized scientific entity annotation scheme for astronomy, validated by a domain expert. Our approach adopts a scholarly research contribution-centric perspective, focusing exclusively on scientific entities relevant to the research theme. The resultant dataset, containing 5,000 annotated astronomy article titles, is made publicly available.

Abstract (translated)

在这项研究中,我们研究了在学术领域开发自然语言实体识别(NER)模型的一个挑战:合适标注数据的稀缺性。我们尝试使用来自微调的LLM模型的预测来帮助非领域专家在天文文学中注释科学实体,以揭示是否可以这样一个合作过程可以 approximate领域专业知识。我们的结果表明,领域专家和LLM辅助的非专家在科学实体注释方面存在适度一致性,以及领域专家和LLM模型的预测之间的公平一致性。在另一个实验中,我们比较了微调和默认LLM模型在这项任务上的表现。我们还引入了一个由领域专家验证的专门的天文学科学实体注释方案。我们的方法采用研究主题为中心的视角,专注于与研究主题相关的科学实体。由此产生的数据集,包含5,000个注释的天文学文章标题,已经公开发布。

URL

https://arxiv.org/abs/2405.02602

PDF

https://arxiv.org/pdf/2405.02602.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot