Paper Reading AI Learner

Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer Approach

2025-03-23 14:34:49
Rochana Chaturvedi, Peyman Baghershahi, Sourav Medya, Barbara Di Eugenio

Abstract

Temporal information extraction from unstructured text is essential for contextualizing events and deriving actionable insights, particularly in the medical domain. We address the task of extracting clinical events and their temporal relations using the well-studied I2B2 2012 Temporal Relations Challenge corpus. This task is inherently challenging due to complex clinical language, long documents, and sparse annotations. We introduce GRAPHTREX, a novel method integrating span-based entity-relation extraction, clinical large pre-trained language models (LPLMs), and Heterogeneous Graph Transformers (HGT) to capture local and global dependencies. Our HGT component facilitates information propagation across the document through innovative global landmarks that bridge distant entities. Our method improves the state-of-the-art with 5.5% improvement in the tempeval $F_1$ score over the previous best and up to 8.9% improvement on long-range relations, which presents a formidable challenge. This work not only advances temporal information extraction but also lays the groundwork for improved diagnostic and prognostic models through enhanced temporal reasoning.

Abstract (translated)

从非结构化文本中提取时间信息对于事件的语境化和获取可操作见解至关重要,特别是在医疗领域。我们通过研究广泛使用的I2B2 2012年时间关系挑战数据集来解决临床事件及其时间关系抽取的任务。由于复杂的医学语言、长文档以及稀疏标注的存在,这一任务本身具有相当大的挑战性。 为此,我们引入了一种新的方法——GRAPHTREX,该方法结合了基于跨度的实体-关系提取、大型预训练的语言模型(LPLMs)和异构图变换器(HGT),以捕捉局部与全局依赖关系。我们的HGT组件通过创新性的全局地标来促进文档中的信息传播,这些地标能够连接远距离的实体。 我们提出的方法显著提升了现有技术水平,在tempeval $F_1$分数上比之前的最佳方法提高了5.5%,在长程关系提取方面最多提高了8.9%。这种改进对于解决长程关系这一重大挑战尤其重要。 这项工作不仅推进了时间信息抽取技术的发展,还为通过增强的时间推理能力来改善诊断和预后模型奠定了基础。

URL

https://arxiv.org/abs/2503.18085

PDF

https://arxiv.org/pdf/2503.18085.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot