Paper Reading AI Learner

Improving Disease Detection from Social Media Text via Self-Augmentation and Contrastive Learning

2024-04-30 15:24:54
Pervaiz Iqbal Khan, Andreas Dengel, Sheraz Ahmed

Abstract

Detecting diseases from social media has diverse applications, such as public health monitoring and disease spread detection. While language models (LMs) have shown promising performance in this domain, there remains ongoing research aimed at refining their discriminating representations. In this paper, we propose a novel method that integrates Contrastive Learning (CL) with language modeling to address this challenge. Our approach introduces a self-augmentation method, wherein hidden representations of the model are augmented with their own representations. This method comprises two branches: the first branch, a traditional LM, learns features specific to the given data, while the second branch incorporates augmented representations from the first branch to encourage generalization. CL further refines these representations by pulling pairs of original and augmented versions closer while pushing other samples away. We evaluate our method on three NLP datasets encompassing binary, multi-label, and multi-class classification tasks involving social media posts related to various diseases. Our approach demonstrates notable improvements over traditional fine-tuning methods, achieving up to a 2.48% increase in F1-score compared to baseline approaches and a 2.1% enhancement over state-of-the-art methods.

Abstract (translated)

检测社交媒体上的疾病具有多种应用,例如公共卫生监测和疾病传播检测。虽然语言模型(LMs)在此领域显示出有前景的表现,但仍然有持续的研究旨在改进它们的区分性表示。在本文中,我们提出了一种将对比学习(CL)与语言建模相结合的新方法,以解决这个挑战。我们的方法引入了一种自增强方法,其中模型的隐藏表示被自身表示增强。这个方法包括两个分支:第一个分支是一个传统的LM,学习给定数据的特征,而第二个分支从第一个分支中引入增强表示,以鼓励泛化。CL通过将原始和增强版本之间的对对距离拉近,同时将其他样本推开,进一步优化这些表示。我们在三个涵盖二元、多标签和多分类任务的社会媒体帖子相关的NLP数据集上评估我们的方法。我们的方法在传统微调方法上取得了显著的改进,与基线方法相比,F1分数提高了2.48%,与最先进的 methods相比,F1分数提高了2.1%。

URL

https://arxiv.org/abs/2405.01597

PDF

https://arxiv.org/pdf/2405.01597.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot