Paper Reading AI Learner

Advancing Minority Stress Detection with Transformers: Insights from the Social Media Datasets

2025-09-03 00:27:01
Santosh Chapagain, Cory J Cascalheira, Shah Muhammad Hamdi, Soukaina Filali Boubrahimi, Jillian R. Scheer

Abstract

Individuals from sexual and gender minority groups experience disproportionately high rates of poor health outcomes and mental disorders compared to their heterosexual and cisgender counterparts, largely as a consequence of minority stress as described by Meyer's (2003) model. This study presents the first comprehensive evaluation of transformer-based architectures for detecting minority stress in online discourse. We benchmark multiple transformer models including ELECTRA, BERT, RoBERTa, and BART against traditional machine learning baselines and graph-augmented variants. We further assess zero-shot and few-shot learning paradigms to assess their applicability on underrepresented datasets. Experiments are conducted on the two largest publicly available Reddit corpora for minority stress detection, comprising 12,645 and 5,789 posts, and are repeated over five random seeds to ensure robustness. Our results demonstrate that integrating graph structure consistently improves detection performance across transformer-only models and that supervised fine-tuning with relational context outperforms zero and few-shot approaches. Theoretical analysis reveals that modeling social connectivity and conversational context via graph augmentation sharpens the models' ability to identify key linguistic markers such as identity concealment, internalized stigma, and calls for support, suggesting that graph-enhanced transformers offer the most reliable foundation for digital health interventions and public health policy.

Abstract (translated)

性少数群体(包括性别和性取向与异性恋及顺性人不同的个体)比他们的异性恋和顺性人同行经历更高的健康问题和精神障碍发生率,这主要是由于梅耶(2003年)模型中描述的“少数派压力”所致。本研究首次对基于变压器架构在在线话语中检测少数派压力的能力进行了全面评估。我们针对多个变压器模型,包括ELECTRA、BERT、RoBERTa和BART等,与传统机器学习基线以及图增强变体进行了基准测试,并进一步评估了零样本和少量样本学习范式以确定其在数据较少情况下的适用性。本研究使用两个最大的公开可用Reddit语料库进行实验,这些语料库包含12,645篇及5,789篇文章,并重复五次随机种子实验以确保结果的稳健性。 我们的结果显示,在仅基于变压器模型的情况下,整合图结构始终能够提高检测性能,且通过关系上下文监督微调的方法优于零样本和少量样本方法。理论分析表明,通过图增强建模社交联系和对话背景可以提升模型识别关键语言标志的能力,如身份隐藏、内化污名以及寻求支持的呼吁等,这说明图增强变压器为数字健康干预及公共卫生政策提供了最可靠的基石。

URL

https://arxiv.org/abs/2509.02908

PDF

https://arxiv.org/pdf/2509.02908.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot