Paper Reading AI Learner

ASCenD-BDS: Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping

2025-02-04 07:44:20
Rajiv Bahl, Venkatesan N, Parimal Aglawe, Aastha Sarasapalli, Bhavya Kancharla, Chaitanya kolukuluri, Harish Mohite, Japneet Hora, Kiran Kakollu, Rahul Diman, Shubham Kapale, Sri Bhagya Kathula, Vamsikrishna Motru, Yogeshwar Reddy

Abstract

The rapid evolution of Large Language Models (LLMs) has transformed natural language processing but raises critical concerns about biases inherent in their deployment and use across diverse linguistic and sociocultural contexts. This paper presents a framework named ASCenD BDS (Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping). The framework presents approach to detecting bias, discrimination, stereotyping across various categories such as gender, caste, age, disability, socioeconomic status, linguistic variations, etc., using an approach which is Adaptive, Stochastic and Context-Aware. The existing frameworks rely heavily on usage of datasets to generate scenarios for detection of Bias, Discrimination and Stereotyping. Examples include datasets such as Civil Comments, Wino Gender, WinoBias, BOLD, CrowS Pairs and BBQ. However, such an approach provides point solutions. As a result, these datasets provide a finite number of scenarios for assessment. The current framework overcomes this limitation by having features which enable Adaptability, Stochasticity, Context Awareness. Context awareness can be customized for any nation or culture or sub-culture (for example an organization's unique culture). In this paper, context awareness in the Indian context has been established. Content has been leveraged from Indian Census 2011 to have a commonality of categorization. A framework has been developed using Category, Sub-Category, STEM, X-Factor, Synonym to enable the features for Adaptability, Stochasticity and Context awareness. The framework has been described in detail in Section 3. Overall 800 plus STEMs, 10 Categories, 31 unique SubCategories were developed by a team of consultants at Saint Fox Consultancy Private Ltd. The concept has been tested out in SFCLabs as part of product development.

Abstract (translated)

大型语言模型(LLMs)的快速演变已经改变了自然语言处理领域,但同时也引发了关于其在多样化的语言和社会文化背景中部署和使用时内在偏见的重要担忧。本文提出了一种名为ASCenD BDS(可适应、随机化及上下文感知框架用于检测偏见、歧视与刻板印象)的框架。该框架提供了一种方法来检测跨多个类别(如性别、种姓、年龄、残疾、经济和社会地位、语言变化等)的偏见、歧视和刻板印象,其方法是可适应性、随机化及上下文感知。 现有的框架主要依赖于使用数据集生成检测偏见、歧视与刻板印象的情景。例如,Civil Comments、Wino Gender、WinoBias、BOLD、CrowS Pairs 和 BBQ 数据集等。然而,这种方法只提供点对点解决方案。因此,这些数据集仅提供了有限数量的评估情景。目前提出的框架通过具备可适应性、随机化及上下文感知功能克服了这一限制。 上下文感知可以根据任何国家或文化(例如一个组织的独特文化)进行定制。在本文中,已在印度语境下建立了上下文感知概念。内容利用来自2011年印度人口普查的数据来实现分类的一致性。通过类别、子类别、STEM(情境-主题-情感)、X-Factor 和同义词开发了一种框架,以支持可适应性、随机化及上下文感知的功能。该框架已在第3节中详细描述。 总体而言,圣福咨询私人有限公司的顾问团队开发了800多个STEM、10个类别和31个独特的子类别。这一概念已在SFCLabs的产品研发过程中进行了测试。

URL

https://arxiv.org/abs/2502.02072

PDF

https://arxiv.org/pdf/2502.02072.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot