Paper Reading AI Learner

Mitigating Negative Side Effects in Multi-Agent Systems Using Blame Assignment

2024-05-07 22:42:04
Pulkit Rustagi, Sandhya Saisubramanian

Abstract

When agents that are independently trained (or designed) to complete their individual tasks are deployed in a shared environment, their joint actions may produce negative side effects (NSEs). As their training does not account for the behavior of other agents or their joint action effects on the environment, the agents have no prior knowledge of the NSEs of their actions. We model the problem of mitigating NSEs in a cooperative multi-agent system as a Lexicographic Decentralized Markov Decision Process with two objectives. The agents must optimize the completion of their assigned tasks while mitigating NSEs. We assume independence of transitions and rewards with respect to the agents' tasks but the joint NSE penalty creates a form of dependence in this setting. To improve scalability, the joint NSE penalty is decomposed into individual penalties for each agent using credit assignment, which facilitates decentralized policy computation. Our results in simulation on three domains demonstrate the effectiveness and scalability of our approach in mitigating NSEs by updating the policies of a subset of agents in the system.

Abstract (translated)

当在共享环境中部署了那些经过独立训练(或设计)以完成各自任务的代理程序时,它们的联合行动可能会产生负面副作用(NSEs)。由于它们的训练没有考虑到其他代理程序的行为或它们联合行动对环境的影响,代理程序没有关于其行动NSEs的先验知识。我们将缓解NSEs的问题建模为合作多代理系统中的Lexicographic Decentralized Markov Decision Process,具有两个目标。代理程序必须在缓解NSEs的同时完成其分配的任务。我们假设与代理程序任务相关的转移和奖励是相互独立的,但联合NSE惩罚在某种程度上导致了这种设置中的一种形式上的依赖关系。为了提高可扩展性,联合NSE惩罚通过信用分配分解为每个代理程序的单独惩罚,这有助于促进分布式策略计算。我们在三个领域的模拟结果表明,通过更新系统中的部分代理程序策略,缓解NSEs的有效性和可扩展性。

URL

https://arxiv.org/abs/2405.04702

PDF

https://arxiv.org/pdf/2405.04702.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot