Paper Reading AI Learner

Inverse Mixed Strategy Games with Generative Trajectory Models

2025-02-05 16:53:34
Max Muchen Sun, Pete Trautman, Todd Murphey

Abstract

Game-theoretic models are effective tools for modeling multi-agent interactions, especially when robots need to coordinate with humans. However, applying these models requires inferring their specifications from observed behaviors -- a challenging task known as the inverse game problem. Existing inverse game approaches often struggle to account for behavioral uncertainty and measurement noise, and leverage both offline and online data. To address these limitations, we propose an inverse game method that integrates a generative trajectory model into a differentiable mixed-strategy game framework. By representing the mixed strategy with a conditional variational autoencoder (CVAE), our method can infer high-dimensional, multi-modal behavior distributions from noisy measurements while adapting in real-time to new observations. We extensively evaluate our method in a simulated navigation benchmark, where the observations are generated by an unknown game model. Despite the model mismatch, our method can infer Nash-optimal actions comparable to those of the ground-truth model and the oracle inverse game baseline, even in the presence of uncertain agent objectives and noisy measurements.

Abstract (translated)

游戏理论模型是模拟多智能体交互的有效工具,尤其是在机器人需要与人类进行协调时。然而,应用这些模型需要从观察到的行为中推断其规范——这是一个被称为逆向游戏问题的艰巨任务。现有的逆向游戏方法通常难以应对行为不确定性及测量噪声,并且依赖于离线和在线数据。为了克服这些限制,我们提出了一种结合生成轨迹模型与可微混合策略博弈框架的逆向游戏方法。通过用条件变分自动编码器(CVAE)表示混合策略,我们的方法可以从嘈杂的测量中推断出高维、多模态的行为分布,并实时适应新的观察结果。 我们在一个模拟导航基准测试中全面评估了该方法,在这个基准测试中,观测数据是由未知游戏模型生成的。即使存在模型不匹配的问题,当面对不确定的目标和噪声测量时,我们的方法仍然能够推断出与真实模型及Oracle逆向博弈基线相当的纳什最优行动。 这种方法在处理复杂、不确定性高的多智能体系统中展现出强大的潜力,尤其是涉及人类机器人交互的应用场景中。

URL

https://arxiv.org/abs/2502.03356

PDF

https://arxiv.org/pdf/2502.03356.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot