Paper Reading AI Learner

Airy: Reading Robot Intent through Height and Sky

2025-10-09 16:07:30
Baoyang Chen, Xian Xu, Huamin Qu

Abstract

As industrial robots move into shared human spaces, their opaque decision making threatens safety, trust, and public oversight. This artwork, Airy, asks whether complex multi agent AI can become intuitively understandable by staging a competition between two reinforcement trained robot arms that snap a bedsheet skyward. Building on three design principles, competition as a clear metric (who lifts higher), embodied familiarity (audiences recognize fabric snapping), and sensor to sense mapping (robot cooperation or rivalry shown through forest and weather projections), the installation gives viewers a visceral way to read machine intent. Observations from five international exhibitions indicate that audiences consistently read the robots' strategies, conflict, and cooperation in real time, with emotional reactions that mirror the system's internal state. The project shows how sensory metaphors can turn a black box into a public interface.

Abstract (translated)

随着工业机器人进入人类共享空间,它们不透明的决策过程威胁到了安全、信任和公众监督。这件名为《Airy》的艺术作品探讨了复杂的多智能体AI能否变得直观易懂:它通过一场竞赛来展示两个经过强化学习训练的机械臂如何将床单向天空拉起。该装置基于三个设计原则:明确的竞争指标(谁抬得更高)、具身熟悉性(观众能识别出布料拍打的声音和动作),以及感官到感官映射(通过森林和天气投影展现机器人之间的合作或竞争)。这种设置让观众能够直观地解读机器的意图。 五个国际展览上的观察表明,观众们可以实时理解机器人的策略、冲突与合作,并且他们的反应情感上与系统内部状态相呼应。该项目展示了感觉隐喻如何将一个黑箱转变为公共界面的可能性。

URL

https://arxiv.org/abs/2510.08381

PDF

https://arxiv.org/pdf/2510.08381.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot