Paper Reading AI Learner

Learning Manipulation Tasks in Dynamic and Shared 3D Spaces

2024-04-26 19:40:19
Hariharan Arunachalam, Marc Hanheide, Sariah Mghames

Abstract

Automating the segregation process is a need for every sector experiencing a high volume of materials handling, repetitive and exhaustive operations, in addition to risky exposures. Learning automated pick-and-place operations can be efficiently done by introducing collaborative autonomous systems (e.g. manipulators) in the workplace and among human operators. In this paper, we propose a deep reinforcement learning strategy to learn the place task of multi-categorical items from a shared workspace between dual-manipulators and to multi-goal destinations, assuming the pick has been already completed. The learning strategy leverages first a stochastic actor-critic framework to train an agent's policy network, and second, a dynamic 3D Gym environment where both static and dynamic obstacles (e.g. human factors and robot mate) constitute the state space of a Markov decision process. Learning is conducted in a Gazebo simulator and experiments show an increase in cumulative reward function for the agent further away from human factors. Future investigations will be conducted to enhance the task performance for both agents simultaneously.

Abstract (translated)

自动化分离过程是每个经历大量材料操作、重复且繁琐的操作,以及高风险暴露的行业的需要。通过在职场和人类操作员之间引入协作自主系统(例如操作器),可以高效地学习自动化选择和放置任务,从双臂操作器共享工作空间的多分类项目中学到。在本文中,我们提出了一个深度强化学习策略,从双臂操作器与多目标之间共享工作空间中学习多分类项目的放置位置,假设选择已经完成。学习策略利用随机演员-评论框架训练代理程序的策略网络,并利用动态3D Gym环境,其中静态和动态障碍(例如人为因素和机器人伴侣)构成一个马尔可夫决策过程的状态空间。学习是通过Gazebo仿真器进行的,实验结果表明,代理远离人类因素时,累积奖励函数增加。未来研究将同时对两个代理进行任务性能的提升。

URL

https://arxiv.org/abs/2404.17673

PDF

https://arxiv.org/pdf/2404.17673.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot