Paper Reading AI Learner

Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review

2024-04-26 20:36:58
Sergio A. Serrano, Jose Martinez-Carranza, L. Enrique Sucar

Abstract

Reinforcement Learning (RL) provides a framework in which agents can be trained, via trial and error, to solve complex decision-making problems. Learning with little supervision causes RL methods to require large amounts of data, which renders them too expensive for many applications (e.g. robotics). By reusing knowledge from a different task, knowledge transfer methods present an alternative to reduce the training time in RL. Given how severe data scarcity can be, there has been a growing interest for methods capable of transferring knowledge across different domains (i.e. problems with different representation) due to the flexibility they offer. This review presents a unifying analysis of methods focused on transferring knowledge across different domains. Through a taxonomy based on a transfer-approach categorization, and a characterization of works based on their data-assumption requirements, the objectives of this article are to 1) provide a comprehensive and systematic revision of knowledge transfer methods for the cross-domain RL setting, 2) categorize and characterize these methods to provide an analysis based on relevant features such as their transfer approach and data requirements, and 3) discuss the main challenges regarding cross-domain knowledge transfer, as well as ideas of future directions worth exploring to address these problems.

Abstract (translated)

强化学习(RL)提供了一个框架,让智能体通过尝试和错误,训练来解决复杂的决策问题。少量的监督学习导致RL方法需要大量数据,这使得它们对许多应用(如机器人学)来说过于昂贵。通过将不同任务中的知识进行重用,知识迁移方法提供了一种减少RL培训时间的方法。由于数据稀缺的严重程度,人们对能够在不同领域之间转移知识的方法产生了浓厚兴趣,因为它们提供了灵活性。本文对关注跨领域知识传递的方法进行了统一分析。通过基于迁移方法分类的树状结构,以及根据数据假设要求对作品进行特征描述,本文的目的是1)提供一个全面的关于跨领域RL设置中的知识传递方法的全面和系统的回顾,2)对这类方法进行分类和定性,以便根据其迁移方法和数据需求提供有关其相关特征的分析,3)讨论跨领域知识传递的主要挑战以及值得探索的未来方向来解决这些问题。

URL

https://arxiv.org/abs/2404.17687

PDF

https://arxiv.org/pdf/2404.17687.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot