Reinforcement_Learning
Reinforcement_Learning
2023-02-01
Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing
Grace Zhang, Ayush Jain, Injune Hwang, Shao-Hua Sun, Joseph J. Lim
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-02-01
Off-the-Grid MARL: a Framework for Dataset Generation with Baselines for Cooperative Offline Multi-Agent Reinforcement Learning
Claude Formanek, Asad Jeewa, Jonathan Shock, Arnu Pretorius
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Autonomous
PDF
2023-02-01
Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search
Marie-Morgane Paumard, Hedi Tabia, David Picard
arXiv_CV
arXiv_CV
Reinforcement_Learning
Optimization
PDF
2023-02-01
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy, Tanmay Gangwani, Sumeet Katariya, Branislav Kveton, Anshuka Rangi
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-02-01
Internally Rewarded Reinforcement Learning
Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-02-01
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization
Amartya Mukherjee, Jun Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Action
PDF
2023-01-31
NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese Networks
Alexandre Heuillet, Hedi Tabia, Hichem Arioui
arXiv_AI
arXiv_AI
NAS
Reinforcement_Learning
Represenation_Learning
Self-Supervised
Contrastive_Learning
Classification
Deep_Learning
Attention
CNN
Image_Classification
PDF
2023-01-31
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
Text_Generation
PDF
2023-01-31
Learning Roles with Emergent Social Value Orientations
Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, Hongyuan Zha
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Pose
PDF
2023-01-31
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments
Tan Chong Min John, Mehul Motani
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Prediction
PDF
2023-01-31
Toward Efficient Gradient-Based Value Estimation
Arsalan Sharifnassab, Richard Sutton
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-31
Retrosynthetic Planning with Dual Value Networks
Guoqing Liu, Di Xue, Shufang Xie, Yingce Xia, Austin Tripp, Krzysztof Maziarz, Marwin Segler, Tao Qin, Zongzhang Zhang, Tie-Yan Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-31
Reinforcement learning and decision making via single-photon quantum walks
Fulvio Flamini, Marius Krumm, Lukas J. Fiderer, Thomas Müller, Hans J. Briegel
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Pose
PDF
2023-01-31
Spyker: High-performance Library for Spiking Deep Neural Networks
Shahriar Rezghi Shirsavar, Mohammad-Reza A. Dehaqani
arXiv_CV
arXiv_CV
Reinforcement_Learning
Knowledge
Pose
PDF
2023-01-31
Anti-Exploration by Random Network Distillation
Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-31
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain, Anima Majumder, Samrat Dutta, Swagat Kumar
arXiv_AI
arXiv_AI
Reconstruction
Unsupervised
Reinforcement_Learning
Represenation_Learning
Pose
Action
PDF
2023-01-31
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings
Sudhir Pratap Yadav, Rajendra Nagar, Suril V. Shah
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
Pose
Action
Deep_Learning
PDF
2023-01-31
Scaling laws for single-agent reinforcement learning
Jacob Hilton, Jie Tang, John Schulman
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Relation
PDF
2023-01-31
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
James Queeney, Erhan Can Ozcan, Ioannis Ch. Paschalidis, Christos G. Cassandras
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-31
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning
Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Few-Shot
PDF
2023-01-30
V2N Service Scaling with Deep Reinforcement Learning
Cyril Shih-Huan Hsu, Jorge Martín-Pérez, Chrysa Papagianni, Paola Grosso
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-30
Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving
Yuan Zhang, Joschka Boedecker, Chuxuan Li, Guyue Zhou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
Autonomous
PDF
2023-01-30
Online Learning Based Mobile Robot Controller Adaptation for Slip Reduction
Huidong Gao, Rui Zhou, Masayoshi Tomizuka, Zhuo Xu
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Pose
PDF
2023-01-30
Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation
Batuhan Altundas, Zheyuan Wang, Joshua Bishop, Matthew Gombolay
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Knowledge
Pose
Action
Deep_Learning
Attention
PDF
2023-01-30
Emergence of Maps in the Memories of Blind Navigation Agents
Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Pose
Detection
GAN
PDF
2023-01-30
Optimal Decision Tree Policies for Markov Decision Processes
Daniël Vos, Sicco Verwer
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2023-01-30
Learning Control from Raw Position Measurements
Fabio Amadio, Alberto Dalla Libera, Daniel Nikovski, Ruggero Carli, Diego Romeres
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Activity
PDF
2023-01-30
Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off
Yuchen Lian, Arianna Bisazza, Tessa Verhoef
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-30
Guided Deep Reinforcement Learning for Articulated Swimming Robots
Jiaheng Hu, Tony Dear
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2023-01-30
Winning Solution of Real Robot Challenge III
Qiang Wang, Robert McCarthy, David Cordova Bulens, Stephen J. Redmond
arXiv_RO
arXiv_RO
Reinforcement_Learning
Classification
Matching
PDF
2023-01-30
Hierarchical Imitation Learning with Vector Quantized Models
Kalle Kujanpää, Joni Pajarinen, Alexander Ilin
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-30
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-Yi Lee, Shao-Hua Sun
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Pose
PDF
2023-01-30
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Deyao Zhu, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Knowledge
Action
PDF
2023-01-30
Designing an offline reinforcement learning objective from scratch
Gaon An, Junhyeok Lee, Xingdong Zuo, Norio Kosaka, Kyung-Min Kim, Hyun Oh Song
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Contrastive_Learning
Action
PDF
2023-01-30
Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem
Hélène Plisnier, Denis Steckelmacher, Jeroen Willems, Bruno Depraetere, Ann Nowé
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2023-01-30
Passivizing learned policies and learning passive policies with virtual energy tanks in robotics
Riccardo Zanella, Gianluca Palli, Stefano Stramigioli, Federico Califano
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2023-01-30
Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
Marvin Klimke, Benjamin Völz, Michael Buchholz
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2023-01-30
Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents
Wenhao Xu, Xuefeng Gao, Xuedong He
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-30
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney, Mouhacine Benosman
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-29
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin, Sridhar Thiagarajan, Nevena Lazic, Nived Rajaraman, Botao Hao, Csaba Szepesvari
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-28
Towards Learning Rubik's Cube with N-tuple-based Reinforcement Learning
Wolfgang Konen
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-28
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox
arXiv_CL
arXiv_CL
Reinforcement_Learning
Sparse
Knowledge
Pose
Few-Shot
Language_Model
PDF
2023-01-28
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
PDF
2023-01-27
A Memory Efficient Deep Reinforcement Learning Approach For Snake Game Autonomous Agents
Md. Rafat Rahman Tushar, Shahnewaz Siddique
arXiv_AI
arXiv_AI
Reinforcement_Learning
CNN
Autonomous
PDF
2023-01-27
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation
Daesol Cho, Seungjae Lee, H. Jin Kim
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Quantitative
Matching
PDF
2023-01-27
Behaviour Discriminator: A Simple Data Filtering Method to Improve Offline Policy Learning
Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Francisco Roldan Sanchez, Stephen J. Redmond
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2023-01-27
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-27
ARiADNE: A Reinforcement learning approach using Attention-based Deep Networks for Exploration
Yuhong Cao, Tianxiang Hou, Yizhuo Wang, Xian Yi, Guillaume Sartoretti
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
Attention
Autonomous
PDF
2023-01-27
SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning
Dongseok Shim, Seungjae Lee, H. Jin Kim
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
CNN
PDF
2023-01-27
Neural Episodic Control with State Abstraction
Zhuo Li, Derui Zhu, Yujing Hu, Xiaofei Xie, Lei Ma, Yan Zheng, Yan Song, Yingfeng Chen, Jianjun Zhao
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-27
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Lingwei Zhu, Zheng Chen, Takamitsu Matsubara, Martha White
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Regularization
Pose
PDF
2023-01-26
Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons
Banghua Zhu, Jiantao Jiao, Michael I. Jordan
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-26
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov, Marlos C. Machado
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-26
Double Deep Reinforcement Learning Techniques for Low Dimensional Sensing Mapless Navigation of Terrestrial Mobile Robots
Linda Dotto de Moraes, Victor Augusto Kich, Alisson Henrique Kolling, Jair Augusto Bottega, Raul Steinmetz, Emerson Cassiano da Silva, Ricardo Bedin Grando, Anselmo Rafael Cuckla, Daniel Fernando Tello Gamarra
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-26
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka, Takashi Onishi, Yoshimasa Tsuruoka
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-26
Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning
Sriram Ganapathi Subramanian, Matthew E. Taylor, Kate Larson, Mark Crowley
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2023-01-26
Privacy-Preserving Joint Edge Association and Power Optimization for the Internet of Vehicles via Federated Multi-Agent Reinforcement Learning
Yan Lin, Jinming Bao, Yijin Zhang, Jun Li, Feng Shu, Lajos Hanzo
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Face
PDF
2023-01-26
Multi-Agent congestion cost minimization with linear function approximation
Prashant Trivedi, Nandyala Hemachandra
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-26
Efficient Trust Region-Based Safe Reinforcement Learning with Low-Bias Distributional Actor-Critic
Dohyeong Kim, Kyungjae Lee, Songhwai Oh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-26
Predicting Parameters for Modeling Traffic Participants
Ahmadreza Moradipari, Sangjae Bae, Mahnoosh Alizadeh, Ehsan Moradi Pari, David Isele
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Autonomous
Prediction
PDF
2023-01-26
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-25
DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination via Deep Reinforcement Learning
I Made Aswin Nahrendra, Byeongho Yu, Hyun Myung
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2023-01-25
An Incremental Inverse Reinforcement Learning Approach for Motion Planning with Human Preferences
Armin Avaei, Linda van der Spaa, Luka Peternel, Jens Kober
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
PDF
2023-01-24
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement Learning
Tairan He, Weiye Zhao, Changliu Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-24
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
Ken Kansky, Skanda Vaidyanath, Scott Swingle, Xinghua Lou, Miguel Lazaro-Gredilla, Dileep George
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-24
NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems
Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
PDF
2023-01-24
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning
Safa Alver, Doina Precup
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Deep_Learning
PDF
2023-01-24
Story Shaping: Teaching Agents Human-like Behavior with Stories
Xiangyu Peng, Christopher Cui, Wei Zhou, Renee Jia, Mark Riedl
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Knowledge_Graph
Pose
Action
PDF
2023-01-24
Large Language Models as Fiduciaries: A Case Study Toward Robustly Communicating With Artificial Intelligence Through Legal Standards
John J. Nay
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Relation
Autonomous
Language_Model
PDF
2023-01-24
Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review
Artem Latyshev, Aleksandr I. Panov
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Pose
Autonomous
PDF
2023-01-24
ASQ-IT: Interactive Explanations for Reinforcement-Learning Agents
Yotam Amitai, Guy Avni, Ofra Amir
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
PDF
2023-01-24
Effective Baselines for Multiple Object Rearrangement Planning in Partially Observable Mapped Environments
Engin Tekin, Elaheh Barati, Nitin Kamra, Ruta Desai
arXiv_AI
arXiv_AI
Recognition
Reinforcement_Learning
PDF
2023-01-24
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Self-Supervised
Pose
Action
PDF
2023-01-24
Constrained Reinforcement Learning for Dexterous Manipulation
Abhineet Jain, Jack Kolb, Harish Ravichandar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
PDF
2023-01-23
A deep reinforcement learning approach to assess the low-altitude airspace capacity for urban air mobility
Asal Mehditabrizi, Mahdi Samadzad, Sina Sabzekar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Autonomous
PDF
2023-01-23
On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy Evaluation
Anna Winnicki, R. Srikant
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-23
Two-Stage Learning For the Flexible Job Shop Scheduling Problem
Wenbo Chen, Reem Khir, Pascal Van Hentenryck
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Deep_Learning
Prediction
PDF
2023-01-23
Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics
Aamal Abbas Hussain, Francesco Belardinelli, Georgios Piliouras
arXiv_AI
arXiv_AI
Reinforcement_Learning
Autonomous
PDF
2023-01-23
Learning to View: Decision Transformers for Active Object Detection
Wenhao Ding, Nathalie Majcherczyk, Mohit Deshpande, Xuewei Qi, Ding Zhao, Rajasimman Madhivanan, Arnie Sen
arXiv_CV
arXiv_CV
Transformer
Reinforcement_Learning
Pose
Detection
Object_Detection
PDF
2023-01-22
Deep Reinforcement Learning for Concentric Tube Robot Path Planning
Keshav Iyengar, Sarah Spurgeon, Danail Stoyanov
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Pose
Deep_Learning
PDF
2023-01-20
Neural Architecture Search: Insights from 1000 Papers
Colin White, Mahmoud Safari, Rhea Sukthanker, Binxin Ru, Thomas Elsken, Arber Zela, Debadeepta Dey, Frank Hutter
arXiv_AI
arXiv_AI
NAS
Recognition
Reinforcement_Learning
Speech
Survey
Deep_Learning
GAN
Speech_Recognition
PDF
2023-01-20
AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics
Tingting Yuan, Liang Mi, Weijun Wang, Haipeng Dai, Xiaoming Fu
arXiv_CV
arXiv_CV
Surveillance
Reinforcement_Learning
Super_Resolution
Relation
Inference
PDF
2023-01-20
Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning
Elizaveta Tennant, Stephen Hailes, Mirco Musolesi
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Pose
Action
PDF
2023-01-20
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan (1 and 2), Deheng Ye (2), Xiaoming Duan (1), Qiang Fu (2), Wei Yang (2), Jianping He (1), Mingfei Sun (3) ((1) Shanghai Jiaotong University, (2) Tencent Inc, (3) The University of Manchester)
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Regularization
PDF
2023-01-19
Investigating the Impact of Direct Punishment on the Emergence of Cooperation in Multi-Agent Reinforcement Learning Systems
Nayana Dasgupta, Mirco Musolesi
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-19
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-19
Effective Diversity in Unsupervised Environment Design
Wenjun Li, Pradeep Varakantham, Dexun Li
arXiv_AI
arXiv_AI
Unsupervised
Reinforcement_Learning
Pose
PDF
2023-01-19
Remote patient monitoring using artificial intelligence: Current state, applications, and challenges
Thanveer Shaik, Xiaohui Tao, Niall Higgins, Lin Li, Raj Gururajan, Xujuan Zhou, U. Rajendra Acharya
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Classification
Activity
PDF
2023-01-18
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization
Lucas N. Alegre, Ana L. C. Bazzan, Diederik M. Roijers, Ann Nowé, Bruno C. da Silva
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-18
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team, Jakob Bauer, Kate Baumli, Satinder Baveja, Feryal Behbahani, Avishkar Bhoopchand, Nathalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
Knowledge
Self-Supervised
Attention
PDF
2023-01-18
Autonomous Slalom Maneuver Based on Expert Drivers' Behavior Using Convolutional Neural Network
Shafagh A. Pashaki, Ali Nahvi, Ahmad Ahmadi, Sajad Tavakoli, Shahin Naeemi, Salar H. Shamchi
arXiv_RO
arXiv_RO
Reinforcement_Learning
CNN
Autonomous
PDF
2023-01-18
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav
Ram Ramrakhya, Dhruv Batra, Erik Wijmans, Abhishek Das
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-18
Multi-compartment Neuron and Population Encoding improved Spiking Neural Network for Deep Distributional Reinforcement Learning
Yinqian Sun, Yi Zeng, Feifei Zhao, Zhuoya Zhao
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Pose
Action
PDF
2023-01-17
Heterogeneous Multi-Robot Reinforcement Learning
Matteo Bettini, Ajay Shankar, Amanda Prorok
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2023-01-17
Consciousness is learning: predictive processing systems that learn by binding may perceive themselves as conscious
V.A. Aksyuk
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Attention
Inference
Prediction
PDF
2023-01-17
Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness
Ezgi Korkmaz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Action
PDF
2023-01-17
The SwaNNFlight System: On-the-Fly Sim-to-Real Adaptation via Anchored Learning
Bassel El Mabsout, Shahin Roozkhosh, Siddharth Mysore, Kate Saenko, Renato Mancuso
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
PDF
2023-01-17
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis, Ling Xuan Chen, Alberto Testolin
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Action
PDF
2023-01-17
Show me what you want: Inverse reinforcement learning to automatically design robot swarms by demonstration
Ilyes Gharbi, Jonas Kuckling, David Garzón Ramos, Mauro Birattari
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2023-01-17
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles
Ivan Masmitja, Mario Martin, Kakani Katija, Spartacus Gomariz, Joan Navarro
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Autonomous
PDF
2023-01-17
DQNAS: Neural Architecture Search using Reinforcement Learning
Anshumaan Chauhan, Siddhartha Bhattacharyya, S. Vadivel
arXiv_AI
arXiv_AI
NAS
Tracking
Recognition
Reinforcement_Learning
Knowledge
Pose
Face
Classification
Detection
Face_Recognition
CNN
PDF
2023-01-16
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using Velocity Obstacles
Zhanteng Xie, Philip Dames
arXiv_RO
arXiv_RO
Reinforcement_Learning
3D
Pose
Autonomous
PDF
2023-01-16
HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association
Qiong Wu, Xu Chen, Tao Ouyang, Zhi Zhou, Xiaoxi Zhang, Shusen Yang, Junshan Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-16
Neuro-Symbolic World Models for Adapting to Open World Novelty
Jonathan Balloch, Zhiyu Lin, Robert Wright, Xiangyu Peng, Mustafa Hussain, Aarun Srinivas, Julia Kim, Mark O. Riedl
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-15
Modeling Human Cognition with a Hybrid Deep Reinforcement Learning Agent
Songlin Xu, Xinyu Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Quantitative
PDF
2023-01-14
Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation
Teham Bhuiyan, Linh Kästner, Yifan Hu, Benno Kutschank, Jens Lambrecht
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-14
Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression
Pietro Talli, Francesco Pase, Federico Chiariotti, Andrea Zanella, Michele Zorzi
arXiv_AI
arXiv_AI
Quantization
Reinforcement_Learning
Pose
PDF
2023-01-14
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets
Shuo Sun, Molei Qin, Xinrun Wang, Bo An
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
2023-01-14
First Three Years of the International Verification of Neural Networks Competition
Christopher Brix, Mark Niklas Müller, Stanley Bak, Taylor T. Johnson, Changliu Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Classification
Image_Classification
Autonomous
PDF
2023-01-14
RMM: Reinforced Memory Management for Class-Incremental Learning
Yaoyao Liu, Bernt Schiele, Qianru Sun
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Action
PDF
2023-01-13
Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State
Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Relation
PDF
2023-01-13
Decentralized model-free reinforcement learning in stochastic games with average-reward objective
Romain Cravic, Nicolas Gast, Bruno Gaujal
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2023-01-13
Multi-Target Landmark Detection with Incomplete Images via Reinforcement Learning and Shape Prior
Kaiwen Wan, Lei Li, Dengqiang Jia, Shangqi Gao, Wei Qian, Yingzhi Wu, Huandong Lin, Xiongzheng Mu, Xin Gao, Sijia Wang, Fuping Wu, Xiahai Zhuang
arXiv_CV
arXiv_CV
Reinforcement_Learning
3D
Knowledge
Pose
Detection
Medical
PDF
2023-01-13
A Constrained-Optimization Approach to the Execution of Prioritized Stacks of Learned Multi-Robot Tasks
Gennaro Notomista
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
PDF
2023-01-13
TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems
Matteo Gallici, Mario Martin, Ivan Masmitja
arXiv_AI
arXiv_AI
Transformer
Transfer_Learning
Reinforcement_Learning
Zero-Shot
Action
PDF
2023-01-12
Language-Informed Transfer Learning for Embodied Household Activities
Yuqian Jiang, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme
arXiv_AI
arXiv_AI
Embedding
Transfer_Learning
Reinforcement_Learning
Pose
Action
Activity
Language_Model
PDF
2023-01-12
Learning to Control and Coordinate Hybrid Traffic Through Robot Vehicles at Complex and Unsignalized Intersections
Dawei Wang, Weizi Li, Lei Zhu, Jia Pan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2023-01-12
Asynchronous training of quantum reinforcement learning
Samuel Yen-Chi Chen
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-12
Approximate Information States for Worst-Case Control and Learning in Uncertain Systems
Aditya Dave, Nishanth Venkatesh, Andreas A. Malikopoulos
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2023-01-12
Safe Policy Improvement for POMDPs via Finite-State Controllers
Thiago D. Simão, Marnix Suilen, Nils Jansen
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-12
Predictive World Models from Real-World Partial Observations
Robin Karlsson, Alexander Carballo, Keisuke Fujii, Kento Ohtani, Kazuya Takeda
arXiv_CV
arXiv_CV
Reinforcement_Learning
Prediction
PDF
2023-01-11
Switchable Lightweight Anti-symmetric Processing with CNN to Reduce Sample Size and Speed up Learning -- Application in Gomoku Reinforcement Learning
Chi-Hang Suen (City, University of London)
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Pose
CNN
PDF
2023-01-11
MotorFactory: A Blender Add-on for Large Dataset Generation of Small Electric Motors
Chengzhi Wu, Kanran Zhou, Jan-Philipp Kaiser, Norbert Mitschke, Jan-Felix Klein, Julius Pfrommer, Jürgen Beyerer, Gisela Lanza, Michael Heizmann, Kai Furmans
arXiv_CV
arXiv_CV
Segmentation
Point_Cloud
Reinforcement_Learning
3D
Pose
Classification
Detection
Object_Detection
PDF
2023-01-11
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning
Maxwell Standen, Junae Kim, Claudia Szabo
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Adversarial
Pose
Survey
PDF
2023-01-11
Adversarial Online Multi-Task Reinforcement Learning
Quan Nguyen, Nishant A. Mehta
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
PDF
2023-01-10
ORBIT: A Unified Simulation Framework for Interactive Robot Learning Environments
Mayank Mittal, Calvin Yu, Qinxi Yu, Jingzhou Liu, Nikita Rudin, David Hoeller, Jia Lin Yuan, Pooria Poorsarvi Tehrani, Ritvik Singh, Yunrong Guo, Hammad Mazhar, Ajay Mandlekar, Buck Babich, Gavriel State, Marco Hutter, Animesh Garg
arXiv_AI
arXiv_AI
Reinforcement_Learning
Represenation_Learning
Action
GAN
PDF
2023-01-10
schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments
Constantin Waubert de Puiseau, Jannik Peters, Christian Dörpelkus, Tobias Meisen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Attention
PDF
2023-01-10
Mastering Diverse Domains through World Models
Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
Knowledge
Action
PDF
2023-01-10
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration Without A-Priori Maps
Shathushan Sivashangaran, Azim Eskandarian
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
Pose
Action
Autonomous
PDF
2023-01-10
Learning to Perceive in Deep Model-Free Reinforcement Learning
Gonçalo Querido, Alberto Sardinha, Francisco Melo
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
RNN
Pose
Action
Attention
PDF
2023-01-09
Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D Environments
Jiye Lee, Hanbyul Joo
arXiv_CV
arXiv_CV
Reinforcement_Learning
3D
Action
Quantitative
Matching
PDF
2023-01-09
Learning-based Design and Control for Quadrupedal Robots with Parallel-Elastic Actuators
Filip Bjelonic, Joonho Lee, Philip Arm, Dhionis Sako, Davide Tateo, Jan Peters, Marco Hutter
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Optimization
PDF
2023-01-09
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-09
Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning
Ana Carrasco, João Sequeira
arXiv_AI
arXiv_AI
Tracking
Reinforcement_Learning
Pose
Autonomous
PDF
2023-01-09
Enabling AI-Generated Content Services in Wireless Edge Networks
Hongyang Du, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Xuemin (Sherman) Shen, Dong In Kim
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Pose
Action
Relation
PDF
2023-01-08
A Survey on Transformers in Reinforcement Learning
Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Review
Face
Survey
PDF
2023-01-08
Learning Symbolic Representations for Reinforcement Learning of Non-Markovian Behavior
Phillip J.K. Christoffersen, Andrew C. Li, Rodrigo Toro Icarte, Sheila A. McIlraith
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Knowledge
Action
PDF
2023-01-07
Markov Chain Concentration with an Application in Reinforcement Learning
Debangshu Banerjee
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2023-01-07
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm
Hamid Gharagozlou, Javad Mohammadzadeh, Azam Bastanfard, Saeed Shiry Ghidary
arXiv_AI
arXiv_AI
Transformer
Embedding
Reinforcement_Learning
Bert
RNN
Pose
Classification
Attention
QA
PDF
2023-01-07
LAGA: A Learning Adaptive Genetic Algorithm for Earth Electromagnetic Satellite Scheduling Problem
Yanjie Song, Jie Chun, Qinwen Yang, Junwei Ou, Lining Xing, Yingwu Chen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Detection
Autonomous
PDF
2023-01-06
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads
Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull, Antoine Lesage-Landry
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
PDF
2023-01-06
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen, Biao Luo, Yuqian Zhao
arXiv_AI
arXiv_AI
Recognition
Reinforcement_Learning
OCR
Optimization
Sparse
Optical_Character
Knowledge
Knowledge_Graph
Pose
Inference
Prediction
PDF
2023-01-06
Provable Reset-free Reinforcement Learning by No-Regret Reduction
Hoai-An Nguyen, Ching-An Cheng
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
Pose
PDF
2023-01-05
Extreme Q-Learning: MaxEnt RL without Entropy
Divyansh Garg, Joey Hejna, Matthieu Geist, Stefano Ermon
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-05
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection
Caroline Strickland, Chandrika Saha, Muhammad Zakar, Sareh Nejad, Noshin Tasnim, Daniel Lizotte, Anwar Haque
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
Face
Detection
GAN
PDF
2023-01-05
Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism
Xudong Guo, Daming Shi, Wenhui Fan
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
PDF
2023-01-05
Reinforcement Learning-Based Air Traffic Deconfliction
Denis Osipychev, Dragos Margineantu, Girish Chowdhary
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Face
Action
PDF
2023-01-04
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu
arXiv_CV
arXiv_CV
Reinforcement_Learning
3D
PDF
2023-01-04
Emergent collective intelligence from massive-agent cooperation and competition
Hanmo Chen, Stone Tao, Jiaxin Chen, Weihan Shen, Xihui Li, Sikai Cheng, Xiaolong Zhu, Xiu Li
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
GAN
PDF
2023-01-04
Robofriend: An Adpative Storytelling Robotic Teddy Bear - Technical Report
Ido Glanz, Matan Weksler, Erez Karpas, Tzipi Horowitz-Kraus
arXiv_AI
arXiv_AI
Reinforcement_Learning
Attention
PDF
2023-01-04
Quantum Multi-Agent Actor-Critic Neural Networks for Internet-Connected Multi-Robot Coordination in Smart Factory Management
Won Joon Yun, Jae Pyoung Kim, Soyi Jung, Jae-Hyun Kim, Joongheon Kim
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Attention
Autonomous
PDF
2023-01-03
A Succinct Summary of Reinforcement Learning
Sanjeevan Ahilan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
PDF
2023-01-03
Contextual Conservative Q-Learning for Offline Reinforcement Learning
Ke Jiang, Jiayu Yao, Xiaoyang Tan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Attention
PDF
2023-01-03
e-Inu: Simulating A Quadruped Robot With Emotional Sentience
Abhiruph Chakravarty, Jatin Karthik Tripathy, Sibi Chakkaravarthy S, Aswani Kumar Cherukuri, S. Anitha, Firuz Kamalov, Annapurna Jonnalagadda
arXiv_RO
arXiv_RO
Reinforcement_Learning
Speech
Emotion
Detection
PDF
2023-01-03
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning
Aritra Mitra, George J. Pappas, Hamed Hassani
arXiv_AI
arXiv_AI
Quantization
Gradient_Descent
Reinforcement_Learning
Optimization
PDF
2023-01-03
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics
Yahao Ding, Zhaohui Yang, Quoc-Viet Pham, Zhaoyang Zhang, Mohammad Shikh-Bahaei
arXiv_AI
arXiv_AI
Surveillance
Tracking
Reinforcement_Learning
Face
Survey
Autonomous
Inference
PDF
2023-01-03
Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on Videos
Wei Xingxing, Wang Songping, Yan Huanqian
arXiv_CV
arXiv_CV
Recognition
Reinforcement_Learning
Adversarial
Pose
Action_Recognition
Action
Prediction
PDF
2023-01-03
Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System
Habtamu Hailemichael, Beshah Ayalew, Lindsey Kerbel, Andrej Ivanco, Keith Loiselle
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
Action
PDF
2023-01-02
Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback
Yuji Saikai, Allan Peake, Karine Chenu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
Action
PDF
2023-01-02
Large-Scale Traffic Signal Control by a Nash Deep Q-network Approach
Yuli.Zhang, Shangbo.Wang, Ruiyuan.Jiang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2023-01-02
Fairness Guaranteed and Auction-based x-haul and Cloud Resource Allocation in Multi-tenant O-RANs
Sourav Mondal, Marco Ruffini
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
PDF
2023-01-02
A RL-based Policy Optimization Method Guided by Adaptive Stability Certification
Shengjie Wang, Fengbo Lan, Xiang Zheng, Yuxue Cao, Oluwatosin Oseni, Haotian Xu, Yang Gao, Tao Zhang
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
PDF
2023-01-02
Deep Reinforcement Learning for Asset Allocation: Reward Clipping
Jiwon Kim, Moon-Ju Kang, KangHun Lee, HyungJun Moon, Bo-Kwan Jeon
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
PDF
2023-01-01
Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot Learning
Yonghao Long, Wang Wei, Tao Huang, Yuehao Wang, Qi Dou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2023-01-01
Optimization of Image Transmission in a Cooperative Semantic Communication Networks
Wenjing Zhang, Yining Wang, Mingzhe Chen, Tao Luo, Dusit Niyato
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
Relation
PDF
2023-01-01
Goal-guided Transformer-enabled Reinforcement Learning for Efficient Autonomous Navigation
Wenhui Huang, Yanxin Zhou, Xiangkun He, Chen Lv
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
Autonomous
PDF
2023-01-01
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu, Chenyan Jia, Ge Zhang, Ziyu Zhuang, Tony X Liu, Soroush Vosoughi
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Few-Shot
Language_Model
PDF
2022-12-31
MERLIN: Multi-agent offline and transfer learning for occupant-centric energy flexible operation of grid-interactive communities using smart meter data and CityLearn
Kingsley Nweye, Siva Sankaranarayanan, Zoltan Nagy
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
PDF
2022-12-31
New Challenges in Reinforcement Learning: A Survey of Security and Privacy
Yunjiao Lei, Dayong Ye, Sheng Shen, Yulei Sui, Tianqing Zhu, Wanlei Zhou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Survey
Action
Autonomous
PDF
2022-12-31
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Sam Powers, Eliot Xing, Abhinav Gupta
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2022-12-31
Cost-Effective Two-Stage Network Slicing for Edge-Cloud Orchestrated Vehicular Networks
Wen Wu, Kaige Qu, Peng Yang, Ning Zhang, Xuemin (Sherman) Shen, Weihua Zhuang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-31
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu, Peng Yang, Weiting Zhang, Conghao Zhou, Xuemin (Sherman) Shen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Inference
PDF
2022-12-31
Situation-Aware Deep Reinforcement Learning for Autonomous Nonlinear Mobility Control in Cyber-Physical Loitering Munition Systems
Hyunsoo Lee, Soohyun Park, Won Joon Yun, Soyi Jung, Joongheon Kim
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Drone
Autonomous
PDF
2022-12-30
Task-Guided IRL in POMDPs that Scales
Franck Djeumou, Christian Ellis, Murat Cubuktepe, Craig Lennon, Ufuk Topcu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-30
Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks
Trevor Ablett, Bryan Chan, Jonathan Kelly
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Action
PDF
2022-12-30
Bayesian Learning for Dynamic Inference
Aolin Xu, Peng Guan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Inference
PDF
2022-12-30
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng, S P Sharan, Zhiwen Fan, Kevin Wang, Yihan Xi, Zhangyang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
PDF
2022-12-30
Hybrid Deep Reinforcement Learning and Planning for Safe and Comfortable Automated Driving
Dikshant Gupta, Mathias Klusch
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Prediction
PDF
2022-12-30
Risk-Sensitive Policy with Distributional Reinforcement Learning
Thibaut Théate, Damien Ernst
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-30
Reinforcement Learning with Success Induced Task Prioritization
Maria Nesterova, Alexey Skrynnik, Aleksandr Panov
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-30
RL and Fingerprinting to Select Moving Target Defense Mechanisms for Zero-day Attacks in IoT
Alberto Huertas Celdrán, Pedro Miguel Sánchez Sánchez, Jan von der Assen, Timo Schenk, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
PDF
2022-12-30
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao, Rui Zhao, Hao Chen, Jianye Hao, Yiqun Chen, Dong Li, Junge Zhang, Zhen Xiao
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Optimization
RNN
Pose
Attention
PDF
2022-12-30
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Khimya Khetarpal, Claire Vernade, Brendan O'Donoghue, Satinder Singh, Tom Zahavy
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-29
On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
Saket Tiwari, Omer Gottesman, George Konidaris
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-29
Visual CPG-RL: Learning Central Pattern Generators for Visually-Guided Quadruped Navigation
Guillaume Bellegarda, Auke Ijspeert
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-29
On Deep Recurrent Reinforcement Learning for Active Visual Tracking of Space Noncooperative Objects
Dong Zhou, Guanghui Sun, Zhao Zhang, Ligang Wu
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Pose
Attention
Autonomous
PDF
2022-12-29
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang, Tielin Zhang, Shuncheng Jia, Qingyu Wang, Bo Xu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-29
Invariance to Quantile Selection in Distributional Continuous Control
Felix Grün, Muhammad Saif-ur-Rehman, Tobias Glasmachers, Ioannis Iossifidis
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-29
Backward Curriculum Reinforcement Learning
KyungMin Ko, Sajad Khodadadian, Siva Theja Maguluri
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-29
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu, Li Shen, Ya Zhang, Yixin Chen, Dacheng Tao
arXiv_AI
arXiv_AI
Transformer
Enhancement
Reinforcement_Learning
Optimization
Review
Survey
Action
Autonomous
PDF
2022-12-28
Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm
Jabri Ismail, Aboulbichr Ahmed, El ouaazizi Aziza
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
QA
PDF
2022-12-28
Certifying Safety in Reinforcement Learning under Adversarial Perturbation Attacks
Junlin Wu, Hussein Sibai, Yevgeniy Vorobeychik
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
Prediction
PDF
2022-12-28
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2022-12-28
On the Convergence of Discounted Policy Gradient Methods
Chris Nota
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-28
Towards Learning Abstractions via Reinforcement Learning
Erik Jergéus, Leo Karlsson Oinonen, Emil Carlsson, Moa Johansson
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-28
On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
Tim G. J. Rudner, Cong Lu, Michael A. Osborne, Yarin Gal, Yee Whye Teh
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-28
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Ekaterina Nikonova, Cheng Xue, Jochen Renz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Relation
PDF
2022-12-27
Data-driven control of COVID-19 in buildings: a reinforcement-learning approach
Ashkan Haji Hosseinloo, Saleh Nabi, Anette Hosoi, Munther A. Dahleh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-27
Strangeness-driven Exploration in Multi-Agent Reinforcement Learning
Ju-Bong Kim, Ho-Bin Choi, Youn-Hee Han
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-27
Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents
Meng Xiao, Dongjie Wang, Min Wu, Ziyue Qiao, Pengfei Wang, Kunpeng Liu, Yuanchun Zhou, Yanjie Fu
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-27
Learning Individual Policies in Large Multi-agent Systems through Local Variance Minimization
Tanvi Verma, Pradeep Varakantham
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-26
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park, Taeyoung Kim, Woohyeon Moon, Luiz Felipe Vecchietti, Dongsoo Har
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-26
Simultaneously Optimizing Perturbations and Positions for Black-box Adversarial Patch Attacks
Xingxing Wei, Ying Guo, Jie Yu, Bo Zhang
arXiv_CV
arXiv_CV
Recognition
Reinforcement_Learning
Adversarial
Pose
Face
Face_Recognition
PDF
2022-12-25
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harsh Agarwal, Heena Rathore
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Activity
PDF
2022-12-24
SHIRO: Soft Hierarchical Reinforcement Learning
Kandai Watanabe, Mathew Strong, Omer Eldar
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2022-12-24
Automated Gadget Discovery in Science
Lea M. Trenkwalder, Andrea López Incera, Hendrik Poulsen Nautrup, Fulvio Flamini, Hans J. Briegel
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-24
Deep Reinforcement Learning for Heat Pump Control
Tobias Rohrer, Lilli Frison, Lukas Kaupenjohann, Katrin Scharf, Elke Hergenrother
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-24
Structure-Enhanced DRL for Optimal Transmission Scheduling
Jiazheng Chen, Wanchun Liu, Daniel E. Quevedo, Saeed R. Khosravirad, Yonghui Li, Branka Vucetic
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-24
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen, Ziyu Wan, Ming Zhou, Shufang Hou, Zhe Cao, Chenyang Le, Jingxiao Chen, Zheng Tian, Weinan Zhang, Jun Wang
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Text_Generation
Caption
Autonomous
PDF
2022-12-23
Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow
Ángela López-Cardona, Guillermo Bernárdez, Pere Barlet-Ros, Albert Cabellos-Aparicio
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-23
Generalised agent for solving higher board states of tic tac toe using Reinforcement Learning
Bhavuk Kalra
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-22
A Learned Simulation Environment to Model Student Engagement and Retention in Automated Online Courses
N. Imstepf, S. Senn, A. Fortin, B. Russell, C. Horn
arXiv_AI
arXiv_AI
Reinforcement_Learning
Prediction
PDF
2022-12-22
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse
Elisha Siddiqui Matekole, Esther Ye, Ramya Iyer, Samuel Yen-Chi Chen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Face
PDF
2022-12-22
Towards Causal Credit Assignment
Mátyás Schubert
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-22
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Peter Börsting, Stefano V. Albrecht
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-21
Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu, Justin Fu, George Tucker, Xinlei Pan, Eli Bronstein, Becca Roelofs, Benjamin Sapp, Brandyn White, Aleksandra Faust, Shimon Whiteson, Dragomir Anguelov, Sergey Levine
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Autonomous
PDF
2022-12-21
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation
Gyan Tatiya, Jonathan Francis, Luca Bondi, Ingrid Navarro, Eric Nyberg, Jivko Sinapov, Jean Oh
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
3D
Knowledge
Knowledge_Graph
Pose
Relation
PDF
2022-12-21
Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms
Marwan Dhuheir, Emna Baccour, Aiman Erbad, Sinan Sabeeh Al-Obaidi, Mounir Hamdi
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Optimization
Inference
PDF
2022-12-21
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu, Saptarshi Nath, Praveen K. Pilly, Soheil Kolouri, Andrea Soltoggio
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Knowledge
Classification
PDF
2022-12-21
On Reinforcement Learning for the Game of 2048
Hung Guei
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-21
Cooperative Flight Control Using Visual-Attention -- Air-Guardian
Lianhao Yin, Tsun-Hsuan Wang, Makram Chahine, Tim Seyde, Mathias Lechner, Ramin Hasani, Daniela Rus
arXiv_AI
arXiv_AI
Reinforcement_Learning
Salient
Pose
Attention
Drone
Autonomous
PDF
2022-12-21
Critic-Guided Decoding for Controlled Text Generation
Minbeom Kim, Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung
arXiv_CL
arXiv_CL
Reinforcement_Learning
Zero-Shot
Pose
Sentiment
Text_Generation
Language_Model
PDF
2022-12-21
A Memetic Algorithm with Reinforcement Learning for Sociotechnical Production Scheduling
Felix Grumbach, Nour Eldin Alaa Badr, Pascal Reusch, Sebastian Trojahn
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-21
Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization
Dongmin Hyun, Xiting Wang, Chanyoung Park, Xing Xie, Hwanjo Yu
arXiv_AI
arXiv_AI
Unsupervised
Reinforcement_Learning
Pose
Summarization
PDF
2022-12-21
Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search
Taisuke Kobayashi
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
PDF
2022-12-21
The Internet of Senses: Building on Semantic Communications and Edge Intelligence
Roghayeh Joda, Medhat Elsayed, Hatem Abou-zeid, Ramy Atawia, Akram Bin Sediq, Gary Boudreau, Melike Erol-Kantarci, Lajos Hanzo
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-21
Neighboring state-based RL Exploration
Jeffery Cheng, Kevin Li, Justin Lin, Pedro Pachuca
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Survey
Action
PDF
2022-12-20
METEOR Guided Divergence for Video Captioning
Daniel Lukas Rothenpieler, Shahin Amiriparian
arXiv_CV
arXiv_CV
Transformer
Reinforcement_Learning
Video_Caption
Pose
Action
Attention
Caption
Activity
PDF
2022-12-20
Variational Quantum Soft Actor-Critic for Robotic Arm Control
Alberto Acuto, Paola Barillà, Ludovico Bozzolo, Matteo Conterno, Mattia Pavese, Antonio Policicchio
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-20
A survey on text generation using generative adversarial networks
Gustavo Henrique de Rosa, João Paulo Papa
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Adversarial
Survey
GAN
Text_Generation
PDF
2022-12-20
Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations
Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei Jiang, Yejin Choi, Chandra Bhagavatula
arXiv_CL
arXiv_CL
Reinforcement_Learning
Salient
Action
PDF
2022-12-20
Adapting the Exploration Rate for Value-of-Information-Based Reinforcement Learning
Isaac J. Sledge, Jose C. Principe
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Action
PDF
2022-12-20
AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning
Aowabin Rahman, Arnab Bhattacharya, Thiagarajan Ramachandran, Sayak Mukherjee, Himanshu Sharma, Ted Fujimoto, Samrat Chatterjee
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
Adversarial
Pose
Action
Autonomous
PDF
2022-12-20
An AI Dungeon Master's Guide: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons
Pei Zhou, Andrew Zhu, Jennifer Hu, Jay Pujara, Xiang Ren, Chris Callison-Burch, Yejin Choi, Prithviraj Ammanabrolu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Prediction
PDF
2022-12-20
Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation
Hiroaki Shinkawa, Nicolas Chauvet, André Röhm, Takatomo Mihana, Ryoichi Horisaki, Guillaume Bachelier, Makoto Naruse
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-19
Inverse Reinforcement Learning for Text Summarization
Yu Fu, Deyi Xiong, Yue Dong
arXiv_CL
arXiv_CL
Reinforcement_Learning
Optimization
Summarization
PDF
2022-12-19
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu, Zheyuan Hu, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Autonomous
PDF
2022-12-19
Human-in-the-loop Abstractive Dialogue Summarization
Jiaao Chen, Mohan Dodda, Diyi Yang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Salient
Pose
Attention
Summarization
PDF
2022-12-19
Optimizing Prompts for Text-to-Image Generation
Yaru Hao, Zewen Chi, Li Dong, Furu Wei
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Language_Model
PDF
2022-12-19
Learning Latent Representations to Co-Adapt to Humans
Sagar Parekh, Dylan P. Losey
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Prediction
PDF
2022-12-19
Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-19
Quantum policy gradient algorithms
Sofiene Jerbi, Arjan Cornelissen, Māris Ozols, Vedran Dunjko
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-18
Empirical Analysis of AI-based Energy Management in Electric Vehicles: A Case Study on Reinforcement Learning
Jincheng Hu, Yang Lin, Jihao Li, Zhuoran Hou, Dezong Zhao, Quan Zhou, Jingjing Jiang, Yuanjian Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
PDF
2022-12-18
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu, Zhengbang Zhu, Menghui Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao
arXiv_AI
arXiv_AI
Reinforcement_Learning
Zero-Shot
Optimization
Knowledge
Pose
Action
Few-Shot
PDF
2022-12-18
Neural Coreference Resolution based on Reinforcement Learning
Yu Wang, Hongxia Jin
arXiv_CL
arXiv_CL
Reinforcement_Learning
Bert
Pose
Detection
PDF
2022-12-18
Risk-Sensitive Reinforcement Learning with Exponential Criteria
Erfaun Noorani, Christos Mavridis, John Baras
arXiv_AI
arXiv_AI
Reinforcement_Learning
Regularization
Pose
PDF
2022-12-17
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun Huang, Edward S. Hu, Dinesh Jayaraman
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-12-17
Comparison of Model-Free and Model-Based Learning-Informed Planning for PointGoal Navigation
Yimeng Li, Arnab Debnath, Gregory J. Stein, Jana Kosecka
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Reinforcement_Learning
3D
Pose
PDF
2022-12-17
Level-$k$ Meta-Learning for Pedestrian-Aware Self-Driving
Haozhe Lei, Quanyan Zhu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Autonomous
PDF
2022-12-17
Conditional Predictive Behavior Planning with Inverse Reinforcement Learning for Human-like Autonomous Driving
Zhiyu Huang, Haochen Liu, Jingda Wu, Chen Lv
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Autonomous
Prediction
PDF
2022-12-16
Safe Evaluation For Offline Learning: Are We Ready To Deploy?
Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-16
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah, Arjun Bhorkar, Hrish Leen, Ilya Kostrikov, Nick Rhinehart, Sergey Levine
arXiv_CV
arXiv_CV
Reinforcement_Learning
PDF
2022-12-16
A Simple Decentralized Cross-Entropy Method
Zichen Zhang, Jun Jin, Martin Jagersand, Jun Luo, Dale Schuurmans
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-12-16
Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling
Ashish Kumar, Ilya Kuzovkin
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-12-16
Multi-Agent Patrolling with Battery Constraints through Deep Reinforcement Learning
Chenhao Tong, Aaron Harwood, Maria A. Rodriguez, Richard O. Sinnott
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Autonomous
PDF
2022-12-16
Reinforcement Learning for Agile Active Target Sensing with a UAV
Harsh Goel, Laura Jarin Lipschitz, Saurav Agarwal, Sandeep Manjanna, Vijay Kumar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Classification
PDF
2022-12-15
Combining information-seeking exploration and reward maximization: Unified inference on continuous state and action spaces under partial observability
Parvin Malekzadeh, Konstantinos N. Plataniotis
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Attention
Inference
PDF
2022-12-15
Emergent Behaviors in Multi-Agent Target Acquisition
Piyush K. Sharma, Erin Zaroukian, Derrik E. Asher, Bryson Howell
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Classification
PDF
2022-12-15
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
Hang Lai, Weinan Zhang, Xialin He, Chen Yu, Zheng Tian, Yong Yu, Jun Wang
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
Pose
PDF
2022-12-15
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-15
Constitutional AI: Harmlessness from AI Feedback
Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-15
Residual Policy Learning for Powertrain Control
Lindsey Kerbel, Beshah Ayalew, Andrej Ivanco, Keith Loiselle
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Action
PDF
2022-12-15
Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning
Lindsey Kerbel, Beshah Ayalew, Andrej Ivanco, Keith Loiselle
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
PDF
2022-12-14
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman, Yexiang Xue
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Regularization
Pose
Action
PDF
2022-12-14
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
Brahma S. Pavse, Josiah P. Hanna
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
PDF
2022-12-14
Cross-Domain Transfer via Semantic Skill Imitation
Karl Pertsch, Ruta Desai, Vikash Kumar, Franziska Meier, Joseph J. Lim, Dhruv Batra, Akshara Rai
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-12-14
Quantum Control based on Deep Reinforcement Learning
Zhikang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2022-12-14
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
Jiashuo Sun, Hang Zhang, Chen Lin, Yeyun Gong, Jian Guo, Nan Duan
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
QA
PDF
2022-12-14
Reinforcement Learning in System Identification
Jose Antonio Martin H., Oscar Fernandez Vicente, Sergio Perez, Anas Belfadil, Cristina Ibanez-Llano, Freddy Jose Perozo Rondon, Jose Javier Valle, Javier Arechalde Pelaz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Face
Action
Prediction
PDF
2022-12-14
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning
Linrui Zhang, Zichen Yan, Li Shen, Shoujie Li, Xueqian Wang, Dacheng Tao
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
PDF
2022-12-14
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang, Taoxing Pan, Qi Zhou, Jie Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-14
Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario
Hugo Muñoz, Ernesto Portugal, Angel Ayala, Bruno Fernandes, Francisco Cruz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Action
PDF
2022-12-13
Enabling the Wireless Metaverse via Semantic Multiverse Communication
Jihong Park, Jinho Choi, Seong-Lyun Kim, Mehdi Bennis
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-13
Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets
Muhammad Anwar, Changlong Wang, Frits de Nijs, Hao Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-13
Model-Free Approach to Fair Solar PV Curtailment Using Reinforcement Learning
Zhuo Wei, Frits de Nijs, Jinhao Li, Hao Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-13
Improving generalization in reinforcement learning through forked agents
Olivier Moulin, Vincent Francois-Lavet, Mark Hoogendoorn
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-13
Collision probability reduction method for tracking control in automatic docking / berthing using reinforcement learning
Kouki Wakita, Youhei Akimoto, Dimas M. Rachman, Yoshiki Miyauchi, Umeda Naoya, Atsuo Maki
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Bert
Pose
PDF
2022-12-13
Single Cell Training on Architecture Search for Image Denoising
Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko
arXiv_CV
arXiv_CV
NAS
Reinforcement_Learning
Restoration
Pose
Denoising
Matching
PDF
2022-12-13
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang, Zhen Guo, Audun Jøsang, Lance M. Kaplan, Feng Chen, Dong H. Jeong, Jin-Hee Cho
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-12
Variance-Reduced Conservative Policy Iteration
Naman Agarwal, Brian Bullins, Karan Singh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-12
Reinforced Approximate Exploratory Data Analysis
Shaddy Garg, Subrata Mitra, Tong Yu, Yash Gadhia, Arjun Kashettiwar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-12
Verifiably Safe Reinforcement Learning with Probabilistic Guarantees via Temporal Logic
Hanna Krasowski, Prithvi Akella, Aaron Ames, Matthias Althoff
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-12-12
A Survey on Reinforcement Learning Security with Application to Autonomous Driving
Ambra Demontis, Maura Pintor, Luca Demetrio, Kathrin Grosse, Hsiao-Ying Lin, Chengfang Fang, Battista Biggio, Fabio Roli
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Survey
Autonomous
PDF
2022-12-12
Where to go: Agent Guidance with Deep Reinforcement Learning in A City-Scale Online Ride-Hailing Service
Jiyao Li, Vicki H. Allan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-12
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
Linrui Zhang, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang, Dacheng Tao
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
Face
Action
Autonomous
PDF
2022-12-12
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen, Yixin Lin, Hao Su, Xiaolong Wang, Vikash Kumar, Aravind Rajeswaran
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Action
PDF
2022-12-12
Decentralized cooperative perception for autonomous vehicles: Learning to value the unknown
Maxime Chaveroche, Franck Davoine, Véronique Cherfaoui
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Autonomous
Prediction
PDF
2022-12-11
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Altun Rzayev, Vahid Tavakol Aghaei
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-11
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Xiaodong Li, Pangjing Wu, Chenxin Zou, Qing Li
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
Deep_Learning
PDF
2022-12-11
Molecular Graph Generation by Decomposition and Reassembling
Masatsugu Yamada, Mahito Sugiyama
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-10
Effects of Spectral Normalization in Multi-agent Reinforcement Learning
Kinal Mehta, Anuj Mahajan, Pawan Kumar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Action
PDF
2022-12-10
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng
arXiv_CV
arXiv_CV
Recognition
Reinforcement_Learning
Speech
Pose
Face
Speech_Recognition
PDF
2022-12-10
Relate to Predict: Towards Task-Independent Knowledge Representations for Reinforcement Learning
Thomas Schnürer, Malte Probst, Horst-Michael Gross
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
PDF
2022-12-10
AutoDRIVE: A Comprehensive, Flexible and Integrated Cyber-Physical Ecosystem for Enhancing Autonomous Driving Research and Education
Tanmay Vilas Samak, Chinmay Vilas Samak, Sivanathan Kandhasamy, Venkat Krovi, Ming Xie
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Face
Autonomous
PDF
2022-12-09
Expeditious Saliency-guided Mix-up through Random Gradient Thresholding
Minh-Long Luu, Zeyi Huang, Eric P. Xing, Yong Jae Lee, Haohan Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Weakly_Supervised
Salient
Adversarial
PDF
2022-12-09
Frugal Reinforcement-based Active Learning
Sebastien Deschamps, Hichem Sahbi
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Classification
Image_Classification
PDF
2022-12-09
Physically Plausible Animation of Human Upper Body from a Single Image
Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
Pose
Action
PDF
2022-12-09
Near-Optimal Differentially Private Reinforcement Learning
Dan Qiao, Yu-Xiang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2022-12-09
Reinforcement Learning for Predicting Traffic Accidents
Injoon Cho, Praveen Kumar Rajendran, Taeyoung Kim, Dongsoo Har
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Deep_Learning
Attention
Autonomous
Prediction
PDF
2022-12-08
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
Joey Hong, Aviral Kumar, Sergey Levine
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-08
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games
Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Smith, Sahana Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael Piacentino, Jesse Hostetler, Aswin Raghavan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Quantitative
PDF
2022-12-08
Learning Options via Compression
Yiding Jiang, Evan Zheran Liu, Benjamin Eysenbach, Zico Kolter, Chelsea Finn
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-08
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning
Onur Beker, Mohammad Mohammadi, Amir Zamir
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Represenation_Learning
Pose
Action
Deep_Learning
PDF
2022-12-08
HERD: Continuous Human-to-Robot Evolution for Learning from Human Demonstration
Xingyu Liu, Deepak Pathak, Kris M. Kitani
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-08
A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information
Jing Fang, Yinbo Yu, Zhongyuan Wang, Xin Ding, Ruimin Hu
arXiv_CV
arXiv_CV
Reinforcement_Learning
Super_Resolution
PDF
2022-12-08
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo (Imperial College London), Michel-Alexandre Cardin (Imperial College London), Pudong Ge (Imperial College London), Fei Teng (Imperial College London), Anna Korre (Imperial College London), Ehecatl Antonio del Rio Chanona (Imperial College London)
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-08
Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk
Fabian Hart, Ostap Okhrin
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Drone
Autonomous
PDF
2022-12-08
Elixir: A system to enhance data quality for multiple analytics on a video stream
Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat T. Chakradhar
arXiv_CV
arXiv_CV
Reinforcement_Learning
Face
PDF
2022-12-08
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare
arXiv_AI
arXiv_AI
Gradient_Descent
Reinforcement_Learning
Image_Compression
PDF
2022-12-07
Low Variance Off-policy Evaluation with State-based Importance Sampling
David M. Bossens, Philip Thomas
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-07
Combining Planning, Reasoning and Reinforcement Learning to solve Industrial Robot Tasks
Matthias Mayr, Faseeh Ahmad, Konstantinos Chatzilygeroudis, Luigi Nardi, Volker Krueger
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Knowledge
PDF
2022-12-07
Selector-Enhancer: Learning Dynamic Selection of Local and Non-local Attention Operation for Speech Enhancement
Xinmeng Xu, Weiping Tu, Yuhong Yang
arXiv_SD
arXiv_SD
Enhancement
Reinforcement_Learning
Speech
Pose
Deep_Learning
Denoising
Attention
CNN
PDF
2022-12-06
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna, Dorsa Sadigh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Few-Shot
PDF
2022-12-06
Solving the Side-Chain Packing Arrangement of Proteins from Reinforcement Learned Stochastic Decision Making
Chandrajit Bajaj, Conrad Li, Minh Nguyen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Prediction
PDF
2022-12-06
Understanding Self-Predictive Learning for Reinforcement Learning
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Represenation_Learning
Pose
Prediction
PDF
2022-12-06
Variable-Decision Frequency Option Critic
Amirmohammad Karimi, Jun Jin, Jun Luo, A. Rupam Mahmood, Martin Jagersand, Samuele Tosatto
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-06
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation
Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
PDF
2022-12-06
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning
Ukyo Honda, Taro Watanabe, Yuji Matsumoto
arXiv_CV
arXiv_CV
Image_Caption
Reinforcement_Learning
Pose
Classification
Caption
PDF
2022-12-06
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
Kai-Chieh Hsu, Duy Phuong Nguyen, Jaime Fernández Fisac
arXiv_RO
arXiv_RO
Reinforcement_Learning
Adversarial
Pose
Action
PDF
2022-12-06
Adaptive Risk-Aware Bidding with Budget Constraint in Display Advertising
Zhimeng Jiang, Kaixiong Zhou, Mi Zhang, Rui Chen, Xia Hu, Soo-Hyun Choi
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Self-Supervised
Pose
Relation
PDF
2022-12-06
Active Classification of Moving Targets with Learned Control Policies
Álvaro Serra-Gómez (1), Eduardo Montijano (2), Wendelin Böhmer (3), Javier Alonso-Mora (1) ((1) Department of Cognitive Robotics, Delft University of Technology, (2) Department of Informatics and Systems Engineering, Universidad de Zaragoza, (3) Department of Software Technology, Delft University of Technology)
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Classification
Deep_Learning
Relation
Attention
Drone
PDF
2022-12-06
Towards a more efficient computation of individual attribute and policy contribution for post-hoc explanation of cooperative multi-agent systems using Myerson values
Giorgio Angelotti, Natalia Díaz-Rodríguez
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Knowledge_Graph
Pose
Quantitative
Relation
PDF
2022-12-06
Reinforcement Learning for UAV control with Policy and Reward Shaping
Cristian Millán-Arias, Ruben Contreras, Francisco Cruz, Bruno Fernandes
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Drone
Autonomous
PDF
2022-12-06
State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning
Ziqi Wang, Tianye Shu, Jialin Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-12-06
Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems
Umut Demir, A. Sadik Satir, Gulay Goktas Sever, Cansu Yikilmaz, Nazim Kemal Ure
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
2022-12-06
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Bo An
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Optimization
Knowledge
Pose
Recommendation
PDF
2022-12-06
Safe Inverse Reinforcement Learning via Control Barrier Function
Yue Yang, Letian Chen, Matthew Gombolay
arXiv_AI
arXiv_AI
Gradient_Descent
Reinforcement_Learning
3D
Pose
Attention
Drone
PDF
2022-12-06
Curriculum Learning for Relative Overgeneralization
Lin Shi, Bei Peng
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Knowledge
Pose
Action
PDF
2022-12-06
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Jie Zou, Jiashu Lou, Baohua Wang, Sixue Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Pose
PDF
2022-12-06
Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning
Ramij R. Hossain, Tianzhixi Yin, Yan Du, Renke Huang, Jie Tan, Wenhao Yu, Yuan Liu, Qiuhua Huang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-12-06
What is the Solution for State Adversarial Multi-Agent Reinforcement Learning?
Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, Fei Miao
arXiv_AI
arXiv_AI
Gradient_Descent
Reinforcement_Learning
Adversarial
Pose
PDF
2022-12-06
Learning Locally, Communicating Globally: Reinforcement Learning of Multi-robot Task Allocation for Cooperative Transport
Kazuki Shibata, Tomohiko Jimbo, Tadashi Odashima, Keisuke Takeshita, Takamitsu Matsubara
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-12-05
PEANUT: Predicting and Navigating to Unseen Targets
Albert J. Zhai, Shenlong Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
Prediction
PDF
2022-12-05
PowRL: A Reinforcement Learning Framework for Robust Management of Power Networks
Anandsingh Chauhan, Mayank Baranwal, Ansuma Basumatary
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-05
Bi-Level Optimization Augmented with Conditional Variational Autoencoder for Autonomous Driving in Dense Traffic
Arun Kumar Singh, Jatan Shrestha, Nicola Albarella
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Autonomous
PDF
2022-12-05
L2SR: Learning to Sample and Reconstruct for Accelerated MRI
Pu Yang, Bin Dong
arXiv_CV
arXiv_CV
Reconstruction
Reinforcement_Learning
Sparse
Pose
PDF
2022-12-05
Physics-Informed Model-Based Reinforcement Learning
Adithya Ramesh, Balaraman Ravindran
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2022-12-05
Accelerating Interactive Human-like Manipulation Learning with GPU-based Simulation and High-quality Demonstrations
Malte Mosbach, Kara Moraw, Sven Behnke
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
Action
PDF
2022-12-05
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Regularization
Action
PDF
2022-12-05
A Machine with Short-Term, Episodic, and Semantic Memory Systems
Taewoon Kim, Michael Cochez, Vincent François-Lavet, Mark Neerincx, Piek Vossen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Knowledge_Graph
PDF
2022-12-05
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao, Zherong Pan, Yang Yu, Kai Xu
arXiv_RO
arXiv_RO
Reinforcement_Learning
3D
Pose
Action
PDF
2022-12-05
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat
Jiajun Chai, Wenzhang Chen, Yuanheng Zhu, Zong-xin Yao, Dongbin Zhao
arXiv_AI
arXiv_AI
Tracking
Reinforcement_Learning
Optimization
Pose
Action
PDF
2022-12-05
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance
Can Chang, Ni Mu, Jiajun Wu, Ling Pan, Huazhe Xu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Zero-Shot
Pose
Action
PDF
2022-12-05
Deep reinforcement learning of event-triggered communication and consensus-based control for distributed cooperative transport
Kazuki Shibata, Tomohiko Jimbo, Takamitsu Matsubara
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-12-04
Learning Bifunctional Push-grasping Synergistic Strategy for Goal-agnostic and Goal-oriented Tasks
Dafa Ren, Shuang Wu, Xiaofan Wang, Yan Peng, Xiaoqiang Ren
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
Pose
Action
PDF
2022-12-04
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao, Jun Zhang, Deheng Ye, Jian Cao, Xiao Han, Qiang Fu, Wei Yang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
Classification
Detection
PDF
2022-12-03
Active learning using adaptable task-based prioritisation
Shaheer U. Saeed, João Ramalhinho, Mark Pinnock, Ziyi Shen, Yunguan Fu, Nina Montaña-Brown, Ester Bonmati, Dean C. Barratt, Stephen P. Pereira, Brian Davidson, Matthew J. Clarkson, Yipeng Hu
arXiv_CV
arXiv_CV
Segmentation
Reinforcement_Learning
Pose
GAN
Medical
PDF
2022-12-03
XTENTH-CAR: A Proportionally Scaled Experimental Vehicle Platform for Connected Autonomy and All-Terrain Research
Shathushan Sivashangaran, Azim Eskandarian
arXiv_RO
arXiv_RO
Reinforcement_Learning
Autonomous
PDF
2022-12-03
A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing
Rudolf Reiter, Jasper Hoffmann, Joschka Boedecker, Moritz Diehl
arXiv_RO
arXiv_RO
Tracking
Reinforcement_Learning
Autonomous
PDF
2022-12-03
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo, Jingyue Gao, Zheng Wu, Chengming Shi, Jianyu Chen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
PDF
2022-12-02
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew Albright, Joshua Vaughan
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-12-02
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Quantitative
PDF
2022-12-02
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
Nikhil Kumar Singh, Indranil Saha
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Quantitative
Attention
PDF
2022-12-02
Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
Zheng Wu, Wenzhao Lian, Changhao Wang, Mengxi Li, Stefan Schaal, Masayoshi Tomizuka
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
PDF
2022-12-01
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter, Timo Thun, Tobias Meisen
arXiv_AI
arXiv_AI
Enhancement
Reinforcement_Learning
Optimization
PDF
2022-12-01
Modeling Mobile Health Users as Reinforcement Learning Agents
Eura Shin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez
arXiv_AI
arXiv_AI
Reinforcement_Learning
Relation
PDF
2022-12-01
Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System
Cooper Cone, Michael Owen, Luis Alvarez, Marc Brittain
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
PDF
2022-12-01
Exploiting Socially-Aware Tasks for Embodied Social Navigation
Enrico Cancelli, Tommaso Campari, Luciano Serafini, Angel X. Chang, Lamberto Ballan
arXiv_AI
arXiv_AI
Reinforcement_Learning
3D
Pose
Action
PDF
2022-12-01
Safe Reinforcement Learning with Probabilistic Control Barrier Functions for Ramp Merging
Soumith Udatha, Yiwei Lyu, John Dolan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Autonomous
PDF
2022-12-01
Kick-motion Training with DQN in AI Soccer Environment
Bumgeun Park, Jihui Lee, Taeyoung Kim, Dongsoo Har
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-12-01
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin, Tongtong Yu, Shengqi Shen, Jun Yang, Meijing Zhao, Kaiqi Huang, Bin Liang, Liang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
Pose
Survey
PDF
2022-12-01
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have
Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski
arXiv_AI
arXiv_AI
Reinforcement_Learning
Survey
Activity
PDF
2022-11-30
Reinforcement Learning for Signal Temporal Logic using Funnel-Based Approach
Naman Saxena, Gorantla Sandeep, Pushpak Jagtap
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-30
Safe Model-Free Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions
Yikun Cheng, Pan Zhao, Naira Hovakimyan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
Attention
PDF
2022-11-30
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
Qi Zhu, Christian Geishauser, Hsien-chin Lin, Carel van Niekerk, Baolin Peng, Zheng Zhang, Michael Heck, Nurul Lubis, Dazhen Wan, Xiaochen Zhu, Jianfeng Gao, Milica Gašić, Minlie Huang
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Knowledge
Few-Shot
PDF
2022-11-30
Targets in Reinforcement Learning to solve Stackelberg Security Games
Saptarashmi Bandyopadhyay, Chenqi Zhu, Philip Daniel, Joshua Morrison, Ethan Shay, John Dickerson
arXiv_AI
arXiv_AI
Reinforcement_Learning
Review
PDF
2022-11-30
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
2022-11-30
Reinforcement Learning for Multi-Truck Vehicle Routing Problems
Randall Correll (1), Sean J. Weinberg (1), Fabio Sanches (1), Takanori Ide (2), Takafumi Suzuki (3) ((1) QC Ware Corp Palo Alto, (2) AISIN CORPORATION Tokyo, (3) Aisin Technical Center of America San Jose)
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Attention
PDF
2022-11-30
The Cost of Learning: Efficiency vs. Efficacy of Learning-Based RRM for 6G
Seyyidahmed Lahmer, Federico Chiariotti, Andrea Zanella
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-30
Rephrasing the Reference for Non-Autoregressive Machine Translation
Chenze Shao, Jinchao Zhang, Jie Zhou, Yang Feng
arXiv_CL
arXiv_CL
Transformer
Reinforcement_Learning
Inference
PDF
2022-11-30
Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation
Alain Andres, Esther Villar-Rodriguez, Javier Del Ser
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Sparse
Pose
PDF
2022-11-30
Reinforced Language Modeling for End-to-End Task Oriented Dialog
Xiao Yu, Qingyang Wu, Kun Qian, Zhou Yu
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
Language_Model
PDF
2022-11-30
General policy mapping: online continual reinforcement learning inspired on the insect brain
Angel Yanguas-Gil, Sandeep Madireddy
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-11-30
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu, Zhengning Hu, Zhi Lai, Daiwei Zheng, Xuyun Nie
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Quantitative
Attention
PDF
2022-11-30
Policy Optimization over General State and Action Spaces
Guanghui Lan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Action
PDF
2022-11-30
Efficient Reinforcement Learning : Targeted Exploration Through Action Saturation
Loris Di Natale, Bratislav Svetozarevic, Philipp Heer, Colin N. Jones
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Action
PDF
2022-11-30
Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning
Sunghyun Sim, Ling Liu, Hyerim Bae
arXiv_AI
arXiv_AI
Enhancement
Reinforcement_Learning
Pose
Activity
PDF
2022-11-30
Welfare and Fairness in Multi-objective Reinforcement Learning
Zimeng Fan, Nianli Peng, Muhang Tian, Brandon Fain
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Action
PDF
2022-11-29
Multi-Agent Reinforcement Learning for Microprocessor Design Space Exploration
Srivatsan Krishnan, Natasha Jaques, Shayegan Omidshafiei, Dan Zhang, Izzeddin Gur, Vijay Janapa Reddi, Aleksandra Faust
arXiv_AI
arXiv_AI
NAS
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-29
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations
Marissa D'Alonzo, Rebecca Russell
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Knowledge
Detection
PDF
2022-11-29
Automated Play-Testing Through RL Based Human-Like Play-Styles Generation
Pierre Le Pelletier de Woillemont, Rémi Labory, Vincent Corruble
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-29
Configurable Agent With Reward As Input: A Play-Style Continuum Generation
Pierre Le Pelletier de Woillemont, Rémi Labory, Vincent Corruble
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-29
Autotuning PID control using Actor-Critic Deep Reinforcement Learning
Vivien van Veldhuizen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Prediction
PDF
2022-11-29
Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Guoxi Zhang, Hisashi Kashima
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-11-29
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-29
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning
Samuel Kessler, Piotr Miłoś, Jack Parker-Holder, Stephen J. Roberts
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2022-11-29
Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networks
Carlos Martin, Tuomas Sandholm
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Action
PDF
2022-11-29
Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning
Avinash Amballa, Advaith P., Pradip Sasmal, Sumohana Channappayya
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
PDF
2022-11-29
Multi-robot Social-aware Cooperative Planning in Pedestrian Environments Using Multi-agent Reinforcement Learning
Zichen He, Chunwei Song, Lu Dong
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Relation
Attention
PDF
2022-11-29
Peano: Learning Formal Mathematical Reasoning
Gabriel Poesia, Noah D. Goodman
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-11-29
Continuous Neural Algorithmic Planners
Yu He, Petar Veličković, Pietro Liò, Andreea Deac
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-11-28
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces
Elie Aljalbout, Maximilian Karl, Patrick van der Smagt
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-11-28
A Visual Active Search Framework for Geospatial Exploration
Anindya Sarkar, Michael Lanier, Scott Alfeld, Roman Garnett, Nathan Jacobs, Yevgeniy Vorobeychik
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Activity
PDF
2022-11-28
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay, Yilun Du, Abhi Gupta, Joshua Tenenbaum, Tommi Jaakkola, Pulkit Agrawal
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-28
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Pose
Action
Autonomous
PDF
2022-11-28
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan, Xin Jin, Bo Li, Wenjun Zeng
arXiv_AI
arXiv_AI
Reinforcement_Learning
Represenation_Learning
Knowledge
Action
PDF
2022-11-28
Continuous Episodic Control
Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
Action
PDF
2022-11-28
GraphPNAS: Learning Distribution of Good Neural Architectures via Deep Graph Generative Models
Muchen Li, Jeffrey Yunfan Liu, Leonid Sigal, Renjie Liao
arXiv_AI
arXiv_AI
NAS
Reinforcement_Learning
RNN
Pose
Relation
PDF
2022-11-28
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning
Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao
arXiv_AI
arXiv_AI
Reinforcement_Learning
Regularization
Pose
PDF
2022-11-28
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung, Myungsik Cho, Jongeui Park, Youngchul Sung
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-28
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Hongjie Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-28
Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand
Daniel Garces, Sushmita Bhattacharya, Stephanie Gil, Dimitri Bertsekas
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Autonomous
PDF
2022-11-27
CorrectNet: Robustness Enhancement of Analog In-Memory Computing for Neural Networks by Error Suppression and Compensation
Amro Eldebiky, Grace Li Zhang, Georg Boecherer, Bing Li, Ulf Schlichtmann
arXiv_CV
arXiv_CV
Enhancement
Reinforcement_Learning
Regularization
Pose
Inference
PDF
2022-11-27
Reinforcement Learning from Simulation to Real World Autonomous Driving using Digital Twin
Kevin Voogd, Jean Pierre Allamaa, Javier Alonso-Mora, Tong Duy Son
arXiv_RO
arXiv_RO
Transfer_Learning
Reinforcement_Learning
Pose
Autonomous
PDF
2022-11-27
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-26
Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Charles Lovering, Jessica Zosa Forde, George Konidaris, Ellie Pavlick, Michael L. Littman
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-26
Computational Co-Design for Variable Geometry Truss
Jianzhe Gu, Lining Yao
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-26
How Crucial is Transformer in Decision Transformer?
Max Siebenborn, Boris Belousov, Junning Huang, Jan Peters
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
RNN
Pose
Action
PDF
2022-11-26
Transfer RL via the Undo Maps Formalism
Abhi Gupta, Ted Moskovitz, David Alvarez-Melis, Aldo Pacchiano
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Optimization
Knowledge
Pose
Matching
PDF
2022-11-26
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang (1), Scott Sanner (2), Baher Abdulhai (1) ((1) Department of Civil Engineering, University of Toronto, (2) Department of Mechanical and Industrial Engineering, University of Toronto)
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Review
Pose
PDF
2022-11-25
Pac-Man Pete: An extensible framework for building AI in VEX Robotics
Jacob Zietek, Nicholas Wade, Cole Roberts, Aref Malek, Manish Pylla, Will Xu, Sagar Patil
arXiv_AI
arXiv_AI
Reinforcement_Learning
Autonomous
PDF
2022-11-25
Towards Improving Proactive Dialog Agents Using Socially-Aware Reinforcement Learning
Matthias Kraus, Nicolas Wagner, Ron Riekenbrauck, Wolfgang Minker
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Relation
Activity
PDF
2022-11-25
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava, Erdem Biyik, Suvir Mirchandani, Noah Goodman, Dorsa Sadigh
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-25
Operator Splitting Value Iteration
Amin Rakhsha, Andrew Wang, Mohammad Ghavamzadeh, Amir-massoud Farahmand
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-24
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning
Aviv Netanyahu, Tianmin Shu, Joshua Tenenbaum, Pulkit Agrawal
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Relation
PDF
2022-11-24
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
PDF
2022-11-24
Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems
Felix Chalumeau, Thomas Pierrot, Valentin Macé, Arthur Flajolet, Karim Beguir, Antoine Cully, Nicolas Perrin-Gilbert
arXiv_AI
arXiv_AI
Reinforcement_Learning
GAN
PDF
2022-11-24
A Benchmark Environment Motivated by Industrial Control Problems
Daniel Hein, Stefan Depeweg, Michel Tokic, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-24
Explainable and Safe Reinforcement Learning for Autonomous Air Mobility
Lei Wang, Hongyu Yang, Yi Lin, Suwan Yin, Yuankai Wu
arXiv_AI
arXiv_AI
Enhancement
Reinforcement_Learning
Adversarial
Pose
Attention
Autonomous
PDF
2022-11-24
Multi-Job Intelligent Scheduling with Cross-Device Federated Learning
Ji Liu, Juncheng Jia, Beichen Ma, Chendi Zhou, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-24
Visual Simulation Software Demonstration for Quantum Multi-Drone Reinforcement Learning
Chanyoung Park, Jae Pyoung Kim, Won Joon Yun, Soyi Jung, Joongheon Kim
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
Drone
Autonomous
PDF
2022-11-24
MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning
Yao Lai, Yao Mu, Ping Luo
arXiv_CV
arXiv_CV
Reinforcement_Learning
Sparse
Represenation_Learning
Action
Autonomous
PDF
2022-11-23
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao, Ying Wang, Wei Sun, Yarui Chen, Gang Niub, Masashi Sugiyama
arXiv_AI
arXiv_AI
Unsupervised
Reinforcement_Learning
Represenation_Learning
Pose
Action
Deep_Learning
PDF
2022-11-23
Learning to Imitate Object Interactions from Internet Videos
Austin Patel, Andrew Wang, Ilija Radosavovic, Jitendra Malik
arXiv_CV
arXiv_CV
Reconstruction
Reinforcement_Learning
3D
Action
PDF
2022-11-23
Enhancing team performance with transfer-learning during real-world human-robot collaboration
Athanasios C. Tsitos, Maria Dagioglou
arXiv_AI
arXiv_AI
Transfer_Learning
Reinforcement_Learning
Knowledge
Action
PDF
2022-11-23
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans, Phillip Isola
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-23
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley, Patrick Mannion
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Medical
PDF
2022-11-23
Reinforcement Learning Agent Design and Optimization with Bandwidth Allocation Model
Rafael F. Reale, Joberto S. B. Martins
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
PDF
2022-11-23
Reinforcement learning for traffic signal control in hybrid action space
Haoqing Luo, sheng jin
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
PDF
2022-11-23
Introspection-based Explainable Reinforcement Learning in Episodic and Non-episodic Scenarios
Niclas Schroeter, Francisco Cruz, Stefan Wermter
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-23
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar, Jakub Macina, Mennatallah El-Assady, Tanmay Sinha, Manu Kapur, Mrinmaya Sachan
arXiv_CL
arXiv_CL
Reinforcement_Learning
OCR
Pose
Language_Model
PDF
2022-11-23
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu, Hao Liu, Aditya Grover, Pieter Abbeel
arXiv_AI
arXiv_AI
Reinforcement_Learning
Zero-Shot
Self-Supervised
Action
Language_Model
Prediction
PDF
2022-11-23
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Shunyu Liu, Yihe Zhou, Jie Song, Tongya Zheng, Kaixuan Chen, Tongtian Zhu, Zunlei Feng, Mingli Song
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Contrastive_Learning
PDF
2022-11-22
Safe Control and Learning Using Generalized Action Governor
Nan Li, Yutong Li, Ilya Kolmanovsky, Anouck Girard, H. Eric Tseng, Dimitar Filev
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-22
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim, Manon Flageat, Antoine Cully
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
PDF
2022-11-22
Monte Carlo Forest Search: UNSAT Solver Synthesis via Reinforcement learning
Chris Cameron, Jason Hartford, Taylor Lundy, Tuan Truong, Alan Milligan, Rex Chen, Kevin Leyton-Brown
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-22
The impact of moving expenses on social segregation: a simulation with RL and ABM
Xinyu Li
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-22
A Deep Reinforcement Learning Approach to Rare Event Estimation
Anthony Corso, Kyu-Young Kim, Shubh Gupta, Grace Gao, Mykel J. Kochenderfer
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Autonomous
PDF
2022-11-22
Reinforcement Causal Structure Learning on Order Graph
Dezhi Yang, Guoxian Yu, Jun Wang, Zhengtian Wu, Maozu Guo
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-22
imitation: Clean Imitation Learning Implementations
Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-22
Learning-based social coordination to improve safety and robustness of cooperative autonomous vehicles in mixed traffic
Rodolfo Valiente, Behrad Toghi, Mahdi Razzaghpour, Ramtin Pedarsani, Yaser P. Fallah
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
Autonomous
PDF
2022-11-22
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving
Yinbo Yu, Jiajia Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Classification
Attention
Image_Classification
Autonomous
PDF
2022-11-22
A Reinforcement Learning Approach to Optimize Available Network Bandwidth Utilization
Hasibul Jamil, Elvis Rodrigues, Jacob Goldverg, Tevfik Kosar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
PDF
2022-11-21
TEMPERA: Test-Time Prompting via Reinforcement Learning
Tianjun Zhang, Xuezhi Wang, Denny Zhou, Dale Schuurmans, Joseph E. Gonzalez
arXiv_AI
arXiv_AI
Reinforcement_Learning
Zero-Shot
Knowledge
Pose
Action
Classification
Sentiment
Few-Shot
Inference
Language_Model
PDF
2022-11-21
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov, Andrew Starnes, Clayton G. Webster
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Action
PDF
2022-11-21
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Alex Beeson, Giovanni Montana
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-11-21
Last-Mile Embodied Visual Navigation
Justin Wasserman, Karmesh Yadav, Girish Chowdhary, Abhinav Gupta, Unnat Jain
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-21
Visual Dexterity: In-hand Dexterous Manipulation from Depth
Tao Chen, Megha Tippur, Siyang Wu, Vikash Kumar, Edward Adelson, Pulkit Agrawal
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
PDF
2022-11-21
Reinforcement Learning-Enhanced Control Barrier Functions for Robot Manipulators
Stephen McIlvanna, Nhat Nguyen Minh, Yuzhu Sun, Mien Van, Wasif Naeem
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-11-21
Data-Driven Offline Decision-Making via Invariant Representation Learning
Han Qi, Yi Su, Aviral Kumar, Sergey Levine
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Optimization
Represenation_Learning
Action
Prediction
PDF
2022-11-21
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning
Lang Qin, Rui Yan, Huajin Tang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
2022-11-21
Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning
Junjie Sheng, Lu Wang, Fangkai Yang, Bo Qiao, Hang Dong, Xiangfeng Wang, Bo Jin, Jun Wang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-21
BBReach: Tight and Scalable Black-Box Reachability Analysis of Deep Reinforcement Learning Systems
Jiaxu Tian, Dapeng Zhi, Si Liu, Peixin Wang, Guy Katz, Min Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-20
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Dmitriy Akimov, Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov
arXiv_AI
arXiv_AI
Reinforcement_Learning
Regularization
Pose
Action
Inference
PDF
2022-11-20
Revealing Robust Oil and Gas Company Macro-Strategies using Deep Multi-Agent Reinforcement Learning
Dylan Radovic, Lucas Kruitwagen, Christian Schroeder de Witt, Ben Caldecott, Shane Tomlinson, Mark Workman
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
PDF
2022-11-20
Adversarial Cheap Talk
Chris Lu, Timon Willi, Alistair Letcher, Jakob Foerster
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
Action
PDF
2022-11-20
Safe Reinforcement Learning using Data-Driven Predictive Control
Mahmoud Selim, Amr Alanwar, M. Watheq El-Kharashi, Hazem M. Abbas, Karl H. Johansson
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-11-20
Real-time Local Feature with Global Visual Information Enhancement
Jinyu Miao, Haosong Yue, Zhong Liu, Xingming Wu, Zaojun Fang, Guilin Yang
arXiv_CV
arXiv_CV
Image_Caption
Enhancement
Reinforcement_Learning
Pose
Deep_Learning
CNN
Matching
PDF
2022-11-20
Efficient Representations of Object Geometry for Reinforcement Learning of Interactive Grasping Policies
Malte Mosbach, Sven Behnke
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-20
Learning to Search for Job Shop Scheduling via Deep Reinforcement Learning
Cong Zhang, Wen Song, Zhiguang Cao, Jie Zhang, Puay Siew Tan, Chi Xu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-20
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic Signal Control
Wenlu Du, Junyi Ye, Jingyi Gu, Jing Li, Hua Wei, Guiling Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
PDF
2022-11-20
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Few-Shot
PDF
2022-11-20
Structure-Enhanced Deep Reinforcement Learning for Optimal Transmission Scheduling
Jiazheng Chen, Wanchun Liu, Daniel E. Quevedo, Yonghui Li, Branka Vucetic
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-19
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement Learning
Mauro Martini, Andrea Eirale, Simone Cerrato, Marcello Chiaberge
arXiv_AI
arXiv_AI
Reinforcement_Learning
Autonomous
PDF
2022-11-19
ReInform: Selecting paths with reinforcement learning for contextualized link prediction
Marina Speranskaya, Sameh Methias, Benjamin Roth
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
Prediction
PDF
2022-11-19
Evaluating the Perceived Safety of Urban City via Maximum Entropy Deep Inverse Reinforcement Learning
Yaxuan Wang, Zhixin Zeng, Qijun Zhao
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Quantitative
Prediction
PDF
2022-11-19
Prediction-aware and Reinforcement Learning based Altruistic Cooperative Driving
Rodolfo Valiente, Mahdi Razzaghpour, Behrad Toghi, Ghayoor Shah, Yaser P. Fallah
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
Autonomous
Prediction
PDF
2022-11-19
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet, Laurence Midgley, Alexandre Laterre
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-18
Provable Defense against Backdoor Policies in Reinforcement Learning
Shubham Kumar Bharti, Xuezhou Zhang, Adish Singla, Xiaojin Zhu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-18
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya, Thang Doan, Lucas Caccia, Laure Soulier, Ludovic Denoyer, Roberta Raileanu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Autonomous
PDF
2022-11-18
GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse state
Junyi Bian, Xiaodi Huang, Hong Zhou, Shanfeng Zhu
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
GAN
Summarization
PDF
2022-11-18
Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections
Frank Röder, Manfred Eppe
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-11-18
Pandering in a Flexible Representative Democracy
Xiaolin Sun, Jacob Masur, Ben Abramowitz, Nicholas Mattei, Zizhan Zheng
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Action
PDF
2022-11-17
AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process
Kevin Du, Ian Gemp, Yi Wu, Yingying Wu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Autonomous
PDF
2022-11-17
Introduction to Online Nonstochastic Control
Elad Hazan, Karan Singh
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
PDF
2022-11-17
Learning to Communicate with Intent: An Introduction
Miguel Angel Gutierrez-Estevez, Yiqun Wu, Chan Zhou
arXiv_AI
arXiv_AI
Reconstruction
Reinforcement_Learning
Pose
Action
Classification
PDF
2022-11-17
A Reinforcement Learning Approach for Process Parameter Optimization in Additive Manufacturing
Susheel Dharmadhikari, Nandana Menon, Amrita Basak
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-17
DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation
Yuzhe Qin, Binghao Huang, Zhao-Heng Yin, Hao Su, Xiaolong Wang
arXiv_CV
arXiv_CV
Point_Cloud
Reinforcement_Learning
Knowledge
Pose
PDF
2022-11-17
Planning Irregular Object Packing via Hierarchical Reinforcement Learning
Sichao Huang, Ziwei Wang, Jie Zhou, Jiwen Lu
arXiv_CV
arXiv_CV
Reinforcement_Learning
Self-Supervised
Pose
Autonomous
PDF
2022-11-16
The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry
Dian Wang, Jung Yeon Park, Neel Sortur, Lawson L.S. Wong, Robin Walters, Robert Platt
arXiv_RO
arXiv_RO
Reinforcement_Learning
PDF
2022-11-16
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
Wele Gedara Chaminda Bandara, Naman Patel, Ali Gholami, Mehdi Nikkhah, Motilal Agrawal, Vishal M. Patel
arXiv_AI
arXiv_AI
Transformer
Reconstruction
Reinforcement_Learning
Pose
Action
Classification
PDF
2022-11-16
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, Jean Ponce, Cordelia Schmid
arXiv_RO
arXiv_RO
Embedding
Reinforcement_Learning
Sparse
Pose
Action
PDF
2022-11-16
Giving Feedback on Interactive Student Programs with Meta-Exploration
Evan Zheran Liu, Moritz Stephan, Allen Nie, Chris Piech, Emma Brunskill, Chelsea Finn
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-16
Model Based Residual Policy Learning with Applications to Antenna Control
Viktor Eriksson Möllerstedt, Alessio Russo, Maxime Bouton
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-16
Reward Gaming in Conditional Text Generation
Richard Yuanzhe Pang, Vishakh Padmakumar, Thibault Sellam, Ankur P. Parikh, He He
arXiv_AI
arXiv_AI
Reinforcement_Learning
Relation
Text_Generation
PDF
2022-11-16
LEMMA: Bootstrapping High-Level Mathematical Reasoning with Learned Symbolic Abstractions
Zhening Li, Gabriel Poesia, Omar Costilla-Reyes, Noah Goodman, Armando Solar-Lezama
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-16
Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning
Kewen Ding
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
PDF
2022-11-15
APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning
Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat Chakradhar
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Detection
Object_Detection
PDF
2022-11-15
Universal Distributional Decision-based Black-box Adversarial Attack with Reinforcement Learning
Yiran Huang, Yexu Zhou, Michael Hefenbrock, Till Riedel, Likun Fang, Michael Beigl
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
PDF
2022-11-15
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou, Xijun Li, Qingyu Qu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Regularization
Pose
Action
PDF
2022-11-15
Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications
Zhongkai Hao, Songming Liu, Yichi Zhang, Chengyang Ying, Yao Feng, Hang Su, Jun Zhu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Review
Pose
Survey
Inference
PDF
2022-11-15
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Self-Supervised
Pose
Action
PDF
2022-11-15
Coordination for Connected and Automated Vehicles at Non-signalized Intersections: A Value Decomposition-based Multiagent Deep Reinforcement Learning Approach
Zihan Guo, Yan Wu, Lifang Wang, Junzhi Zhang
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
PDF
2022-11-15
Automatic Evaluation of Excavator Operators using Learned Reward Functions
Pranav Agarwal, Marek Teichmann, Sheldon Andrews, Samira Ebrahimi Kahou
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Prediction
PDF
2022-11-15
Explainable Action Advising for Multi-Agent Reinforcement Learning
Yue Guo, Joseph Campbell, Simon Stepputtis, Ruiyu Li, Dana Hughes, Fei Fang, Katia Sycara
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2022-11-15
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Siddhant Bhambri, Amrita Bhattacharjee, Dimitri Bertsekas
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-15
General Intelligence Requires Rethinking Exploration
Minqi Jiang, Tim Rocktäschel, Edward Grefenstette
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
PDF
2022-11-15
Agent-State Construction with Auxiliary Inputs
Ruo Yu Tao, Adam White, Marlos C. Machado
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Action
Summarization
PDF
2022-11-14
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal, Ashish Kumar, Jitendra Malik, Deepak Pathak
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
PDF
2022-11-14
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen, Zhang-Wei Hong, Joni Pajarinen, Pulkit Agrawal
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-14
Interactively Learning to Summarise Timelines by Reinforcement Learning
Yuxuan Ye, Edwin Simpson
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
Action
PDF
2022-11-14
NeurIPS 2022 Competition: Driving SMARTS
Amir Rasouli, Randy Goebel, Matthew E. Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, Kasra Rezaee, Animesh Garg, David Meger, Jun Luo, Liam Paull, Weinan Zhang, Xinyu Wang, Xi Chen
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Action
Autonomous
PDF
2022-11-14
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan, Abdullah O. Alomar, John R. Williams
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Matching
PDF
2022-11-14
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
Philipp Dominic Siedler
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Drone
Autonomous
PDF
2022-11-13
Goal-Conditioned Reinforcement Learning in the Presence of an Adversary
Carlos Purves, Pietro Liò, Cătălina Cangea
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-13
Learning Heterogeneous Agent Cooperation via Multiagent League Training
Qingxu Fu, Xiaolin Ai, Jianqiang Yi, Tenghai Qiu, Wanmai Yuan, Zhiqiang Pu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-13
Adversarial and Random Transformations for Robust Domain Adaptation and Generalization
Liang Xiao, Jiaolong Xu, Dawei Zhao, Erke Shang, Qi Zhu, Bin Dai
arXiv_CV
arXiv_CV
Transformer
Reinforcement_Learning
Adversarial
Pose
PDF
2022-11-12
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
Yunpeng Qing, Shunyu Liu, Jie Song, Mingli Song
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Review
Pose
Survey
Deep_Learning
Summarization
PDF
2022-11-12
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso, Gastone P. Rosati Papini, Patrick M. Wensing, Andrea Del Prete
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
PDF
2022-11-12
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf, Miguel Sarabia, Barry-John Theobald
arXiv_AI
arXiv_AI
Reinforcement_Learning
Self-Supervised
Pose
Action
PDF
2022-11-11
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson, Ahmed H. Qureshi
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
Zero-Shot
Pose
PDF
2022-11-11
Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning
Pedro Sequeira, Jesse Hostetler, Melinda Gervasio
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Deep_Learning
PDF
2022-11-11
Controlling Commercial Cooling Systems Using Reinforcement Learning
Jerry Luo, Cosmin Paduraru, Octavian Voicu, Yuri Chervonyi, Scott Munns, Jerry Li, Crystal Qian, Praneet Dutta, Jared Quincy Davis, Ningjia Wu, Xingwei Yang, Chu-Ming Chang, Ted Li, Rob Rose, Mingyan Fan, Hootan Nakhost, Tinglin Liu, Brian Kirkman, Frank Altamura, Lee Cline, Patrick Tonker, Joel Gouker, Dave Uden, Warren Buddy Bryan, Jason Law, Deeni Fatiha, Neil Satra, Juliet Rothenberg, Molly Carlin, Satish Tallapaka, Sims Witherspoon, David Parish, Peter Dolan, Chenyu Zhao, Daniel J. Mankowitz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
GAN
PDF
2022-11-11
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michał Bortkiewicz, Jakub Łyskawa, Paweł Wawrzyński, Mateusz Ostaszewski, Artur Grudkowski, Tomasz Trzciński
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Activity
PDF
2022-11-11
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
Burcu Küçükoğlu, Walraaf Borkent, Bodo Rueckauer, Nasir Ahmad, Umut Güçlü, Marcel van Gerven
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
RNN
PDF
2022-11-11
Fleet Rebalancing for Expanding Shared e-Mobility Systems: A Multi-agent Deep Reinforcement Learning Approach
Man Luo, Bowen Du, Wenzhe Zhang, Tianyou Song, Kun Li, Hongming Zhu, Mark Birkin, Hongkai Wen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Knowledge
Pose
Action
PDF
2022-11-11
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction
Xuming Hu, Shiao Meng, Chenwei Zhang, Xiangli Yang, Lijie Wen, Irwin King, Philip S. Yu
arXiv_CL
arXiv_CL
Unsupervised
Gradient_Descent
Recognition
Reinforcement_Learning
Optimization
Regularization
Knowledge
Action
Relation
Few-Shot
Relation_Extraction
PDF
2022-11-11
Efficient Domain Coverage for Vehicles with Second Order Dynamics via Multi-Agent Reinforcement Learning
Xinyu Zhao, Razvan C. Fetecau, Mo Chen
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
Attention
Autonomous
PDF
2022-11-11
Deep Reinforcement Learning Microgrid Optimization Strategy Considering Priority Flexible Demand Side
Jinsong Sang, Hongbin Sun, Lei Kou
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Face
Relation
PDF
2022-11-11
pyRDDLGym: From RDDL to Gym Environments
Ayal Taitler, Michael Gimelfarb, Sriram Gopalakrishnan, Martin Mladenov, Xiaotian Liu, Scott Sanner
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2022-11-10
Robust N-1 secure HV Grid Flexibility Estimation for TSO-DSO coordinated Congestion Management with Deep Reinforcement Learning
Zhenqi Wang, Sebastian Wende-von Berg, Martin Braun
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Face
PDF
2022-11-10
Reinforcement Learning in an Adaptable Chess Environment for Detecting Human-understandable Concepts
Patrik Hammersborg, Inga Strümke
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Autonomous
PDF
2022-11-10
RARE: Renewable Energy Aware Resource Management in Datacenters
Vanamala Venkataswamy, Jake Grigsby, Andrew Grimshaw, Yanjun Qi
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-09
Vision-based navigation and obstacle avoidance via deep reinforcement learning
Paul Blum, Peter Crowley, George Lykotrafitis
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
PDF
2022-11-09
RL-DWA Omnidirectional Motion Planning for Person Following in Domestic Assistance and Monitoring
Andrea Eirale, Mauro Martini, Marcello Chiaberge
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-09
Interpretable Deep Reinforcement Learning for Green Security Games with Real-Time Information
Vishnu Dutt Sharma, John P. Dickerson, Pratap Tokekar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Prediction
PDF
2022-11-09
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker, Aldo Pacchiano
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Action
PDF
2022-11-09
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta, Peter Karkus, Tong Che, Danfei Xu, Marco Pavone
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Sparse
Knowledge
PDF
2022-11-09
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Autonomous
PDF
2022-11-08
Learning to Follow Instructions in Text-Based Games
Mathieu Tuli, Andrew C. Li, Pashootan Vaezipoor, Toryn Q. Klassen, Scott Sanner, Sheila A. McIlraith
arXiv_CL
arXiv_CL
Reinforcement_Learning
Action
PDF
2022-11-08
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data
Tengyang Xie, Mohak Bhardwaj, Nan Jiang, Ching-An Cheng
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Pose
PDF
2022-11-08
Active Example Selection for In-Context Learning
Yiming Zhang, Shi Feng, Chenhao Tan
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
Language_Model
PDF
2022-11-08
Reinforcement Learning with Stepwise Fairness Constraints
Zhun Deng, He Sun, Zhiwei Steven Wu, Linjun Zhang, David C. Parkes
arXiv_AI
arXiv_AI
Reinforcement_Learning
PDF
2022-11-08
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu, Mengbing Li, Chengchun Shi, Zhenke Wu, Piotr Fryzlewicz
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Detection
PDF
2022-11-08
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie, Zichuan Lin, Junyou Li, Shuai Li, Deheng Ye
arXiv_AI
arXiv_AI
Reinforcement_Learning
Knowledge
Review
Face
Survey
Deep_Learning
Attention
PDF
2022-11-07
A Transfer Learning Approach for UAV Path Design with Connectivity Outage Constraint
Gianluca Fontanesi, Anding Zhu, Mahnaz Arvaneh, Hamed Ahmadi
arXiv_RO
arXiv_RO
Transfer_Learning
Reinforcement_Learning
Pose
Action
Autonomous
PDF
2022-11-07
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe, Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
PDF
2022-11-06
ProtoX: Explaining a Reinforcement Learning Agent via Prototyping
Ronilo J. Ragodos, Tong Wang, Qihang Lin, Xun Zhou
arXiv_CV
arXiv_CV
Reinforcement_Learning
Self-Supervised
Pose
Contrastive_Learning
Action
PDF
2022-11-06
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning
Dan Elbaz, Gal Novik, Oren Salzman
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Action
PDF
2022-11-06
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu, Xueyuan Li, Zirui Li, Jingda Wu, Guodong Du, Xin Gao, Fan Yang, Shihua Yuan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Review
Pose
Survey
Autonomous
PDF
2022-11-04
Evaluating and Improving Factuality in Multimodal Abstractive Summarization
David Wan, Mohit Bansal
arXiv_CV
arXiv_CV
Reinforcement_Learning
Bert
Zero-Shot
Pose
Detection
Relation
Summarization
PDF
2022-11-04
De novo PROTAC design using graph-based deep generative models
Divya Nori, Connor W. Coley, Rocío Mercado
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Sparse
Pose
Activity
PDF
2022-11-04
Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features
Yuhang Gai, Bing Wang, Jiwen Zhang, Dan Wu, Ken Chen
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-11-04
Emergent Quantized Communication
Boaz Carmeli, Ron Meir, Yonatan Belinkov
arXiv_AI
arXiv_AI
Quantization
Reinforcement_Learning
Pose
Deep_Learning
PDF
2022-11-04
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Cem Alpturk, Venkatraman Renganathan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Optimization
Pose
Action
PDF
2022-11-04
Mixline: A Hybrid Reinforcement Learning Framework for Long-horizon Bimanual Coffee Stirring Task
Zheng Sun, Zhiqi Wang, Junjia Liu, Miao Li, Fei Chen
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-11-04
Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinforcement Learning for Robotics
Krishan Rana, Ming Xu, Brendan Tidd, Michael Milford, Niko Sünderhauf
arXiv_RO
arXiv_RO
Reinforcement_Learning
Knowledge
Pose
Action
PDF
2022-11-04
Benchmarking Quality-Diversity Algorithms on Neuroevolution for Reinforcement Learning
Manon Flageat, Bryan Lim, Luca Grillotti, Maxime Allard, Simón C. Smith, Antoine Cully
arXiv_RO
arXiv_RO
Reinforcement_Learning
Relation
PDF
2022-11-03
Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
Siddharth Nayak, Kenneth Choi, Wenqi Ding, Sydney Dolan, Karthik Gopalakrishnan, Hamsa Balakrishnan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
PDF
2022-11-03
Contrastive Value Learning: Implicit Models for Simple Offline RL
Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
PDF
2022-11-03
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai, Emma Brunskill
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-03
lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Anne Wu, Kianté Brantley, Noriyuki Kojima, Yoav Artzi
arXiv_CL
arXiv_CL
Reinforcement_Learning
PDF
2022-11-03
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai Nguyen, Andrea Baisero, Dian Wang, Christopher Amato, Robert Platt
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Action
PDF
2022-11-03
A Posterior Sampling Framework for Interactive Decision Making
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-03
Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration
Masood S. Mortazavi, Tiancheng Qin, Ning Yan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-03
Sensor Control for Information Gain in Dynamic, Sparse and Partially Observed Environments
J. Brian Burns, Aravind Sundaresan, Pedro Sequeira, Vidyasagar Sadhu
arXiv_AI
arXiv_AI
Enhancement
Reinforcement_Learning
Sparse
Pose
Autonomous
Prediction
PDF
2022-11-02
Learning to Grasp the Ungraspable with Emergent Extrinsic Dexterity
Wenxuan Zhou, David Held
arXiv_RO
arXiv_RO
Reinforcement_Learning
Zero-Shot
Pose
Face
PDF
2022-11-02
Deep Reinforcement Learning for IRS Phase Shift Design in Spatiotemporally Correlated Environments
Spilios Evmorfos, Athina P. Petropulu, H. Vincent Poor
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Face
Relation
PDF
2022-11-02
Multi-vehicle Conflict Resolution in Highly Constrained Spaces by Merging Optimal Control and Reinforcement Learning
Xu Shen, Francesco Borrelli
arXiv_RO
arXiv_RO
Reinforcement_Learning
Action
PDF
2022-11-02
Over-communicate no more: Situated RL agents learn concise communication protocols
Aleksandra Kalinowska, Elnaz Davoodi, Florian Strub, Kory W Mathewson, Ivana Kajic, Michael Bowling, Todd D Murphey, Patrick M Pilarski
arXiv_CL
arXiv_CL
Reinforcement_Learning
Pose
Action
PDF
2022-11-02
Dual Generator Offline Reinforcement Learning
Quan Vuong, Aviral Kumar, Sergey Levine, Yevgen Chebotar
arXiv_AI
arXiv_AI
Reinforcement_Learning
Adversarial
Action
GAN
PDF
2022-11-02
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang, Yawen Huang, Bingzhang Hu, Shizheng Wang, Haoran Duan, Noura Al Moubayed, Yefeng Zheng, Yang Long
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Attention
PDF
2022-11-02
Causal Counterfactuals for Improving the Robustness of Reinforcement Learning
Tom He, Jasmina Gajcin, Ivana Dusparic
arXiv_RO
arXiv_RO
Reinforcement_Learning
Pose
Autonomous
Inference
PDF
2022-11-02
DynamicLight: Dynamically Tuning Traffic Signal Duration with DRL
Liang Zhang, Qiang Wu, Jun Shen, Linyuan Lü, Bo Du, Akbar Telikani, Jianqing Wu, Shubin Xie
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
PDF
2022-11-02
Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz, Ostap Okhrin
arXiv_RO
arXiv_RO
Reinforcement_Learning