Planning with Abstract Learned Models While Learning Transferable Subtasks

2019-12-16 17:47:57

John Winder, Stephanie Milani, Matthew Landen, Erebus Oh, Shane Parr, Shawn Squire, Marie desJardins, Cynthia Matuszek

arXiv_AI

arXiv_AI Reinforcement_Learning Action

Abstract
Abstract (translated)
URL
PDF

Abstract

We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.

Abstract (translated)

URL

https://arxiv.org/abs/1912.07544

PDF

https://arxiv.org/pdf/1912.07544.pdf