Improving adaptability to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents

2022-04-13 17:52:54

Olivier Moulin, Vincent Francois-Lavet, Paul Elbers, Mark Hoogendoorn

arXiv_AI

Abstract
Abstract (translated)
URL
PDF

Abstract

Adapting a Reinforcement Learning (RL) agent to an unseen environment is a difficult task due to typical over-fitting on the training environment. RL agents are often capable of solving environments very close to the trained environment, but when environments become substantially different, their performance quickly drops. When agents are retrained on new environments, a second issue arises: there is a risk of catastrophic forgetting, where the performance on previously seen environments is seriously hampered. This paper proposes a novel approach that exploits an ecosystem of agents to address both concerns. Hereby, the (limited) adaptive power of individual agents is harvested to build a highly adaptive ecosystem. This allows to transfer part of the workload from learning to inference. An evaluation of the approach on two distinct distributions of environments shows that our approach outperforms state-of-the-art techniques in terms of adaptability/generalization as well as avoids catastrophic forgetting.

Abstract (translated)

URL

https://arxiv.org/abs/2204.06550

PDF

https://arxiv.org/pdf/2204.06550.pdf