Multi-Agent Path Finding via Tree LSTM

2022-10-24 03:22:20

Yuhao Jiang, Kunjie Zhang, Qimai Li, Jiaxin Chen, Xiaolong Zhu

arXiv_AI

arXiv_AI RNN Reinforcement_Learning Attention Pose Agent

Abstract
Abstract (translated)
URL
PDF

Abstract

In recent years, Multi-Agent Path Finding (MAPF) has attracted attention from the fields of both Operations Research (OR) and Reinforcement Learning (RL). However, in the 2021 Flatland3 Challenge, a competition on MAPF, the best RL method scored only 27.9, far less than the best OR method. This paper proposes a new RL solution to Flatland3 Challenge, which scores 125.3, several times higher than the best RL solution before. We creatively apply a novel network architecture, TreeLSTM, to MAPF in our solution. Together with several other RL techniques, including reward shaping, multiple-phase training, and centralized control, our solution is comparable to the top 2-3 OR methods.

Abstract (translated)

URL

https://arxiv.org/abs/2210.12933

PDF

https://arxiv.org/pdf/2210.12933.pdf