Abstract
For an autonomous vehicle to plan a path in its environment, it must be able to accurately forecast the trajectory of all dynamic objects in its proximity. While many traditional methods encode observations in the scene to solve this problem, there are few approaches that consider the effect of the ego vehicle's behavior on the future state of the world. In this paper, we introduce VRD, a vectorized world model-inspired approach to the multi-agent motion forecasting problem. Our method combines a traditional open-loop training regime with a novel dreamed closed-loop training pipeline that leverages a kinematic reconstruction task to imagine the trajectory of all agents, conditioned on the action of the ego vehicle. Quantitative and qualitative experiments are conducted on the Argoverse 2 multi-world forecasting evaluation dataset and the intersection drone (inD) dataset to demonstrate the performance of our proposed model. Our model achieves state-of-the-art performance on the single prediction miss rate metric on the Argoverse 2 dataset and performs on par with the leading models for the single prediction displacement metrics.
Abstract (translated)
为使自动驾驶汽车规划其周围环境中的路径,它必须能够准确预测其附近所有动态物体的轨迹。虽然许多传统方法将观测结果编码到场景中以解决这个问题,但很少有方法考虑自车行为对世界未来状态的影响。在本文中,我们介绍了VRD,一种基于多代理物运动预测的多代理物建模方法。我们的方法结合了传统的开环训练模式和一种新颖的梦境闭环训练管道,利用运动重建任务想象所有代理物的轨迹,条件自车行动。我们对ArgoVeRse 2多世界预测评估数据集和 intersection Drone (inD) 数据集进行了定量实验和定性实验,以评估我们提出的模型的性能。我们的模型在ArgoVeRse数据集上的单个预测失真率指标上实现了最先进的性能,与领先模型的单个预测位移指标相当。
URL
https://arxiv.org/abs/2406.14415