Paper Reading AI Learner

CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning

2024-05-02 21:50:26
Kota Kondo, Andrea Tagliabue, Xiaoyi Cai, Claudius Tewari, Olivia Garcia, Marcos Espitia-Alvarez, Jonathan P. How

Abstract

Traditional optimization-based planners, while effective, suffer from high computational costs, resulting in slow trajectory generation. A successful strategy to reduce computation time involves using Imitation Learning (IL) to develop fast neural network (NN) policies from those planners, which are treated as expert demonstrators. Although the resulting NN policies are effective at quickly generating trajectories similar to those from the expert, (1) their output does not explicitly account for dynamic feasibility, and (2) the policies do not accommodate changes in the constraints different from those used during training. To overcome these limitations, we propose Constraint-Guided Diffusion (CGD), a novel IL-based approach to trajectory planning. CGD leverages a hybrid learning/online optimization scheme that combines diffusion policies with a surrogate efficient optimization problem, enabling the generation of collision-free, dynamically feasible trajectories. The key ideas of CGD include dividing the original challenging optimization problem solved by the expert into two more manageable sub-problems: (a) efficiently finding collision-free paths, and (b) determining a dynamically-feasible time-parametrization for those paths to obtain a trajectory. Compared to conventional neural network architectures, we demonstrate through numerical evaluations significant improvements in performance and dynamic feasibility under scenarios with new constraints never encountered during training.

Abstract (translated)

传统的优化规划器虽然在效果上很有效,但计算成本很高,导致轨迹生成速度较慢。成功减少计算时间的方法之一是使用模仿学习(IL)从这些规划器中开发快速神经网络(NN)策略,将它们视为专家演示者。尽管生成的NN策略在快速生成类似于专家轨迹方面非常有效,但(1)它们的输出没有明确考虑到动态可行性,(2)这些策略没有考虑到训练过程中约束的变化。为了克服这些限制,我们提出了约束引导扩散(CGD),一种新型的IL-基轨迹规划方法。CGD利用了一种结合扩散策略和代理高效优化问题的混合学习/在线优化方案,使得可以生成无碰撞、动态可行轨迹。CGD的关键思想包括将专家通过 IL 解决的原始具有挑战性的优化问题划分为两个更容易管理子问题:(a)高效地找到无碰撞路径,(b)为这些路径确定一个动态可行的时间参数,以获得轨迹。与传统的神经网络架构相比,我们通过数值评估展示了在训练过程中从未遇到过的新的约束条件下,性能和动态可行性都有显著的提高。

URL

https://arxiv.org/abs/2405.01758

PDF

https://arxiv.org/pdf/2405.01758.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot