Paper Reading AI Learner

Context-aware LLM-based Safe Control Against Latent Risks

2024-03-18 15:17:15
Quan Khanh Luu, Xiyu Deng, Anh Van Ho, Yorie Nakahira

Abstract

It is challenging for autonomous control systems to perform complex tasks in the presence of latent risks. Motivated by this challenge, this paper proposes an integrated framework that involves Large Language Models (LLMs), stochastic gradient descent (SGD), and optimization-based control. In the first phrase, the proposed framework breaks down complex tasks into a sequence of smaller subtasks, whose specifications account for contextual information and latent risks. In the second phase, these subtasks and their parameters are refined through a dual process involving LLMs and SGD. LLMs are used to generate rough guesses and failure explanations, and SGD is used to fine-tune parameters. The proposed framework is tested using simulated case studies of robots and vehicles. The experiments demonstrate that the proposed framework can mediate actions based on the context and latent risks and learn complex behaviors efficiently.

Abstract (translated)

对于具有潜在风险的环境中执行复杂任务,自主控制系统具有一定的挑战性。为了应对这一挑战,本文提出了一种集成框架,涉及到大语言模型(LLMs)、随机梯度下降(SGD)和基于优化的控制。在第一段中,所提出的框架将复杂任务分解为一系列较小的子任务,这些子任务的规格考虑了上下文信息和潜在风险。在第二阶段,这些子任务及其参数通过涉及LLMs和SGD的双过程进行进一步优化。LLM用于生成粗略猜测和失败解释,而SGD用于微调参数。所提出的框架通过模拟机器人及车辆的案例研究进行了测试。实验结果表明,与传统方法相比,所提出的框架能够通过上下文及潜在风险来调节行为,并能够有效地学习复杂的行为。

URL

https://arxiv.org/abs/2403.11863

PDF

https://arxiv.org/pdf/2403.11863.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot