Paper Reading AI Learner

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning

2024-05-04 20:59:06
Weiye Zhao, Tairan He, Feihan Li, Changliu Liu

Abstract

Deep reinforcement learning (DRL) has demonstrated remarkable performance in many continuous control tasks. However, a significant obstacle to the real-world application of DRL is the lack of safety guarantees. Although DRL agents can satisfy system safety in expectation through reward shaping, designing agents to consistently meet hard constraints (e.g., safety specifications) at every time step remains a formidable challenge. In contrast, existing work in the field of safe control provides guarantees on persistent satisfaction of hard safety constraints. However, these methods require explicit analytical system dynamics models to synthesize safe control, which are typically inaccessible in DRL settings. In this paper, we present a model-free safe control algorithm, the implicit safe set algorithm, for synthesizing safeguards for DRL agents that ensure provable safety throughout training. The proposed algorithm synthesizes a safety index (barrier certificate) and a subsequent safe control law solely by querying a black-box dynamic function (e.g., a digital twin simulator). Moreover, we theoretically prove that the implicit safe set algorithm guarantees finite time convergence to the safe set and forward invariance for both continuous-time and discrete-time systems. We validate the proposed algorithm on the state-of-the-art Safety Gym benchmark, where it achieves zero safety violations while gaining $95\% \pm 9\%$ cumulative reward compared to state-of-the-art safe DRL methods. Furthermore, the resulting algorithm scales well to high-dimensional systems with parallel computing.

Abstract (translated)

深度强化学习(DRL)在许多连续控制任务中表现出显著的性能。然而,DRL在现实世界应用中的一大障碍是缺乏安全性保证。尽管DRL代理可以通过奖励塑造满足系统安全,但设计一个在每一步都确保达到严格约束的安全代理仍然具有挑战性。相比之下,该领域现有的安全控制方法提供了对严格安全约束的持续满足的保证。然而,这些方法需要显式地分析系统动态模型来合成安全控制,这在DRL环境中通常是不可访问的。在本文中,我们提出了一个模型-无关的安全控制算法,称为隐式安全集算法,用于为DRL代理合成训练过程中的安全保障。所提出的算法通过查询黑盒动态函数(例如数字双胞胎模拟器)生成安全指数和安全控制律。此外,我们理论证明,隐式安全集算法保证连续时间和离散时间系统的有限时间收敛和前馈不变性。我们在最新的Safety Gym基准上验证了所提出的算法,该算法在实现零安全违规的同时,与最先进的 safe DRL 方法相比获得了95% ± 9%的累积奖励。此外,该算法具有良好的扩展性,可应用于高维系统,通过并行计算实现。

URL

https://arxiv.org/abs/2405.02754

PDF

https://arxiv.org/pdf/2405.02754.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot