Paper Reading AI Learner

FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation

2024-04-19 05:24:24
Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, Hui Xiong

Abstract

Virtual network embedding (VNE) is an essential resource allocation task in network virtualization, aiming to map virtual network requests (VNRs) onto physical infrastructure. Reinforcement learning (RL) has recently emerged as a promising solution to this problem. However, existing RL-based VNE methods are limited by the unidirectional action design and one-size-fits-all training strategy, resulting in restricted searchability and generalizability. In this paper, we propose a FLexible And Generalizable RL framework for VNE, named FlagVNE. Specifically, we design a bidirectional action-based Markov decision process model that enables the joint selection of virtual and physical nodes, thus improving the exploration flexibility of solution space. To tackle the expansive and dynamic action space, we design a hierarchical decoder to generate adaptive action probability distributions and ensure high training efficiency. Furthermore, to overcome the generalization issue for varying VNR sizes, we propose a meta-RL-based training method with a curriculum scheduling strategy, facilitating specialized policy training for each VNR size. Finally, extensive experimental results show the effectiveness of FlagVNE across multiple key metrics. Our code is available at GitHub (this https URL).

Abstract (translated)

虚拟网络嵌入(VNE)是网络虚拟化中一个关键的资源分配任务,旨在将虚拟网络请求(VNRs)映射到物理基础设施。最近,强化学习(RL)已成为解决这个问题的一种有前景的解决方案。然而,现有的基于RL的VNE方法受到单向动作设计和高薪策略的限制,导致搜索性和泛化能力受限。在本文中,我们提出了一个灵活且可扩展的RL框架,名为FlagVNE。具体来说,我们设计了一个双向动作基于马尔可夫决策过程模型,使得虚拟和物理节点能够联合选择,从而提高解决方案空间的探索灵活性。为了应对广泛和动态的动作空间,我们设计了一个分层的解码器,用于生成自适应的动作概率分布,并确保高训练效率。此外,为了克服不同VNR大小下的泛化问题,我们提出了一种基于元RL的培训方法,通过课程计划策略实现针对每个VNR大小的专用策略训练。最后,大量实验结果表明,FlagVNE在多个关键指标上具有有效性。我们的代码可于GitHub上获取(此https链接)。

URL

https://arxiv.org/abs/2404.12633

PDF

https://arxiv.org/pdf/2404.12633.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot