Paper Reading AI Learner

Decentralized Personalized Federated Learning based on a Conditional Sparse-to-Sparser Scheme

2024-04-24 16:03:34
Qianyu Long, Qiyuan Wang, Christos Anagnostopoulos, Daning Bi

Abstract

Decentralized Federated Learning (DFL) has become popular due to its robustness and avoidance of centralized coordination. In this paradigm, clients actively engage in training by exchanging models with their networked neighbors. However, DFL introduces increased costs in terms of training and communication. Existing methods focus on minimizing communication often overlooking training efficiency and data heterogeneity. To address this gap, we propose a novel \textit{sparse-to-sparser} training scheme: DA-DPFL. DA-DPFL initializes with a subset of model parameters, which progressively reduces during training via \textit{dynamic aggregation} and leads to substantial energy savings while retaining adequate information during critical learning periods. Our experiments showcase that DA-DPFL substantially outperforms DFL baselines in test accuracy, while achieving up to $5$ times reduction in energy costs. We provide a theoretical analysis of DA-DPFL's convergence by solidifying its applicability in decentralized and personalized learning. The code is available at:this https URL

Abstract (translated)

去中心化联邦学习(DFL)因其稳健性和避免集中协调而变得流行。在这种范式中,客户端通过与网络邻居交换模型来积极参与训练。然而,DFL在训练和通信方面引入了增加的成本。现有的方法通常关注最小化通信,而忽视了训练效率和数据异质性。为了填补这一空白,我们提出了一个新颖的\textit{稀疏到稀疏}训练方案:DA-DPFL。DA-DPFL以部分模型参数为基础进行初始化,在训练过程中通过动态聚合逐渐减少,从而在关键学习期间保留足够的信息。我们的实验展示了DA-DPFL在测试准确率方面明显优于DFL基线,同时实现能源成本降低至原来的5倍。我们通过固化DA-DPFL在去中心化和个性化学习上的收敛性,提供了理论分析。代码可在此处访问:https://this URL。

URL

https://arxiv.org/abs/2404.15943

PDF

https://arxiv.org/pdf/2404.15943.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot