Paper Reading AI Learner

Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning

2024-03-31 19:57:38
Mohamed Elsayed, A. Rupam Mahmood

Abstract

Deep representation learning methods struggle with continual learning, suffering from both catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful units. While many methods address these two issues separately, only a few currently deal with both simultaneously. In this paper, we introduce Utility-based Perturbed Gradient Descent (UPGD) as a novel approach for the continual learning of representations. UPGD combines gradient updates with perturbations, where it applies smaller modifications to more useful units, protecting them from forgetting, and larger modifications to less useful units, rejuvenating their plasticity. We use a challenging streaming learning setup where continual learning problems have hundreds of non-stationarities and unknown task boundaries. We show that many existing methods suffer from at least one of the issues, predominantly manifested by their decreasing accuracy over tasks. On the other hand, UPGD continues to improve performance and surpasses or is competitive with all methods in all problems. Finally, in extended reinforcement learning experiments with PPO, we show that while Adam exhibits a performance drop after initial learning, UPGD avoids it by addressing both continual learning issues.

Abstract (translated)

深度表示学习方法在持续学习方面存在困难,常常由于刚性和无用的单元而受到损失。虽然许多方法分别解决了这两个问题,但只有少数方法同时处理这两个问题。在本文中,我们引入了一种名为 Utility-based Perturbed Gradient Descent (UPGD) 的新的方法,作为用于连续学习表示的新颖方法。UPGD 将梯度更新与扰动相结合,对更有用的单元应用较小的修改,以保护它们不遗忘,对更无用的单元应用较大的修改,以恢复它们的塑料性。我们使用具有挑战性的流式学习设置,其中连续学习问题具有数百个非平稳性和未知的任务边界。我们证明了大多数现有方法至少存在一个这些问题,主要表现在它们在任务上的准确性下降。另一方面,UPGD 在所有问题上都持续改进并超过了或与所有方法竞争。最后,在扩展的强化学习实验中使用PPO,我们证明了,尽管Adam在初始学习后表现下降,但UPGD通过解决连续学习和无用性问题来避免这种情况。

URL

https://arxiv.org/abs/2404.00781

PDF

https://arxiv.org/pdf/2404.00781.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot