Paper Reading AI Learner

Deep Residual Compensation Convolutional Network without Backpropagation

2023-01-27 11:45:09
Mubarakah Alotaibi, Richard Wilson

Abstract

PCANet and its variants provided good accuracy results for classification tasks. However, despite the importance of network depth in achieving good classification accuracy, these networks were trained with a maximum of nine layers. In this paper, we introduce a residual compensation convolutional network, which is the first PCANet-like network trained with hundreds of layers while improving classification accuracy. The design of the proposed network consists of several convolutional layers, each followed by post-processing steps and a classifier. To correct the classification errors and significantly increase the network's depth, we train each layer with new labels derived from the residual information of all its preceding layers. This learning mechanism is accomplished by traversing the network's layers in a single forward pass without backpropagation or gradient computations. Our experiments on four distinct classification benchmarks (MNIST, CIFAR-10, CIFAR-100, and TinyImageNet) show that our deep network outperforms all existing PCANet-like networks and is competitive with several traditional gradient-based models.

Abstract (translated)

PCANet及其变体在分类任务中提供了良好的准确性结果。然而,尽管网络深度在实现良好的分类精度方面非常重要,但这些网络最多只训练了九层。在本文中,我们介绍了一种残留卷积神经网络,它是第一个在提高分类精度的同时训练数百层的PCANet-like网络。 proposed network的设计包括几个卷积层,每个卷积层后是 post-processing steps 和分类器。为了纠正分类错误并显著增加网络的深度,我们每个层训练从其前几个层残留的信息中提取的新标签。这种学习机制是通过在一次forward pass中穿越网络的层来实现的,而不需要反向传播或梯度计算。我们对四个不同的分类基准(米NIST、CIFAR-10、CIFAR-100和tinyImageNet)进行的试验表明,我们的深度网络在所有现有的PCANet-like网络中表现更好,并与几个传统的基于梯度的方法竞争。

URL

https://arxiv.org/abs/2301.11663

PDF

https://arxiv.org/pdf/2301.11663.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot