Paper Reading AI Learner

RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection

2024-05-06 15:02:16
Thennarasi Balakrishnan, Sandeep Singh Sengar

Abstract

Object detection algorithms particularly those based on YOLO have demonstrated remarkable efficiency in balancing speed and accuracy. However, their application in brain tumour detection remains underexplored. This study proposes RepVGG-GELAN, a novel YOLO architecture enhanced with RepVGG, a reparameterized convolutional approach for object detection tasks particularly focusing on brain tumour detection within medical images. RepVGG-GELAN leverages the RepVGG architecture to improve both speed and accuracy in detecting brain tumours. Integrating RepVGG into the YOLO framework aims to achieve a balance between computational efficiency and detection performance. This study includes a spatial pyramid pooling-based Generalized Efficient Layer Aggregation Network (GELAN) architecture which further enhances the capability of RepVGG. Experimental evaluation conducted on a brain tumour dataset demonstrates the effectiveness of RepVGG-GELAN surpassing existing RCS-YOLO in terms of precision and speed. Specifically, RepVGG-GELAN achieves an increased precision of 4.91% and an increased AP50 of 2.54% over the latest existing approach while operating at 240.7 GFLOPs. The proposed RepVGG-GELAN with GELAN architecture presents promising results establishing itself as a state-of-the-art solution for accurate and efficient brain tumour detection in medical images. The implementation code is publicly available at this https URL.

Abstract (translated)

物体检测算法,尤其是基于YOLO的算法,已经在平衡速度和精度方面取得了显著的效率。然而,在肿瘤检测应用中,它们的应用仍然没有被充分利用。这项研究提出了RepVGG-GELAN,一种新颖的YOLO架构,通过在RepVGG上进行重新参数化卷积,特别是在医学图像中的肿瘤检测。RepVGG-GELAN利用RepVGG架构来提高检测肿瘤的速度和准确性。将RepVGG集成到YOLO框架中旨在实现计算效率和检测性能的平衡。本研究包括一个基于空间金字塔池化的全局 efficient层聚合网络(GELAN)架构,进一步增强了RepVGG的检测能力。在肿瘤数据集上进行的实验评估表明,RepVGG-GELAN在准确性和速度方面超过了现有的RCS-YOLO。具体来说,RepVGG-GELAN在最新的现有方法的基础上实现了4.91%的增加的准确性和2.54%的增加的AP50。所提出的RepVGG-GELAN与GELAN架构相结合,为准确且高效的肿瘤检测在医学图像中提供了有前景的解决方案。实现代码可在此链接的公开URL中获取。

URL

https://arxiv.org/abs/2405.03541

PDF

https://arxiv.org/pdf/2405.03541.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot