Paper Reading AI Learner

AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation

2024-05-05 17:37:50
Viet-Thanh Nguyen, Van-Truong Pham, Thi-Thao Tran

Abstract

Skin lesion segmentation is a critical task in computer-aided diagnosis systems for dermatological diseases. Accurate segmentation of skin lesions from medical images is essential for early detection, diagnosis, and treatment planning. In this paper, we propose a new model for skin lesion segmentation namely AC-MambaSeg, an enhanced model that has the hybrid CNN-Mamba backbone, and integrates advanced components such as Convolutional Block Attention Module (CBAM), Attention Gate, and Selective Kernel Bottleneck. AC-MambaSeg leverages the Vision Mamba framework for efficient feature extraction, while CBAM and Selective Kernel Bottleneck enhance its ability to focus on informative regions and suppress background noise. We evaluate the performance of AC-MambaSeg on diverse datasets of skin lesion images including ISIC-2018 and PH2; then compare it against existing segmentation methods. Our model shows promising potential for improving computer-aided diagnosis systems and facilitating early detection and treatment of dermatological diseases. Our source code will be made available at: this https URL.

Abstract (translated)

皮肤病变分割是计算机辅助诊断系统皮肤疾病诊断中的一项关键任务。准确从医学图像中分割皮肤病变是早期诊断、诊断和治疗规划的必要条件。在本文中,我们提出了一个名为AC-MambaSeg的新模型用于皮肤病变分割,这是一种增强模型,具有混合CNN-Mamba骨干网络和高级组件,如卷积块注意模块(CBAM)、注意门和选择性内核瓶颈。AC-MambaSeg利用Vision Mamba框架进行高效的特征提取,而CBAM和选择性内核瓶颈则增强了其关注有信息区域并抑制背景噪声的能力。我们在包括ISIC-2018和PH2等多样数据集的皮肤病变图像上评估AC-MambaSeg的性能,然后与现有分割方法进行比较。我们的模型在改善计算机辅助诊断系统和促进早期诊断和治疗皮肤疾病方面具有令人鼓舞的潜力。我们的源代码将在此处公布:https://this URL。

URL

https://arxiv.org/abs/2405.03011

PDF

https://arxiv.org/pdf/2405.03011.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot