Paper Reading AI Learner

CardioXNet: A Novel Lightweight CRNN Framework for Classifying Cardiovascular Diseases from Phonocardiogram Recordings

2020-10-03 17:07:42
Samiul Based Shuvo, Shams Nafisa Ali, Soham Irtiza Swapnil

Abstract

The alarmingly high mortality rate and increasing global prevalence of cardiovascular diseases (CVDs) signify the crucial need for early detection schemes. Phonocardiogram(PCG) signals has been historically applied in this domain owing to its simplicity and cost-effectiveness. However, insufficiency of expert physicians and human subjectivity affect the applicability of this technique, especially in the low-resource settings. For resolving this issue, in this paper, we introduce CardioXNet,a novel lightweight CRNN architecture for automatic detection of five classes of cardiac auscultation namely normal, aortic stenosis, mitral stenosis, mitral regurgitation and mitral valve prolapse using raw PCG signal. The process has been automated by the involvement of two learning phases namely, representation learning and sequence residual learning. The first phase mainly focuses on automated feature extraction and it has been implemented in a modular way with three parallel CNN pathways i.e., frequency feature extractor (FFE), pattern extractor (PE) and adaptive feature extractor (AFE). 1D-CNN based FFE and PE respectively learn the coarse and fine-grained features from the PCG while AFE explores the salient features from variable receptive fields involving 2D-CNN based squeezeexpansion. Thus, in the representation learning phase, the network extracts efficient time-invariant features and converges with great rapidity. In the sequential residual learning phase,because of the bidirectional-LSTMs and the skip connection, the network can proficiently extract temporal features. The obtained results demonstrate that the proposed end-to-end architecture yields outstanding performance in all the evaluation metrics compared to the previous state-of-the-art methods with up to 99.6% accuracy, 99.6% precision, 99.6% recall and 99.4% F1-score on an average while being computationally comparable.

Abstract (translated)

URL

https://arxiv.org/abs/2010.01392

PDF

https://arxiv.org/pdf/2010.01392.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot