Paper Reading AI Learner

A Transversal Study of Fundamental Frequency Contours in Parkinsonian Voices

2024-02-09 13:08:58
Pablo Rodriguez-Perez Ruben Fraile Miguel Garcia-Escrig Nicolas Saenz-Lechon Juana M. Gutierrez-Arriola Victor Osma-Ruiz

Abstract

A transversal study of the pitch variability of parkinsonian voices in read speech is presented. 30 patients suffering from Parkinson's disease (PD) and 32 healthy speakers were recorded while reading a text without voiceless phonemes. The fundamental frequency contours were calculated from the recordings, and the following measures were used for describing them: mean, minimum, maximum, and standard deviation of the estimated fundamental frequencies. Results based on these measures indicate that the influence of PD on some aspects of intonation can be masked by the effects of aging, especially for male voices. However, some parameters such as the relative fundamental frequency range exhibit lower correlations with age than with PD stage, as evaluated using the Hoehn and Yahr scale. These correlations between relative fundamental frequency range and PD stage reach moderate-to-high values in the case of women. Additionally, three parameters describing the form of the fundamental frequency modulation spectrum were investigated for correlation with age and PD stage. The study of this modulation spectrum provides some insight into the ability of the speakers to plan the intonation of full phrases. For both male and female populations, significant correlations were found between parameters obtained from the modulation spectrum of fundamental frequency and the PD stage. Nevertheless, the quantitative assessment of the performance of regression models built from these modulation parameters and fundamental frequency range suggests that such measures are likely to be of limited value in the early diagnosis of PD due to inter-speaker variability.

Abstract (translated)

本文对 Parkinsonian 语音变调的研究进行了综述。在对 30 名患有 Parkinson's disease(PD)的患者和 32 名健康说话者进行阅读时无音素语音的文本时进行录音。从录音中计算出基本频率轮廓,并使用以下度量对其进行描述:平均、最小、最大和标准差估计的基本频率。基于这些度量的结果表明,PD 对某些语调方面的影响可能会被衰老的影响所掩盖,特别是对于男性声音。然而,使用 Hoehn 和 Yahr 刻度对相对基本频率范围和PD 阶段的关联度进行评估,结果显示这些参数与年龄的关联较低,而与PD阶段的关联较高。此外,研究了三个参数描述基本频率调制频谱的形式,以评估其与年龄和PD阶段的关联。基于这些调制参数的基本频率频谱的研究提供了一些对说话者规划完整短语的能力的洞察。对于男性和女性人群,从基本频率频谱获得了的参数与PD 阶段之间发现了显著的关联。然而,基于这些参数的回归模型的定量评估表明,由于说话者之间的变异性,这些测量值在 PD 的早期诊断中可能有限价值。

URL

https://arxiv.org/abs/2402.06387

PDF

https://arxiv.org/pdf/2402.06387.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot