Investigating voice as a biomarker: Deep phenotyping methods for early detection of Parkinson's disease

https://doi.org/10.1016/j.jbi.2019.103362Get rights and content
Under an Elsevier user license
open archive

Highlights

  • Mild Parkinson’s can be distinguished from no Parkinson’s using voice features.

  • Identity confounding leads to overestimates of model performance.

  • Gradient boosted model outperforms Random Forest and Logistic Regression models.

Abstract

Voice technology has grown tremendously in recent years and using voice as a biomarker has also been gaining evidence. We demonstrate the potential of voice in serving as a deep phenotype for Parkinson’s Disease (PD), the second most common neurodegenerative disorder worldwide, by presenting methodology for voice signal processing for clinical analysis. Detection of PD symptoms typically requires an exam by a movement disorder specialist and can be hard to access and inconsistent in findings. A vocal digital biomarker could supplement the cumbersome existing manual exam by detecting and quantifying symptoms to guide treatment. Specifically, vocal biomarkers of PD are a potentially effective method of assessing symptoms and severity in daily life, which is the focus of the current research. We analyzed a database of PD patient and non-PD subjects containing voice recordings that were used to extract paralinguistic features, which served as inputs to machine learning models to predict PD severity. The results are presented here and the limitations are discussed given the nature of the recordings. We note that our methodology only advances biomarker research and is not cleared for clinical use. Specifically, we demonstrate that conventional machine learning models applied to voice signals can be used to differentiate participants with PD who exhibit little to no symptoms from healthy controls. This work highlights the potential of voice to be used for early detection of PD and indicates that voice may serve as a deep phenotype for PD, enabling precision medicine by improving the speed, accuracy, accessibility, and cost of PD management.

Keywords

Parkinson's disease
Deep phenotype
Feature selection
Voice technology
Audio features
Voice biomarkers

Cited by (0)