Quantitative Assessment of Speech in Cerebellar Ataxia Using Magnitude and Phase Based Cepstrum

Kashyap, Bipasha; Pathirana, Pubudu N.; Horne, Malcolm; Power, Laura; Szmulewicz, David

doi:10.1007/s10439-020-02455-7

Quantitative Assessment of Speech in Cerebellar Ataxia Using Magnitude and Phase Based Cepstrum

Original Article
Published: 21 January 2020

Volume 48, pages 1322–1336, (2020)
Cite this article

Annals of Biomedical Engineering Aims and scope Submit manuscript

Bipasha Kashyap ORCID: orcid.org/0000-0002-9469-858X¹,
Pubudu N. Pathirana¹,
Malcolm Horne²,
Laura Power³ &
…
David Szmulewicz^2,3,4

345 Accesses
14 Citations
2 Altmetric
Explore all metrics

Abstract

The clinical assessment of speech abnormalities in Cerebellar Ataxia (CA) is time-consuming and inconsistent. We have developed an automated objective system to quantify CA severity and thereby facilitate remote monitoring and optimisation of therapeutic interventions. A quantitative acoustic assessment could prove to be a viable biomarker for this purpose. Our study explores the use of phase-based cepstral features extracted from the modified group delay function as a complement to the features obtained from the magnitude cepstrum. We selected a combination of 15 acoustic measurements using RELIEF feature selection algorithm during the feature optimisation process. These features were used to segregate ataxic speakers from normal speakers (controls) and objectively assess them based on their severity. The effectiveness of our study has been experimentally evaluated through a clinical study involving 42 patients diagnosed with CA and 23 age-matched controls. A radial basis function kernel based support vector machine (SVM) classifier achieved a classification accuracy of 84.6% in CA–Control discrimination [area under the ROC curve (AUC) of 0.97] and 74% in the modified 3-level CA severity estimation (AUC of 0.90) deduced from the clinical ratings. The strong classification ability of selected features and the SVM model supports this scheme’s suitability for monitoring CA related speech motor abnormalities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SARAspeech—Feasibility of automated assessment of ataxic speech disturbance

Article Open access 16 March 2023

M. Grobe-Einsler, J. Faber, … T. Klockgether

Characterizing Parkinson’s Disease Speech by Acoustic and Phonetic Features

Multi-class Versus One-Class Classifier in Spontaneous Speech Analysis Oriented to Alzheimer Disease Diagnosis

References

Ackermann, H. and I. Hertrich. Speech rate and rhythm in cerebellar dysarthria: an acoustic analysis of syllabic timing. Folia Phoniatr. Logop. 46(2):70–78, 1994.
Article CAS Google Scholar
Ali, Z., M. Alsulaiman, G. Muhammad, I. Elamvazuthi, and T. A. Mesallam. Vocal fold disorder detection based on continuous speech by using MFCC and GMM. In: 2013 7th IEEE GCC Conference and Exhibition (GCC), November. IEEE, 2013, pp. 292–297.
Alim, S. A. and N. K. A. Rashid. Some commonly used speech feature extraction algorithms. In: From Natural to Artificial Intelligence-Algorithms and Applications. London: IntechOpen, 2018.
Bäckström, T. Speech Coding: With Code-Excited Linear Prediction. Cham: Springer, 2017.
Book Google Scholar
Benba, A., A. Jilbab, A. Hammouch, and S. Sandabad. Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson’s disease. In: 2015 International Conference on Electrical and Information Technologies (ICEIT), March. IEEE, 2015, pp. 300–304.
Berger, Y. G. A jackknife variance estimator for unistage stratified samples with unequal probabilities. Biometrika 94(4):953–964, 2007.
Article Google Scholar
Boes, C. J. History of neurologic examination books. Bayl. Univ. Med. Center Proc. 28(2):172–179, 2015.
Article Google Scholar
Breathnach, C. S. Sir Gordon Holmes. Med. Hist. 19(2):194–200, 1975.
CAS Google Scholar
Brendel, B., H. Ackermann, D. Berg, T. Lindig, T. Schölderle, L. Schöls, M. Synofzik, and W. Ziegler. Friedreich ataxia: dysarthria profile and clinical data. Cerebellum 12(4):475–484, 2013.
Article CAS Google Scholar
Brendel, B., M. Synofzik, H. Ackermann, T. Lindig, T. Schölderle, L. Schöls, and W. Ziegler. Comparing speech characteristics in spinocerebellar ataxias type 3 and type 6 with Friedreich ataxia. J. Neurol. 262(1):21–26, 2015.
Article Google Scholar
De Boer, E. A note on phase distortion and hearing. Acustica 11:182–184, 1961.
Google Scholar
Diener, H. C. and J. Dichgans. Pathophysiology of cerebellar ataxia. Mov. Disord. Off. J. Mov. Disord. Soc. 7(2):95–109, 1992.
Article CAS Google Scholar
Fine, E. J., C. C. Ionita, and L. Lohr. The history of the development of the cerebellar examination. Semin. Neurol. 22(04):375–384, 2002.
Article Google Scholar
Frail, R., J. I. Godino-Llorente, N. Saenz-Lechon, V. Osma-Ruiz, and C. Fredouille. MFCC-based remote pathology detection on speech transmitted through the telephone channel. In: Proceedings of Biosignals, 2009.
Fraile, R., J. I. Godino-Llorente, N. Sáenz-Lechón, V. Osma-Ruiz, and P. Gómez-Vilda. Use of cepstrum-based parameters for automatic pathology detection on speech. Proc. Biosignals’ 08 1:85–91, 2008.
Google Scholar
Fu, Z., G. Lu, K. M. Ting, and D. Zhang. Optimizing cepstral features for audio classification. In: Twenty-Third International Joint Conference on Artificial Intelligence, June 2013.
Furui, S. Speaker recognition in smart environments. In: Human-Centric Interfaces for Ambient Intelligence. Cambridge: Academic, pp. 163–184, 2010.
Gerkmann, T., M. Krawczyk-Becker and J. Le Roux. Phase processing for single-channel speech enhancement: history and recent advances. IEEE Signal Process. Mag. 32(2):55–66, 2015.
Article Google Scholar
Hegde, R. M., H. A. Murthy, and V. R. R. Gadde. Significance of the modified group delay feature in speech recognition. IEEE Trans. Audio Speech Lang. Process. 15(1):190–202, 2006.
Article Google Scholar
Jafari, A. Classification of Parkinson’s disease patients using nonlinear phonetic features and Mel-frequency cepstral analysis. Biomed. Eng. Appl. Basis Commun. 25(04):1350001, 2013.
Article Google Scholar
Jannetts, S. and A. Lowit. Cepstral analysis of hypokinetic and ataxic voices: correlations with perceptual and other acoustic measures. J. Voice 28(6):673–680, 2014.
Article Google Scholar
Jelliffe, S. E., and W. A. White. Diseases of the Nervous System: A Text-Book of Neurology and Psychiatry. Philadelphia: Lea & Febiger, 1923.
Google Scholar
Kashyap, B., P. N. Pathirana, M. Horne, L. Power, and D. Szmulewicz. Identification of cerebellar dysarthria with SISO characterisation. In: 2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE), October. IEEE, 2017, pp. 479–485.
Kashyap, B., P. N. Pathirana, M. Horne, L. Power, and D. Szmulewicz. Quantitative assessment of syllabic timing deficits in ataxic dysarthria. In: 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), July. IEEE, 2018, pp. 425–428.
Kent, R. D., J. F. Kent, J. R. Duffy, J. E. Thomas, G. Weismer, and S. Stuntebeck. Ataxic dysarthria. J. Speech Lang. Hear. Res. 43(5):1275–1289, 2000.
Article CAS Google Scholar
Laitinen, M. V., S. Disch, and V. Pulkki. Sensitivity of human hearing to changes in phase spectrum. J. Audio Eng. Soc. 61(11):860–877, 2013.
Google Scholar
Liu, H. and H. Motoda. Computational Methods of Feature Selection. Boca Raton: CRC Press, 2007.
Book Google Scholar
Luna-Webb, S. Comparison of Acoustic Measures in Discriminating Between Those with Friedreich’s Ataxia and Neurologically Normal Peers, 2015.
Murthy, H. A. and V. Gadde. The modified group delay function and its application to phoneme recognition. In: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings (ICASSP’03), April, Vol. 1. IEEE, 2003, p. I-68.
Ohm, G. S. On the definition of sound, together with the theory of the siren and similar sound-forming devices linked to it. Ann. Phys. 135(8):513–565, 1843.
Article Google Scholar
Paliwal, K. K. and L. Alsteris. Usefulness of phase spectrum in human speech perception. In: Eighth European Conference on Speech Communication and Technology, 2003.
Patterson, R. D. A pulse ribbon model of monaural phase perception. J. Acoust. Soc. Am. 82(5)1560–1586, 1987.
Article CAS Google Scholar
Peng, H., F. Long, and C. Ding. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8):1226-1238, 2005.
Article Google Scholar
Plomp, R. and H. J. Steeneken. Effect of phase on the timbre of complex tones. J. Acoust. Soc. Am. 46(2B):409–421, 1969.
Article CAS Google Scholar
Rovini, E., C. Maremmani, A. Moschetti, D. Esposito, and F. Cavallo. Comparative motor pre-clinical assessment in Parkinson’s disease using supervised machine learning approaches. Ann. Biomed. Eng. 46(12):2057–2068, 2018.
Article Google Scholar
Schalling, E., B. Hammarberg, and L. Hartelius. Perceptual and acoustic analysis of speech in individuals with spinocerebellar ataxia (SCA). Logop. Phoniatr. Vocol. 32(1):31–46, 2007.
Article Google Scholar
Schalling, E., B. Hammarberg, and L. Hartelius. A longitudinal study of dysarthria in spinocerebellar ataxia (SCA): aspects of articulation, prosody, and voice. J. Med. Speech–Lang. Pathol. 16(2):103–118, 2008.
Google Scholar
Schmitz-Hübsch, T., S. T. Du Montcel, L. Baliko, J. Berciano, S. Boesch, C. Depondt, P. Giunti, C. Globas, J. Infante, J. S. Kang, and B. Kremer. Scale for the assessment and rating of ataxia: development of a new clinical scale. Neurology 66(11):1717–1720, 2006.
Article Google Scholar
Schroeder, M. R. New results concerning monaural phase sensitivity. J. Acoust. Soc. Am. 31(11):1579, 1959.
Article Google Scholar
Seasholtz, M.B. and B. Kowalski, The parsimony principle applied to multivariate calibration. Analytica Chimica Acta, 277(2), pp.165-177, 1993.
Article CAS Google Scholar
Vikram, C. M. and K. Umarani. Pathological voice analysis to detect neurological disorders using MFCC and SVM. Int. J. Adv. Electr. Electron. Eng. 2(4):87–91, 2013.
Google Scholar
Vogel, A. P., N. Rommel, A. Oettinger, L. H. Stoll, E. M. Kraus, C. Gagnon, M. Horger, P. Krumm, D. Timmann, E. Storey, and L. Schöls. Coordination and timing deficits in speech and swallowing in autosomal recessive spastic ataxia of Charlevoix–Saguenay (ARSACS). J. Neurol. 265(9):2060–2070, 2018.
Article Google Scholar
Wu, Z., E. S. Chng, and H. Li. Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition. In: Thirteenth Annual Conference of the International Speech Communication Association, 2012.
Yu, J.S., A.Y. Xue, E.E. Redei, and N. Bagheri, A support vector machine model provides an accurate transcript-level-based diagnostic for major depressive disorder. Translational psychiatry, 6(10), p.e931, 2016.
Article CAS Google Scholar

Download references

Acknowledgments

This research is supported by the Royal Victorian Eye and Ear Hospital (RVEEH), the Florey Institute of Neuroscience and Mental Health, Melbourne, Australia through the National Health and Medical Research Council (NHMRC, Grant GNT1101304 and APP1129595) and CSIRO Data61.

Author information

Authors and Affiliations

Networked and Sensing Control (NSC) Lab, School of Engineering, Deakin University, Waurn Ponds, Victoria, Australia
Bipasha Kashyap & Pubudu N. Pathirana
Florey Institute of Neuroscience and Mental Health, Parkville, Victoria, Australia
Malcolm Horne & David Szmulewicz
Balance Disorders and Ataxia Service, Royal Victorian Eye and Ear Hospital, St Andrews Place, East Melbourne, Victoria, Australia
Laura Power & David Szmulewicz
Cerebellar Ataxia Clinic, Alfred Hospital, Prahran, Melbourne, Victoria, Australia
David Szmulewicz

Authors

Bipasha Kashyap
View author publications
You can also search for this author in PubMed Google Scholar
Pubudu N. Pathirana
View author publications
You can also search for this author in PubMed Google Scholar
Malcolm Horne
View author publications
You can also search for this author in PubMed Google Scholar
Laura Power
View author publications
You can also search for this author in PubMed Google Scholar
David Szmulewicz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bipasha Kashyap.

Additional information

Associate Editor Eiji Tanaka oversaw the review of this article.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kashyap, B., Pathirana, P.N., Horne, M. et al. Quantitative Assessment of Speech in Cerebellar Ataxia Using Magnitude and Phase Based Cepstrum. Ann Biomed Eng 48, 1322–1336 (2020). https://doi.org/10.1007/s10439-020-02455-7

Download citation

Received: 03 September 2019
Accepted: 08 January 2020
Published: 21 January 2020
Issue Date: April 2020
DOI: https://doi.org/10.1007/s10439-020-02455-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Quantitative Assessment of Speech in Cerebellar Ataxia Using Magnitude and Phase Based Cepstrum

Abstract

Access this article

Similar content being viewed by others

SARAspeech—Feasibility of automated assessment of ataxic speech disturbance

Characterizing Parkinson’s Disease Speech by Acoustic and Phonetic Features

Multi-class Versus One-Class Classifier in Spontaneous Speech Analysis Oriented to Alzheimer Disease Diagnosis

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Quantitative Assessment of Speech in Cerebellar Ataxia Using Magnitude and Phase Based Cepstrum

Abstract

Access this article

Similar content being viewed by others

SARAspeech—Feasibility of automated assessment of ataxic speech disturbance

Characterizing Parkinson’s Disease Speech by Acoustic and Phonetic Features

Multi-class Versus One-Class Classifier in Spontaneous Speech Analysis Oriented to Alzheimer Disease Diagnosis

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation