Automatic Recognition System for Dysarthric Speech Based on MFCC’s, PNCC’s, JITTER and SHIMMER Coefficients

Zaidi, Brahim-Fares; Boudraa, Malika; Selouani, Sid-Ahmed; Addou, Djamel; Yakoub, Mohammed Sidi

doi:10.1007/978-3-030-17798-0_40

Brahim-Fares Zaidi¹⁶,
Malika Boudraa¹⁶,
Sid-Ahmed Selouani¹⁷,
Djamel Addou¹⁶ &
…
Mohammed Sidi Yakoub¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 944))

Included in the following conference series:

Science and Information Conference

2344 Accesses
2 Citations

Abstract

The aim of this work is to improve the automatic recognition of the dysarthria speech. In this context, we have compared two techniques of speech parameterization; these two techniques are based on the recently proposed coefficients Power Normalized Cepstral Coefficients and Mel-Frequency Cepstral Coefficients. In this paper we have concatenate several variants of JITTER and SHIMMER with the techniques of speech parameterization to improve an automatic recognition of the dysarthric word system. The aim is to help the fragile persons having speech problems (dysarthric voice) and the doctor to make a first diagnosis about the patient’s disease. For this, an Automatic Acknowledgment of Continuous Pathological Speech System has been developed based on the Hidden Models of Markov and the Hidden Markov Model Toolkit. For our tests, we used the Nemours Database which contains 11 speakers representing dysarthric voices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Classification of Speech Dysfluencies Using Speech Parameterization Techniques and Multiclass SVM

Comparative analysis of Dysarthric speech recognition: multiple features and robust templates

Article 08 April 2022

Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features

Article 30 October 2022

References

Kim, C., Stern, R.M.: Power Normalized Cepstral Coefficients (PNCC) for robust speech recognition. IEEE Trans. Audio Speech Lang. Process. 24, 1315 (2016)
Article Google Scholar
Mohammed, A., Mansour, A., Ghulam, M., Mohammed, Z., Mesallam, T.A., Malki, K.H., Mohamed, F., Mekhtiche, M.A., Mohamed, B.: Automatic speech recognition of pathological voice. Indian J. Sci. Technol. 8, 32 (2015)
Article Google Scholar
Tsanas, A.: Accurate telemonitoring of Parkinson’s disease symptom severity using nonlinear speech signal processing and statistical machine learning. University of Oxford, June 2012
Google Scholar
Zaidi, B.F., Selouani, S.A., Boudraa, M., Hamdani, G.: Human/machine interface dialog integrating new information and communication technology for pathological voice. In: IEEE Xplore, Future Technologies Conference (FTC), San Francisco, CA, USA, January 2017
Google Scholar
Alam, M.J., Kenny, P., Dumouchel, P., O’Shaughnessy, D.: Robust feature extractors for continuous speech recognition. In: IEEE Xplore, European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, November 2014
Google Scholar
Dua, M., Aggarwal, R.K., Kadyan, V., Dua, S.: Punjabi automatic speech recognition using HTK. Int. J. Comput. Sci. Issues 9(4), 359 (2012)
Google Scholar
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book, version 3.1, pp. 1–277 (2006)
Google Scholar
Menéndez-Pidal, X., Polikoff, J.B., Peters, S.M., Leonzio, J.E., Bunnell, H.T.: The nemours database of dysarthric speech. J. IEEE (in press)
Google Scholar
Darley, F.L., Aronson, A.E., Brown, J.R.: Differential diagnostic patterns of dysarthria. J. Speech Lang. Hear. Res. 12, 246–269 (1969)
Article Google Scholar
Titze, I.R.: Principles of Voice Production. National Center for Voice and Speech, Iowa City, USA, 2nd printing (2000)
Google Scholar
Schoentgen, J., de Guchteneere, R.: Time series analysis of jitter. J. Phon. 23, 189–201 (1995)
Article Google Scholar
Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Singular Thomson Learning, San Diego (2000)
Google Scholar
Tsanas, A., Little, M.A., McSharry, P.E., Ramig, L.O.: Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson‘s disease symptom severity. J. R. Soc. Interface 8, 842–855 (2011)
Article Google Scholar
Kaiser, J.: On a simple algorithm to calculate the ‘energy’ of a signal. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1990), pp. 381–384, Albuquerque, NM, USA, April 1990
Google Scholar
Kounoudes, A., Naylor, P.A., Brookes, M.: The DYPSA algorithm for estimation of glottal closure instants in voices speech. In: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP), pp. 349–352, Orlando, FL (2002)
Google Scholar
Naylor, P.A., Kounoudes, A., Gudnason, J., Brookes, M.: Estimation of glottal closure instants in voices speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15, 34–43 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Speech Communication and Signal Processing (LSCSP), U.S.T.H.B University, Algiers, Algeria
Brahim-Fares Zaidi, Malika Boudraa & Djamel Addou
Laboratory of Research in Human-System Interaction (LARHSI), University of Moncton, Shippagan Campus, Moncton, Canada
Sid-Ahmed Selouani & Mohammed Sidi Yakoub

Authors

Brahim-Fares Zaidi
View author publications
You can also search for this author in PubMed Google Scholar
Malika Boudraa
View author publications
You can also search for this author in PubMed Google Scholar
Sid-Ahmed Selouani
View author publications
You can also search for this author in PubMed Google Scholar
Djamel Addou
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Sidi Yakoub
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Brahim-Fares Zaidi .

Editor information

Editors and Affiliations

Saga University, Saga, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zaidi, BF., Boudraa, M., Selouani, SA., Addou, D., Yakoub, M.S. (2020). Automatic Recognition System for Dysarthric Speech Based on MFCC’s, PNCC’s, JITTER and SHIMMER Coefficients. In: Arai, K., Kapoor, S. (eds) Advances in Computer Vision. CVC 2019. Advances in Intelligent Systems and Computing, vol 944. Springer, Cham. https://doi.org/10.1007/978-3-030-17798-0_40

Download citation

DOI: https://doi.org/10.1007/978-3-030-17798-0_40
Published: 24 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17797-3
Online ISBN: 978-3-030-17798-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Automatic Recognition System for Dysarthric Speech Based on MFCC’s, PNCC’s, JITTER and SHIMMER Coefficients

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Classification of Speech Dysfluencies Using Speech Parameterization Techniques and Multiclass SVM

Comparative analysis of Dysarthric speech recognition: multiple features and robust templates

Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Automatic Recognition System for Dysarthric Speech Based on MFCC’s, PNCC’s, JITTER and SHIMMER Coefficients

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Classification of Speech Dysfluencies Using Speech Parameterization Techniques and Multiclass SVM

Comparative analysis of Dysarthric speech recognition: multiple features and robust templates

Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation