Evaluation of Automatic Glottal Source Analysis

Kane, John; Gobl, Christer

doi:10.1007/978-3-642-38847-7_1

John Kane²¹ &
Christer Gobl²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

International Conference on Nonlinear Speech Processing

1123 Accesses
3 Citations

Abstract

This paper documents a comprehensive evaluation carried out on automatic glottal inverse filtering and glottal source parameterisation methods. The experiments consist of analysis of a wide variety of synthetic vowels and assessment of the ability of derived parameters to differentiate breathy to tense voice. One striking finding is that glottal model-based parameters compared favourably to parameters measured directly from the glottal source signal, in terms of separation of breathy to tense voice. Also, certain combinations of inverse filtering and parameterisation methods were more robust than others.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model

Article 23 December 2019

Speech synthesis for glottal activity region processing

Article 03 December 2018

Analysis and Synthesis of Glottalization Phenomena in German-Accented English

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Degottex, G., Lanchantin, P., Roebel, A., Rodet, X.: Mixed source model and its adapted vocal-tract filter estimate for voice transformation and synthesis. Speech Communication 55(2), 278–294 (2013)
Article Google Scholar
Degottex, G., Roebel, A., Rodet, X.: Pitch transposition and breathiness modification using a glottal source model and its adapted vocal-tract filter. In: Proceedings of ICASSP, pp. 5128–5131 (2011)
Google Scholar
Drugman, T., Dubuisson, T., Dutoit, T.: On the mutual information between source and filter contributions for voice pathology detection. In: Proceedings of Interspeech, pp. 1463–1466 (2009)
Google Scholar
Lugger, M., Yang, B.: The relevance of voice quality features in speaker independent emotion recognition. In: Proceedings of ICASSP, pp. 17–20 (2007)
Google Scholar
Walker, J., Murphy, P.: A review of glottal waveform analysis. In: Stylianou, Y., Faundez-Zanuy, M., Esposito, A. (eds.) COST 277. LNCS, vol. 4391, pp. 1–21. Springer, Heidelberg (2007)
Chapter Google Scholar
Lin, Q.: Speech production theory and articulatory speech synthesis, Ph. D. Thesis (1990)
Google Scholar
Székely, É., Kane, J., Scherer, S., Gobl, C., Carson-Berndsen, J.: Detecting a targeted voice style in an audiobook using voice quality features. In: Proceedings of ICASSP, pp. 4593–4596 (2012)
Google Scholar
Drugman, T., Bozkurt, B., Dutoit, T.: A comparative study of glottal source estimation techniques. Computer Speech and Language 26, 20–34 (2011)
Article Google Scholar
Alku, P.: Glottal inverse filtering analysis of human voice production - A review of estimation and parameterization methods of the glottal excitation and their applications. Sadhana 36(5), 623–650 (2011)
Article Google Scholar
Yegnanarayana, B., Veldhius, R.: Extraction of vocal-tract system characteristics from speech signals. IEEE Transactions on Audio Speech and Language Processing 6(4), 313–327 (1998)
Article Google Scholar
Alku, P., Bäckström, T., Vilkman, E.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11(2-3), 109–118 (1992)
Article Google Scholar
Drugman, T., Bozkurt, B., Dutoit, T.: Complex cepstrum-based decomposition of speech for glottal source estimation. In: Proceedings of Interspeech, pp. 116–119 (2009)
Google Scholar
Kane, J., Gobl, C.: Evaluation of glottal closure instant detection in a range of voice qualities. Speech Communication 55(2), 295–314 (2013)
Article Google Scholar
Drugman, T., Thomas, M., Gudnason, J., Naylor, P., Dutoit, T.: Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review. IEEE Transactions on Audio Speech and Language processing 20(3), 994–1006 (2012)
Article Google Scholar
Alku, P., Bäckström, T., Vilkman, E.: Normalized amplitude quotient for parameterization of the glottal flow. Journal of the Acoustical Society of America 112(2), 701–710 (2002)
Article Google Scholar
Hacki, T.: Klassifizierung von Glottisdysfunktionen mit Hilfe der Elektroglottographie. Folia Phoniatrica 41, 43–48 (1989)
Article Google Scholar
Airas, M., Alku, P.: Comparison of multiple voice source parameters in different phonation types. In: Proceedings of Interspeech, pp. 1410–1413 (2007)
Google Scholar
Strik, H.: Automatic parameterization of differentiated glottal flow: Comparing methods by means of synthetic flow pulses. Journal of the Acoustical Society of America, 2659–2669 (1998)
Google Scholar
Gobl, C., Ní Chasaide, A.: Amplitude-based source parameters for measuring voice quality. In: Proceedings of the ISCA Tutorial and Research Workshop VOQUAL 2003 on Voice Quality: Functions, Analysis and Synthesis, Geneva, pp. 151–156 (2003)
Google Scholar
Degottex, G., Roebel, A., Rodet, X.: Phase minimization for glottal model estimation. IEEE Transactions on Audio Speech and Language processing 19(5), 1080–1090 (2011)
Article Google Scholar
Hanson, H.M.: Glottal Characteristics of female speakers: Acoustic Correlates. Journal of the Acoustical Society of America 10(1), 466–481 (1997)
Article Google Scholar
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. In: STL-QPSR, Speech, Music, and Hearing, KTH, Stockholm, vol. 26(4), pp. 1–13 (1985)
Google Scholar
Kane, J., Kane, M., Gobl, C.: A spectral LF model based approach to voice source parameterisation. In: Proceedings of Interspeech, pp. 2606–2609 (2010)
Google Scholar
Laver, J.: The Phonetic Description of Voice Quality. Cambridge University Press (1980)
Google Scholar
Gobl, C.: Modelling aspiration noise during phonation using the LF voice source model. In: Proceedings of Interspeech, pp. 965–968 (2006)
Google Scholar
Kane, J., Gobl, C.: Automating manual user strategies for precise voice source analysis. Speech Communication 55(3), 397–414 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Phonetics and Speech Laboratory, School of Linguistic, Speech and Communication Sciences, Trinity College Dublin, Ireland
John Kane & Christer Gobl

Authors

John Kane
View author publications
You can also search for this author in PubMed Google Scholar
Christer Gobl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TCTS Lab, University of Mons, 31, Bouldevard Bolez, 7000, Mons, Belgium
Thomas Drugman
TCTS Lab, University of Mons, 31, Boulevard Dolez, 7000, Mons, Belgium
Thierry Dutoit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kane, J., Gobl, C. (2013). Evaluation of Automatic Glottal Source Analysis. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-38847-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Evaluation of Automatic Glottal Source Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model

Speech synthesis for glottal activity region processing

Analysis and Synthesis of Glottalization Phenomena in German-Accented English

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Evaluation of Automatic Glottal Source Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model

Speech synthesis for glottal activity region processing

Analysis and Synthesis of Glottalization Phenomena in German-Accented English

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation