Abstract
This paper documents a comprehensive evaluation carried out on automatic glottal inverse filtering and glottal source parameterisation methods. The experiments consist of analysis of a wide variety of synthetic vowels and assessment of the ability of derived parameters to differentiate breathy to tense voice. One striking finding is that glottal model-based parameters compared favourably to parameters measured directly from the glottal source signal, in terms of separation of breathy to tense voice. Also, certain combinations of inverse filtering and parameterisation methods were more robust than others.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Degottex, G., Lanchantin, P., Roebel, A., Rodet, X.: Mixed source model and its adapted vocal-tract filter estimate for voice transformation and synthesis. Speech Communication 55(2), 278–294 (2013)
Degottex, G., Roebel, A., Rodet, X.: Pitch transposition and breathiness modification using a glottal source model and its adapted vocal-tract filter. In: Proceedings of ICASSP, pp. 5128–5131 (2011)
Drugman, T., Dubuisson, T., Dutoit, T.: On the mutual information between source and filter contributions for voice pathology detection. In: Proceedings of Interspeech, pp. 1463–1466 (2009)
Lugger, M., Yang, B.: The relevance of voice quality features in speaker independent emotion recognition. In: Proceedings of ICASSP, pp. 17–20 (2007)
Walker, J., Murphy, P.: A review of glottal waveform analysis. In: Stylianou, Y., Faundez-Zanuy, M., Esposito, A. (eds.) COST 277. LNCS, vol. 4391, pp. 1–21. Springer, Heidelberg (2007)
Lin, Q.: Speech production theory and articulatory speech synthesis, Ph. D. Thesis (1990)
Székely, É., Kane, J., Scherer, S., Gobl, C., Carson-Berndsen, J.: Detecting a targeted voice style in an audiobook using voice quality features. In: Proceedings of ICASSP, pp. 4593–4596 (2012)
Drugman, T., Bozkurt, B., Dutoit, T.: A comparative study of glottal source estimation techniques. Computer Speech and Language 26, 20–34 (2011)
Alku, P.: Glottal inverse filtering analysis of human voice production - A review of estimation and parameterization methods of the glottal excitation and their applications. Sadhana 36(5), 623–650 (2011)
Yegnanarayana, B., Veldhius, R.: Extraction of vocal-tract system characteristics from speech signals. IEEE Transactions on Audio Speech and Language Processing 6(4), 313–327 (1998)
Alku, P., Bäckström, T., Vilkman, E.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11(2-3), 109–118 (1992)
Drugman, T., Bozkurt, B., Dutoit, T.: Complex cepstrum-based decomposition of speech for glottal source estimation. In: Proceedings of Interspeech, pp. 116–119 (2009)
Kane, J., Gobl, C.: Evaluation of glottal closure instant detection in a range of voice qualities. Speech Communication 55(2), 295–314 (2013)
Drugman, T., Thomas, M., Gudnason, J., Naylor, P., Dutoit, T.: Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review. IEEE Transactions on Audio Speech and Language processing 20(3), 994–1006 (2012)
Alku, P., Bäckström, T., Vilkman, E.: Normalized amplitude quotient for parameterization of the glottal flow. Journal of the Acoustical Society of America 112(2), 701–710 (2002)
Hacki, T.: Klassifizierung von Glottisdysfunktionen mit Hilfe der Elektroglottographie. Folia Phoniatrica 41, 43–48 (1989)
Airas, M., Alku, P.: Comparison of multiple voice source parameters in different phonation types. In: Proceedings of Interspeech, pp. 1410–1413 (2007)
Strik, H.: Automatic parameterization of differentiated glottal flow: Comparing methods by means of synthetic flow pulses. Journal of the Acoustical Society of America, 2659–2669 (1998)
Gobl, C., Ní Chasaide, A.: Amplitude-based source parameters for measuring voice quality. In: Proceedings of the ISCA Tutorial and Research Workshop VOQUAL 2003 on Voice Quality: Functions, Analysis and Synthesis, Geneva, pp. 151–156 (2003)
Degottex, G., Roebel, A., Rodet, X.: Phase minimization for glottal model estimation. IEEE Transactions on Audio Speech and Language processing 19(5), 1080–1090 (2011)
Hanson, H.M.: Glottal Characteristics of female speakers: Acoustic Correlates. Journal of the Acoustical Society of America 10(1), 466–481 (1997)
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. In: STL-QPSR, Speech, Music, and Hearing, KTH, Stockholm, vol. 26(4), pp. 1–13 (1985)
Kane, J., Kane, M., Gobl, C.: A spectral LF model based approach to voice source parameterisation. In: Proceedings of Interspeech, pp. 2606–2609 (2010)
Laver, J.: The Phonetic Description of Voice Quality. Cambridge University Press (1980)
Gobl, C.: Modelling aspiration noise during phonation using the LF voice source model. In: Proceedings of Interspeech, pp. 965–968 (2006)
Kane, J., Gobl, C.: Automating manual user strategies for precise voice source analysis. Speech Communication 55(3), 397–414 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kane, J., Gobl, C. (2013). Evaluation of Automatic Glottal Source Analysis. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-38847-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)