A Review of Glottal Waveform Analysis

Walker, Jacqueline; Murphy, Peter

doi:10.1007/978-3-540-71505-4_1

Jacqueline Walker¹ &
Peter Murphy¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4391))

1347 Accesses
21 Citations

Abstract

Glottal inverse filtering is of potential use in a wide range of speech processing applications. As the process of voice production is, to a first order approximation, a source-filter process, then obtaining source and filter components provides for a flexible representation of the speech signal for use in processing applications. In certain applications the desire for accurate inverse filtering is more immediately obvious, e.g., in the assessment of laryngeal aspects of voice quality and for correlations between acoustics and vocal fold dynamics, the resonances of the vocal tract should firstly be removed. Similarly, for assessment of vocal performance, trained singers may wish to obtain quantitative data or feedback regarding their voice at the level of the larynx.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Evaluation of Automatic Glottal Source Analysis

Voice production model based on phonation biophysics

Article Open access 08 September 2021

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Akande, O.: Speech analysis techniques for glottal source and noise estimation in voice signals. Ph. D. Thesis, University of Limerick (2004)
Google Scholar
Akande, O., Murphy, P.J.: Estimation of the vocal tract transfer function for voiced speech with application to glottal wave analysis. Speech Communication 46, 15–36 (2005)
Article Google Scholar
Akande, O., Murphy, P.J.: Improved speech analysis for glottal excited linear predictive speech coding. In: Proc. Irish Signals and Systems Conference, pp. 101–106 (2004)
Google Scholar
Alkhairy, A.: An algorithm for glottal volume velocity estimation. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 233–236 (1999)
Google Scholar
Alku, P., Vilkman, E., Laine, U.K.: Analysis of glottal waveform in different phonation types using the new IAIF-method. In: Proc. 12th Int. Congress Phonetic Sciences, vol. 4, pp. 362–365 (1991)
Google Scholar
Alku, P.: An automatic method to estimate the time-based parameters of the glottal pulseform. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 29–32 (1992)
Google Scholar
Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)
Article Google Scholar
Alku, P., Vilkman, E.: Estimation of the glottal pulseform based on Discrete All-Pole modeling. In: Proc. Int. Conf. on Spoken Language Processing, pp. 1619–1622 (1994)
Google Scholar
Ananthapadmanabha, T.V., Fant, G.: Calculation of true glottal flow and its components. STL-QPR, 1–30 (1985)
Google Scholar
Atal, B.S., Hanauer, S.L.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Amer. 50, 637–655 (1971)
Article Google Scholar
Bergstrom, A., Hedelin, P.: Codebook driven glottal pulse analysis. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 53–56 (1989)
Google Scholar
Berouti, M., Childers, D., Paige, A.: Correction of tape recorder distortion. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 397–400 (1977)
Google Scholar
Brüel & Kjær: Measurement Microphones, 2nd edn. (1994)
Google Scholar
Chen, W.-T., Chi, C.-Y.: Deconvolution and vocal-tract parameter estimation of speech signals by higher-order statistics based inverse filters. In: Proc. IEEE Workshop on HOS, pp. 51–55 (1993)
Google Scholar
Childers, D.G.: Glottal source modeling for voice conversion. Speech Communication 16, 127–138 (1995)
Article Google Scholar
Childers, D.G.: Speech processing and synthesis toolboxes. Wiley, New York (2000)
Google Scholar
Childers, D.G., Chieteuk, A.: Modeling the glottal volume-velocity waveform for three voice types. J. Acoust. Soc. Amer. 97, 505–519 (1995)
Article Google Scholar
Childers, D.G., Principe, J.C., Ting, Y.T.: Adaptive WRLS-VFF for Speech Analysis. IEEE Trans. Speech and Audio Proc. 3, 209–213 (1995)
Article Google Scholar
Childers, D.G., Hu, H.T.: Speech synthesis by glottal excited linear prediction. J. Acoust. Soc. Amer. 96, 2026–2036 (1994)
Article Google Scholar
Cranen, B., Schroeter, J.: Physiologically motivated modelling of the voice source in articulatory analysis/synthesis. Speech Communication 19, 1–19 (1996)
Article Google Scholar
Cummings, K.E., Clements, M.A.: Glottal Models for Digital Speech Processing: A Historical Survey and New Results. Digital Signal Processing 5, 21–42 (1995)
Article Google Scholar
Deng, H., Beddoes, M.P., Ward, R.K., Hodgson, M.: Estimating the Glottal Waveform and the Vocal-Tract Filter from a Vowel Sound Signal. In: Proc. IEEE Pacific Rim Conf. Communications, Computers and Signal Processing, vol. 1, pp. 297–300 (2003)
Google Scholar
Edwards, J.A., Angus, J.A.S.: Using phase-plane plots to assess glottal inverse filtering. Electronics Letters 32, 192–193 (1996)
Article Google Scholar
Elliot, M., Clements, M.: Algorithm for automatic glottal waveform estimation without the reliance on precise glottal closure information. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 101–104 (2004)
Google Scholar
Erdem, A.T., Tekalp, A.M.: Linear Bispectrum of Signals and Identification of Nonminimum Phase FIR Systems Driven by Colored Input. IEEE Trans. Signal Processing 40, 1469–1479 (1992)
Article MATH Google Scholar
Fant, G.C.M.: Acoustic Theory of Speech Production. Mouton, The Hague (1970)
Google Scholar
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. STL-QPR, 1–14 (1985)
Google Scholar
Fant, G., Lin, Q., Gobl, C.: Notes on glottal flow interaction. STL-QPR, 21–45 (1985)
Google Scholar
Friedlander, B.: A recursive maximum likelihood algorithm for ARMA spectral estimation. IEEE Trans. Inform. Theory 28, 639–646 (1982)
Article Google Scholar
Fu, Q., Murphy, P.: Adapive Inverse filtering for High Accuracty Estimation of the Glottal Source. In: Proc. NoLisp’03 (2003)
Google Scholar
Fu, Q., Murphy, P.J.: Robust glottal source estimation based on joint source-filter model optimization. IEEE Trans. Audio, Speech Lang. Proc. 14, 492–501 (2006)
Article Google Scholar
Hillman, R.E., Weinberg, B.: A new procedure for venting a reflectionless tube. J. Acoust. Soc. Amer. 69, 1449–1451 (1981)
Article Google Scholar
Holmberg, E.R., Hillman, R.E., Perkell, J.S.: Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal and loud voice. J. Acoust. Soc. Amer. 84, 511–529 (1988)
Article Google Scholar
Hinich, M.J., Shichor, E.: Bispectral Analysis of Speech. In: Proc. 17th Convention of Electrical and Electronic Engineers in Israel, pp. 357–360 (1991)
Google Scholar
Hinich, M.J., Wolinsky, M.A.: A test for aliasing using bispectral components. J. Am. Stat. Assoc. 83, 499–502 (1988)
Article MathSciNet Google Scholar
Holmes, J.N.: Low-frequency phase distortion of speech recordings. J. Acoust. Soc. Amer. 58, 747–749 (1975)
Article Google Scholar
Ishizaka, K., Flanagan, J.L.: Synthesis of voiced sounds from a two mass model of the vocal cords. Bell Syst. Tech. J. 51, 1233–1268 (1972)
Google Scholar
Jiang, Y., Murphy, P.J.: Production based pitch modification of voiced speech. In: Proc. ICSLP, pp. 2073–2076 (2002)
Google Scholar
Klatt, D.: Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Amer. 67, 971–994 (1980)
Article Google Scholar
Klatt, D., Klatt, L.: Analysis, synthesis, and perception of voice quality variations among female and male talkers. J. Acoust. Soc. Amer. 87, 820–857 (1990)
Article Google Scholar
Konvalinka, I.S., Mataušek, M.R.: Simultaneous estimation of poles and zeros in speech analysis and ITIT-iterative inverse filtering algorithm. IEEE Trans. Acoust., Speech, Signal Proc. 27, 485–492 (1979)
Article Google Scholar
Kopec, G.E., Oppenheim, A.V., Tribolet, J.M.: Speech Analysis by Homomorphic Prediction. IEEE Trans. Acoust., Speech, Signal Proc. 25, 40–49 (1977)
Article Google Scholar
Krishnamurthy, A.K.: Glottal Source Estimation using a Sum-of-Exponentials Model. IEEE Trans. Signal Processing 40, 682–686 (1992)
Article Google Scholar
Krishnamurthy, A.K., Childers, D.G.: Two-channel speech analysis. IEEE Trans. Acoust., Speech, Signal Proc. 34, 730–743 (1986)
Article Google Scholar
Lee, D.T.L., Morf, M., Friedlander, B.: Recursive least squares ladder estimation algorithms. IEEE Trans. Acoust., Speech, Signal Processing 29, 627–641 (1981)
Article MATH MathSciNet Google Scholar
Lee, K., Park, K.: Glottal Inverse Filtering (GIF) using Closed Phase WRLS-VFF-VT Algorithm. In: Proc. IEEE Region 10 Conference, vol. 1, pp. 646–649 (1999)
Google Scholar
Makhoul, J.: Linear Prediction: A Tutorial Review. Proc. IEEE 63, 561–580 (1975)
Google Scholar
Mataušek, M.R., Batalov, V.S.: A new approach to the determination of the glottal waveform. IEEE Trans. Acoust., Speech, Signal Proc. 28, 616–622 (1980)
Article Google Scholar
Mathews, M.V., Miller, J.E., David Jr., E.E.: Pitch synchronous analysis of voiced sounds. J. Acoust. Soc. Amer. 33, 179–186 (1961)
Article Google Scholar
Mendel, J.M.: Tutorial on Higher-Order Statistics (Spectra) in Signal Processing and System Theory: Theoretical Results and Some Applications. Proc. IEEE 79, 278–305 (1991)
Article Google Scholar
Milenkovic, P.: Glottal Inverse Filtering by Joint Estimation of an AR System with a Linear Input Model. IEEE Trans. Acoust., Speech, Signal Proc. 34, 28–42 (1986)
Article Google Scholar
Milenkovic, P.H.: Voice source model for continuous control of pitch period. J. Acoust. Soc. Amer. 93, 1087–1096 (1993)
Article Google Scholar
Miller, R.L.: Nature of the Vocal Cord Wave. J. Acoust. Soc. Amer. 31, 667–677 (1959)
Article Google Scholar
Miyanaga, Y., Miki, M., Nagai, N.: Adaptive Identification of a Time-Varying ARMA Speech Model. IEEE Trans. Acoust., Speech, Signal Proc. 34, 423–433 (1986)
Article Google Scholar
Miyanaga, Y., Miki, N., Nagai, N., Hatori, K.: A Speech Analysis Algorithm which eliminates the Influence of Pitch using the Model Reference Adaptive System. IEEE Trans. Acoust., Speech, Signal Proc. 30, 88–96 (1982)
Article Google Scholar
Monsen, R.B., Engebretson, A.M.: Study of variations in the male and female glottal wave. J. Acoust. Soc. Amer. 62, 981–993 (1977)
Article Google Scholar
Monsen, R.B., Engebretson, A.M., Vemula, N.R.: Indirect assessment of the contribution of subglottal air pressure and vocal-fold tension to changes of fundamental frequency in English. J. Acoust. Soc. Amer. 64, 65–80 (1978)
Article Google Scholar
Morikawa, H., Fujisaki, H.: Adaptive Analysis of Speech based on a Pole-Zero Representation. IEEE Trans. Acoust., Speech, Signal Proc. 30, 77–87 (1982)
Article Google Scholar
Nikias, C.L., Raghuveer, M.R.: Bispectrum Estimation: A Digital Signal Processing Framework. Proc. IEEE 75, 869–891 (1987)
Article Google Scholar
Oppenheim, A.V.: A speech analysis-synthesis system based on homomorphic filtering. J. Acoust. Soc. Amer. 45, 458–465 (1969)
Article Google Scholar
Oppenheim, A.V., Schafer, R.W.: Discrete-Time Signal Processing. Prentice-Hall, Englewood Cliffs (1989)
MATH Google Scholar
Pan, R., Nikias, C.L.: The complex cepstrum of higher order cumulants and nonminimum phase system identification. IEEE Trans. Acoust., Speech, Signal Proc. 36, 186–205 (1988)
Article MATH Google Scholar
Parthasarathy, S., Tufts, D.W.: Excitation-Synchronous Modeling of Voiced Speech. IEEE Trans. Acoust., Speech, Signal Proc. 35, 1241–1249 (1987)
Article Google Scholar
Plumpe, M.D., Quatieri, T.F., Reynolds, D.A.: Modeling of the Glottal Flow Derivative Waveform with Application to Speaker Identification. IEEE Trans. Speech and Audio Proc. 7, 569–586 (1999)
Article Google Scholar
Quatieri, T.F., McAulay, R.J.: Shape invariant time-scale and pitch modification of speech. IEEE Trans. Signal Process. 40, 497–510 (1992)
Article Google Scholar
Rosenberg, A.: Effect of the glottal pulse shape on the quality of natural vowels. J. Acoust. Soc. Amer. 49, 583–590 (1971)
Article Google Scholar
Rothenberg, M.: A new inverse-filtering technique for deriving the glottal air flow waveform. J. Acoust. Soc. Amer. 53, 1632–1645 (1973)
Article Google Scholar
Schroeder, M.R., Atal, B.S.: Code-excited linear prediction (CELP): High quality speech at very low bit rates. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 10, pp. 937–940 (1985)
Google Scholar
Shanks, J.L.: Recursion filters for digital processing. Geophysics 32, 33–51 (1967)
Article Google Scholar
Sondhi, M.M.: Measurement of the glottal waveform. J. Acoust. Soc. Amer. 57, 228–232 (1975)
Article Google Scholar
Sondhi, M.M., Resnik, J.R.: The inverse problem for the vocal tract: Numerical methods, acoustical experiments, and speech synthesis. J. Acoust. Soc. Amer. 73, 985–1002 (1983)
Article Google Scholar
Steiglitz, K.: On the simultaneous estimation of poles and zeros in speech analysis. IEEE Trans. Acoust., Speech, Signal Proc. 25, 194–202 (1977)
Google Scholar
Steiglitz, K., McBride, L.E.: A technique for the identifcation of linear systems. IEEE Trans. Automat. Contr. 10, 461–464 (1965)
Article Google Scholar
Stylianou, Y.: Applying the harmonic plus noise model in concatenative speech synthesis. IEEE Trans. Speech Audio Process. 9, 21–29 (2001)
Article Google Scholar
Tekalp, A.M., Erdem, A.T.: Higher-Order Spectrum Factorization in One and Two Dimensions with Applications in Signal Modeling and Nonminimum Phase System Identification. IEEE Trans. Acoust., Speech, Signal Proc. 37, 1537–1549 (1989)
Article MATH Google Scholar
Thomson, M.M.: A new method for determining the vocal tract transfer function and its excitation from voiced speech. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 23–26 (1992)
Google Scholar
Ting, Y.T., Childers, D.G.: Speech Analysis using the Weighted Recursive Least Squares Algorithm with a Variable Forgetting Factor. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing., vol. 1, pp. 389–392 (1990)
Google Scholar
Tremain, T.E.: The government standard linear predictive coding algorithm: LPC-10. Speech Technol, 40–49 (1982)
Google Scholar
Veeneman, D.E., BeMent, S.L.: Automatic Glottal Inverse Filtering from Speech and Electroglottographic Signals. IEEE Trans. Acoust., Speech, Signal Proc. 33, 369–377 (1985)
Article Google Scholar
Walker, J.: Application of the bispectrum to glottal pulse analysis. In: Proc. NoLisp’03 (2003)
Google Scholar
Wong, D.Y., Markel, J.D., Gray, A.H.: Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans. Acoust., Speech, Signal Proc. 27, 350–355 (1979)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland
Jacqueline Walker & Peter Murphy

Authors

Jacqueline Walker
View author publications
You can also search for this author in PubMed Google Scholar
Peter Murphy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Yannis Stylianou Marcos Faundez-Zanuy Anna Esposito

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Walker, J., Murphy, P. (2007). A Review of Glottal Waveform Analysis. In: Stylianou, Y., Faundez-Zanuy, M., Esposito, A. (eds) Progress in Nonlinear Speech Processing. Lecture Notes in Computer Science, vol 4391. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71505-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-71505-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71503-0
Online ISBN: 978-3-540-71505-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Review of Glottal Waveform Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Evaluation of Automatic Glottal Source Analysis

Voice production model based on phonation biophysics

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

A Review of Glottal Waveform Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Evaluation of Automatic Glottal Source Analysis

Voice production model based on phonation biophysics

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation