Abstract
Glottal inverse filtering is of potential use in a wide range of speech processing applications. As the process of voice production is, to a first order approximation, a source-filter process, then obtaining source and filter components provides for a flexible representation of the speech signal for use in processing applications. In certain applications the desire for accurate inverse filtering is more immediately obvious, e.g., in the assessment of laryngeal aspects of voice quality and for correlations between acoustics and vocal fold dynamics, the resonances of the vocal tract should firstly be removed. Similarly, for assessment of vocal performance, trained singers may wish to obtain quantitative data or feedback regarding their voice at the level of the larynx.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Akande, O.: Speech analysis techniques for glottal source and noise estimation in voice signals. Ph. D. Thesis, University of Limerick (2004)
Akande, O., Murphy, P.J.: Estimation of the vocal tract transfer function for voiced speech with application to glottal wave analysis. Speech Communication 46, 15–36 (2005)
Akande, O., Murphy, P.J.: Improved speech analysis for glottal excited linear predictive speech coding. In: Proc. Irish Signals and Systems Conference, pp. 101–106 (2004)
Alkhairy, A.: An algorithm for glottal volume velocity estimation. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 233–236 (1999)
Alku, P., Vilkman, E., Laine, U.K.: Analysis of glottal waveform in different phonation types using the new IAIF-method. In: Proc. 12th Int. Congress Phonetic Sciences, vol. 4, pp. 362–365 (1991)
Alku, P.: An automatic method to estimate the time-based parameters of the glottal pulseform. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 29–32 (1992)
Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)
Alku, P., Vilkman, E.: Estimation of the glottal pulseform based on Discrete All-Pole modeling. In: Proc. Int. Conf. on Spoken Language Processing, pp. 1619–1622 (1994)
Ananthapadmanabha, T.V., Fant, G.: Calculation of true glottal flow and its components. STL-QPR, 1–30 (1985)
Atal, B.S., Hanauer, S.L.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Amer. 50, 637–655 (1971)
Bergstrom, A., Hedelin, P.: Codebook driven glottal pulse analysis. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 53–56 (1989)
Berouti, M., Childers, D., Paige, A.: Correction of tape recorder distortion. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 397–400 (1977)
Brüel & Kjær: Measurement Microphones, 2nd edn. (1994)
Chen, W.-T., Chi, C.-Y.: Deconvolution and vocal-tract parameter estimation of speech signals by higher-order statistics based inverse filters. In: Proc. IEEE Workshop on HOS, pp. 51–55 (1993)
Childers, D.G.: Glottal source modeling for voice conversion. Speech Communication 16, 127–138 (1995)
Childers, D.G.: Speech processing and synthesis toolboxes. Wiley, New York (2000)
Childers, D.G., Chieteuk, A.: Modeling the glottal volume-velocity waveform for three voice types. J. Acoust. Soc. Amer. 97, 505–519 (1995)
Childers, D.G., Principe, J.C., Ting, Y.T.: Adaptive WRLS-VFF for Speech Analysis. IEEE Trans. Speech and Audio Proc. 3, 209–213 (1995)
Childers, D.G., Hu, H.T.: Speech synthesis by glottal excited linear prediction. J. Acoust. Soc. Amer. 96, 2026–2036 (1994)
Cranen, B., Schroeter, J.: Physiologically motivated modelling of the voice source in articulatory analysis/synthesis. Speech Communication 19, 1–19 (1996)
Cummings, K.E., Clements, M.A.: Glottal Models for Digital Speech Processing: A Historical Survey and New Results. Digital Signal Processing 5, 21–42 (1995)
Deng, H., Beddoes, M.P., Ward, R.K., Hodgson, M.: Estimating the Glottal Waveform and the Vocal-Tract Filter from a Vowel Sound Signal. In: Proc. IEEE Pacific Rim Conf. Communications, Computers and Signal Processing, vol. 1, pp. 297–300 (2003)
Edwards, J.A., Angus, J.A.S.: Using phase-plane plots to assess glottal inverse filtering. Electronics Letters 32, 192–193 (1996)
Elliot, M., Clements, M.: Algorithm for automatic glottal waveform estimation without the reliance on precise glottal closure information. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 101–104 (2004)
Erdem, A.T., Tekalp, A.M.: Linear Bispectrum of Signals and Identification of Nonminimum Phase FIR Systems Driven by Colored Input. IEEE Trans. Signal Processing 40, 1469–1479 (1992)
Fant, G.C.M.: Acoustic Theory of Speech Production. Mouton, The Hague (1970)
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. STL-QPR, 1–14 (1985)
Fant, G., Lin, Q., Gobl, C.: Notes on glottal flow interaction. STL-QPR, 21–45 (1985)
Friedlander, B.: A recursive maximum likelihood algorithm for ARMA spectral estimation. IEEE Trans. Inform. Theory 28, 639–646 (1982)
Fu, Q., Murphy, P.: Adapive Inverse filtering for High Accuracty Estimation of the Glottal Source. In: Proc. NoLisp’03 (2003)
Fu, Q., Murphy, P.J.: Robust glottal source estimation based on joint source-filter model optimization. IEEE Trans. Audio, Speech Lang. Proc. 14, 492–501 (2006)
Hillman, R.E., Weinberg, B.: A new procedure for venting a reflectionless tube. J. Acoust. Soc. Amer. 69, 1449–1451 (1981)
Holmberg, E.R., Hillman, R.E., Perkell, J.S.: Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal and loud voice. J. Acoust. Soc. Amer. 84, 511–529 (1988)
Hinich, M.J., Shichor, E.: Bispectral Analysis of Speech. In: Proc. 17th Convention of Electrical and Electronic Engineers in Israel, pp. 357–360 (1991)
Hinich, M.J., Wolinsky, M.A.: A test for aliasing using bispectral components. J. Am. Stat. Assoc. 83, 499–502 (1988)
Holmes, J.N.: Low-frequency phase distortion of speech recordings. J. Acoust. Soc. Amer. 58, 747–749 (1975)
Ishizaka, K., Flanagan, J.L.: Synthesis of voiced sounds from a two mass model of the vocal cords. Bell Syst. Tech. J. 51, 1233–1268 (1972)
Jiang, Y., Murphy, P.J.: Production based pitch modification of voiced speech. In: Proc. ICSLP, pp. 2073–2076 (2002)
Klatt, D.: Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Amer. 67, 971–994 (1980)
Klatt, D., Klatt, L.: Analysis, synthesis, and perception of voice quality variations among female and male talkers. J. Acoust. Soc. Amer. 87, 820–857 (1990)
Konvalinka, I.S., Mataušek, M.R.: Simultaneous estimation of poles and zeros in speech analysis and ITIT-iterative inverse filtering algorithm. IEEE Trans. Acoust., Speech, Signal Proc. 27, 485–492 (1979)
Kopec, G.E., Oppenheim, A.V., Tribolet, J.M.: Speech Analysis by Homomorphic Prediction. IEEE Trans. Acoust., Speech, Signal Proc. 25, 40–49 (1977)
Krishnamurthy, A.K.: Glottal Source Estimation using a Sum-of-Exponentials Model. IEEE Trans. Signal Processing 40, 682–686 (1992)
Krishnamurthy, A.K., Childers, D.G.: Two-channel speech analysis. IEEE Trans. Acoust., Speech, Signal Proc. 34, 730–743 (1986)
Lee, D.T.L., Morf, M., Friedlander, B.: Recursive least squares ladder estimation algorithms. IEEE Trans. Acoust., Speech, Signal Processing 29, 627–641 (1981)
Lee, K., Park, K.: Glottal Inverse Filtering (GIF) using Closed Phase WRLS-VFF-VT Algorithm. In: Proc. IEEE Region 10 Conference, vol. 1, pp. 646–649 (1999)
Makhoul, J.: Linear Prediction: A Tutorial Review. Proc. IEEE 63, 561–580 (1975)
Mataušek, M.R., Batalov, V.S.: A new approach to the determination of the glottal waveform. IEEE Trans. Acoust., Speech, Signal Proc. 28, 616–622 (1980)
Mathews, M.V., Miller, J.E., David Jr., E.E.: Pitch synchronous analysis of voiced sounds. J. Acoust. Soc. Amer. 33, 179–186 (1961)
Mendel, J.M.: Tutorial on Higher-Order Statistics (Spectra) in Signal Processing and System Theory: Theoretical Results and Some Applications. Proc. IEEE 79, 278–305 (1991)
Milenkovic, P.: Glottal Inverse Filtering by Joint Estimation of an AR System with a Linear Input Model. IEEE Trans. Acoust., Speech, Signal Proc. 34, 28–42 (1986)
Milenkovic, P.H.: Voice source model for continuous control of pitch period. J. Acoust. Soc. Amer. 93, 1087–1096 (1993)
Miller, R.L.: Nature of the Vocal Cord Wave. J. Acoust. Soc. Amer. 31, 667–677 (1959)
Miyanaga, Y., Miki, M., Nagai, N.: Adaptive Identification of a Time-Varying ARMA Speech Model. IEEE Trans. Acoust., Speech, Signal Proc. 34, 423–433 (1986)
Miyanaga, Y., Miki, N., Nagai, N., Hatori, K.: A Speech Analysis Algorithm which eliminates the Influence of Pitch using the Model Reference Adaptive System. IEEE Trans. Acoust., Speech, Signal Proc. 30, 88–96 (1982)
Monsen, R.B., Engebretson, A.M.: Study of variations in the male and female glottal wave. J. Acoust. Soc. Amer. 62, 981–993 (1977)
Monsen, R.B., Engebretson, A.M., Vemula, N.R.: Indirect assessment of the contribution of subglottal air pressure and vocal-fold tension to changes of fundamental frequency in English. J. Acoust. Soc. Amer. 64, 65–80 (1978)
Morikawa, H., Fujisaki, H.: Adaptive Analysis of Speech based on a Pole-Zero Representation. IEEE Trans. Acoust., Speech, Signal Proc. 30, 77–87 (1982)
Nikias, C.L., Raghuveer, M.R.: Bispectrum Estimation: A Digital Signal Processing Framework. Proc. IEEE 75, 869–891 (1987)
Oppenheim, A.V.: A speech analysis-synthesis system based on homomorphic filtering. J. Acoust. Soc. Amer. 45, 458–465 (1969)
Oppenheim, A.V., Schafer, R.W.: Discrete-Time Signal Processing. Prentice-Hall, Englewood Cliffs (1989)
Pan, R., Nikias, C.L.: The complex cepstrum of higher order cumulants and nonminimum phase system identification. IEEE Trans. Acoust., Speech, Signal Proc. 36, 186–205 (1988)
Parthasarathy, S., Tufts, D.W.: Excitation-Synchronous Modeling of Voiced Speech. IEEE Trans. Acoust., Speech, Signal Proc. 35, 1241–1249 (1987)
Plumpe, M.D., Quatieri, T.F., Reynolds, D.A.: Modeling of the Glottal Flow Derivative Waveform with Application to Speaker Identification. IEEE Trans. Speech and Audio Proc. 7, 569–586 (1999)
Quatieri, T.F., McAulay, R.J.: Shape invariant time-scale and pitch modification of speech. IEEE Trans. Signal Process. 40, 497–510 (1992)
Rosenberg, A.: Effect of the glottal pulse shape on the quality of natural vowels. J. Acoust. Soc. Amer. 49, 583–590 (1971)
Rothenberg, M.: A new inverse-filtering technique for deriving the glottal air flow waveform. J. Acoust. Soc. Amer. 53, 1632–1645 (1973)
Schroeder, M.R., Atal, B.S.: Code-excited linear prediction (CELP): High quality speech at very low bit rates. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 10, pp. 937–940 (1985)
Shanks, J.L.: Recursion filters for digital processing. Geophysics 32, 33–51 (1967)
Sondhi, M.M.: Measurement of the glottal waveform. J. Acoust. Soc. Amer. 57, 228–232 (1975)
Sondhi, M.M., Resnik, J.R.: The inverse problem for the vocal tract: Numerical methods, acoustical experiments, and speech synthesis. J. Acoust. Soc. Amer. 73, 985–1002 (1983)
Steiglitz, K.: On the simultaneous estimation of poles and zeros in speech analysis. IEEE Trans. Acoust., Speech, Signal Proc. 25, 194–202 (1977)
Steiglitz, K., McBride, L.E.: A technique for the identifcation of linear systems. IEEE Trans. Automat. Contr. 10, 461–464 (1965)
Stylianou, Y.: Applying the harmonic plus noise model in concatenative speech synthesis. IEEE Trans. Speech Audio Process. 9, 21–29 (2001)
Tekalp, A.M., Erdem, A.T.: Higher-Order Spectrum Factorization in One and Two Dimensions with Applications in Signal Modeling and Nonminimum Phase System Identification. IEEE Trans. Acoust., Speech, Signal Proc. 37, 1537–1549 (1989)
Thomson, M.M.: A new method for determining the vocal tract transfer function and its excitation from voiced speech. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 2, pp. 23–26 (1992)
Ting, Y.T., Childers, D.G.: Speech Analysis using the Weighted Recursive Least Squares Algorithm with a Variable Forgetting Factor. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing., vol. 1, pp. 389–392 (1990)
Tremain, T.E.: The government standard linear predictive coding algorithm: LPC-10. Speech Technol, 40–49 (1982)
Veeneman, D.E., BeMent, S.L.: Automatic Glottal Inverse Filtering from Speech and Electroglottographic Signals. IEEE Trans. Acoust., Speech, Signal Proc. 33, 369–377 (1985)
Walker, J.: Application of the bispectrum to glottal pulse analysis. In: Proc. NoLisp’03 (2003)
Wong, D.Y., Markel, J.D., Gray, A.H.: Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans. Acoust., Speech, Signal Proc. 27, 350–355 (1979)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this chapter
Cite this chapter
Walker, J., Murphy, P. (2007). A Review of Glottal Waveform Analysis. In: Stylianou, Y., Faundez-Zanuy, M., Esposito, A. (eds) Progress in Nonlinear Speech Processing. Lecture Notes in Computer Science, vol 4391. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71505-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-71505-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71503-0
Online ISBN: 978-3-540-71505-4
eBook Packages: Computer ScienceComputer Science (R0)