Abstract
In this paper, we present a fundamental frequency determination method dependent on the autocorrelation compression of the multi-scale product of speech signal. It is based on the multiplication of compressed copies of the original autocorrelation operated on the multi-scale product. The multi-scale product is based on realising the product of the speech wavelet transform coefficients at three successive dyadic scales. We use the quadratic spline wavelet function. We compress the autocorrelation of the multi-scale product a number of times by integer factors (downsampling). Hence, when the obtained functions are multiplied, we obtain a peak with a clear maximum corresponding to the fundamental frequency. We have evaluated our method on the Keele database. Experimental results show the effectiveness of our method presenting a good performance surpassing other algorithms. Besides, the proposed approach is robust in noisy environment.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Hess, W.J.: Pitch Determination of Speech Signals, pp. 373–383. Springer (1983)
Shahnaz, C., Wang, W.P., Ahmad, M.O.: A spectral Matching Method for Pitch Estimation from Noise-corrupted Speech. In: IEEE International Midwest Symposium on Circuits and Systems, pp. 1413–1416. IEEE Press, Taiwan (2009)
Chu, C., Alwan, A.: A SAFE: A Statistical Approach to F0 Estimation Under Clean and Noisy Conditions. IEEE Trans. Audio, Speech and Language Process. 20, 933–944 (2012)
Kadambe, S., Boudreaux-Bartels, G.F.: Application of the Wavelet Transform for Pitch Detection of Speech Signals. IEEE Trans. Information Theory 38, 917–924 (1992)
Ben Messaoud, M.A., Bouzid, A., Ellouze, N.: Autocorrelation of the Speech Multi-scale Product for Voicing Decision and Pitch Estimation. Springer Cognitive Computation 2, 151–159 (2010)
Meyer, G., Plante, F., Ainsworth, W.A.: A Pitch Extraction Reference Database. In: 4th European Conference on Speech Communication and Technology EUROSPEECH 1995, Madrid, pp. 837–840 (1995)
Rabiner, L., Cheng, M., Rosenberg, A., McGonegal, C.: A comparative performance study of several pitch detection algorithms. IEEE Trans. on Acoustic, Speech, and Signal. Process. 24, 399–418 (1976)
Camacho, A.: SWIPE: a Sawtooth Waveform Inspired Pitch Estimator for Speech and Music, Ph.D. dissertation, Dept. Elect. Eng., Florida Univ., USA (2007)
Ben Messaoud, M.A., Bouzid, A., Ellouze, N.: Using Multi-scale Product Spectrum for Single and Multi-pitch Estimation. IET Signal Process. Journal 5, 344–355 (2011)
Varga, A.: Assessment for Automatic Speech Recognition: II. Noisex-92: A Database and an Experiment to Study the Effect of Additive Noise on Speech Recognition Systems. Speech Communication 12, 247–251 (1993)
Joho, D., Bennewitz, M., Behnke, S.: Pitch Estimation Using Models of Voiced Speech on Three Levels. In: 4th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, pp. 1077–1080. IEEE Press, Honolulu (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ben Messaoud, M.A., Bouzid, A., Ellouze, N. (2013). An Efficient Method for Fundamental Frequency Determination of Noisy Speech. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-38847-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)