Abstract
Detection, localization and enhancement of a generic acoustic message produced in a noisy environment can be accomplished by means of a microphone array. A Crosspower Spectrum Phase analysis and the corresponding Coherence Measure allow an accurate time delay estimation employed in source localization. Once source position is estimated, an enhanced version of the original acoustic message is derived, that can represent the input for a speech recognition system. Preliminary results in terms of talker localization are presented.
This work was partially supported by the DIMUS-ESPRIT project
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
J. L. Flanagan, H. F. Silverman, “Material for International Workshop on Microphone Array Systems: Theory and Practice”, Technical Report LEMS-113, Division of Engineering-Brown University, October 1992.
J. L. Flanagan, J. D. Johnston, R. Zahn, G. W. Elko, “Computer-steered Microphone Arrays for Sound Transduction in Large Rooms”, J.Acoust.Soc.Am. 78(5), November 1985, pp. 1508–1518.
H. F. Silverman, S. E. Kirtman, “A Two-stage Algorithm for Determining Talker Location from Linear Microphone Array Data”, Computer Speech and Language (1992) 6, pp. 129–152.
M. Omologo, P. Svaizer, “Acoustic Event Detection and Localization in a Noisy Environment”, ESPRIT PROJECT 5345 DIMUS Report n. 9211-30, October 1992.
C. H. Knapp, G. C. Carter, “The Generalized Correlation Method for Estimation of Time Delay”, IEEE Trans, on Acoustics, Speech and Signal Processing, Vol. ASSP-24, n. 4, August 1976.
M. Omologo, P. Svaizer, “Talker Localization and Speech Enhancement in a Noisy Environment using a Microphone Array based Acquisition System”, Proc. Eurospeech, Berlin, September 1993.
B. Angelini, F. Brugnara, D. Falavigna, D. Giuliani, R. Gretter, M. Omologo, “A Baseline of a Speaker Independent Continuous Speech Recognizer of Italian”, Proc. Eurospeech, Berlin, September 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Omologo, M., Svaizer, P. (1995). Talker Tracking using two Microphone Pairs and a CrosspowerSpectrum Phase Analysis. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_43
Download citation
DOI: https://doi.org/10.1007/978-3-642-57745-1_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-63344-7
Online ISBN: 978-3-642-57745-1
eBook Packages: Springer Book Archive