Single- and Multi-Microphone Spectral Amplitude Estimation Using a Super-Gaussian Speech Model

Lotter, Thomas

doi:10.1007/3-540-27489-8_4

Thomas Lotter⁴

Part of the book series: Signals and Communication Technology ((SCT))

2468 Accesses
2 Citations

Abstract

In this contribution, MAP spectral amplitude estimators for speech enhancement are presented. For single-microphone applications, efficient MAP estimators with a super-Gaussian speech model, that can be adapted with high accuracy towards the real distribution in a given system, are introduced. For multi-microphone applications, joint MAP estimators that also exploit spatial properties of speech and noise are derived. Both the integration of the more accurate speech model as well as the multi-microphone joint spectral amplitude estimation improve the performance of a common DFT domain speech enhancement system.

This work was carried out while being with the Institute of Communication Systems and Data Processing (IND) at the RWTH Aachen University, Germany

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Statistically Optimal Joint Multimicrophone MAP Estimators Under Super-Gaussian Assumption

Article 04 November 2023

A Comparative Study of Speech Processing in Microphone Arrays with Multichannel Alignment and Zelinski Post-Filtering

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

S. F. Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction,” IEEE Trans. Acoust., Speech and Signal Processing, vol. 27, pp. 113–120, 1979.
Article Google Scholar
Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech and Signal Processing, vol. 32, pp. 1109–1121, 1984.
Article Google Scholar
R. Martin, “Speech enhancement using MMSE short time spectral estimation with gamma distributed priors,” in Proc. IEEE ICASSP, 2002, pp. 253–256.
Google Scholar
P. J. Wolfe and S. J. Godsill, “Eficient Alternatives to the Ephraim-Malah Suppression Rule for Audio Signal Enhancement,” EURASIP Journal on Applied Signal Processing, Special Issue: Digital Audio for Multimedia Communications, vol. 2003, no. 11, pp. 1043–1051, 2003.
MATH Google Scholar
R. Martin, “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech and Audio Processing, vol. 9, pp. 504–512, 2001.
Article Google Scholar
I. Cohen, “On the decision-directed estimation approach of Ephraim and Malah,” in Proc. IEEE ICASSP, 2004.
Google Scholar
R. J. McAulay and M. L. Malpass, “Speech enhancement using a soft-decision noise suppression filter,” IEEE Trans. Acoust., Speech and Signal Processing, pp. 137–145, Apr. 1980.
Google Scholar
D. R. Brillinger, Time Series, Data Analysis and Theory. McGraw-Hill, 1981.
Google Scholar
H. Brehm and W. Stammler, “Description and Generation of Spherically Invariant Speech-Model Signals,” Elsevier Signal Processing, vol. 12, pp. 119–141, 1987.
Article Google Scholar
R. Martin and C. Breithaupt, “Speech Enhancement in the DFT Domain using Laplacian Speech Priors,” in Proc. IWAENC, 2003, pp. 87–90.
Google Scholar
T. Lotter, Single and Multichannel Speech Enhancement for Hearing Aids. Ph.D. Dissertation, Aachener Beiträge zu Digitalen Nachrichtensystemen (ed. P. Vary), RWTH Aachen University, 2004.
Google Scholar
P. Vary, “Noise suppression by spectral magnitude estimation — Mechanisms and theoretical limits,” Signal Processing, vol. 8, pp. 387–400, 1985.
Article Google Scholar
D. L. Wang and J. S. Jim, “The unimportance of phase in speech enhancement,” IEEE Trans. Acoust., Speech and Signal Processing, vol. ASSP-30, pp. 679–681, 1982.
Article Google Scholar
I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series, and Products. Academic Press, Inc., 1994.
Google Scholar
S. Kullback, Information Theory and Statistics. Dover Publication, 1968.
Google Scholar
J. L. Melsa and D. L. Cohn, Decision and Estimation Theory. McGraw-Hill, 1978.
Google Scholar
Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech and Signal Processing, vol. 33, pp. 443–445, 1985.
Article Google Scholar
T. Lotter, C. Benien, and P. Vary, “Multichannel direction-independent speech enhancement using spectral amplitude Estimation,” EURASIP Journal on Applied Signal Processing, Special Issue: Signal Processing for Acoustic Communication Systems, vol. 2003, no. 11, pp. 1147–1157, 2003.
MATH Google Scholar
G. W. Elko, “Spatial coherence functions for differential microphone in isotropic noise fields,” in Microphone Arrays, edited by M. Brandstein and D. B. Ward, Springer-Verlag, pp. 61–86, 2001.
Google Scholar
D. Malah, R. V. Cox, and A. J. Accardi, “Tracking speech presence uncertainty to improve speech enhancement in non-stationary noise environments,” in Proc. IEEE ICASSP, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Siemens Audiological Engineering Group, Erlangen, Germany
Thomas Lotter

Authors

Thomas Lotter
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lotter, T. (2005). Single- and Multi-Microphone Spectral Amplitude Estimation Using a Super-Gaussian Speech Model. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_4

Download citation

DOI: https://doi.org/10.1007/3-540-27489-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Single- and Multi-Microphone Spectral Amplitude Estimation Using a Super-Gaussian Speech Model

Abstract

Chapter PDF

Similar content being viewed by others

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Statistically Optimal Joint Multimicrophone MAP Estimators Under Super-Gaussian Assumption

A Comparative Study of Speech Processing in Microphone Arrays with Multichannel Alignment and Zelinski Post-Filtering

Keywords

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Single- and Multi-Microphone Spectral Amplitude Estimation Using a Super-Gaussian Speech Model

Abstract

Chapter PDF

Similar content being viewed by others

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Statistically Optimal Joint Multimicrophone MAP Estimators Under Super-Gaussian Assumption

A Comparative Study of Speech Processing in Microphone Arrays with Multichannel Alignment and Zelinski Post-Filtering

Keywords

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation