Statistical Methods for the Enhancement of Noisy Speech

Martin, Rainer

doi:10.1007/3-540-27489-8_3

Rainer Martin⁴

Part of the book series: Signals and Communication Technology ((SCT))

2533 Accesses
14 Citations

Abstract

Speech signals are frequently disturbed by statistically independent additive noise signals. When the power fluctuation of the noise signal is significantly slower than that of the speech signal, a single-microphone approach may be successfully used to reduce the level of the disturbing noise. This chapter outlines algorithms for noise reduction which are based on short term spectral representations of speech and on optimal estimation techniques. We present some of the more prominent estimation methods for complex spectral coefficients, for the amplitude and phase of spectral coefficients, and for related parameters such as the a priori signal-to-noise ratio. We interpret these algorithms in terms of their input-output characteristics. Some recent developments such as the use of super-Gaussian speech models and the properties of the resulting estimators are highlighted. Furthermore, we discuss the estimation of the background noise power and the application of these techniques in conjunction with a low bit rate speech coder.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

A brief overview of speech enhancement with linear filtering

Article Open access 13 November 2014

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Acoustic Signal Processing

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

S. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 27, pp. 113–120, 1979.
Article Google Scholar
M. Berouti, R. Schwartz, and J. Makhoul, “Enhancement of speech corrupted by acoustic noise,” in Proc. IEEE ICASSP, 1979, pp. 208–211.
Google Scholar
J. Lim, ed., Speech Enhancement. Prentice-Hall, 1983.
Google Scholar
Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 32, pp. 1109–1121, Dec. 1984.
Article Google Scholar
D. Van Compernolle, “DSP techniques for speech enhancement,” in Proc. Speech Processing in Adverse Conditions, 1992, pp. 21–30.
Google Scholar
R. Martin, “Statistical methods for the enhancement of noisy speech,” in Proc. IWAENC, 2003, pp. 1–6.
Google Scholar
Y. Ephraim and I. Cohen, “Recent advancements in speech enhancement,” book chapter, CRC Press, 2004.
Google Scholar
O. Cappé, “Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor,” IEEE Trans. Speech and Audio Processing, vol. 2, pp. 345–349, Apr. 1994.
Article Google Scholar
Y. Ephraim and H. Van Trees, “A signal subspace approach for speech enhancement,” IEEE Trans. Speech and Audio Processing, vol. 3, no. 4, pp. 251–266, 1995.
Article Google Scholar
T. Gülzow and A. Engelsberg, “Comparison of a discrete wavelet transformation and a nonuniform polyphase filterbank applied to spectral subtraction speech enhancement,” Signal Processing, Elsevier, vol. 64, no. 1, pp. 5–19, 1998.
Article MATH Google Scholar
S. Gustafsson, P. Jax, and P. Vary, “A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics,” in Proc. IEEE ICASSP, 1998, pp. 397–400.
Google Scholar
S. Gustafsson, R. Martin, P. Jax, and P. Vary, “A psychoacoustic approach to combined acoustic echo cancellation and noise reduction,” IEEE Trans. Speech and Audio Processing, vol. 10, no. 5, pp. 245–256, 2002.
Article Google Scholar
D. Griffin and J. Lim, “Signal estimation from modified short-time Fourier transform,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 32, pp. 236–243, Apr. 1984.
Article Google Scholar
R. Martin and R. Cox, “New speech enhancement techniques for Low bit rate speech coding,” in Proc. IEEE Workshop on Speech Coding, 1999, pp. 165–167.
Google Scholar
P. Scalart and J. Vieira Filho, “Speech enhancement based on a priori signal to noise estimation,” in Proc. IEEE ICASSP, 1996, pp. 629–632.
Google Scholar
K. Linhard and T. Haulick, “Noise subtraction with parametric recursive gain curves,” in Proc. EUROSPEECH, vol. 6, 1999, pp. 2611–2614.
Google Scholar
C. Beaugeant and P. Scalart, “Speech enhancement using a minimum least-squares amplitude estimator,” in Proc. IWAENC, 2001, pp. 191–194.
Google Scholar
I. Cohen and B. Berdugo, “Speech enhancement for non-stationary noise environments,” Signal Processing, Elsevier, vol. 81, pp. 2403–2418, 2001.
Article MATH Google Scholar
I. Cohen, “Speech enhancement using a noncausal a priori SNR estimator,” IEEE Signal Processing Letters, vol. 11, pp. 725–728, 2004.
Article Google Scholar
D. Wang and J. Lim, “The unimportance of phase in speech enhancement,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 30, no. 4, pp. 679–681, 1982.
Article Google Scholar
I. Gradshteyn and I. Ryzhik, Table of Integrals, Series, and Products. Academic Press, 5th ed., 1994.
Google Scholar
R. McAulay and M. Malpass, “Speech enhancement using a soft-decision noise suppression filter,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 28, pp. 137–145, Dec. 1980.
Article Google Scholar
P. Wolfe and S. Godsill, “Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement,” in IEEE Workshop on Statistical Signal Processing, 2001, pp. 496–499.
Google Scholar
T. Lotter and P. Vary, “Noise reduction by maximum a posteriori spectral amplitude estimation with supergaussian speech modeling,” in Proc. IWAENC, 2003, pp. 83–86.
Google Scholar
Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 33, pp. 443–445, Apr. 1985.
Article Google Scholar
D. O’shaughnessy, Speech Communications. IEEE Press, 2 ed., 2000.
Google Scholar
J. Porter and S. Boll, “Optimal estimators for spectral restoration of noisy speech,” in Proc. IEEE ICASSP, 1984, pp. 18A.2.1–18A.2.4.
Google Scholar
R. Martin, “Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors,” in Proc. IEEE ICASSP, vol. I, 2002, pp. 253–256.
Google Scholar
D. Brillinger, Time Series: Data Analysis and Theory. Holden-Day, 1981.
Google Scholar
R. Martin, “Speech enhancement based on minimum mean square error estimation and supergaussian priors,” IEEE Trans. Speech and Audio Processing, to appear, 2005.
Google Scholar
C. Breithaupt and R. Martin, “MMSE estimation of magnitude-squared DFT coefficients with supergaussian priors,” in Proc. IEEE ICASSP, vol. I, 2003, pp. 848–851.
Google Scholar
R. Martin and C. Breithaupt, “Speech enhancement in the DFT domain using Laplacian speech priors,” in Proc. IWAENC, 2003, pp. 87–90.
Google Scholar
D. Van Compernolle, “Noise adaptation in a hidden Markov model speech recognition system,” Computer Speech and Language, vol. 3, pp. 151–167, 1989.
Article Google Scholar
J. Sohn and W. Sung, “A Voice activity detector employing soft decision based noise spectrum adaptation,” in Proc. IEEE ICASSP, vol. 1, 1998, pp. 365–368.
Google Scholar
D. Malah, R. Cox, and A. Accardi, “Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments,” in Proc. IEEE ICASSP, 1999, pp. 789–792.
Google Scholar
R. Martin, “Spectral subtraction based on minimum statistics,” in Proc. EUSIPCO, 1994, pp. 1182–1185.
Google Scholar
R. Martin, “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech and Audio Processing, vol. 9, pp. 504–512, July 2001.
Article Google Scholar
I. Cohen, “Noise estimation in adverse environments: improved minima controlled recursive averaging,” IEEE Trans. Speech and Audio Processing, vol. 11, pp. 466–475, Sept. 2003.
Article Google Scholar
T. Wang, K. Koishida, V. Cuperman, A. Gersho, and J. Collura, “A 1200/2400 BPS coding suite based on MELP,” in IEEE Workshop on Speech Coding, 2002, pp. 90–92.
Google Scholar
R. Martin, D. Malah, R. Cox, and A. Accardi, “A noise reduction preprocessor for Mobile voice communication,” EURASIP Journal on Applied Signal Processing, vol. 2004, pp. 1046–1058, Aug. 2004.
Article MATH Google Scholar
G. Elko, “Microphone array systems for hands-free telecommunication,” in Proc. IWAENC, 1995, pp. 31–38.
Google Scholar
M. Brandstein and D. B. Ward, eds., Microphone Arrays. Springer-Verlag, Berlin, 2001.
Google Scholar
R. Zelinski, “A Microphone array with adaptive post-filtering for noise reduction in reverberant rooms,” in Proc. IEEE ICASSP, 1988, pp. 2578–2581.
Google Scholar
C. Marro, Y. Mahieux, and K. Simmer, “Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering,” IEEE Trans. Speech and Audio Processing, vol. 6, no. 3, pp. 240–259, 1998.
Article Google Scholar
J. Bitzer, K. Simmer, and K.-D. Kammeyer, “Multi-microphone noise reduction by post-filter and superdirective beamformer,” in Proc. IWAENC, 1999, pp. 100–103.
Google Scholar
R. Martin, “Small microphone arrays with postfilters for noise and acoustic echo reduction,” in Microphone Arrays (M. Brandstein and D. B. Ward, eds.), Springer-Verlag, Berlin, 2001.
Google Scholar
R. Balan and J. Rosca, “Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase,” in Proc. IEEE Sensor Array and Multichannel Signal Processing Workshop, 2002.
Google Scholar
T. Lotter, C. Benien, and P. Vary, “Multichannel speech enhancement using Bayesian spectral amplitude estimation,” in Proc. IEEE ICASSP, 2003.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Communication Acoustics, Ruhr-Universität Bochum, Bochum, 44780, Germany
Rainer Martin

Authors

Rainer Martin
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Martin, R. (2005). Statistical Methods for the Enhancement of Noisy Speech. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_3

Download citation

DOI: https://doi.org/10.1007/3-540-27489-8_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Statistical Methods for the Enhancement of Noisy Speech

Abstract

Chapter PDF

Similar content being viewed by others

A brief overview of speech enhancement with linear filtering

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Acoustic Signal Processing

Keywords

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Statistical Methods for the Enhancement of Noisy Speech

Abstract

Chapter PDF

Similar content being viewed by others

A brief overview of speech enhancement with linear filtering

Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement

Acoustic Signal Processing

Keywords

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation