Low Delay Filter-Banks for Speech and Audio Processing

Löllmann, Heinrich W.; Vary, Peter

doi:10.1007/978-3-540-70602-1_2

Heinrich W. Löllmann³ &
Peter Vary³

Part of the book series: Signals and Communication Technology ((SCT))

1772 Accesses
12 Citations

Digital filter-banks are an integral part of many speech and audio processing algorithms used in today’s communication systems. They are commonly employed for adaptive subband filtering, for example, to perform acoustic echo cancellation in hands-free communication devices or multi-channel dynamic-range compression in digital hearing aids, e.g., [34,81]. Another frequent task is speech enhancement by noise reduction, e.g., [4,81]. This eases the communication in adverse environments where acoustic background noise impairs the intelligibility and fidelity of the transmitted speech signal. A noise reduction system is also beneficial to improve the performance of speech coding and speech recognition systems, e.g., [41].

The choice of the filter-bank has a significant influence on the performance of such systems in terms of signal quality, computational complexity, and signal delay. Accordingly, the filter-bank design has to fulfill different, partly conflicting requirements in dependence of the considered application.

One prominent example is speech and audio processing for digital hearing aids. The restricted capacity of the battery and the small size of the chip set limit the available computational power. Moreover, a low overall processing delay is required to avoid disturbing artifacts and echo effects, e.g., [1,75]. Such distortions can occur when the hearing aid user is talking. In this case, the processed speech can interfere with the original speech signal, which reaches the cochlea with minimal delay via bone conduction or through the hearing aid vent. To prevent this, the algorithmic signal delay of the filter-bank used for the signal enhancement must be considerably lower than the tolerable processing delay, i.e., the latency between the analog input and output signal of the system. In addition, a filter-bank with non-uniform time-frequency resolution, which is similar to that of the human auditory system, is desirable to perform multi-channel dynamic-range compression and noise reduction with a small number of frequency bands.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

A brief overview of speech enhancement with linear filtering

Article Open access 13 November 2014

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Article Open access 25 November 2014

Speech Noise Reduction Algorithm in Digital Hearing Aids Based on an Improved Sub-band SNR Estimation

Article 11 July 2017

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

J. Agnew, J. M. Thornton: Just noticeable and objectionable group delays in digital hearing aids, Journal of the American Academy of Audiology, 11(6), 330–336, 2000.
Google Scholar
K. G. Beauchamp: Walsh Functions and Their Applications, London, GB: Academic Press, 1975.
MATH Google Scholar
M. G. Bellanger, G. Bonnerot, M. Coudreuse: Digital filtering by polyphase network: application to sample-rate alteration and filter banks, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-24(2), 109–114, April 1976.
Article Google Scholar
J. Benesty, S. Makino, J. Chen: Speech Enhancement, Berlin, Germany: Springer, 2005.
Google Scholar
S. F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-27(2), 113–120, April 1979.
Article Google Scholar
C. Braccini, A. V. Oppenheim: Unequal bandwidth spectral analysis using digital frequency warping, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-22(4), 236–244, August 1974.
Article Google Scholar
C. S. Burrus, R. A. Gopinath, H. Guo: Introduction to Wavelets and Wavelet Transforms: A Primer, Upper Saddle River, NJ, USA: Prentice-Hall, 1998.
Google Scholar
O. Cappé: Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor, IEEE Trans. on Speech and Audio Processing, 2(2), 345–349, April 1994.
Article Google Scholar
I. Cohen: Enhancement of speech using Bark-scaled wavelet packet decomposition, Proc. EUROSPEECH ’01, 1933–1936, Aalborg, Denmark, September 2001.
Google Scholar
A. G. Constantinides: Frequency transformation for digital filters, IEE Electronic Letters, 3(11), 487–489, November 1967.
Article Google Scholar
R. E. Crochiere: A weighted overlap-add method of short-time Fourier analysis/synthesis, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-28(10), 99–102, February 1980.
Article Google Scholar
R. E. Crochiere, L. R. Rabiner: Multirate Digital Signal Processing, Upper Saddle River, NJ, USA: Prentice-Hall, 1983.
Google Scholar
Z. Cvetković, J. D. Johnston: Nonuniform oversampled filter banks for audio signal processing, IEEE Trans. on Speech and Audio Processing, 11(5), 393–399, September 2003.
Article Google Scholar
R. Czarnach: Recursive processing by noncausal digital filters, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-30(3), 363–370, June 1982.
Article Google Scholar
I. Daubechies, W. Sweldens: Factoring Wavelet transforms into lifting steps, Journal of Fourier Analysis and Applications, 4(3), 247–269, May 1998.
Article MATH MathSciNet Google Scholar
Y. Deng, V. J. Mathews, B. Farhang-Boroujeny: Low-delay nonuniform pseudo-QMF banks with application to speech enhancement, IEEE Trans. on Signal Processing, 55(5), 2110–2121, May 2007.
Article MathSciNet Google Scholar
G. Doblinger: An efficient algorithm for uniform and nonuniform digital filter banks, Proc. ISCAS ’91, 1, 646–649, Singapore, June 1991.
Google Scholar
B. Dumitrescu, R. Bregović, T. Saramäki, R. Niemistö: Low-delay nonuniform oversampled filterbanks for acoustic echo control, Proc. EUSIPCO ’06, Florence, Italy, September 2006.
Google Scholar
A. Engelsberg: Transformation-Based Systems for Single-Channel Noise Reduction in Speech Signals, PhD thesis, Christian-Albrechts University, Ulrich Heute (ed.), Kiel, Germany: Shaker Verlag, 1998 (in German).
Google Scholar
Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-32(6), 1109–1121, December 1984.
Article Google Scholar
C. Feldbauer, G. Kubin: Critically sampled frequency-warped perfect reconstruction filterbank, Proc. ECCTD ’03, Krakow, Poland, September 2003.
Google Scholar
S. Franz, S. K. Mitra, J. C. Schmidt, G. Doblinger: Warped discrete Fourier transform: a new concept in digital signal processing, Proc. ICASSP ’02, 2, 1205–1208, Orlando, FL, USA, May 2002.
Google Scholar
E. Galijašević: Allpass-Based Near-Perfect-Reconstruction Filter Banks, PhD thesis, Christian-Albrechts University, Ulrich Heute (ed.), Kiel, Germany: Shaker Verlag, 2002.
Google Scholar
E. Galijašević, J. Kliewer: Design of allpass-based non-uniform oversampled DFT filter banks, Proc. ICASSP ’02, 2, 1181–1184, Orlando, FL, USA, May 2002.
Google Scholar
R. C. Gonzalez, P. Wintz: Digital Image Processing, London, GB: Addison-Wesley, 1977.
MATH Google Scholar
T. Gülzow, A. Engelsberg, U. Heute: Comparison of a discrete Wavelet transformation and a nonuniform polyphase filterbank applied to spectral-subtraction speech enhancement, Signal Processing, Elsevier, 64(1), 5–19, January 1998.
MATH Google Scholar
T. Gülzow, T. Ludwig, U. Heute: Spectral-subtraction speech enhancement in multirate systems with and without non-uniform and adaptive bandwidths, Signal Processing, Elsevier, 83(8), 1613–1631, August 2003.
MATH Google Scholar
H. Gustafsson, S. E. Nordholm, I. Claesson: Spectral subtraction using reduced delay convolution and adaptive-averaging, IEEE Trans. on Speech and Audio Processing, 9(8), 799–807, November 2001.
Article Google Scholar
A. Härmä: Implementation of recursive filters having delay free loops, Proc. ICASSP ’98, 3, 1261–1264, Seattle, WA, USA, May 1998.
Google Scholar
A. Härmä: Implementation of frequency-warped recursive filters, Signal Processing, Elsevier, 80(3), 543–548, March 2000.
MATH Google Scholar
U. Heute: Noise reduction, in E. Hänsler, G. Schmidt (eds.), Topics in Acoustic Echo and Noise Control, 325–384, Berlin, Germany: Springer, 2006.
Google Scholar
Y. Hu, P. C. Loizou: Subjective comparison of speech enhancement algorithms, Proc. ICASSP ’06, Tolouse, France, May 2006.
Google Scholar
ITU-T Rec. P.862: Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, February 2001.
Google Scholar
M. Kahrs, K. Brandenburg: Applications of Digital Signal Processing to Audio and Acoustics, Boston, MA, USA: Kluwer, 1998.
Google Scholar
M. Kappelan: Characteristics of Allpass Chains and their Application for Non-Equispaced Spectral Analysis and Synthesis, PhD thesis, RWTH Aachen University, Peter Vary (ed.), Aachener Beiträge zu Digitalen Nachrichtensystemen, Aachen, Germany: Mainz Verlag, 1998 (in German).
Google Scholar
M. Kappelan, B. Strauß, P. Vary: Flexible nonuniform filter banks using allpass transformation of multiple order, Proc. EUSIPCO ’96, 3, 1745–1748, Trieste, Italy, 1996.
Google Scholar
T. Karp, A. Mertins: Lifting schemes for biorthogonal modulated filter banks, Proc. of Intl. Conf. on Digital Signal Processing (DSP) ’97, 1, 443–446, Santorini, Greece, July 1997.
Article Google Scholar
T. Karp, A. Mertins, G. Schuller: Efficient biorthogonal cosine-modulated filter banks, Signal Processing, Elsevier, 81(5), 997–1016, May 2001.
MATH Google Scholar
J. M. Kates, K. H. Arehart: Multichannel dynamic-range compression using digital frequency warping, EURASIP Journal on Applied Signal Processing, 18, 3003-3014, 2005.
Google Scholar
J. Kliewer, A. Mertins: Oversampled cosine-modulated filter banks with arbitrary system delay, IEEE Trans. on Signal Processing, 46(4), 941–955, April 1998.
Article Google Scholar
A. M. Kondoz: Digital Speech - Coding for Low Bit Rate Communication Systems, Chichester, UK: Wiley, 2004.
Google Scholar
T.-Y. Leou, J. K. Aggarwal: Recursive implementation of LTV filters – frozen-time transfer function versus generalized transfer function, Proc. of the IEEE, 72(7), 980–981, July 1984.
Article Google Scholar
J. Li, T. Q. Nguyen, S. Tantaratana: A simple design for near-perfect-reconstruction nonuniform filter banks, IEEE Trans. on Signal Processing, 45(8), 2105–2109, August 1997.
Article Google Scholar
H. W. Löllmann, P. Vary: Efficient non-uniform filter-bank equalizer, Proc. EUSIPCO ’05, Antalya, Turkey, September 2005.
Google Scholar
H. W. Löllmann, P. Vary: Generalized filter-bank equalizer for noise reduction with reduced signal delay, Proc. INTERSPEECH ’05, 2105–2108, Lisbon, Portugal, September 2005.
Google Scholar
H. W. Löllmann, P. Vary: Low delay filter for adaptive noise reduction, Proc. IWAENC ’05, 205–208, Eindhoven, The Netherlands, September 2005.
Google Scholar
H. W. Löllmann, P. Vary: A warped low delay filter for speech enhancement, Proc. IWAENC ’06, Paris, France, September 2006.
Google Scholar
H. W. Löllmann, P. Vary: Parametric phase equalizers for warped filter-banks, Proc. EUSIPCO ’06, Florence, Italy, September 2006.
Google Scholar
H. W. Löllmann, P. Vary: Improved design of oversampled allpass transformed DFT filter-banks with near-perfect reconstruction, Proc. EUSIPCO ’07, Poznan, Poland, September 2007.
Google Scholar
T. Lotter, P. Vary: Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model, EURASIP Journal on Applied Signal Processing, 7, 1110–1126, May 2005.
Google Scholar
A. Makur, S. K. Mitra: Warped discrete Fourier transform: theory and application, IEEE Trans. on Circuits and Systems I, 48(9), 1086–1093, September 2001.
Article MATH MathSciNet Google Scholar
D. Malah, R. V. Cox, A. J. Accardi: Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments, Proc. ICASSP ’99, 789–792, Phoenix, AR, USA, May 1999.
Google Scholar
R. Martin, H.-G. Kang, R. V. Cox: Low delay analysis synthesis schemes for joint speech enhancement and low bit rate speech coding, Proc. EUROSPEECH ’99, 3, 1463–1466, Budapest, Hungary, 1999.
Google Scholar
R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Trans. on Speech and Audio Processing, 9(5), 504–512, July 2001.
Article Google Scholar
R. Martin: Statistical methods for the enhancement of noisy speech, in J. Benesty, S. Makino, J. Chen (eds.), Speech Enhancement, 43–65, Berlin, Germany: Springer, 2005.
Chapter Google Scholar
S. K. Mitra, C. D. Creusere, H. Babic: A novel implementation of perfect reconstruction QMF banks using IIR filters for infinite length signals, Proc. ISCAS ’92, 2312–2315, San Diego, CA, USA, May 1992.
Google Scholar
D. R. Morgan, J. C. Thi: A delayless subband adaptive filter architecture, IEEE Trans. on Signal Processing, 43(8), 1819–1830, August 1995.
Article Google Scholar
A. V. Oppenheim, D. Johnson, K. Steiglitz: Computation of spectra with unequal resolution using the fast Fourier transform, Proc. of the IEEE, 59(2), 299–301, February 1971.
Article Google Scholar
A. V. Oppenheim, R. W. Schafer, J. R. Buck: Discrete-Time Signal Processing, 2nd edition, Upper Saddle River, NJ, USA: Prentice-Hall, 1999.
Google Scholar
T. W. Parks, C. S. Burrus: Digital Filter Design, Chichester, GB: Wiley, 1987.
MATH Google Scholar
A. Petrovsky, M. Parfieniuk, A. Borowicz: Warped DFT based perceptual noise reduction system, Convention Paper of Audio Engineering Society, Berlin, Germany, May 2004.
Google Scholar
W. H. Press, S. A. Teukolsky, W. T. Vetterling, B. P. Flannery: Numerical Recipes in C, 2nd edition, Cambridge, GB: Cambridge University Press, 1992.
MATH Google Scholar
J. G. Proakis, D. G. Manolakis: Digital Signal Processing: Principles, Algorithms, and Applications, 3rd edition, Upper Saddle River, NJ, USA: Prentice-Hall, 1996.
Google Scholar
S. R. Quackenbush and T. P. Barnwell III and M. A. Clements: Objective Measures of Speech Quality, Upper Saddle River, NJ, USA: Prentice-Hall, 1988.
Google Scholar
C. M. Rader: An improved algorithm for high-speed autocorrelation with application to spectral estimation, IEEE Trans. on Audio and Electroacoustics, 18(4), 439–441, December 1970.
Article Google Scholar
K. R. Rao and P. Yip: Discrete Cosine Transform, New York, NY, USA: Academic Press, 1990.
MATH Google Scholar
M. Renfors, T. Saramäki: Recursive Nth-band digital filters – part I: design and properties, IEEE Trans. on Circuits and Systems, 34(1), 24–39, January 1987.
Article Google Scholar
M. Schönle, C. Beaugeant, K. Steinert, H. W. Löllmann, B. Sauert, P. Vary: Hands-free audio and its application to telecommunication terminals, Proc. of Intl. Conf. on Audio for Mobile and Handheld Devices (AES), Seoul, Korea, September 2006.
Google Scholar
G. D. T. Schuller and T. Karp: Modulated filter banks with arbitrary system delay: efficient implementation and the time-varying case, IEEE Trans. on Signal Processing, 48(3), 737–748, March 2000.
Article Google Scholar
H. W. Schüßler, W. Winkelnkemper: Variable digital filters, Archiv der Elektrischen Übertragung (AEÜ), 24(11), 524–525, 1970.
Google Scholar
H. W. Schüßler: Implementation of variable digital filters, Proc. EUSIPCO ’80, 123–129, Lausanne, Switzerland, September 1980.
Google Scholar
B. Shankar M. R., A. Makur: Allpass delay chain-based IIR PR filterbank and its application to multiple description subband coding, IEEE Trans. on Signal Processing, 50(4), 814–823, April 2002.
Article Google Scholar
J. O. Smith, J. S. Abel: Bark and ERB bilinear transforms, IEEE Trans. on Speech and Audio Processing, 7(6), 697–708, November 1999.
Article Google Scholar
K. Steiglitz: A note on variable recursive digital filters, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-28(1), 111–112, February 1980.
Article MathSciNet Google Scholar
M. A. Stone, B. C. J. Moore: Tolerable hearing aid delays II: estimation of limits imposed during speech production, Ear and Hearing, 32(4), 325–338, 2002.
Article Google Scholar
W. Sweldens: The lifting scheme: a custom-design construction of biorthogonal wavelets, Applied and Computational Harmonic Analysis, 3(2), 186–200, 1996.
Article MATH MathSciNet Google Scholar
P. P. Vaidyanathan: Multirate Systems and Filter Banks, Upper Saddle River, NJ, USA: Prentice-Hall, 1993.
MATH Google Scholar
P. Vary: On the design of digital filter banks based on a modified principle of polyphase, AEÜ (Archive for Electronics and Communications), 33, 293–300, 1979.
Google Scholar
P. Vary: Digital filter banks with unequal resolution, Short Communication Digest of European Signal Processing Conf. (EUSIPCO), 41–42, Lausanne, Switzerland, September 1980.
Google Scholar
P. Vary: Noise suppression by spectral magnitude estimation – mechanism and theoretical limits, Signal Processing, Elsevier, 8(4), 387–400, July 1985.
Google Scholar
P. Vary, R. Martin: Digital Speech Transmission: Enhancement, Coding and Error Concealment, Chichester, GB: Wiley, 2006.
Google Scholar
P. Vary: An adaptive filter-bank equalizer for speech enhancement, Signal Processing, Elsevier, 86(6), 1206–1214, June 2006.
MATH Google Scholar
M. Vetterli, J. Kovačević: Wavelets and Subband Coding, Upper Saddle River, NJ, USA: Prentice-Hall, 1995.
MATH Google Scholar
E. Zwicker, H. Fastl: Psychoacoustics: Facts and Models, 2nd edition, Berlin, Germany: Springer, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Communication Systems and Data Processing, RWTH Aachen University, Germany
Heinrich W. Löllmann & Peter Vary

Authors

Heinrich W. Löllmann
View author publications
You can also search for this author in PubMed Google Scholar
Peter Vary
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technische Universität, Darmstadt, Germany
Eberhard Hänsler
Harman/Becker Automotive Systems, Ulm, Germany
Gerhard Schmidt

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Löllmann, H.W., Vary, P. (2008). Low Delay Filter-Banks for Speech and Audio Processing. In: Hänsler, E., Schmidt, G. (eds) Speech and Audio Processing in Adverse Environments. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70602-1_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-70602-1_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70601-4
Online ISBN: 978-3-540-70602-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Low Delay Filter-Banks for Speech and Audio Processing

Chapter PDF

Similar content being viewed by others

A brief overview of speech enhancement with linear filtering

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Speech Noise Reduction Algorithm in Digital Hearing Aids Based on an Improved Sub-band SNR Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Low Delay Filter-Banks for Speech and Audio Processing

Chapter PDF

Similar content being viewed by others

A brief overview of speech enhancement with linear filtering

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Speech Noise Reduction Algorithm in Digital Hearing Aids Based on an Improved Sub-band SNR Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation