Digital filter-banks are an integral part of many speech and audio processing algorithms used in today’s communication systems. They are commonly employed for adaptive subband filtering, for example, to perform acoustic echo cancellation in hands-free communication devices or multi-channel dynamic-range compression in digital hearing aids, e.g., [34,81]. Another frequent task is speech enhancement by noise reduction, e.g., [4,81]. This eases the communication in adverse environments where acoustic background noise impairs the intelligibility and fidelity of the transmitted speech signal. A noise reduction system is also beneficial to improve the performance of speech coding and speech recognition systems, e.g., [41].
The choice of the filter-bank has a significant influence on the performance of such systems in terms of signal quality, computational complexity, and signal delay. Accordingly, the filter-bank design has to fulfill different, partly conflicting requirements in dependence of the considered application.
One prominent example is speech and audio processing for digital hearing aids. The restricted capacity of the battery and the small size of the chip set limit the available computational power. Moreover, a low overall processing delay is required to avoid disturbing artifacts and echo effects, e.g., [1,75]. Such distortions can occur when the hearing aid user is talking. In this case, the processed speech can interfere with the original speech signal, which reaches the cochlea with minimal delay via bone conduction or through the hearing aid vent. To prevent this, the algorithmic signal delay of the filter-bank used for the signal enhancement must be considerably lower than the tolerable processing delay, i.e., the latency between the analog input and output signal of the system. In addition, a filter-bank with non-uniform time-frequency resolution, which is similar to that of the human auditory system, is desirable to perform multi-channel dynamic-range compression and noise reduction with a small number of frequency bands.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
J. Agnew, J. M. Thornton: Just noticeable and objectionable group delays in digital hearing aids, Journal of the American Academy of Audiology, 11(6), 330–336, 2000.
K. G. Beauchamp: Walsh Functions and Their Applications, London, GB: Academic Press, 1975.
M. G. Bellanger, G. Bonnerot, M. Coudreuse: Digital filtering by polyphase network: application to sample-rate alteration and filter banks, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-24(2), 109–114, April 1976.
J. Benesty, S. Makino, J. Chen: Speech Enhancement, Berlin, Germany: Springer, 2005.
S. F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-27(2), 113–120, April 1979.
C. Braccini, A. V. Oppenheim: Unequal bandwidth spectral analysis using digital frequency warping, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-22(4), 236–244, August 1974.
C. S. Burrus, R. A. Gopinath, H. Guo: Introduction to Wavelets and Wavelet Transforms: A Primer, Upper Saddle River, NJ, USA: Prentice-Hall, 1998.
O. Cappé: Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor, IEEE Trans. on Speech and Audio Processing, 2(2), 345–349, April 1994.
I. Cohen: Enhancement of speech using Bark-scaled wavelet packet decomposition, Proc. EUROSPEECH ’01, 1933–1936, Aalborg, Denmark, September 2001.
A. G. Constantinides: Frequency transformation for digital filters, IEE Electronic Letters, 3(11), 487–489, November 1967.
R. E. Crochiere: A weighted overlap-add method of short-time Fourier analysis/synthesis, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-28(10), 99–102, February 1980.
R. E. Crochiere, L. R. Rabiner: Multirate Digital Signal Processing, Upper Saddle River, NJ, USA: Prentice-Hall, 1983.
Z. Cvetković, J. D. Johnston: Nonuniform oversampled filter banks for audio signal processing, IEEE Trans. on Speech and Audio Processing, 11(5), 393–399, September 2003.
R. Czarnach: Recursive processing by noncausal digital filters, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-30(3), 363–370, June 1982.
I. Daubechies, W. Sweldens: Factoring Wavelet transforms into lifting steps, Journal of Fourier Analysis and Applications, 4(3), 247–269, May 1998.
Y. Deng, V. J. Mathews, B. Farhang-Boroujeny: Low-delay nonuniform pseudo-QMF banks with application to speech enhancement, IEEE Trans. on Signal Processing, 55(5), 2110–2121, May 2007.
G. Doblinger: An efficient algorithm for uniform and nonuniform digital filter banks, Proc. ISCAS ’91, 1, 646–649, Singapore, June 1991.
B. Dumitrescu, R. Bregović, T. Saramäki, R. Niemistö: Low-delay nonuniform oversampled filterbanks for acoustic echo control, Proc. EUSIPCO ’06, Florence, Italy, September 2006.
A. Engelsberg: Transformation-Based Systems for Single-Channel Noise Reduction in Speech Signals, PhD thesis, Christian-Albrechts University, Ulrich Heute (ed.), Kiel, Germany: Shaker Verlag, 1998 (in German).
Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-32(6), 1109–1121, December 1984.
C. Feldbauer, G. Kubin: Critically sampled frequency-warped perfect reconstruction filterbank, Proc. ECCTD ’03, Krakow, Poland, September 2003.
S. Franz, S. K. Mitra, J. C. Schmidt, G. Doblinger: Warped discrete Fourier transform: a new concept in digital signal processing, Proc. ICASSP ’02, 2, 1205–1208, Orlando, FL, USA, May 2002.
E. Galijašević: Allpass-Based Near-Perfect-Reconstruction Filter Banks, PhD thesis, Christian-Albrechts University, Ulrich Heute (ed.), Kiel, Germany: Shaker Verlag, 2002.
E. Galijašević, J. Kliewer: Design of allpass-based non-uniform oversampled DFT filter banks, Proc. ICASSP ’02, 2, 1181–1184, Orlando, FL, USA, May 2002.
R. C. Gonzalez, P. Wintz: Digital Image Processing, London, GB: Addison-Wesley, 1977.
T. Gülzow, A. Engelsberg, U. Heute: Comparison of a discrete Wavelet transformation and a nonuniform polyphase filterbank applied to spectral-subtraction speech enhancement, Signal Processing, Elsevier, 64(1), 5–19, January 1998.
T. Gülzow, T. Ludwig, U. Heute: Spectral-subtraction speech enhancement in multirate systems with and without non-uniform and adaptive bandwidths, Signal Processing, Elsevier, 83(8), 1613–1631, August 2003.
H. Gustafsson, S. E. Nordholm, I. Claesson: Spectral subtraction using reduced delay convolution and adaptive-averaging, IEEE Trans. on Speech and Audio Processing, 9(8), 799–807, November 2001.
A. Härmä: Implementation of recursive filters having delay free loops, Proc. ICASSP ’98, 3, 1261–1264, Seattle, WA, USA, May 1998.
A. Härmä: Implementation of frequency-warped recursive filters, Signal Processing, Elsevier, 80(3), 543–548, March 2000.
U. Heute: Noise reduction, in E. Hänsler, G. Schmidt (eds.), Topics in Acoustic Echo and Noise Control, 325–384, Berlin, Germany: Springer, 2006.
Y. Hu, P. C. Loizou: Subjective comparison of speech enhancement algorithms, Proc. ICASSP ’06, Tolouse, France, May 2006.
ITU-T Rec. P.862: Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, February 2001.
M. Kahrs, K. Brandenburg: Applications of Digital Signal Processing to Audio and Acoustics, Boston, MA, USA: Kluwer, 1998.
M. Kappelan: Characteristics of Allpass Chains and their Application for Non-Equispaced Spectral Analysis and Synthesis, PhD thesis, RWTH Aachen University, Peter Vary (ed.), Aachener Beiträge zu Digitalen Nachrichtensystemen, Aachen, Germany: Mainz Verlag, 1998 (in German).
M. Kappelan, B. Strauß, P. Vary: Flexible nonuniform filter banks using allpass transformation of multiple order, Proc. EUSIPCO ’96, 3, 1745–1748, Trieste, Italy, 1996.
T. Karp, A. Mertins: Lifting schemes for biorthogonal modulated filter banks, Proc. of Intl. Conf. on Digital Signal Processing (DSP) ’97, 1, 443–446, Santorini, Greece, July 1997.
T. Karp, A. Mertins, G. Schuller: Efficient biorthogonal cosine-modulated filter banks, Signal Processing, Elsevier, 81(5), 997–1016, May 2001.
J. M. Kates, K. H. Arehart: Multichannel dynamic-range compression using digital frequency warping, EURASIP Journal on Applied Signal Processing, 18, 3003-3014, 2005.
J. Kliewer, A. Mertins: Oversampled cosine-modulated filter banks with arbitrary system delay, IEEE Trans. on Signal Processing, 46(4), 941–955, April 1998.
A. M. Kondoz: Digital Speech - Coding for Low Bit Rate Communication Systems, Chichester, UK: Wiley, 2004.
T.-Y. Leou, J. K. Aggarwal: Recursive implementation of LTV filters – frozen-time transfer function versus generalized transfer function, Proc. of the IEEE, 72(7), 980–981, July 1984.
J. Li, T. Q. Nguyen, S. Tantaratana: A simple design for near-perfect-reconstruction nonuniform filter banks, IEEE Trans. on Signal Processing, 45(8), 2105–2109, August 1997.
H. W. Löllmann, P. Vary: Efficient non-uniform filter-bank equalizer, Proc. EUSIPCO ’05, Antalya, Turkey, September 2005.
H. W. Löllmann, P. Vary: Generalized filter-bank equalizer for noise reduction with reduced signal delay, Proc. INTERSPEECH ’05, 2105–2108, Lisbon, Portugal, September 2005.
H. W. Löllmann, P. Vary: Low delay filter for adaptive noise reduction, Proc. IWAENC ’05, 205–208, Eindhoven, The Netherlands, September 2005.
H. W. Löllmann, P. Vary: A warped low delay filter for speech enhancement, Proc. IWAENC ’06, Paris, France, September 2006.
H. W. Löllmann, P. Vary: Parametric phase equalizers for warped filter-banks, Proc. EUSIPCO ’06, Florence, Italy, September 2006.
H. W. Löllmann, P. Vary: Improved design of oversampled allpass transformed DFT filter-banks with near-perfect reconstruction, Proc. EUSIPCO ’07, Poznan, Poland, September 2007.
T. Lotter, P. Vary: Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model, EURASIP Journal on Applied Signal Processing, 7, 1110–1126, May 2005.
A. Makur, S. K. Mitra: Warped discrete Fourier transform: theory and application, IEEE Trans. on Circuits and Systems I, 48(9), 1086–1093, September 2001.
D. Malah, R. V. Cox, A. J. Accardi: Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments, Proc. ICASSP ’99, 789–792, Phoenix, AR, USA, May 1999.
R. Martin, H.-G. Kang, R. V. Cox: Low delay analysis synthesis schemes for joint speech enhancement and low bit rate speech coding, Proc. EUROSPEECH ’99, 3, 1463–1466, Budapest, Hungary, 1999.
R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Trans. on Speech and Audio Processing, 9(5), 504–512, July 2001.
R. Martin: Statistical methods for the enhancement of noisy speech, in J. Benesty, S. Makino, J. Chen (eds.), Speech Enhancement, 43–65, Berlin, Germany: Springer, 2005.
S. K. Mitra, C. D. Creusere, H. Babic: A novel implementation of perfect reconstruction QMF banks using IIR filters for infinite length signals, Proc. ISCAS ’92, 2312–2315, San Diego, CA, USA, May 1992.
D. R. Morgan, J. C. Thi: A delayless subband adaptive filter architecture, IEEE Trans. on Signal Processing, 43(8), 1819–1830, August 1995.
A. V. Oppenheim, D. Johnson, K. Steiglitz: Computation of spectra with unequal resolution using the fast Fourier transform, Proc. of the IEEE, 59(2), 299–301, February 1971.
A. V. Oppenheim, R. W. Schafer, J. R. Buck: Discrete-Time Signal Processing, 2nd edition, Upper Saddle River, NJ, USA: Prentice-Hall, 1999.
T. W. Parks, C. S. Burrus: Digital Filter Design, Chichester, GB: Wiley, 1987.
A. Petrovsky, M. Parfieniuk, A. Borowicz: Warped DFT based perceptual noise reduction system, Convention Paper of Audio Engineering Society, Berlin, Germany, May 2004.
W. H. Press, S. A. Teukolsky, W. T. Vetterling, B. P. Flannery: Numerical Recipes in C, 2nd edition, Cambridge, GB: Cambridge University Press, 1992.
J. G. Proakis, D. G. Manolakis: Digital Signal Processing: Principles, Algorithms, and Applications, 3rd edition, Upper Saddle River, NJ, USA: Prentice-Hall, 1996.
S. R. Quackenbush and T. P. Barnwell III and M. A. Clements: Objective Measures of Speech Quality, Upper Saddle River, NJ, USA: Prentice-Hall, 1988.
C. M. Rader: An improved algorithm for high-speed autocorrelation with application to spectral estimation, IEEE Trans. on Audio and Electroacoustics, 18(4), 439–441, December 1970.
K. R. Rao and P. Yip: Discrete Cosine Transform, New York, NY, USA: Academic Press, 1990.
M. Renfors, T. Saramäki: Recursive Nth-band digital filters – part I: design and properties, IEEE Trans. on Circuits and Systems, 34(1), 24–39, January 1987.
M. Schönle, C. Beaugeant, K. Steinert, H. W. Löllmann, B. Sauert, P. Vary: Hands-free audio and its application to telecommunication terminals, Proc. of Intl. Conf. on Audio for Mobile and Handheld Devices (AES), Seoul, Korea, September 2006.
G. D. T. Schuller and T. Karp: Modulated filter banks with arbitrary system delay: efficient implementation and the time-varying case, IEEE Trans. on Signal Processing, 48(3), 737–748, March 2000.
H. W. Schüßler, W. Winkelnkemper: Variable digital filters, Archiv der Elektrischen Übertragung (AEÜ), 24(11), 524–525, 1970.
H. W. Schüßler: Implementation of variable digital filters, Proc. EUSIPCO ’80, 123–129, Lausanne, Switzerland, September 1980.
B. Shankar M. R., A. Makur: Allpass delay chain-based IIR PR filterbank and its application to multiple description subband coding, IEEE Trans. on Signal Processing, 50(4), 814–823, April 2002.
J. O. Smith, J. S. Abel: Bark and ERB bilinear transforms, IEEE Trans. on Speech and Audio Processing, 7(6), 697–708, November 1999.
K. Steiglitz: A note on variable recursive digital filters, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-28(1), 111–112, February 1980.
M. A. Stone, B. C. J. Moore: Tolerable hearing aid delays II: estimation of limits imposed during speech production, Ear and Hearing, 32(4), 325–338, 2002.
W. Sweldens: The lifting scheme: a custom-design construction of biorthogonal wavelets, Applied and Computational Harmonic Analysis, 3(2), 186–200, 1996.
P. P. Vaidyanathan: Multirate Systems and Filter Banks, Upper Saddle River, NJ, USA: Prentice-Hall, 1993.
P. Vary: On the design of digital filter banks based on a modified principle of polyphase, AEÜ (Archive for Electronics and Communications), 33, 293–300, 1979.
P. Vary: Digital filter banks with unequal resolution, Short Communication Digest of European Signal Processing Conf. (EUSIPCO), 41–42, Lausanne, Switzerland, September 1980.
P. Vary: Noise suppression by spectral magnitude estimation – mechanism and theoretical limits, Signal Processing, Elsevier, 8(4), 387–400, July 1985.
P. Vary, R. Martin: Digital Speech Transmission: Enhancement, Coding and Error Concealment, Chichester, GB: Wiley, 2006.
P. Vary: An adaptive filter-bank equalizer for speech enhancement, Signal Processing, Elsevier, 86(6), 1206–1214, June 2006.
M. Vetterli, J. Kovačević: Wavelets and Subband Coding, Upper Saddle River, NJ, USA: Prentice-Hall, 1995.
E. Zwicker, H. Fastl: Psychoacoustics: Facts and Models, 2nd edition, Berlin, Germany: Springer, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Löllmann, H.W., Vary, P. (2008). Low Delay Filter-Banks for Speech and Audio Processing. In: Hänsler, E., Schmidt, G. (eds) Speech and Audio Processing in Adverse Environments. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70602-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-70602-1_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70601-4
Online ISBN: 978-3-540-70602-1
eBook Packages: EngineeringEngineering (R0)