Abstract
In this chapter, the principles of audio coding will be described, with emphasis on low delay audio coding. Audio coding is based on psycho-acoustic masking effects, as computed by psycho-acoustic models. To use the masking effects and to obtain a good compression ratio, filter banks are used. The principles of psycho-acoustics and of the design of filter banks are presented. Further a new low delay audio coding scheme based on prediction is shown.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Technical Council of the AES: CD “Perceptual audio coders: what to listen for,” Audio Engineering Society, New York.
N. Kitawaki and K. Itoh, “Pure delay effects on speech quality in telecommunications,” IEEE J. Sel. Areas in Comm., vol. 9, pp. 586–593, May 1991.
J.-H. Chen, R. V. Cox, Y.-C. Lin, N. Jayant, and M. J. Melchner, “A low-delay CELP coder for the CCITT 16 kb/s speech coding standard,” IEEE J. Sel Areas in Comm., vol. 10, pp. 830–849, June 1992.
B. Edler and G. Schuller, “Audio coding using a psychoacoustic pre-and post-filter,” ICASSP 2000, Istanbul, Turkey, pp. 11–881–884.
S. Haykin, Adaptive Filter Theory. Englewood Cliffs, N.J.: Prentice Hall, 1999.
A. Härmä, U. K. Laine, and M. Karjalainen, “Backward adaptive warped lattice for wideband stereo coding,” in Proc. of EUSIPCO’98, (Greece), 1998.
B. Edler, C. Faller, G. Schuller, “Perceptual Audio Coding Using a Time-Varying Linear Pre-and Post-filter,” AES Symposium, Los Angeles, CA, Sept. 2000
G. Schuller, B. Yu, D. Huang, “Lossless coding of audio signals using cascaded prediction,” in Proc. ICASSP, Salt Lake City, Utah, May 2001
S. Dorward, D. Huang, S. A. Savari, G. Schuller, and B. Yu, “Low Delay Perceptually Lossless Coding of Audio Signals,” Data Compression Conference, Snowbird, UT, March 2001, pp. 312–320
V. Madisetti, D. B. Williams, eds., The Digital Signal Processing Handbook, Chapter 42, D. Sinha et al., “The Perceptual Audio Coder (PAC),” CRC Press, Boca Raton, Fl., 1998.
ITU-R, “Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems,” Rec. ITU-R BS. 1116-1, Geneva, 1997
U. Zölzer, Digital Audio Signal Processing, John Wiley & Sons, 1997.
J. D. Johnston, “Estimation of perceptual entropy using noise masking criteria,” in Proc. ICASSP, pp. 2524–2527, Apr. 1988.
M. Bosi and R. E. Goldberg, Introduction to Digital Audio Coding and Standards, Kluwer Academic Publishers, 2002.
E. Zwicker, H. Fastl, and H. Frater, Psychoacoustics: Facts and Models, Springer Verlag; 2nd edition, 1999.
P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, 1993.
G. Schuller and T. Karp, “Modulated Filter Banks with Arbitrary system delay: efficient implementations and the time-varying Case,” IEEE Transactions on Signal Processing, pp. 737–748, Mar. 2000.
G. Schuller, “Time-varying filter banks with low delay for audio coding,” 105th AES Convention, San Francisco, CA, Sept. 26–29, 1998.
G. Schuller and M. J. T. Smith, “New framework for modulated perfect reconstruction filter banks,” IEEE Transactions on Signal Processing, vol.44, pp. 1941–1954, Aug. 1996.
V. Madisetti and D. B. Williams (Editors) The Digital Signal Processing Handbook, by CRC Press, Book and CD-ROM edition, 1997.
G. M. Phillips, “Echo and its effects on the telephone user,” Bell Laboratories Record, vol. 32, pp. 281–284, Aug. 1954.
G. Schuller and A. Harma, “Low delay audio compression using predictive coding,” in Proc. ICASSP, Orlando, FL, May 13–17, 2002.
J. Herre, “Temporal noise shaping, quantization and coding methods in perceptual audio coding: a tutorial introduction,” in AES 17th International Conference, Florence, Italy, Sept. 2–5, 1999.
E. Allamanche, R. Geiger, J. Herre, and T. Sporer, “MPEG-4 Low Delay Audio Coding based on the AAC Codec,” 106th AES Convention, Munich, Germany, May, 1999.
N. S. Jayant and P. Noll, Digital Coding of Waveforms, Prentice Hall, Englewood Cliffs, New Jersey, 1984.
F. K. Soong and B.-H. Juang, “Line spectrum pair (LSP) and speech data compression,” in Proc. ICASSP, 1984, pp. 1.10.1–1.10.4.
G. Schuller, B. Yu, D. Huang, and B. Edler, “Perceptual audio coding using adaptive Pre-and post-filters and lossless compression,” IEEE Trans. Speech Audio Processing, pp. 379–390, Sept. 2002.
A. Gelman, H. Stein, and D. Rubin, Bayesian Data Analysis, New York: Chapman & Hall, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Kluwer Academic Publishers
About this chapter
Cite this chapter
Schuler, G. (2004). Audio Coding. In: Huang, Y., Benesty, J. (eds) Audio Signal Processing for Next-Generation Multimedia Communication Systems. Springer, Boston, MA. https://doi.org/10.1007/1-4020-7769-6_11
Download citation
DOI: https://doi.org/10.1007/1-4020-7769-6_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7768-5
Online ISBN: 978-1-4020-7769-2
eBook Packages: Springer Book Archive