Audio Coding

Schuler, Gerald

doi:10.1007/1-4020-7769-6_11

Gerald Schuler³

1010 Accesses

Abstract

In this chapter, the principles of audio coding will be described, with emphasis on low delay audio coding. Audio coding is based on psycho-acoustic masking effects, as computed by psycho-acoustic models. To use the masking effects and to obtain a good compression ratio, filter banks are used. The principles of psycho-acoustics and of the design of filter banks are presented. Further a new low delay audio coding scheme based on prediction is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Audio Transmission

Acoustic Signal Processing

Fourier-Time-Transformation (FTT), Analysis of Sound and Auditory Perception

References

Technical Council of the AES: CD “Perceptual audio coders: what to listen for,” Audio Engineering Society, New York.
Google Scholar
N. Kitawaki and K. Itoh, “Pure delay effects on speech quality in telecommunications,” IEEE J. Sel. Areas in Comm., vol. 9, pp. 586–593, May 1991.
Google Scholar
J.-H. Chen, R. V. Cox, Y.-C. Lin, N. Jayant, and M. J. Melchner, “A low-delay CELP coder for the CCITT 16 kb/s speech coding standard,” IEEE J. Sel Areas in Comm., vol. 10, pp. 830–849, June 1992.
Google Scholar
B. Edler and G. Schuller, “Audio coding using a psychoacoustic pre-and post-filter,” ICASSP 2000, Istanbul, Turkey, pp. 11–881–884.
Google Scholar
S. Haykin, Adaptive Filter Theory. Englewood Cliffs, N.J.: Prentice Hall, 1999.
Google Scholar
A. Härmä, U. K. Laine, and M. Karjalainen, “Backward adaptive warped lattice for wideband stereo coding,” in Proc. of EUSIPCO’98, (Greece), 1998.
Google Scholar
B. Edler, C. Faller, G. Schuller, “Perceptual Audio Coding Using a Time-Varying Linear Pre-and Post-filter,” AES Symposium, Los Angeles, CA, Sept. 2000
Google Scholar
G. Schuller, B. Yu, D. Huang, “Lossless coding of audio signals using cascaded prediction,” in Proc. ICASSP, Salt Lake City, Utah, May 2001
Google Scholar
S. Dorward, D. Huang, S. A. Savari, G. Schuller, and B. Yu, “Low Delay Perceptually Lossless Coding of Audio Signals,” Data Compression Conference, Snowbird, UT, March 2001, pp. 312–320
Google Scholar
V. Madisetti, D. B. Williams, eds., The Digital Signal Processing Handbook, Chapter 42, D. Sinha et al., “The Perceptual Audio Coder (PAC),” CRC Press, Boca Raton, Fl., 1998.
Google Scholar
ITU-R, “Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems,” Rec. ITU-R BS. 1116-1, Geneva, 1997
Google Scholar
U. Zölzer, Digital Audio Signal Processing, John Wiley & Sons, 1997.
Google Scholar
J. D. Johnston, “Estimation of perceptual entropy using noise masking criteria,” in Proc. ICASSP, pp. 2524–2527, Apr. 1988.
Google Scholar
M. Bosi and R. E. Goldberg, Introduction to Digital Audio Coding and Standards, Kluwer Academic Publishers, 2002.
Google Scholar
E. Zwicker, H. Fastl, and H. Frater, Psychoacoustics: Facts and Models, Springer Verlag; 2nd edition, 1999.
Google Scholar
P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, 1993.
Google Scholar
G. Schuller and T. Karp, “Modulated Filter Banks with Arbitrary system delay: efficient implementations and the time-varying Case,” IEEE Transactions on Signal Processing, pp. 737–748, Mar. 2000.
Google Scholar
G. Schuller, “Time-varying filter banks with low delay for audio coding,” 105th AES Convention, San Francisco, CA, Sept. 26–29, 1998.
Google Scholar
G. Schuller and M. J. T. Smith, “New framework for modulated perfect reconstruction filter banks,” IEEE Transactions on Signal Processing, vol.44, pp. 1941–1954, Aug. 1996.
Google Scholar
V. Madisetti and D. B. Williams (Editors) The Digital Signal Processing Handbook, by CRC Press, Book and CD-ROM edition, 1997.
Google Scholar
G. M. Phillips, “Echo and its effects on the telephone user,” Bell Laboratories Record, vol. 32, pp. 281–284, Aug. 1954.
Google Scholar
G. Schuller and A. Harma, “Low delay audio compression using predictive coding,” in Proc. ICASSP, Orlando, FL, May 13–17, 2002.
Google Scholar
J. Herre, “Temporal noise shaping, quantization and coding methods in perceptual audio coding: a tutorial introduction,” in AES 17th International Conference, Florence, Italy, Sept. 2–5, 1999.
Google Scholar
E. Allamanche, R. Geiger, J. Herre, and T. Sporer, “MPEG-4 Low Delay Audio Coding based on the AAC Codec,” 106th AES Convention, Munich, Germany, May, 1999.
Google Scholar
N. S. Jayant and P. Noll, Digital Coding of Waveforms, Prentice Hall, Englewood Cliffs, New Jersey, 1984.
Google Scholar
F. K. Soong and B.-H. Juang, “Line spectrum pair (LSP) and speech data compression,” in Proc. ICASSP, 1984, pp. 1.10.1–1.10.4.
Google Scholar
G. Schuller, B. Yu, D. Huang, and B. Edler, “Perceptual audio coding using adaptive Pre-and post-filters and lossless compression,” IEEE Trans. Speech Audio Processing, pp. 379–390, Sept. 2002.
Google Scholar
A. Gelman, H. Stein, and D. Rubin, Bayesian Data Analysis, New York: Chapman & Hall, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer AEMT, Germany
Gerald Schuler

Authors

Gerald Schuler
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bell Laboratories, Lucent Technologies, USA
Yiteng Huang
INRS-EMT, Université du Québec, Canada
Jacob Benesty

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Schuler, G. (2004). Audio Coding. In: Huang, Y., Benesty, J. (eds) Audio Signal Processing for Next-Generation Multimedia Communication Systems. Springer, Boston, MA. https://doi.org/10.1007/1-4020-7769-6_11

Download citation

DOI: https://doi.org/10.1007/1-4020-7769-6_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7768-5
Online ISBN: 978-1-4020-7769-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Audio Coding

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Audio Transmission

Acoustic Signal Processing

Fourier-Time-Transformation (FTT), Analysis of Sound and Auditory Perception

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Audio Coding

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Audio Transmission

Acoustic Signal Processing

Fourier-Time-Transformation (FTT), Analysis of Sound and Auditory Perception

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation