Abstract
The ways in which people communicate are changing rapidly. The standard voice call over a wired network is but one means of communications, which already includes cordless and wireless voice calls, video calls, beeper service, FAX service, e-mail service, and data services. This revolution in communications is being fueled by several sources, including the availability of low cost, low power, computation in both DSP and RISC chips, larger and cheaper memory chips, improved algorithms for communications (e.g. modems, signalling) and signal processing, and finally the creation of world-wide standards for transmission, signal compression, and communication protocols. The broad goal of the communications revolution is to provide seamless and high quality communications between people (or groups of people), anywhere, anytime, and at a reasonable price. Although there are many technologies that form the bases for the communications environment of the twenty-first century, one of the key technologies for making the vision a reality is speech processing. In this paper we attempt to show, by example, how speech processing has been applied to specific problems in telecommunications, and how it will grow to become an even more essential component of the communications systems of the twenty-first century.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Speech Coding
Atal, B. S., Speech Processing Based on Linear Prediction. Encyclopedia of Physical Science and Technology, Vol. 13, Academic Press, New York, 1987, pp. 219–230.
Chen, J. H., Cox, R. V., Lin, Y. C, Jayant, N. S., and Melchner, M. J., A Low-Delay CELP Coder for the CCITT 16 Kb/s Speech Coding Standard, IEEE J. Sel Areas Commun., 10(2):830–849 (June 1992).
Jayant, N. S., and Noll, P., Digital Coding of Waveforms, Prentice-Hall, Englewood Cliffs, NJ, 1984.
Kroon, P., and Atal, B. S., Strategies for Improving the Performance of CELP Coders at Low Bit Rates, Proc. IEEE Intl. Conf Acoustics, Speech, and Signal Proc, 151–154 (April 1988).
Schroeder, M, R., Atal, B. S., and Hall, J. L., Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear, J. Acoust. Soc. Am., 66:1647–1652 (1979).
Shoham, Y, Low-rate Speech Coding Based on Time-Frequency Interpolation, Proc. Int. Conf Spoken Lang. Proc. (ICLSF92), 37–49, (Oct. 1992).
Speech Synthesis
Allen, J., Synthesis of Speech from Unrestricted Text, Proc. IEEE, 64:422–433 (1976).
Allen, J., Hunnicutt, S., and Klatt, D. H., From Text to Speech: The MIT Talk System, Cambridge University Press, Cambridge, England, 1987.
Coker, C. H., A Modele of Articulatory Dynamics and Control, Proc. IEEE, 64:452–459 (1976).
Flanagan, J. L., Computers that Talk and Listen: Man-Machine Communication by Voice, Proc. IEEE, 64:405–415 (1976).
Flanagan, J. L., and Rabiner, L. R., Speech Synthesis, Dowden, Hutchinson and Ross, Stroudsberg, PA, 1973.
Holmes, J. N., Mattingly, I. G., and Shearine, J. N., Speech Synthesis by Rule, Language and Speech, 7:127–143 (1964).
Klatt, D. H., Review of Text-to-Speech Conversion for English, J. Acoust. Soc. Am., 82(3):737–793 (September 1987).
Olive, J. R, and Liberman, M. Y., A Set of Concatenative Units for Speech Synthesis. In: Speech Communication Papers Presented at the 97th Meeting of the Acoustical Society of America (J. J. Wolf and D. H. Klatt, eds.), American Institute of Physics, New York, 1979, pp. 515–518.
O’Malley, M. H., Larkin, D. K., and Peters, E. W., Beyond the Reading Machine: What the Next Generation of Intelligent Text-to-Speech Systems Should Do for the User, Proc. Speech Tech. ′56, 216–219(1986).
Pisoni, D. B., Nusbaum, H. C., and Greene, B. G., Perception of Synthetic Speech Generated by Rule, Proc. IEEE, 73:1665–1676(1985).
Speech Recognition
Bahl, L. R., Jelinek, E, and Mercer, R., A Maximum Likelihood Approach to Continuous Speech Recognition, IEEE Trans. Pattern Analysis and Machine Intelligence, PAMI-5(2): 179–190 (1983).
Lee, K. E, Automatic Speech Recognition, the Development of the SPHINX System, Kluwer Academic Publishers, Boston, 1989.
Ney, H., The Use of a One Stage Dynamic Programming Algorithm for Connected Word Recognition, IEEE Trans. Acoustics, Speech, and Signal Proc, ASSP-32(2):263–271 (1984).
Rabiner, L. R., A Tutorial on Hidden Markov Models and Its Applications to Speech Recognition, Proc. IEEE, 72(2):257–286(1989).
Rabiner, L. R., and Juang, B. H., Fundamentals of Speech Recognition, Prentice-Hall Inc., Englewood Cliffs, NJ, 1993.
Rabiner, L. R., and Levinson, S. E., Isolated and Connected Word Recognition — Theory and Selected Applications, IEEE Trans. Commun., COM-29(5):621–659 (1981).
Reddy, D. R., Speech Recognition Machine: A Review, Proc. IEEE, 64:502–531 (1976).
Weibel, A., and Lee, K. F, (eds.), Readings in Speech Recognition, Morgan Kaufman, San Mateo, CA, 1990.
Zue, V. W., The Use of Speech Knowledge in Automatic Speech Recognition, Proc. IEEE, 73(11): 1602–1615 (1985).
Speaker Verification
Rosenberg, A. E., Automatic Speaker Verification: A Review, Proc. IEEE, 64:475–487 (1976).
Rosenberg, A. E., and Soong, F. K., Evaluation of a Vector Quantization Talker Recognition System in Text Independent and Text Dependent Modes, Computer, Speech, and Language, 22:143–157 (1987).
Soong, F. K., Rosenberg, A. E., Juang, B. H., and Rabiner, L. R., A Vector Quantization Approach to Speaker Recognition, AT&T Tech. J., 66:14–26 (1987).
Language Translation
Roe, D. B., Moreno, P. J., Sproat, R. W., Pereira, F. C, Riley, M. D., and Macarron, A., A Spoken Language Translator for Restricted-Domain Context-Free Languages, Speech Communication, 11:311–319 (1992).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rabiner, L.R. (1995). Telecommunications Applications of Speech Processing. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-57745-1_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-63344-7
Online ISBN: 978-3-642-57745-1
eBook Packages: Springer Book Archive