Statistical and Discriminative Methods for Speech Recognition

Juang, B. H.; Chou, Wu; Lee, C. H.

doi:10.1007/978-3-642-57745-1_4

B. H. Juang,
Wu Chou &
C. H. Lee²

Part of the book series: NATO ASI Series ((NATO ASI F,volume 147))

234 Accesses
1 Citations

Abstract

In this paper, we discuss the issue of speech recognizer training from a broad perspective with root in the classical Bayes decision theory. We differentiate the method of classifier design via distribution estimation and the method of discriminative training based on the fact that in many realistic applications, such as speech recognition, the real signal distribution form is rarely known precisely. We argue that traditional methods relying on distribution estimation are suboptimal when the assumed distribution form is not the true one, and that “optimality” in distribution estimation does not automatically translate into “optimality” in classifier design. We compare the two different methods in the context of hidden Markov modeling for speech recognition. We show the superiority of the discriminative method over the distribution estimation method by citing the results of several key speech recognition experiments. In general, the discriminative method provides a 30-50% reduction in recognition errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Decade of Discriminative Language Modeling for Automatic Speech Recognition

Minimizing Free Energy of Stochastic Functions of Markov Chains

Long-standing problems in speech perception dissolve within an information-theoretic perspective

Article 01 April 2019

References

L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition;” Proc. IEEE, 77(2): 257–286, February 1989
Article Google Scholar
L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ, 1993
Google Scholar
R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis, New York: Wiley, 1973
MATH Google Scholar
F. Jelinek, “The development of an experimental discrete dictation recognizer,” Proc. IEEE, 73: 1616–1624, November 1985
Article Google Scholar
B. H. Juang, L. R. Rabiner and J. G. Wilpon, “On the use of bandpass liftering in speech recognition,” IEEE Trans. Acoust. Speech Signal Processing, ASSP-35(7): 947–954, July 1987
Article Google Scholar
B. H. Juang and L. R. Rabiner, “Hidden Markov models for speech recognition,” Technometrics. Vol. 33, No. 3, pp. 251–272, August 1991
Article MathSciNet MATH Google Scholar
L. E. Baum, T. Petrie, G. Soulcs and N. Weiss, “A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains,” Ann. Math. Stat.,41(1): 164–171, 1970
Article MATH Google Scholar
B. H. Juang and S. Katagiri, “Discriminative learning for minimum error classification,” IEEE Trans. Signal Processing, SP-40, No. 12, pp. 3043–3054, December 1992
Article Google Scholar
Wu Chou, C. H. Lee and B. H. Juang, “Minimum error rate training based on N-best string models,” IEEE ICASSP-93 Proceedings, 11–652-655, April 1993
Google Scholar
Wu Chou, C. H. Lee and B. H. Juang, “Segmental GPD training of a hidden Markov model based speech recognizer,” IEEE Proc. ICASSP-92, pp. 473–476, 1992
Google Scholar

Download references

Author information

Authors and Affiliations

Speech Research, AT&T Bell Laboratories, 600 Mountain Avenue, 07974, Murray Hill, NJ, USA
C. H. Lee

Authors

B. H. Juang
View author publications
You can also search for this author in PubMed Google Scholar
Wu Chou
View author publications
You can also search for this author in PubMed Google Scholar
C. H. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronics and Technology of Computers Faculty of Sciences, University of Granada, E-18071, Granada, Spain
Antonio J. Rubio Ayuso & Juan M. López Soler &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Juang, B.H., Chou, W., Lee, C.H. (1995). Statistical and Discriminative Methods for Speech Recognition. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-57745-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-63344-7
Online ISBN: 978-3-642-57745-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Statistical and Discriminative Methods for Speech Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Decade of Discriminative Language Modeling for Automatic Speech Recognition

Minimizing Free Energy of Stochastic Functions of Markov Chains

Long-standing problems in speech perception dissolve within an information-theoretic perspective

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Statistical and Discriminative Methods for Speech Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Decade of Discriminative Language Modeling for Automatic Speech Recognition

Minimizing Free Energy of Stochastic Functions of Markov Chains

Long-standing problems in speech perception dissolve within an information-theoretic perspective

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation