Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition

Bridle, John S.

doi:10.1007/978-3-642-76153-9_28

John S. Bridle³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 68))

976 Accesses
411 Citations
4 Altmetric

Abstract

We are concerned with feed-forward non-linear networks (multi-layer perceptrons, or MLPs) with multiple outputs. We wish to treat the outputs of the network as probabilities of alternatives (e.g. pattern classes), conditioned on the inputs. We look for appropriate output non-linearities and for appropriate criteria for adaptation of the parameters of the network (e.g. weights). We explain two modifications: probability scoring, which is an alternative to squared error minimisation, and a normalised exponential (softmax) multi-input generalisation of the logistic non-linearity. The two modifications together result in quite simple arithmetic, and hardware implementation is not difficult either. The use of radial units (squared distance instead of dot product) immediately before the softmax output stage produces a network which computes posterior distributions over class labels based on an assumption of Gaussian within-class distributions. However the training, which uses cross-class information, can result in better performance at class discrimination than the usual within-class training method, unless the within-class distribution assumptions are actually correct.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Overview: Classification

Bayesian Neural Networks: An Introduction and Survey

Backpropagation for Fully Connected Cascade Networks

Article 25 January 2017

References

D R Cox and H D Millar. The Theory of stochastic processes. Methuen, 1965.
MATH Google Scholar
T J Seinowski and C R Rosenberg. NETtalk: A parallel network that learns to read aloud. Technical Report JHU/EECS-86/01, Johns Hopkins U. EE&CS, 1986.
Google Scholar
L Gillick. Probability scores for backpropagation networks. July 1987. Personal communication.
Google Scholar
G E Hinton. Connectionist Learning Procedures. Technical Report CMU-CS-87–115, Carnegie Mellon University Computer Science Department, June 1987.
Google Scholar
E B Baum and F Wilczek. Supervised learning of probability distributions by neural networks. In D Anderson, editor,Neural Information Processing Systems, pages 52–61, Am. Inst, of Physics, 1988.
Google Scholar
S Solla, E Levin, and M Fleisher. Accelerated learning in layered neural networks. Complex Systems, January 1989.
Google Scholar
G.E. Hinton, T.J. Sejnowski, and D.H. Ackley. Boltzmann machines: constraint satisfaction networks that learn. Technical report CMU-CS-84–119, Carnegie-Mellon University, May 1984.
Google Scholar
E Yair and A Gersho. The Boltzmann Perceptron Network: a soft classifier. Technical Report CIPR TR 88–11, Center for Information Processing Research, Dept. of E&CE, UCSB, November 1988.
Google Scholar
L R Bahl, P F Brown, P V de Souza, and R L Mercer. Maximum mutual information estimation of hidden Markov model parameters. In Proc. IEEE ICASSP86, pages 49–52, 1986.
Google Scholar
A J Viterbi. Principles of Digital Communication and Coding. McGraw-Hill, 1979.
MATH Google Scholar
G E Peterson and H L Barney. Control methods used in a study of vowels. J. Acoust. Soc. Amer., 24(2):175–184, March 1952.
Article Google Scholar
W M Huang and R P Lippmann. Neural net and traditional classifiers. In D Anderson, editor, Neural Information Processing Systems, pages 387–396, Am. Inst, of Physics, 1988.
Google Scholar

Download references

Author information

Authors and Affiliations

Speech Research Unit, Royal Signals and Radar Establishment, St. Andrews Road, Great Malvern, UK, WR14 3PS
John S. Bridle

Authors

John S. Bridle
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Laboratoire de Recherche en Informatique, Université de Paris Sud, Bâtiment 490, F-91405, Orsay Cedex, France
Françoise Fogelman Soulié
Institut National Polytechnique de Grenoble, LTIRF, 46, avenue Félix Viallet, F-38031, Grenoble Cedex, France
Jeanny Hérault

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bridle, J.S. (1990). Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition. In: Soulié, F.F., Hérault, J. (eds) Neurocomputing. NATO ASI Series, vol 68. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76153-9_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-76153-9_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76155-3
Online ISBN: 978-3-642-76153-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Overview: Classification

Bayesian Neural Networks: An Introduction and Survey

Backpropagation for Fully Connected Cascade Networks

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Overview: Classification

Bayesian Neural Networks: An Introduction and Survey

Backpropagation for Fully Connected Cascade Networks

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation