Large Ensemble Averaging

Horn, David; Naftaly, Ury; Intrator, Nathan

doi:10.1007/3-540-49430-8_7

David Horn⁶,
Ury Naftaly⁶ &
Nathan Intrator⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1524))

5664 Accesses

Abstract

Averaging over many predictors leads to a reduction of the variance portion of the error. We present a method for evaluating the mean squared error of an infinite ensemble of predictors from finite (small size) ensemble information. We demonstrate it on ensembles of networks with difierent initial choices of synaptic weights.We find that the optimal stopping criterion for large ensembles occurs later in training time than for single networks. We test our method on the suspots data set and obtain excellent results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Bootstrap bias corrections for ensemble methods

Article 30 November 2016

Multivariate analysis of short time series in terms of ensembles of correlation matrices

Article Open access 02 October 2018

Diversity in Classifier Ensembles: Fertile Concept or Dead End?

References

J.L. Elman and D. Zipser. Learning the Hidden Structure of Speech. J. Acoust. Soc. Amer. 83, 1615–1626. 1988.
Article Google Scholar
S. Geman, E. Bienenstock and R. Doursat. Neural networks and the bias/variance dilemma. Neural Comp., 4(1):1–58. 1992.
Article Google Scholar
W.P. Lincoln and J. Skrzypek. Synergy of clustering multiple back propagation networks. In Touretzky, D. S, editors, Advances in Neural Information Processing Systems 2, pages 650–657, SanMateo, CA. Morgan Kaufmann 1990.
Google Scholar
J. Morris Forecasting the sunspot cycle. J. Roy. Stat. Soc. Ser. A, 140, 437–447 1977.
Google Scholar
U. Naftaly, N. Intrator and D. Horn. Optimal Ensemble Averaging of Neural Networks. Network, Comp. Neural Sys., 8, 283–296 1997.
Article MATH Google Scholar
S.J. Nowlan and G.E. Hinton. Simplifying neural networks by soft weight-sharing. Neural Computation. 4, 473–493 1992.
Article Google Scholar
P.M. Perrone. Improving regression estimation: averaging methods for variance reduction with extensions to general convex measure optimization. PhD thesis BrownUniversity, Institute for Brain and Neural Systems, 1993.
Google Scholar
H. Pi and C. Peterson. Finding the Embedding Dimension and Variable Dependencies in Time Series. Neural Comp. 6, 509–520 1994.
Article Google Scholar
M.B. Priestley. Spectral Analysis and Time Series. Academic Press. 1981.
Google Scholar
A.S. Weigend, B.A. Huberman and D. Rumelhart. Predicting the future: A connectionist approach. Int. J. Neural Syst. 1, 193–209 1990.
Article Google Scholar
D. H. Wolpert. Stacked generalization. Neural Networks 5:241–259 1992
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Physics and Astronomy, Tel Aviv University, Tel Aviv, 69978, Israel
David Horn & Ury Naftaly
School of Mathematical Sciences Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, 69978, Israel
Nathan Intrator

Authors

David Horn
View author publications
You can also search for this author in PubMed Google Scholar
Ury Naftaly
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Intrator
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Willamette University, Salem, OR, 97301, USA
Genevieve B. Orr
GMD First (Forschungszentrum Informationstechnik), Rudower Chaussee 5, D-12489, Berlin, Germany
Klaus-Robert Müller

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Horn, D., Naftaly, U., Intrator, N. (1998). Large Ensemble Averaging. In: Orr, G.B., Müller, KR. (eds) Neural Networks: Tricks of the Trade. Lecture Notes in Computer Science, vol 1524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49430-8_7

Download citation

DOI: https://doi.org/10.1007/3-540-49430-8_7
Published: 28 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65311-0
Online ISBN: 978-3-540-49430-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Large Ensemble Averaging

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Bootstrap bias corrections for ensemble methods

Multivariate analysis of short time series in terms of ensembles of correlation matrices

Diversity in Classifier Ensembles: Fertile Concept or Dead End?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Large Ensemble Averaging

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Bootstrap bias corrections for ensemble methods

Multivariate analysis of short time series in terms of ensembles of correlation matrices

Diversity in Classifier Ensembles: Fertile Concept or Dead End?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation