Abstract
In this era of big data, analyzing large scale data efficiently and accurately has become a challenge problem. Online sequential extreme learning machine is one of ELM variants, which provides a method to analyze data. Ensemble method provides a way to learn data more accurately. MapReduce provides a simple, scalable and fault-tolerant framework, which can be utilized for large scale learning. In this paper, we propose an ensemble OS-ELM framework which supports ensemble methods including Bagging, subspace partitioning and cross validating. Further we design a parallel ensemble of online sequential extreme learning machine (PEOS-ELM) algorithm based on MapReduce for large scale learning. PEOS-ELM algorithm is evaluated with real and synthetic data with the maximum number of training data 5120K and the maximum number of attributes 512. The speedup of this algorithm can reach as high as 40 on a cluster with maximum 80 cores. The accuracy of PEOS-ELM algorithm is at the same level as that of ensemble OS-ELM running on a single machine, which is higher than that of the original OS-ELM.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Arndt, H., Bundschus, M., Naegele, A.: Towards a next-generation matrix library for java. In: 33rd Annual IEEE International Computer Software and Applications Conference, COMPSAC 2009, vol. 1, pp. 460–467. IEEE (2009)
He, Q., Shang, T., Zhuang, F., Shi, Z.: Parallel extreme learning machine for regression based on mapreduce. Neurocomput. 102, 52–58 (2013)
Huang, G.-B., Chen, L.: Convex incremental extreme learning machine. Neurocomputing 70, 3056–3062 (2007)
Huang, G.-B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 42(2), 513–529 (2012)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine. In: Technical Report ICIS/03/2004. School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore (January 2004)
Lan, Y., Soh, Y.C., Huang, G.-B.: Ensemble of online sequential extreme learning machine. Neurocomputing 72(13), 3391–3395 (2009)
Liang, P.N.-Y., Huang, G.-B., Saratchandran, Sundararajan, N.: A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Transactions on Neural Networks 17(6), 1411–1423 (2006)
Liu, N., Wang, H.: Ensemble based extreme learning machine. IEEE Signal Processing Letters 17(8), 754–757 (2010)
Rokach, L.: Ensemble-based classifiers. Artificial Intelligence Review 33(1-2), 1–39 (2010)
Wang, B., Huang, S., Qiu, J., Liu, Y., Wang, G.: Parallel online sequential extreme learning machine based on mapreduce. In: The International Conference on Extreme Learning Machines (ELM 2013), Beijing, China, October 15-17 (2013)
Xin, J., Wang, Z., Chen, C., Ding, L., Wang, G., Zhao, Y.: Elm*: distributed extreme learning machine with mapreduce. In: World Wide Web, pp. 1–16 (2013)
Zhai, J.-h., Xu, H.-y., Wang, X.-z.: Dynamic ensemble extreme learning machine based on sample entropy. Soft Comput. 16(9), 1493–1502 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Huang, S., Wang, B., Qiu, J., Yao, J., Wang, G., Yu, G. (2015). Parallel Ensemble of Online Sequential Extreme Learning Machine Based on MapReduce. In: Cao, J., Mao, K., Cambria, E., Man, Z., Toh, KA. (eds) Proceedings of ELM-2014 Volume 1. Proceedings in Adaptation, Learning and Optimization, vol 3. Springer, Cham. https://doi.org/10.1007/978-3-319-14063-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-14063-6_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14062-9
Online ISBN: 978-3-319-14063-6
eBook Packages: EngineeringEngineering (R0)