Abstract
Although ensembles of classifiers are one of the most popular tools to deal with data streams classification task [1,2,3,4,5], in the literature there is a lack of new approaches to creating ensembles of regression estimators [6, 7]. Most of the latest developments focus on the application of the regression estimators to solve very important real-world problems. In [8] the authors propose to create an ensemble composed of decision trees, gradient boosted trees and random forest to forecast electricity consumptions. The algorithm uses different weights for each component based on its previous performance. As it was shown in [9], the regression can be applied to enhanced prediction of occurrence of the concept-drift. The authors propose an ensemble method which utilizes constrained penalized regression as a combiner to track a drifting concept in a classification setting. The data stream approach to system fault prediction has been examined in [10]. In this paper different data-stream-based linear regression prediction methods have been tested and compared with a newly developed fault detection system. The applied and evaluated data stream mining algorithms were: grid-based classifier, polygon-based method, and one-class support vector machines. The results showed that the linear regression method generally achieved good performance in predicting short-term data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: A method for automatic adjustment of ensemble size in stream data mining. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 9–15 (2016)
Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: How to adjust an ensemble size in stream data mining? Inf. Sci. 381, 46–54 (2017)
Street, W.N., Kim, Y.: A streaming ensemble algorithm (sea) for large-scale classification. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 377–382. ACM (2001)
Minku, L.L., Yao, X.: DDD: a new ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619–633 (2012)
Sun, J., Fujita, H., Chen, P., Li, H.: Dynamic financial distress prediction with concept drift based on time weighting combined with adaboost support vector machine ensemble. Knowl. Based Syst. 120, 4–14 (2017)
Ikonomovska, E., Gama, J., Džeroski, S.: Online tree-based ensembles and option trees for regression on evolving data streams. Neurocomputing 150, 458–470 (2015)
Duda, P., Jaworski, M., Rutkowski, L.: Online GRNN-based ensembles for regression on evolving data streams. In: International Symposium on Neural Networks, pp. 221–228. Springer, Berlin (2018)
Galicia, A., Talavera-Llames, R., Troncoso, A., Koprinska, I., Martínez-Álvarez, F.: Multi-step forecasting for big data time series based on ensemble learning. Knowl. Based Syst. (2018)
Wang, L.-Y., Park, C., Choi, H., Yeon, K.: A classifier ensemble for concept drift using a constrained penalized regression combiner. Procedia Comput. Sci. 91, 252–259 (2016)
Alzghoul, A., Löfstrand, M., Backe, B.: Data stream forecasting for system fault prediction. Comput. Ind. Eng. 62(4), 972–978 (2012)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Rutkowski, L., Jaworski, M., Duda, P. (2020). Regression. In: Stream Data Mining: Algorithms and Their Probabilistic Properties. Studies in Big Data, vol 56. Springer, Cham. https://doi.org/10.1007/978-3-030-13962-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-13962-9_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-13961-2
Online ISBN: 978-3-030-13962-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)