Abstract
OLAP techniques provide efficient solutions to navigate through data cubes. However, they are not equipped with frameworks that empower user investigation of interesting information. They are restricted to exploration tasks.
Recently, various studies have been trying to extend OLAP to new capabilities by coupling it with data mining algorithms. However, most of these algorithms are not designed to deal with sparsity, which is an unavoidable consequence of the multidimensional structure of OLAP cubes.
In [1], we proposed a novel approach that embeds Multilayer Perceptrons into OLAP environment to extend it to prediction. This approach has largely met its goals with limited sparsity cubes. However, its performances have decreased progressively with the increase of cube sparsity.
In this paper, we propose a substantially modified version of our previous approach called NAP-SC (Neural Approach for Prediction over Sparse Cubes). Its main contribution consists in minimizing sparsity effect on measures prediction process through the application of a cube transformation step, based on a dedicated aggregation technique.
Carried out experiments demonstrate the effectiveness and the robustness of NAP-SC against high sparsity data cubes.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Abdelbaki, W., Ben Messaoud, R., Ben Yahia, S.: A Neural-Based Approach for Extending OLAP to Prediction. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 117–129. Springer, Heidelberg (2012)
Inmon, W.H.: Building the Data Warehouse. John Wiley & Sons (1996)
Agarwal, D., Chen, B.C.: Latent OLAP: Data Cubes Over Latent Variables. In: Proceedings of the 2011 International Conference on Management of Data, SIGMOD 2011, pp. 877–888. ACM, New York (2011)
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (on-line Analytical Processing) to User-analysts: An IT Mandate, vol. 32. Codd & Date, Inc. (1993)
Olson, D., Delen, D.: Advanced Data Mining Techniques. Springer (2008)
Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-Driven Exploration of OLAP Data Cubes. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 168–182. Springer, Heidelberg (1998)
Ben Messaoud, R., Loudcher-Rabaseda, S.: OLEMAR: An On-Line Environment for Mining Association Rules in Multidimensional Data. In: Advances in Data Warehousing and Mining, vol. 2. Idea Group Publishing (2007)
Bodin-Niemczuk, A., Ben Messaoud, R., Rabaséda, S.L., Boussaid, O.: Vers l’intégration de la prédiction dans les cubes OLAP. In: EGC, pp. 203–204 (2008)
Niemi, T., Nummenmaa, J., Thanisch, P.: Normalising OLAP Cubes for Controlling Sparsity. Data & Knowledge Engineering 46(3), 317–343 (2003)
Kriegel, H.P., Borgwardt, K.M., Kröger, P., Pryakhin, A., Schubert, M., Zimek, A.: Future Trends in Data Mining. Data Min. Knowl. Discov. 15(1), 87–97 (2007)
Juyoung, K., Hwanseung, Y., Yoshifumi, M.: Classification of Sparsity Patterns and Performance Evaluation in OLAP Systems. IPSJ SIG Notes 67, 449–455 (2002)
Goil, S., Choudhary, A.: High Performance Multidimensional Analysis and Data Mining. In: Proceedings of the High Performance Networking and Computing Conference (SC 1998), Orlando, Florida, USA (November 1998)
Palpanas, T., Koudas, N., Mendelzon, A.: Using Datacube Aggregates for Approximate Querying and Deviation Detection. IEEE Trans. on Knowl. and Data Eng. 17, 1465–1477 (2005)
Chen, Y., Dong, G., Han, J., Pei, J., Wah, B.W., Wang, J.: Regression Cubes with Lossless Compression and Aggregation. IEEE Trans. on Knowl. and Data Eng. 18 (December 2006)
Chen, B.C., Chen, L., Lin, Y., Ramakrishnan, R.: Prediction Cubes. In: Proceedings of the 31st International Conference on Very large Data Bases, VLDB 2005, pp. 982–993 (2005)
Thomsen, E.: Olap Solutions: Building Multidimensional Information Systems. Wiley Computer Publishing (2002)
Hornik, K., Stinchcombe, M., White, H.: Multilayer Feedforward Networks are Universal Approximators. Neural Networks 2(5), 359–366 (1989)
Haykin, S.: Neural Networks: a Comprehensive Foundation. Prentice Hall International Editions Series. Prentice Hall (1999)
Hotelling, H.: Analysis of a Complex of Statistical Variables into Principal Components. Journal of Educational Psychology 24(7), 498–520 (1933)
Rubin, D.B.: Multiple Imputations in Sample Surveys: a Phenomenological Bayesian Approach to Nonresponse. In: Proceedings of the Survey Research Methods Section, pp. 20–28 (1978)
Ben Messaoud, R., Boussaid, O., Rabaséda, S.L.: Using a Factorial Approach for Efficient Representation of Relevant OLAP Facts. In: Proceedings of the 7th International Baltic Conference on Databases and Information Systems (DB&IS 2006), pp. 98–105. IEEE Communications Society, Vilnius (2006)
Rumelhart, D., McClelland, J., University of California, S.D.P.R.G: Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Foundations. Computational Models of Cognition and Perception. MIT Press (1986)
Ben Othman, L., Rioult, F., Ben Yahia, S., Crémilleux, B.: Missing Values: Proposition of a Typology and Characterization with an Association Rule-Based Model. In: Pedersen, T.B., Mohania, M.K., Tjoa, A.M. (eds.) DaWaK 2009. LNCS, vol. 5691, pp. 441–452. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abdelbaki, W., Ben Yahia, S., Ben Messaoud, R. (2012). NAP-SC: A Neural Approach for Prediction over Sparse Cubes. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-35527-1_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35526-4
Online ISBN: 978-3-642-35527-1
eBook Packages: Computer ScienceComputer Science (R0)