Abstract
The paper shows the solution of the query selectivity estimation problem for certain types of database queries with a selection condition based on several table attributes. The selectivity parameter allows for estimating a size of data satisfying a query condition. An estimator of a multidimensional probability density function is required for an accurate selectivity calculation for conditions involving many attributes and correlated attribute values. Using multidimensional histogram as a nonparametric density function estimator is mostly too much storage-consuming. The implementation of the known unconventional storage-efficient approach based on Discrete Cosine Transform spectrum of a multidimensional histogram is presented. This solution extends functionality of the Oracle DBMS cost-based query optimizer. The method of experimental obtaining error-optimal parameters values of spectrum storage for typical attributes value distributions is considered.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Bruno, N., Chaudhuri, S., Gravano, L.: STHoles: a multidimensional workload-aware histogram. In: Proceedings of ACM SIGMOD International Conference on Management of Data, New York, US, pp. 211–222 (2001)
Chakrabarti, K., Garofalakis, M., Rastogi, R., Shim, K.: Approximate query processing using wavelets. The Very Large DataBases Journal 10(2-3), 199–223 (2001)
Getoor, L., Taskar, B., Koller, D.: Selectivity estimation using probabilistic models. In: Proceedings of ACM SIGMOD International Conference on Management of Data, New York, US, pp. 461–472 (2001)
Gunopulos, D., Kollios, G., Tsortas, V.J., Domeniconi, C.: Selectivity estimator for multidimensional range queries over real attributes. The Very Large DataBases Journal 14(2), 137–154 (2005)
Lee, L., Deok-Hwan, K., Chin-Wan, C.: Multi-dimensional selectivity estimation using compressed histogram estimation information. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Philadelphia, US, pp. 205–214 (1999)
Oracle: Oracle 10g. using extensible optimizer, http://download.oracle.com/docs/cd/B14117_01/appdev.101/b10800/dciextopt.htm
Possala, V., Ioannidis, Y.E.: Selectivity estimation without the attribute value independence assumption. In: Proceedings of the 23rd International Conference on Very Large Databases, Athens, Greece, pp. 486–495 (1997)
Scott, D.W., Sain, S.R.: Multi-dimensional Density Estimator. Handbook of Statistics, vol. 24. North-Holland Publishing Company, Amsterdam (2004)
Yan, F., Hou, W.C., Jiang, Z., Luo, C., Zhu, Q.: Selectivity estimation of range queries based on data density approximation via cosine series. Data & Knowledge Engineering 63(3), 855–878 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Augustyn, D.R. (2009). Applying Advanced Methods of Query Selectivity Estimation in Oracle DBMS. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds) Man-Machine Interactions. Advances in Intelligent and Soft Computing, vol 59. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00563-3_61
Download citation
DOI: https://doi.org/10.1007/978-3-642-00563-3_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00562-6
Online ISBN: 978-3-642-00563-3
eBook Packages: EngineeringEngineering (R0)