Abstract
Ranking-aware queries have been gaining much attention recently in many applications such as search engines and data streams. They are, however, not only restricted to such applications but are also very useful in OLAP applications. In this paper, we introduce aggregation ranking queries in OLAP data cubes motivated by an online advertisement tracking data warehouse application. These queries aggregate information over a specified range and then return the ranked order of the aggregated values. They differ from range aggregate queries in that range aggregate queries are mainly concerned with an aggregate operator such as SUM and MIN/MAX over the selected ranges of all dimensions in the data cubes. Existing techniques for range aggregate queries are not able to process aggregation ranking queries efficiently. Hence, in this paper we propose new algorithms to handle this problem. The essence of the proposed algorithms is based on both ranking and cumulative information to progressively rank aggregation results. Furthermore we empirically evaluate our techniques and the experimental results show that the query cost is improved significantly.
This research is supported by the NSF grants under IIS-23022, CNF-0423336, and EIA-00-80134.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Babcock, B., Olston, C.: Distributed top-k monitoring. In: Proc. of Int. Conf. on Managment of Data (SIGMOD), pp. 563–574 (2003)
Bruno, N., Chaudhuri, S., Gravano, L.: Top-k selection queries over relational databases: Mapping strategies and performance evaluation. ACM Trans. on Database Systems 27(2), 153–187 (2002)
Bruno, N., Gravano, L., Marian, A.: Evaluating top-k queries over web accessible databases. In: Proc. of Int. Conf. on Data Engineering (ICDE), pp. 369–380 (2002)
Chang, K.C., Hwang, S.: Minimal probing: Supporting expensive predicates for top-k queries. In: Proc. of Int. Conf. on Management of Data (SIGMOD), pp. 346–357 (2002)
Charikar, M., Chen, K., Farach-Colton, M.: Approximate frequency counts over data streams. In: Proc. of 29th Int. Colloq. on Automata, Languages and Programming, pp. 693–703 (2002)
Ho, C., Agrawal, R., Megiddo, N., Srikant, R.: Range queries in olap data cubes. In: Proc. of Int. Conf. on Management of Data (SIMGMOD), pp. 73–88 (1997)
Donjerkovic, D., Ramakrishnan, R.: Probabilistic optimization of top N queries. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 411–422 (1999)
Fagin, R.: Combining fuzzy information from multiple systems. In: Proc. of Symp. on Principles of Database Systems (PODS), pp. 216–226 (1996)
Golab, L., DeHaan, D., Demaine, E.D., Lopez-Ortiz, A., Munro, J.I.: Identifying frequent items in sliding windows over on-line packet streams. In: Proc. of the conference on Internet measurement conferenc, pp. 173–178 (2003)
Ilyas, I.F., Aref, W.G., Elmagarmid, A.K.: Joining ranked inputs in practice. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 950–961 (2002)
Ilyas, I.F., Aref, W.G., Elmagarmid, A.K.: Supporting top-k join queries in relational databases. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 754–765 (2003)
Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data cube: A relational aggregation operator generalizing group-by, cross-tabs and sub-totals. In: Proc. of Int. Conf. on Data Engeering(ICDE), pp. 152–159 (1996)
Lee, S.Y., Ling, T.W., Li, H.-G.: Hierarchical compact cube for range-max queries. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 232–241 (2000)
Li, C., Chang, K.C.-C., Ilyas, I.F., Song, S.: Ranksql: Query algebra and opatimization for relational topk queries. In: Proc. of Int. Conf. on Management of Data, SIGMOD (2005)
Li, H.-G., Yu, H., Agrawal, D., El Abbadi, A.: Ranking aggregates. Technical Report 2004-07, University of California at Santa Barbara (2004), http://www.cs.ucsb.edu/research/trcs/docs/2004-07.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, HG., Yu, H., Agrawal, D., El Abbadi, A. (2005). Progressive Ranking of Range Aggregates. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_18
Download citation
DOI: https://doi.org/10.1007/11546849_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28558-8
Online ISBN: 978-3-540-31732-6
eBook Packages: Computer ScienceComputer Science (R0)