Abstract
In this work, we present a novel approach for the efficient materialization of dynamic web pages in e-commerce applications such as an online retail store with millions of items, hundreds of HTTP requests per second and tens of dynamic web page types. In such applications, user satisfaction, as measured in terms of response time (QoS) and content freshness (QoD), determines their success especially under heavy workload. The novelty of our materialization approach over existing ones is that, it considers the data dependencies between content fragments of a dynamic web page. We introduce two new semantic-based data freshness metrics that capture the content dependencies and propose two materialization algorithms that balance QoS and QoD. In our evaluation, we use a real-world experimental system that resembles an online bookstore and show that our approach outperforms existing QoS-QoD balancing approaches in terms of server-side response time (throughput), data freshness and scalability.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Arlitt, M.: Characterizing web user sessions. SIGMETRICS Perform. Eval. Rev. 28(2), 50–63 (2000)
Arlitt, M.F., Williamson, C.L.: Internet web servers: workload characterization and performance implications. IEEE/ACM Trans. Networking 5(5), 631–645 (1997)
Bright, L., Raschid, L.: Using latency-recency profiles for data delivery on the web. In: VLDB, pp. 550–561 (2002)
Cunha, C., Bestavros, A., Crovella, M.: Characteristics of www client-based traces. Boston University, Tech. Rep. TR-95-010 (1995)
Guirguis, S., Sharaf, M.A., Chrysanthis, P.K., Labrinidis, A., Pruhs, K.: Adaptive scheduling of web transactions. In: ICDE, pp. 357–368 (2009)
Labrinidis, A., Roussopoulos, N.: Webview materialization. SIGMOD Rec. 29(2), 367–378 (2000)
Labrinidis, A., Roussopoulos, N.: Exploring the tradeoff between performance and data freshness in database-driven web servers. VLDB J. 13(3), 240–255 (2004)
Labrinidis, A., Luo, Q., Xu, J., Xue, W.: Caching and materialization for web databases. Foundations and Trends in Databases 2(3), 169–266 (2009)
Larson, P.-Å., Goldstein, J., Zhou, J.: Mtcache: transparent mid-tier database caching in sql server. In: ICDE, pp. 177–189 (2004)
Li, W.-S., Po, O., Hsiung, W.-P., Candan, K.S., Agrawal, D.: Engineering and hosting adaptive freshness-sensitive web applications on data centers. In: WWW, pp. 587–598 (2003)
Luo, Q., Krishnamurthy, S., Mohan, C., Pirahesh, H., Woo, H., Lindsay, B.G., Naughton, J.F.: Middle-tier database caching for e-business. In: SIGMOD, pp. 600–611 (2002)
Mah, B.A.: An empirical model of http network traffic. In: INFOCOM, p. 592 (1997)
Menascé, D.A.: Testing e-commerce site scalability with tpc-w. In: CMG Conference, pp. 457–466 (2001)
Olshefski, D.P., Nieh, J., Nahum, E.: Ksniffer: determining the remote client perceived response time from live packet streams. In: OSDI, pp. 333–346 (2004)
Oke, A., Bunt, R.B.: Hierarchical workload characterization for a busy web server. In: TOOLS, pp. 309–328 (2002)
Padmanabhan, V.N., Qiu, L.: The content and access dynamics of a busy web site: findings and implications. SIGCOMM Comput. Commun. Rev. 30(4), 111–123 (2000)
Papastavrou, S.: Semantics-based Metrics and Algorithms for Dynamic Content in Web Database Applications. Ph.D. dissertation, University of Cyprus (2009)
Papastavrou, S., Samaras, G., Evripidou, P., Chrysanthis, P.K.: A decade of dynamic web content: a structured survey on past and present practices and future trends. IEEE Communications Surveys & Tutorials 8(2), 52–60 (2006)
Papastavrou, S., Chrysanthis, P.K., Samaras, G.: Exploring content dependencies to better balance performance and freshness in web database applications. In: WISE, pp. 512–525 (2012)
Paulson, L.D.: Building rich web applications with ajax. IEEE Comput. 38(10), 14–17 (2005)
Qu, H., Labrinidis, A.: Preference-aware query and update scheduling in web-databases. In: ICDE, pp. 1–10 (2007)
Schmitt, B., Oberlnder, S.: Access evaluation of digital libraries: characteristics and performance of web opacs. In: Second Int. Workshop on New Developments in Digital Libraries (2002)
Schroeder, B., Harchol-Balter, M.: Web servers under overload: how scheduling can help. ACM Trans. Internet Technol. 6(1), 20–52 (2006). doi:10.1145/1125274.1125276
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: discovery and applications of usage patterns from web data. SIGKDD Explor. Newsl. 1(2), 12–23 (2000)
Wang, Q., Makaroff, D., Edwards, H.K., Thompson, R.: Workload characterization for an e-commerce web site. In: CASCON, pp. 313–327 (2003)
Wilson, T.: Review of gmail. Inf. Res. 10(1) (2004). http://www.informationr.net/ir/reviews/sofrev17/sofrev17.html
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was part of the first author’s doctoral dissertation at the University of Cyprus and was partially supported by the USA National Science Foundation Award OIA-1028162 and co-funded by the EU Project CONET (INFSO-ICT-224053) and the project FireWatch (#0609-BIE/09), sponsored by the Cyprus Research Promotion Foundation.
Rights and permissions
About this article
Cite this article
Papastavrou, S., Chrysanthis, P.K. & Samaras, G. Performance vs. freshness in web database applications. World Wide Web 17, 969–995 (2014). https://doi.org/10.1007/s11280-013-0262-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-013-0262-0