Abstract
Workload flows in enterprise systems that use the multi-tier paradigm are often characterized as bursty, i.e., exhibit a form of temporal dependence. Burstiness often results in dramatic degradation of the perceived user performance, which is extremely difficult to capture with existing capacity planning models. The main reason behind this deficiency of traditional capacity planning models is that the user perceived performance is the result of the complex interaction of a very complex workload with a very complex system. In this paper, we propose a simple and effective methodology for detecting burstiness symptoms in multi-tier systems rather than identifying the low-level exact cause of burstiness as traditional models would require. We provide an effective way to incorporate this information into a surprisingly simple and effective modeling methodology. This new modeling methodology is based on the index of dispersion of the service process at a server, which is inferred by observing the number of completions within the concatenated busy periods of that server. The index of dispersion together with other measurements that reflect the “estimated” mean and the 95th percentile of service times are used to derive a Markov-modulated process that captures well burstiness and variability of the true service process, despite inevitable inaccuracies that result from inexact and limited measurements. Detailed experimentation on a TPC-W testbed where all measurements are obtained by HP (Mercury) Diagnostics, a commercially available tool, shows that the proposed technique offers a simple yet powerful solution to the difficult problem of inferring accurate descriptors of the service time process from coarse measurements of a given system. Experimental and model prediction results are in excellent agreement and argue strongly for the effectiveness of the proposed methodology under both bursty and non-bursty workloads.
This work is partially supported by NSF grants CNS-0720699 and CCF-0811417, and a gift from HPLabs. A short version of this paper titled “How to Parameterize Models with Bursty Workloads” appeared in the HotMetrics 2008 Workshop (non-copyrighted) [5].
Chapter PDF
Similar content being viewed by others
Keywords
References
Andersen, A.T., Nielsen, B.F.: On the statistical implications of certain random permutations in markovian arrival processes (MAPs) and second-order self-similar processes. Perf. Eval. 41(2-3), 67–82 (2000)
Balbo, G., Serazzi, G.: Asymptotic analysis of multiclass closed queueing networks: Common bottlenecks. Perf. Eval. 26(1), 51–72 (1996)
Banga, G., Druschel, P.: Measuring the capacity of a web server under realistic loads. World Wide Web 2(1-2), 69–83 (1999)
Beran, J.: Statistics for Long-Memory Processes. Chapman & Hall, New York (1994)
Casale, G., Mi, N., Cherkasova, L., Smirni, E.: How to parameterize models with bursty workloads. In: Proceedings of First Workshop on Hot Topics in Measurement & Modeling of Computer Systems, HotMetrics 2008 (2008)
Casale, G., Mi, N., Smirni, E.: Bound analysis of closed queueing networks with workload burstiness. In: Proceedings of ACM SIGMETRICS, pp. 13–24 (2008)
Casale, G., Zhang, E., Smirni, E.: Interarrival times characterization and fitting for markovian traffic analysis. Number WM-CS-2008-02 (2008), http://www.wm.edu/computerscience/techreport/2008/WM-CS-2008-02.pdf
Cox, D., Lewis, P.: The Statistical Analysis of Series of Events. John Wiley and Sons, New York (1966)
Ferng, H., Chang, J.: Connection-wise end-to-end performance analysis of queueing networks with MMPP inputs. Perf. Eval. 43(1), 39–62 (2001)
Garcia, D., Garcia, J.: TPC-W E-commerce benchmark evaluation. In: IEEE Computer, pp. 42–48 (February 2003)
Gusella, R.: Characterizing the variability of arrival processes with indexes of dispersion. IEEE JSAC 19(2), 203–211 (1991)
Heindl, A.: Traffic-Based Decomposition of General Queueing Networks with Correlated Input Processes. Ph.D. Thesis. Shaker Verlag, Aachen (2001)
Lazowska, E.D., Zahorjan, J., Graham, G.S., Sevcik, K.C.: Quantitative System Performance. Prentice-Hall, Englewood Cliffs (1984)
Liu, Z., Niclausse, N., Jalpa-Villanueva, C.: Traffic model and performance evaluation of web servers. Perform. Eval. 46(2-3) (2001)
Liu, Z., Wynter, L., Xia, C.H., Zhang, F.: Parameter inference of queueing models for it systems using end-to-end measurements. Perf. Eval. 63(1), 36–60 (2006)
Menascé, D.A., Almeida, V.A.F.: Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning. Prentice-Hall, Inc., Englewood Cliffs (2000)
Menascé, D.A., Almeida, V.A.F., Dowdy, L.W.: Capacity planning and performance modeling: from mainframes to client-server systems. Prentice-Hall, Inc., Englewood Cliffs (1994)
Mi, N., Zhang, Q., Riska, A., Smirni, E., Riedel, E.: Performance impacts of autocorrelated flows in multi-tiered systems. Perf. Eval. 64(9-12), 1082–1101 (2007)
Neuts, M.F.: Structured Stochastic Matrices of M/G/1 Type and Their Applications. Marcel Dekker, New York (1989)
Ranjan, S., Rolia, J., Fu, H., Knightly, E.: Qos-driven server migration for internet data centers. In: The 10th International Workshop on Quality of Service (IWQoS 2002) (2002)
Reiser, M.: Mean-value analysis and convolution method for queue-dependent servers in closed queueing networks. Perf. Eval. 1, 7–18 (1981)
Reiser, M., Lavenberg, S.S.: Mean-value analysis of closed multichain queueing networks. JACM 27(2), 312–322 (1980)
Robertazzi, T.G.: Computer Networks and Systems. Springer, Heidelberg (2000)
Rolia, J., Vetland, V.: Correlating resource demand information with arm data for application services. In: Proceedings of WOSP 1998, pp. 219–230. ACM, New York (1998)
Sriram, K., Whitt, W.: Characterizing superposition arrival processes in packet multiplexers for voice and data. IEEE JSAC 4(6), 833–846 (1986)
Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: Proceedings of the ACM SIGMETRICS Conference, Banff, Canada, pp. 291–302 (June 2005)
Urgaonkar, B., Shenoy, P., Chandra, A., Goyal, P.: Dynamic provisioning of multi-tier internet applications. In: ICAC 2005: Proceedings of the Second International Conference on Automatic Computing, pp. 217–228 (2005)
Villela, D., Pradhan, P., Rubenstein, D.: Provisioning servers in the application tier for e-commerce systems. ACM Trans. Interet Technol. 7(1), 7 (2007)
Mercury Diagnostics, www.mercury.com/us/products/diagnostics
Zhang, Q., Cherkasova, L., Mathews, G., Greene, W., Smirni, E.: R-capriccio: A capacity planning and anomaly detection tool for enterprise services with live workloads. In: Proceedings of Middleware, Newport Beach, CA, pp. 244–265 (2007)
Zhang, Q., Cherkasova, L., Smirni, E.: A regression-based analytic model for dynamic resource provisioning of multi-tier applications. In: Proceedings of ICAC 2007, p. 27 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 IFIP International Federation for Information Processing
About this paper
Cite this paper
Mi, N., Casale, G., Cherkasova, L., Smirni, E. (2008). Burstiness in Multi-tier Applications: Symptoms, Causes, and New Models. In: Issarny, V., Schantz, R. (eds) Middleware 2008. Middleware 2008. Lecture Notes in Computer Science, vol 5346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89856-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-89856-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89855-9
Online ISBN: 978-3-540-89856-6
eBook Packages: Computer ScienceComputer Science (R0)