Abstract
In this paper we consider a multiclass G/G/1 queueing system. When the server is idle, we use a balanced routing control policy to determine what kind of customer is served. Under such a balanced policy, we derive the diffusion limits of the queue length processes and the workload processes. The diffusion limits are the same for these processes regardless the choice of c as long as c ≥ 2. We further show that the proposed balanced routing policy for any fixed c ≥ 2 is asymptotically optimal in the sense that it minimizes the workload over all time in the diffusion limit.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
We consider a multiclass G/G/1 queueing system. When the server is idle, we use a balanced routing control policy to determine what kind of customer is served. This problem arises in a number of important applications. For example, it can be viewed as a order model of Van Mieghem [1]. It is interesting to think of our system as modeling order fulfillment at a firm which dynamically receives orders from customers for several different types or classes of goods and services. In order not to keep one kind of customer’s orders in the firm for too long, we need to settle the problem of routing control in such a system. The purpose of this paper is to show how the firm should sequence the different orders that are competing for its scarce resources.
Load balancing is a canonical method for sharing resources among different customers efficiently. Two versions of the load balancing problem have been studied in the literature: static and dynamic. The static version was first analyzed by Azar et al. [2] using the balls-and-bins model. In this paper, we are interested in the dynamic version, in Vvednskaya et al. [3], a routing policy that is based on the partial queue length information is proposed. The model was also studied by Mitzenmacher [4].
Chen and Ye [5] extends and complements the previous works in several ways. Firstly, based on Mitzenmacher’s, they establish the asymptotic optimality of the balanced routing under more general queueing systems. Secondly, they complements the work of He and Down [6], they rigorously establish the heavy traffic limit theorem for the queueing system under balance routing. Compared to Chen and Ye [5], the distinguishing feature of our analysis is that here we consider a multiclass G/G/1 queueing system. We extend the balanced routing control policy to multiclass G/G/1 queueing system.
The paper is organized as follows. In the next section we present our model. Section 3 analyze the fluid limit of the queueing processes, and establish the uniform attraction property of the fluid limit. In Sect. 4, we show that for any \( c \ge 2, \) the balanced routing control is asymptotically optimal. We conclude in Sect. 5 with extensions and discussion.
2 The Model
We consider a general single-server multiclass queueing system. Customers are categorized into \( L \) different classes depending on their specific arrival patterns. Class \( l \in \mathcal{L} = \left\{ {1, \ldots ,L} \right\} \) customers arrive at the system following a renewal process with arrival rate \( \lambda_{l} \). At the server, the customers are served at a rate of \( \mu_{l} \) according to the first-in first-out (FIFO) discipline.
For class \( l \) customers, denote the interarrival times between consecutive customers as \( u_{l,k} ,\;l \in \mathcal{L};\;k \ge 1 \), and denote the service time of \( k{\text{th}} \) customer at the server by \( v_{l,k} ,\;l \in \mathcal{L};\;k \ge 1 \). We assume that \( \left\{ {u_{l,k} ,\;k \ge 1} \right\},\;l \in \mathcal{L} \), and \( \left\{ {v_{l,k} ,\;k \ge 1} \right\},l \in \mathcal{L} \), are i.i.d. random sequence, all with finite second moments. In particular, \( u_{l,k} \) are random variables with mean \( {1 \mathord{\left/ {\vphantom {1 {\lambda_{l} }}} \right. \kern-0pt} {\lambda_{l} }} \) and variance \( a_{l}^{2} ,l \in \mathcal{L} \); and \( v_{l,k} \) are random variables with mean \( {1 \mathord{\left/ {\vphantom {1 {\mu_{l} }}} \right. \kern-0pt} {\mu_{l} }} \) and variance \( b_{l}^{2} ,l \in \mathcal{L} \). Denote the traffic intensity as \( \rho_{l} = {{\lambda_{l} } \mathord{\left/ {\vphantom {{\lambda_{l} } {\mu_{l} }}} \right. \kern-0pt} {\mu_{l} }},l \in \mathcal{L} \). For any \( \mathcal{L}^{\prime } \subseteq \mathcal{L}, \) let \( \lambda_{{\mathcal{L}^{\prime } }} = \sum\limits_{{l \in \mathcal{L}^{\prime } }} {\lambda_{l} } \) and \( \mu_{{\mathcal{L}^{\prime } }} = \sum\limits_{{l \in \mathcal{L}^{\prime } }} {\mu_{l} } \). Let \( \lambda_{0} = {{\lambda_{\mathcal{L}} } \mathord{\left/ {\vphantom {{\lambda_{\mathcal{L}} } L}} \right. \kern-0pt} L} \) denote the average arrival rate and \( \mu_{0} = {{\mu_{\mathcal{L}} } \mathord{\left/ {\vphantom {{\mu_{\mathcal{L}} } L}} \right. \kern-0pt} L} \) the average service rate.
We call \( A\left( t \right) = \left( {A_{l} \left( t \right)} \right)_{{l \in \mathcal{L}}} \) the arrival process, where \( A_{l} (t) \) represents the number of class \( l \) customers that have arrived during the time interval \( \left[ {0,t} \right] \); and call \( S\left( t \right) = \left( {S_{l} \left( t \right)} \right)_{{l \in \mathcal{L}}} \) the service process, where \( S_{l} (t) \) is the number of class \( l \) customers that are served during the first \( t \) time units that the server devotes to class \( l \).
Let \( q = \left( {q_{1} , \ldots ,q_{L} } \right)^{\prime } \) be a nonnegative integer valued vector (with \( q_{l} \) denote the queue length of class \( l \) customers). Let \( \pi (q) = \left( {j_{1} ,j_{2} , \ldots ,j_{L} } \right)^{\prime } \) be a permutation of indices \( \mathcal{L} \) such that \( q_{{j_{1} }} \ge q_{{j_{2} }} \ge \cdots \ge q_{{j_{L} }} \), and \( \pi_{l} (q) \) be the set of the first \( l \) components of \( \pi (q) \). Let \( \phi (j,k;\pi ) \) be a zero–one random variable, and \( \phi (j,k;\pi ) = 1 \) represents that the \( k{\text{th}} \) service is dispatched to the class \( j \) customers. For \( \forall q \) and \( \forall \mathcal{L}^{\prime} \subseteq \mathcal{L} \) that satisfy \( \pi_{l} (q) = \mathcal{L}^{\prime } \) for some \( l \) \( \left( {1 \le l \le L} \right), \)
We assume that the routing sequence is independent of the arrival process and the service time process. For ease of analysis, we introduce
3 Fluid Limit and Uniform Attraction
Here we start with presenting the main processes in this paper. First, let \( Q(t) = \left\{ {Q_{l} (t)} \right\}_{{l \in \mathcal{L}}} \) be the queue length at time \( t \), where \( Q_{l} (t) \) denote the total number of customers of class \( l \) in the system at time \( t \). Then, \( B_{l} (t) \) is the number of customers of class \( l \) that are served during \( [0,t] \), \( T_{l} (t) \) represents total amount of the time during \( [0,t] \) that the server allocates to class \( l \). The above can be described as follows:
The derived process that characterizes the dynamics of the queueing system is the queue-length process:
\( Q_{l} (0) \) is the initial number of customers of class \( l \) in the system, \( l \in \mathcal{L} \).
Now define the workload process and the idling process: \( W\left( t \right) = \left( {W_{l} \left( t \right)} \right)_{{l \in \mathcal{L}}} \) and \( Y\left( t \right) = \left( {Y_{l} \left( t \right)} \right)_{{l \in \mathcal{L}}} , \) where
and \( Y_{l} (t) = t - \sum\limits_{{l \in \mathcal{L}}} {T_{l} (t)} \). In addition, we define \( W_{0} (t) \) and \( Y_{0} (t) \) as follows,
Theorem 1 (Fluid Limit)
Let \( M \) be a given positive constant, and suppose \( \left| {\bar{Q}^{n} (0)} \right| = \sum\nolimits_{{l \in \mathcal{L}}} {\bar{Q}_{l}^{n} (0)} \le M \) for all \( n \). Then, for any subsequence of fluid scaled processes, there exists a further subsequence, denoted by \( \mathcal{N}, \) such that, along \( \mathcal{N}, \)
Theorem 2 (Uniform Attraction)
Let \( M \) be a given positive constant, and suppose \( \left| {\bar{Q}^{n} (0)} \right| = \sum\nolimits_{{l \in \mathcal{L}}} {\bar{Q}_{l}^{n} (0)} \le M \) for all \( n. \) Assume that the heavy traffic condition \( \rho = 1 \) holds. Then, there exists a time \( T_{M} > 0 \) such that, all fluid levels are the same after the time \( T_{M} \) and the fluid levels remain fixed afterward: for all \( l \in \mathcal{L}, \)
4 Diffusion Limit and Asymptotic Optimality
We apply the standard diffusion scaling (along with centering) to the derived process:
We can rewrite the unscaled system workload process for the \( n{\text{th}} \) network, and apply the diffusion scaling to the equation, we have
where
For each \( n, \) we also have
In view of (4) and with the centering procedure as in the above, we can write the following equality for the workload process,
By the functional central limit theorem for the renewal process (refer to Chen and Yao [7], Chap. 5), we have
For ease of exposition, assume that
It follows from Theorem 1 that as \( n \to \infty , \)
It follows from the random time-change theorem (refer to Chen and Yao [7], Chap. 5), the process \( \hat{X}^{n} \left( t \right) \) converges weakly as follows,
where the limit \( \bar{X}_{0} = \left\{ {\hat{X}_{0} \left( t \right),\;t \ge 0} \right\} \) is a Brownian motion.
We show that the derived diffusion scaled process approach to some limits described by \( \hat{W}_{0} \left( t \right),\;\hat{Y}_{0} \left( t \right) \) in the next theorem. The limiting process are characterized by the following relations for all \( t \ge 0 \) and \( l \in \mathcal{L}: \)
Theorem 3
Suppose that the heavy traffic condition \( \rho = 1 \) holds. Under the balanced routing policy, we have the following results.
-
(a)
Diffusion Limit: The following weak convergence holds when \( n \to \infty : \)
in particular, \( \hat{W}_{0} \) is a single-dimensional RBM, \( \hat{Y}_{0} \) is the associated regulator, and \( \hat{Q}\left( t \right) = Q^{*} \left( {\hat{W}_{0} \left( t \right)} \right), \) is the fixed point with all components equal to \( \mu_{0} \hat{W}_{0} \left( t \right). \)
-
(b)
Asymptotic optimality: The balanced routing is asymptotically optimal in the following sense: Let \( \hat{W}_{0}^{n,G} \) and \( \hat{Q}^{n,G} \) denote the scaled workload and queue length process associated with any feasible routing scheme \( G. \). Then, for all \( t \ge 0 \) and \( u \ge 0, \) we have
5 Concluding Remarks
In this paper, we extend the balanced routing control policy to multiclass G/G/1 queueing system. We prove that, under the diffusion scale to minimize the total workload process and the maximum queue length process, for any \( c \ge 2, \) the balanced routing control is asymptotically optimal. In heavy traffic, the multi-server simplifies to a single server with service capacity equal to the sum of the parallel servers, and the analysis still applies.
References
Stolyar AL (2001) Max-weight scheduling in a generalized switch: state space collapse and workload minimization in heavy traffic. Ann Appl Probab 14x:1–53
Azar Y, Broder A, Karlin A, Upfal E (1999) Balanced allocations. SIAM J Comput 29:1180–1200
Vvednskaya ND, Dobrushin RL, Karpelevich FI (1996) Queueing system with selection of the shortest of two queues: an asymptotic approach. Probl Inf Transm 32:15–27
Mitzenmacher M (2001) The power of two choices in randomized load balancing. IEEE Trans Parallel Distrib Comput 12:1094–1104
Chen H, Ye HQ (2012) Asymptotic optimality of balanced routing. Oper Res 60(1):163–179
He YT and Down DG (2008) Limited choice and locality considerations for load balancing. Perform Eval 65:670–687
Chen H, Yao DD (2001) Fundamentals of Queueing networks: performance, asymptotics, and optimization. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xiao, X., Wu, L. (2013). Asymptotic Optimality of Balanced Routing in a Multiclass G/G/1 Queue. In: Yin, Z., Pan, L., Fang, X. (eds) Proceedings of The Eighth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), 2013. Advances in Intelligent Systems and Computing, vol 212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37502-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-37502-6_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37501-9
Online ISBN: 978-3-642-37502-6
eBook Packages: EngineeringEngineering (R0)