Keywords

There are various mechanisms and approaches to improve the energy efficiency of the cloud computing system. One way to improve energy efficiency is to upgrade network equipment by implementing power-saving modes and adaptive transmission rates to achieve proportional power usage. Another example of an approach to improve energy efficiency is scheduling and load balancing the servers, VMs, and applications [4]. Speaking about VMs, several approaches can be taken for improving energy efficiency. For example, VM self-adaption and hardware adjustment and choosing the most efficient physical machines for VM placements. Finally, VM migrations allow dynamic consolidation of physical machines by moving underutilized VMs onto fewer hosts and powering off the unused ones, that permits reduced resource consumption at any given moment [3]. Similar to virtual machines, it is possible to manage the power consumption of cloud computing servers.

The servers can be put into standby state in order to improve the energy efficiency of a cloud system in case of light load. On the one hand, the switching to standby mode allows to reduce power consumption, and on the other hand, it leads to extra power usage to turn on/off the server. Therefore, it is important to understand under what conditions it will be advantageous to put the server in standby state, and under what conditions it is more profitable to leave it in the operating mode. In this regard, cloud providers employ various mechanisms that decrease server switching number. In this article we consider three mechanisms to improve the energy efficiency of the cloud computing server: the shutdown delay mechanism, the switch on delay mechanism and the threshold-based switch mechanism.

1 Analysis of Mathematical Models of Mechanisms for Increasing the Energy Efficiency of Cloud Systems

We proceed from the fact that arriving customers are distributed evenly on the virtual machine. Since we assumed that the computing resources are distributed evenly across all servers, we will consider a system consisting of one server with Processor Sharing policy.

We consider a base model [6] as a single-server queuing system with C virtual machines. Customers arrive according to the Poisson law with rate \(\lambda \). Service times, switch on and switch off durations are exponentially distributed with the parameters \(\mu \), \(\alpha \) and \(\beta \), respectively. The system state is described by the vector (sk), where k is the number of customers in the system, s is the server state. Here s = 0 means that the system is in the standby mode, s = 1 reflects switch-on mode and s = 2 and s = 3 represent operating and switch off modes, respectively. Arrival of a customer in an empty system cause change of the system state to the switch on mode. After exponentially distributed time with rate \(\alpha \), the system switches to the operating mode, in which serving of customers is started. When the system remains empty in the operating mode, it switches off immediately.

For the base model, the set of states \(S_1\) is represented in the following form: \(S_1=\{ (s,k)\vert s=1,2,~ 1\le k\le C \} \cup \{ (s,k)\vert s=3,~ 0\le k\le C \} \cup (0,0) \).

We derive the system of equilibrium equations, based on the transition intensity diagram [5, 6], which makes it possible to obtain stationary probabilities \(p_{s,k}\) that the system is in (s,k) state:

$$\begin{aligned} p_{3,0} =\frac{\lambda }{\beta } p_{0,0} ; \end{aligned}$$
(1)
$$\begin{aligned} p_{1,1} =\frac{\lambda }{\left( \lambda +\alpha \right) } p_{0,0} +\frac{\beta }{\left( \lambda +\alpha \right) } p_{3,1} ; \end{aligned}$$
(2)
$$\begin{aligned} p_{1,k} =\frac{\lambda }{\left( \lambda +\alpha \right) } p_{1,k-1} +\frac{\beta }{\left( \lambda +\alpha \right) } p_{3,k} ,\quad 2\le k\le C-1; \end{aligned}$$
(3)
$$\begin{aligned} p_{1,C} =\frac{\lambda }{\alpha } p_{1,C-1} +\frac{\beta }{\alpha } p_{3,C} ; \end{aligned}$$
(4)
$$\begin{aligned} p_{2,k} =\frac{\left( \lambda +\mu \right) }{\mu } p_{2,k-1} -\frac{\lambda }{\mu } p_{2,k-2} -\frac{\alpha }{\mu } p_{1,k-1} ,\quad 3\le k\le C; \end{aligned}$$
(5)
$$\begin{aligned} p_{2,C} =\frac{\alpha }{\mu } p_{1,C} +\frac{\lambda }{\mu } p_{2,C-1} ; \end{aligned}$$
(6)
$$\begin{aligned} p_{2,1} =\frac{\left( \lambda +\beta \right) }{\mu } p_{3,0} ; \end{aligned}$$
(7)
$$\begin{aligned} p_{2,2} =\frac{\left( \lambda +\mu \right) }{\mu } p_{2,1} -\frac{\alpha }{\mu } p_{1,1} ; \end{aligned}$$
(8)
$$\begin{aligned} p_{3,k} =\frac{\lambda }{\left( \lambda +\beta \right) } p_{3,k-1} ,\quad 1\le k\le C-1; \end{aligned}$$
(9)
$$\begin{aligned} p_{3,C} =\frac{\lambda }{\beta } p_{3,C-1} ; \end{aligned}$$
(10)
$$\begin{aligned} \sum _{i=0}^{3}\sum _{j=0}^{C}p_{i,j} =1. \end{aligned}$$
(11)

Due to the high energy consumption for shutting down the cloud server, in some cases it’s more beneficial to leave it in operating mode pending the arrival of new customers. In [5] we consider the model with server shutdown delay mechanism. In contrast to the base model, where it was assumed that the server shuts down as soon as it remains empty, in this model the system does not switch off immediately, but waits exponentially distributed time with rate \(\gamma \). If a customer arrives during that waiting period, then the system starts serving. Otherwise, the state is changed to the switch off mode. If a customer arrives during the switch off mode, then the system turns to the switch on mode immediately after the completion of the switch off. Otherwise, the system falls to the stand by mode. For this model, the set of states is represented in the following form: \(S_2=\{ (s,k)\vert s=1,~ 1\le k\le C \} \cup \{ (s,k)\vert s=2,3,~ 0\le k\le C \} \cup (0,0) \).

The system of equations for the model with shutdown delay mechanism differs from the system of equations for the base model by the following formulas: we added formula (12), and replaced formulas (7) and (8) by formulas (13) and (14).

$$\begin{aligned} p_{2,0} =\frac{\lambda +\beta }{\gamma } p_{3,0} ; \end{aligned}$$
(12)
$$\begin{aligned} p_{2,1} =\frac{\lambda +\gamma }{\mu } p_{2,0} ; \end{aligned}$$
(13)
$$\begin{aligned} p_{2,2} =\frac{\left( \lambda +\mu \right) }{\mu } p_{2,1} -\frac{\lambda }{\mu } p_{2,0} -\frac{\alpha }{\mu } p_{1,1} ; \end{aligned}$$
(14)

Also we consider the model with server switch on delay, as well as in base model, system passes in switch off mode at once after it remains empty. But it does not switch on immediately on arrival of a new customer, and waits exponentially distributed time with rate \(\theta \).

For this system, the set of states \(S_3\) is represented in the following form: \(S_3=\{ (s,k)\vert s=0,3,~ 0\le k\le C \} \cup \{ (s,k)\vert s=1,2,~ 1\le k\le C \} \).

The system of equations for the model with switch on delay mechanism differs from the system of equations for the base model by the following formulas: we added formulas (18) and (19), and replaced formulas (2)–(4) by formulas (15)–(17).

$$\begin{aligned} p_{1,1} =\frac{\theta }{\left( \lambda +\alpha \right) } p_{0,1} ; \end{aligned}$$
(15)
$$\begin{aligned} p_{1,k} =\frac{\theta }{\left( \lambda +\alpha \right) } p_{0,k} ,\quad 2\le k\le C-1; \end{aligned}$$
(16)
$$\begin{aligned} p_{1,C} =\frac{\lambda }{\alpha } p_{1,C-1} +\frac{\theta }{\alpha } p_{0,C} ; \end{aligned}$$
(17)
$$\begin{aligned} p_{0,k} =\frac{\lambda }{\lambda +\theta } p_{0,k-1} +\frac{\beta }{\lambda +\theta } p_{3,k} ,\quad 1\le k\le C-1; \end{aligned}$$
(18)
$$\begin{aligned} p_{0,C} =\frac{\lambda }{\theta } p_{0,C-1} +\frac{\beta }{\theta } p_{3,C} ; \end{aligned}$$
(19)

Then we consider the mode with the threshold-based switch mechanism, in which system passes from standby mode in switch on mode only after arrived of a certain number \(\kappa \) of customers.

For this system, the set of states \(S_4\) is represented in the following form: \(S_4=\{ (s,k)\vert s=0,~ 0\le k\le \kappa -1 \} \cup \{ (s,k)\vert s=1,~ \kappa \le k\le C \} \cup \{ (s,k)\vert s=2,~ 1\le k\le C \} \cup \{ (s,k)\vert s=3,~ 0\le k\le C \} \).

The system of equations for this model differs from the system of equations for the base model by the following formulas: we added formula (25), and replaced formulas (2)–(4) by formulas (20) and (21) and formulas (5) and (6) by formulas (22)–(24).

$$\begin{aligned} p_{1,\kappa } =\frac{\lambda }{\left( \lambda +\alpha \right) } p_{1,\kappa -1} +\frac{\beta }{\left( \lambda +\alpha \right) } p_{3,\kappa } ; \end{aligned}$$
(20)
$$\begin{aligned} p_{1,k} =\frac{\lambda }{\left( \lambda +\alpha \right) } p_{1,k-1} +\frac{\beta }{\left( \lambda +\alpha \right) } p_{3,k} ,\quad \kappa +1\le k\le C-1; \end{aligned}$$
(21)
$$\begin{aligned} p_{2,k+1} =\frac{\left( \lambda +\mu \right) }{\mu } p_{2,k} -\frac{\lambda }{\mu } p_{2,k-1} ,\quad 2\le k\le \kappa -1; \end{aligned}$$
(22)
$$\begin{aligned} p_{2,k+1} =\frac{\left( \lambda +\mu \right) }{\mu } p_{2,k} -\frac{\alpha }{\mu } p_{1,k} -\frac{\lambda }{\mu } p_{2,k-1} ,\quad \kappa \le k\le C-1; \end{aligned}$$
(23)
$$\begin{aligned} p_{2,C} =\frac{\alpha }{\mu } p_{1,C} +\frac{\lambda }{\mu } p_{2,C-1} ; \end{aligned}$$
(24)
$$\begin{aligned} \lambda p_{0,k} =\lambda p_{0,k-1} +\beta p_{3,k} ,\quad 1\le k\le \kappa -1; \end{aligned}$$
(25)

We derived the system of equilibrium equations for each model, based on the transition intensity diagrams, which makes it possible to obtain stationary probability distribution of the system. Taking into account the normalization condition and using matrix methods, the system of equilibrium equations can be solved numerically, but we represent the analytical solution in [5].

2 Energy Consumption Indicators and the Performance Characteristics of Cloud Systems

With the system stationary distribution, we calculate the energy consumption indicators. We will assume that in the switch on/off mode, the power consumption is constant and equal to the average values \(P_1\) and \(P_3\), respectively. In the operating mode, the power consumption \(P_{2,k}\) depends on the server occupancy. Through \(P_{2,max}\) we denoted the maximum value of the server’s power consumption in the operating mode, and through \(P_{2,min}\) we denoted the power consumption in idle mode. The energy consumption in the standby mode will be calculated by \(P_0\). By analogy with the formula given in [2], we derive the formula for the average server power consumption:

$$\begin{aligned} P=P_0 \sum _{i=0}^C p_{0,i}+P_1 \sum _{i=0}^C p_{1,i}+P_3 \sum _{i=0}^C p_{3,i}+\sum _{i=0}^C P_{2,i}p_{2,i} \end{aligned}$$
(26)

where

$$\begin{aligned} P_{2,k}=P_{2,min}+{{P_{2,max}-P_{2,min}}\over {C}}k \end{aligned}$$
(27)

According to Little’s law, the average number N of customers in the system is equal to the average effective arrival rate \(\lambda (1-\pi )\) multiplied by the average sojourn time T. Expressed algebraically the law is

$$\begin{aligned} N=\lambda (1-\pi )T \end{aligned}$$
(28)

where blocking probability \(\pi \) is

$$\begin{aligned} \pi =p_{0,C}+p_{1,C}+p_{2,C}+p_{3,C} \end{aligned}$$
(29)

The average number N of customers is given by

$$\begin{aligned} N=\sum _{k=0}^3 \sum _{i=1}^C ip_{k,i} \end{aligned}$$
(30)

The average response time T follows directly from formulas (28) and (30):

$$\begin{aligned} T={{\sum _{k=0}^3 \sum _{i=1}^C ip_{k,i}}\over {\lambda (1-\pi )}} \end{aligned}$$
(31)

3 Numeral analysis

On the energy profile of the cloud system installed at the University of Cardiff [1], it can be seen [1] that the inclusion of the server lasts 150 seconds, and the shutdown is 30 seconds. Further, for convenience, it was represented in minutes. The values of \(P_i\) were taken from [1], according to which \(P_0 = 10\) W, \(P_1 = 170\) W, \(P_3 = 120\) W, \(P_{2, min} = 105\) W and \(P_{2, max} = 268\) W.

The results of numerical analysis for the values C = 20, \(\mu \) = 20, \(\alpha \) = 1, \(\beta \) = 2 are presented in Figs. 1, 2, 3, 4, 5 and 6.

The plots of the server’s power consumption (Fig. 1) for base model show that the consumed power increases very fast for small values of the arrival flow intensity \(\lambda \), also note that with the increase of waiting time, during which the system doesn’t go into standby mode, the power consumption also increases.

Fig. 1.
figure 1

The dependence of the power consumption P on the arrival flow intensity \(\lambda \). Comparison of the model with the shutdown delay mechanism and the base model.

Fig. 2.
figure 2

The dependence of the average response time T on the arrival flow intensity \(\lambda \). Comparison of the model with the shutdown delay mechanism and the base model.

In Fig. 2 it is clearly seen that the greatest dependence of the average sojourn time T on the arrival flow intensity \(\lambda \) is observed at values of \(\gamma \) from 1 to 5.

Fig. 3.
figure 3

The dependence of the power consumption P on the arrival flow intensity \(\lambda \). Comparison of the model with the switch on delay mechanism and the base model.

Fig. 4.
figure 4

The dependence of the average response time T on the arrival flow intensity \(\lambda \). Comparison of the model with the switch on delay mechanism and the base model.

Fig. 5.
figure 5

The dependence of the power consumption P on the arrival flow intensity \(\lambda \). Comparison of the model with the threshold-based switch mechanism and the base model.

Fig. 6.
figure 6

The dependence of the average response time T on the arrival flow intensity \(\lambda \). Comparison of the model with the threshold-based switch mechanism and the base model.

The graph of the server power consumption P (Fig. 3) for the model with server shutdown delay increases most sharply at small values of the arrival flow intensity \(\lambda \), also note that with an increase in the exponential time \(\theta \) during which the system doesn’t go into the operating mode after the first customer is received, the value of the power consumption increases accordingly. Note that for \(\theta \) = 20, the graph of the model with server shutdown delay is most closely approximated to the base model’s graph, where the system was included immediately after the customer was received.

In Fig. 4 note that for small values of \(\theta \), the difference in the average sojourn time T is the greatest.

For each of the three models, it was noted that when \(\gamma \rightarrow \infty \) (see Figs. 1 and 2), \(\theta \rightarrow \infty \) (see Figs. 3 and 4) and \(\kappa \rightarrow 1\) (see Figs. 5 and 6), respectively, each of the three systems tends to the initial baseline model.

4 Conclusion

We constructed mathematical models of cloud computing systems taking into account various mechanisms for increasing energy efficiency in terms of queuing theory, and analytical expressions for the main characteristics of energy consumption and server performance metrics were obtained. We conducted a comparative numerical analysis of the mechanisms for improving the energy efficiency of cloud computing based on the initial data close to the real ones. Numerical analysis showed that the mechanism with server shutdown delay gives an improvement in both power and response time, for a mechanism with server enable delay, there is an improvement at almost all intervals of power consumption, except service time at low loads. The mechanism with server thresholding gives an improvement for power, but deterioration in time.