A novel grey power-Markov model for the prediction of China’s electricity consumption

Sun, Liqin; Yang, Youlong; Ning, Tong; Zhu, Jiadi

doi:10.1007/s11356-021-17016-1

A novel grey power-Markov model for the prediction of China’s electricity consumption

Research Article
Published: 12 November 2021

Volume 29, pages 21717–21738, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Science and Pollution Research Aims and scope Submit manuscript

A novel grey power-Markov model for the prediction of China’s electricity consumption

Download PDF

Liqin Sun¹,
Youlong Yang¹,
Tong Ning¹ &
…
Jiadi Zhu¹

392 Accesses
4 Citations
Explore all metrics

Abstract

Forecasting the electricity consumption has always played an important role in the management of power system management, which requires higher forecasting technology. Therefore, based on the principle of “new information priority”, combined with rolling mechanism and Markov theory, a novel grey power-Markov prediction model with time-varying parameters (RGPMM(λ,1,1)) is designed, which overcomes the inherent defects of fixed structure and poor adaptability to the changes of original data. In addition, in order to prove the validity and applicability of the prediction model, we have used the model to predict China’s total electricity consumption, and have compared it with the prediction results by a series of benchmark models. The result shows that the can better adapt to the characteristics of electricity consumption data, and it also shows the advantages of the proposed forecasting model. In this paper, the proposed forecasting model is used to predict China’s total electricity consumption in the next six years from 2018 to 2023, so as to provide certain reference value for power system management and distribution.

A hybrid prediction model based on improved multivariable grey model for long-term electricity consumption

Article 16 November 2020

Using grey models for forecasting China’s growth trends in renewable energy consumption

Article 24 July 2015

Mid-term Load Forecasting Based on Modified Grey Model

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

China is a developing country and is in a period of rapid development. Electricity consumption forecast is an important part of Power Economic Planning, energy investment, and environmental protection (Lin and Liu 2016). Electricity consumption forecast has become an important research area in the operation and management of modern power systems (Kavousi-Fard et al. 2014). The high and low accuracy of electricity consumption forecasting (Amber et al. 2018) is of great significance to economic development and power planning. Accurate electricity consumption forecast is affected by a series of factors, such as population (Hussain et al. 2016), economic growth (Lin and Liu 2016), power facilities (Khosravi et al. 2012), and climate factors (Hernández et al. 2013), making the prediction problem a challenging and complex task. In order to solve these problems, in recent years, many domestic and foreign experts, scholars and related research institutions have done a lot of in-depth research on electricity consumption forecast models. The main methods are non-linear intelligent models (Bekiroglu et al. 2018; Hernandez et al. 2014), traditional statistical analysis models (Chui et al. 2009; Mohamed and Bodger 2005), and grey prediction models (Xiao et al. 2017).

Ghani and Ahmad (2010) used SPSS software to establish a linear regression model based on the multiple regression method to predict and analyze the fish landing to demonstrate the effectiveness and feasibility of the method. Wang J et al. (2018) used an autoregressive moving average (ARMA) model based on the time series algorithm to predict short-term wind power. Zhang et al. (2019) combined BP and RBF neural network methods to predict and analyze wind speed to verify the effectiveness and accuracy of the combined prediction model. The above models usually require a large amount of data, so it is difficult to get accurate results when dealing with limited data. Grey theory (Julong 1982) is used to solve the problem of uncertainty with limited data and poor information. It focuses on building a grey prediction model with a small amount of information. The grey theory is not to find the statistical law of time series data, but to associate the random process with time, and use the cumulative generation operation to process the original data. It reduces the inherent randomness of the data, transforms the irregular data into an exponential form, generates a sequence with strong regularity, and can predict the future direction of the data according to the theory. Grey systems are often used to discover the laws hidden in chaotic data. Wang et al. (1819) used the grey management degree and grey theory to establish a grey system model based on the basic data of urban heating and forecast its demand. Guo et al. (2013) proposed a new comprehensive adaptive grey model, CAGM(1,N), which can be applied to any actual forecasting issues and can obtain higher fitting and prediction accuracy compared with the traditional GM(1,N) model (Pai et al. 2008; Tien 2008).

These methods make predictions based on raw data by maximizing the fitting accuracy, but do not take into account the complex diversity of things themselves. The single grey model has a large error in the process of forecasting, and it is difficult to achieve the expected accuracy. The Markov model is suitable for the prediction of random problems, and it can better describe the dynamic trend of randomly changing objects and can make up for the shortcomings of the grey model. Therefore, two prediction models are combined. The grey model is used to forecast, and the predicted values are corrected by the Markov chain to effectively improve the accuracy of the prediction, so as to achieve the purpose of scientific prediction and analysis. So far, the combination of grey theory and Markov model has applied many areas of prediction, such as Yong and Yidan (1992) put forward the grey Markov model for the first time by combining the advantages of grey model and Markov theory, which has since been widely used in the prediction of traffic, natural disasters, energy consumption, and other fields. Kumar and models (2010) combined the grey Markov and time series model to ground breakingly predict energy consumption in India, which provided a feasible scheme for the prediction of India’s energy consumption. CAO Jian et al. (2019) explored the internal relationship among the accidents of road transport on hazardous chemical and the traffic accidents in China based on grey Markov model, and analyzed the grey Markov combined prediction model in the prediction of safety accident. The effectiveness and feasibility of the method had been verified by experiments.

The above research shows that the grey Markov model has better prediction accuracy and ability. However, the actual operation found that when faced with the prediction of small sample data, the model still has a certain degree of contingency. Therefore, according to the characteristics of China’s electricity consumption data, this paper optimizes and improves the traditional grey Markov model, and proposes RGPMM(λ,1,1) to predict China’s electricity consumption more accurately, so as to provide more accurate information for the rational distribution of energy.

objective

This article has carried out the work in the following aspects:

1)
Grey power prediction model (Wang et al. 2011). It is a new type of nonlinear grey prediction model. Its power index can reflect the nonlinear development characteristics of the data and is used to describe the development of things. The nonlinear situation has good prediction results.
2)
The power index λ is generally taken as integer in the traditional grey prediction models, such as λ = 2, which is called Verhulst model. In this paper, λ belongs to the real number R, that is, λ can be taken the fraction to establish the model and the value of can be estimated through optimization theory. Then a novel grey prediction model can be established. In the process of modeling, the robustness of λ is analyzed.
3)
The introduction of rolling mechanism (Akay and Atak 2007). In the forecasting process, because the data from the far past has little effect on the forecast, the Rolling Mechanism is introduced to continuously update the input information, which breaks the constraint of constant initial value in the classic grey prediction model and complies with the principle of “new information priority” (Julong 1989).
4)
In this paper, the relative error of the grey power model is used as the index, and the weighted Markov theory (Liu et al. 2018) is used to correct the grey power model, which further improves the prediction accuracy and adaptability of the model.

Organization

The rest of this article is organized as follows: the “Basic knowledge” section briefly introduces the historical background of the grey model and the traditional GM(1,1) model, Markov theory, rolling mechanism, and the grey development zone. The “Methodology of improved grey prediction model” section introduces how to build the RGPMM(λ,1,1). The “Case studies on forecasting the total electricity consumption in China” section illustrates the practicability of the RGPMM(λ,1,1) by experiment, and forecasts the total electricity consumption in the next few years by this model. The “Conclusion” section contains the conclusions and suggestions for future work.

Basic knowledge

Basic GM(1,1)

In 1982, Professor Deng Julong first proposed the concept of grey system and built the GM(1,1). The process of the GM(1,1) is as follows Julong (1982), Lin et al. (2012), and Zeng et al. (2020):

Step 1: Transforing the original data. Let a set of non-negative sequences is $X^{\left (0\right )}=\left \{x^{\left (0\right )}\left (1\right ),\right . x^{\left (0\right )}\left (2\right ),\cdots ,x^{\left (0\right )}$ $\left .\left (n\right )\right \},(n\geq 4)$. The 1-AGO sequence is given by

$$ X^{\left( 1\right)}=\left\{x^{\left( 1\right)}\left( 1\right),x^{\left( 1\right)}\left( 2\right),\cdots,x^{\left( 1\right)}\left( n\right)\right\} $$

(1)

where, $x^{\left (1\right )}\left (k\right )={\sum }_{i=1}^{k}x^{\left (0\right )}\left (i\right ), k=1,2,\cdots ,n.$

Step 2: Based on the sequence $X^{\left (1\right )}$, the whitening form equation of the prediction model can be established:

$$ \frac{dx^{\left( 1\right)}}{dt}+a\cdot x^{\left( 1\right)}=b. $$

(2)

In Formula (2), a and b are the parameters to be estimated. The grey differential equation is:

$$ x^{\left( 0\right)}\left( k\right)+a\cdot z^{\left( 1\right)}\left( k\right)=b, $$

(3)

where, $z^{\left (1\right )}\left (k\right )=\frac {1}{2}\cdot \left [x^{\left (1\right )}\left (k\right )+x^{\left (1\right )}\left (k-1\right )\right ]$ is the background value and $Z^{\left (1\right )}=\left \{z^{\left (1\right )}\left (2\right ),z^{\left (1\right )}\left (3\right ),\cdots ,z^{\left (1\right )}\left (n\right )\right \}$ is the mean sequence of $X^{\left (1\right )}$ (Xiong et al. 2014).

Step 3: Estimating the model parameters. Set the parameters vector to be estimated as $\begin {pmatrix}\hat {a}\\\hat {b}\end {pmatrix}$ and solve it according to the least squares method to obtain

$$ \left( \begin{array}{cc}\hat{a}\\\hat{b} \end{array}\right)=\left( \begin{array}{cc}B^{T}B \end{array}\right)^{-1}B^{T}Y, $$

(4)

where $B=\left (\begin {array}{cc}-z^{\left (1\right )}\left (2\right )&1\\ -z^{\left (1\right )}\left (3\right )&1\\{\cdots } & {\cdots } \\ -z^{\left (1\right )}\left (n\right )&1 \end {array}\right )$, $Y=(x^{\left (0\right )}\left (2\right ),x^{\left (0\right )}\left (3\right ),$ $\cdots ,x^{\left (0\right )}\left (n\right ) )^{T}$.

Step 4: Obtaining the time response function. According to Eq. 4, solve (2), then the time corresponding equation is computed as:

$$ \begin{array}{@{}rcl@{}} \hat{x}^{\left( 1\right)}\left( t+1\right)&=&\left( x^{\left( 0\right)}\left( 1\right)-\frac{b}{a}\right)\cdot e^{-at}+\frac{b}{a},\\ t&=&1,2,\cdots,n,n+1,\cdots. \end{array} $$

Step 5: Obtaining the fitted and predicted values in the original domain. The simplified predicted value of the first-order accumulation operator sequence is $\hat {x}^{\left (0\right )}\left (t\right )=\hat {x}^{\left (1\right )}\left (t\right )-\hat {x}^{\left (0\right )}\left (t-1\right ),$ namely,

$$ \begin{array}{@{}rcl@{}} \hat{x}^{\left( 0\right)}\left( t\right)&=&\left[x^{(0)}(1)-\frac{\hat{b}}{\hat{a}}\right]\cdot\left( 1-e^{\hat{a}}\right)\cdot e^{-\hat{a}(t-1)},\\ t&=&2,3,\cdots,n,n+1,\cdots. \end{array} $$

where $\hat {x}^{(0)}(t) (t\le n)$ are called fitted values, and $\hat {x}^{(0)}(t)(t>n)$ are called predicted values.

The flow chart is shown in Fig. 1.

Markov process

Markov process (Zhao et al. 2014) is a theory that studies the state of things and their transition. A Markov process in which time and state are both discrete is called a Markov chain. Markov chain analysis is a statistical analysis method based on the probability theory and stochastic process theory, using stochastic mathematical models to analyze the quantitative relationship of objective objects in the development and change process. Its characteristic is no after-effect, that is, the current state of the system is only related to the previous state, and has nothing to do with the subsequent state.

Transition probability and transition probability matrix

In Markov process, the transition probability and the transition probability matrix of states need to be calculated, which are defined as follows:

Definition 1

Let {X_n, n ∈ T} be a Markov chain, and call the conditional probability p_ij(n,T) = P(X_n+ 1|X_n = i),i,j ∈ T the one-step transition probability of the Markov chain {X_n, n ∈ T} at time n, which is referred to as the transition probability. That is, the conditional probability that the particle is in state i at time n and then is in state j after one step. The matrix composed of transition probability is the transition probability matrix. In a Markov chain, the system state transition can be represented by the transition probability matrix P as follows:

$$ P=\left( \begin{array}{cccc}p_{11}&p_{12}&\cdots&p_{1n}\\ p_{21}&p_{22}&\cdots&p_{2n}\\ {\vdots} & {\vdots} &{\ddots} &\vdots\\ p_{n1}&p_{n2}&\cdots&p_{nn} \end{array}\right) $$

The steps of Markov process

Markov process is introduced to obtain the transition probability of residual state, so as to determine the state of the residual when t > n. The steps are as follows:

Step 1: Determine the residual state;

Step 2: Calculate the state transition probability matrix P according to the residual state;

Step 3: Determine the initial state vector;

Step 4: According to the state transition formula, calculate the result of the t th state transition, and take the one with higher probability of occurrence status.

Methodology of improved grey prediction model

The grey power model

The GPM(λ,1,1) is an extension of the traditional GM(1,1). In this paper, the power exponent of GPM(λ,1,1) is analyzed according to the information covering principle of grey system, and the following definitions are given.

Definition 2

(The grey power model) Assuming $X^{\left (0\right )}$ is a non-negative unimodal raw data sequence, $X^{\left (1\right )}$ is the 1 − AGO sequence of $X^{\left (0\right )}$. $Z^{\left (1\right )}$ is a sequence generated next to the mean of the $X^{\left (1\right )}$. Then, there is the following non-linear model which meets the three conditions of gray modeling, and the grey power model is

$$ x^{\left( 0\right)}\left( k\right)+a\cdot z^{\left( 1\right)}\left( k\right)=b\cdot \left[z^{\left( 1\right)}\left( k\right)\right]^{\lambda} $$

The whitening equation of the grey power model is

$$ \frac{dx^{\left( 1\right)}}{dt}+a\cdot x^{\left( 1\right)}=b\cdot \left[x^{\left( 1\right)}\right]^{\lambda} $$

(5)

Solving the above model by the solution method of GM(1,1), we can get the solution of the whitening equation is

$$ x^{\left( t+1\right)} = \left\{e^{-\left( 1-\lambda\right)at}\left[\left( 1 - \lambda\right)\int be^{\left( 1-\lambda\right)at}dt+c\right]\right\}^{\frac{1}{1-\lambda}}. $$

(6)

Parameters analysis of G P M(λ,1,1)

Parameter λ estimation method

The parameter λ is an important coefficient in the GPM(λ,1,1). According to the above formulas, since $x^{\left (1\right )}\neq 0$, divide both sides of Eq. 5 by $\left [x^{\left (1\right )}\right ]^{\lambda }$ and then take the deriation about t to get (7) as follows:

$$ \begin{array}{@{}rcl@{}} \frac{d^{2}x^{\left( 1\right)}}{dt^{2}}&\cdot& \left[x^{\left( 1\right)}\right]^{\lambda}-\lambda \cdot \left( \frac{dx^{\left( 1\right)}}{dt}\right)^{2}\left[x^{\left( 1\right)}\right]^{\lambda-1}=-a\left( 1-\lambda\right)\\&\cdot& \left[x^{\left( 1\right)}\right]^{\lambda}\cdot \frac{dx^{\left( 1\right)}}{dt} \end{array} $$

(7)

According to the information coverage principle of grey derivative, we cover $\frac {dx^{\left (1\right )}}{dt}$ and $\frac {d^{2}x^{\left (1\right )}}{dt^{2}}$ in Eq. 7 with the first grey derivatives and the second grey derivatives of $x^{\left (1\right )}$, then we will get

$$ \begin{aligned} &\left[x^{\left( 0\right)}\left( t\right)-x^{\left( 0\right)}\left( t-1\right)\right]\cdot \left[z^{\left( 1\right)}\left( t\right)\right]^{\lambda}\\&-\lambda \cdot \left[x^{\left( 0\right)}\left( t\right)\right]^{2}\cdot\left[z^{\left( 1\right)}\left( t\right)\right]^{\lambda-1}\\ &=-a\left( 1-\lambda\right)\cdot \left[z^{\left( 1\right)}\left( t\right)\right]^{\lambda}\cdot x^{\left( 0\right)}\left( t\right) \end{aligned} $$

(8)

Dividing the Eq. 8 with t = k by the Eq. 8 with t = k + 1, we can eliminate the unknown parameter a and get

$$ \begin{aligned} &\frac{\left[x^{\left( 0\right)}\left( k\right)-x^{\left( 0\right)}\left( k-1\right)\right]\cdot \left[z^{\left( 1\right)}\left( k\right)\right]^{\lambda}-\lambda \cdot \left[x^{\left( 0\right)}\left( k\right)\right]^{2}\cdot\left[z^{\left( 1\right)}\left( k\right)\right]^{\lambda-1}}{\left[x^{\left( 0\right)}\left( k+1\right)-x^{\left( 0\right)}\left( k\right)\right]\cdot \left[z^{\left( 1\right)}\left( k+1\right)\right]^{\lambda}-\lambda \cdot \left[x^{\left( 0\right)}\left( k+1\right)\right]^{2}\cdot\left[z^{\left( 1\right)}\left( k+1\right)\right]^{\lambda-1}}\\ &=\frac{\left[z^{\left( 1\right)}\left( k\right)\right]^{\lambda}\cdot x^{\left( 0\right)}\left( k\right)}{\left[z^{\left( 1\right)}\left( k+1\right)\right]^{\lambda}\cdot x^{\left( 0\right)}\left( k+1\right)} \end{aligned} $$

(9)

It follows from Eq. 9 that

$$ \begin{aligned} \lambda=&\frac{\left[x^{\left( 0\right)}\left( k+1\right)-x^{\left( 0\right)}\left( k\right)\right]\cdot z^{\left( 1\right)}\left( k+1\right)\cdot z^{\left( 1\right)}\left( k\right)\cdot x^{\left( 0\right)}\left( k\right)}{\left[x^{\left( 0\right)}\left( k+1\right)\right]^{2}\cdot z^{\left( 1\right)}\left( k\right)\cdot x^{\left( 0\right)}\left( k\right)-\left[x^{\left( 0\right)}\left( k\right)\right]^{2}\cdot z^{\left( 1\right)}\left( k+1\right)\cdot x^{\left( 0\right)}\left( k+1\right)}\\ &-\frac{\left[x^{\left( 0\right)}\left( k\right)-x^{\left( 0\right)}\left( k-1\right)\right]\cdot z^{\left( 1\right)}\left( k\right)\cdot z^{\left( 1\right)}\left( k+1\right)\cdot x^{\left( 0\right)}\left( k+1\right)}{\left[x^{\left( 0\right)}\left( k+1\right)\right]^{2}\cdot z^{\left( 1\right)}\left( k\right)\cdot x^{\left( 0\right)}\left( k\right)-\left[x^{\left( 0\right)}\left( k\right)\right]^{2}\cdot z^{\left( 1\right)}\left( k+1\right)\cdot x^{\left( 0\right)}\left( k+1\right)} \end{aligned} $$

(10)

From the expression of λ, we can see that it can not only reflect the grey derivative of the original data, but also reflect the role of grey integral. When k = 2,3,⋯ ,n − 1, the corresponding $\left (n-2\right )$ values of λ can be computed, which is {λ_k}.

Let $g\left (\lambda \right )=\sum \limits _{k=2}^{n-1}{\left (\lambda -\lambda _{k}\right )^{2}}$, the value of λ that makes $g\left (\lambda \right )$ take the minimum value is the constant value to be determined.

Since $g\left (\lambda \right )$ is a parabola with an opening upward, according to the first order condition of unconditional extremum, the optimal value of λ is

$$ \begin{aligned} \hat{\lambda}=\frac{1}{n-2}\sum\limits_{k=2}^{n-1}{\lambda_{k}}. \end{aligned} $$

(11)

In this case, $g\left (\hat {\lambda }\right )$ takes the minimum value.

Estimates of parameters a and b

After the optimal value of λ is determined, the parameters a and b can be estimated directly according to the least square method. Then, we can get the theorem 1.

Theorem 1

Assuming $X^{\left (0\right )}$, $X^{\left (1\right )}$ and $Z^{\left (1\right )}$ are as defined in Definition 1, then the least squares estimate of the parameter sequence in GPM(λ,1,1) is

$$ \begin{pmatrix}\hat{a}\\\hat{b}\end{pmatrix}=\begin{pmatrix}B^{T}B\end{pmatrix}^{-1}B^{T}Y, $$

(12)

where $B=\begin {pmatrix}-z^{\left (1\right )}\left (2\right )&\left [z^{\left (1\right )}\left (2\right )\right ]^{\hat {\lambda }}\\ -z^{\left (1\right )}\left (3\right )&\left [z^{\left (1\right )}\left (3\right )\right ]^{\hat {\lambda }}\\{\cdots } & {\cdots } \\ -z^{\left (1\right )}\left (n\right )&\left [z^{\left (1\right )}\left (n\right )\right ]^{\hat {\lambda }}\end {pmatrix}$, $Y=x^{\left (0\right )}\left (2\right ),x^{\left (0\right )}$ $\left (3\right ),\cdots ,x^{\left (0\right )}\left (n\right )^{T}$.

Solution of G P M(λ,1,1)

According to Eq. 12 and the estimation results of parameters, we can simplify it to get $\hat {x}^{\left (1\right )}\left (t+1\right )=\left [c\cdot e^{-\left (1-\hat {\lambda }\right )\hat {a}t}+\left . \hat {b} \middle / \hat {a} \right .\right ]^{\frac {1}{1-\hat {\lambda }}}$. If the initial value $\hat {x}^{\left (1\right )}\left (1\right )=x^{\left (0\right )}\left (1\right )$, then the solution of GPM(λ,1,1) is

$$ \begin{array}{@{}rcl@{}} \hat{x}^{\left( 1\right)}\left( t+1\right)&=&\left\{\left[\left( x^{\left( 0\right)}\left( 1\right)\right)^{1-\hat{\lambda}}-\left. \hat{b} \middle/ \hat{a} \right.\right]\right.\\&&\left.\cdot e^{-\left( 1-\hat{\lambda}\right)\hat{a}t}+\left. \hat{b} \middle/ \hat{a} \right.\right\}^{\frac{1}{1-\hat{\lambda}}}. \end{array} $$

(13)

Rolling modeling mechanism

It is easy to produce some unacceptable errors in practical applications. In order to reduce the errors, rolling mechanism is proposed. The length of training data set is set as c, and the predicted period of rolling modeling is set as d. The steps are as follows:

Step 1: The original sequence $\left \{x^{\left (0\right )}\left (1\right ),x^{\left (0\right )}\right .\left (2\right ),\cdots ,$ $\left .x^{\left (0\right )}\left (c\right )\right \}$ is used to model and the d-period prediction value $\left \{\hat {x}^{\left (0\right )}\left (c+1\right ),\hat {x}^{\left (0\right )}\left (c+2\right ),\cdots ,\hat {x}^{\left (0\right )}\left (c+d\right )\right \}$ is obtained;

Step 2: When predicting the sequence $\{\hat {x}^{\left (0\right )}(c+d+1),\hat {x}^{\left (0\right )}\left (c+d+2\right ),\cdots ,\hat {x}^{\left (0\right )}\left (c+2d\right )\}$, we use the latest c data points $\left \{\hat {x}^{\left (0\right )}\left (d+1\right ),\hat {x}^{\left (0\right )}\left (d+2\right ),\cdots ,\hat {x}^{\left (0\right )}\left (d+c\right )\right \}$ to predict;

Step 3: Repeat step 2 and use the latest sequence to predict the next set of d data points until the required data points are predicted.

The flow chart is shown in Fig. 2.

Building the R G P M M(λ,1,1)

Due to the complexity of the real situation, there will always be a certain difference between the fitting value obtained by GPM(λ,1,1) and the real value. Then, the accuracy index of grey fitting is random and non-stationary. In order to correct the predictable result and improve the prediction accuracy of GPM(λ,1,1), the fluctuation of grey fitting accuracy index is analyzed and predicted by Markov theory in this paper. Combined with the rolling mechanism, the electricity consumption is forecasted in the future. The steps of RGPMM(λ,1,1) are as follows and the flow chart is shown in Fig. 3.

Step 1: Calculate fitted values and predicted values

According to the time series $X^{\left (0\right )}=\{x^{\left (0\right )}\left (1\right ),x^{\left (0\right )}\left (2\right ),\cdots ,x^{\left (0\right )}$ $\left (n\right )\}$, GPM(1,1) is established to obtain

$$ \begin{aligned} &\hat{x}^{\left( 0\right)}\left( t+1\right)\\=&\!\left\{\left[\left( x^{\left( 0\right)}\left( 1\right)\right)^{1-\hat{\lambda}}-\frac{\hat{b}}{\hat{a}}\right]\cdot e^{-\left( 1-\hat{\lambda}\right)at}+\frac{\hat{b}}{\hat{a}}\right\}^{\frac{1}{1-\hat{\lambda}}}\\ & - \left\{\left[\left( x^{\left( 0\right)}\left( 1\right)\right)^{1-\hat{\lambda}} - \frac{\hat{b}}{\hat{a}}\right]\cdot e^{-\left( 1-\hat{\lambda}\right)a(t-1)}+\frac{\hat{b}}{\hat{a}}\right\}^{\frac{1}{1-\hat{\lambda}}} \end{aligned} $$

(14)

where $\hat {x}^{(0)}(t) (t\le n)$ are called fitted values, and $\hat {x}^{(0)}(t)(t>n)$ are called predicted values.

Step 2: Calculate the grey fitting accuracy index

The grey fitting accuracy index is set to $Y\left (t\right )=\left . x^{\left (0\right )}\left (t\right ) \middle / \hat {x}^{\left (0\right )}\left (t\right ) \right .$, which reflects the deviation degree of the data fitted by model from the original data.

Step 3: Division of state interval

Considering that $Y\left (t\right )$ is divided into m state $E_{i}=\left [\otimes _{1i}, \otimes _{2i}\right ], i=1,2,\cdots ,m$. The grey elements ⊗_1i and ⊗_2i are the lower bound and the upper bound of the i th state, where $\otimes _{1i}=Y\left (t\right )+a_{i}\overline {Y}$, $\otimes _{2i}=Y\left (t\right )+b_{i}\times \overline {Y}$, $\overline {Y}=\frac {1}{n}\cdot {\sum }_{i=1}^{n}Y(i)$. The a_i and b_i are constants that need to be determined based on experience and data.

Considering the limited amount of electricity consumption data in this paper, it is more appropriate to use the cluster analysis to determine the number of classification classes and the classification interval.

Step 4: Establish a state transition matrix

The transition probability of state E_i to state E_j is

$$ p_{ij}\left( \omega\right)=\frac{M_{ij}\left( \omega\right)}{M_{i}}. $$

where, $M_{ij}\left (\omega \right )$ is the number of samples $Y\left (t\right )$ transferred from the state of E_i to the state of E_j through ω steps; M_i is the total number of occurrences of E_i, and satisfies ${\sum }_{j=1}^{m}p_{ij}\left (\omega \right )=1$, i,j = 1,2,⋯ ,m. Therefore, $p_{ij}\left (\omega \right )$ reflects the probability of transition from E_i to E_j through ω steps.

The state transition probability matrix is

$$ R\left( \omega\right)=\left( \begin{array}{cccc} p_{11}\left( \omega\right)&p_{12}\left( \omega\right)&\cdots&p_{1m}\left( \omega\right)\\ p_{21}\left( \omega\right)&p_{22}\left( \omega\right)&\cdots&p_{2m}\left( \omega\right)\\ {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ p_{m1}\left( \omega\right)&p_{m2}\left( \omega\right)&\cdots&p_{mm}\left( \omega\right) \end{array}\right) $$

The $R\left (\omega \right )$ reflects the transfer law between the various states of the system. By examining $R\left (\omega \right )$ and the current state, we can predict the future development and change of the system.

The autocorrelation coefficient of each order is

$$ r_{\omega}=\frac{\sum\limits_{l=1}^{n-\omega}\left[Y\left( l\right)-\overline{Y}\right]\cdot \left[Y\left( l+\omega\right)-\overline{Y}\right]}{\sum\limits_{l=1}^{n}{\left[Y\left( l\right)-\overline{Y}\right]^{2}}} $$

By normalizing r_ω, the Markov weight of each order is

$$ \theta_{\omega}=\frac{|r_{\omega}|}{\sum\limits_{\omega=1}^{m}|r_{\omega}|}, \omega \le m. $$

where, 𝜃_ω is the Markov weight of the ωth order, and the ωth order generally is the maximum order when |r_ω|≥ 0.3.

Step 5: Calculate more accurate the predicted value

The transition probability matrix is used to predict the state interval E_i of the grey fitting precision index. The interval interpolation is used to determine the predicted value. Therefore, $\tilde {x}^{\left (0\right )}(n+1)=\hat {x}^{(0)}(n+1)\ast \widehat {Y}\left (n+1\right )$, where

$$ \widehat{Y}\left( n+1\right)=\otimes_{1i} \times {\frac{p_{i-1}}{p_{i-1}\ast p_{i+1}}}+\otimes_{2i}\times{\frac{p_{i+1}}{p_{i-1}\ast p_{i+1}}} $$

(15)

Step 6: Calculate more accurate predicted values in the next few years

Through a rolling mechanism, the input data is updated and the RGMM(λ,1,1) is established to forecast the next year’s value. Continue the above process until the desired data forecasted.

Evaluation metrics

Here are three kinds of evaluation metrics to evaluate the prediction accuracy. Only when the three metrics are all passed, the RGPMM(λ,1,1) can be used to predict, and its predicted values have reference significance. There are three kinds of evaluation metrics as follows:

A: The residual test

The three statistical indicators are determined, namely MAE (Mean Absolute Error) (Hamzaçebi 2007), MAPE (Mean Absolute Percentage Error) (Azadeh et al. 2008), and RMSE (The Root Mean Squared Error) (Geem and Roper 2009). The formulas for MAE,MAPE and RMSE are as follows:

$$ \renewcommand{\arraystretch}{1} \begin{array}{rl} &MAE=\frac{1}{n}\cdot{\sum}_{i=1}^{n}{\left|x^{(0)}(i)-\hat{x}^{(0)}(i)\right|},\\&MAPE=\frac{1}{n}\cdot{\sum}_{i=1}^{n}{\left|\frac{x^{(0)}(i)-\hat{x}^{(0)}(i)}{x^{(0)}(i)}\right|}\\ &RMSE=\sqrt{\frac{1}{n}\cdot{\sum}_{i=1}^{n}{\big(x^{(0)}(i)-\hat{x}^{(0)}(i)\big)^{2}}} \end{array} $$

where x⁽⁰⁾(i) is the original value at time i, and the $\hat {x}^{(0)}(i)$ is the fitted value at time i. Table 1 shows criteria of forecasting performance.

Table 1 MAPE criteria for model evaluation

A novel grey power-Markov model for the prediction of China’s electricity consumption

Abstract

Similar content being viewed by others

A hybrid prediction model based on improved multivariable grey model for long-term electricity consumption

Using grey models for forecasting China’s growth trends in renewable energy consumption

Mid-term Load Forecasting Based on Modified Grey Model

Explore related subjects

Introduction

objective

Organization

Basic knowledge

Basic GM(1,1)

Markov process

Transition probability and transition probability matrix

Definition 1

The steps of Markov process

Methodology of improved grey prediction model

The grey power model

Definition 2

Parameters analysis of G P M(λ,1,1)

Parameter λ estimation method

Estimates of parameters a and b

Theorem 1

Solution of G P M(λ,1,1)

Rolling modeling mechanism

Building the R G P M M(λ,1,1)

Step 1: Calculate fitted values and predicted values

Step 2: Calculate the grey fitting accuracy index

Step 3: Division of state interval

Step 4: Establish a state transition matrix

Step 5: Calculate more accurate the predicted value

Step 6: Calculate more accurate predicted values in the next few years

Evaluation metrics

A: The residual test

B: The correlation degree

C: The posterior error test

Case studies on forecasting the total electricity consumption in China

Experimental data

The prediction results of experiment

The prediction results of R G P M M(λ,1,1)

Comparing the forecasting performance of R G P M(λ,1,1) and R G P M M(λ,1,1)

Forecast the total electricity consumption in the next 6 years

Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Author contribution

Data availability

Ethical approval

Consent to participate

Consent to publish

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation