Abstract
Short-term load forecasting of the power grid can realize the optimal configuration of power generation and dispatch of the power grid which saves energy to the greatest extent and ensures the stable operation of the power system. The power load data is affected by many factors and presents complex volatility. It is difficult for a single prediction method to obtain accurate prediction results. In this paper, a combined optimization prediction method based on Hilbert-Huang transform (HHT) is proposed. By acquiring more regular component sequences of load data, its essential characteristics are explored and then combined with different neural network models for prediction to improve the accuracy and stability of short-term load forecasting. Simulation experiment results verify the prediction accuracy of the combined prediction method.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
With the continuous popularization and development of smart grid, the accuracy of load forecasting is vital for improving the scientificity of power generation and distribution and dispatching in the power system [1]. The short-term load forecasting is used to forecast the load data in the next day or next few days, which is affected by many factors. The forecasting method is continuously improved, and the forecasting accuracy also remains to be further improved. The method for short-term load forecasting is mainly classified as classical forecasting method and modern forecasting method. The classical forecasting method includes empirical forecasting method and traditional forecasting method, of which the machine learning algorithm is used in the modern forecasting method for analysis modeling through historical load data and relevant factors of impact load, such as air temperature and special event. The research methods include support vector machine, random forest and neural network at present [2].
In recent years, the dynamic equilibrium of power supply and demand is provided with higher requirement with the development and reform of power market, so that the requirement for accuracy of short-term load forecasting becomes higher and higher; at the same time, with the continuous renewal of research method in short-term load forecasting field, many experts put forward the method for combination forecasting. The combination forecasting model not only overcomes the limitation of single model algorithm, but also realizes the complementary advantages between different neural networks by virtue of multi-feature characteristics of load data. It is widely applied and deeply researched in the short-term load forecasting field at present. The common thoughts in combination forecasting model include: firstly, combine with different neural network models to realize combination forecasting for load data, such as CNN-LSTM and GRU-NN combination forecasting model [3, 4]; secondly, preprocess the original load sequence, take different methods to extract features, then establish the forecasting model respectively, such as variational mode decomposition (VMD) and local mean decomposition (LMD) selected [5, 6], and execute load forecasting in combination with different forecasting models.
On the basis of existing scientific research achievements of combination forecasting method and characteristics of periodicity, non-linearity, non-stationarity and strong randomness of short-term load forecasting sequence, a short-term load combination forecasting method for power grid based on Hilbert-Huang Transform is proposed in this paper. HHT is a time-frequency analysis method characterized by self-adaptability and decomposing signal locally. Firstly, the load data is decomposed by empirical mode decomposition (EMD) algorithm to get IMF component and get different components of instantaneous frequency by Hilbert conversion; secondly, the different load forecasting models are selected to forecast the high-frequency, medium-frequency and low-frequency components respectively. Such a combination model can not only make full use of the most of character of different components, but also realize the complementary advantage between different forecasting models. The rationality and effectiveness of the combination model are validated by test in this paper.
2 Theoretical Basis of Combination Model
2.1 Empirical Mode Decomposition (\({\varvec{EMD}}\))
HHT include EMD and Hilbert conversion. EMD decomposition is the core part of HHT conversion, as well as the truly innovative point. EMD process is an iterative process of data processing by envelope fit by extreme value. After the data column iterated meets the certain conditions, it will become the intrinsic mode function (IMF), calling as screening. IMF is characterized by two points: 1) In sequence, the number of extreme point is equal to that of zero crossing point or the difference is not greater than 1; 2) At any time, the local value of upper envelope and lower envelope defined by extreme point is zero [7]. The general steps of EMD decomposition include:
-
1)
The maximum value and minimum value of input signal \(x\left( t \right)\) are evaluated to fit the envelope. The curve fitting mentioned here is an important issue in \(EMD\). The effect of the interpolation method selected on envelope fitting will have a direct influence on \(EMD\) decomposition result, and the cubic spline interpolation method is used in this paper.
-
2)
The mean \(m\left( t \right)\) of upper envelope and lower envelope is evaluated, and \(h\left( t \right)\) is evaluated by \(h\left( t \right) = x\left( t \right) - m\left( t \right)\);
-
3)
\(IMF\) end condition of \(h\left( t \right)\) is judged. If two features of \(IMF\) mentioned above are met, \(h\left( t \right)\) is the first \(IMF\) evaluated by decomposition, recorded as \(h_{1} \left( t \right)\); if not, Step 1 and Step 2 are cycled and repeated through assuming \(h\left( t \right) = x\left( t \right)\) as the new sequence;
-
4)
The new sequence \(r\left( t \right)\) is gotten by \(r\left( t \right) = x\left( t \right) - h_{n} \left( t \right)\), and the decomposition end condition is judged. If it is met, the decomposition is ended; \(r\left( t \right)\) is the residual component, and n \(IMF\) components are gotten by decomposition; if not, the above steps are cycled and repeated through assuming \(r\left( t \right) = x\left( t \right)\).
A certain number of \(IMF\) and residual component \(r\) are gotten through decomposition of original sequence signal \(x\left( t \right)\), of which \(x\left( t \right)\) is expressed as:
wherein, \(h_{i} \left( t \right)\) is the \(i{ }IMF{ }\) component, and \(r\left( t \right){ }\) is the residual component. Each \(IMF\) component gotten here is the independent data sequence of a characteristic scale [8].
2.2 Hilbert Spectrum Analysis
After \(EMD\) decomposition screening, \(Hilbert\) conversion is applied in each independent \(IMF\) component, so as to get the instantaneous frequency and instantaneous amplitude to analyze the components. \(Hilbert\) conversion can realize the \(90^{^\circ }\) phase shift of base frequency and harmonic wave accurately, and remain constant amplitude [9]. With regard to the given signal \(x\left( t \right)\), \(Hilbert\) conversion may be defined as:
The analytic signal is gotten through \(Hilbert\) conversion for signal \(x\left( t \right)\):
So the instantaneous frequency \(\omega\):
Through \(Hilbert\) conversion, the analytic function of each IMF component \(h_{i} \left(t \right)\):
The instantaneous frequency and instantaneous amplitude of each \(IMF\) component are gotten from Eq. (4, 5). In essence, \(Hilbert\) conversion shows the optimal approximation degree of local signal and sine function, and its localization feature is further reinforced in the differential operation of solving instantaneous frequency [10].
2.3 RBF Neural Network
\(RBF\) neural network can approximate arbitrary nonlinear function with good function approximation function, and is widely applied in the load forecasting aspect by virtue of characteristics of simple structure and rapid learning convergence rate. \(RBF\) neural network based on Gaussian kernel is used [11]. It is assumed that the input vector is \(n\) dimension, and is denoted as \(x = \left( {x_{1} ,x_{2} , \ldots ,x_{n} } \right)^{T}\). In addition, there are \(k\) hidden nodes and \(m\) outputs in the model. \(h_{i} \left( x \right)\) represents the \(i\) hidden layer node. The Gaussian function is used for conversion of space mapping of input information as the kernel function of hidden layer neuron:
\(\delta\) is the extended constant. When the vector is input to neural network through \(Gaussian\) radial basis function, the output of the \(j\) node of hidden layer:
wherein, \(c_{j}\) is the center of Gaussian function of the \(j\) hidden layer; \(\parallel \cdot\parallel \) is Euclidean norm, and \(\sigma_{j}\) is the width of Gaussian function of the \(j\) hidden layer. The output of \(RBF\) neural network:
wherein, \(w_{j}\) is the network connection weight between the \(j\) hidden layer node and output layer [18]. \(RBF\) neural network forecasting model can turn the nonlinear mapping from input layer to hidden layer into the linear mapping on the other space, and forecast the signal with high frequency, large volatility and strong randomness better [19].
2.4 LSTM Recurrent Neural Network
LSTM is an improved structure proposed for easy gradient vanishing and gradient explosion of common \(RNN\) in practical training. It is a mechanism which leads in cell gate from neuron of standard \(RNN\) model, which consists of input gate, output gate and forget gate [12]. The forget gate is used to decide the forget and update of transitional information. LSTM cell structure is as shown in Fig. 1.
LSTM model can decide which information is forgotten and updated to constitute the long-term and short-term memory network through gate mechanism in cell structure. According to LSTM cell structure chart, \(C_{t}\) is the cell state at \(t\) time; \(x_{t}\) is the input at \(t{ }\) time; \(h_{t}\) is the output at \(t\) time, and \(f_{t}\), \(i_{t} \; and \; o_{t}\) are output of forget gate, input gate and output gate respectively. The operation process of concrete cell structure:
wherein, \(W_{f}\), \(W_{i}\), \(W_{c}\) and \(W_{o}\) are weight matrixes, and \(b_{f}\), \(b_{i}\), \(b_{c}\) and \(b_{o}\) are bias vectors. \(LSTM\) model is optimized by “forget gate” additionally, so as to control the convergence of gradient during training data, and solve the gradient vanishing or gradient explosion better.
3 Short-Term Load Forecasting Combination Model Based on HHT
The short-term load data of power grid is affected by human production and life, change of meteorological condition, economic factor, political factor, etc. The system load data includes multiple characteristics for analysis and forecasting, and it is difficult to obtain the essential characteristics. In order to further explore the inherent law of load data, the short-term load forecasting combination model of power grid based on \(HHT\) is established to decompose the load data as a certain amount of \(IMF\) by \(EMD\) algorithm, and then convert and process each component alone by \(Hilbert\) conversion, so as to get different instantaneous frequencies and instantaneous amplitudes. According to different characteristics of \(IMF\), the different neural network models are selected for forecasting, and the result is overlapped to get the forecasted value of load in the end. In the meantime, due to large influence of change of air temperature on fluctuation of load data, the accuracy of load forecasting is promoted in combination with the correlation between temperature data of the region and \(IMF\) component.
3.1 Hilbert-Huang Transform (HHT) of Load Data
The data sample in March 2021 of a region in East China is selected for test. The load data curve is shown in Fig. 2. Firstly, the load sequence is provided with \(EMD\) decomposition, then the envelope is fitted with cubic spline interpolation method, and a total of 7 \(IMF\) components and a residual component \(r\) are decomposed. The concrete result is as shown in Fig. 3.
The frequency of \(IMF_{1}\) component and \(IMF_{2}\) component is high in Fig. 3, but that of \(IMF_{3}\) to \(IMF_{7}\) is decreased progressively in contrast. In order to further analyze each component, \(Hilbert\) conversion is also applied to obtain the concrete instantaneous frequency curve chart of each component, as shown in Fig. 4.
The mean frequency of each \(IMF\) component is further calculated in Table 1. It shows that each \(IMF\) component owns different frequency characteristics in Fig. 4 and Table 1. It is decreased progressively, and the mean value calculated is also reduced in turn. \(IMF_{1}\) to \(IMF_{3}\) is characterized by large fluctuation, strong randomness and high frequency upon calculation as the random part of load; \(IMF_{4}\) to \(IMF_{5}\) is characterized by steady fluctuation trend and mean frequency decrease, which represents the periodicity of load; eventually, according to the calculation result of mean frequency in Table 1, \(IMF_{6}\), \(IMF_{7}\) and residual term \(r\) approach to zero, which represents the trend component of load. The components are divided into random component, periodic component and trend component according to their characteristics. On one hand, they reduce the difficulty of building forecasting model; on the other hand, they emphasize on different characteristics of each component. The model is built by combination of \(RBF\) neural network model and recurrent neural network based on \(LSTM\). It not only takes advantage of high learning rate of \(RBF\) neural network to process the data signal with large volatility and high frequency, but also processes the problems with strong periodicity and highly correlated with time sequence in combination with \(LSTM\) to effectively improve the forecasting accuracy.
3.2 Correlation Analysis of \({\varvec{IMF}}\) Component Temperature
The power load data includes multiple properties of power utilizations, i.e., industrial load, appliance load and transportation load. Different \(IMF\) components represent different properties of power utilization data, and the meteorological influence is also different, so it should be analyzed in preliminary data processing in combination with meteorological factors. For example, the air temperature is selected as the representation of meteorological factors, the meteorological factor is integrated to adjust the input data and model parameter through comparison with correlation between different \(IMF\) components and temperatures as well as neural network modeling for different \(IMF\) components.
The correlation coefficient of each \(IMF\) component and air temperature is defined as
wherein, \(c_{i} \left( t \right)\) is air temperature of the corresponding point (\(i = 1,2, \ldots ,n\); \(n\) is total number of \(IMF\) components). The difference in correlation of different seasons is obvious, of which the correlation coefficient of \(IMF\) component and temperature in summer and winter is high, and the load data in March is selected; the correlation of \(IMF\) component and air temperature data gotten by decomposition is small on the whole, and the curve chart of correlation coefficient in Fig. 5 is obtained. Specially, it is shown that the correlation between \(IMF_{1}\) component and \(IMF_{2}\) component and air temperature data scarcely exists. \(IMF_{3}\) and \(IMF_{4}\) are positively correlated with air temperature data, but \(IMF_{5}\) and \(IMF_{7}\) are negatively correlated with air temperature data, so the short-term load forecasting combination model is trained for different \(IMF\) components respectively. As an example of \(IMF_{4}\) component, it shows that \(IMF_{4}\) component is greatly affected by weather in contrast in Fig. 5, and the forecasting difficulty is high. In the short-term load forecasting combination modeling of power grid corresponding to \(IMF_{4}\) component, the proportion of training data, validation data and test data is about 90%, 5% and 5% respectively.
3.3 Short-Term Load Forecasting Combination Model of Power Grid
The load sequence of power system is characterized by volatility and special periodicity, and it is greatly affected by actual scenes, for example, the difference of geographic position and living habit of the southern and northern China, economic and social difference of the first-tier and second-tier cities and the third-tier and fourth-tier cities will cause different periodicities and volatilities of load data due to climate, major events, electricity price fluctuation, etc. The short-term load combination forecasting model of power grid based on \(HHT\) proposed herein is applied to study the essence of load data through decomposition of short-term load data of power grid, and then forecast in combination with the appropriate neural network forecasting model according to characteristics of different components, so as to improve the forecasting accuracy and stability.
The concrete steps of short-term load combination forecasting model of power grid based on \(HHT{ }\) is:
-
(1)
Preprocess the historical load data and specify the evaluation index;
-
(2)
Decompose the load data by \(EMD\) algorithm, provide \(Hilbert\) conversion for \(IMF\) component, and get the instantaneous frequency;
-
(3)
Apply appropriate neural network model for forecasting respectively according to characteristics of different frequencies of each component;
-
(4)
Add the forecasted result of each component to get the final result;
-
(5)
Eventually, get the accuracy index through comparison with the non-compositional method.
4 Simulated Analysis
The short-term load forecasting model is applied, and the load data in March 2021 of one region in eastern China is selected to forecast the load value for 24 h on April 1, 2021 as the training sample, and analyze the accuracy of forecasted result. The curve chart of actual load and forecasted load is as shown in Fig. 6. The error of concrete forecasted result value and relative percentage of forecasting is as shown in Table 2, of which the forecasting percentage error is defined as:
wherein, \(A_{t}\) is the real load value; \(P_{t}\) is the load value gotten by forecasting.
In order to compare with other methods, the mean absolute percentage error (\(MAPE\)) is selected to measure the forecasted result as the evaluation index of short-term load forecasting of power grid.
The result which shows the higher forecasting accuracy and better effect of the method proposed herein is presented in Table 3 by comparison of the method proposed herein and forecasted result of \(RNN\) recurrent neural network and \(LSTM\) recurrent neural network forecasting model for mean load of one region.
In contrast with \(MAPE\) result based on the combination method and network model method based on \(LSTM\) and \(RNN\), although the forecasting accuracy is fluctuated, \(MAPE\) index of combination forecasting method proposed in this paper is basically less than 2%, which is obviously superior to that of recurrent neural network forecasting model based on \(LSTM\) applied alone.
5 Conclusions
In this paper, the short-term load combination forecasting model of power grid based on \(HHT\) is studied. The original load sequence is decomposed by \(EMD\) decomposition algorithm, and then each \(IMF\) component is changed along by \(HT\). According to characteristics of different components and analysis on correlation of air temperature data, it is forecasted in combination with the forecasting model of neural network based on \(RBF\) and recurrent neural network based on \(LSTM\). It not only takes advantage of \(HHT\) to process nonlinear and non-stable signal, but also realizes the complementary advantages between different neural networks. It is discovered that the accuracy of short-term load forecasting combination method of power grid based on \(HHT\) is higher through experimental result. Certainly, the characteristic analysis on IMF component should remain to be further studied and explored in contrast with correlation analysis of other factors, such as social experience factors and holiday activity factors. The combined neural network model also remains to be further improved and tried to promote the load forecasting accuracy to a greater extent.
References
Yao, J.G., Yang, S.C., Gao, Z.H., et al.: Development trend prospects of power dispatching automation system. Autom. Electr. Power Syst. 31(13), 7–11 (2007)
Osman, Z.H., Awad, M.L., Mahmoud, T.K.: Neural network based approach for short-term load forecasting. In: 2009 IEEE/PES Power Systems Conference and Exposition, pp. 1–8. Seattle, WA (2009)
Lu, J.X., Zhang, Q.P., Yang, Z.H., et al.: Short-term load forecasting method based on CNN-LSTM hybrid neural network model. Autom. Electr. Power Syst. 43(8), 131–137 (2019)
Wu, L.Z., Kong, C., Hao, X.H., et al.: A short-term load forecasting method based on GRU-CNN hybrid neural network model. Math. Probl. Eng. 2020, 1–10 (2020)
Liang, Z., Sun, G.Q., Li, H.C., et al.: Short-term load forecasting based on VMD and PSO optimized deep belief network. Power Syst. Technol. 42(2), 598–606 (2018)
Kong, X.Y., Li, C., Zheng, F., et al.: Short-term load forecasting method based on empirical mode decomposition and feature correlation analysis. Autom. Electr. Power Syst. 43(5), 46–51 (2019)
Huang, N.E., Shen, Z., Long, S.R., et al.: The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. In: Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, pp. 903–995 (1998)
Zhu, Z.H., Sun, Y.L., Ji, Y.: Short-term load forecasting based on EMD and SVM. High Volt. Eng. 33(5), 118–122 (2007)
Huang, C.G.: Hilbert transform and its applications. J. Chengdu Inst. Meteorol. 14(3), 273–276 (1999)
Munir, B.S., Reza, M., Trisetyarso, A., et al.: Feature extraction using Hilbert-Huang transform for power system oscillation measurements. In: 2017 4th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE), pp. 93–96. Semarang, Indonesia (2017)
Schilling, R.J., Carroll, J.J., Al-Ajlouni, A.F.: Approximation of nonlinear systems with radial basis function neural networks. IEEE Trans. Neural Netw. 12(1), 1–15 (2001)
Gers, F.A., Schmidhuber, J., Cummins F.: Learning to forget: continual prediction with LSTM. In: Ninth International Conference on Artificial Neural Networks (ICANN), pp. 850–855. Edinburgh, UK (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Zhang, Y., Xia, S., Chen, C., Yang, F., He, X. (2023). Combined Short-Term Load Forecasting Method Based on HHT. In: Deng, DJ., Chao, HC., Chen, JC. (eds) Smart Grid and Internet of Things. SGIoT 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 497. Springer, Cham. https://doi.org/10.1007/978-3-031-31275-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-31275-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31274-8
Online ISBN: 978-3-031-31275-5
eBook Packages: Computer ScienceComputer Science (R0)