Predicting photovoltaic power generation using double-layer bidirectional long short-term memory-convolutional network

Sabri, Mohammed; El Hassouni, Mohammed

doi:10.1007/s40095-022-00530-4

Predicting photovoltaic power generation using double-layer bidirectional long short-term memory-convolutional network

Original Research
Published: 28 September 2022

Volume 14, pages 497–510, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal of Energy and Environmental Engineering Aims and scope Submit manuscript

Predicting photovoltaic power generation using double-layer bidirectional long short-term memory-convolutional network

Download PDF

Mohammed Sabri¹^na1 &
Mohammed El Hassouni²^na1

355 Accesses
4 Citations
Explore all metrics

Abstract

Accurate photovoltaic (PV) power prediction is critical for PV power plant safety and stability. The main restrictions influencing the accuracy of the PV power forecast are the variability and intermittency of solar energy. Therefore, this study proposes a hybrid deep learning model for PV power forecast that is successfully developed using the combination of the bidirectional long short-term memory (BLSTM) and convolutional neural network (CNN) and is applied to the actual dataset collected in the DKASC PV system in Alice Springs, Australia. The proposed architecture is a structure of two major branches. BLSTM is used first to extract the bidirectional temporal characteristics of PV power. Next, CNN was used to capture the spatial characteristics. The prediction results of the hybrid model are compared with those of the single model LSTM, BLSTM, CNN, gated recurrent unit, recurrent neural network (RNN), and the hybrid network (LSTM–CNN, CNN–LSTM) in order to demonstrate the higher performance of the proposed hybrid prediction model. By comparing statistical performance indicators such as root mean square error (RMSE), mean absolute error (MAE), mean square error (MSE), and coefficient of determination (R$^2$) values with other existing deep learning models, the performance of the proposed BLSTM–CNN model has been demonstrated. The results indicate that the BLSTM–CNN model has the highest precision with the lowest MSE of 0.0089, MAE of 0.0531, RMSE of 0.0944, and highest R² of 0.9993. BLSTM–CNN can enhance forecasting accuracy while also accurately capturing the various temporal–spatial characteristics of PV power.

A hybrid model of CNN and LSTM autoencoder-based short-term PV power generation forecasting

Article Open access 24 January 2024

Hybrid Deep Learning Architecture Approach for Photovoltaic Power Plant Output Prediction

Accurate photovoltaic power forecasting models using deep LSTM-RNN

Article 14 October 2017

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Energy is one of the most important factors in the process of industrialization and modernization, and it plays a critical role in technological and economic progress. Furthermore, massive population growth has exacerbated the global energy crisis. Electricity demand continues to rise, which has a negative influence on the environment [52]. As a result, several governments and regions are enacting laws to encourage the development of renewable energy [30]. Solar energy has become a crucial means of solving environmental and energy concerns due to its clean and plentiful properties [43]. The randomness and intermittent nature of PV power generation, on the other hand, makes integrating it into current energy networks extremely difficult [48]. Accurate PV power prediction is critical for ensuring the power grid’s security and storing alternative energy sources for a reasonable amount of time [2].

Different prediction approaches have been presented in recent years to forecast PV power output, which can be split into four groups: Physical method [36], statistical method [8], machine learning method [47], and hybrid method [32] are four types of PV power predicting techniques that have been established based on various forecasting principles.

The physical model forecasts PV power based on geological variables and meteorological data (i.e., air pressure, humidity, solar radiation, cloud volume, etc.) provided by the meteorological stations. After that, it creates a physical model based on the PV panel parameters, and then directly the PV generation power is calculated. The physical model is less reliant on previous data, but it is more complex to model because there are several unknown parameters [11].

Based on statistical data, time series models predict PV power. The impact of meteorological conditions is not considered throughout the forecasting phase; just the time factor is taken into account. To forecast PV output, a range of statistical approaches have been used, for example, the Markov Chain method [33, 42], gray theory [55], autoregressive and moving average (ARMA) models [6, 25]. A prior understanding of the PV system’s complicated photoelectric conversion link is not required for statistical modeling, but only a partial comprehension and realization using numerous data analysis approaches. However, the statistical approach uses a huge quantity of data to calculate and requires more time. With the rapid growth of artificial intelligence (AI) in recent years, Machine learning models with strong learning capacity and nonlinear mapping capability have been widely employed in PV power forecasting [18]. Furthermore, machine learning models can forecast PV generation power from easily available data, eliminating the need for complicated computations and other costly expenditures. For example, support vector machine (SVM) [31], back propagation neural network (BPNN) [19], extreme learning machine (ELM) [3] and Elman neural network [51]. These approaches forecast the generation capacity of PV power plants only based on historical data, without requiring any knowledge of PV power plants such as the number of panels, panel capacity [34]. Traditional single algorithms frequently neglect the fact that output power changes with a wide range of meteorological variables, which might lead to inaccurate forecasting [12]. Hybrid methods, which combine a variety of effective techniques, are more effective and efficient when compared to other ways for PV generation forecast [7]. Some examples of hybrid models used in PV power prediction are: support vector machine (SVM) and ant colony optimization (ACO) [37], convolution neural network (CNN) and gated recurrent unit (GRU) [40], bidirectional LSTM model with a genetic algorithm (GA) [58]. (SDA–GA–ELM) based on customized similar day analysis (SDA), genetic algorithm, and extreme learning machine [59].

The deep learning theory, introduced by Hinton et al. [15] has recently gained a lot of traction. With the advancement at a rapid pace of artificial intelligence techniques, deep learning models have a wider and more robust nonlinear network structure than classic machine learning models [28]. Some have already produced excellent results in predicting PV power generation, LSTM proposed by Hochreiter and Schmidhuber [16] has been extensively used to forecast PV power. The LSTM is a recursive neural network that can increase the network’s storage space and retain historical data for later use, it has the advantage of detecting long-term time series relationships. Gao et al. [13] proposed an LSTM model based on meteorological data to forecast the daily power production of PV power plants using weather categorization. Chen et al. [4] proposed a new method for very-short-term PV power prediction that combines similar time period collection using RCC (radiation classification coordinate) with LSTM. Lee et al. [27] proposed two models LSTM and GRU to predict PV power generation in a peak zone. The CNN is perfect for processing and analyzing high-dimensional data. For time series forecasting, some researchers employ CNN. For wind and solar energy forecasting, Díaz-Vico et al. [9] employed a CNN with input data from a numerical weather prediction system (NWP), the CNN’s excellent feature extraction capacity is demonstrated by the prediction results. Sabri et al. [41] proposed a new hybrid deep learning model (CNN–GRU) to predict the PV power output, the convolutional layer extracts the characteristics of the input data, while the GRU maintains the crucial details to improve prediction performance.

A new form of LSTM, known as the bidirectional LSTM (BLSTM), has recently been used for classification and regression problems. By structurally incorporating two forward and backward LSTM layers, the BLSTM can consider data from both the past and the future at the same time [24]. The BLSTM network has recently been utilized in electricity price forecasting [5], urban solid waste forecasting [21], and air pollution forecasting [35]. Aside from developing a time series forecasting model with BSLTM, other researchers have attempted to enhance BLSTM’s efficiency by combining the advantages of BLSTM and CNN to improve the prediction effect. Lawal et al. [26] proposed short-term wind forecasting at various elevations above ground level using a hybrid of 1D CNN and the BLSTM network. Unal et al. [46] proposed a spatiotemporal deep learning architecture to forecast energy consumption using the hybrid model CNN–BLSTM. Joseph et al. [22] proposed a novel hybrid deep learning model based on BLSTM and CNN for predicting traffic congestion in a smart city.

Weather forecast data has a restricted forecasting range, and historical PV power time series are non-periodic and non-stationary making classical AI algorithms ineffective. Specifically, to overcome current obstacles and attain the objectives of accurate PV power predicting. The following considerations contribute to the hypothesis of the proposed method in this paper: According to the current study, BLSTM has a high ability to extract bidirectional temporal characteristics, CNN can extract spatial characteristics. It is found that considering the combination of BLSTM and CNN model to predict PV power can achieve more accurate results.

Therefore, a new hybrid model of PV power forecasting, the BLSTM–CNN model, is suggested in this study based on the mechanistic characteristics of time series data. The main research contents of this paper are as follows:

1.
To get acceptable prediction results, the input dataset is placed through a preprocessing step where redundant, outlier, or missing values are eliminated.
2.
A hybrid PV power prediction network is proposed that takes into account the temporal–spatial characteristics extraction order.
3.
The bidirectional temporal characteristics of the data are extracted first using the BLSTM model and then the spatial characteristics of the data are extracted using the CNN model while considering the PV data features.

Methods and materials

Convolutional neural network

A major component of a convolutional neural network is the convolution layer [23]. Convolution layer C and numerous filters are coupled to the input matrix, with each filter holding an $i$ $\times$ $i$ weight matrix. Find the convolution matrix using a filtered scan of the input matrix. The CNN layer can extract local features from high-layer inputs and send them down to lower layers for more sophisticated features [54]. Equation (1) is the result of the vector $y^1_{ij}$ output from the first convolutional layer, where x is the input vector for power production, and n is the number of units per window. The output vector x of the previous layer is used to calculate y. w is the weight of the kernel, $\sigma$ is the activation function, $b^{1}_j$ represents the bias for the $j\mathrm{th}$ feature map, and m is the index value of the filter. The result of Eq. (2) is the vector $y^l_{ij}$ output from the $l_{th}$ convolutional layer.

$$\begin{aligned} y^1_{ij}= & {} \sigma \left( b^1_j+\sum _{m=1}^{M}w_{m.j}^1x^0_{i+m-1.j}\right) \end{aligned}$$

(1)

$$\begin{aligned} y^l_{ij}= & {} \sigma \left( b^l_j+\sum _{m=1}^{M}w_{m.j}^lx^0_{i+m-1.j}\right). \end{aligned}$$

(2)

The pooling layer is a crucial component of CNN, and it is utilized to minimize the convolution matrix’s dimension. Eq. (3) represents the max-pooling layer operation. T is the step that specifies how far the input data area will be relocated, and R is the pooling size that is smaller than y.

$$\begin{aligned} p^l_{ij} = \max _{r\in R } y^{l-1}_{i\times T+r.j} \end{aligned}$$

(3)

Bidirectional long short-term memory neural network

In 1982, Hopfield proposed the recurrent neural network (abbreviated as RNN) [17]. Because of its unique network structure, which differs from traditional neural networks, each component of the RNN maintains the hidden layer parameters, allowing the current component to retain the memory of the information produced by the pre-order components. Figure 1 illustrates a comprehensive overview of RNN. Nevertheless, there is a clear disadvantage to RNN when the data series is too long, or the time interval is too large. The continuous multiplication impact in gradient reverse multiplication causes the vanishing gradient issue, rendering RNNs unable to train effectively. This problem was well handled by Schmidhuber’s LSTM network, which he introduced in 1997 [16]. The forget gate, output gate, and input gate are used to build a memory unit in LSTM [56], which replaces the memory unit in RNN.

The process of the three gates specific to the LSTM is described in the following details:

Forget gate Unwanted data can be discarded if desired. By reading the current moment’s input $x_t$ and the previous moment’s output $h_{t-1}$, and assigning a weight between 0 and 1 to each data in the cell state $C_{t-1}$ at the previous moment, 0 denotes “all discarded” whereas 1 denotes “all retained.” The LSTM network can adjust this weight to improve the model through continual feedback learning. The following is the output $f_t$:

$$\begin{aligned} f_t = \sigma (W_f.[h_{t-1},x_t]+b_f). \end{aligned}$$

(4)

Input gate It is used to figure out what data should be saved in the cell state. The sigmoid layer defines what value needs to be updated, and the output of the input gate is designated as $i_t$. The tanh layer generates new $\tilde{C_{t}}$, that is, ready-to-add information to the cell state:

$$\begin{aligned} i_t= & {} \sigma (W_i.[h_{t-1},x_t]+b_i) \end{aligned}$$

(5)

$$\begin{aligned} \tilde{C_{t}}= & {} tanh(W_c.[h_{t-1},x_t]+b_c). \end{aligned}$$

(6)

Then, multiply the cell state at the previous moment by the forget gate output function $f_t$, and update the cell state at the previous moment. Add the newly generated candidate information and calculate the current unit state as follows:

$$\begin{aligned} C_{t} = f_t*c_{t-1}+i_t *\tilde{C_{t}}. \end{aligned}$$

(7)

Output gate Filter to ensure that just what is required in the cell state is output. It is also split into two layers: The tanh layer updates the cell state requirement to a value between $-1$ and 1. The output is designated as $o_t$, and the sigmoid layer defines which part of the cell state is output, and finally outputs $h_t$:

$$\begin{aligned} o_{t}= \sigma (W_o.[h_{t-1},x_t]+b_o) \end{aligned}$$

(8)

$$\begin{aligned} h_{t}= o_{t}*tanh{(C_{t})} \end{aligned}$$

(9)

where $o_t$ , $i_t$ ,$f_t$ are the output gate, input gate, and the output value of the forget gate, respectively. The $b_{f,i,o}$ and $W_{f,i,o}$ are the bias vectors and weight matrices. $\sigma$ is a sigmoid function.

BLSTM stands for bidirectional LSTM and is commonly employed for natural language processing. In terms of time series forecasting, BLSTM may outperform LSTM. BLSTM is made up of two fundamental LSTMs [14]: a forward LSTM that utilizes past information and a backward LSTM that utilizes future information, allowing information from time t-1 and time t+1 to be utilized at time t. Usually, BLSTM is more efficient than LSTM and RNN in general since both past and future information may be used.

The calculating equation of the $y_t$ :

$$\begin{aligned} y_{t}= g(W_y[h_t;h^{\prime}_t ]+b_y) \end{aligned}$$

(10)

$$\begin{aligned} h_{t}= f(W_t[c_{t-1};x]+b_t) \end{aligned}$$

(11)

$$\begin{aligned} h^{\prime}_{t}= f(W^{\prime}_t[c^{\prime}_{t-1};x]+b^{\prime}_t) \end{aligned}$$

(12)

where $h^{\prime}_{t}$ and $h_t$ is the hidden output of the backward LSTM cell and of the forward LSTM cell at time t, respectively, $W^{\prime}_t$ is the weight matrix of the backward LSTM cell, $W_t$ is the weight matrix of the forward LSTM cell at the time t, $b^{\prime}_t$ is the bias vectors of the backward LSTM cell at the time t, $b_t$ is the bias vectors of the forward LSTM cell at the time t.

BLSTM–CNN hybrid neural networks

Hybrid models, on average, outperform single models. Maintaining the utility of BLSTM and CNN in consideration. We leveraged the complementary capabilities of both models to construct a new operational temporal and spatial extracting features model to predict PV power generation more precisely. In this paper, a hybrid approach called BLSTM–CNN is suggested to forecast PV power generation using a series connection of BLSTM and CNN, as illustrated in Fig. 2. The suggested approach excels at pulling complex characteristics and patterns from weather factors obtained for PV power generation forecasting. The historical time series PV power data is initially fed into the BLSTM model as an input, and the temporal characteristics of the data are extracted utilizing the BLSTM model’s capability of processing time series data. The resulting temporal characteristics are then transmitted to the CNN model input layer to extract the data’s spatial characteristics. A CNN often contains numerous levels of convolutional-pooling layers, with many convolution operations conducted at each level to capture significant data. CNN applies weights to weather factors depending on their effect on PV power. Finally, a fully connected layer is employed to gather the data and forecast the PV power generation using extracted characteristics. The dropout layer is also introduced to the model to minimize model overfitting.

Dataset description

In this study, the PV data from 1B DKASC, Alice Springs PV system was chosen as a case study [10]. For this experiment, data from October 1, 2020, to January 27, 2021, with a resolution of 5 min were chosen. The input parameters are global horizontal radiation ($W/m^2 \times sr$), weather temperature Celsius ($^{\circ }\text {C}$), diffuse horizontal radiation ($W/m^2 \times sr$), current phase average (A), weather relative humidity ($\%$) and wind direction (${\hat{A}}^o$), while the output is set to active power data (kW). To increase the effectiveness and precision of the model forecast, the data must be preprocessed and filtered before being fed into it. Preprocessing involves eliminating abnormal data, completing missing values, and normalizing the data. The data is separated into two parts: 80% for training and 20% for testing. The BLSTM–CNN hybrid model has two primary parts. The first one is the bidirectional long-term dependencies are learned using the temporal modeling tool BLSTM after data preprocessing. The second one is 1D CNN, which is applied to extract the data’s spatial characteristics. In order to evaluate the effectiveness of the proposed BLSTM–CNN. Five single deep learning models CNN, GRU, LSTM, RNN, BLSTM, and two hybrid models LSTM–CNN and CNN–LSTM are also used as comparison models for predicting the output of PV power. The metrics used to measure model prediction efficiency and accuracy are RMSE, MSE, MAE, R$^2$. The experimental results were completed in Python 3.7 and a personal computer with a 64-bit operating system, Intel (R) Core (TM) i7-4600 CPU@2.10GHZ 2.70GHZ and 8.00 GB of RAM. The framework of PV power output forecasting is shown in Fig. 3.

Model evaluation indexes

To compare the performance of various predictive models, we utilize the mean absolute error (MAE), root mean square error (RMSE), mean square error (MSE), and coefficient of determination (R$^2$) [39]. Definitions of these evaluation indexes are as follows.

$$\begin{aligned} \text {MAE}= & {} \frac{1}{N}\sum _{i=1}^{N} \left|y_{i} - \tilde{y_{i}} \right|\end{aligned}$$

(13)

$$\begin{aligned} \text {MSE}= & {} \frac{1}{N}\sum _{i=1}^{N}(y_{i} - \tilde{y_{i}})^{2} \end{aligned}$$

(14)

$$\begin{aligned} \text {RMSE}= & {} \sqrt{\frac{1}{N}\sum _{i=1}^{n}({y_{i}} - \tilde{y_{i}})^{2}} \end{aligned}$$

(15)

$$\begin{aligned} R^2= & {} 1- \frac{\sum _{i=1}^{N}(y_i-\tilde{y_{i}})^2 }{\sum _{i=1}^{N}(y_i-\bar{y_i})^2} \end{aligned}$$

(16)

where $y_{i}$ is the real PV power generation value, $\tilde{y_{i}}$ predicted value and N is the number of $y_{i}$. $\bar{y_i}$ is the average of the real PV power generation in the test set.

Modeling results

Results and comparisons

This research proposes a hybrid model (BLSTM–CNN) for PV power prediction. The BLSTM model was used to extract bidirectional temporal features. Set up two hidden layers using the filtered index data in the BLSTM model, where Units = 128; Units = 256. The obtained temporal characteristics are then sent to the CNN model input layer, which uses the convolutional layer and pooling layer to extract spatial features of the dataset. In the CNN model, 2 layers of convolutional layers and 2 layers of pooling layers are used, there are 128 and 256 convolution kernels, respectively. In the convolutional layer, the kernel size is 3*3. The dropout layer [44] was also included in the model to avoid overfitting issues during training, which could reduce prediction accuracy. The batch size of the proposed model is 500. Finally, two layers of the fully connected layer with 512 and 256 neurons, respectively, output of the PV power generation forecasting result. The parameters settings of the suggested model in this work are shown in Table 1. Five single deep learning models CNN, GRU, LSTM, RNN, BLSTM, and two hybrid models LSTM–CNN and CNN–LSTM were used as comparison models for PV power output forecasting to verify the effectiveness of the suggested BLSTM–CNN model in this work. The RMSE, MSE, MAE, and R$^2$ metrics are used to evaluate model forecasting accuracy and effectiveness.

Table 1 Parameters setting of the proposed method

Full size table

Table 2 The results of the forecasting model

Full size table

Table 3 Statistical values of the experimental PV power (KW) data

Full size table

Table 4 Error evaluation results of BLSTM–CNN and other deep learning models in four datasets

Full size table

Table 2 shows the forecasting results of the eight models. Table 2 shows that the proposed BLSTM–CNN has lower values of RMSE, MSE, MAE values, and the highest value of R² than the other seven models. To make the data in Table 2 more visible. Figure 4 illustrates the results of the various models evaluation criteria. PV power generation data for one day in 2021 were chosen randomly as validation data to test the efficiency of the BLSTM–CNN model. Figure 5a shows a comparison of forecasted and actual values. For all times in the range, the forecasting curves show high consistency with the actual data. As shown in Fig. 5b, the error between the forecasted and actual values is illustrated by the rose curve.

Subsequently, the suggested forecasting model’s performance and stability are tested in four months to ensure the BLSTM–CNN forecasting reliability and efficacy. The collected data is separated into four months: October, November, December, and January. Each month’s data is divided into two parts: 80% for training and 20$\%$ for testing. Table 3 shows the partition of training and testing sets in four months.

Table 4 shows the results of various models CNN, GRU, RNN, LSTM, BLSTM, CNN–LSTM, LSTM–CNN, and BLSTM–CNN for the PV power forecasting in different months October, November, December, and January. The hybrid models outperform the single models in terms of prediction accuracy. In addition, the prediction effect of the BLSTM–CNN hybrid model is better than that of the two hybrid models (LSTM–CNN and CNN–LSTM). The result indicates that BLSTM–CNN outperforms other models in terms of prediction accuracy in four months, although various models can be used to forecast PV power generation, no single model consistently always outperforms the others, which confirms the proposed model’s ability to extract temporal–spatial features, which allows it to create a complicated relationship between input data and target PV power generation. For better visualization, the evaluation criteria result in different months of the various models is also illustrated in Fig. 6.

A four-day from each month was chosen at random for further examination. Figure 7 illustrates the prediction results for these sixteen days using the proposed model and seven comparable models. It is evident that all the models perform well in terms of predictions. The suggested hybrid deep learning model outperforms five single models and two hybrid models in terms of prediction accuracy. The BLSM-CNN curve and the actual value curve are very close and show better prediction performance, especially at night and during peak power.

Comparison of the proposed hybrid BLSTM–CNN model with state-of-the-art methods

The historical data in this paper comes from the DKASC in Australia, Different state-of-the-art methodologies were conducted and examined with solar producing plants from the DKASC in previous studies. The comparison results are shown in Fig. 8.

Chen et al. [4] proposed a simple and efficient RCC-LSTM model for PV power prediction. The RCC (radiation classification coordinate) method as a tool is used for gathering identical time periods, then LSTM is used to extract characteristics from time series PV power data. The data was gathered from the Yulara in Alice Springs for two years (2017-2018) with a resolution of the historical dataset as 5 min. The average MAE value obtained is 0.587.

Zhou et al. [29] proposed a hybrid deep learning model (WPD-LSTM) for short-term PV power forecasting. The data was gathered from DKASC, Alice Springs from June 1, 2014, to June 12, 2016. The average MAE value obtained is 0.2357. Zhou et al. [59] proposed a hybrid model (SDA–GA–ELM) based on extreme learning machine (ELM), genetic algorithm (GA), and customized similar day analysis (SDA) to forecast hourly PV power generation. The dataset was collected from Jan 14, 2017, to Oct 15, 2018, with a resolution of 1 h from DKASC. The average MAE value obtained is 0.2367.

Wang et al. [50] proposed a hybrid model (LSTM–CNN) for PV power forecasting. The data was collected from 1B DKASC, Alice Springs PV for half-year data with a resolution of the historical dataset as 5 min. The average MAE value obtained is 0.2210. Zhen et al. [58] proposed a hybrid model (GA-BLSTM)) for ultra-short-term PV power prediction. The data was collected from 8 PV plants ranging from 2017 to 2019 with resolution of the historical dataset as 5 min. The average MAE value obtained is 0.242. Abdel-Basset et al. [1] proposed a novel deep learning architecture namely (PV-Net), to enable efficient extraction of positional and temporal features in PV power sequences, the gates of the GRU are modified utilizing convolutional layers (named Conv-GRU) for forecasting short-term PV energy production. The data was collected from 1B DKASC, Alice Springs PV throughout five years (2015–2019) with a resolution of the historical dataset as 5 min. The average MAE value obtained is 0.398.

When the above research’s results are compared, the suggested BLSTM–CNN has the minimum MAE value. It is obvious that the suggested model outperforms prior researches and provides higher PV generation predicting performance.

However, the suggested model has a few flaws that must be investigated further. For example, in this research, the structure and training hyper-parameters of the model were found by experimentation, which is time-consuming. As a result, automated settings estimation approaches, like heuristic optimization algorithms, will be utilized in our future research to pick and improve the parameters of the neural network more effectively.

Conclusion

Accurate PV power forecasting plays an important role in the maintenance, control, management, and operation of PV power generation systems. In this research, a novel hybrid PV power generation forecasting model based on a deep learning algorithm namely BLSTM–CNN was suggested to increase the accuracy and reliability of PV power generation forecasting. More specifically, BLSTM automatically extracts bidirectional temporal correlation characteristics of PV data and CNN extracts spatial correlation characteristics of PV data to produce the final PV power forecasting results. Four evaluation measures and five single deep learning (CNN, GRU, RNN, LSTM, BLSTM) and two hybrid models (LSTM–CNN, CNN–LSTM) were employed for the experimental study to validate the proposed model’s predicting performance. The BLSTM–CNN model is proposed and used in a novel way in the field of PV power forecasting with the highest R$^2$ value of 0.9993, the lowest RMSE value of 0.0944, MAE value of 0.0531, MSE value of 0.0089. In terms of forecasting accuracy, the results indicate that the proposed model outperforms other traditional classical models. In the next study, the hybrid model will be combined with more sophisticated deep learning models to extract temporal and spatial features separately, resulting in more precise PV power forecast results. Moreover, the proposed model can also be enhanced and used in other domains, such as wind speed forecasting and residential load forecasting.

Abbreviations

AI:: Artificial intelligence
ARMA:: Autoregressive and moving average
BLSTM:: Bidirectional long short-term memory
BPNN:: Back propagation neural network
CNN:: Convolutional neural network
DKASC:: Desert Knowledge Australia Solar Centre
ELM:: Extreme learning machine
GA:: Genetic algorithm
GRU:: Gated recurrent unit
LSTM:: Long short-term memory
MAE:: Mean absolute error
MSE:: Mean square error
NWP:: Numerical weather prediction
PV:: Photovoltaic
RMSE:: Root mean square error
RCC:: Radiation classification coordinate
RNN:: Recurrent neural network
SVM:: Support vector machine
SDA:: Customized similar day analysis
WPD:: Wavelet packet decomposition
$b^{1}_{j}$ :: Bias for the $j\mathrm{th}$ feature map
$f_t$ :: Output of the forget gate at time t
m :: Index value of the filter
n :: Number of units per window
$i_t$ :: Output of the input gate at time t
$o_t$ :: Output of an LSTM block at time t
R :: Pooling size
$R^2$ :: Coefficient of determination
x :: Input vector for power production
$y^1_{ij}$ :: Output of the first convolutional layer
W :: Weight parameters of layers

References

Abdel-Basset, M., Hawash, H., Chakrabortty, R.K., et al.: Pv-net: an innovative deep learning approach for efficient forecasting of short-term photovoltaic energy production. J. Clean. Prod. 303(127), 037 (2021)
Google Scholar
Agoua, X.G., Girard, R., Kariniotakis, G.: Short-term spatio-temporal forecasting of photovoltaic power production. IEEE Trans. Sustain. Energy 9(2), 538–546 (2017)
Article Google Scholar
Behera, M.K., Majumder, I., Nayak, N.: Solar photovoltaic power forecasting using optimized modified extreme learning machine technique. Eng. Sci. Technol. Int. J. 21(3), 428–438 (2018)
Google Scholar
Chen, B., Lin, P., Lai, Y., et al.: Very-short-term power prediction for PV power plants using a simple and effective RCC-LSTM model based on short term multivariate historical datasets. Electronics 9(2), 289 (2020)
Article Google Scholar
Cheng, H., Ding, X., Zhou, W., et al.: A hybrid electricity price forecasting model with Bayesian optimization for German energy exchange. Int. J. Electr. Power Energy Syst. 110, 653–666 (2019)
Article Google Scholar
Chu, Y., Urquhart, B., Gohari, S.M., et al.: Short-term reforecasting of power output from a 48 MWe solar PV plant. Sol. Energy 112, 68–77 (2015)
Article Google Scholar
Das, U.K., Tey, K.S., Seyedmahmoudian, M., et al.: Forecasting of photovoltaic power generation and model optimization: a review. Renew. Sustain. Energy Rev. 81, 912–928 (2018)
Article Google Scholar
De Giorgi, M.G., Congedo, P.M., Malvoni, M.: Photovoltaic power forecasting using statistical methods: impact of weather data. IET Sci. Meas. Technol. 8(3), 90–97 (2014)
Article Google Scholar
Díaz-Vico, D., Torres-Barrán, A., Omari, A., et al.: Deep neural networks for wind and solar energy prediction. Neural Process. Lett. 46(3), 829–844 (2017)
Article Google Scholar
DKASC Alice Springs (2021) 1B: Trina. http://dkasolarcentre.com.au/locations/alice-springs?source=1B
Dolara, A., Leva, S., Manzolini, G.: Comparison of different physical models for PV power output prediction. Sol. Energy 119, 83–99 (2015)
Article Google Scholar
Du, P., Wang, J., Yang, W., et al.: Multi-step ahead forecasting in electrical power system using a hybrid forecasting system. Renew. Energy 122, 533–550 (2018)
Article Google Scholar
Gao, M., Li, J., Hong, F., et al.: Day-ahead power forecasting in a large-scale photovoltaic plant based on weather classification using LSTM. Energy 187(115), 838 (2019)
Google Scholar
He, Y.L., Chen, L., Gao, Y., et al.: Novel double-layer bidirectional LSTM network with improved attention mechanism for predicting energy consumption. ISA Trans. 127, 350–360 (2022). https://doi.org/10.1016/j.isatra.2021.08.030
Article Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Article MathSciNet MATH Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Hopfield, J.J.: Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. 79(8), 2554–2558 (1982)
Article MathSciNet MATH Google Scholar
Hossain, M., Mekhilef, S., Danesh, M., et al.: Application of extreme learning machine for short term output power forecasting of three grid-connected PV systems. J. Clean. Prod. 167, 395–405 (2017)
Article Google Scholar
Hu, Y., Lian, W., Han, Y., et al.: A seasonal model using optimized multi-layer neural networks to forecast power output of PV plants. Energies 11(2), 326 (2018)
Article Google Scholar
Jaihuni, M., Basak, J.K., Khan, F., et al.: A novel recurrent neural network approach in forecasting short term solar irradiance. ISA Trans. 121, 63–74 (2022)
Article Google Scholar
Jammeli, H., Ksantini, R., Abdelaziz, F.B., et al.: Sequential artificial intelligence models to forecast urban solid waste in the city of Sousse, Tunisia. IEEE Trans. Eng. Manag. (2021). https://doi.org/10.1109/TEM.2021.3081609
Article Google Scholar
Joseph, L.L., Goel, P., Jain, A., et al.: A novel hybrid deep learning algorithm for smart city traffic congestion predictions. In: 2021 6th International Conference on Signal Processing, pp. 561–565. Computing and Control (ISPCC), IEEE (2021)
Kim, T.Y., Cho, S.B.: Predicting residential energy consumption using CNN-LSTM neural networks. Energy 182, 72–81 (2019)
Article Google Scholar
Kulshrestha, A., Krishnaswamy, V., Sharma, M.: Bayesian BILSTM approach for tourism demand forecasting. Ann. Tour. Res. 83(102), 925 (2020)
Google Scholar
Lan, H., Zm, Liao, Zhao, Y.: ARMA model of the solar power station based on output prediction. Electr. Meas. Instrum. 48(2), 31–35 (2011)
Google Scholar
Lawal, A., Rehman, S., Alhems, L.M., et al.: Wind speed prediction using hybrid 1d CNN and BLSTM network. IEEE Access 9, 156–679 (2021)
Article Google Scholar
Lee, D., Kim, K.: PV power prediction in a peak zone using recurrent neural networks in the absence of future meteorological information. Renew. Energy 173, 1098–1110 (2021)
Article Google Scholar
Li, C., Tang, G., Xue, X., et al.: The short-term interval prediction of wind power using the deep learning model with gradient descend optimization. Renew. Energy 155, 197–211 (2020)
Article Google Scholar
Li, P., Zhou, K., Lu, X., et al.: A hybrid deep learning model for short-term PV power forecasting. Appl. Energy 259(114), 216 (2020)
Google Scholar
Liu, J.: China’s renewable energy law and policy: a critical review. Renew. Sustain. Energy Rev. 99, 212–219 (2019)
Article Google Scholar
Malvoni, M., De Giorgi, M.G., Congedo, P.M.: Data on support vector machines (SVM) model to forecast photovoltaic power. Data Brief 9, 13–16 (2016)
Article Google Scholar
Mellit, A., Massi Pavan, A., Ogliari, E., et al.: Advanced methods for photovoltaic output power forecasting: a review. Appl. Sci. 10(2), 487 (2020)
Article Google Scholar
Miao, S., Ning, G., Gu, Y., et al.: Markov chain model for solar farm generation and its application to generation performance evaluation. J. Clean. Prod. 186, 905–917 (2018)
Article Google Scholar
Nguyen, N.Q., Bui, L.D., Van Doan, B., et al.: A new method for forecasting energy output of a large-scale solar power plant based on long short-term memory networks a case study in Vietnam. Electr. Power Syst. Res. 199(107), 427 (2021)
Google Scholar
Nguyen Dinh, T., Phan Hoang, N.: Air pollution forecasting using regression models and lstm deep learning models for Vietnam. In: International Conference on Future Data and Security Engineering, Springer, 264–275 (2021)
Ogliari, E., Dolara, A., Manzolini, G., et al.: Physical and hybrid methods comparison for the day ahead PV output power forecast. Renew. Energy 113, 11–21 (2017)
Article Google Scholar
Pan, M., Li, C., Gao, R., et al.: Photovoltaic power forecasting based on a support vector machine with improved ant colony optimization. J. Clean. Prod. 277(123), 948 (2020)
Google Scholar
Peng, T., Zhang, C., Zhou, J., et al.: An integrated framework of Bi-directional long-short term memory (BiLSTM) based on sine cosine algorithm for hourly solar radiation forecasting. Energy 221(119), 887 (2021)
Google Scholar
Sabri, M., El Hassouni, M.: A comparative study of LSTM and RNN for photovoltaic power forecasting. In: Int. Conf. Adv. Technol. Humanity, pp. 265–274. Springer, Cham (2022)
Chapter Google Scholar
Sabri, M., El Hassouni, M.: A novel deep learning approach for short term photovoltaic power forecasting based on GRU-CNN model. In: E3S Web of Conferences, EDP Sciences, 00064 (2022)
Sabri, N.M., El Hassouni, M.: Accurate photovoltaic power prediction models based on deep convolutional neural networks and gated recurrent units. Energy Sourc. Part A Recovery Util. Environ. Eff. 44(3), 6303–6320 (2022)
Google Scholar
Sanjari, M.J., Gooi, H.: Probabilistic forecast of PV power generation based on higher order Markov chain. IEEE Trans. Power Syst. 32(4), 2942–2952 (2016)
Article Google Scholar
Shi, Y., He, W., Zhao, J., et al.: Expected output calculation based on inverse distance weighting and its application in anomaly detection of distributed photovoltaic power stations. J. Clean. Prod. 253, 119965 (2020)
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., et al.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Tovar, M., Robles, M., Rashid, F.: PV power prediction, using CNN-LSTM hybrid neural network model Case of study: Temixco-Morelos, Mexico. Energies 13(24), 6512 (2020)
Article Google Scholar
Ünal, F., Almalaq, A., Ekici, S.: A novel load forecasting approach based on smart meter data using advance preprocessing and hybrid deep learning. Appl. Sci. 11(6), 2742 (2021)
Article Google Scholar
Voyant, C., Notton, G., Kalogirou, S., et al.: Machine learning methods for solar radiation forecasting: a review. Renew. Energy 105, 569–582 (2017)
Article Google Scholar
Wang, H., Yi, H., Peng, J., et al.: Deterministic and probabilistic forecasting of photovoltaic power based on deep convolutional neural network. Energy Convers. Manag. 153, 409–422 (2017)
Article Google Scholar
Wang, K., Qi, X., Liu, H.: A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl. Energy 251(113), 315 (2019)
Google Scholar
Wang, K., Qi, X., Liu, H.: Photovoltaic power forecasting based LSTM-convolutional network. Energy 189(116), 225 (2019)
Google Scholar
Wang, S., Wang, Y., Cheng, Y., et al.: An improved model for power prediction of PV system based on Elman neural networks. In: 2020 Asia Energy and Electrical Engineering Symposium (AEEES), IEEE, 902–907 (2020)
Wang, Y.: The analysis of the impacts of energy consumption on environment and public health in China. Energy 35(11), 4473–4479 (2010)
Article Google Scholar
Wojtkiewicz, J., Hosseini, M., Gottumukkala, R., et al.: Hour-ahead solar irradiance forecasting using multivariate gated recurrent units. Energies 12(21), 4055 (2019)
Article Google Scholar
Wu, Q., Guan, F., Lv, C., et al.: Ultra-short-term multi-step wind power forecasting based on CNN-LSTM. IET Renew. Power Gener. 15(5), 1019–1029 (2021)
Article Google Scholar
Yamada, F., Wazawa, Y., Kobayashi, K., et al.: Prediction of next day solar power generation by gray theory and neural networks. Trans. Inst. Electr. Eng. Jpn B 134(6), 494–500 (2014)
Google Scholar
Yan, Y., Hy, Xing: A sea clutter detection method based on LSTM error frequency domain conversion. Alex. Eng. J. 61(1), 883–891 (2022)
Article Google Scholar
Zhen, H., Niu, D., Yu, M., et al.: A hybrid deep learning model and comparison for wind power forecasting considering temporal-spatial feature extraction. Sustainability 12(22), 9490 (2020)
Article Google Scholar
Zhen, H., Niu, D., Wang, K., et al.: Photovoltaic power forecasting based on GA improved Bi-LSTM in microgrid without meteorological information. Energy 231, 120908 (2021)
Article Google Scholar
Zhou, Y., Zhou, N., Gong, L., et al.: Prediction of photovoltaic power output based on similar day analysis, genetic algorithm and extreme learning machine. Energy 204, 117894 (2020)
Article Google Scholar

Download references

Author information

Mohammed Sabri and Mohammed El Hassouni have contributed equally to this work.

Authors and Affiliations

LRIT, Mohammed V University in Rabat, 1014, Rabat, Morocco
Mohammed Sabri
FLSH, Mohammed V University in Rabat, 1014, Rabat, Morocco
Mohammed El Hassouni

Authors

Mohammed Sabri
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed El Hassouni
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MS: writing—original draft preparation, methodology, visualization, software, validation, formal analysis. MEH: review and editing.

Corresponding author

Correspondence to Mohammed Sabri.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Sabri, M., El Hassouni, M. Predicting photovoltaic power generation using double-layer bidirectional long short-term memory-convolutional network. Int J Energy Environ Eng 14, 497–510 (2023). https://doi.org/10.1007/s40095-022-00530-4

Download citation

Received: 28 May 2022
Accepted: 07 September 2022
Published: 28 September 2022
Issue Date: September 2023
DOI: https://doi.org/10.1007/s40095-022-00530-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting photovoltaic power generation using double-layer bidirectional long short-term memory-convolutional network

Abstract

Similar content being viewed by others

A hybrid model of CNN and LSTM autoencoder-based short-term PV power generation forecasting

Hybrid Deep Learning Architecture Approach for Photovoltaic Power Plant Output Prediction

Accurate photovoltaic power forecasting models using deep LSTM-RNN

Introduction