Keywords

1 Introduction

One of the important challenges we are facing is energy crisis. Out of different renewable sources of energy, as a clean and cost-free, solar energy is very much popular everywhere. One should be conversant with the various components of solar energy for its effective use. There are various components which attribute to the solar energy to different extents like the duration of sunshine, maximum ambient temperature, any day and month, global radiation, latitude and longitude to name a few. The availability of solar energy is predominantly controlled by the global solar radiation; however, the measurement of same is done only at a couple of locations due to the involvement of relatively expensive equipment and the subsequent maintenance charges. Researches to corelate the solar radiation with various climatic condition and geometrical factors have been done in the past and the major models were based on different inputs, equations and algorithms [1]. Models developed on the basis of the actual solar radiation reaching a given area outperformed the meteorological data model. Attempts were made to overcome the limitation of inaccessibility of insolation data in certain regions, and consequently, temperature-based models and other suitable modifications [2, 3] were proposed to overcome the shortcoming. ANN technique based on resilient propagation (RP) [4] with the physical features and atmospheric conditions as inputs resulted in a much better and consistent output compared to the classical modelling. Subsequently, soft computing technique was deployed to study the effectiveness in estimating the solar radiation and was found to be instrumental in dealing with practical problems where the nature is data is far more complex and dynamic because of the noise and the nonlinearity associated with it. AI-based approach which includes artificial neural network (ANN), fuzzy logic, adaptive neuro-fuzzy interface system (ANFIS) and data mining (DM) is some of the methods which have been effectively utilized to yield better results [5] than conventional modelling. Several models based on ANN and regression techniques [6] were compared and the former yielded in much better results.

Attempt has been made to devise an easier model for evaluating the total solar irradiation taking input parameters like month, day, time, latitude, longitude, elevation, slope, azimuth as inputs and global irradiance on the inclined plane (plane of the array) (W/m2) as output for three places in India which are Kolkata, Roorkee and Chennai.

1.1 Dataset

It is already mentioned that this study was carried out using the PVGIS database for three Indian cities Kolkata, Roorkee and Chennai [7]. The parameters used in this study are: day, month, latitude, longitude, elevation, slope, azimuth and global solar irradiance which were collected for six consecutive years (2011–2016). From the total dataset, two separate datasets were created among which the first dataset (for 2011–2015) was used for training, testing and validation and the second dataset (for 2016) has been used for prediction. From the first dataset, out of 1, 31,325 collected data, 70% (91,927) were used for training, 15% (19,698) were used for validation and the rest 15% (19,698) were used for testing. From the second dataset, 8755 data points for each city were used for prediction to evaluate the proposed ANN model. Table1 shows the geographical locations of these three cities Kolkata, Roorkee and Chennai.

Table 1 Geographic information of three cities

1.2 Artificial Neural Network

The ANN model is a mathematical simulation of the way in which the human brain works [8]. It is a highly interconnected network of neurons which is capable of processing the computational data in parallel. The input signal yi, for i = 0, 1, 2, …, n is processed through the intermediate layers of neurons which assign a weightage wij to the inputs followed by an application of sigmoid function resulting in an output equivalent to the weighted mean which is given by Eq. 1.

$$u_{j} = \mathop \sum \limits_{i = 0}^{n} w_{ij} y_{i}$$
(1)

The multi-layer feedforward mechanism (MLF) along with the back-propagation is the most commonly used ANN method used for estimating the solar radiation [9,10,11]. This model can be used for problems which are not linearly separable. Apart from input (i) and one output layer (k), it usually contains one or two intermediate/hidden layer (j) which are interconnected by weights Wij and Wjk and is shown in Fig. 1a, and the basic structure of a single artificial neuron is shown in Fig. 1b.

Fig. 1
figure 1

a Typical structure of ANN b basic structure of a single artificial neuron

A bias is added to the input and a nonlinearity transforms the sum into an output. This is called the activation function of the node. Linear activations are usually present in the output nodes. The logistic sigmoid function given in Eq. 2 is used for hidden node while the linear equation given by Eq. 3 is used for output node [11].

$$f\left( w \right) = 1/\left( {1 + {\text{e}}^{ - w} } \right)$$
(2)
$$f\left( x \right) = x$$
(3)

The synaptic weight is updated on the basis of a procedure called back-propagation in which the error generated at the output after running a cycle is propagated backward through the intermediate layers finally to the input layer [12]. The training cycle is then repeated to minimize the error, and the network gradually moves towards stability. The estimated error used here is the mean squared error (MSE). In addition to the back-propagation technique, other methods like gradient descent, conjugate gradient algorithms, resilient back-propagation, gradient descent with momentum and quasi-Newton algorithms can be used. The number of neurons, the number of intermediate layers and the training algorithm vary from model to model.

1.3 ANN Model Architecture

A three-layer feedforward neural networks (FFNN) simulation has been used for the purpose of investigation. The input variables are fed into the first layer which is basically the input layer followed by the intermediate or the hidden layer which forms the second layer and the third layer is the output layer. Many real-world function problem modelling has been done following this topology [13, 14]. The selection of the neurons in the intermediate layer is very crucial as the complexity of system to be modelled is related to it. For this particular investigation, the selection of optimum number of neurons in the intermediate layer was done by trial-and-error method. Using a range of neurons between 2 and 80 the system was evaluated for minimum error between the predicted and the actual output. For the intermediate layer, the logistic sigmoid function was considered for the transfer function while the output layer was based on the linear transfer function. Back-propagation (BP) with the Levenberg–Marquardt was then applied to train the neural network as it is one of the fastest and more accurate algorithms. The stability of the steepest descent method [15] is combined with the speed of Newton algorithm. This combination is used to compute the Jacobian matrices bypassing the hessian matrix. This approach gives a faster convergence with minimalistic error.

1.4 Model Performance Evaluation [16]

Using two statistical indicators, mean squared error (MSE) and coefficient of determination (regression value (R)), the performance of the models was evaluated.

$${\text{MSE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} (H_{{\text{p}}} - H_{{\text{a}}} )^{2}$$
(4)

where Hp is the predicted value and Ha is the actual value

$$R^{2} = \frac{{\left[ {\mathop \sum \nolimits_{i = 1}^{n} \left( {H_{{\text{p}}} - H_{{\text{p,avg}}} } \right)\left( {H_{{\text{a}}} - H_{{\text{a,avg}}} } \right)} \right]^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {H_{{\text{p}}} - H_{{\text{p,avg}}} } \right)^{2} \mathop \sum \nolimits_{i = 1}^{n} \left( {H_{{\text{a}}} - H_{{\text{a,avg}}} } \right)^{2} }}$$
(5)

2 Results

In the first scenario, the artificial neural network was trained using 2011 to 2015 dataset consisting of the input parameters (day, month, time, latitude, longitude, slope and azimuth) and it was observed that the model achieved MSE value of 0.676 for training and 0.645 for testing which is shown in Table 2.

Table 2 Performance evaluation of the proposed ANN model on the training datasets

The regression plot [17] of training dataset has been shown in Fig. 2, from which it can be seen that the overall regression value was obtained of 0.94482 with the coefficients of regression line for slope and offset being 0.89 and 0.18, which indicates a good fit of the data. Also, the best validation performance of 0.665 was achieved after 148 iterations as shown in Fig. 4a. The regression plots of testing dataset are shown in Fig. 3 from which it is seen that the regression value was obtained of 0.93869 with the coefficients of regression line for slope and offset being 1.1 and 0.00013 for Kolkata.

Fig. 2
figure 2

Regression plot of training dataset

Fig. 3
figure 3

Regression plot of testing dataset for a Kolkata b Roorkee and c Chennai

Fig. 4
figure 4

a Best validation performance b error histogram of trained ANN

The regression value was obtained of 0.95262 with the coefficients of regression line for slope and offset being 1 and 0.014 for Roorkee. The same was obtained of 0.9076 with the coefficients of regression line for slope and offset being 0.99 and 0.1 for Chennai.

The error histogram demonstrates that the maximum instance of error occurred at 0.2272 signifying the accuracy of the trained ANN model in predicting the GSR value for unknown inputs as shown in Fig. 4b. The error histogram chart during tested data for Kolkata, Roorkee and Chennai is shown in Figs. 5, 6 and 7, respectively.

Fig. 5
figure 5

Error histogram for tested data of Kolkata

Fig. 6
figure 6

Error histogram for tested data of Roorkee

Fig. 7
figure 7

Error histogram for tested data of Chennai

The trained ANN model was used for the prediction of global solar radiation in three Indian cities (Kolkata, Roorkee and Chennai). The statistical performance measures (MSE and R2 value) for all the three cities are presented in Table 3 for both training and testing datasets.

Table 3 Performance evaluation of the proposed ANN model on the prediction dataset

From Table 3, it can be observed that the proposed ANN model can efficiently predict the GSR with low mean squared error and high R2 value for all three cities.

3 Discussion and Conclusion

In this study, multilayer feedforward (MLFF) ANN technique [18] based on back-propagation algorithm is developed, trained and tested to predict hourly global solar radiation for 2016 for three cities Kolkata, Roorkee and Chennai. The model is developed and proposed by including input parameters like month, day, time, latitude, longitude, elevation, slope, azimuth and global solar irradiation as output. Using the prepared ANN model created with the PVGIS 5 years solar radiation dataset for three cities and testing this one for the prediction of 2016 data revealed that the predictions of solar radiation using our proposed model shows a better result with the obtained PVGIS dataset for 2016. This developed model is suitable for predicting solar radiation for any locations in India for which solar radiation is required for site-specific solar energy applications especially for solar power generation and production.