1 Introduction

The knowledge of the amount of solar radiation falling on the surface of the earth is of prime importance to engineers and scientists involved in the design of solar energy systems, climate change, hydrology, and agricultural applications. The records of global solar radiation measurements are relatively scarce, due to the cost, maintenance, and calibration requirements of the measuring equipment. In many countries, air temperature is measured at a wide number of locations, and therefore, it is important to study the relation between the solar radiation data and the air temperature data. For example, in Saudi Arabia, the availability of solar radiation data from weather stations is more restricted than air temperature data. Therefore, it is useful to develop methods using the more available data, such as the air temperature data, to estimate the solar radiation.

Empirical models used for estimating solar radiation can be briefly classified as:

  • Sunshine-based model is used as an input sunshine duration (Angström 1924; Almorox and Hontoria 2004; Harrouni et al. 2005; Benghanem and Joraid 2007).

  • Temperature-based model is widely used since air temperature data are the most measured available meteorological parameter (Allen 1997; Meza and Varas 2000; Mahmood and Hubbar 2002; Bandyopadhyay et al. 2008; Liu et al. 2009a; Tulcan-Paulescu and Paulescu 2008; Wu and Liu 2012).

  • Other models using many parameters such as air temperature, humidity, precipitation saturation vapor pressures, and rainfall to estimate solar radiation data have been reported in Yin (1999), Liu and Scott (2001), Thornton and Running (1999), Winslow et al. (2001), and Benghanem and Mellit (2010).

The most investigated parameters are sunshine duration (S) and air temperature (T). However sunshine data are not always available in most Saudi’s locations. Since the air temperature is the most recorded meteorological variable in Saudi Arabia, solar radiation models based on the measured air temperature could be a good solution in this area. Therefore, it is important to develop an accurate solar radiation model using geographical location and some commonly available parameters such as maximum (T max) and minimum (T min) air temperature. However, existing empirical models using air temperature are not well adjusted in this location, Madinah, Saudi Arabia.

This paper aims to develop a simple calibrated model for estimating daily global solar radiation. The model accepts the difference between maximum and minimum air temperature (T max − T min) and extraterrestrial solar radiation (Rs0) as input. The main contribution consists of adjusting the possible coefficients (A, B, and C) in order to increase the accuracy of the model \( \left( {\mathrm{Rs}=A+B\cdot R{s_0}{{{\left( {{T_{\max }}-{T_{\min }}} \right)}}^C}} \right) \). The adjusted model has been then compared with some empirical models and a new introduced artificial neural network (ANN) \( \left( {\mathrm{R}\mathrm{s}=\widetilde{f}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)} \right) \).

This paper is organized as follows: Database description and observed correlation are given in Section 2. A brief description of some empirical models is provided in Section 3. Section 4 provides an ANN-based model used for estimating of daily solar radiation. Section 5 deals with the presentation the new calibrated model. Results and discussion are reported in Section 6.

2 Database and observed correlation

The experimental data (global solar radiation, air temperature, relative humidity, etc.) are collected since January 2007 by Solar Energy Laboratory (Physics Department, Faculty of Science, Taibah University). The data have been measured using a meteorological station and collected by using the wireless data acquisition system developed by Benghanem (2010). The used sensors, their sensibility, and the quality of the measured are also reported in Benghanem (2010); the data quality depends on the accuracy of the used sensors. Figure 1a shows the weather monitor software used for collecting the data.

Fig. 1
figure 1

a Weather monitor software. b Daily evolution of measured global solar radiation (Madinah, Saudi Arabia). c Daily evolution of measured average air temperature (Madinah, Saudi Arabia). d Daily evolution of extraterrestrial solar radiation (Madinah, Saudi Arabia)

As an example, Fig. 1b, c shows the daily evolution of global solar radiation (Rs) on a horizontal plane and average daily air temperature in Madinah (latitude 24.55° N, longitude 39.70° E). The expression of daily extraterrestrial radiation (Rs0) is given as:

$$ \mathrm{R}{{\mathrm{s}}_0}=\frac{24 }{\pi }3,600\cdot {I_{\mathrm{s}\mathrm{c}}}\left[ {1+0.033\cdot \cos \left( {\frac{{360\cdot D}}{365 }} \right)} \right]\left( {\cos \lambda \cdot \cos \delta +\frac{{2\pi {\omega_s}}}{360}\sin \lambda \cdot \sin \delta } \right). $$
(1)

The only input required to calculate these daily values, for a specific day of the year, is the latitude of the location in degrees.

Where I sc is the solar constant (1,367 W/m2 day−1), λ is the latitude of the site in degrees, δ is the solar declination in degrees, ω s is the hour angle of the sun in degrees, and D is the number of days of the year starting from the first of January.

These values can be computed by the following equations:

$$ \delta =23.45.\sin \left( {\frac{{360\left( {D+284} \right)}}{365 }} \right). $$
(2)

The sunrise hour angle ω s can be calculated if the latitude λ of the site and the solar declination δ are known:

$$ {\omega_{\mathrm{s}}}={\cos^{-1 }}\left[ {-\tan \delta \cdot \tan \lambda } \right]. $$
(3)

Figure 1d shows the daily evolution of extraterrestrial solar radiation in Madinah, Saudi Arabia.

The correlation between the daily global solar radiation, air temperature T max − T min, and extraterrestrial solar radiation is given in Fig. 2b–d, and the correlation coefficients are 78, 57, and 90 % respectively.

Fig. 2
figure 2

a Correlation between daily global solar radiation and average air temperature (Madinah, Saudi Arabia) r = 0.57. b Correlation between daily global solar radiation and the difference air temperature (T max − T min) in Madinah, Saudi Arabia. c Correlation between daily global solar radiation and extraterrestrial solar irradiation (Madinah, Saudi Arabia)

3 Empirical models

Numerous models have been developed using empirical relationships to estimate global solar radiation at ground level from the difference between air temperature extremes. However, the accuracy of these models needs to be verified for each considered location. The investigated empirical models are given below.

3.1 Model 1

Hargreaves and Samani (1982) suggest that the daily global solar radiation could be estimated from the difference between the mean value of the daily maximum temperatures T max and the mean value of the daily minimum temperatures T min, by means of the following equation:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot K\cdot {{\left( {{T_{\max }}-{T_{\min }}} \right)}^{0.5 }}. $$
(4)

The empirical parameter K was set to 0.17 for arid and semiarid regions. Later, Hargreaves (1994) recommended the use of K = 0.16 for interior regions and K = 0.19 for coastal regions. This coefficient should be derived at a site where data measurements are available. The equation assumes that the difference in maximum and minimum temperature is directly related to the fraction of extraterrestrial radiation received at the ground level. However, there are other factors than solar radiation that can influence the temperature difference. These factors include cloudiness, humidity, latitude, elevation, topography, or proximity to a large body of water.

3.2 Model 2

The new model developed by Hargreaves et al. (1985) is given as:

$$ \mathrm{Rs}=\mathrm{Rs}{}_0 \cdot \left[ {a\sqrt{{{T_{\max }}-{T_{\min }}}}+b} \right] $$
(5)

where a and b are empirical coefficients.

3.3 Model 3

Chen et al. (2004) include a logarithmic variation of air temperature as:

$$ \mathrm{Rs}{}_G=\mathrm{R}{{\mathrm{s}}_0}\cdot \left[ {a\cdot \ln \left( {{T_{\max }}-{T_{\min }}} \right)+b} \right] $$
(6)

where a and b are empirical coefficients.

3.4 Model 4

The modified Hargreaves and Samani (1982) model developed by Annandale et al. (2002) is given as:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot \left[ {{A_n}\left( {1+2 \times {7^{{{10^{-5 }}}}}\cdot z} \right){{{\left( {{T_{\max }}-{T_{\min }}} \right)}}^{0.5 }}} \right] $$
(7)

where z is the altitude and A n is an empirical coefficient. The coefficient must be derived at a site where data measurements are available.

3.5 Model 5

Bristow and Campbell (1984) model proposed a different relation between daily values of the same variables Rs and (T max − T min), introducing a model with an exponential term:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot A\cdot \left[ {1-\exp \left( {-B{{{\left( {{T_{\max }}-{T_{\min }}} \right)}}^C}} \right)} \right] $$
(8)

where A, B, and C are empirical coefficients.

In Eq.8, T min should be calculated for a generic day as the average of measured values in consecutive days as:

$$ {T_{\min }}=\frac{{{T_{\min }}(j)+{T_{\min }}\left( {j+1} \right)}}{2}. $$
(9)

where j is the current day and j + 1 is the next day.

3.6 Model 6

Donatelli and Campbell (1998) model estimates the global solar radiation Rs by estimating the extraterrestrial radiation Rs0 and multiplying by the transmissivity coefficient of the atmosphere:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot {A_{\mathrm{DC}}}\cdot \left[ {1-\exp \left( {-{B_{\mathrm{DC}}}f\left( {{T_{\mathrm{avg}}}} \right){{{\left( {{T_{\max }}-{T_{\min }}} \right)}}^2}f\left( {{T_{\min }}} \right)} \right)} \right] $$
(10)
$$ {T_{\mathrm{avg}}}=\frac{{{T_{\min }}+{T_{\max }}}}{2} $$
(11)
$$ f\left( {{T_{\mathrm{avg}}}} \right)=0.017\cdot \exp \left[ {\exp \left( {-0.053{T_{\mathrm{avg}}}} \right)} \right] $$
(12)
$$ f\left( {{T_{\min }}} \right)=\exp \left[ {\frac{{{T_{\min }}}}{{{C_{\mathrm{DC}}}}}} \right] $$
(13)

where A DC stands for the clear sky transitivity, B DC and C DC are empirical coefficients, and f(T avg) and f(T min) are functions based on mean and minimum temperature.

4 ANN-based model

The well-known multilayer perceptron (MLP) (Mellit 2009; Mellit et al. 2013) is used to build an ANN-based model for estimating the daily global solar radiation. The employed MLP has three layers: an input layer, a single hidden layer, and an output layer. The input layer has two inputs, Rs0 and T max − T min, and its output layer has a single output node: Rs. Thus, the investigated relationship can be given as:

$$ \mathrm{Rs}=\tilde{f}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right) $$
(14)

where \( \widetilde{f} \) is an approximate function.

A set of 365 × 3 (three years, 2007–2009) has been used in training the network while a set of 365 (year, 2010) is used for testing the network. The performance function used for training the network is the mean-squared error. Preprocessing steps on dataset were carried out by using the following expression:

$$ y={y_{\min }}+\left( {x-{x_{\min }}} \right){{\left( {{x_{\max }}-{x_{\min }}} \right)}^{-1 }}\left( {{y_{\max }}-{y_{\min }}} \right) $$
(15)

where x ∈ [x min x max] and x ∈ [y min y max], x is the original data value, and y is the corresponding normalized variable. y min and y max have been assumed to be −1 and 1, respectively. This preprocessing step on the data (input/output) allows the network to perform more efficiently. Different architectures have been evaluated by trial and error, and the simplified one has 3 units within the hidden layer (see Fig. 3).

Fig. 3
figure 3

The simplified obtained MLP architecture

The simplified ANN-based model is given as:

$$ \mathrm{Rs}=\left( {4.6{\alpha_1}+0.05{\alpha_2}+1.15{\alpha_3}+3.65} \right){10^3} $$
(16)

Where

$$ \begin{array}{*{20}c} {{\alpha_1}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=\mathrm{Tansig}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=2/\left( {1+\exp \left( {-2\left( {-\mathrm{R}{{\mathrm{s}}_0}-({T_{\max }}-{T_{\min }})+1.6} \right)} \right)} \right)-1} \hfill \\ {{\alpha_2}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=\mathrm{Tansig}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=2/\left( {1+\exp \left( {-2\left( {2,131\mathrm{R}{{\mathrm{s}}_0}-\left( {{T_{\max }}-{T_{\min }}} \right)-338} \right)} \right)} \right)-1} \hfill \\ {{\alpha_3}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=\mathrm{Tansig}\left( {\mathrm{R}{{\mathrm{s}}_0},{T_{\max }}-{T_{\min }}} \right)=2/\left( {1+\exp \left( {2\left( {+423\mathrm{R}{{\mathrm{s}}_0}+\left( {{T_{\max }}-{T_{\min }}} \right)+45} \right)} \right)} \right)-1} \hfill \\ \end{array} $$

Tansig is a hyperbolic tangent sigmoid transfer function which is given as: \( \mathrm{Tansig}(x)=2/\left( {1+\exp \left( {-2\mathrm{x}} \right)} \right)-1. \)

5 New calibrated model

This section provides the different steps for a new calibrated model. From the above models, we can write the following relationship:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot K\cdot f\left( {{T_{\max }}-{T_{\min }}} \right) $$
(17)

where K is a regression coefficient.

The first step aims to analyze the following equation:

$$ \mathrm{Rs}=\mathrm{R}{{\mathrm{s}}_0}\cdot A\cdot \left( {{T_{\max }}-{T_{\min }}} \right). $$
(18)

We found that for winter months, the average value of the coefficient A is 0.132, whereas for summer months, the value is 0.125. Thus, the average value of empirical coefficient A for Madinah site is 0.13. The correlation coefficient (r) between the measured and estimated daily global solar radiation is equal to 89 %. Figure 4 shows the daily evolution of the coefficient A.

Fig. 4
figure 4

Evolution of coefficient A (Eq. 18)

In the second step, we have analyzed the following equation:

$$ \mathrm{R}\mathrm{s}=B\cdot \mathrm{R}{{\mathrm{s}}_0}{{\left( {{T_{\max }}-{T_{\min }}} \right)}^2}. $$
(19)

It has been found that for winter months, the average value of the coefficient B is 0.0323, and for summer months, the value is 0.0267. The average value of empirical coefficient B is 0.03. The correlation coefficient is equal to 80 %. Figure 5 shows the evolution of the coefficient B during 1 year at Madinah site. However, for another site, this coefficient must be calculated at a site where data measurements are available.

Fig. 5
figure 5

Evolution of coefficient B (Eq.19)

A simple dimensional analysis outlook shows that Eqs. 18 and 19 would not have physical meaning because a function f(.) with unique physical variable, such as f(T max − T min), cannot be dimensionless. Therefore, the empirical coefficients should be considered as physical variables with units. By analyzing various forms with temperature variable and based on the observed data reported in Fig. 6, we suggest the following equation:

Fig. 6
figure 6

Observed data: the daily global solar radiation versus extraterrestrial solar radiation and difference of temperature T max − T min

$$ \mathrm{R}\mathrm{s}=A+B\cdot \mathrm{R}{{\mathrm{s}}_0}{{\left( {{T_{\max }}-{T_{\min }}} \right)}^C} $$
(20)

where A and B are the empirical coefficients deduced from linear regression analysis and C is a power coefficient which should be derived at the considered site. It is clearly shown that the new equation depends on the extraterrestrial solar radiation (Rs0) and the difference in maximum and minimum air temperature (T max − T min). A description of the mathematical expression of the estimation correlation can be deduced from observed data. In fact, by an iterative process, we have varied the power coefficient C, and by fitting data, we calculate the correlation coefficient between the measured and estimated daily global solar radiation.

Table 1 reports the different regression coefficients A and B, the power coefficient C, and the correlation coefficient (r). With reference to Table 1, it can be seen that, when the power coefficient C decrease, the correlation coefficient increases until the value of C = 0.25. After that, the correlation coefficient decreases. Consequently, the adjusted model can be given as:

Table 1 Regression coefficients A and B, power coefficient C, and correlation coefficient r
$$ \mathrm{R}\mathrm{s}=A+B\cdot \mathrm{R}{{\mathrm{s}}_0}{{\left( {{T_{\max }}-{T_{\min }}} \right)}^{0.25 }}. $$
(21)

The model performance was evaluated in terms of the following statistical error tests: correlation coefficient (r) where a model is more efficient when r is closer to 1, and it indicates also that measured and estimated data are very well correlated; root mean square error (RMSE) where a lower value indicates better estimation; mean bias error (MBE) indicates overall under- or overestimation. Therefore, a good estimation means that the value of MBE would be zero. A positive value of MBE indicates the tendency of the estimation model to underestimate the measured solar radiations, whereas a negative MBE indicates a tendency to overestimate the measured solar radiations; the mean absolute error (MAE) is a quantity used to measure how close estimates are to the eventual outcomes; and the mean percentage error (MPE) is the computed average of percentage errors by which estimated forecasts differ from actual values of the quantity being estimated. The employed errors are given in the Appendix (r, RMSE, MBE MAE, and MPE).

6 Results and discussion

Regression analysis of the experimental data and the derivation coefficients of the model involved with simple and multiple linear regressions have been developed by using Origin software and Matlab. Figure 7 shows the observed data versus the linear fitting of the calibrated model; as can be seen, good agreement is obtained.

Fig. 7
figure 7

Experimental data and linear fitting for the new calibrated model

To assess the effectiveness of the calibrated model, the above-mentioned goodness tests have been used. Table 2 reports the statistical test between observed and estimated daily global solar radiation for different value of the power coefficient C. With reference to this table, it has been found that the best correlation coefficient is 94 % for the calibrated model, which is validated by comparing estimated and measured daily global solar radiation (year 2010). The obtained value of MBE for the calibrated (Eq. 21) is −0.012. It means that the underestimation of daily global solar radiation is observed with lower value. The RMSE has been found the lowest for the proposed model 0.063. It means that good estimation of global solar radiation by using extraterrestrial radiation and the difference between maximum and minimum air temperature. The MAE is 3.87 Wh/m2/day and the MPE is less than 0.6 %. These errors indicate clearly the good accuracy of the model.

Table 2 Statistical test between observed and estimated daily global solar radiation for different value of the power coefficient C

It can be concluded that the estimation of global solar radiation can be performed with acceptable accuracy using all tested models. RMSE and r are the primary parameters used to assess the accuracy of the estimation of global solar radiation. The calibrated model makes better options to estimate the global solar radiation in Madinah location from air temperature data.

Estimated versus measured daily global solar irradiation for the Madinah site (year 2010) is shown in Fig. 8. As can be seen that the estimated solar radiation values by the model are very close the measured ones, with relatively high accuracy.

Fig. 8
figure 8

Estimated and measured daily global solar radiation data (Madinah, 2010)

In order to confirm the effectiveness of the calibrated model, six empirical models (Eqs. 48 and 10) have been used. Table 3 summarizes the six used models and the calculated coefficients for each model.

Table 3 Empirical models coefficients A, B, and C

A comparison between the six models, the introduced ANN model, and the new calibrated model is reported in Table 4. We have calculated the goodness indicators: r, RMSE, MBE, MAE, and MPE. With reference to these indicators, it can be concluded that the calibrated model performs better than other investigated models: since r = 94 %, it is close to 1, MAE = 3.87 (Wh/m2/day), MPE = 0.56 %, and RMSE = 0.063 (Wh/m2/day) have lower values, and the MBE = −0.012 which is close to 0. The ANN-based model provides also a good result with a correlation coefficient of 93.7 %.

Table 4 Performance evaluations of six empirical models, ANN-based model, and the new calibrated model (RMSE, MBE, r, MAE, and MPE)

7 Conclusion and perspectives

In this paper, a new simplified model was calibrated and evaluated for estimating the daily global solar radiation by using measured air temperature data. The calibrated model is basically based on the well-known Hargreaves et al. (1985) model.

It has been demonstrated that the new calibrated model is more accurate than other existing empirical models used in this work. In addition, ANN-based model has proven its accuracy compared to other empirical models. It has been verified that empirical model-based temperature provides good results if their parameters are correctly adjusted. In fact, the empirical coefficients A, B, and C should be estimated at the considered site.

Our future work will focus on the calibration of the model for other regions in Saudi Arabia as well as to test its effectiveness in the future years.