Introduction

Solar energy is considered as one of the most reliable sources of renewable energy in the world. The development of this source of energy facilities in a place is possible with the assessments made of the solar energy potential of that place. So, modeling and predicting the amount of energy emitted from the sun and high accuracy in evaluating this resource, has provided an important area of research and accelerates technology advancement by having a positive impact on decision-making and reducing uncertainty in development planning (Deo and Sahin 2017). Unfortunately, due to financial, technical or organizational constraints, there are no solar radiation stations in most parts of the world and some existing radiometers are not accurate enough and their data are erroneous. Therefore, in addition to the methods based on the measured values of radiation, several other methods including the use of artificial intelligence (Deo and Sahin 2017; Ghimire et al. 2019; Mohammadi and Aghashariatmadari 2020), satellite images (Qin et al. 2011; Kim and Liang 2010), physical and statistical modeling based on the measured values of meteorological factors (El-Metwally 2004; Liu et al. 2010; Demirhan et al. 2013; Sun et al. 2015), neural networks (Alsina et al. 2016; Renno et al. 2015) and radiation transfer models (Chen et al. 2016; Benajes et al. 2015; Chen et al. 2015) were proposed. Also, several large-scale models are used to fill the data measurement gap, extend data measurement periods, and estimate radiation at other meteorological stations where radiation is not measured. Although the data sets provided by this method contain a large amount of measured radiation data, their main part consists of modeled values (Muneer 2004). Additionally, over the years, remote sensing, satellites, and their data have revolutionized the field of meteorological and climatological studies. The ability to collect continuous temporal and spatial signals above the atmosphere is considered as one of the advantages of remote sensing that makes it possible to estimate radiation in remote areas where radiation stations are scattered. Another strength of satellite methods is their ability to accurately plot relative differences between adjacent areas, although absolute accuracy may not be perfect for each point. However, this method was proven to be a reliable reference for describing the microclimatic properties of radiant energy.

Despite the importance of solar radiation on ground processes, little effort has been made to develop and produce high-resolution geographical maps around the world. Due to the limitations of terrestrial database models, a number of models were developed to estimate radiation on the planet’s horizontal surfaces using solar or terrestrial satellites at different resolutions (Journee and Bertrand 2010). Many models of radiation estimation based on satellite imagery can produce daily radiation with relatively high accuracy compared to Pyranometric data. There are generally three different ways to estimate solar radiation from satellite data: A) Statistical modeling such as the HELIOSAT model for METEOSAT satellites (Rigollier et al. 2004) and the cloud-cover index model (Perez et al. 2002); B) physical modeling using a complex radiation transfer model (Kim and Liang 2010); and C) parameterized radiation transmission model (Yang et al. 2006). So far, the models designed to estimate radiation from satellite observations are in a diverse range from purely physical models to absolute empirical models including the practical model proposed by Perez et al. (2002) and Janjai et al. (2005). Pure physical models attempt to describe radiation reaching the Earth’s surface by the radiation transfer equation, which requires complete knowledge of tuning and calibration of satellite images and accurate information of the constituents of the atmosphere (Raschke and Preuss 1979; Gautier et al. 1980; Schillings et al. 2004a, b). Empirical/statistical models, on the other hand, can involve establishing a simple regression relationship between satellite observational values and terrestrial measurements (Tarpley 1979; Cano et al. 1986). Further, hybrid models use the approach of simple physical models with some fitting to the observations. Perez et al. (1997) and Zelenka et al. (1999) indicated that solar radiation data estimated by satellite imagery are more accurate than data interpolated through modern radiometric network measurements. In addition, Perez et al. (2002) reported that hourly satellite estimates are more accurate than empirical methods based on ground station measurements at a distance of 25 km from each other compared to field measurements. Lotfi (2012) estimated the net radiation values of Fars province using MODIS sensor images and showed that the. Based on the results, the estimated values show a ±13% error and concluded that this error rate is acceptable given the area of satellite cells (6.5 ha). Zhang et al. (2014) calculated the total amount of solar radiation for the Tibetan Plateau using MODIS sensor data and hybrid parametric model to provide the results with more spatial resolution for short-wavelength radiation than its predecessors. Ryu et al. (2018) combined an atmospheric radiative transfer model with an artificial neural network (ANN) to estimate shortwave solar radiation. A series of MODIS products were used as inputs to run the ANN. Results showed that the developed products will be useful in solar energy investigations. Ghimire et al. (2019) designed algorithms based on deep belief and neural networks and trained them with MODIS satellite data in Australia to predict the monthly ground solar radiation. Results showed that the proposed method can be adopted to estimate solar radiation in regions where it is impossible to setting up monitoring data acquisition instruments. Bamehr and Sabetghadam (2021) developed models using multiple linear regression method and artificial neural networks using some atmospheric properties extracted from MODIS and ozone monitoring instrument (OMI) in Mashhad. Results indicated that the estimates were more accurate using the artificial neural networks than the regression method and in both methods, the accuracy of estimation improves when cloud fraction is used as a predictor. Other studies in this area were conducted by López and Batlles (2014), Qin et al. (2011) and Gao et al. (2015).

Iran is among the countries with high received of solar radiation, which makes it possible to provide a significant part of its energy demands through this clean energy source (Aghashariatmadari 2011). However, the country’s radiation network is weak and the number of radiation stations, despite the great importance of this variable, is very small and the stations are scattered throughout the country. The use of experimental models to estimate radiation is common, but these models, despite their sometimes high errors, need to achieve some meteorological variables in the study period. By increasing the access to satellite data, an opportunity becomes available which should be used to the fullest, especially for an important variable such as solar radiation, which is an input for a large number of climatic and global models. The ability to collect continuous temporal and spatial signals above the atmosphere is considered as one of the advantages of remote sensing and satellite images. Moreover, the minimum data requirement is considered as another key advantages of this method, especially in data-scarce regions. This feature makes it possible to estimate radiation in remote areas where radiation stations are scattered. Accordingly, the importance of the present study is to evaluate and compare the efficiency of solar radiation empirical models on the ground with a selected satellite based model named SBDART presented by Ricchiazzi et al. 1998. In this regard, the data measured in four selected radiometric stations are compared with the radiation values obtained from the implementation of algorithms based on satellite images and a suitable algorithm for estimating radiation in the studied stations is introduced based on the analysis of statistical measures.

Materials and method

Study area and statistical specifications of the data

In this study, the synoptic stations in Zanjan, Kerman, Mashhad, and Yazd were selected as study stations, and the studies were performed from statistical quality control to the extraction of relationships using daily data from these stations during 2007–2015.

Daily weather data of four synoptic stations of Kerman (latitude 30.25°N, longitude 56.97°E, altitude 1753.8 meter above sea level), Mashhad (latitude 36.27°N, longitude 59.63°E, altitude 999.2 m asl), Zanjan (latitude 36.68°N, longitude 48.48°E, altitude 1663.0 m asl) and Yazd (latitude 31.90°N, longitude 54.28°E, altitude 1237.2 m asl) have been received in the 8-year period of 2015–2007 through the Iranian Meteorological Organization (IRIMO) (Fig. 1 and Table 1). The climate of stations varies from arid to semi-arid. Zanjan station represent the semi-arid climate type based on De Martonne's classification. The annual precipitation of the city is about 303.5 mm and the minimum and maximum mean annual temperatures are about 4.3°C and 18.4 °C, respectively. Kerman station represent the arid climate type. The annual precipitation of the city is about 142.5 mm and the minimum and maximum mean annual temperatures are about 7.1°C and 24.9°C, respectively. Mashhad is another station that its daily mean annual temperature amplitude is about 13.8°C. The total sum of its precipitation is 246.9 mm and located in semi-arid climate. The final station is Yazd. Its climate is defined by dry. The annual precipitation of the city is about 55.3 mm on average, and the maximum and minimum annual temperatures are about 26.8°C and 12.2°C, respectively. The predominant common crop of the studied cities, in addition to their specific crops, is wheat.

Fig. 1
figure 1

Solar radiation stations’ distribution

Table 1 Statistical specifications of the data used in the studied stations

The long-term statistical period of the radiation data is considered as a basis for selecting these stations. At this point, first, the long-term statistics of the meteorological stations on a daily basis were received from the meteorological organization and a yearly-classified database was formed. This bank includes data on total solar radiation (Rs), radiation received on the horizontal surface above the atmosphere (R0), sunshine hour fraction, horizontal visibility and cloudiness. Data quality control was performed based on Moradi et al. (2009) quality control algorithm. Only the variables related to radiation (total radiation and number of hours of sunshine) of all meteorological data received had statistical deficiencies. Therefore, regarding the purpose of the study, the quality control of radiation statistics was performed with high accuracy.

Empirical models

Angstrom relationship (1924)

Angstrom Relationship is a simple linear relation which expresses the amount of sunlight on a horizontal surface by its ratio to the amount of radiation on a clear, cloudless day with a clear sky. This relationship is the basis of many radiation estimation models. In his proposed relation, Angstrom introduces the radiation fraction (\( \frac{R_s}{R_c} \)) as a linear function of the fraction of sunshine hours (\( \frac{n}{N} \)).

$$ \frac{R_s}{R_c}=f\left(\frac{n}{N}\right)=a+b\frac{n}{N} $$
(1)

whereRc is the total radiation on a clear, cloudless day with a clear sky; Rsis the total radiation received from the sun on a horizontal surface on the earth's surface; n is the actual number of hours of solar radiation, Nis the number of theoretical hours of solar radiation calculated from astronomical relationships, and, a, bare the empirical coefficients which should be calculated with the help of the statistical period of measured radiation data and solar hours. The coefficients used in the Angstrom formula vary depending on atmospheric conditions (humidity, dust, etc.) and solar radiation (latitude and month of the year).

These relationships are used when the amount of Rc (radiation received on the horizontal surface of the earth in perfectly clear and transparent sky conditions) is known, which is difficult to determine. In order to overcome this limitation, several researchers suggested the use of received radiation on the horizontal surface above the atmosphere (R0) instead of (Rc).

$$ \frac{{\mathrm{R}}_{\mathrm{S}}}{{\mathrm{R}}_0}=\left(0.22+0.54\frac{\mathrm{n}}{\mathrm{N}}\right) $$
(2)

Accordingly, the equation introduced by Angstrom was first modified in 1940 by Prescott. In his proposed equation, Prescott (1940) substituted the radiation received from the sun above the atmosphere on the horizontal surfaces for the radiation in clear and transparent sky conditions, and the coefficients a, bwere 0.22 and 0.54, respectively, by using the data of Mont weather station (USA). The Angstrom equation was widely used after the Prescott correction as the Angstrom-Prescott equation.

In the cases which radiation data are not available and calibration is not performed to correct the parameters, the values a = 0.25, b = 0.5 are used (Allen et al. 1998). Considering the importance of pure solar radiation in the equation used to estimate evapotranspiration and water requirement of plants and other applications of solar radiation, it is necessary to calibrate and define the values of coefficients in each area according to the conditions of that area.

Swartman and Ogunlade model

Swartman and Ogunlade (1967) investigated the effect of relative humidity on the received radiation and introduced some equation in this respect. They showed that these equations give better estimates of solar radiation intensity than the single-parameter relationship in tropical countries.

$$ {R}_s=18.765{\left(\frac{n}{N}\right)}^{0.36}{(RH)}^{-0.15} $$
(3)
$$ {R}_s=14.451+174.593\left(\frac{n}{N}\right)-10.137(RH) $$
(4)

where RHis the relative humidity.

Hargreaves and Samani model

Hargreaves and Samani (1982) presented a simple empirical model to simulate daily solar radiation to estimate evapotranspiration when only minimum and maximum temperatures are available. They hypothesized that the transience of atmosphere on a given day was proportional to the second root of the difference between the minimum and maximum daily temperatures (Celsius):

$$ {\mathrm{R}}_{\mathrm{s}}={\mathrm{K}}_{\mathrm{r}}{\left({\mathrm{T}}_{\mathrm{max}}-{\mathrm{T}}_{\mathrm{min}}\right)}^{0.5}{\mathrm{R}}_{\mathrm{n}} $$
(5)

where Rn is the total radiation received from the sun on a horizontal surface above the earth atmosphere, Rs indicates the total radiation received from the sun on a horizontal surface on the earth surface, and Kr is considered as an experimental constant to consider the effect of dust in the atmosphere and the change in water vapor pressure with altitude.

Gopinathan model

Gopinathan (1988) proposed the equations to estimate the regression coefficientsaand bof angstrom type correlations for predicting monthly mean daily global solar radiation. These equations developed using average monthly values of altitude (m), \( \frac{n}{N} \) and \( \frac{R_S}{R_0} \) variables with the aim to be applicable to all parts of the world to compute global solar radiation.

$$ \frac{{\mathrm{R}}_{\mathrm{s}}}{{\mathrm{R}}_{\mathrm{n}}}=\mathrm{a}+\mathrm{b}\left(\frac{\mathrm{n}}{\mathrm{N}}\right) $$
(6)
$$ a=0.265+0.070h-0.135\left(\frac{n}{N}\right) $$
(7)
$$ b=0.401+0.108h-0.325\left(\frac{n}{N}\right) $$
(8)

El-Metwally model

El-Metwally (2004) proposed the 4 Parameter Polynomial model in Egypt. The methods use ground-based measurements of maximum and minimum temperature, daily mean of cloud cover and extraterrestrial global radiation.

$$ {\mathrm{R}}_{\mathrm{s}}=\mathrm{a}{\mathrm{R}}_{\mathrm{o}}+\mathrm{b}{\mathrm{T}}_{\mathrm{max}}+\mathrm{c}{\mathrm{T}}_{\mathrm{min}}+{\mathrm{d}}_{\mathrm{cm}}+\mathrm{e} $$
(9)

Where a, b, c, d, e are the experimental constants, TmaxandTmin the air temperature maximum and minimum (c) and cm is average cloud cover (Okta).

Garg model

Water vapor affects the permeability of the atmosphere. Calculations by Tamm and Thormalla (1992) indicated that increasing water vapor from 1 to 4 cm of water in a vertical bar reduces the average daily total radiation in a cloudless sky by 5.6%. Accordingly, Garg and Garg (1982) introduced their model by involving water vapor content in the structure of the relationship:

$$ \frac{{\mathrm{R}}_{\mathrm{S}}}{{\mathrm{R}}_0}=\left(\mathrm{E}+\mathrm{F}\left(\frac{\mathrm{n}}{\mathrm{N}}\right)+\mathrm{H}.{\mathrm{w}}_{\mathrm{at}}\right) $$
(10)

In this regard, RSthe average monthly total solar radiation(Mjm−2day−1), wat is the content of atmospheric water vapor per unit volume of air (grm−3)and E, F, Hare the coefficients of the relationship. wat is calculated based on the equation suggested by Hussain (1984):

$$ {\mathrm{w}}_{\mathrm{a}\mathrm{t}}=\mathrm{RH}\left(4.7923+0.3647{\mathrm{T}}_{\mathrm{a}}+0.0055{\mathrm{T}}_{\mathrm{a}}^2+0.0003{\mathrm{T}}_{\mathrm{a}}^3\right) $$
(11)

where RH indicates relative humidity (%) and Tais average air temperature (degrees Celsius).

Finally, the coefficients of selected empirical models of radiation estimation were calibrated based on meteorological and geographical conditions in Iran (Table 2).

Table 2 Calibrated models in Iran Radiation Measurement Network

MODIS sensor data

Data regarding measured radiation on the ground was used as well as MODIS sensor data. MODIS sensors are installed on Aqua and Terra satellites. The images used by the MODIS sensor of Aqua and Terra satellites in the form of MODIS products were received in the 8-year period of 2015–2007 through the websites http://modis.gsfc.nasa.gov and https://reverb.echo.nasa.gov. Due to the two sun-synchronous satellites, the Terra satellite images were taken at 10:30 a.m. and the Aqua satellite images at 13:30 local time on a daily basis. The studied days were selected from the days approved in the quality control tests in such a way that at least one day as its representative was checked for each month between mid-March and mid-August for a period of eight years. Satellite data were extracted from level-2 MODIS sensor products stored in HDF files, from which atmospheric and terrestrial specifications can be obtained. The total possible amount of precipitable water from the atmosphere was extracted from the data (MOD/MYD05) and the visual thickness of the cloud was derived from the products (MOD/MYD06). Similarly, MOD/MYD04 data were used due to lack of observation of variables such as aerosol values. In the case of high uncertainty in any of the required variables, the values ​​ are obtained from MOD / MYD08 products.

All of these files are saved in either GRID or Swath. After downloading the desired data, the obtained files with EOS-HDF extension are processed by HEG V2.12 software under Linux and the desired information layers are selected. These layers were then extracted into the Bash environment using the MODIStsp package under the R programming language and converted to files with the TIFF extension. All MODIS data were geo-registered and resampled by the nearest neighborhood interpolation method, which was also performed using HEG V2.12. The required information was extracted from the layers using arcpy and os modules under the Python 2.7 programming language.

The total amount of daily radiation is estimated by determining the amount of momentary radiation at half-hour intervals using the Santa Barbara DISORT Atmospheric Radiative Transfer model. The momentary values in this model have an internal database including parameters such as diffuse reflection and absorption for clouds with liquid and solid droplets under different conditions. This database is based on the Mie Scattering Code and ranges from an effective radius of 2 to 128 micrometers. This code, which is actually the atmospheric transfer equation of radiation and is written in FORTRAN language, has physical models for calculating radiation under the influence of standard atmosphere, cloud, haze and surface (Liu et al. 2010). In this study, the effective radius values according to Liu et al. (2010) are considered as 10 μm for water droplets in the cloud and 65 μm for ice droplets belonging to the cloud, and the model with 4 flows by 0.005 Micrometer separation is implemented. Ozone and haze are 0.349 atm cm for the rural type. The short wavelength range is between 0.3 and 3 micrometers in proportion to the Pyranometer used in the stations. For cloudy skies, the total atmospheric water vapor is assumed to be 1.418 gr/cc, and the cloud cover is homogeneously composed of parallel plates and is considered static during the morning and evening.

Based on these values and variables monitored at ground stations, the amount of momentary radiation at half-hour intervals is calculated. According to the surface altitude, atmospheric condition, solar tilt angle, and albedo of the earth surface for a given geographical position, the SBDART model is used to calculate direct and scattered radiation and the amounts of radiation in the form of downward shortwave flux for clear and cloudy skies are obtained. Atmospheric conditions include visibility, total water vapor, and visual thickness of the cloud. The model with a significant number of variables is designed so that there is a logical value instead of each variable which is not defined. The variables used in the present study includes Atmospheric Water Vapor (a.m. and p.m.), Cloud’s Optical Depth (a.m. and p.m.), White sky albedo, Black sky albedo, Horizontal visibility and Aerosol for Kerman, Mashhad, Yazd and Zanjan in the 8-year period of 2015-2007.

These values are all obtained from MODIS products, except for the horizontal visibility variable, for which data were from the Meteorological Organization. These values indicate the atmospheric characteristics at two time points per day (10:30 and 13:30). The inclination angle of sun was also calculated for all intervals. The daily albedos were calculated by using interpolation operations between the two values of Black-sky Albedo and White-sky Albedo, as well, according to the change in the Zenith angle. The two partial values of albedo represent the albedo for the direct radiation and indirect radiation, respectively. In this way, each day is divided into three parts including sunrise up to 10:30 local time (Terra satellite transit time), 10:30 to 13:30 local time (Aqua transit time), and 13:30 until sunset. Then, the atmospheric characteristics of Terra and Aqua satellites were used for sections 1 and 3, while an interpolation was used for section 2. Since the maximum amount of radiation occurs in section 2, this interpolation increases the accuracy of the model.

The introduced empirical models were implemented on study stations for the selected days, as well. Observational data were used for the valuesn/N, which are shown as Nn. Daily average cloud fraction (CF) data were used for estimating daily radiation by using Angstrom Prescott empirical model and the values of MOD/MYD08_D3 products were obtained. First, the images related to 50 days were downloaded for each station and their CF values were extracted. Since two values (morning and evening) were obtained for each day, the average for each day was taken. During the next step, by establishing a regression relation, an equation was obtained for calculating the value n/N using CF. The relations obtained for each station are presented in Table 3. Then, on the studied days, daily radiation values were obtained through the Angstrom-Prescott empirical model using CF values. The required coefficients of the Angstrom-Prescott empirical model for each station are considered according to Aghashariatmadari (2011).

Table 3 Regression relations between the average cloud fraction and sunny hours

Performance evaluation indices

There are various error indicators used to compare and assess the performance of models. In this research, in order to evaluate the performance of models against measured data, the most- routinely used indices including coefficient of determination (R2), the mean absolute error (MAE), the root mean square error (RMSE), and the relative root mean square error (RRMSE) were used as follows:

  1. (1)

    Root-Mean-Square Error (RMSE):

$$ RMSE={\left\{\left[{\sum}_1^n{\left({O}_i-{P}_i\right)}^2\right]/n\right\}}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$2$}\right.} $$
(12)
  1. (2)

    Mean Bias Error (MBE)

$$ MBE=\left[{\sum}_1^n\left({O}_i-{P}_i\right)\right]/n $$
(13)
  1. (3)

    Coefficient of determination (R2)

$$ {R}^2=1-\frac{RMSE^2}{\sigma^2} $$
(14)
  1. (4)

    Mean Absolute Error (MAE)

$$ MAE=\frac{1}{n}{\sum}_1^n\left|{O}_i-{P}_i\right| $$
(15)
  1. (5)

    Relative Root Mean Square Error (RRMSE)

$$ RRMSE=\frac{{\left\{\left[{\sum}_1^n{\left({O}_i-{P}_i\right)}^2\right]/n\right\}}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$2$}\right.}}{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$n$}\right.{\sum}_1^n{P}_i}\times 100 $$
(16)
  1. (6)

    Mean Absolute Percentage Error (MAPE):

$$ MAPE=\frac{1}{n}\sum \limits_{i=1}^n\left|\frac{P_i-{O}_i}{O_i}\right|\times 100 $$
(17)

where Pi indicates the estimated daily global solar radiation, Oi represents the observed values, σ is describes the standard deviation, and n considered as the number of the data pairs. The RMSE is a measure of random errors, which represents some information about the short-term performance of the regression. Knowing about the amount of RMSE and MAE is important because they represent errors in the units of the studied variables. The values of 1 in R2 represent the better performance and perfect fit of the model, which is contrariwise in the case of RMSE and MAE indices (Badescu 2014). The MAPE indicates the mean absolute percentage difference between the estimated and measured data. Also, the RRMSE is obtained by dividing the root mean square error to the average of observed or measured data. Some studies such as Deo et al. (2018) and Ghorbani et al. (2018) indicated that different limits of this index is used to determine the accuracy of the models such as excellent for RRMSE<10%, good for 10%<RRMSE<20%, fair for 20%<RRMSE<30%, and poor for RRMSE>30%.

Results and discussion

In this study, the efficiency of a radiation estimation model was investigated by using satellite images compared to the Angstrom-Prescott empirical relation and eight selected empirical models of solar radiation estimation (Angstrom-Prescott model) and calibrated Angstrom-Prescott model, Swartman, Hargreaves & Samani, Gopinathan, Garg, and El-Metwally). After collecting data and preparing inputs, the models were implemented and the desired statistical indicators were calculated according to the amount of radiation measured with Pyranometers.

Evaluation the efficiency of models

Figure 2 shows the comparison between the calculated and estimated values of radiation using empirical models and parametric model (SBDART) and the values of statistical error indices of the studied models in general and by station are presented in Table 4 and Fig. 2.

Fig. 2
figure 2

Output distribution of models versus the values observed at the studied stations

Table 4 Calculated statistical indices of models

As illustrated in Fig. 2 and Table 4, the coefficients of determination vary between 0.25 and 0.95, most of which are related to the SBDART (Parametric) model and the least of which are related to the Hargreaves model. After SBDART model, the highest R2 values are related to calibrated Angstrom and Angstrom-FAO models (R2 =0.89), Garg model (R2 = 0.88) and Gopinathan model (R2 = 0.86), respectively. In addition, the mean absolute error percentage of the SBDART model is significantly different from other models (MAPE = 4.3%). After the parametric model, the lowest MAPE is related to the Angstrom model (MAPE = 5.98%). The MAPE measurement value for other models is higher than 12% and the highest percentage of mean absolute error is related to the El-Metwally model with the rate 446.4. Furthermore, the relative root mean square error (RRMSE) is equal to 5.2% and 7.5% for the SBDART and calibrated Angstrom models, respectively. For other statistical error indicators, the SBDART model performs significantly better than other models.

Figure 2 shows how the calculated data are distributed around the bisector line. As shown, only the SBDART model and then the calibrated Angstrom model have little scattering of points around the bisector line. The highest scatter rate is related to Hargreaves and El-Metwally models, whose error indices were higher in Table 4 than other models. The RMSE values of different models vary based on the overlapped data of all stations studied in the range 13.5–1272.6 W/m2.day.

At Zanjan station, the RMSE values of different models change in the range of 10.4-1162.7 W/m2.day. The lowest mean absolute percentage error (MAPE) is equal to 3.9% and the highest rate of coefficient of determination (R2) is equal to 0.97 and both indicators are related to the SBDART model. The calibrated Angstrom model with values of MAPE =7.4% and R2 =0.9 after the parametric model has a better estimate of solar radiation at this station. At Yazd station, SBDART model with a considerable difference from other models with MAPE=3.9% and R2=0.97 have the best estimate of solar radiation in this station. Statistical indices of SBDART model of Mashhad station indicated a better estimate of this model than other models (MAPE =4.3% and R2 =0.96). In a similar situation for Kerman station, SBDART model with indices of MAPE =4.7% and R2 =0.95 has better performance than experimental models. In terms of RMSE index, the best performance is related to the parametric model, and the Angstrom-Prescott empirical model using satellite data has the highest error compared to the studied models like the coefficient of determination index. This is also not the case with Kerman station, and seems to be related to the fact that satellite data related to the daily average cloud fraction (CF) fit well with sunshine hour data (Table 3).

Combining empirical models and satellite data

The results of statistical analysis of empirical models show that the increase in model variables does not necessarily increase their accuracy and the estimation of models with less input such as Angstrom model is more accurate than other models. On the other hand, as shown in Table 5, by comparing the three models of Angstrom Prescott based on station observations, the parametric model and Angstrom based on satellite data on different days in terms of fraction of sunshine hours (cloudy and cloudless day), the parametric model is more efficient on cloudy days than on cloudless days so that the calculated RMSE and MBE indices for cloudy days are 9.07 W/m2.day and - 0.72 W/m2.day, and are 19.77 W/m2.day and 3.60 W/m2.day, respectively, for sunny days. In contrast, the empirical Angstrom-Prescott model had better results with observational data on cloudless days so that the indices were RMSE = 43.60 W/m2.day and MBE = 34.25 W/m2.day for cloudy days, and were RMSE = 29.56 W/m2.day and = MBE = 6.19 W/m2.day for cloudless days. With the calculated indices for the Angstrom-Prescott model with satellite data, it is not possible to judge whether it is different or insensitive to cloudy or sunny days. Furthermore, this model has not been very sensitive to nNchanges.

Table 5 Comparison of models in different sky conditions in terms of cloud cover

Sensitivity analysis of models

The sensitivity of empirical solar radiation models to meteorological variables was analyzed. The implemented method is based on changing the meteorological value by a defined percentage steps and computing the corresponding percentage change in Rs while keeping the other variables constant at their long-term averages. This was accomplished in the study area by changing the variables from −20 to + 20% with ± 5% increments that were deemed to influence Rs. These variables included: Tmax, Tmin, RHmean%, cloudiness, Hourly Vapor Pressure (hvp) and sunshine hours.

Figure 3 shows the relative change in Rs due to the relative change in a selected meteorological variable. As it can be seen in the figure, in Calibrated Angstrom-FAO model, in response to the change in n/N by 20%, Rs changed about 10%. This value was about 11% for n/N in main Angstrom-FAO model. As well as, for Gopinatan model, a 20% change in n/N led to a 7.1% change in Rs. In the case of El-Metwally model, this value was about 28% for the cloudiness and about 1.1% and 1.4% for Tmax and Tmin, respectively. Also, For Swartman model, with n/N and RHmean variables, Rs had the highest sensitivity to n/N with a change of 7%. Furthermore, it can be seen at the Garg model, in response to the change in hvp by 20%, Rs varied about 30%. This value is about 20% for n/N. And finally in Hargreaves model the highest changes of Rs is belong to Tmax by value of 34 % in comparison with 12.5% variation in response to a 20% change in Tmin.

Fig. 3
figure 3

Percent change in Rs vs. percent changes in meteorological variables

Conclusion

The present study proposed a satellite based method based on SBDART algorithm to predict the global solar radiation in target stations with more accuracy. The ability to collect continuous temporal and spatial signals above the atmosphere is considered as one of the advantages of remote sensing and satellite images. Also, the minimum data requirement is considered as another key advantage of this method, especially in data-scarce regions. This features makes it possible to estimate radiation in remote areas where radiation stations are scattered. In addition to being more accurate nationwide (rather than point-to-point), satellite data have far lower costs. Each of the two satellite based methods used here have advantages and disadvantages. The parametric model is accurate for different sky conditions (R2 = 95% and RRMSE =5.2%) and only uses satellite data without any need of other meteorological data. However, its complexity, having a large number of input variables, as well as being time consuming (especially due to the lack of access to high-speed computer systems) are considered as its weaknesses. On the other hand, the satellite based Angstrom-Prescott model has no complexity in the process and requires a small number of variables thus it can be easily used for different areas. This model needs accurate sunshine hour’s data and requires calibration of relationship coefficients for the study area. However, its lower accuracy relative to the parametric model (R2 = 0.57 and RRMSE= 16.9%) in some precise applications may not yield good results. Based on the findings of this study, the proposed satellite based method is highly efficient in predicting global solar radiation values. By considering the importance of solar radiation values, the performance of other modern models can be evaluated for simulating this variable. Due to the difference in model efficiency in different climatic conditions, it is suggested that the applicability of introduced parametric model is also evaluated in different climate conditions.