1 Introduction

Water resources in semiarid regions are vulnerable to the impacts of climate change and human activities, and the accurate estimation of reference evapotranspiration (ETref) is a primary tool in the management of water resources. Also, the estimation of ETref by using hydrological equations can be helpful in agriculture sectors (Celestin et al. 2020; Ndiaye et al. 2020; Yan et al. 2021). It has a key role in the management of water resources and the determination of crops’ water demands in the semidry regions (Berti et al. 2014; Ferreira et al. 2019; dos Santos Farias et al. 2020).

The most accurate evaluation of ETref is computed by the lysimeter method, but this method has high costs and requires complex instruments (Ahooghalandari et al. 2016; Ahooghalandari et al. 2017). Therefore, alternative techniques for indirect estimation of ETref were developed based on empirical equations. Numerous empirical equations have been introduced to estimate ETref. Despite the advantages of empirical equations such as ease of use, applicability due to the great variety of required parameters, and classification based on various climatic parameters, the low accuracy of some of these equations in estimating ETref is one of the main challenges in their application. In contrast, the FAO56 Penman-Monteith (PM-FAO56) equation is the standard combination-based model used to estimate the ETref in different climates and at different time scales (Güçlü et al. 2017; Saggi and Jain 2019; Shiri et al. 2019; Ndiaye et al. 2020; Sharafi and Mohammadi Ghaleni 2021). The accuracy of this equation is due to its consideration of all climatic parameters, including solar radiation, air temperature, wind speed, and relative humidity (Ndiaye et al. 2020). Furthermore, the equations for empirical models based on temperature, solar radiation, and mass transfer use fewer climatic parameters in the calculation of ETref. Therefore, the development and calibration of empirical model equations in different climates can be more effective for agricultural and hydrological projects where only a few climatic variables are available (Heydari and Heydari and Heydari 2014; Gafurov et al. 2018).

Several researchers have evaluated the dependence of different empirical ETref equations on various meteorological parameters over different climates. Gao et al. (2017) also assessed different empirical ETref equations in various climates and observed that the PT and Hargreaves (HG) equations worked best in dry and semidry climates, while the MK equation worked best in the humid climate of China. Sharafi and Ghaleni 2021) evaluated different empirical equations for ETref in different climates of Iran. Their results found that the simplest regression model (MLR) based on minimum and maximum temperature data was more precise than the empirical equations. They also recommended the solar radiation–based Irmak equation as the best substitute for the PM-FAO56 model, especially in dry and semidry climates.

Rahimikhoob et al. (2012) compared four temperature-based and solar radiation–based equations using data from eight stations in subtropical climates of Iran and confirmed the applicability of Priestley–Taylor (PT) and Hargreaves–Samani (HS) equations in those climates. The comparison results showed that the original PT and HS equations were more applicable in a humid climate. The performance of the PT and HS equations improved slightly after the calibration; however, the Trajkovic (TRAJ) and Makkink (MAKK) equations improved greatly. Tabari et al. (2013) compared different temperature-based, solar radiation–based, and mass transfer–based equations for modeling ETref in humid climates of Iran and found that the temperature-based Blaney–Criddle (BC) and HS equations surpassed the other temperature-based models. Cross-comparison of the 31 empirical equations showed that the five best equations as compared with the PM-FAO56 model were the two solar radiation–based equations developed, the temperature-based BC and Hargreves-M4 equations.

Farzanpour et al. (2019) conducted 20 ETref equations using daily meteorological data of 10 stations (12 years) in semidry climates of Iran. Their results revealed that the calibrated equations might be a good alternative for the PM-FAO56 equation. Bourletsikas et al. (2018) compared 24 different equations for estimating ETref in Greece and concluded that calibrating mass transfer-based equations is essential for improving their performances. Celestin et al. (2020) compared the 32 empirical ETref equations with the PM-FAO56 using data on temperature, solar radiation, and mass transfer in northwest China. They found that the World Meteorological Organization (WMO) and the Mahringer equations for the mass transfer–based model provided the best results.

Above all, in the present study, we have tried to introduce the best calibrated equations with the highest accuracy in different climates of Iran. Therefore, the goals of this study were (1) the use of monthly data of eight climatic variables measured in 41 synoptic stations over a period of 30 years (1990–2019); (2) the calibration of 32 empirical equations based on temperature-based, solar radiation–based, and mass transfer–based equations in four main climates (very dry, dry, semidry, and humid) in the study area; (3) the comparison of the results of the original and calibrated equations versus the PM-FAO56 equation using different statistical criteria; and (4) drawing an accurate map of the results of the best calibrated empirical equations in the study area.

2 Materials and methods

2.1 Time and location scales

Iran is in the northern hemisphere between 25 and 40° latitude. For this study, meteorological data recorded between 1990 and 2019 were collected from 41 synoptic stations in the country. These data were collected by the National Meteorological Organization of Iran and include the monthly mean of minimum, mean and maximum air temperature, relative humidity, wind speed measured at 2 m height, and solar radiation. The data were complete, and no data needed to be reconstructed.

According to the FAO56 index, Iran is classified into four climatic regions: very dry, dry, semidry, and humid (Fig. 1). Figure 1 shows the location and climate for each station used in this study. Six stations were in very dry climate, 17 stations in dry climate, 14 stations in semidry climate, and 4 stations in humid climate (Fig. 1).

Fig. 1
figure 1

Location and classification of studied stations based on the FAO56 method

2.2 Empirical ETref equations

Based on the type and importance of input variables used in each empirical equation to calculate the ETref, the models were divided into 4 categories: combination-based (1 equation), temperature-based (11 equations), solar radiation–based (11 equations), and mass transfer–based (10 equations). Table 1 lists the 33 empirical ETref equations used in this study and their respective references.

Table 1 The ETref estimated based on empirical equations

To calculate the PM-FAO56, measurements of the amount of total solar radiation at the Earth’s surface (Rs, MJ m-2 d-1), maximum and minimum temperature, wind speed (m s-1), and lack of vapor pressure (VPD, kPa) are required. Due to lack of access to Rs and VPD, the FAO method was used (Gholipoor 2009). Daily values of Rs were obtained from Hargreaves and Samani’s equation (Mehdizadeh et al. 2017) and the modified Allen et al. (2006) equation. Solar radiation reaching the land surface (Rn, MJ m-2 d-1) was first measured above the Earth’s atmosphere for each day of the year based on latitude and longitude and the solar constant (Allen et al. 2006). Then, Rs was calculated using Eq. 34:

$$ Rs={K}_{Rs}\times \left(1+2.7\times {10}^{-5}\times Alt\right)\times {\left({T}_{\mathrm{max}}-{T}_{\mathrm{min}}\right)}^{0.5}\times {R}_n $$
(34)

where Alt is altitude (m) and KRs is the empirical constant, considered equal to 0.16 (Gholipoor 2009). The es calculation is obtained from the difference between the daily saturated water vapor pressure (emax) and the actual water vapor pressure (ea). Relative humidity at temperature was assumed to be at least 100 percent and the values for ea were obtained from Eq. (35):

$$ {e}_a=0.6108\times \exp \frac{\left(17.27\times {T}_{\mathrm{min}}\right)}{\left({T}_{\mathrm{min}}+237.3\right)} $$
(35)

In very dry and dry climates, the relative humidity at the Tmin may never reach 100%. Therefore, it was assumed that in these regions, ea values would occur at Tmin > 10 °C and it was observed that in this case it had a minor effect on ETref (1–2%). As a result, the ETref was calculated assuming that the dew point was equal to the Tmin. Then, the Tmax saturated vapor pressure during the day (emax) depending on the Tmax was obtained from Eq. (36).

$$ {e}_{\mathrm{max}}=0.6108\times \exp \frac{\left(17.27\times {T}_{\mathrm{max}}\right)}{\left({T}_{\mathrm{max}}+237.3\right)} $$
(36)

where es is obtained from the mean ea and emax. Finally, es is calculated as the average between ea and emax in a part of the day when the air temperature is not at its maximum. However, other researchers have found that (emax-ea) × 0.75 is a more accurate estimate of ea (Tanner and Sinclair 1983; Allen et al. 1998). Therefore, this method is used in this current study. Calculations were performed using SAS software (Statistical Analysis System, Version 9.1, SAS Inst., Cary, NC).

The average annual rainfall over the last 30 years in Iran was reported to be 334 mm. The highest annual rainfall occurred at Bandar Anzali station (1791.78 mm), and the lowest annual rainfall occurred at Bam station (61.61 mm). The average annual rainfall in the very dry climate was 92.89 mm, which was 140.74, 266.41, and 1150.05 mm less than in dry, semidry, and humid climates, respectively (Fig. 2(a)). The annual average relative humidity in Iran was reported to be 55.34% with the highest relative humidity at Bandar Anzali station (84.71%) and the lowest relative humidity at Bam station (28.08%). The relative humidity in the very dry climate was 44.64%, which was 1.02, 7.9, and 35.93% less than in the dry, semidry, and humid climates, respectively (Fig. 2(b)).

Fig. 2
figure 2

The long period values of (a) rainfall, (b) relative humidity, (c) temperature, (d) solar radiation, and (e) wind speed of Iran’s climate (1990–2019)

The 30-year average air temperature in Iran was reported to be 17.54 °C. The hottest and coldest stations in this study were the Bandar Abbas and Ardabil stations (26.63 and 9.14 °C, respectively). The average air temperature in the very dry climate was 20.44 °C, which was 3.4, 10.15, and 6.04 °C warmer than in the dry, semidry, and humid climates, respectively (Fig. 2(c)). The average solar radiation received in Iran is reported to be 7.26 MJ m-2 day-1. The highest and lowest received solar radiations were observed in Bam and Rasht stations (9.06 and 4.17 MJ m-2 day-1, respectively). The average solar radiation received in the very dry climate was 8.55 MJ m-2 day-1, which increased by 0.36, 1.06, and 3.74 MJ m-2 day-1 in dry, semidry, and humid climates (Fig. 2(d)). The average wind speed in the country during the last 30 years was reported to be 4.35 m s-1, which has increased by about 0.52 m s-1 compared to the same period. The highest and lowest wind speeds were recorded in Zabol and Gorgan stations, respectively (10.62 and 1.45 m s-1). The mean wind speed in a very dry climate was 6.54 m s-1, which increased by 2.15, 2.63, and 3.97 m s-1 in dry, semidry, and humid climates (Fig. 2(e)).

2.3 Evaluation performance criteria

Until now, many performance criteria have been used to evaluate the results of the model for prediction of ETref. The equations were assessed for each station by means of six statistical measures used to evaluate the accuracy of each model in estimating the ETref: the coefficient of determination (R2), the root mean square error (RMSE), the average percentage error (APE), the mean bias error (MBE), the index of agreement (D), and the scatter index (SI). The explanations for the statistical measures appear in Table 2. These criteria are widely reported in the literature (Kisi 2014; Samaras et al. 2014; Celestin et al. 2020).

Table 2 The characteristics of evaluation performance criteria used in the study

The R2 coefficient, acquired by the least squared regression analysis, is a commonly used correlation measure. For the absolute and/or relative errors’ estimation, RMSE, APE, and MBE indices were also evaluated. The descriptive index of agreement (D) was used for the correlation between the equations, expressing the degree to which an equation’s predictions are error-free (Willmott 1982). According to Li et al. (2013), the range of SI for the accuracy of the models is excellent (SI < 0.1), good (0.1 < SI < 0.2), fair (0.2 < SI < 0.3), and poor (SI > 0.3).

In Eqs. (37) to (42), \( {ET}_{ref_i}^{PM_{FAO56}} \)and \( {ET}_{ref_i}^{\operatorname{mod} el} \)are the ETref based on PM-FAO56, and modeled ETref, \( {\overline{ET}}_{ref}^{PM_{FAO56}} \)and \( {\overline{ET}}_{ref}^{\operatorname{mod} el} \)are the mean values of ETref based on PM-FAO56 and modeled ETref and N is the number of data sets.

2.4 Empirical equation calibration

The basis of empirical equations used in estimating ETref is the regression relationship between the ETref equation as a dependent variable and meteorological parameters as independent variables. In the process of developing each of the empirical equations, one of two modifications may be made, either a change in meteorological parameters or a change in the coefficients of the equation. In this study, modification (optimization) of constant coefficients in empirical equations is the basis for increasing the accuracy of ETref estimation in different climates. The objective function of that change has been to minimize the RMSE error criterion by optimizing the constant coefficients of the equations as decision variables. For instance, in the HASA equation, the two coefficients a and b in Eq. (43) are optimized to minimize the amount of error between the estimated ETref and the PM-FAO56.

$$ {ET}_{ref}=\left[a\times {R}_a\left({T}_{mean}+17.8\right){\left({T}_{\mathrm{max}}-{T}_{\mathrm{min}}\right)}^b\right]/\lambda $$
(43)

The accuracy of empirical equations in estimating ETref before and after calibration was evaluated using error evaluation criteria separately for various empirical equations (temperature-based, solar radiation–based, and mass transfer–based) and for very dry, dry, semidry, and humid climates.

3 Results

3.1 Accuracy evaluation of empirical equations

To assess the 32 empirical equations for temperature-based, solar radiation–based, and mass transfer–based models in different climates, meteorological datasets from 1990 to 2019 were evaluated. Table 3 shows the values of R2 and RMSE for original and calibrated empirical equations based on temperature, solar radiation, and mass transfer methods in different climates. According to the results of the best values of R2 and RMSE in the temperature-based BARO equation for very dry (0.71 and 65.05), dry (0.63 and 74.69), semidry (0.46 and 60.25), and humid (0.59 and 73.35) climates observed in original BARO equation. These values after calibration were 0.82 and 32.79 in very dry climate, 0.73 and 42.12 in dry climate, 0.65 and 30.34 in semidry climate, and 0.71 and 35.29 in humid climate. The DRAL1 equation in dry climate (0.73 and 36.4), and BERT equation in semidry (0.66 and 31.16) and humid climates (0.7 and 30.58) had acceptable results (Table 3).

Table 3 The original and calibrated values of R2 and RMSE for empirical ETref equations in different climates of Iran

For solar radiation–based methods, the maximum R2 in very dry, dry, and humid climates derived by the original HARG equation was 0.6, 0.47, and 0.45, respectively, but the best RMSE in very dry climate obtained by the original IRMA1 equation was 67.51, and in dry and humid climates as obtained by the original JEHA equation was 65.92 and 66.77, respectively. The result for calibrated equations showed that in very dry (R2 = 0.84 and RMSE = 33.13), dry (R2 = 0.74 and RMSE = 34.9), and semidry (R2 = 0.63 and RMSE = 34.36) climates, the OUDI, ABTE2, and TATA3 equations yielded reliable estimates. In the humid climate, calibrated PRTA and MAKK equations showed the maximum R2 = 0.71 and the minimum RMSE = 33.71 (Table 3).

The results from mass transfer–based methods showed that the values of R2 for the original PEMN equation in very dry, dry, semidry, and humid climates were acceptable (0.74, 0.7, 0.53, and 0.64, respectively). The values of RMSE for the original PENM equation were reported as 54.17 for very dry, 53.13 for dry, and 46.46 for semidry climates, but the best value of RMSE for humid climates, 49.2, was obtained by the original ROMA equation.

The results from mass transfer–based methods showed that the values of R2 for the calibrated PEMN equation in very dry, dry, semidry, and humid climates were acceptable (0.85, 0.79, 0.71, and 0.77, respectively). The values of RMSE for the calibrated PEMN equation were 27.87 for very dry, 31.83 for dry, and 29.38 for semidry climates, but the best value of RMSE for humid climates, 30.14, was obtained by the calibrated ROMA equation (Table 3).

Radar charts in Fig. 3 compare the APE values of the \( {ET}_{ref}^{PM_{FAO56}} \) and the estimated ETref using the original and calibrated empirical equations for temperature-based, solar radiation–based, and mass transfer–based methods from 1990 to 2019. Based on the results of APE plots, calibration greatly improved the performance of all empirical equations in all investigated climates compared with the original empirical equations. After calibration, APE values are closer to zero.

Fig. 3
figure 3

The APE performance of original and calibrated empirical equations in different climates. (a) Temperature-based equations. (b) Solar radiation–based equations. (c) Mass transfer–based equations

A reduction in values of APE for the calibrated empirical equations was found in very dry, dry, semidry, and humid climates in temperature-based (1, 1.5, 2.2, and 3.2%), solar radiation–based (1, 1.4, 1.9, and 3.1%), and mass transfer–based (1, 1.3, 1.6, and 2.8%) methods when compared to the original equations (Fig. 3). This indicates the great effect calibration has relative to other empirical equations on increasing the accuracy of temperature-based methods. This increase confirms the accuracy of calibrated empirical equations in estimating ETref in humid climate. At the same time, the accuracy of ETref estimation for all empirical equations decreased from very dry to humid climates, indicating that the process of ETref estimation in humid climate is more complex due to its greater dependence on multiple climatic parameters.

3.2 Bias error evaluation of empirical equations

Figure 4 shows a decrease in MBE for calibrated empirical equations compared to the original empirical equations. This error reduction is evident in all types of equations and in all climates. Figure 4(a) shows the overestimation of most empirical temperature-based methods in very dry, semidry, and humid climates. The highest accuracy of empirical equations for temperature-based and solar radiation–based methods for estimating ETref in dry climate (Fig. 4(a-2) and (b-2)) is obtained when the MBE in this climate for all empirical equations is less than 20 mm year-1. Figure 4(c) shows the overestimation of ETref values for mass transfer–based empirical equations in humid climate and underestimation in very dry, dry, and semidry climates. The highest accuracy of MBE is acquired by mass transfer–based methods in semidry climate.

Fig. 4.
figure 4

The MBE performance of original and calibrated empirical equations in different climates. (a) Temperature-based equations. (b) Solar radiation–based equations. (c) Mass transfer–based equations

For temperature-based equations, the negative MBE values are observed for calibrated TATA1 and BARO equations in very dry climate, the original BARO and AHOO1 equations and calibrated HASA, TRAJ, TATA1, DRAL1, DORJ, BARO, and AHOO1 equations in dry climate, calibrated TATA2 and AHOO1 equations in semidry climate and, finally, BERT, DORJ, BARO, and AHOO1 calibrated equations in humid climate, indicating that the tendency of these equations is to underestimate ETref (Fig. 4(a)). The MBE values for most original and calibrated equations were overestimated in very dry (Fig. 4(a-1)) and dry (Fig. 4(a-3)) climates. Also, it is noteworthy that the highest overestimation and underestimation were observed in humid climate, which was reported in MAHR (39.9 mm year-1) and BARO (−34.3 mm year-1) original equations, respectively (Fig. 4(a-4), 4(c-4)). The overestimation of the in original equations varied from 25.27 mm year-1 to 26.15 mm year-1 in very dry and semidry climates, respectively. The overestimation of the temperature-based equations was found by Trajkovic (2007) and Landeras et al. (2008). Furthermore, according to Temesgen et al. (2005), higher wind speed combined with lower humidity resulted in lower values of temperature-based equations compared to PM-FAO56, especially in drier climates.

For solar radiation–based methods, the negative MBE values are observed for PRTA, IRMA2, TATA3, and OUDI original equations and PRTA calibrated equation in very dry climate, and ABTE2 original and calibrated equations in humid climate. Also, most equations in very dry, dry, and semidry climates showed a good response to calibration. In humid climate, due to the influence of more factors on ETref, more fluctuations were observed. However, the calibrated MAKK, HARG, and ABTE1 equations showed better results in humid climate (Fig. 4(b-4)). Analyses by Trajkovic and Kolakovic (2009) showed the solar radiation–based equations had a tendency to overestimate ETref in humid climates of Serbia.

For mass transfer–based equations, most of the original and calibrated equations exhibited underestimation in very dry, dry, and semidry climates. According to the MBE values, the original and calibrated equations of ROHW in the very dry climate, PENM in the dry climate, DALT in the semidry climate, and ROMA in the humid climate showed better results (Fig. 4(c)). In general, the comparative results revealed that the mass transfer-based equations had the best performances among the ETref equations evaluated in very dry and semidry climates. Also, the temperature-based and solar radiation–based equations were the good equations for the dry climates (Fig. 4(a) and (b)).

3.3 Correlation between PM-FAO56 and empirical equations

Based on the results presented in Table 3, the highest accuracy of empirical equations for temperature-based, solar radiation–based, and mass transfer–based methods of estimating ETref in different climates is determined by the BARO, JEHA, and PENM equations, respectively. Figure 5 shows the values of D for the best equations of temperature-based, solar radiation–based, and mass transfer–based methods in different climates. A better fit between the estimated ETref and PM-FAO56 appears in the calibrated empirical equations when compared to the original empirical equations in all empirical equations and climates.

Fig. 5.
figure 5

The D performance of original and calibrated empirical equations in different climates. (a) Temperature-based equations. (b) Solar radiation–based equations. (c) Mass transfer–based equations

The best fit between the PM-FAO56 and estimated ETref values is related to the calibrated PENM equation in humid climate and is equal to 0.94 (Fig. 5(c-4)). Generally, the best and worst fit between the PM-FAO56 and estimated ETref values were related to the mass transfer–based (Fig. 5(c)) and the solar radiation–based methods (Fig. 5(b)).

Figure 6 shows PM-FAO56 and estimated ETref values for the best empirical equations in different climates during 1990 to 2019. The estimated ETref values using the empirical equations after calibration are very close to the calculated ETref values. This can especially be seen in the calibrated PENM equation in semidry climate (Fig. 6(c-3)).

Fig. 6
figure 6

The estimated ETref versus PM-FAO56 values in different climates. (a) Temperature-based equations. (b) Solar radiation–based equations. (c) Mass transfer–based equations

Figure 6 makes it clear that the calibrated techniques show better accuracy when compared to the original equations. For temperature-based equations, the best correlation was showed by calibrated BARO equation with R2 0.82, 0.73, 0.65, and 0.71 in very dry, dry, semidry, and humid climates, respectively. This equation resulted in the highest D index, ranging from 0.65 in very dry climate to 0.87 in dry, semidry, and humid climates (Fig. 5(a-1–4)). The results indicated a slight underestimation in ETref values for calibrated BARO equation in comparison to PM-FAO56 (Fig. 6(a-1–4)). The radiation-based equations overall performed better than the mass transfer equations, since a more important role is expected for Rs when estimating ETref in humid climates (Irmak et al. 2006). The calibrated JEHA equation exhibited the highest R2 and lowest RMSE (Table 3). Additionally, this equation presented the D index from 0.46 in very dry climate to 0.85 in semidry climate (Fig. 5(b-1–4)). Therefore, it can be concluded that the JEHA equation of this class can be suitable for estimating of ETref values in surveyed climates (Fig. 6(b-1–4)). These results for JEHA are in contrast to Tabari et al. (2013), who surveyed in humid climates. Among the 10 empirical equations from mass transfer–based methods, the calibrated PEMN equation showed accurate ETref equal to 2826, 2227, 1308, and 946 mm year-1 in very dry, dry, semidry, and humid climates, respectively (during 1990 to 2019). Simultaneously, it displayed the highest correlation coefficient (R2 = 0.85, 0.79, 0.71, and 0.77 in very dry, dry, semidry, and humid climates, respectively). On the other hand, this equation resulted in the highest D index ranging from 0.59 in very dry climate to 0.94 in humid climate (Fig. 5(c-1–4). Overall, the statistical indices of the mass transfer–based equations were reasonable in humid climate. Based on Valipour 2015), mass transfer–based equations cannot be suggested for use without calibration.

From the abovementioned, it can be concluded that the calibrated equations of BARO, JEHA, and PENM equations (as a good alternative for ETref estimations) have similar performance and they are recommended for use in different climates of Iran (Fig. 6).

3.4 SI map

Figure 7 shows that calibration at stations with very dry climate, such as Zabol, Zahedan, Bam, Iranshahr, and Chabahar stations, had a greater effect on the accuracy of ETref estimation based on the SI value in the excellent class (SI < 0.1). The highest amount of error in the SI index is related to stations with humid climates, such as Rasht and Nowshahr. This is due to the complexity of the ETref process in humid climate.

Fig. 7
figure 7

SI map of the best empirical equations in different climates

The SI map demonstrates that the original temperature, radiation, and mass transfer–based equations generally did not have excellent class (SI < 0.1), except BARO and PENM equations in Bam and Iranshahr stations, respectively. In general, the results of the present study confirmed that temperature-based equation had the more accuracy than solar radiation and mass transfer–based equations. Similar results were also reported by Farzanpour et al. (2019). Their results cleared that the temperature- and radiation-based equations generally have had similar SI values, giving more accurate simulations than the mass transfer–based equations. This might show the importance of temperature parameters on ETref estimating in the very dry and dry stations, as well as its superiority to the solar radiation-based. The impact of input climatic parameters seems to be very low as the mass transfer–based equations gave the undesirable results in these climates. However, this situation should be used with cautiousness because a comprehensive study should be conducted to determinate the portion of each parameter on the empirical ETref magnitudes, which is beyond the scope of this research.

4 Conclusion

The accurate estimation of ETref by empirical equations can be helpful for water resources management, crops’ water demand, and irrigation scheduling. This study attempted to investigate and calibrate 32 empirical equations classified in three categories (temperature-based, solar radiation–based, and mass transfer–based) in main climates (very dry, dry, semidry, and humid) of Iran. The results show that most of the calibrated empirical equations had good accuracy in estimating ETref in all studied climates. However, the accuracy of the ETref estimate before and after calibration depended on the classification of the equation in the type and number of input data and the type of climate under study. In other words, each climatic region has its own superior empirical equation. Also, with the complexity of climatic variables, the accuracy of various empirical equations is associated with change.

According to the results of the best values of R2 and RMSE in the temperature-based BARO equation for very dry (0.82 and 32.79), dry (0.73 and 42.12), semidry (0.65 and 30.34), and humid (0.71 and 35.29), climates were observed in the calibrated BARO equation. Also, the calibrated JEHA equation showed reliable estimates in very dry (R2 = 0.78 and RMSE = 32.39), dry (R2 = 0.69 and RMSE = 37.25), semidry (R2 = 0.60 and RMSE = 34.23), and humid (R2 = 0.68 and RMSE = 41.86) climates. Finally, the best values of R2 and RMSE were reported for the calibrated PENM equation in very dry (0.85 and 27.87), dry (0.79 and 31.83), semidry (0.71 and 29.38), and humid (0.77 and 31.71) climates, respectively. The results of APE, MBE, and D criteria show that the accuracy of the empirical equations after calibration increased significantly when compared to their original values. At the same time, the results of SI criterion and the effect of factors such as high relative humidity and the balance between air temperature and rainfall mean that the estimation of ETref is more complex. Considering the dependence of the ETref process on fewer meteorological parameters, we can conclude that in very dry climates the empirical equations before and after calibration are more accurate.

Considering the limitations associated with the availability and reliability of the climatological data, especially in developing countries, the good performance of empirical models must be emphasized, since they deal with a very simple equation. Further studies are needed in order to evaluate the performance of the calibrated equations in other areas in the world with different climates. Also, evaluation is needed for the performance of the empirical equations on a different time scale (daily). Therefore, more studies might use empirical equations and stations as well as other calibration scenarios for assessing these results in various climates.