Introduction

It is now accepted worldwide that the climate system is warming in recent decades with the increase in air and ocean temperatures, increase in sea levels and widespread snow melting (Intergovernmental Panel on Climate Change 2007). It is reported that in the late nineteenth century, the earth temperature has increased about 0.3 and 0.6 °C. Over the past 100 years (1906–2005), the land surface air temperature has increased up to 0.74 °C and expected to increase up to 1.1–6.4 °C by the year 2100 (Intergovernmental Panel on Climate Change 2007). Change in air temperature is directly or indirectly linked with the other climatic variables such as vapor pressure deficit, shortwave radiation, relative humidity, precipitation, wind speed and direction and dew-point temperature (Gao et al. 2006), which in turn affect the evaporation from the surface and transpiration from the plants and lead to sudden changes in evapotranspiration from coupled plant–soil interface. Since evapotranspiration is a nonlinear complex function of many climatic variables, any changes in the variables in conjunction with the air temperature lead to an abrupt increase or decrease in the magnitude and trends of evapotranspiration. Therefore, identifying the reasons of such changes in evapotranspiration is a crucial step to understand the hydrological cycle in the region. However, the quantitative analysis of dominant factors affecting evapotranspiration trends in the South Korea is not addressed so far, although some qualitative argument has been made about the relationship between evapotranspiration and climatic variables.

The amount of evaporation that would occur if a sufficient water source was available can be represented as potential evapotranspiration (PET) (Penman 1948; Thornthwaite 1948), and reference evapotranspiration (ETo) refers to the amount of evaporation and transpiration from a reference vegetation of grass (Hargreaves 1994; Allen et al. 1998; Droogers and Allen 2002). Both the PET and ETo methods are a complex biophysical process affected by different climatic and environmental parameters. In this study, ETo is estimated by the temperature-dependent Hargreaves method (Hargreaves 1994) and FAO Penman–Monteith (FAO-PM) method which is dependent on variety of different climatic variables (Allen et al. 1998). Less data-intensive methods such as Hargreaves or Thornthwaite methods are highly recommended in developing countries where limited meteorological stations are available (Singh and Pawar 2011). The Thornthwaite method has previously been recommended for the humid climatic condition such as east central of America and south Florida (Abtew 2007) and thus can successfully be applied in South Korean environment. A theoretical entity more than an operative one, the PET or different types of ETo is difficult to quantify directly, and so, a variety of different estimation methods have been developed because of the involvement of different climatic variables in each model. Therefore, comparing the performance of each predictive model of evapotranspiration leads to the use of complete set of climatic variables and can produce more convincing estimates of the dynamic of potential evaporation (Donohue et al. 2010).

Overall, the same ETo method may yield different results under different climatic conditions and geographical environments. The high altitude, changeable weather and complex terrain and even season of the year can be responsible for the changing results of ETo between the empirical equations, for example (Lang et al. 2017). Differences among methods often reach hundreds of millimeters per growing season (Federer et al. 1996), and accuracy of a given method depends heavily on the climatic conditions of the study site.

Furthermore, many studies have shown the comparison of different PET and ETo models. For example, Chen et al. (2005) performed comparison of the Thornthwaite method and pan data with the standard Penman–Monteith estimates of reference evapotranspiration in China; results showed that the Thornthwaite method overestimates the ETo in southeastern part of China and underestimates it in other parts of China in spring, summer and autumn. Hari (2016) compared the four different temperature- and radiation-based and combined parameter-based (FAO-56) modified Penman–Monteith methods; results showed that the highest sums were provided by the Blaney–Criddle, Penman–Monteith-FAO-56, modified Penman methods and Hargrove method, while the lowest amounts were provided by the modified Penman methods and Hargreaves methods. Zarei et al. (2015) showed that pan evaporation method, Hargreaves–Samani modified and Blaney–Criddle do not have a significant difference (p < 0.05 in ANOVA statistics) by Penman–Monteith-FAO-56. However, the Thornthwaite model has the most difference by Penman–Monteith-FAO-56. Ilesanmi et al. (2014) conducted analysis for Onne and Kano states in Nigeria and results showed that the Blaney–Morin–Nigeria method was the best to be used for estimating ETo as this method showed strong correlation with the Penman–Monteith method. Alexandris et al. (2008) performed the comparison of the ETo from the surface of rain-fed grass in central Serbia, using six empirical methods against the Penman–Monteith method and results showed that the Turc's and Makkink's methods underestimated results, while the Priestley–Taylor, Hargreaves–Samani and Copais methods performed well for the region and generated results closest to the Penman–Monteith method.

Many researchers have investigated the trends in ETo, and they yielded diverse results in different regions of the world. For example, in Pisa Italy, Moonen et al. (2002) analyzed the trends over the period of 122 years and found a decrease in minimum temperature, precipitation, maximum temperature and ETo trend. The ETo value was calculated based on temperature using the Hargreaves empirical equation (Hargreaves 1994). However, ETo may show the decreasing trend with the increase in temperature; for example, in northwest and southeast China, ETo has decreased during 1954–1993 in all the seasons (Thomas 2000). In addition to this, decreasing trends were also observed in Yangtze River basin, Yellow River basin, northern regions and Tibetan Plateau of China (Wang et al. 2007; Zhang et al. 2007, 2010). Decreasing trends in ETo were observed in India (Bandyopadhyay et al. 2009) and USA (Irmak et al. 2012). A decreasing trend in ETo shown in above studies is completely opposite to the general perception that the increase in air temperature lead to an increase in actual evaporation, ETo and PET.

On the other hand, many regions also showed an increasing trend in ETo. For example, Kaohsiung, south Taiwan, showed an increasing trend in ETo over the 48-year period (Yu et al. 2002). Northeast arid zone of Nigeria showed an increase in ETo because of the observed increasing trend in wind speed over the region (Hess 1998). Analysis of the trends in pan evaporation for the Canadian Prairies showed both increasing and decreasing trends during the period of 1971–2000 (Burn and Hesch 2007). They analyzed the influence of climatic parameters on pan evaporation and showed that the wind speed tends to show more decreasing trend and vapor pressure deficit tends to show more increasing trends. In addition to this, semiarid Iran also tend to show more increasing trend in ETo (Tabari 2010; Tabari and Marofi 2010). One of the major reasons of inconsistent findings in the trends of ETo is the fact that some climate change studies utilize only one kind of empirical equation which uses only few climatic variables (i.e., temperature or radiation) and thus is unable to consider the influence of other critical climatic variables (i.e., wind speed, sunshine hours, relative humidity, atmospheric pressure, etc.). Therefore, comparing the performance of different predictive models can increase the possibility of using more climatic variables and thus potentially more robust and true trends in ETo can be obtained.

Robustness of significance testing for climatic data is getting special concerns in trend analysis studies because of the difficulties in establishing a valid null hypothesis and the impact of long term persistence, as reported in previous literature (Clarke 2010; Serinaldi et al. 2018). These difficulties in trend analysis techniques are because of the reason that the hydro-climatic data are not able to meet the assumptions of statistical testing such as distribution, correlations and stationarity. Many researchers in South Korea have conducted the studies about trend analysis of hydro-climatic variables. For example, Jung et al. (2011) investigated the spatial and temporal variability of precipitation trends over Korea during the years 1973–2005 using the Mann–Kendall test (Mann 1945; Kendall 1975). Some trend analysis studies mainly focused on summer or seasonal precipitating trends (Kwon and Lee 2004; Chang and Kwon 2007; Baek et al. 2017); a very few studies focused on spatial and temporal variation in ETo trends in South Korea (Nam et al. 2015), which is an important component of hydrological cycle over South Korea. In most of the previous studies, trend analysis was performed using nonparametric techniques, especially Mann–Kendall (MK) test, without paying any attention to the existing serial correlation in the climatic data. However, it is well recognized in previous literature that the serial correlation in climate data adversely affects the power of the trend test (Yue et al. 2002). Therefore, a more comprehensive approach based on variance correction, the modified Mann–Kendall (ModMK) proposed by Hamed and Ramachandra Rao (1998), was adopted in this study to remove the effect of serial correlation.

Based on above discussion, the main objectives of this study were: (1) to estimate the monthly and annual ETo using FAO Penman–Monteith (FAO-PM), Hargreaves and Thornthwaite methods and to compare them at both spatial and temporal scales; (2) to detect the monotonic linear trends in the annual and monthly ETo time series considering the effect of serial correlation; (3) to estimate and compare the slopes of trend lines of ETo time series using Theil–Sen's estimator and linear regression (LR) method; and (4) to identify the most crucial and dominating climatic variables influencing the ETo time series over the region, using multiple (stepwise) regression analysis.

Materials and methods

Study area and data

South Korea is situated in East Asia and has a total area of 100,210 km2. Asian monsoon season heavily impacts the climatic patterns over South Korea. Winter is mainly comprised of extremely dry and cold air and summer is comprised of warm and moist air masses coming from Southeastern Asia. The spatial distribution and topographical characteristics of 54 meteorological stations selected in this study are presented in Fig. 1.

Fig. 1
figure 1

Spatial distribution and topographical characteristics of 54 meteorological stations (used in this study) in South Korea

Initially, monthly and annual meteorological data were collected from 70 meteorological stations administered by the Korea Meteorological Administration (KMA; Web site: web.kma.go.kr) over South Korea for the period of 1971–2017 (47 years). However, only 54 meteorological stations were selected because of the unavailability sufficient record lengths or missing values of meteorological data at other 16 stations. The climatic variables used in this study to calculate ETo consist of the following: latitude of the sites (Lat, in degrees), monthly total precipitation (Ppre, mm), monthly mean daily maximum temperature (Tmax, °C), monthly mean daily minimum temperature (Tmin, °C), monthly mean temperature (Tavg, °C), monthly mean daily wind speeds at 2 m height (U2, m/s), monthly mean daily bright sunshine hours (Tsun, hour), monthly mean dew-point temperature (Tdew, °C), monthly mean relative humidity (RH, %), monthly mean atmospheric pressure at surface (P, kPa), monthly mean atmospheric pressure at sea level (P0, kPa) and elevation of the sites (z, m). Atmospheric pressure was available in Pascal, and the necessary unit corrections were made to convert it to kPa, to confirm its application to evapotranspiration equations. Initially collected data were plotted to visually inspect whether there is any error in the data. If there were any outlier or unusual data points, we then relooked the same data for the nearby stations and corrected the errors accordingly. Furthermore, preliminary analysis was performed to check the quality of initially collected using the double-mass method and the run test to evaluate the homogeneity and randomness of the data, respectively(Adeloye and Montaseri 2003).

Methods for estimating ETo and PET

There are many methodologies proposed for the estimation of ETo and PET. However, their performance varies with the variation of environment because all the equations have different empirical backgrounds. Previous studies have recommended the use of FAO Penman–Monteith method (FAO-PM) for determining ETo (Xing et al. 2008); however, following special weather conditions, it might lead to errors (Widmoser 2009). At the places where the substantial inaccuracy existed in large climatic data set, FAO-PM method may lead to uncertainty in calculations. Therefore, alternative methods such as Hargreaves and Thornthwaite were also adopted to compare the performance of different predictive models. The mathematical formulation of above-stated methods of ETo and PET is as follows.

FAO Penman–Monteith (FAO-PM) method

The FAO-PM method is recommended by Allen et al. (1998) as the standard method for calculation of ETo. This method is able to incorporate a variety of different climatic variables which can be divided into physiological and aerodynamic parameters (Xu et al. 2006). Its accuracy and reliability under different climatic variables have been widely accepted in South Korea (Nam et al. 2015). Following the methodology proposed by Allen et al. (1998), the method was used to calculate the monthly values of ETo and annual ETo was estimated by accumulating the monthly ETo. Further details about the computation of climatic parameters, are available in Chapter 3 of the FAO paper 56 (Allen et al. 1998).

Hargreaves method

The original Hargreaves method (Hargreaves 1994) uses only the maximum and minimum temperatures to estimate the \({\text{ET}}{\text{o}}\), and the mathematical formulation can be expressed as follows:

$${\text{ET}}{\text{o}} = 0.0023 \times 0.408R_{\text{a}}^{{\prime }} \left( {T_{\hbox{max} } - T_{\hbox{min} } } \right)^{0.5} \left( {T + 17.8} \right)$$
(1)

where \(R_{\text{a}}^{{\prime }}\) is the extraterrestrial solar radiation measured in (MJm−2d−1) and \(T\) is defined as the average of \(T_{\hbox{max} }\) and \(T_{\hbox{min} }\). However, Droogers and Allen (2002) proposed the modified Hargreaves method which can give better results with the addition of precipitation factor in the formula, which corrects the \({\text{ET}}{\text{o}}\) by utilizing the amount of rain of each month as a proxy for insolation. In this formula, addition of precipitation data is based on the assumption that it can represent relative levels of humidity. The following mathematical equation can be derived (Droogers and Allen 2002):

$${\text{ET}}{\text{o}} = 0.0013 \times 0.408R_{\text{a}}^{\prime } \left( {\left( {T_{\hbox{max} } - T_{\hbox{min} } } \right) - 0.0123P} \right)^{0.76} \left( {T + 17.0} \right)$$
(2)

Here, PP indicates the precipitation in mm per month. Therefore, the modified Hargreaves method is adopted in this study because of availability of reliable precipitation in South Korea. Moreover, because of unavailability mean external radiation data, it is estimated using the latitude data and month of the year.

Thornthwaite method

The Thornthwaite method is proposed by Thornthwaite (1948), which consists of average temperature for a given period (T), normal annual temperature (Ta) and number of sunshine hours N. In this study, PET was calculated following the methodology proposed by Thornthwaite:

$${\text{PET}}_{J} = 16\left( {\frac{{10\bar{T}_{J} }}{I}} \right)^{{a_{\text{th}} }} \left( {\frac{{\overline{\text{hrday}} }}{12}} \right)\left( {\frac{\text{daymon}}{30}} \right)$$
(3)

where \({\text{PET}}\) indicates the monthly potential evapotranspiration in mm/month for month J (J = 1,…,12), \(\overline{\text{hrday}}\) indicates the mean daily daylight hours during month J, \({\text{daymon}}\) is the number of days in month J, \(\bar{T}_{J}\) is the mean monthly air temperature (°C) during month J and is the annual heat index. The annual heat index is estimated as the sum of the monthly indices:

$$I = \mathop \sum \limits_{j = 1}^{12} i_{J}$$
(4)

where \(i = \left( {\frac{{\bar{T}_{J} }}{5}} \right)^{1.514}\)

$$a_{\text{th}} = 0.4924 + 1.79 \times 10^{ - 2} I - 7.71 \times 10^{ - 5} I^{2} + 6.75 \times 10^{ - 7} I^{3}$$
(5)

Trend analysis techniques

Modified Mann–Kendall (ModMK) test

Trend analysis techniques are applied on the basis of assumption that the observed time series is serially independent. However, the data such as mean annual values of climatic variables may show significant correlation. In such situation when the significant autocorrelation is present in the time series, MK test tends to show the significant trends, when no trend exists in reality (von Storch 1995; Yue and Wang 2004). Therefore, the existence of serial correlation primes to an increasing probability of disproportionate rejection of the null hypothesis. To cope with this problem, Hamed and Ramachandra Rao (1998) considered all the significance serial correlation structure existed in the time series of climatic data. This method used the corrected value of the variance

$$V\left( S \right)^{*} = {\text{cf}}*V\left( S \right)$$
(6)

The positive (negative) value of \(S\) shows the upward (downward) trend. The \(S\) is assumed as normally distributed if \(N \ge 8\), and its mean and variance can be estimated as follows:

$$E\left[ S \right] = 0$$
(7)
$$\text{var} \left( S \right) = \frac{{\left[ {N\left( {N - 1} \right)\left( {2N + 5} \right) - \mathop \sum \nolimits_{i = 1}^{n} t_{i} i\left( {i - 1} \right)\left( {2i + 5} \right)} \right]}}{18}$$
(8)

where N is the number of observation and ti are the ties of the sample time series. \(V\left( S \right)\) depicts the variance value of the MK test statistic, while \(S\) shows the original time series and \({\text{cf}}\) is the correction factor as proposed in Hamed and Ramachandra Rao (1998) which can be expressed as follows:

$${\text{cf}} = 1 + \frac{2}{{N\left( {N - 1} \right)\left( {N - 2} \right)}}\mathop \sum \limits_{k = 1}^{N - 1} \left( {N - k} \right)\left( {N - k - 1} \right)\left( {N - k - 2} \right)r_{k}^{R}$$
(9)

\(r_{k}^{R}\) shows the value of serial correlation coefficient for lag-\(k\) of the observed climatic time-series data. If the value of Z is positive, then trend is called as increasing and vice versa. In this study, trends were tested at the significance level of \(\alpha = 0.05\;{\text{and}}\;0.01\).

Theil–Sen estimator

An approach proposed in Theil (1950) and Sen (1968) is utilized to estimate the magnitude of the trends in \({\text{ET}}{\text{o}}\) time series, which can be computed as follows:

$$\beta = {\text{Median}}\left[ {Y_{i} - Y_{j} /i - j} \right]\quad \forall j < i$$
(10)

where \(1 < j < i < n\). If N represents the all possible combination of record pairs for the entire climatic data set, then value of slope can be computed as \(n = N\left( {N - 1} \right)/2\), and \(\beta\) indicates the median of these n values.

Linear regression (LR) method

The slope value estimated by the Theil–Sen estimator is compared with the slope value estimated using LR method. The positive slope value calculated by LR shows an increasing trend, while a negative value shows the decreasing trend. The LR line can be estimated as follows:

$$y = a + bx$$
(11)

Here, \(x\) is an explanatory variable and y is a dependent variable, while \(b\) and \(a\) indicate the slope and intercept, respectively (Gocic and Trajkovic 2013).

Each regression model is tested for the potential existence of collinearity between independent variables by computing the variance inflation factor (VIF) (Marquaridt 1970). Collinearity problem exists when VIF is very large, such as 10 or more. The multicollinearity has been tested in previous studies conducted in Arizona (Wang et al. 2016) and also been used for monthly land-cover-specific evapotranspiration models derived from global eddy flux measurements and remote sensing data (Fang et al. 2016).

Results

Statistical summary of climatic variables

Basic statistical analysis of climatic variables which are used to estimate the value of \({\text{ET}}{\text{o}}\) is crucial because a small change in variables directly affects the value of \({\text{ET}}{\text{o}}\). Box plot of all climatic variables recorded for the period of 1971–2017 (47 years) at 54 meteorological stations across South Korea is presented in Fig. 2. The line drawn within the rectangle indicates the mean values, while rectangle width in the upper and lower portion shows the 75th and 25th percentile, respectively. The lower end of the line shows the minimum value, and the upper end shows the maximum. It can be seen from the statistical summary of 54 meteorological stations that during summer season (June, July and August) South Korea has the minimum values of average \(P\) and \(P_{0}\) (nearly 100.5 kPa) as compared to other months of the year (Fig. 2a, b). Apart from outliers, overall, the monthly variation of \(P\) and \(P_{0}\) is quite similar to each other. Similarly, in case of Tdew, the mean values started increasing from February (− 7 °C) and reached the peak in August (22 °C) and then started decreasing abruptly till December (Fig. 2c). The monthly variation of Tmax, Tmin and Tavg is quite similar throughout the year (as shown in Fig. 2d, e, j). An abrupt increase is observed in the ranges of the maximum, minimum, 75th and 25th percentiles of total monthly precipitation in summer (Jun, July, and August) season (Fig. 2f). The possible reason of this increase is the Asian Monsoon system, which brings the majority of summer precipitation in a relatively short period (Chung et al. 2004). In case of RH (Fig. 2g), after the slow decline from January to April, the abrupt increase was observed from April to July and started declining from July onward. In Fig. 2i, the highest (lowest) value of wind speed was observed in April (September), while in Fig. 2i, highest numbers of sunshine hours were recorded in May and lowest in July. Overall, most of the climatic variables showed the significant changes in temporal patterns during summer because of an increase in temperature and effect of Asian monsoon system.

Fig. 2
figure 2figure 2

Box plot showing the monthly variations of the major meteorological variables across South Korea. These variables include: a atmospheric pressure (kPa), b atmospheric pressure at sea level (kPa), c dew-point temperature (°C), d maximum temperature (°C), e minimum temperature (°C), f total monthly rainfall (mm), g relative humidity (%), h wind speed (m/s), i sunshine (hour), j mean temperature (°C)

Spatial and temporal distribution of ETo across South Korea

In terms of wet and dry conditions, there is no large climatic difference between the stations, because of the small area of South Korea (100,210 km2). Overall, South Korea falls under the category of humid continental/subtropical climate with dry winter. The temperate climate of South Korea contains very wet and humid conditions, especially during summer season due to the influence of the North Pacific high-pressure system and East Asian monsoon. So summer meets the well-watered condition to estimate the value of ETo. In case of winter, the South Korea is topographically influenced by expanding Siberian high-pressure zones which results in predominantly cold, dry northwesterly winds. So concerns might exist about the estimation of ETo in dry weather conditions in winter. In order to cope with dry and wet climatic conditions, in this study, three approaches were compared to check the sensitivity of evapotranspiration data at local conditions.

The contour maps to show spatial distribution of mean annual ETo and PET, during the period of 1971–2017, are presented in Fig. 3, which shows a combined effect of all climatological factors. It can be seen from Fig. 3 that all the three methods showed different values of ETo from one location to another because of different climatic environments. ETo value is relatively higher along the south and east coastal areas and relatively lower along the northwest side of South Korea. This is because of the reason that the yearly average, minimum and maximum temperature in South Korea increases from north to southward, and along the east and south coast. This increase in temperature can be reflected in the ETo and PET contour lines of FAO-PM, Hargreaves and Thornthwaite methods. For example, the highest mean annual values of FAO-PM (Fig. 3a) were estimated at the station Daegu (945 mm, located at southeast part), Mokpo (882 mm, located at southwest coast), Sokcho (871 mm, located at northeast coast) and Chupungnyeong (849 mm, located at mid-latitude portion). In case of Hargreaves (Fig. 3b), the annual average values of ETo are significantly higher at all stations as compared to FAO-PM and PET values of Thornthwaite method. This is because of dominant role of Tmax and Tmin in the empirical equation used in Hargreaves. The highest values of Hargreaves were estimated at the stations such as Uiseong (1183 mm), Miryang (1166 mm) and Namwon (1151 mm) located near the south side or coastal part of South Korea, where annual average temperature is relatively higher as compared to other regions. The lowest values of Hargreaves were estimated at Daegwallyeong (843 mm) and Incheon (899 mm) stations located at northern portion of South Korea. The possible reason is that the temperature of South Korea tends to be cooler as moving from south to northward. Thornthwaite ETo calculated based on Tavg also showed a similar distribution pattern as that of other two methods as shown in Fig. 3c. The highest mean annual values were estimated at the stations such as Daegu (796 mm), Busan (785 mm) and Pohang (780 mm) located at southern part, whereas the lowest value was estimated at Daegwallyeong (561 mm) located at northern part.

Fig. 3
figure 3

Spatial distribution of mean annual \({\text{ET}}{\text{o}}\) (mm) across South Korea using following methods: a FAO-PM, b Hargreaves, c Thornthwaite

The spatial distribution maps of ETo and PET give a valuable information to water resource engineering for accurate planning and management of water resources in the region, since the changes in annual and seasonal values of ETo and PET are an important driving force which directly affect the hydrological cycle. Furthermore, analyzing the influence of changes in meteorological variables to ETo in different seasons or areas will be likely to have a different effect on the ETo and PET and, in turn, affect the water availability in the region.

The monthly temporal distribution of ETo and PET estimated during 1971–2017 using FAO-PM, Hargreaves and Thornthwaite methods is presented in Fig. 4. It can be seen that, from winter to spring (December–May), the mean, 75th and 25th percentile values of the FAO-PM are higher than those of the Thornthwaite and lower than those of Hargreaves. However, during summer season (June–September) PET estimated by Thornthwaite is significantly higher than the ETo estimated by FAO-PM and lower than the ETo estimated by Hargreaves. During winter season (December–February), Thornthwaite method has the least values of the mean, 75th and 25th percentile as compared to other two methods because of its heavy dependence on the sole climatic variable (i.e., Tavg). Overall, Thornthwaite and Hargreaves methods are more sensitive to seasonal variation of temperature and lead to over prediction as compared to FAO-PM, which is able to consider the collective impact of different climatic variables. Therefore, FAO-PM method shows relatively smooth seasonal deviation, with the gradual increase from January to May and decrease from August onward.

Fig. 4
figure 4

Temporal distribution of monthly \({\text{ET}}{\text{o}}\) (mm), comparing all methods of evapotranspiration (Thornthwaite, Hargreaves and FAO-PM)

It can be noticed from the Thornthwaite values in Fig. 4, during winter season (December–February) the mean values are nearly zero. This is because below-freezing temperature in winter season is estimated as zero in the parameterization of Thornthwaite method (Van Der Schrier et al. 2011), whereas the parameterization of FAO-PM and Hargreaves methods still estimates some values of ETo.

Temporal trend analysis of monthly and annual ETo and PET across South Korea

The trends in the monthly and annual ETo and PET were analyzed by performing the ModMK test as mentioned in “Theil–Sen's estimator” section. Z statistics of ModMK are presented in Tables 1, 2 and 3 using Thornthwaite, Hargreaves and FAO-PM, respectively. The value of Z statistics is tested for the significance level of 95% (\(\alpha = 0.05\)) and 99% (\(\alpha = 0.01\)) as indicated in bold and asterisk, respectively. Summary graph of the trend analysis using three methods is presented in Fig. 5a–c. In case of Thornthwaite, most of the stations tend to show positive trends in monthly and annual time series; however, the number of stations with significant positive trend varies from one month to another. It can be clearly seen from Table 1 and Fig. 5a that a significant positive trend (p < 0.05 or p < 0.01) was observed in more than 50% of the stations during February (36), March (43), May (38), June (33), October (42) and November (31), whereas insignificant trends (no trends) were observed in January (50), April (40), July (51), August (44), September (28) and December (47), at 95% and 99% confidence levels. However, none of the station showed a significant negative trend (except one station in January and two stations in August) in monthly and annual time series.

Table 1 Z statistics of modified MK trend tests for the Thornthwaite reference crop evapotranspiration over South Korea
Table 2 Z statistics of modified MK trend tests for Hargreaves reference crop evapotranspiration over South Korea
Table 3 Z statistics of modified MK trend tests for FAO-PM reference crop evapotranspiration over South Korea
Fig. 5
figure 5

Summary of ETo and PET trend analysis at 54 meteorological stations across South Korea using three methods: a Thornthwaite, b Hargreaves, c FAO-PM

In case of Hargreaves, from January to December (except February) more than 70% of the stations did not show any significant trend (p > 0.05), as shown in Table 2 and Fig. 5b. In summer season (June–September), more than 50% of the stations showed a decreasing trend; however, only few stations showed a significant decreasing trend.

As compared to Thornthwaite and Hargreaves methods, the FAO-PM method showed more equally distributed significant increasing and decreasing trends, especially in summer season, as shown in Table 3 and Fig. 5c. However, still more than 50% of the stations fall under the category of insignificant trends (p > 0.05) throughout the year.

Thornthwaite method showed the highest number of significant increasing trends after the Hargreaves and FAO-PM, which can be correlated with the effect of parameterization between them. The comparative significant increasing trend of Thornthwaite method is related to the simplicity of its parameterization (Van Der Schrier et al. 2011). In addition to this, the reason of greater number of stations with significant increasing trends in case of Thornthwaite and Hargreaves methods is likely associated with the exclusion of cloud cover and vapor pressure deficit (which relates to the capacity of the air to take up water) in their parameterization, as both parameters tend to suppress high value of ETo estimated by Hargreaves and PET in atmosphere (Van Der Schrier et al. 2011).

The stations having highest value of Z statistics in case of Thornthwaite, Hargreaves and FAO-PM are the Suwon (8.67), Busan (6.57) and Sancheong (11.78) stations, respectively. Their corresponding annual time series were drawn to check and compare the linear regression slopes at each selected station, as shown in Fig. 6. It can be seen that the annual ETo estimated by Hargreaves method is significantly higher than that estimated by the FAO-PM and Thornthwaite method at all three stations. In case of Suwon station, Thornthwaite has the highest value of slope (2.72) and R2 (0.5947), following the FAO-PM (slope of 2.48 and R2 of 0.41) and Hargreaves (slope of 0.67 and R2 of 0.07) methods. Positive slope of LR observed in all three methods indicates the possibility of existing correlation between them at Suwon station.

Fig. 6
figure 6

LR-based trend in annual evapotranspiration at the stations having highest value of Z statistics: a Suwon station for Thornthwaite method, b Busan station for Hargreaves method, c Sancheong station for FAO-PM method

Similarly, the slope of LR and R2 values for Busan and Sancheong stations are shown in Fig. 6b, c, which confirms the increasing trend and presence of correlation between their time series. Furthermore, an abrupt increase in annual ETo and PET was observed in years 1974, 1994 and 2015 at all the 54 meteorological stations using three methods selected in this study. This increase in evaporative demand during 1974, 1994 and 2015 is also mentioned in previous studies (Nam et al. 2015; Azam et al. 2018).

Monthly temporal distribution of the magnitude of the trends detected in monthly \({\text{ET}}{\text{o}}\) (mm) throughout the year estimated by Thornthwaite, Hargreaves and FAO-PM methods is compared using the mean (symbol“×”), 25th, 75th percentiles of the slope b of LR and Sen's slope, as shown in Fig. 7. It can be seen that the overall seasonal changing patterns of both the LR and the Sen's slopes are quite similar; for example, an abrupt increase in the range of the maximum, minimum, 75th and 25th percentiles of summer \({\text{ET}}{\text{o}}\) (June, July and August) was observed for LR and the Sen's slope. However, the monthly magnitude of the trends varies significantly according to the method of estimation of \({\text{ET}}{\text{o}}\). For example, in case of LR (Fig. 7a), the Thornthwaite has the highest value of 75th percentile and lowest value of mean from May to August as compared to Hargreaves and FAO-PM methods. However, in case of Sen's slope method (Fig. 7b), the mean, 25th, 75th percentiles of Thornthwaite, Hargreaves and FAO-PM methods are quite comparable throughout the year.

Fig. 7
figure 7

Monthly temporal distribution of the magnitude of trends detected in \({\text{ET}}{\text{o}}\) (mm/year) using; a the slope b of LR, b Sen's slope, using three methods (i.e., Thornthwaite, Hargreaves and FAO-PM)

The increase in the magnitude of the trends in summer season can be correlated with the sudden increase in geopotential height at 700 hPa (\(\Phi 700\)) over mid-latitude Asia in the mid-1970s (Ho et al. 2003). An increase in geopotential height causes stronger northerly wind and produces more evaporation. Furthermore, the increase in evaporative demand during summer is linked with the intensification of the upper-level westerly Jet over Korea and the anomalously warm sea surface temperature (SST) in the western North Pacific region (Baek et al. 2017).

Relationship between Thornthwaite, Hargreaves and FAO-PM equations

Because of the difference between the parameterization of temperature-based equations (i.e., Thornthwaite, Hargreaves) and high data-intensive equation (i.e., FAO-PM), it is important to analyze the nature of correlation existed between their \({\text{ET}}{\text{o}}\) time series. Spatial distribution of monthly Pearson's correlations between Thornthwaite–Hargreaves (Thorn–harg), Thornthwaite–FAO-PM (Thorn–Pm) and Hargreaves–FAO-PM (Harg–Pm) and their corresponding relationship were assessed by drawing the linear fit on the scatter plot as shown in Fig. 8. All the three combinations of \({\text{ET}}{\text{o}}\) equations showed close linear correlation (p < 0.01). It can be seen that the corresponding R2 of the linear fit (on right) increases with the increase in the value of Pearson's correlation (left). The monthly values of \({\text{ET}}{\text{o}}\) time series tend to be more concentrated around the linear fit in case of Harg–Pm, with the highest values of R2 (0.8) and Pearson's r (0.84–0.98) as compared to Thorn–Pm (R2 0.65 and Pearson's r 0.70–0.92) and Thorn–harg (R2 0.76 and Pearson's r 0.70–0.92). This seems like linear fit tending to increase with the increase in the value of parameterization from low data-intensive equations to high data-intensive equations. Furthermore, the value of Pearson's correlations decreases with the increase in the value of yearly average values of ETo and PET from north to southward as shown in Figs. 3 and 8. The correlation is relatively low at high-temperature areas along the mid-latitude east coast and south coast and high at low-temperature areas along the north portion and along the west coast. Thus, areas of lower correlation can be associated with the dominate role of temperature in Thornthwaite and Hargreaves methods and the integrated effect of sunshine, wind, temperature and relative humidity, atmospheric pressure in FAO-PM method. This finding matches well with the previous literature (Beguería et al. 2014). Apart from temperature, precipitation is also an important climatic factor used in the Hargreaves (modified) method which results in a significant reduction in sunshine hours and wind speed as the available energy is the primary driver of ETo, and this fact is also recognized by Irmak et al. (2012). Thus, due to indirect linkage between the parameterization of the Hargreaves and FAO-PM methods, the value of correlation is highest among them.

Fig. 8
figure 8

Spatial distribution of Pearson's correlations (left) between monthly ETo and PET time series obtained by the Thornthwaite (Th), Hargreaves (Harg) and FAO-PM (Pm) equations at 54 meteorological stations in South Korea. Scatter plot (right) shows the corresponding relationship of the magnitude of change for different ETo and PET equations; the solid line represents the perfect agreement (1:1) and the dashed line represents a linear fit to the data

Apart from the Pearson's r correlation, LR regression equation is utilized to check the nature of linear trend existing between the ETo and PET, estimated by three different methods. It can be noticed from Fig. 8 (right) that all pairs of ETo and PET methods showed an increasing trend, with the Thorn–harg has the highest value of slope b (0.745) following the Harg–Pm (0.532) and Thorn–Pm (0.408). A significant increasing trend detected by the LR method in Fig. 8b matches well with the results of temporal trend analysis performed separately on the time series of Thornthwaite and Hargreaves methods using modMK test as shown in Tables 1 and 2 and Fig. 5a, b. Therefore, the trends detected by the Z statistics of modMK test are quite comparable with the trends detected by the LR method. The similarity between behavior of the modMK test statistics and LR equation is also observed in the trend analysis of precipitation and temperature (Asfaw et al. 2018) and drought time series (Azam et al. 2018).

In order to check the seasonal changes in correlation between the methods of ETo and PET, the monthly and annual values were drawn in the form of box plot as shown in Fig. 9. It can be clearly seen that Hargreaves–FAO-PM has the highest value of correlation throughout the year, which verifies the results of R2 value of 0.8, as shown in Fig. 8f. This increase in the correlation is mainly due to stations located near the western coast and near the North Korea (Fig. 8e).

Fig. 9
figure 9

Monthly and annual temporal variation of Pearson's r correlation at 54 meteorological stations across South Korea

However, smaller correlation in the eastern and southern coastal areas of South Korea is because of the reason that eastern and southern coasts are located in the path of East Korea warm current (EKWC). EKWC is a surface oceanic current located at the sea of Japan. It branches off from the Tsushima Current located at the eastern side of Korea Strait, and it passes along southeastern coast of Korean peninsula toward north (Olesen 2011). In addition to this, east coastal areas of South Korea were also affected by the North Korea cold current (NKCC), especially during winter season. Choi and Zhang (2005) investigated the association of sea and air temperatures in the eastern and southern coastal regions of South Korea, by checking the temporal variation of heat and momentum exchanges which in turn cause the heating and cooling in atmosphere of coastal areas. They concluded that wind speed, surface and air temperature over sea surface can affect the latent heat fluxes in the lower atmosphere of coastal areas.

Influence of climatic variables on ETo changes in South Korea

In case of Thornthwaite and Hargreaves, one can assume that increasing or decreasing are because of atmospheric temperature. This is because of the reason that the temperature is the most important climatic variable in their empirical equations. However, in case of FAO-PM it is difficult to identify the reason of the significant trends detected by the LR, Sen's slope and modMK Z statistics because of the involvement of various climatic variables. Therefore, in order to quantify the influence of each climatic variable which is responsible for the observed increasing trends in monthly values of ETo at various meteorological stations in South Korea, a multiple stepwise regression method was applied on the monthly time series of ETo obtained by FAO-PM. The multiple stepwise regression analysis was carried out using R programming package, namely "olsrr" (Hebbali 2017), in which monthly time series of ETo is considered as a dependent variable and eight climatic variables such as Tmax, Tmin, Tsun, Tdew, P, P0, U2 and RH are considered as independent variables. The results of multiple stepwise regression model for each month (January–December) are presented in Table 4. It can be noticed that the order of climatic variables is different from one month to another, sorted on the basis of dominance of climatic variable influencing the monthly values of ETo. The climatic variables having lowest value of p and highest value of R2 are selected in the first step of multiple stepwise regression and considered as the most dominant variable influencing the value of ETo. The results showed that during autumn, early winter (September–January) and early summer (May–June) wind speed is the most dominant variable influencing ETo at 55 stations across South Korea. The highest values of Pearson's correlation for the wind speed during autumn, early winter and early summer confirm its dominant effect on ETo (Table 5). On the other hand, during spring (February–April) and late summer (July–August), the Tmax followed by U2 is the most dominant climatic variable, using multiple stepwise regression model (Table 4) and Pearson's correlation method (Table 5). The R2 value varies from 0.754 to 0.901 which showed fairly good agreement of multiple stepwise regression model. Those climatic variables which do not have a significant impact (p > 0.05) on ETo were removed in multiple stepwise regression model and indicated in bold (Table 4). The corresponding values of beta, standard error and t statistics for each climatic variable are also shown in Table 4.

Table 4 Monthly climatic variables sorted according to value dominance and R2 in multiple stepwise regression model at 95% confidence level
Table 5 Pearson's correlation between the climatic variables and ETo from January to December

The sensitivity and impact of U2 and Tmax are higher on the value of ETo as compared to other climatic variables. The higher influence of U2 during autumn, early winter and early summer can be because of the fact that the energy associated with the wind speed leads to a rapid change in vapor pressure deficit conditions. Previous studies have shown the impact of wind speed on the magnitude and significance of ETo trends in Australia (Rayner 2007; Roderick et al. 2007), in Tibetan Plateau (Shenbin et al. 2006; Zhang et al. 2007) and in Canada (Burn and Hesch 2007). Furthermore, combination of U2 and Tsun also leads to affect the ETo (Gao et al. 2006; Xu et al. 2006). Irmak et al. (2006) showed that ETo is most sensitive to U2 at semiarid regions having strong wind. This is because of the reason that an increase in wind speed will lower the aerodynamic resistance which in turn increases the value of ETo. In general, when crops transpire water, the immediate surrounding environment of the crop canopy will be moist. In arid climates, the wind flow is most likely to replace this moist air with dry air and causes an increase in ETo.

They found that at semiarid regions and island locations ETo is more sensitive to Tmax than any other climatic variable in summer months, which match well with the results of this study. These findings indicate the importance of each climatic variable in terms of trends and magnitude of ETo and their value changes according to the regional and temporal characteristics. As a result, only one or two climatic variables are not responsible to the spatial or temporal changes in ETo; therefore, it needed to be collectively estimated by a combination of energy balance equation, rather than relaying only on temperature or radiation data.

Multicollinearity diagnosis is performed using the VIF to identify the collinearity problem. VIF values were tested for the ETo obtained by FAO-PM as shown in Table 6. Most of the climatic variables lie between the VIF values of 1 and 6 (VIF < 10), thereby indicating that there were no potential multicollinearity problems existed in our dataset.

Table 6 Seasonal collinearity statistics for the climatic variables in South Korea

Conclusions

Since ETo and PET determine the crop water demand and irrigation scheduling across the region, investigating the spatial and temporal trends is a crucial step. In this study, a spatial and temporal trend analysis of monthly and annual ETo was performed, while taking care of serial correlation of time series, using Thornthwaite, Hargreaves and FAO-PM methods. Their results were compared to find more robust trends at 55 meteorological stations throughout the region. A multiple stepwise regression method was used to identify the leading climatic variables influencing the trends in ETo. The following major conclusions can be drawn from this study:

  1. 1.

    Although there is the quantitative difference between the values of PET and ETo estimated by Thornthwaite, Hargreaves and FAO-PM methods, spatial changing pattern of PET and ETo according to the location of meteorological stations is same; for instance, ETo value is higher along the south and east coastal areas and lower along the northwest side of South Korea.

  2. 2.

    Trend analysis performed considering the effect of serial correlation using modMK Z statistics showed that the Thornthwaite method tends to show the highest number of stations (most of them located at east and south coast) with a significant increasing trend in monthly and annual time series (except July and August) as compared to Hargreaves and FAO-PM methods, because of its strong association with temperature. However, FAO-PM method showed the larger number of stations (mostly located on mid-latitude areas and south coastal areas) with significant decreasing trends as compared to Thornthwaite and Hargreaves methods, which can be correlated with the impact of the collective involvement of different climatic variables in addition to the atmospheric air temperature. Moreover, the magnitude of the trends detected by the LR and the Sen's slope method is quite comparable with the abrupt increase in summer.

  3. 3.

    Spatial and temporal correlation analysis showed the presence of association (p < 0.01) between Thornthwaite, Hargreaves and FAO-PM equations, while the strongest correlation existed between Hargreaves and FAO-PM as it ranged from 0.84 to 0.98 with R2 value of 0.8. Furthermore, the correlation is relatively lower at high-temperature areas along the mid-latitude east coast and south coast and high at low-temperature areas along the north portion and along the west coast.

  4. 4.

    Various climatic variables evaluated to check their influence on ETo using multiple stepwise regression model showed that the U2 is the most influencing climatic variable, especially in autumn, early winter and early summer, and Tmax during spring and late summer.

Although this study clearly stated the existence of increasing trends in monthly and annual ETo and identified the wind speed and maximum atmospheric air temperature as the forcing mechanism behind these trends, evaluation of the amount of increase in water demand with the increase in ETo, in various sectors such as agriculture and reservoirs operations is beyond the scope of this study. The results of this study can be used as a theoretical basis for future water resource planning and management in South Korea. Moreover, ETo changes based on location and time of the year emphasize the seasonal management of reservoirs and hydraulic structures across the country.