1 Introduction

Climate classification refers to the grouping of similar conditions through formulating a set of environmental factors. The process generates areas with distinct climatic patterns and provides the possibility of finding localities with similar climate conditions in different geographical areas. Climate classification provides several benefits to agricultural planning, urban planning, risk assessment by insurance companies and governance and obviously climate studies (Fallmann and Emeis 2020). Various models are adopted in climate classification, examples include aridity index, Alisov, Köppen, Berg, Holdridge life zones, Lauer, Strahler, Thornthwaite, Trewartha, Troll, and Vahl. Amongst, Köppen model is widely used across scientific and industry community. The method is a vegetation-based climate classification, which provides a basis to identify climate similarity and dissimilarity (Köppen 1936). This method initially relied on the botanical zones under the influence of climatic conditions, but it was then improved by taking into account meteorological data and presenting climate types through arbitrary indices. Köppen-Geiger (KG) (Geiger 1961) is the final version of Köppen model and the most commonly used method of climate zonation in any geographical location (Sohoulande 2024). All that it requires is temperature and precipitation ground datasets (Ascencio-Vásquez et al. 2019; Hobbi et al. 2022).

The potential of the KG model to quantify climate variation and change over various periods has been demonstrated in previous studies (Cui et al. 2021). Due to its relevance to ecological conditions, this method can serve to classify the various aspects of the environment objectively from practical and theoretical points of view, such as photovoltaic climate zonation (Ascencio-Vásquez et al. 2019; Mazzeo et al. 2020), development of diffuse irradiation models (Every et al. 2020), estimation evapotranspiration in relation to Köppen’s method (Akhavan et al. 2018), evaluation of outdoor thermal perception and comfort conditions (Canan et al. 2020; Salata et al. 2018), mapping of the last glacial maximum, mid-holocene, and present climate zones (Yoo and Rohli 2016), and identification of particulate matter clusters (Pražnikar 2017).

The best outcome of KG method relies on long-term and fine resolution ground-based climatic variables (i.e. air temperature and percipitation) (Ouatiki et al. 2023). The data is required for accurate climate zonation, good definition of transition areas, and detection of past and future changes in climate zones (Bonsoms and Ninyerola 2024). However, owing to high costs and accessibility constraints, weather stations are characterized with spatially poor coverage and nonhomogeneous data (Javanshiri et al. 2021). As a result, application of KG method on a massive scale (e.g. national scale) is challanged by data inconsistency and scarcity (Beck et al. 2018; Zou et al. 2021).

To fill the gaps across the ground-based (i.e. data inconsistency and scarcity), two techniques have gained significant attention: (1) integrating satellite-derived climate variables into supervised machine learning (e.g. multilinear regression analysis). This is a statistical model to fill the gaps across ground-based data by developing a best-fit line between alternative sources of data (e.g. satellite images) and available data from weather stations (Aksu et al. 2023; de Moraes and Gonçalves 2023); and (2) Geo-statistical interpolation methods utilize the spatial relationship/equation of the measured sample points to create a continuous map with point data (Atkinson and Lloyd 2014; Madenci et al. 2019). Generally, the principle of spatial prediction means that items that are closer to each other tend to be more similar and vice versa. Obviously, the use of these techniques to generate data for KG method requires considering a diverse range of parameters such as spatial and temporal resolution of data, selection of explanatory variables (herein temperature, elevation and precipitation), data quality of satellite images, and geographical distribution of weather stations (Mokhtari et al. 2013). As a result, robust comparison between these techniques rarely conducted and widely have been limited to small and medium-sized areas (Aboutalebi et al. 2018).

Although satellite data provides continuous information on the spatial data characteristics of features, comparisons indicated a noticeable deviation between satellite precipitation and ground-based measurements in daily, monthly and annual timescales and different spatial scales (Ma et al. 2021). In order to achieve good spatial variation in precipitation and better accuracy than the original image data the least squares linear relationship between remote sensing precipitation data and ground-observed precipitation has been suggested (yosefi kebriya et al. 2021). Therefore, in this study, first, the relationship between ground data and data from satellite sources was examined through MLR method. As a routine procedure, statistical methods based on interpolation techniques are applied for spatial gap-filling of weather data (Hammann and MacDonell 2022). Therefore, the method based on spatial interpolation of point data will be evaluated as an alternative in this study.

Accordingly, within the context of the entire country of Iran, the objectives of this study are threefold: (1) use satellite-derived climate variables within the framework of machine learning, particularly multilinear regression analysis (MLR) to generate air temperature and precipitation for KG method; (2) use spatial interpolation to fill the ground-based climate data for KG classification and (3) Compare KG maps from two previous methods to enhance detection of the climate class boundary and facilitate climate change monitoring.

This study is expected to overcome the shortcoming of generation climate classification maps within the context of an entire country due to unavailability or spatially spars distribution of weather stations. In addition, climate change is expected to be better monitored due to the availability of satellite data on various spatial scales and the advancement of remote sensing technology in the near future.

2 Materials and methods

2.1 Description of the study area

The study area covers the entire country of Iran located in the south-western Asia. It expanses from 25° to 40° North latitude and from 44° to 64° East longitude (Fig. 1). The altitude changes from about -25 m to about 5610 m above sea level significantly contributes to the climatic heterogeneity of this country. (Ghajarnia et al. 2022; Javanshiri et al. 2021; Razmi et al. 2017). Iran is mostly located in arid and semi-arid zones (Fallah et al. 2017). The annual rainfall varies from less than 50 mm in the uninhabitable eastern deserts to 1800 mm on the shores of the Caspian Sea and the Western highlands.

Fig. 1
figure 1

Map of the study area and the locations of the synoptic weather stations

The temperature variation has increased over the last decades, and the mean annual temperature is predicted to increase in the next decades from 1.5 to 4.5 degrees centigrade under the scenarios proposed by the Intergovernmental Panel on Climate Change (IPCC). Overall, Iran experiences hyper-arid, arid, semi-arid, Mediterranean, and wet climate (cold mountainous type) with the spatial coverage of 35.5%, 29.2%, 20.1%, 5% and 10%, respectively (Amiri and Eslamian 2010).

2.2 Data

2.2.1 Satellie Dataset

As shown in Table 1, the satellite dataset for this study consists of MODIS/Terra products, GPM IMERG V06 precipitation and Shuttle Radar Topography Mission (SRTM) datasets. The MODIS imageries include LST (MOD11A2) and NDVI (MOD13A3) images with 1000 m spatial resolution for a period from 1 April 2016 to 29 May 15, 2019 on a monthly basis. In total 171 MODIS images were downloaded for this study.

Table 1 Specification of the satellite imageries used in this research study

KG method requires monthly base data for climate classification. Therefore, daily, 8-days average data sources were aggregated to be used in KG method. On the other hand, for the climate studies, usually, the data indicating weather patterns at least over the above 30 years are required (Francisco 2014). Since this study evaluates the capability of the aimed methodology, in order to reduce the computational process, the above-mentioned data periods were selected. On the other hand, comparisons of three climate classification approaches based on KG method using the same short-term data, it is expected not to affect the research methodology. Therefore, in case of a virtuous accuracy of the result, the presented methodology can be applied for the data of long-term data to produce a real climate classification map over the study area or anywhere else.

The GPM dataset was obtained from the above period with the spatial resolution of 10,000 m. GPM has approximately ten constellation satellites and a core satellite that was launched on February 27, 2014, with the multi-channel GPM Microwave Imager (GMI). GMI uses a set of frequencies and the first Ku/Ka-band dual-frequency space-based precipitation radar as a reference standard to unify precipitation measurements from a constellation of operational satellites (Ma et al. 2021). Finally, SRTM data with the spatial resolution of 1000m, created on November 29, 2014, was obtained from the USGS official website.

2.2.2 Meteorological data from weather stations

Precipitation and temperature are critical inputs for KG method. Both data are recorded in weather stations and are more available than the other types of climate data anywhere on the earth. In this study, the ground-based temperature and precipitation data were derived from 332 synoptic weather stations located in the 32 provinces of Iran. Subsequently, the ground-based data were aggregated to produce seven meteorological variables including mean temperature of the coldest (Tmin) and hottest (Tmax) month, mean annual near-surface temperature (Tann), accumulated annual precipitation (Pann), precipitation of the driest month (Pmin), lowest and highest monthly precipitation values of summer (Psmin) and winter (Psmax). The temperature and precipitation data was recorded in Celsius degrees (ºC) and millimeters (mm), respectively.

It should be noted that through stratified sampling, the weather station dataset was split into a modeling dataset and a testing dataset (30%-40%), which is a common practice in research (Deyasi et al. 2021; McGibbon et al. 2023). Accordingly, 33% of weather station data (109 stations) was selected for the evaluation of maps and remaining dataset (223 stations) was used for MLR and spatial interpolation purposes.

2.3 Methods

As shown in Fig. 2, research methodology comprised four main courses (1) modelling air temperature and precipitation values using MLR models from MODIS and GPM satellite imageries; (2) generate air temperature and precipitation values through spatial interpolation (i.e. Krigging and Co-Krigging); (3) produce six KG classification maps. This includes a KG classification map per percipitation and air temperature from MLR model, spatial interpolation and unused meteorological data in MLR model and spatial interpolation technique; and (4) comparing the KG classification maps to determine the most effective and accurate technique.

Fig. 2
figure 2

Research procedure to produce KG classification maps within the context of the entire country of Iran

2.3.1 Developing MLR models using MODIS and GPM data

The first course of methodology focused on developing multiple MLR models between air temperature and precipitation from 223 weather stations (dependent variables) and explanatory variables from MODIS, GPM, and SRTM imageries. It is reported that the linear relationship between remote sensing precipitation products and gauge-measured precipitation could be used to correct and predict satellite precipitation data (Ma et al. 2021). Generally, the linear regression model is presented as follows:

$${Y}_{i}={\beta }_{0}+\sum\nolimits_{j=1}^{n}{\beta }_{i}{x}_{j}+{\varepsilon }_{i}$$
(1)

where \({Y}_{i}\) is \({i}^{th}\) of dependent variable, \({\beta }_{0}\) is line interception coefficient,\({\beta }_{i}\) is regression coefficients of \({i}^{th}\) independent variable, that are determined by fitting the equation to the data, \({X}_{i}\) through \({X}_{n}\) are the independent variable and \(\varepsilon\) presents the difference between observed and predicted value of dependent variable. The total variability of dependent variable accounted for by multiple linear regression is measured through coefficient of determination (Tabachnick and Fidell 2007):

$${R}^{2}=\frac{{ss}_{reg}}{{ss}_{y}}$$
(2)
$${ss}_y=\sum\nolimits{(y_{i}-\overline y)}^2,{ss}_{reg}=\sum\nolimits{(y_{i}^{\prime}-\overline y)}^2$$
(3)

where \({ss}_{y}\) is the total sum of squares difference between each observed value \(y\) and the mean of \(y\) over all cases and \({ss}_{reg}\) is the total sum of squared due to regression calculated from predicted value \(\left(y'\right)\) and the mean of \(y\) over all cases. The squared multiple correlation can also be calculated from the sum of the correlation between dependent and independent variable, \({r}_{yi}\), and standardized regression coefficient, \({\beta }_{i}\) of independent variable as follow:

$${R}^{2}=\sum\nolimits_{i=1}^{k}{r}_{yi}{\beta }_{i}$$
(4)

where \(r_{yi}\) is the correlation between dependent variable and \(i^{th}_{{}}\) independent variable and \(\beta\) is the standardized regression coefficient that is used to estimate standardized \(y'\).

The satellite-based explanatory variables include 20 variables (see Table 2) which can categorized under 4 land surface temperature (LST), normalized difference vegetation index (NDVI), digital elevation model (DEM) and temperature-vegetation dryness index (TVDI). Previous studies have documented these explanatory variables to be the most effective for conducting regression analysis between air temperature and precipitation using data from weather stations and satellites (Hooker et al. 2018; Ouma et al. 2021).

Table 2 List of satellite-based explanatory variables used in MLR models

In order to generate the above data (explanatory variables), initially, the satellite images underwent required reprocessing and processing (i.e. data rescaling and normalization) schemes and finally resampled to 1000 m spatial resolution to be used for further analysis. Subsequently, three types of LST products were generated from MOD11A2 satellite images. These imageries were delivered in an 8-day average; thus, all the image within a month were averaged to produce monthly LST images. This study also produced yearly LST data as follows:

$${\overline{MLST} }_{i}=\frac{\sum\nolimits_{j=1 }^{3}{\overline{{LST }_{i}}}_{j}}{3}$$
(5)

where \({\overline{MLST} }_{i}\) is the mean monthly LST image of the ith month of three years, and \(\overline{{LST }_{i}}\) is the mean LST image calculated from the three 8-day images acquired in the ith month of the jth year. This study used the data of three years; therefore, the term \(\sum\nolimits_{j = 1}^{3}{\overline{{LST }_{i}}}_{j}\) was divided by 3.

TVDI is a widely used method to derive the spatial variation of soil moisture based on the relationship between LST and normalized difference vegetation index (Fig. 3) calculated from satellite data (Moosavi et al. 2016; Sandholt et al. 2002; Wigmore et al. 2019). TVDI values were calculated from following equations:

$$TVDI=\frac{LST - {LST}_{min}}{{LST}_{max} - {LST}_{min}}$$
(6)

where \(LST\) is the observed surface temperature, \({LST}_{min}\) is the wet edge, and \({LST}_{max}\) is the dry edge that is calculated from Eq. 10.

$${LST}_{max}=a+b*NDVI$$
(7)

where a and b coefficients are obtained from the linear relationship between LST and vegetation cover, and NDVI is an image calculated from red (R) and near infrared (NIR) satellite data as follows:

Fig. 3
figure 3

LST-NDVI triangle space adapted from Sandholt, et al. (2002)

$$NDVI=\frac{NIR - R}{NIR + R}$$
(8)

TVDI was calculated from the MODIS LST product image (MOD11A2) and the NDVI image (MOD13A3). Like Eq. 2, the monthly NDVI image was calculated from Eq. 6:

$${\overline{MNDVI} }_{i}=\frac{\sum\nolimits_{j = 1}^{3}{{NDVI}_{i}}_{j}}{3}$$
(9)

where \({\overline{MNDVI} }_{i}\) is the mean monthly NDVI image of the ith month of three years, and \({NDVI}_{i}\) is the NDVI image of the ith month of the jth year.

In the case of GPM data, the completed half-hourly calibrated IMERG data provided by “precipitationCal “ with the “netCDF” file format were converted to the Tagged Image File Format (TIFF). These imageries then used to extract precipitation values for the entire country. Finally, As shown in Table 5 a total of three regression models were generated between Tmin,Tmax and Tann (dependent variables); and LSTmax, LSTmin, LSTann.

2.3.2 Spatial interpolation of guage-based percipiation and temperature data

In the second course, this study applied Ordinary kriging (OK) and Co-kriging (Co-K) to fill the gaps in meteorological data from Weather stations. These methods been widely used to interpolate temperature point data (Berndt and Haberlandt 2018; Bhunia et al. 2018; Hsu et al. 2017; Tan et al. 2021; Tobin et al. 2011) and precipitation point data (Tan et al. 2021; Zheng et al. 2017; Zou et al. 2021). In this study, data normality was applied to the data of the weather stations in the study area to map precipitation and temperature across the country.

The Kriging technique uses a semivariogram (half of a theoretical variogram) as a measure of variance to detect and depict the spatial correlation between observations. The mathematical functions for calculating semivariograms and describing the degree of spatial dependency are as follows (Cressie 1994; Fanchi 2018; Zimmerman 1994):

$$\gamma (h)= \frac{1}{2N(h)}\sum\nolimits_{i = 1}^{N(h)}{\left[Z\left({u}_{i}+h\right)-Z({u}_{i})\right]}^{2}$$
(10)

where \(\upgamma (\text{h})\) is an empirical semivariogram, N(h) is the number of paired observations with distance h, and Z(ui) and Z (ui + h) are the observed values in ui and ui + h locations, respectively.

After the empirical half-variance variogram is calculated, the most appropriate model is fitted to the data. The principle of prediction is based on a weighted average of observations in the vicinity of the estimated point values, i.e. Z (ui). The weight λi depends on the distance, and the spatial relationship between observations can be determined through different kriging techniques such as ordinary kriging and universal kriging (Zůvala et al. 2016).

Ordinary kriging as a spatial data interpolator finds the best linear unbiased estimate of a second-order stationary random field with an unknown constant mean as follows (Ali Akbar 2012):

$$\widehat{Z}({u}_{0})= \sum\nolimits_{i = 1}^{n}\lambda i Z\left({u}_{i}\right)$$
(11)

where \(\widehat{Z}({u}_{0})\) is the kriging estimate at location \({u}_{0}\), \(Z\left({u}_{i}\right)\) is the value observed at location \({u}_{i}\), and λi is the weighting factor for \(Z\left({u}_{i}\right)\).

The variance of estimation is:

$${\sigma }_{OK}^{2}\left(u\right)=-\gamma \left(u,u\right)+\sum\nolimits_{i =1}^{n}{\lambda }_{i}\gamma \left(u,{u}_{i}\right)+\mu$$
(12)

where \(\upgamma (u, u)\) is the mean of the semivariogram at the estimated location of \(u\),\(\gamma \left(u,{u}_{i}\right)\) is the mean of the semivariogram between location \(u\) and the ith location of observation, and μ is the lagrangian coefficient used to minimize the kriging variance. Estimation variance measures the uncertainty of the estimation at a desired point.

The estimation error variance by ordinary kriging is as follows:

$$\widehat{Z}\left({u}_{0}\right)- Z\left({u}_{0}\right)= R\left({u}_{i}\right)=\sum\nolimits_{i = 1}^{n}\uplambda Z\left({u}_{i}\right)-Z\left({u}_{0}\right)$$
(13)

where \(Z\left({u}_{0}\right)\) is an unknown true value at \({u}_{0}\), and \(R\left({u}_{i}\right)\) is the estimation error. An estimator is unbiased; therefore:

$$E\left[R\left({x}_{0}\right)\right]=\;0\;and\; \sum\nolimits_{i = 1}^{n}{\uplambda }_{i}=1$$
(14)

The co-kriging technique uses a secondary variable that is spatially interdependent on the main variable. Considering a secondary variable, a classic co-kriging semivariogram is defined as (Cahn et al. 1994):

$${\gamma }_{12}(h)= \frac{1}{2N(h)}\sum\nolimits_{i = 1}^{N(h)}(\left[{Z}_{1}\left({x}_{i}+h\right)-{Z}_{1}\left({x}_{i}\right)\right]\left[{Z}_{2}\left({x}_{i}+h\right)-{Z}_{2}\left({x}_{i}\right)\right])$$
(15)

where γ12 is a mutual semivariogram serving as the function of the distance (h), and N is the number of pairs of points Z1 (xi) and Z2 (xi) at a stepped distance (h + dh). The co-kriging estimate of the Zi attribute at position X0 is calculated by the following equation:

$${Z}_{i} ({x}_{0})= \sum\nolimits_{i = 1}^{N(h)}{\lambda }_{1i}{Z}_{1}({x}_{i})+\sum\nolimits_{i = 1}^{N(h)}{\lambda }_{2i}{Z}_{2}({x}_{i})$$
(16)

where λ1i is the weight associated with Z1 (xi), and λ1i is the weight of Z2 (xi).

It should be noted that the purpose of performing the regression method in this study was to estimate gauge observations from satellite measurement whereas, interpolation was performed to fill the data gaps between spatially known ground point data. However, in case spatial data is missing, both methods can also be used for data gap filling, and they are conceptually closely related. In most cases, regression methods are used to identify a generalized function defining the dependent variable from the independent variables at any given point, while interpolation methods used to estimate the values by using the known values at surrounding input points using a function that fairly describes the spatial/temporal distribution of a variable (Madenci et al. 2019).

As the results of the previous two tiers, this study developed four raster maps of air temperature and precipitation were developed for the entire Iran.

2.3.3 Implementation of KG classification scheme

As developed by (Kottek et al. 2006), the KG classification scheme was adopted under the criterion in Tables 2 and 3. Accordingly, five main climate groups are defined each presented with a letter as: (A) tropical climate, (B) arid climate, (C) warm temperate climate, (D) snow climate, and (E) polar climate. Then, each of the climates is divided into smaller climatic zones based on the seasonal variation of precipitation in an area. As a result, there are several classes to define, including dry summer (s), dry winter (w), and no dry season or completely humid climate (f) (Peel et al. 2007). Definitions of these main groups are based on the criterion presented in Table 3.

Table 3 The main types of climate in the KG climate classification, descriptions and identification criteria (Kottek et al. 2006)

As shown in Table 3, the first letter describes the climate in term of the temperature condition, and the second letter describes the climate in term of the precipitation conditions in an area. Also, Tmin, Tmax, Pann, Pmin, Psmin, Psmax, Pwmin, and Pwmax represent the monthly mean temperature of the coldest month, monthly mean temperature of the hottest month, accumulated annual precipitation, precipitation of the driest month, lowest monthly precipitation values for the summer, highest monthly precipitation values for the summer, lowest monthly precipitation values for the winter, and highest monthly precipitation values for the winter, respectively. The third letter in the KG method is defined based on the criteria shown in Table 3. Note that, for type (b), a threshold temperature value of + 10ºC is set for at least four months, as defined in the third column of Table 4 (Kottek et al. 2006).

Table 4 Determination of the third letter for the temperature conditions in the KG climate classification method

The KG method applies dryness threshold (Pth) values for class B climate based on the absolute measure of the annual mean temperature and the annual cycle of precipitation. The following equations show how this threshold is set.

$$P_{th}=2\times T_{ann}\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\text{if at least}\;2/3\;\text{of the annual precipitation occurs in winter} P_{th}=2\times T_{ann}+28\;\;\;\;\;\;\;\text{if at least}\;2/3\;\text{of the annual precipitation occurs in winter} P_{th}=2\times T_{ann}+14\;\;\;\;\;\;\;\mathrm{otherwise}$$
(17)

In total, this study produced six maps in a raster format with spatial resolution of 1000 m. The four maps resulted from MLR models and spatial interpolation per precipitation and temperature. These maps cover the entire country of Iran. This study also produced two further KG classification maps from unused ground-based meteorological data (109 stations) in MLR and spatial interpolation techniques. In line with four other maps, the ground-based KG maps were generated with spatial resolution of 1000 m. However, those only identify the KG classification of each weather observation station. The ground-based maps were used to assess the accuracy of the four other maps against the real meteorological data from weather observation stations.

2.4 Evaluation of model performance

Since the result of practicing the KG method was a nominal classified map, the final results were evaluated with overall accuracy and the Kappa coefficient (K). The arbitrary Kappa of 0.75 was considered excellent. It was calculated with Eq. 18 as follows (Fleiss et al. 2013):

$$K=\frac{\left[\%\;agreement\;obsereved\right]-\left[\%\;agreement\;expected\;by\;chance\right]}{100\%-\left[\%\;agreement\; expected\;by\;chance\right]}$$
(18)

The Kappa coefficient is a measure of the accuracy of maps and is calculated for each matrix using diagonal and marginal elements. It indicates how well the classification agrees with the real data.

3 Results and discussion

3.1 Satellie-based regression analysis

The strength of relationship between meteorological values and satellite-based data was inspected statistically with Pearson’s correlation test. Figure 4 shows the relationship between air temperature and three types of MODIS- LST data. The highest correlation was found between Tmin and LSTmin followed by the Tann-LSTann and Tmax-LSTmax pairs at a ground station. The correlations were significant based on Pearson’s correlation test at P-value < 0.01.

Fig. 4
figure 4

Relationship between Tann-LSTann, Tmax-LSTmax and Tmin-LSTmin pairs at a ground station

As shown in Table 5, there was a significant and positive linear correlation between Tmin and LST values at the significance level of 0.01. In addition, value of variance inflation factor (VIF) for all the linear models were less than the determined cut-off point value (VIF < 10). This indicates the non-collinearity in the models.

Table 5 Statistics for the regression model output based on the ground-based temperature data

In the case of statistical relationship between ground-based precipitation data and GPM data, correlation generally declined when precipitation decreased. As shown in Fig. 5, the highest correlation belonged to the Pwmax-GPMwmax pair, followed by Pann-GPMann, Pw-GPMw, Pwmin-GPMwmin, and Psmin-GPMsmin. This result also indicated overestimation of GPM values among all the developed regression models. Collectively, Pearson’s correlations were significant (2-tailed P value < 0.01) for all the pairs (Fig. 5).

Fig. 5
figure 5

Relationship between the pairs Pann-GPMann, Psmin-GPMsmin, Pw-GPMw, Pwmin-GPMwmin, and Pwmax-GPMwmax

Table 6 shows MLR models between ground-based participation (depended variable) and GPM data, TVDI data and DEM (independent variables). According to Table 2, with the increase of overall rainfall in a time scale, the correlations between ground data and satellite data are increased. For example, Psmin and Pann show the lowest and highest correlation with GPMsmin and GPMann respectively. However, the TVDI as an ancillary independent variable improves the correlations to some extent. Also, VIF values indicated in this table reject collinearities between independent variables.

Table 6 Statistics for MLR models output based on the ground-based precipitation data and GPM data, TVDI data and DEM

Figure 6 shows TVDI maps derived from MODIS LST and NDVI products. LST and NDVI on the same time scale were plotted against each other, and LSTmin and LSTmax parameters (Table 6) were derived from the triangulated graph presented in Fig. 2. The monthly scatter plots of April, June, August, and October as well as the equations used to calculate LSTmax and LSTmin values for mapping TVDI are presented in Fig. 7 and Table 7, respectively. As shown in Fig. 6, most wet areas on all the time scale maps are located in the north and west of the country, but they reduce from the northwest to the southeast. In the eastern and southern regions, the lack of rainfall and the increase in temperature lead to a decrease in soil moisture. In addition, TVDI accounts for the vegetation conditions influenced by precipitation (Hussain et al. 2023). Briefly, the effect of rainfall is first reflected as the soil moisture, which varies with climate change. Soil moisture, as a storage for precipitation, is one of the key climate variables owing to its significant role in the hydrological cycle (Gruber et al. 2019) as well as energy and biogeochemical cycles. Regarding the role of soil moisture in the climate system, it is known as a basic property for interpreting for monthly to seasonal climate variations and for climate change studies (Jiang et al. 2023). The large-scale alteration in soil moisture availability is a relatively fast response to the climate variation and a significant sign of regional climate change (Allan et al. 2020; Berg and Sheffield 2018). TVDI is a function of soil moisture and vegetation. As vegetation increases, the surface temperature decreases because the leaves absorb the heat for transpiration and cause a reverse process between the surface temperature and the vegetation. However, TVDI reproduces the variation in moisture on a finer scale than can be derived from ground-based data in the study area.

Fig. 6
figure 6

TVDI maps for different time scales: (A) TVDIann, (B) TVDIsmin, (C) TVDIwmin, (D) TVDIwmax, and (E) TVDIwmin

Fig. 7
figure 7

Scatter plots of LST and NDVI maps on different time scales (for a sample of four months including April, June, August and October)

Table 7 Equation derived from LST-NDVI space in the calculation of LSTmax and LSTmin values

Figure 8 shows modelled precipitation and temperature resulted from regression analysis. As shown, the hottest region of Iran is located mainly in the south and southeast, and the coldest region is in the Northwest part of the country. The mean air temperature is increased from the north to the south and from the west to the east. The increase of temperature in the West–East direction is due to the existence of mountains and the topographic conditions of western Iran. The increase of temperature in the north–south direction is due to the proximity to the equator, the increase in the angle of the sun, and the moisture richness of the atmosphere on the southern coasts.

Fig. 8
figure 8

Regression-based near-surface temperature and precipitation maps

3.2 Spatial interpolation

Additionally, the ground-based data of rainfall and air temperature were interpolated statistically on different time scales as the required parameters for the KG method. The interpolation was conducted after data normalization, which is a pre-requisite for interpolation (of kriging and Co-kriging types). Considering Root Square Mean Error ((RMSE) values, Tmax, Tmin, Tann, Pswin, Pwmax, Pann and Pwmin were interpolated through Co-Krigging method using DEM as an ancillary data. Whereas, ordinary kriging had better performance than Co-kriging method in interpolating Pw point data. Figure 9 shows the maps of geo-statistically interpolated ground-based precipitation and air temperature data.

Fig. 9
figure 9

Geo-statistically interpolated precipitation and air temperature maps based on the ground data of the study area

3.3 Demonstration of KG classification maps

Figure 10 shows KG classification maps generated from satellite-based data (KGs) and ground-based geo-statistically interpolated data (KGgs). Overall, this study identified three main groups and eight sub-groups of KG climate classes in Iran. Based on this classification, the three main types of climates in Iran during 2016–2019 were B (arid climate), C (warm temperate climate), and D (snow climate). Moreover, the Bwh subgroup (characterized by dry and very hot desert conditions) dominates the largest area of the country, encompassing most of the central parts, southeast, parts of the northeast, and the hillside of the Zagros mountains in the study region. The Bsk subgroup, representing semi-arid and cold desert climates, extends over the east and northeast, as well as parts of the south to southwest, including areas of the Zagros mountains. The Bwk subgroup, associated with dry and cold conditions, covers the northern and western edges of the central part of the country. Following in the hierarchy of Group B, the Bsh subgroup, characterized by dry and very hot semi-desert climates, spans from the center to the south and southwest regions. The Csa subgroup, the sole representative of Group C, signifies the Mediterranean climate with dry and very hot summers, spanning from the north to the northwest and southwest of the country. Additionally, the Dsa subgroup, belonging to the cold climate (D), covers the north and northwest regions. Figures 10 and 1 indicate that Dsa, Dsb, and Csb cover only a small portion of the country.

Fig. 10
figure 10

Main-group and sub-group areas on KGs and KGgs maps

Additionally, Fig. 11 shows a comparison between similarity and differences of the KG main group and sub-group between KGgs and KGs maps. Overall, the greatest output difference between the two methods belongs to C and D main groups. However, based on the results, the Bwh climate group covers a major part of the country.

Fig. 11
figure 11

Comparison of the percentages for different main-group and sub-group areas on KGs and KGgs maps

Considering KGgs (Figs. 10 and 11), in the main group, similar to KGs, climate B has the largest area covering above 90% of the country, and the predominant sub-climate of the three groups is Bwh, which covers about 55.73% of the country. Bsk, the second dominant climate which covers about 20.47% of the country, has the average annual temperature below 18ºC and is extended in the northeastern, northern, and western regions up to the Zagros Mountainous. In terms of area coverage, the third dominant climate of the country is Bwk followed by the sequence of Bsh, Csa, Dsa, Csb and Dsb.

In terms of area coverage, the two methods showed almost similar results. The maximum and minimum area differences between classes of KGgs and KGs were about 3% (Bwh) and 0.2% (Bsh), respectively. In general, the same trend of change can be seen in the areas on both KGgs and KGs maps.

3.4 Assessment of model performance

As indicated in Table 8, the overall accuracy values of 92.66% and 84.4% and the Kappa coefficients of 0.75% and 0.41% were obtained for KGs and KGgs, respectively. Although the kappa values are obviously different in the main groups, the values obtained by the two methods are closer to each other in the sub-groups. In addition to the kappa, disagreement indices including quantity disagreement (QD) and allocation disagreement (AD) were calculated in this study. The details of equations in calculating QD and QD have been presented by Pontius and Millones (2011). The inverse behavior of total disagreement (sum of QD and QD values) in contrast to the kappa and total accuracy can be seen in Table 8.

Table 8 Kappa statistics, the overall accuracy, QD and AD of KGs and KGgs maps based on the test data

If the point-wise climate classification map (derived from 223 synoptic stations) is overlaid with the GPM and LST images, KG (Tables 9 and 10) can be proposed for the study area

Table 9 Determination of the KG main group and criteria in case of using LST and GPM satellite data
Table 10 Determination of the third letter (temperature conditions) in the KG climate classification method with LST data
$${P}_{th}=2\times {LST}_{ann}\text{ if at least }3/4\text{ of the annual precipitation occurs in winter}$$
(19)
$${P}_{th}=2\times {LST}_{ann}+38\text{ otherwise}.$$
(20)

The Kappa accuracy was improved where the modified KG criteria were applied on LST and GPM data (Table 11). The accuracy assessment was performed using the same dataset as in the regression and interpolation methods.

Table 11 Kappa statistics for the overall accuracy of climate classification based on the modified KG criteria

4 Conclusion

This study provided a comprehensive assessment between regression analysis and spatial interpolation techniques in the use of developing Köppen-Geiger (KG) classification maps on a national scale. Both techniques are popular to overcome data inconsistency and scarcity. In the case of regression analysis, this research examined the relationship between MODIS-LST, MODIS-NDVI, MODIS-TVDI, GPM-precipitation and SRTM-DEM (explanatory variables) and ground-based precipitation and temperature (dependent variables). The spatial interpolation model (Krigging and Co-krigging) was developed from a weather observation station. Overall, this study identified three main groups and eight sub-groups of KG climate classes in Iran. Based on this classification, the three main types of climate in Iran during 2016–2019 were B (arid climate), C (warm temperate climate) and D (snow climate). Also, the results showed that the area coverage of the sub-groups (Bwh, Bsk, Bwk, Bsh, Csa, Dsa, Dsa, Dsb and Csb) in the maps obtained from the two studied methods follow each other to a large extent.

This research study thus obtained a Kappa of 66% for the sub-group on the KGgs map. The results also revealed that the KG map generated from satellite data was more accurate than the map generated from geostatistically interpolated ground data. Due to the operational applicability of remotely sensed data, this study claims to be of insight for climate classification based on remote sensing imageries. The spatial trends of climate change can also be detected with continuous pixel-based data. However, with the Kappa of at least 75%, this study showed that satellite data can be used for KG climate classification, although GPM data and point data are not homologous. The dissimilarity of satellite data and point data may be due to the high spatial variation of precipitation in arid and semi-arid regions, whereas satellite data estimate precipitation with 10-km spatial resolution. Satellite data helps to define a new threshold, modify the criteria, and offer a new approach to examining climate globally.

Continuous pixel-based climate maps created from the presented methodology are expected to reveal better climate transition zones than the point data. Moreover, overlying time series climate maps generated from long- term satellite data the maps of future climate zones can be generated and the hotspots of environmentally susceptible areas due to climate change and climate warming can be identified. Due to the global coverage of satellite data, a unique and globally high temporal and spatial resolution climate map can be generated, and significant regional and temporal climate change can be portrayed. However, this study recommends employing techniques such as artificial intelligence for data gap filling of climate data with the purpose of climate classification. This study also suggests the evaluation of the performance of the presented methodology in the area covering KG climate classes other than classes existing in the study area of this research.