Introduction

The relation between rainfall and runoff and the measurement of this relation in the water cycle, which are a critical part of our lives, are important. Due to its importance, the precipitation on fields and its runoff path have usually been a subject of interest for research and analyses. The conversion of precipitation to runoff, you have to be known some events and terminology. Rainwater accumulate from hallows, intercept on vegetation, and infiltrate into soil. Infiltration is the vertical penetration of rain into the soil through the soil surface, that is, absorption of water by the soil. While rain continues, soil infiltrates water as much as infiltration capacity of the soil. When soil exceeded this capacity, runoff occurred. Precipitation, runoff, and infiltration measurements have been made using different methods depending on the soil type, and hydrologic soil groups have been created. Infiltration and runoff are important data for hydrology (Te Chow et al. 1988). In order to obtain such data, it is necessary to have precipitation and runoff measurement stations. Therefore, to determine these values which cannot be measured in basins that do not include such measurement stations, the curve number (CN) method is used. Infiltration is important in hydrology, because the amount of infiltration indicates the feeding of groundwater sources. Therefore, data relating to the infiltration level of the soil are important. In order to determine the hydrologic soil groups, it is necessary to know the characteristics of the soil and the flora living in the soil. The structure, texture, organic matter content, and moisture of the soil affect the amount of infiltration (NRCS 2004b).

In Soil Conservation Service (SCS) CN calculation, different CN values are calculated for different antecedent moisture conditions (AMCs). While the AMC I condition corresponds to the dry time of year, the AMC III condition corresponds to the humid time and the AMC II condition to the normal time. The CN values in these conditions are called CNI, CNIII, and CNII, respectively. The CNI and CNIII values are calculated by utilizing the CNII value (Mishra et al. 2008; Mishra and Singh 2013). For CN calculation, the values which the soil will have depending on its geological structure, land use/land cover (LULC) characteristic, and inclination value have been tabulated by the SCS. In remote sensing, such data are obtained by utilizing satellite images and geographic information system (GIS) techniques. Then, the obtained maps are superimposed on each other, the curve number determined as per these characteristics is assigned, and thus, the curve numbers of the basins are acquired. With the CN obtained, the runoff of the catchment basin is calculated retrospectively.

The literature includes numerous works based on the SCS-CN method (Deshmukh et al. 2013; El-Hames 2012; Guzmán et al. 2011; Huang et al. 2008; Mishra and Singh 1999; Mishra et al. 2006; Moretti and Montanari 2007; Shi et al. 2007; Singh et al. 2008; Young and Carleton 2006; Zhan and Huang 2004). El-Hames used the SCS-CN method to calculate the peak discharge in basins located in arid and semi-arid regions but made no measurements (El-Hames 2012). In another application, the SCS-CN technique was used to determine the change in sediment amount (Tyagi et al. 2008). Kottegoda et al. (2000) prepared statistical models of the daily runoff values in three different river basins in Italy by using the CN technique. Lin et al. integrated the curve number method to determine the impact of land use change on environmental runoff (Lin et al. 2014). Isik et al., on the other hand, applied the curve number technique by combining it with artificial neural networks to determine the impact of land use change on runoff (Isik et al. 2013).

This study reveals through remote operate in the year 2005. Therefore, the Landsat Thematic Mapper (Landsaat TM) images from the years 2000, 2006, 2010, and 2011 were used to produce the land use change maps. In order to cope with the detailed steps of the process and routine calculations in the weighted curve number calculation, it was ensured, by developing a user interface with the Python software language, that the process steps can be completed easily and quickly. The obtained results can be used as methodological references for studies to be conducted on other catchment areas (potable water and non-potable water).

Methodology

As outlined in Fig. 1, we use remote sensing and GIS to analyze the effects of land use change on the hydrologic structure of the Yamula Basin in Kayseri metropolitan area.

Fig. 1
figure 1

The methodology of this study

Study area

Within the scope of the application, our aim was to determine the runoff values in the catchment area of the Yamula Dam located on the provincial border of Kayseri (Turkey) with help of the curve number calculation to be made with integration of the remote sensing and GIS. The study area has a total surface area of 5773 ha and lies between the latitudes of 38 52′–39 05′ N and the longitudes of 35 12′–35 2′ E. The Yamula Dam is a hydroelectric power plant located on the Kızılırmak River. It started to retain water in the year 2003 and to generate power in the year 2005. The catchment area map of the study area is shown in Fig. 2.

Fig. 2
figure 2

The location maps of the geographic basin area (Yamula Dam) used in this study

Data

In this study, the data of the land use/land cover, land use capability, hydrologic soil groups’ maps, and precipitation values were used to calculate the SCS-CN and the runoff associated therewith.

The precipitation values of the region were obtained from the rainfall observation stations of the Turkish State Meteorological Service, which are near the basin.

Land use/land cover map

The satellite images that were acquired in 2000, 2006, 2010, and 2011 (NASA 2016) were processed by the following three processes: pre-processing, classification, and post-classification, as shown in Fig. 1. In classification, a supervised image classification maximum likelihood method was used. First, Landsat satellite images were converted to the Universal Transverse Mercator (UTM) coordinate system. Next, Erdas Imagine 9.1 and ArcGIS 10.3 were selected for various analyses and image classification (Erdas 2014; ESRI 2014). The classified images were further smoothed with a majority filter with a 3 × 3 kernel to reduce the number of misclassified pixels (Erdas 1999). The classification categories in the study area are water, cropland, settlement, dryland, and grassland.

The Kappa statistic and overall accuracy were used to measure the agreement between two sets of categorizations of a dataset while correcting for chance agreements between the categories. Jenness’s kappa analysis in the ArcView 3.x extension was used to estimate the sample size required to achieve a confidence level and precision for statistical analysis (Jenness and Wynne 2005). The error matrix was produced after calculating the minimum sample size. Table 1 shows the overall classification accuracy and kappa results for the Yamula Basin. Our results show that the kappa coefficient and overall accuracy statistic are greater than 80 %. The high overall accuracy and high kappa coefficient suggest a good relationship between the classified image and the reference data (Congalton and Green 2008; Landis and Koch 1977).

Table 1 Results of the classification accuracies for Yamula Basin

Then, the classified raster images were converted to vector maps, and thematic land use maps were obtained (Fig. 3). The GIS database was created by eliminating topological errors in the land use data obtained. The geological and agricultural land use capability maps were overlaid with the created land use maps. Finally, the resulting GIS database was used as the input parameter for the SCS method which was developed by the Soil Conservation Service for the calculation of curve numbers.

Fig. 3
figure 3

Thematic land use maps: a year 2000, b year 2006, c year 2010, and d year 2011

Hydrologic soil groups

The HSG data is one of the types of data used in the SCS-CN method. The soil structure was divided into hydrologic groups by Musgrave in 1955 (Musgrave 1955). This classification is prepared by measuring the precipitation, runoff, and infiltration values. These four groups are named by letters from A to D. Among these groups, group A has low runoff potential, while group D has high runoff potential (Table 2).

Table 2 The definitions of the hydrological soil groups (Musgrave 1955)

For this classification, these groups were created by first performing the precipitation and runoff measurements on the soils with specific characteristics by using an instrument called an infiltrometer and then calculating the infiltration amount by using the difference between such measurements (NRCS 2004b; SCS 1993).

Land use capability classes

The map of the land use capability (LUC) classes was prepared by using the land inclinations. It has the values ranging between LUC I and LUC VII depending on whether the land is flat or steep. The value “I” indicates lands with a flat inclination, while the value “VII” refers to steep lands, with the inclination values increasing gradually between them (Table 3). The land use capability classes of the region were obtained from the Geographic Information Systems Department of the General Directorate of Agricultural Reform, Republic of Turkey Ministry of Food, Agriculture and Livestock.

Table 3 The values of slopes for different land use capability classes (Şen et al. 2010)

Method

Curve number method

In practice, the SCS-CN method (NRCS 2004a; USDA/NRCS 1986) has been used to estimate the runoff. The Natural Resources Conservation Service (previously SCS) developed the curve number method in 1964 with the intention of detecting the non-measurable runoffs in basins where no runoff measurement can be performed. The curve number method allows the calculation of non-measurable runoff values. The CN that gives the relation between the rainfall and the runoff empirically takes a value from 0 to 100. The CN method was developed initially for the USA. Then, it started to be applied globally. Although the basic concepts are the same for the entire world, additional criteria are introduced for different regions.

The mathematical relation between the rainfall and the runoff is expressed with the following equation:

$$ \frac{F}{S}=\frac{Q}{P-{I}_a} $$
(1)

where F refers to the actual retention, S to the potential maximum retention (mm), Q to the direct runoff (mm), P to the rainfall depth (mm), and I a to the initial abstraction (mm), that is, the amount of water absorbed by the soil at the first stage.

In fact, the actual amount retained in the soil is obtained by subtracting the runoff and the initial abstraction from the rainfall.

$$ F=P-{I}_a-Q $$
(2)

When the two equations are combined, the following equation is obtained:

$$ Q=\frac{{\left(P-{I}_a\right)}^2}{P-{I}_a+S} $$
(3)

The calculations made for the I a/S ratio by using the measurement values obtained from the sample basins revealed that it had values lower than 0.2 which is the default value (Baltas et al. 2007). This ratio had different values because the geological structures are different in different basins. In numerous studies, however, the 0.2 value is used for the I a/S ratio. If this value is included in Eq. 3, Eq. 4 will be obtained.

$$ Q=\frac{{\left(P-0.2S\right)}^2}{P+0.8S},P>0.2S $$
(4)

This rainfall-runoff relation equation is used in the CN method. The potential maximum retention value (S) is converted into the CN value. The relation is as shown in Eq. 5.

$$ CN=\frac{25400}{254+S} $$
(5)

Theoretically, the potential maximum amount of water retained by the soil (S) varies between 0 and infinity, while the CN value varies between 0 and 100. When the amount of water retained by the soil is 0, the curve number is 100, and the entire rainfall turns into runoff. When the amount of retained water is infinite, on the other hand, the CN value is 0, which indicates that the entire rainfall has infiltrated the soil, and no runoff at all is observed.

At the implementation stage of the CN method expressed mathematically in the equations above, the CN values are obtained by using the remote sensing and GIS methods in a combined manner. For this, the land use/land cover map, the hydrologic soil groups’ map, and the antecedent rainfall values of the study area must be available. The SCS has tabulated the curve number values in line with such data. The CN tables were created for the American continent. The resource (Şen et al. 2010) was used as a reference to prepare the CN tables in the study area. In the CN method used, the land use capability classes’ map created by considering the inclination classes was used practically, in addition to the land use/land cover map, the hydrologic soil groups’ map, and the antecedent rainfall values. The curve numbers of the catchment area were calculated by utilizing these maps.

The CN calculations were made with the AMC of the soil. The AMC values range from 1 to 3. The classes were determined by considering the 5-day antecedent rainfall.

The rainfall limits for the seasonal AMC classes are as shown in Table 4 in SCS-1972.

Table 4 Antecedent rainfall limits for different AMCs (US Department of Agriculture Soil Conservation Service)

The weighted curve numbers based on the sub-basin were calculated from the curve numbers calculated as described above by using Eq. 6.

$$ CN(II)=\frac{{\displaystyle {\sum}_{i=1}^n}\left(C{N}_i*{A}_i\right)}{{\displaystyle {\sum}_{i=1}^n}{A}_i} $$
(6)

The CNII value was used to calculate the curve number values of the arid and wet seasons. The calculation was made using Eqs. 7 and 8 (Özdemir 2007; Te Chow et al. 1988).

$$ CN(I)=\frac{4.2*CN(II)}{10-0.058CN(II)},{\displaystyle {\sum}_{i=1}^5{P}_i<13} $$
(7)
$$ \begin{array}{cc}\hfill CN(III)=\frac{23CN(II)}{\left(10+0,13CN(II)\right)},\hfill & \hfill {\displaystyle {\sum}_{i=1}^5{P}_i>23}\hfill \end{array} $$
(8)

Results

For the curve number method, the classification images obtained from satellite images were used as the land use/land cover map after having been converted into vector format. The study area was divided into 14 sub-basins (Fig. 4). In order to divide the basin into sub-basins, the river network map was prepared using the digital elevation map of the region. In the river network, every stream channel forms sub-basin area. Considering this river network, the Yamula Dam catchment basin was divided into sub-basins. The curve number values for these sub-basins were calculated separately for each year. The area distribution of the sub-basins is shown in Table 7. For different AMC values, the CNI, CNII, and CNIII values were calculated. According to the daily precipitation data obtained from the Turkish State Meteorological Service for Kayseri province, the months of March–April–May indicate the AMC III (rainy time), July–August–September indicate the AMC I (arid time), and October–November–December indicate the AMC II.

Fig. 4
figure 4

Sub-basin and stream network in study area

For the curve number calculation, the land use/land cover map (Fig. 3) for the years 2000, 2006, 2010, and 2011, the hydrologic soil groups’ map (Fig. 5), and the land use capability map (Fig. 6) were used. These maps were vector-formatted and had the same coordinate system. They were superimposed on each other in the ArcGIS program. These unified vector-formatted images were subjected to unify analysis (Fig. 7). In this way, four different images were obtained by making the same unify analysis for the years 2000, 2006, 2010, and 2011. The obtained images were clipped in compliance with the boundaries of each sub-basin area and saved in different files, thus determining 14 sub-basins for each year, and by applying the curve number method of these sub-basins, a curve number was assigned for each area. Then, considering the areas of the curve numbers, a weighted curve number was calculated for each sub-basin. Fifty-six different images were obtained, and 56 different weighted curve numbers were calculated for 14 sub-basins and four different years. In order to cope with the emerging heavy workload, a user interface developed in the Python language in the ArcGIS program was used. The results were obtained by operating the user interface programs developed by using the Python language in the ArcGIS software.

Fig. 5
figure 5

Hydrological soil groups’ map of Yamula Dam basin area

Fig. 6
figure 6

Land use capability classes map of study area

Fig. 7
figure 7

Unified image (land use/land cover in year 2010, land use capability, and hydrological soil groups’ maps) of basin area

It is easy to run the ArcGIS commands with the Python language. In this way, the ArcGIS commands used in cycles facilitate the processes. The ArcGIS library is imported with the help of the “import arcpy” command.

The script file draws, with the help of the Structured Query Language (SQL) query, the curve numbers in possible conditions from the reference table in the database. It assigns values to the maps we created. Immediately after this value assignment, it can also calculate the weighted curve numbers (Fig. 8 and Table 5). In such types of applications which involve detailed processes, the Python language assists users. Combined use of the Python language and SQL query accelerated the processes significantly.

Fig. 8
figure 8

Graphic illustrations of estimated curve number for sub-basin areas

Table 5 Pseudo code of weighted curve number calculation in Python language

The obtained results are shown in Table 6. Table 7 gives the area distribution of the sub-basins. When the change in the total weighted curve numbers between the years is examined, it is observed that the curve numbers have increased in the some sub-basins.

Table 6 Estimated curve numbers of sub-basin areas
Table 7 Area distributions of sub-basins

Conclusions

Runoff value is important data for hydrologic cycle. In this study, the remote sensing and GIS techniques and the SCS-CN technique were applied to the Yamula Dam catchment basin. By the analysis of meteorological, soil type, land inclination, and remotely sensed data, with the GIS techniques, changes of the region were examined. In the study area, data can be obtained quickly and cheaply with the integration of remote sensing satellite images and GIS techniques. By the use of Python language in GIS, data can be analyzed functionally and fast. SCS-CN method can estimate runoff outdated values belonging to ungauged stations in minimum time.

The results of this study show the important role of urban sprawl and significant change of land use regime in the increase of runoff rate within a catchment basin area. Therefore, the remotely sensed data with the aid of a GIS can provide valuable data for studies on land use changes. Also, the results of this study show that the incorporation of GIS and remote sensing data in hydrological models can effectively support decision making in the areas of risk and sustainable development.

As shown in Tables 6 and 7, when the change in the total weighted curve numbers between the years is examined, it is observed that the curve numbers have increased in the sub-basins numbered 7, 8, 9, 10, 11, 12, and 13 where the water body has increased greatly. This increase is a result of the filling of the land use/land cover areas, which are used differently with water. In terms of area, the water area in sub-basin 8 in particular rose from 128 to 1451 ha. This high rise in the water body raised the curve number from 65 to 84. This change indicates an increase in the runoff and a decrease in the amount of water retained by the soil. In other words, most of the rainfall turned into runoff.

The classification results indicated that the basin underwent vast land use changes between the years 2000 to 2011. In the sub-basins numbered 2, 4, 6, and 14, the higher increase in the agricultural fields in proportion to the rise in the total water area plus the decrease in the arid soil areas led to a decrease in the curve number between the years 2000 to 2010. The increase in cropland reduced the curve number, while the increase in the water areas and decrease in barren soil areas in the sub-basin increased the curve number value. By increasing the water retention capacity of the soil through changes in the land use/land cover, the loss of rainfall as surface runoff will be prevented, and it will be ensured that the underground waters are fed. The SCS-CN method is a convenient method to understand and display such relations. The method will provide a better understanding of the main reasons for the effects and will support city administrators in similar projects.