1 Introduction

Surface water resources are under increasing stress in view of severe regional imbalances in availability of water and drought conditions in India. Because of increasing demand on water supply from various sectors, timely and analytical information is needed by water managers to assess current and long-term water resource potential in the region. This information is critical to management of water supply and to avoid potential adverse affects on the hydrologic system.

Southern basins in Tamil Nadu are very much drought prone. The sustainable planning of water resources in such region requires hydrological information, the most crucial of which is the estimation of spatial and temporal variability of rainfall (Wilk et al. 2006; Goncalves and Alpuim 2006). This necessarily depends on rain gauge network in a given basin. Often the number of rain gauges in a basin is not adequate to arrive at average or mean precipitations used in any design study due to the large space and time variability of rainfall. Some authors (e.g., Tsintikidis et al. 2002) have attempted to identify possible sites for the placement of additional gauges in order to reduce the precipitation interpolation error using historical hourly data.

Conventionally, however, stochastic interpolation methods such as Kriging are widely recognized and used as standard methods for interpolation of hydrological variables such as groundwater level, precipitation, etc. For example, Vijayakumar and Remadevi (2006) used Kriging for the spatial analysis of groundwater levels and established its superiority over the inverse square distance. Jinfeng (2006) conducted a case study on the impacts of land use/cover change (LUCC) on the temporal and spatial variability of the groundwater level in an arid oasis in the Sangong River Watershed by using geographical information system (GIS), remote sensing (RS) and Kriging. Similarly, Shi et al. (2007) compared IDW, Ordinary Kriging and Simple Kriging interpolation of geostatistics analyst to interpolate rainfall data. Among other important conclusions, they observed that Exponential model of semivariogram has the highest precision than others. Yan-bing (2002) compared five different semivariogram models for monthly rainfall interpolation in their study area.

Although Kriging has been widely used, it is being recognized that traditional Kriging has a major limitation due to the need for a mathematical function for a semivariogram that might fit the surface to be interpolated. Consequently, some researchers have used and demonstrated the advantage of the universal function approximates such as artificial neural network (ANN) as a replacement to fitted authorized semivariogram model within ordinary Kriging (Teegavarapu 2007). Chowdhury et al. (2010) demonstrated that ANNs can also be used to map with noticeably higher accuracy than Kriging the complex and seemingly erratic spatial pattern of groundwater contamination provided that reasonable data preprocessing and exploratory data analysis are performed.

The present study aimed to investigate the efficacy of ANN to interpolate monthly average rainfall in non-sampling locations with the help of rainfall measured in the neighbouring locations. The idea of using only neighbouring rain gauge is derived from one of the commonly used non-parametric interpolation methods, the ‘nearest neighbour’ method, which is based on minimum Euclidian distance (Brath et al. 2002). Karlsson and Yakowitz (1987) used this method for studying rainfall runoff process. Szolgay et al. (2009) compared the performance of three interpolation methods namely nearest neighbour method, Kriging and inverse distance weighting (IDW) in producing maps of 2-year and 100-year daily precipitation as extreme rainfall indices. In this study, though directly nearest neighbour method is not used, an emphasis is laid in selection of appropriate neighbourhood input variables for training ANN architecture. The efficacy of the proposed approach is demonstrated through an application in improving the inflows to Ramanadhi reservoir in the Tamirabarani basin. The next section briefly introduces Kriging and ANN techniques. This is followed by a description of the study area. Subsequently, the proposed methodology is discussed in detail, results analysed and conclusions drawn.

2 Artificial neural network

The ANN is briefly introduced in this section. Multilayer feed forward network with Back Propagation learning algorithm is one of the most popular neural network architectures, which has been deeply studied and widely used in many fields. Typically a neural network consists of three layers: (1) an input layer; (2) an output layer and (3) an intermediate or hidden layer. The input vectors are D ∈ R n and D = (X 1, X 2,…, X n )T; the outputs of q neurons in the hidden layer, Z ∈ R q, Z = (Z 1, Z 2,…, Z q )T, and the outputs of the output layer are Y ∈ R m, Y = (Y 1, Y 2,…, Y m )T, assuming that the weight and the threshold between the input layer and the hidden layer are w ij and y j , respectively, and the weight and the threshold between the hidden layer and output layer are w jk and y k , respectively; the outputs of each neuron in a hidden layer and output layer are:

$$ Z_{j} = f\left( {\sum\limits_{i = 1}^{n} {w_{ji} X_{i} - \theta_{j} } } \right), $$
(1)
$$ Y_{k} = f\left( {\sum\limits_{j = 1}^{q} {w_{kj} Z- \theta_{k} } } \right), $$
(2)

where f() is a transfer function, which is the rule for mapping the neuron’s summed input to its output, and by a suitable choice, is a means of introducing a non-linearity into the network design, and θ is the threshold. One of the most commonly used functions is the sigmoid function, and it is monotonically increasing and ranges from 0 to 1. Details on ANN can be found in Karunanithi et al. (1994). In this study, NEUROSHELL software is used for implementing ANN (Neuroshell2).

3 Kriging interpolation technique

Kriging is essentially a method of estimation. The assumption here is that there is a local structure in the sampling domain, and the accuracy of the estimated values depends on how this is accounted for. Moreover, a weighted sum of the observations is assumed to provide the best estimate. Thus, Kriging is a method of determining the optimal weights. The most commonly applied form of Kriging uses a “semivariogram”, which is a measure of spatial correlation between pairs of points describing the variance over a distance or lag h. Weights change according to the spatial arrangement of samples. The linear combination of weights is as follows:

$$ Y_{i} = \sum {\lambda_{i} y_{i} } , $$
(3)

where Y i estimate of Kriging, y i the variables evaluated in the observation locations and λ i Kriging weights.

The semivariogram is formulated as follows:

$$ \gamma \left( h \right) = \frac{1}{2}E\left( {y_{\left( x \right)} - y_{{\left( {x + h} \right)}} } \right)^{2} , $$
(4)

where γ(h) semivariogram, dependent on distance h; (x, x + h) pair of points with distance vector h; y(x) regionalized variable y at point x; y(x) – y(x + h) difference of the variable at two points separated by h; E mathematical expectation.

Details on Kriging can be found in Kitanidis (1997) and Biau et al. (1999).

4 Study area

Tamirabarani River has its origin in Western Ghats near Senkottai district of Tamil Nadu State (India) with a catchment area of about 6,000 km2. There are 18 rain gauge stations in the catchment (refer Fig. 1). Six years of monthly rainfall records are collected from these stations for a time period from January 1979 to December 1984. Out of 18 stations, stations S3 (Cheranmadevi), S6 (Kadayanallur) and S12 (Sattankulam) are used for validation of rainfall interpolation.

Fig. 1
figure 1

Location of rain gauge stations in the basin

The location of Ramanadhi reservoir is also shown in the map. Monthly inflows to Ramanadhi reservoir are available from 1974 to 1986.

5 Interpolation of rainfall with ANN

The following four different models based on ANN for rainfall interpolation are considered in detail.

  1. (a)

    ANN with only three neighbouring stations (ANN_I): The methodology adopted for interpolation of rainfall is shown in Fig. 2. For the non-sampled location, rain gauge stations in the proximity and conforming to the same land use are selected (Table 1). The justification for use of rain gauges from the same land use can be understood from the fact that rainfall in the same land use (particularly for small- and medium-sized catchments) is expected to have same value. Figure 3 shows the general land use of the basin.

Fig. 2
figure 2

Flow chart showing methodology for rainfall interpolation

Table 1 Proximity stations using land use map
Fig. 3
figure 3

Land use of Tamirabarani basin

ANN_1:

$$ I = f\left( {R_{1} ,R_{2} ,R_{3} } \right), $$
(5)

where R 1 monthly rainfall of first rain gauge in the proximity, R 2 monthly rainfall of second rain gauge in the proximity and R 3 monthly rainfall of third rain gauge in the proximity.

The output consists of rainfall at the non-sampled location.

  1. (b)

    ANN with three neighbouring stations and their distances from origin as inputs (ANN_II): The easting and northing of rain gauge stations are determined from an arbitrary origin, and their distances from the origin are tabulated in Table 2. Since rainfall varies spatially, it is expected that incorporating distance to a particular station from a reference origin (as one of the input variables in ANN) may help in improving the prediction.

    Functionally, the model can be represented as:

    ANN_II:

    $$ I = f\left( {R_{1} ,D_{1} ,R_{2} ,D_{2} ,R_{3} ,D_{3} ,D_{\text{ns}} } \right), $$
    (6)

    where R 1 monthly rainfall of first rain gauge in the proximity, D 1 distance to the first rain gauge in the proximity from an assumed origin, R 2 monthly rainfall of second rain gauge in the proximity, D 2 distance to the second rain gauge in the proximity from an assumed origin, R 3 monthly rainfall of third rain gauge in the proximity, D 3 distance to the third rain gauge in the proximity from an assumed origin, D ns distance of non-sampled location in the study.

    The output consists of rainfall at the non-sampled location.

    Table 2 Co-ordinates information for 18 stations
  2. (c)

    ANN with all stations (ANN_III): This model is an extension of ANN_I wherein all the stations in the catchment form input vectors to the ANN model. Therefore, functionally, the model can be represented as:

    ANN_III:

    $$ I = f\left( {R_{1} ,R_{2} , \ldots R_{n} } \right), $$
    (7)

    where R 1 monthly rainfall of first rain gauge, R 2 monthly rainfall of second rain gauge, R n monthly rainfall of nth rain gauge and n number of stations in the catchment.

    The output consists of rainfall at the non-sampled location.

  3. (d)

    ANN with all stations and their distances from origin as inputs (ANN_IV): This model is an extension of ANN_II wherein the all rain gauges stations and their distances from an origin is given as input to the model. Functionally, the model can be represented as:

    ANN_IV:

    $$ I = f\left( {R_{1} ,D_{1} ,R_{2} ,D_{2} , \ldots R_{n} ,D_{n} ,D_{\text{ns}} } \right), $$
    (8)

    where R 1 monthly rainfall of first rain gauge, D 1 distance to the first rain gauge from an assumed origin, R 2 monthly rainfall of second rain gauge, D 2 distance to the second rain gauge from an assumed origin, R n monthly rainfall of nth rain gauge, D n distance to the nth rain gauge from an assumed origin, D ns distance of non-sampled location in the study and n number of stations in the catchment.

    The output consists of rainfall at the non-sampled location.

The back-propagation algorithm is used for training the ANN architecture consisting of one hidden layer. Out of 18 stations in the catchment, 10 are used for training, 5 for testing and 3 for validation. Now for each of these stations, input vectors are constructed for all the ANN models. The results are compared and tabulated for the months of January–March for the year 1984 in Table 3.

Table 3 ANN results compared with Kriging

As can be observed from the table, the RMSE for the month of January by ANN_II is only 9 mm as against 12.7 by ANN_I. On an average for all the three months, ANN_II estimates are better than ANN_I. It is interesting to note that by including the distance to stations as one of the inputs, the ability of ANN to map the complexity in variation of rainfall over the basin is considerably increased despite the fact that a very small sample of data is used for training. This is reflected in the difference in RMSE values in the two cases.

Further, the comparison of ANN_III and ANN_IV clearly shows that it is better to incorporate all the rain gauges stations in the catchment for better estimation of rainfall. However, if the ‘distance from origin’ variables is dropped (ANN_III), even with inclusion of all stations do not improve the interpolation.

6 Interpolation of rainfall with Kriging

Here, again two different models are considered, namely Kriging with only three immediate neighbourhood stations as used for ANN_I and ANN_II interpolation (Kriging_I) and Kriging with all the stations in the basin as done in conventional methods (Kriging_II). This is illustrated in Table 3, where the results of Kriging analysis are compared with ANN_I and ANN_II. As seen from the table, RMSE for Kriging is significantly more than ANN models. Further, it can be observed that inclusion of all the stations in the Kriging model only marginally improves the result when compared with Kriging_I.

7 Inflow prediction to Ramanadhi reservoir

Here, an attempt is made for improving the prediction of inflows to reservoir by estimating rainfall value in the vicinity of the reservoir using ANN_II (as discussed above). The methodology is illustrated in Fig. 4. For the inflow prediction, first the reservoir for which inflow prediction is required is identified along with a few more sites in the vicinity of the reservoir. In this study, up to five sites in the vicinity are considered, and it is found that best prediction comes for a neighbourhood network of three sites. For these three sites, monthly rainfall is estimated using ANN_II model, which will form a part of input variables for inflow prediction. The input vector for inflow prediction can be mathematically depicted as:

$$ I_{i + 1} = f\left( {I_{i - 1} ,I_{i} ,R_{1i} ,R_{2i} ,R_{3i} } \right), $$
(9)

where I i and I i–1 are antecedent inflows. R 1i , R 2i and R 3i are antecedent rainfalls in the neighbouring sites (as estimated using ANN_II).

Fig. 4
figure 4

Flow chart showing methodology for inflow prediction

The ANN is trained, and the results of inflow prediction for the year 1984 are tabulated in Table 4 and Fig. 5. As seen from the table, there is a drastic improvement in the RMSE value with rainfall incorporated as input variable. The RMSE value decreases from 4.1 to 1.6 cumec. This is clearly reflected in figure. Particularly, the peak value is very closely predicted. For example, the inflow for the period of February 1984 is predicted as 6.08 cumec as against 3.11 cumec for prediction without using rainfall as input. Similarly, the inflow for the period of March 1994 has been predicted as 13.36 cumec which is very close to the actual value of 14.17 cumec.

Table 4 Inflow predictions for Ramanadhi reservoir
Fig. 5
figure 5

Inflow predictions for Ramanadhi reservoir

This improvement in the prediction of inflow to the reservoir is very vital for system engineers/irrigation engineers for optimal planning of reservoir operation.

8 Conclusion

A novel method is proposed for interpolating rainfall values in a region with the help of available rain gauge stations. Further, the proposed method is applied for improving the inflow to a reservoir. The following conclusions can be arrived at from the study:

  1. (a)

    ANN seems to be a powerful alternative to the traditional Kriging method of spatial interpolation.

  2. (b)

    The effectiveness of ANN training crucially depends on not only the length of training records but also the quality of input variables.

  3. (c)

    Results are only marginally improved in both ANN and Kriging models when all the stations in the catchment are used when compared to stations in the proximity.

  4. (d)

    Inclusion of some kind of ‘distance’ variable in the input seems to play an important role in mapping the complexity of variability provided all the stations are included. Further studies should be carried out in both interpolations of rainfall as well as other hydrological variables such as ground water level, electrical conductivity variation, etc.

  5. (e)

    Inflows to a reservoir can be improved using antecedent rainfall in the neighbourhood.