A simple approach for improving spatial interpolation of rainfall using ANN

Sivapragasam, C.; Arun, V. M.; Giridhar, D.

doi:10.1007/s00703-010-0090-z

A simple approach for improving spatial interpolation of rainfall using ANN

Review Article
Published: 10 October 2010

Volume 109, pages 1–7, (2010)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Meteorology and Atmospheric Physics Aims and scope Submit manuscript

A simple approach for improving spatial interpolation of rainfall using ANN

Download PDF

C. Sivapragasam¹,
V. M. Arun¹ &
D. Giridhar¹

608 Accesses
21 Citations
Explore all metrics

Abstract

Interpolation of hydrological variables such as rainfall, ground water level, etc. is necessary to arrive at many engineering decisions. This study suggests an innovative and yet simple idea to effectively interpolate the rainfall at non-sampling locations using artificial neural network (ANN). The method has been demonstrated in the Tamirabarani basin of Tamil Nadu State (India), where rain gauge information for 18 rain gauge stations is available. With the help of land use map of the basin and also the proximity of rain gauge stations to each other in the neighbourhood, the most appropriate input variables for ANN are designed and used for training ANN to estimate rainfall in the unknown stations. It is also interesting to note that with appropriate selection of input variables for training ANN, one of the major lacunae of ANN (to have sufficiently lengthy training records) can be suitably addressed. ANN results are compared with Kriging method. Further, the proposed method is applied to improve the prediction of inflow to Ramanadhi reservoir in Tamirabarani basin, and the results seem to be very encouraging.

ANN-based Rainfall-Runoff Model and Its Performance Evaluation of Sabarmati River Basin, Gujarat, India

Article 17 August 2022

Simulation of rainfall-runoff process using an artificial neural network (ANN) and field plots data

Article 17 October 2021

Comparison of Spatial Interpolation Methods for Mapping Rainfall in Indian Himalayas of Uttarakhand Region

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Surface water resources are under increasing stress in view of severe regional imbalances in availability of water and drought conditions in India. Because of increasing demand on water supply from various sectors, timely and analytical information is needed by water managers to assess current and long-term water resource potential in the region. This information is critical to management of water supply and to avoid potential adverse affects on the hydrologic system.

Southern basins in Tamil Nadu are very much drought prone. The sustainable planning of water resources in such region requires hydrological information, the most crucial of which is the estimation of spatial and temporal variability of rainfall (Wilk et al. 2006; Goncalves and Alpuim 2006). This necessarily depends on rain gauge network in a given basin. Often the number of rain gauges in a basin is not adequate to arrive at average or mean precipitations used in any design study due to the large space and time variability of rainfall. Some authors (e.g., Tsintikidis et al. 2002) have attempted to identify possible sites for the placement of additional gauges in order to reduce the precipitation interpolation error using historical hourly data.

Conventionally, however, stochastic interpolation methods such as Kriging are widely recognized and used as standard methods for interpolation of hydrological variables such as groundwater level, precipitation, etc. For example, Vijayakumar and Remadevi (2006) used Kriging for the spatial analysis of groundwater levels and established its superiority over the inverse square distance. Jinfeng (2006) conducted a case study on the impacts of land use/cover change (LUCC) on the temporal and spatial variability of the groundwater level in an arid oasis in the Sangong River Watershed by using geographical information system (GIS), remote sensing (RS) and Kriging. Similarly, Shi et al. (2007) compared IDW, Ordinary Kriging and Simple Kriging interpolation of geostatistics analyst to interpolate rainfall data. Among other important conclusions, they observed that Exponential model of semivariogram has the highest precision than others. Yan-bing (2002) compared five different semivariogram models for monthly rainfall interpolation in their study area.

Although Kriging has been widely used, it is being recognized that traditional Kriging has a major limitation due to the need for a mathematical function for a semivariogram that might fit the surface to be interpolated. Consequently, some researchers have used and demonstrated the advantage of the universal function approximates such as artificial neural network (ANN) as a replacement to fitted authorized semivariogram model within ordinary Kriging (Teegavarapu 2007). Chowdhury et al. (2010) demonstrated that ANNs can also be used to map with noticeably higher accuracy than Kriging the complex and seemingly erratic spatial pattern of groundwater contamination provided that reasonable data preprocessing and exploratory data analysis are performed.

The present study aimed to investigate the efficacy of ANN to interpolate monthly average rainfall in non-sampling locations with the help of rainfall measured in the neighbouring locations. The idea of using only neighbouring rain gauge is derived from one of the commonly used non-parametric interpolation methods, the ‘nearest neighbour’ method, which is based on minimum Euclidian distance (Brath et al. 2002). Karlsson and Yakowitz (1987) used this method for studying rainfall runoff process. Szolgay et al. (2009) compared the performance of three interpolation methods namely nearest neighbour method, Kriging and inverse distance weighting (IDW) in producing maps of 2-year and 100-year daily precipitation as extreme rainfall indices. In this study, though directly nearest neighbour method is not used, an emphasis is laid in selection of appropriate neighbourhood input variables for training ANN architecture. The efficacy of the proposed approach is demonstrated through an application in improving the inflows to Ramanadhi reservoir in the Tamirabarani basin. The next section briefly introduces Kriging and ANN techniques. This is followed by a description of the study area. Subsequently, the proposed methodology is discussed in detail, results analysed and conclusions drawn.

2 Artificial neural network

The ANN is briefly introduced in this section. Multilayer feed forward network with Back Propagation learning algorithm is one of the most popular neural network architectures, which has been deeply studied and widely used in many fields. Typically a neural network consists of three layers: (1) an input layer; (2) an output layer and (3) an intermediate or hidden layer. The input vectors are D ∈ R ⁿ and D = (X ₁, X ₂,…, X _n)^T; the outputs of q neurons in the hidden layer, Z ∈ R ^q, Z = (Z ₁, Z ₂,…, Z _q)^T, and the outputs of the output layer are Y ∈ R ^m, Y = (Y ₁, Y ₂,…, Y _m)^T, assuming that the weight and the threshold between the input layer and the hidden layer are w _ij and y _j, respectively, and the weight and the threshold between the hidden layer and output layer are w _jk and y _k, respectively; the outputs of each neuron in a hidden layer and output layer are:

$$ Z_{j} = f\left( {\sum\limits_{i = 1}^{n} {w_{ji} X_{i} - \theta_{j} } } \right), $$

(1)

$$ Y_{k} = f\left( {\sum\limits_{j = 1}^{q} {w_{kj} Z- \theta_{k} } } \right), $$

(2)

where f() is a transfer function, which is the rule for mapping the neuron’s summed input to its output, and by a suitable choice, is a means of introducing a non-linearity into the network design, and θ is the threshold. One of the most commonly used functions is the sigmoid function, and it is monotonically increasing and ranges from 0 to 1. Details on ANN can be found in Karunanithi et al. (1994). In this study, NEUROSHELL software is used for implementing ANN (Neuroshell2).

3 Kriging interpolation technique

Kriging is essentially a method of estimation. The assumption here is that there is a local structure in the sampling domain, and the accuracy of the estimated values depends on how this is accounted for. Moreover, a weighted sum of the observations is assumed to provide the best estimate. Thus, Kriging is a method of determining the optimal weights. The most commonly applied form of Kriging uses a “semivariogram”, which is a measure of spatial correlation between pairs of points describing the variance over a distance or lag h. Weights change according to the spatial arrangement of samples. The linear combination of weights is as follows:

$$ Y_{i} = \sum {\lambda_{i} y_{i} } , $$

(3)

where Y _i estimate of Kriging, y _i the variables evaluated in the observation locations and λ_i Kriging weights.

The semivariogram is formulated as follows:

$$ \gamma \left( h \right) = \frac{1}{2}E\left( {y_{\left( x \right)} - y_{{\left( {x + h} \right)}} } \right)^{2} , $$

(4)

where γ(h) semivariogram, dependent on distance h; (x, x + h) pair of points with distance vector h; y(x) regionalized variable y at point x; y(x) – y(x + h) difference of the variable at two points separated by h; E mathematical expectation.

Details on Kriging can be found in Kitanidis (1997) and Biau et al. (1999).

4 Study area

Tamirabarani River has its origin in Western Ghats near Senkottai district of Tamil Nadu State (India) with a catchment area of about 6,000 km². There are 18 rain gauge stations in the catchment (refer Fig. 1). Six years of monthly rainfall records are collected from these stations for a time period from January 1979 to December 1984. Out of 18 stations, stations S3 (Cheranmadevi), S6 (Kadayanallur) and S12 (Sattankulam) are used for validation of rainfall interpolation.

The location of Ramanadhi reservoir is also shown in the map. Monthly inflows to Ramanadhi reservoir are available from 1974 to 1986.

5 Interpolation of rainfall with ANN

The following four different models based on ANN for rainfall interpolation are considered in detail.

(a)
ANN with only three neighbouring stations (ANN_I): The methodology adopted for interpolation of rainfall is shown in Fig. 2. For the non-sampled location, rain gauge stations in the proximity and conforming to the same land use are selected (Table 1). The justification for use of rain gauges from the same land use can be understood from the fact that rainfall in the same land use (particularly for small- and medium-sized catchments) is expected to have same value. Figure 3 shows the general land use of the basin.

Table 1 Proximity stations using land use map

Full size table

ANN_1:

$$ I = f\left( {R_{1} ,R_{2} ,R_{3} } \right), $$

(5)

where R ₁ monthly rainfall of first rain gauge in the proximity, R ₂ monthly rainfall of second rain gauge in the proximity and R ₃ monthly rainfall of third rain gauge in the proximity.

The output consists of rainfall at the non-sampled location.

(b)
ANN with three neighbouring stations and their distances from origin as inputs (ANN_II): The easting and northing of rain gauge stations are determined from an arbitrary origin, and their distances from the origin are tabulated in Table 2. Since rainfall varies spatially, it is expected that incorporating distance to a particular station from a reference origin (as one of the input variables in ANN) may help in improving the prediction.

Functionally, the model can be represented as:

ANN_II:
$$ I = f\left( {R_{1} ,D_{1} ,R_{2} ,D_{2} ,R_{3} ,D_{3} ,D_{\text{ns}} } \right), $$
(6)
where R ₁ monthly rainfall of first rain gauge in the proximity, D ₁ distance to the first rain gauge in the proximity from an assumed origin, R ₂ monthly rainfall of second rain gauge in the proximity, D ₂ distance to the second rain gauge in the proximity from an assumed origin, R ₃ monthly rainfall of third rain gauge in the proximity, D ₃ distance to the third rain gauge in the proximity from an assumed origin, D _ns distance of non-sampled location in the study.

The output consists of rainfall at the non-sampled location.
Table 2 Co-ordinates information for 18 stations
Full size table
(c)
ANN with all stations (ANN_III): This model is an extension of ANN_I wherein all the stations in the catchment form input vectors to the ANN model. Therefore, functionally, the model can be represented as:

ANN_III:
$$ I = f\left( {R_{1} ,R_{2} , \ldots R_{n} } \right), $$
(7)
where R ₁ monthly rainfall of first rain gauge, R ₂ monthly rainfall of second rain gauge, R _n monthly rainfall of nth rain gauge and n number of stations in the catchment.

The output consists of rainfall at the non-sampled location.
(d)
ANN with all stations and their distances from origin as inputs (ANN_IV): This model is an extension of ANN_II wherein the all rain gauges stations and their distances from an origin is given as input to the model. Functionally, the model can be represented as:

ANN_IV:
$$ I = f\left( {R_{1} ,D_{1} ,R_{2} ,D_{2} , \ldots R_{n} ,D_{n} ,D_{\text{ns}} } \right), $$
(8)
where R ₁ monthly rainfall of first rain gauge, D ₁ distance to the first rain gauge from an assumed origin, R ₂ monthly rainfall of second rain gauge, D ₂ distance to the second rain gauge from an assumed origin, R _n monthly rainfall of nth rain gauge, D _n distance to the nth rain gauge from an assumed origin, D _ns distance of non-sampled location in the study and n number of stations in the catchment.

The output consists of rainfall at the non-sampled location.

The back-propagation algorithm is used for training the ANN architecture consisting of one hidden layer. Out of 18 stations in the catchment, 10 are used for training, 5 for testing and 3 for validation. Now for each of these stations, input vectors are constructed for all the ANN models. The results are compared and tabulated for the months of January–March for the year 1984 in Table 3.

Table 3 ANN results compared with Kriging

Full size table

As can be observed from the table, the RMSE for the month of January by ANN_II is only 9 mm as against 12.7 by ANN_I. On an average for all the three months, ANN_II estimates are better than ANN_I. It is interesting to note that by including the distance to stations as one of the inputs, the ability of ANN to map the complexity in variation of rainfall over the basin is considerably increased despite the fact that a very small sample of data is used for training. This is reflected in the difference in RMSE values in the two cases.

Further, the comparison of ANN_III and ANN_IV clearly shows that it is better to incorporate all the rain gauges stations in the catchment for better estimation of rainfall. However, if the ‘distance from origin’ variables is dropped (ANN_III), even with inclusion of all stations do not improve the interpolation.

6 Interpolation of rainfall with Kriging

Here, again two different models are considered, namely Kriging with only three immediate neighbourhood stations as used for ANN_I and ANN_II interpolation (Kriging_I) and Kriging with all the stations in the basin as done in conventional methods (Kriging_II). This is illustrated in Table 3, where the results of Kriging analysis are compared with ANN_I and ANN_II. As seen from the table, RMSE for Kriging is significantly more than ANN models. Further, it can be observed that inclusion of all the stations in the Kriging model only marginally improves the result when compared with Kriging_I.

7 Inflow prediction to Ramanadhi reservoir

Here, an attempt is made for improving the prediction of inflows to reservoir by estimating rainfall value in the vicinity of the reservoir using ANN_II (as discussed above). The methodology is illustrated in Fig. 4. For the inflow prediction, first the reservoir for which inflow prediction is required is identified along with a few more sites in the vicinity of the reservoir. In this study, up to five sites in the vicinity are considered, and it is found that best prediction comes for a neighbourhood network of three sites. For these three sites, monthly rainfall is estimated using ANN_II model, which will form a part of input variables for inflow prediction. The input vector for inflow prediction can be mathematically depicted as:

$$ I_{i + 1} = f\left( {I_{i - 1} ,I_{i} ,R_{1i} ,R_{2i} ,R_{3i} } \right), $$

(9)

where I _i and I _i–1 are antecedent inflows. R _1i, R _2i and R _3i are antecedent rainfalls in the neighbouring sites (as estimated using ANN_II).

The ANN is trained, and the results of inflow prediction for the year 1984 are tabulated in Table 4 and Fig. 5. As seen from the table, there is a drastic improvement in the RMSE value with rainfall incorporated as input variable. The RMSE value decreases from 4.1 to 1.6 cumec. This is clearly reflected in figure. Particularly, the peak value is very closely predicted. For example, the inflow for the period of February 1984 is predicted as 6.08 cumec as against 3.11 cumec for prediction without using rainfall as input. Similarly, the inflow for the period of March 1994 has been predicted as 13.36 cumec which is very close to the actual value of 14.17 cumec.

Table 4 Inflow predictions for Ramanadhi reservoir

Full size table

This improvement in the prediction of inflow to the reservoir is very vital for system engineers/irrigation engineers for optimal planning of reservoir operation.

8 Conclusion

A novel method is proposed for interpolating rainfall values in a region with the help of available rain gauge stations. Further, the proposed method is applied for improving the inflow to a reservoir. The following conclusions can be arrived at from the study:

(a)
ANN seems to be a powerful alternative to the traditional Kriging method of spatial interpolation.
(b)
The effectiveness of ANN training crucially depends on not only the length of training records but also the quality of input variables.
(c)
Results are only marginally improved in both ANN and Kriging models when all the stations in the catchment are used when compared to stations in the proximity.
(d)
Inclusion of some kind of ‘distance’ variable in the input seems to play an important role in mapping the complexity of variability provided all the stations are included. Further studies should be carried out in both interpolations of rainfall as well as other hydrological variables such as ground water level, electrical conductivity variation, etc.
(e)
Inflows to a reservoir can be improved using antecedent rainfall in the neighbourhood.

References

Biau G, Zorita E, Storch HV, Wackernagel H (1999) Estimation of precipitation by Kriging in the EOF space of the sea level pressure field. J Clim 12(4):1070–1085
Article Google Scholar
Brath A, Montanari A, Toth E (2002) Neural networks and non-parametric methods for improving realtime flood forecasting through conceptual hydrological models. Hydrol Earth Syst Sci 6(4):627–640
Article Google Scholar
Chowdhury M, Alouani A, Hossain F (2010) Comparison of ordinary Kriging and artificial neural network for spatial mapping of arsenic contamination of groundwater. Stoch Environ Res Risk Assess. doi:10.1007/s00477-008-0296-5
Goncalves AM, Alpuim T (2006) Precipitation measurement and the analysis of hydrological resources in a river basin. In: 7th international symposium on spatial accuracy assessment in natural resources and environmental sciences (Accuracy 2006), pp 851–860
Jinfeng Y, Chen X, Geping L, Quanjun G (2006) Temporal and spatial variability response of groundwater level to land use/land cover change in oases of arid areas. Chin Sci Bull 51(Suppl 1):51–59
Google Scholar
Karlsson M, Yakowitz S (1987) Nearest neighbor methods for nonparametric rainfall-runoff forecasting. Water Resour Res 23(7):1300–1308
Article Google Scholar
Karunanithi N, Grenney WJ, Whitley D, Bovee K (1994) Neural networks for river flow prediction. J Comput Civil Eng 8(2):201–220
Article Google Scholar
Kitanidis PK (1997) Introduction to geostatistics: applications to hydrogeology. Cambridge University Press, New York
Book Google Scholar
Neuroshell2 (Release 4.0).Ward systems Group Inc., 1993–1998
Shi Y, Li L, Zhang L (2007) Application and comparing of IDW and Kriging interpolation in spatial rainfall information. In: Proceedings of SPIE, Geoinformatics 2007, vol 6753
Szolgay J, Parajka J, Kohnová S, Hlavcová K (2009) Comparison of mapping approaches of design annual maximum daily precipitation. Atmos Res 92:289–307
Google Scholar
Teegavarapu RSV (2007) Use of universal function approximation in variance dependent surface interpolation method—an application in hydrology. J Hydrol 332:16–29
Article Google Scholar
Tsintikidis D, Georgakakos KP, Sperfslage JA, Smith DE, Carpenter TM (2002) Precipitation uncertainty and rain gauge network design within Folsom Lake watershed. J Hydrol Eng 7(2):175–184
Article Google Scholar
Vijayakumar V, Remadevi R (2006) Kriging of groundwater levels—a case study. J Spat Hydrol 6(1):81–94
Google Scholar
Wilk J, Kniveton D, Andersson L, Layberry R, Todd MC, Hughese D, Ringrose F, Vanderpost C (2006) Estimating rainfall and water balance over the Okavango River Basin for hydrological applications. J Hydrol 331(1–2):18–29
Article Google Scholar
Yan-bing T (2002) Comparison of semivariogram models for Kriging monthly rainfall in eastern China. J Zhejiomj Univ Sci 3(5):584–590
Article Google Scholar

Download references

Acknowledgments

This paper is a part of the research work funded by the Council of Scientific and Industrial Research (CSIR), Govt. of India under grant No. 24 (0305)/09/EMR-II. We wish to acknowledge the financial support rendered by the CSIR.

Author information

Authors and Affiliations

Department of Civil Engineering, Kalasalingam University, Krishnankoil, Srivilliputtur, Tamil Nadu, 626 190, India
C. Sivapragasam, V. M. Arun & D. Giridhar

Authors

C. Sivapragasam
View author publications
You can also search for this author in PubMed Google Scholar
V. M. Arun
View author publications
You can also search for this author in PubMed Google Scholar
D. Giridhar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. Sivapragasam.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sivapragasam, C., Arun, V.M. & Giridhar, D. A simple approach for improving spatial interpolation of rainfall using ANN. Meteorol Atmos Phys 109, 1–7 (2010). https://doi.org/10.1007/s00703-010-0090-z

Download citation

Received: 07 April 2009
Accepted: 28 September 2010
Published: 10 October 2010
Issue Date: November 2010
DOI: https://doi.org/10.1007/s00703-010-0090-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A simple approach for improving spatial interpolation of rainfall using ANN

Abstract

Similar content being viewed by others

ANN-based Rainfall-Runoff Model and Its Performance Evaluation of Sabarmati River Basin, Gujarat, India

Simulation of rainfall-runoff process using an artificial neural network (ANN) and field plots data

Comparison of Spatial Interpolation Methods for Mapping Rainfall in Indian Himalayas of Uttarakhand Region

1 Introduction

2 Artificial neural network

3 Kriging interpolation technique

4 Study area

5 Interpolation of rainfall with ANN

6 Interpolation of rainfall with Kriging

7 Inflow prediction to Ramanadhi reservoir

8 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A simple approach for improving spatial interpolation of rainfall using ANN

Abstract

Similar content being viewed by others

ANN-based Rainfall-Runoff Model and Its Performance Evaluation of Sabarmati River Basin, Gujarat, India

Simulation of rainfall-runoff process using an artificial neural network (ANN) and field plots data

Comparison of Spatial Interpolation Methods for Mapping Rainfall in Indian Himalayas of Uttarakhand Region

Explore related subjects

1 Introduction

2 Artificial neural network

3 Kriging interpolation technique

4 Study area

5 Interpolation of rainfall with ANN

6 Interpolation of rainfall with Kriging

7 Inflow prediction to Ramanadhi reservoir

8 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation