Introduction

Monitoring networks established in different river basins collects data on hydrometeorological parameters such as rainfall, streamflow, water quality, sediment, etc.; which are required for water resources planning, design and management. Efficient information gathering system should assess the variability in hydrologic processes with spatial and temporal scales and reduce the costs of data collection. Evaluation of network involves determining optimal number of monitoring stations and their respective locations by which hydrological information can be maximized [1]. Setting up and maintaining a rain gauge network (RGN) is an evolutionary process, wherein a minimum network is established early in the development of the geographical area; and the network reviewed and upgraded periodically to arrive at an optimum network. For network optimisation, approaches commonly used include statistical approaches, user-survey technique, hybrid method and sampling strategies. Statistical approaches for RGN optimisation range from clustering techniques, spatial regression methods (in generalized least square framework) and entropy-based methods.

Entropy method quantifies the relative information content for the RGN, and has the advantage that it needs only rainfall data for evaluation. The method facilitates network design by quantifying the marginal contribution of each data collection node to the overall information provided by the network using an index termed as marginal entropy. In recent past, entropy method is widely applied by various researchers for evaluation of different hydrometeorological networks such as streamflow, rainfall, groundwater and water quality monitoring etc., to derive optimum networks. The researchers have applied entropy based approach to design stream gauge network for Pembina river basin in southern Manitoba, Canada [2]. They have adopted a directional informational transfer index to identify the most priority station amongst seven stations in the network. Previously, the entropy technique has been applied for the assessment of water quality monitoring networks for Mississippi River in Louisiana [3]. They concluded that the concept of entropy is a promising method as it quantitatively measures the information produced by network, which will improve the efficiency and cost-effectiveness of current monitoring programs. The entropy and least square methods can be applied to evaluate the stream gauge network in the State of Illinois through an assessment of the transferring of information (transinformation) among gauging records for low, average, and high flow conditions [4]. They expressed that the hybrid combination of the entropy and GLS measures of regional value of stations would certainly provide more insight in assessment of stream gauging program and the station rankings based on the combined method can preserve correlation with rankings using both entropy and GLS.

The researchers have applied entropy theory to evaluate the stream flow network for Kizihrmak basin [5]. They compared the model uncertainty in stream flow network with transinformation index using normal, log-normal and gamma distributions. They also suggested that the ranking of log-normal distribution could be considered for selection of stream gauge network for Kizihrmak basin. The comparisons were made on the applications of mixed and continuous distributions in entropy theory to study about the rainfall variability and rainfall intermittency using transinformation index [6]. The study showed that the mixed distribution is suitable for evaluation of RGN for Choongju basin, Korea. Moreover, optimisation theory related to hydrological network studies iterated that the rain gauge station provided more than 50 % redundant information is considered to be insignificant and for possible discontinuation from the core network. The investigators have analyzed the application of a novel, process-oriented stream gauging method for generating rating curves to establish a stream gauging network in the Whitewater River basin in south central Kansas [7]. Entropy theory can be applied to the high density RGN of the urban area of Rome for evaluation of maximum information content achievable by a rainfall network for different sampling time intervals [8]. The scientists have applied entropy theory-based maximum information minimum redundancy criterion for evaluation of hydrometric network [9]. In the present study, normal distribution is used for computation of entropy values for the stations under consideration. Transinformation index measures the redundant or mutual information between variables, and is computed from marginal and conditional entropy indices to derive an optimum network. Kolmogorov–Smirnov (KS) test is used for checking the adequacy of fitting of normal distribution to the recorded data. The methodology adopted in computation of KS statistics values and evaluation of RGN using entropy method for upper Bhima basin up to Ujjani reservoir is briefly described in the ensuing sections.

Methodology

Normal Distribution

The probability density function (PDF) and cumulative distribution function (CDF) of two-parameter normal distribution is given by:

$$ {\text{f}}\left( {\text{X}} \right) = \frac{1}{{\sqrt {2\pi } \sigma }}{ \exp }\left( { - \frac{{\left( {{\text{X}} - \mu } \right)}}{{\sigma^{2} }}^{2} } \right),\; - \infty < {\text{X}} < \infty ,\;\sigma > 0 $$
(1)
$$ {\text{F}}\left( {\text{X}} \right) = \frac{1}{2}\left[ {1 + {\text{erf}}\left( {\frac{{\left( {{\text{X}} - \mu } \right)}}{\sigma \sqrt 2 }} \right)} \right] $$
(2)

where μ and σ the parameters of the distribution [10]. The parameters are determined from the mean and standard deviation of the recorded data. Here, ‘erf’ indicates the error function of normal distribution.

Kolmogorov–Smirnov Test

The adequacy of fitting of normal distribution to the recorded data is judged by KS test and the theoretical description of KS statistics is expressed by:

$$ {\text{KS}} = \mathop {\text{Max}}\limits_{{{\text{i}} = 1}}^{\text{N}} \left( {F_{\text{e}} \left( {{\text{X}}_{\text{i}} } \right) - {\text{F}}_{\text{D}} \left( {{\text{X}}_{\text{i}} } \right)} \right) $$
(3)

Here Fe(Xi) = (i − 0.44)/(N + 0.12) is the empirical CDF of Xi and FD(Xi) is the computed CDF of Xi. If the computed value of KS statistics given by normal distribution is less than the theoretical value at the desired significance level, then the distribution is considered to be suitable for fitting to the recorded rainfall data [11].

Concept of Entropy

A quantitative measure of the uncertainty associated with a probability distribution, or the information content of the distributions termed as, Shannon entropy, which can be mathematically expressed as:

$$ {\text{H}}({\text{X}}) = - {\text{k}}\sum {{\text{P}}_{\text{i}} } \;{ \ln }\left( {{\text{P}}_{\text{i}} } \right) $$
(4)

where, H(X) is the entropy corresponding to the random variable X; k is a constant that has value equal to one, when natural logarithm is taken; and Pi represents the probability of ith event of random variable X [12].

Marginal Entropy

Marginal entropy for the discrete random variable X is defined as:

$$ {\text{H}}({\text{X}}) = - {\text{k}}\sum\limits_{{{\text{i}} = 1}}^{\text{N}} {{\text{P}}\left( {{\text{X}}_{\text{i}} } \right)} \;{ \ln }\left[ {{\text{P}}\left( {{\text{X}}_{\text{i}} } \right)} \right] $$
(5)

where P(Xi) is the probability of occurrence of Xi, computed by normal distribution, and N is the number of observations [13]. The marginal entropy H(X) indicates the amount of information or uncertainty that X has. If the variables X and Y are considered as independent, then the joint entropy [H(X, Y)] is equal to the sum of their marginal entropies defined by:

$$ {\text{H}}\left( {{\text{X}},\;{\text{Y}}} \right) = {\text{H}}\left( {\text{X}} \right) + {\text{H}}\left( {\text{Y}} \right) $$
(6)

If the variables are stochastically dependent, then the joint entropy is less than its total entropy.

Conditional Entropy

Conditional entropy measures the entropy of a random variable Y, if one has already learned completely about the random variable X. The conditional entropy of Y on X is defined by:

$$ {\text{H}}\left( {{{\text{Y}} \mathord{\left/ {\vphantom {{\text{Y}} {\text{X}}}} \right. \kern-0pt} {\text{X}}}} \right) = {\text{H}}\left( {{\text{X}},\;{\text{Y}}} \right) - {\text{H}}\left( {\text{X}} \right) $$
(7)

Conditional entropy value becomes zero, if the value of one variable is completely determined by the value of other variable. If the variables are independent, then \( {\text{H}}\left( {{{\text{Y}} \mathord{\left/ {\vphantom {{\text{Y}} {\text{X}}}} \right. \kern-0pt} {\text{X}}}} \right) = {\text{H}}\left( {\text{Y}} \right) \).

Transinformation Index

Transinformation is the form of entropy that measures the redundant or mutual information between variables [14]. Transinformation represents the amount of information, which is common to two stochastically dependent variables X and Y. The transinformation between X and Y is defined as:

$$ {\text{T}}\left( {{\text{X}},{\text{Y}}} \right) = {\text{H}}\left( {\text{X}} \right) + {\text{H}}\left( {\text{Y}} \right) - {\text{H}}\left( {{\text{X}},{\text{Y}}} \right) = {\text{H}}\left( {\text{Y}} \right) - {\text{H }}\left( {{\text{Y}}|{\text{X}}} \right) $$
(8)

If the variables X and Y are dependent to each other, then the transinformation index \( {\text{T}}\left( {{\text{X}},{\text{Y}}} \right) = 0 \).

Steps Involved in Computation of Entropy Measures

In practice, the existing sampling sites of a RGN can be arranged in the order of information content. In the ordered list thus obtained, the first station is the one where the highest uncertainty about the variable occurs, and the subsequent stations serve to reduce the uncertainty further. The steps involved in selecting the best combination of stations using entropy method are as follows:

  1. (i)

    Let the data collection network under review, consists of M monitoring stations. The data series of the variable of interest at each station (X1, X2,…, XM) is represented by Xij, where ‘i’ denotes the station identification number (i = 1, 2,…, M) and ‘j’ is for time period (j = 1, 2,…, N). The data length at all stations is assumed to be equal to N. The best fitted multivariate joint PDF for the subset (X1, X2,…, XM) of M monitoring stations is selected.

  2. (ii)

    The marginal entropy of the variable H(Xi) (i = 1, 2,…, M) for each station is calculated. The station with the highest marginal entropy is denoted as the first priority station Pr(Xz1). This is the location, where the highest uncertainty occurs about the variable and hence information-gain will be highest from the observations recorded at this site.

  3. (iii)

    This station Pr(Xz1) is coupled with every other (M − 1) stations in the network to compute transinformation T(Xi, Pr(Xz1)) with Xi ≠ Pr(Xz1), i = 1, 2,… M; and to select that pair, which gives the least transinformation. The station that fulfils this condition is marked as the second priority location Pr (Xz2).

  4. (iv)

    The pair [Pr(Xz1), Pr(Xz2)] is coupled with every other (M − 2) station in the network to select a triplet with the least transinformation T[Xi; Pr(Xz1), Pr(Xz2)]. The same procedure is continued by successively considering combinations of three and more stations, and selecting the combination that produces the least transinformation. Finally, all M monitoring stations (X1, X2,…, XM) can be ranked in priority order to get [Pr(Xz1), Pr(Xz2),…, Pr(XzM)].

  5. (v)

    It is possible to terminate the above process early, before carrying out for all M stations by selecting a particular threshold transinformation value as the amount of redundant information to be permitted in the network, such that sampling of the variable may be stopped at the stations that exceed the threshold to get optimum number of stations, which is less than M.

In the above procedure, the benefits for each combination of sampling sites are measured in terms of least transinformation or the highest conditional entropy produced by that combination. The above procedure helps to assess network configurations with respect to the existing stations. If new stations are to be added to the system, their locations may be selected again on the basis of the entropy method by ensuring maximum gain of information. The correlation coefficient of each monitoring station can be computed by using Eq. (9) based on transinformation index.

$$ {\text{Transinformation }}\left( {\text{T}} \right) = - \frac{1}{2}{ \ln }\left( {1 - {\text{R}}^{2} } \right) $$
(9)

where R represents the multiple correlation coefficient of Xi on Pr(Xz1), Pr(Xz2),…, Pr(XzM).

Application

The methodology detailed above has been applied to evaluate the RGN for upper Bhima basin up to Ujjani reservoir. The basin is located in the western part of Maharashtra between 18°03′N–19°24′N latitude and 70°20′E–75°18′E longitude. The geographical area of the basin is 14,712 km2. Of the total geographical area under study, 25 % is hilly and/or highly dissected, 55 % plateau and 20 % is remaining plain area [15]. Figure 1 gives a location map of the study area. From scrutiny of historical rainfall data, it is noted that 25 rain gauge stations have concurrent data for the period 1971–2007; and considered in deriving the optimum RGN for the basin under study. The daily rainfall series was used to compute the annual rainfall series for each rain gauge station and delineated in Fig. 2a–e.

Fig. 1
figure 1

Location map of the study area

Fig. 2
figure 2figure 2

ae Plot of recorded annual rainfalls during 1971–2007 for 25 rain gauge stations of Upper Bhima basin

Results and Discussions

By applying the procedure detailed above, a computer program was developed and used to evaluate the RGN of upper Bhima basin. The program compute the KS statistics values, identify the first priority station based on marginal entropy, compute the conditional entropy with reference to first priority station and arrange the rain gauge stations in order of priority based on transinformation index. Table 1 gives the statistical parameters (μ, σ and CV), KS statistics values and marginal entropy values obtained from normal distribution for 25 rain gauge stations considered in the study.

Table 1 Statistical parameters of annual rainfall, computed values of KS statistics and marginal entropy values of 25 rain gauge stations given by normal distribution

From Table 1, it may be noted that the Khandala rain gauge station receives maximum average annual rainfall of about 4,313 mm when compared to the corresponding values of other rain gauge stations. Also, from Table 1, it may be noted that the CV values computed from average annual rainfall vary from about 23–54 %.

Based on GoF test results, it may be noticed that the computed values of KS statistics of all 25 rain gauge stations given by normal distribution are lesser than the theoretical value of 0.218 at 5 % significance level, and at this level, the normal distribution is considered to be acceptable for fitting annual rainfall data recorded at the rain gauge stations considered in the study. From Table 1, it transpires that the station is coupled with other 24 rain gauge stations individually to identify the next priority station in order, and to compute the transinformation index. Table 2 gives the transinformation index matrix obtained from normal distribution for the stations under study. Table 3 give the details on redundant information passed by each station based on transinformation index and its corresponding correlation coefficient values given by normal distribution.

Table 2 Transinformation index matrix based on marginal entropy given by normal distribution
Table 3 Details of redundant information passed by each station together with correlation coefficient based on normal distribution

From Table 3, it may be noted that Whiram station provides 100 % information, as indicated by the transinformation index value; and considered as the reference station for computation of redundant information for other stations. Also, from Table 3, it may be noted that the amount of redundant information from Kadus, Chaskman, Holkarpul, Ranjangaon, Askheda and Whiram stations are about 62, 65, 74, 80, 85 and 100 % respectively; and are proposed for possible discontinuation from the existing RGN. The study showed that the derived optimum network for the basin consists of 19 rain gauge stations with 774 km2 per gauging station; which satisfies the WMO [16] recommended value of 600–900 km2 per station for minimum density of RGN. The results are expected to be of assistance to stakeholders for decision making as regards RGN optimisation in the Upper Bhima basin.

Conclusions

This paper presented an entropy based procedure for evaluation of RGN of upper Bhima basin, consists of 25 rain gauge stations. The normal distribution was used to represent the data and then to compute the marginal and conditional entropy indices, and the transinformation index. The average annual rainfall recorded at the rain gauge stations varied from about 421–4,313 mm. The CV values based on average annual rainfall also varied from 23–54 %. The KS test results supported the use of normal distribution for fitting annual rainfall data recorded at the rain gauge stations considered in the study. The results showed that Khandala station is the first priority station with highest marginal entropy for upper Bhima basin. The results also showed that Whiram station provided 100 % redundant information and hence considered as the reference station for computation of redundant information passed by other stations. From the results of transinformation index given by normal distribution, it may be noted that the amount of redundant information given by Kadus, Chaskman, Holkarpul, Ranjangaon, Askheda and Whiram stations are about 62, 65, 74, 80, 85 and 100 % respectively; and are proposed for possible discontinuation from the core network while optimising RGN of upper Bhima basin up to Ujjani reservoir. Thus, the derived optimum network for Upper Bhima basin consists of 19 rain gauge stations, which satisfy WMO guidelines as regards minimum density for RGN. The results are expected to be of assistance to stakeholders for decision making as regards RGN optimisation in the river basin.