Developing the Regional Nonstationary IDF Curves Using NGN-ProNEVA Framework

Mahmoudi, Mohammad Reza; Tahanian, Moein; Gohari, Alireza; Eslamian, Saeid

doi:10.1007/s11269-023-03619-5

Developing the Regional Nonstationary IDF Curves Using NGN-ProNEVA Framework

Published: 03 October 2023

Volume 37, pages 5581–5599, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Water Resources Management Aims and scope Submit manuscript

Developing the Regional Nonstationary IDF Curves Using NGN-ProNEVA Framework

Download PDF

Mohammad Reza Mahmoudi ORCID: orcid.org/0000-0002-3272-9870¹,
Moein Tahanian¹,
Alireza Gohari¹ &
…
Saeid Eslamian¹

159 Accesses
Explore all metrics

Abstract

Intensity–Duration–Frequency (IDF) curves are known as practical tools in the construction of infrastructures. However, developing such curves is restricted in many regions due to sparse and insufficient record length of in-site rainfall observations. It is hence recommended to generate the regional curves. According to the hydro-climate variability and change during the recent decades, it is essential to consider the non-stationarity of hydro-climate variables. In this research, the Neural Gas network (NGN) coupled with ProNEVA have been applied to develop non-stationary regional IDFs. The l-moments approach was used to plot regional stationary IDF curves to compare the results of non-stationary IDFs for homogenous regions. The results showed that the regional nonstationary curves had overestimated rainfall intensity compared with the regional stationary curves which can attribute to the decreasing trend of rainfall over the study area. The average value of overestimation in the return period of 2 years was equal to 50 percent. This overestimation was more significant for lower return periods, which indicates that the nonstationary approach is more important for short-duration events. The return period of 100 years is equal to 25 percent in region two, and in region one, it is equal to 20 and 43 percent, respectively.

Regionalization of rainfall intensity–duration–frequency (IDF) curves with L-moments method using neural gas networks

Article 05 August 2022

Annual Daily Maximum Rainfall-Based IDF Curve Derivation Methodology

Article 04 October 2019

Framework for developing IDF curves using satellite precipitation: a case study using GPM-IMERG V6 data

Article 02 October 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Global warming is known as one of the most important threats for ecosystem services and human societies. There will be an increase in the unpredictability of water flows, linked to more frequent and extreme weather events and temporal and spatial shifts in rainfall patterns resulting in more intense floods and droughts, severe forest fires, rising sea levels, flooding, melting polar ice, catastrophic storms and declining biodiversity (Tsakiris and Loucks 2023). Precipitation is among major climatic variables that affects the hydrological regime with temporal and spatial variations. Extreme rainfall can lead to floods causing damage to buildings, farmlands, and infrastructures. Estimating future rainfall intensity is hence required in the optimal design of hydraulic structures as well as reducing the potential damage such disasters can cause (Hardy 2003).

The Intensity–Duration–Frequency (IDF) curves are widely used to capture rainfall characteristics.

The atlas of IDFs has been developed in developed countries like USA by the American National Weather Service created the National Oceanic and Atmospheric Administration (NOAA) (Perica et al. 2011). However, generating such curves in a desired area with no station is challenging and the application of regional curves instead of at-site has been suggested.

Several studies suggested that the application of regional techniques on extreme rainfalls can increasingly reduce the doubts about the estimations resulting from the at-site viewpoint (Lee and Maeng 2003; Alemaw and Chaoka 2016; Abdi et al. 2017; Mahmoudi et al. 2023). In addition, the IDF regionalization is useful in shortening the steps and required time for the calculations of the IDF curves. (Mahmoudi et al. 2023).

In the recent decades, extreme events have intensified due to global warming and human impacts (Tramblay et al. 2012; Cavanaugh et al. 2015; Xu et al. 2015). Numerous investigations indicated a significant increase in extreme precipitation over North America (Westra et al. 2014; Asadieh and Krakauer 2015), which has caused substantial structural and economic damages. Other studies reported increased extreme precipitation in Africa, Europe, and Asia (Lenderink and Van Meijgaard 2008; Singh et al. 2014; Jamali et al. 2022; Motamedi et al. 2023).

Given the importance of considering climate change and future trends in the design and construction of infrastructure, it is vital to consider the trends and nonstationary to improve the design parameters and minimize the anticipated damages to human societies (Li et al. 2018; Vasiliades et al. 2015). Therefore, a shift from stationary to nonstationary-based IDF curves is vital to capture non-monotonic trends in extreme precipitation (Veneziano and Yoon 2013; Cheng, and AghaKouchak 2014; Blanchet et al. 2016; Agilan and Umamahesh 2016).

Agilan and Umamahesh (2016) applied a multi-objective genetic algorithm (MOGA) to develop nonstationary GEV models by capturing nonlinear trends in the series. Using Bayesian inference, Ragno et al. (2019) presented a generalized framework for estimating nonstationary IDF curves and their uncertainties.

As in-site rainfall observations are sparse and have insufficient record length, the nonstationary IDFs should be regionalized to address these gaps in many regions around the world. Most of the earlier studies were focused on stationary regional IDF (Overeem and Syvitski 2010) and nonstationary IDF (Lima et al. 2016, Sarhadi and Soulis 2017), while investigations on developing regional nonstationary IDFs are generally lacking.

In this study, we developed regional nonstationary IDF curves by employing non-stationarity and clustering principles. The process-informed nonstationary extreme value analysis was applied to develop a newly-developed hybrid evolution Markov Chain Monte Carlo (MCMC) approach for numerical parameters estimation and uncertainty assessment. (Ragno et al. 2019). The present study applies a Neural Gas Network (NGN) method to cluster the hydrological data and ascertain the homogeneous regions. The NGN is one of the competitive neural networks and uses an unsupervised teaching method (Fritzke 1995). This study aims to develop regionalized nonstationary IDF curves by integrating NGN and ProNEVA as a new framework named NGN-ProNEVA. The proposed framework can describe the changes in annual maximum precipitation in response to CO₂ emissions in the atmosphere and annual maximum sea levels over time.

2 Study Area

Khuzestan province is located between 47°41´ to 50°39´ eastern longitude and 29°58´ to 33°04´ northern latitude. Despite having only 4% of the Iran's total area, this province owns more than 30% of the country's surface water. The annual rainfall of the study area is 284.3 mm. The highest rainfall is recorded in Izeh station (Northeast of the province), with 614.8 mm. Khuzestan province is surrounded from the north and east by the Zagros Mountain range. Moving towards the interior of the province, the height of these mountains decreases and gives its place to the Mahor hills. Khuzestan includes two mountainous and plain regions. Two-fifths of the area of this province is mountainous, and three-fifths is plain.

3 Dataset

This study used data from 18 rain gauge stations located in Khuzestan province, Iran (Fig. 1). The data was obtained from the Khuzestan Water and Electricity Organization (KWEO). The hourly rainfall data from 1978 to 2015 were obtained to quantify the changes in the selected stations. All the selected stations consider as automatic recording stations. In addition, geographical information of the selected stations including latitude, longitude, elevation from sea level were applied to determine the number of optimal clusters (Table 1).

Table 1 Characteristics of selected stations in this study

Full size table

4 Methodology

The main steps of this study include: (1) quantifying the trends in extreme precipitation of the selected stations; (2) determining the optimal number and clustering the stations; (3) defining GEV nonstationary models and select the best model; (4) developing Nonstationary IDF Curves for target clusters; (5) extracting regional stationary IDF curves. To extract the annual maximum rainfall time series for the region, the maximum rainfall for all duration and each year was extracted here to consider the same years in the stations belonging to each region,. Then, the model with the highest frequency among all duration was considered as the best one for the station.

4.1 Trend Analysis

The Mann Kendal (M–K) test (Mann 1945; Kendall 1962) is a nonparametric statistical test to detect trends in time series, that has a long tradition of application in hydrology and has been applied in the case of extremes (Villarini et al. 2009; Cheng and AghaKouchak 2014; Jamali et al. 2022; Gohari et al. 2022; Motamedi et al. 2023). The power of the M–K test depends on the pre-assigned significance level, the magnitude of the trend, the sample size, and the number of variations within the time series (Yue et al. 2002).

The M–K statistics (S) is computed as:

$$\mathrm{S}=\sum_{i=1}^{N-1}\sum_{j=i+1}^{N}sgn({x}_{j}-{x}_{i})$$

(1)

where, $n$ is the number of data points, $x$ is the data values in time series and the $sgn$ function is defined as:

$$\mathrm{Sgn}\left(\uptheta \right)=\mathrm{sg}n \left({x}_{j}-{x}_{i}\right)=\left\{\begin{array}{cc}-1& \theta <0\\ 0& \theta =0\\ 1& \theta >0\end{array}\right\}$$

(2)

The standard value of the test statistics ($Z$) can be calculated as:

$$\mathrm{Z}=\left\{\begin{array}{cc}\frac{s-1}{\sqrt{var(s)}}& s>0\\ 0& s=0\\ \frac{s+1}{\sqrt{var(s)}}& s<0\end{array}\right\}$$

(3)

where, the $var$ (variance) is computed as:

$$var(s)=\frac{n(n-1)(2n+5)-\sum_{i=1}^{p}{t}_{i}({t}_{i}-1)(2{t}_{i}+5)}{18}$$

(4)

where $p$ is the number of tied groups and ${t}_{i}$ is the number of data in the ${i}^{th}$ (tied) group. Negative Z values indicate the decreasing trends while positive values show the increasing trends. This study applied the M–K test at 95% and 90% confidence levels to detect trends in the intensity of annual maximum rainfall over the selected durations.

4.2 Clustering

4.2.1 The Number of Optimal Clusters

. Determining the number of optimal clusters is the first step in regionalization. Numerous methods, including Chou & Su CS (Chou et al. 2004), Silhouette (Rousseeuw 1987), and Caliński and Harabasz (1974) indices, have been widely used to determine the optimal number of clusters. The highest values in Silhouette and Calinski-Harabasz indices and the lowest one in CS indicate the optimal number of clusters.

The silhouette value calculates how similar an object is to its cluster (cohesion) compared to other clusters (separation). The silhouette ranges from − 1 to + 1, where a high value denotes that the object is well matched to its cluster and inadequately matched to neighboring clusters. The clustering configuration is appropriate if most objects have a high value. The silhouette can be calculated with any distance metric, such as the Euclidean or Manhattan distances (Rousseeuw 1987).

The Calinski-Harabasz ($CH$) index for $K$ number of clusters on a dataset $D =[ {d }_{1},{ d}_{2} ,{ d}_{3} , \dots {d}_{N} ]$ is defined as,

$$CH=\left[\frac{\sum_{k=1}^{K}{n}_{k}{\Vert {c}_{k}-c\Vert }^{2}}{K-1}\right]/\left[\frac{\sum_{k=1}^{K}{\sum }_{i=1}^{{n}_{k}}{\Vert {d}_{i}-{c}_{k}\Vert }^{2}}{N-K}\right]$$

(5)

where, $nk$ and $ck$ are the number of points and centroid of the k^th cluster respectively, $c$ is the global centroid, $N$ is the total number of data points.

The CS measure is then defined as

$$\mathrm{CS}= \frac{\frac{1}{k}\sum_{i=1}^{k}[ \frac{1}{{N}_{i}}\sum_{{x}_{i}\in {C}_{i} }\,{max}_{{x}_{q}\in {C}_{i}} \{d({x}_{i}.{x}_{q})\} ]}{\frac{1}{k}\sum_{i=1}^{k}[{min}_{ j\in k. j\ne i }\{d({m}_{i}.{m}_{j})\} ]}=\frac{\sum_{i=1}^{k}[\frac{1}{{N}_{i}}\sum_{{x}_{i}\in {C}_{i} }\,{max}_{{x}_{q}\in {C}_{i}} \{d({x}_{i}.{x}_{q})\} ]}{\sum_{i=1}^{k}[{min}_{j\in k. j\ne i }\{d({m}_{i}.{m}_{j})\} ]}$$

(6)

where $d\left({x}_{i}.{x}_{q}\right)$ is a function of the distance between two points in the $i$ cluster, $k$ is the number of clusters, ${N}_{i}$ is the number of members in each cluster, and ${m}_{i}$ is the center of each cluster.

4.2.2 Neural Gas Network (NGN)

In this study, the selected stations were divided into homogenous regions using NGN. The rule of learning in NGN is as follows:

$${w}_{i}^{new}={w}_{i}^{old}+{\alpha }_{i}(x-{w}_{i}^{old})$$

(7)

$${\alpha }_{i}=\varepsilon {e}^{\frac{{-k}_{i}}{\lambda }}$$

(8)

where ${w}_{i}$ is a gas molecule formed on data space (the number of these molecules is initially assumed as a value, and eventually, it is revised to have the logical and optimal function of the algorithm); ${\alpha }_{i}$ is a parameter that specifies the learning rate and depends on ${k}_{i}$, and $\lambda$; ${k}_{i}$ refers to the superior neuron to the neuron $i$; $\varepsilon$ is a constant number that controls the learning rate. If λ tends to infinity, learning of the whole neurons would be equal, and if it tends to zero, then the nearest neuron begins to learn. The extreme modes of λ are not suitable alone, and usually, a mode between them is chosen.

An edge function is defined to create a neighborhood between the first and second neurons in terms of proximity. For each neuron, there is ${c}_{i.j}\epsilon \left\{0.1\right\}$ showing that weather there exists an edge or not. The ${t}_{i.j}\epsilon \left\{0.1.2\dots .\right\}$ also indicates the age (time intervals) from the last meeting or re-edge, if it exceeds more than one size, the neighborhood will be broken. This approach helps the neural network to learn topology. Generally, the NG algorithm can be summarized as: (Step-1) creation of A random position (${w}_{i}$) in the data space; (Step-2) selection of an input (x) from the expected data; (Step-3) calculation of the distance between x, and the centers of ${w}_{i}$ and ${k}_{i}$ aging for each center; (Step-4) Adaption or learning step as Eq. (9):

$${w}_{i}^{new}={w}_{i}^{old}+\varepsilon {e}^{\frac{{-k}_{i}}{\lambda }}(x-{w}_{i}^{old})$$

(9)

As the algorithm progresses, the learning speed should be reduced during the training period; otherwise, the neural network will be repeated leading to an incorrect cycle. For this purpose, the amount of $\lambda$ and $\varepsilon$ should be decreased as learning progresses according to Eq. (10):

$$\mathrm{G}\left(\mathrm{t}\right)={G}_{i}{\left(\frac{{G}_{f}}{{G}_{i}}\right)}^{\frac{t}{{t}_{max}}}\quad\quad{\lambda }_{i}>{\lambda }_{f},{\varepsilon }_{i}>{\varepsilon }_{f},{T}_{i}<{T}_{f}$$

(10)

where $i$ and $f$ show the parameter values at the beginning and the end of the learning process, respectively.

Afterwards, an edge between the first two ranks in terms of proximity and age of this edge is considered equal to zero (create a neighborhood) as Step-5. By increasing the age of all edge (${t}_{i.j}\to {t}_{i.j}+1$), it is assumed that ${k}_{i}=0$ and for each j when ${t}_{i.j}>T$, it is considered as ${c}_{i.j}=0$.T should be increased during the learning period to reduce the degree of rigidity, which means the edges are allowed to last longer (Step-6). If the termination conditions (e.g., the maximum quantity of neurons or any amount of performance) are not met, Step-2 is repeated, Otherwise, algorithm would be finished.

4.2.3 L-Moments

L-moments were presented by Hosking (1990), and it has been found to have great application in many regional hydro-climatic analyses. The most crucial applications of L-moments include identifying the homogeneous regions, detecting the discordant sites, and selecting the appropriate distribution function over a larger range of distributions.. They also better display the outlier events (Rao and Hamed 1997). L- moments were used here to check regional homogeneity and discordant stations and determine appropriate regional distribution.

4.3 Nonstationary IDF Curves

The time series of extreme rainfall intensity in various durations were formed based on annual maxima throughout the record. According to the extreme value theory, the behavior of these annual maxima values can generally be described by one of the three extreme value distributions, including Gumbel, Fréchet, and Weibull. The generalized extreme value (GEV) distribution, a single parametric family, represents those mentioned above three extreme distributions (Agilan and Umamahesh 2016). The cumulative distribution function of the GEV is described (Hosking and Wallis 1993) as Eq. (11):

$${F}_{GEV}\left(x|\mu ,\sigma ,\xi \right)=exp\left[{-\left(1+\xi \left(\frac{x-\mu }{\sigma }\right)\right)}^{\frac{-1}{\xi }}\right]$$

(11)

$$F_{GEV(x)}\;\mathrm{is\;defined\;for}\;\{\mathrm{x}:\;1+\upxi(\mathrm{x}-\upmu)/\upsigma>0\},\;-\infty<\upmu<\infty,\upsigma>0,\;\mathrm{and}\;-\infty<\upxi<\infty$$

where: $\mu$, $\sigma$, and $\xi$ are the location, scale, and shape parameters, respectively (Smith 2001). Under stationary assumption, it is assumed that the GEV parameters, θ = {μ, σ, ξ}, are not changing over time. However, the parameters and characteristics of the GEV distribution become time-dependent under nonstationary conditions. To integrate the nonstationary effects, the nonstationary GEV model is defined by considering its parameters as a covariate of time (${x}_{t}$|${\mu }_{t}$, ${\sigma }_{t}$, ~GEV (${\mu }_{t}$, ${\sigma }_{t}$, ξ)):

$${f}_{{x}_{t}}\left({x}_{t}|{\mu }_{t},{\sigma }_{t},\xi \right)={\frac{1}{{\sigma }_{t}}\left[1+\xi \left(\frac{{x}_{t}-{\mu }_{t}}{{\sigma }_{t}}\right)\right]}^{-\frac{1}{\xi }-1}exp\left[{-\left(1+\xi \left(\frac{{x}_{t}-{\mu }_{t}}{{\sigma }_{t}}\right)\right)}^{-\frac{1}{\xi }}\right]$$

(12)

According to the Eq. (12), different nonstationary models were defined in this study to capture the non-stationarity in rainfall characteristics over the case study (Table 2).

Table 2 The defined nonstationary GEV models

Full size table

4.3.1 Parameter Estimation

In this study, the ProNEVA was applied to estimate the parameters of the extreme value distribution under both stationary and non-stationary assumptions using a Bayesian method, to address uncertainties derived from input errors and model selection.

A Bayesian Markov Chain Monte Carlo (BMCMC) approach was applied to characterized the uncertainty of time-varying IDFs. More information about BMCMC can be found in Gilks et al. (1995).

4.3.2 Model Diagnostics and Selection

We applied different metrics for Goodness Of Fit (GOF) assessment and model selection, including quantile and probability plots as a visual assessment along withthe Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). The AIC (Akaike 1974; Aho et al. 2014) is formulated as Eq. (13):

$$AIC=2.\left(D-\widehat{L}\right)$$

(13)

where $D$ is the number of parameters in the statistical model and $\widehat{\text{L}}$ is the log-likelihood function evaluated at the vector of parameters. The BIC (Schwarz 1978) is also defined as:

$$BIC=D.\mathrm{ln}\left(N\right)-2.\widehat{L}$$

(14)

where $N$ is the length of records. The model associated with a lower AIC and BIC is considered as a better fit.

5 Results and Discussions

5.1 Trend Analysis

In order to investigate the trends in the study area, all 18 stations and 11 selected durations were studied using the M–K test. Due to the negative values of Z for most time series in different stations (Table 3), it can be declared that the rainfall trend in short durations is decreasing over the study area.

Table 3 The z value for detecting trends in different time series at 90% and 95% confidence levels

Full size table

Khuzestan Province is one of the most water-rich provinces in Iran, where large rivers flow. Therefore, its water resources planning and management should be updated according to the effects of climate change. According to the results, the trend of rainfall in the most of the time duration in this region has decreased, leading to a negative effect on available water resources. Jamali et al. (2022) reported that most stations throughout Iran showing drying features with higher temperatures, which is in agreement with the presented results. Generally, most regions over Iran implied a reduction in the annual precipitation.

Due to the detected tendency in the precipitation during the different durations, extracting the IDF curves based on the nonstationary assumption is essential. Although most of the durations do not have significant trends, the recent studies highlight the need to go beyond subjective criteria for significance analysis (Rosner et al. 2014), as most ground-based stations do not exhibit a statistically significant non-stationary behavior (Westra et al. 2014; Cheng and AghaKouchak 2014).

5.2 Clustering of Rainfall Stations

Figure 2 shows the number of optimal clusters in a range of clusters. The results indicated that the optimal number of homogeneous regions under all three methods was equal to 2.

In this study, rainfall station clustering was performed using NGN based on the optimal number of clusters. Figure 3 shows the results of clustering the study area into two separate regions. Region 1 occupied the eastern areas, and region 2 occupied the central and western parts of the province. The results revealed that the stations with the higher altitudes in the eastern cluster and the lower stations in the western cluster were divided into separate clusters, which confirms the proper functioning of NGN in terms of topographic detection of the data space. The results of our study.

5.3 Nonstationary Models

According to Table 4, the best regional nonstationary model was M2 in the case of using time covariate and M5 in the case of using CO₂ covariate for both homogenous regions. The scale parameter was variable in both selected models, and the location and shape parameters were constant.

Table 4 The best regional model in each region and Validation of the regional model in stations

Full size table

The uncertainty in the risk estimation of extreme precipitation in different durations because of replacing model M2 with M1 to capture non-stationarity for regions one and two are demonstrated in Fig. 4, respectively. The results showed that with an increasing return period, the values of uncertainty also increased, which is expected. As illustrated in Fig. 4, the uncertainty of ignoring M2 with M1 with a range of confidence level equal to 95 percent in region one and 85 to 95 percent in region two.

The uncertainty indicates that the M2 model, constructed by the scale parameter as a covariate in the GEV parameters, can capture non-stationarity effects. Gao et al. (2016) conducted a nonstationary extreme value analysis for annual maximum daily precipitation (AMP) at 631 meteorological stations over China for the period 1951–2013. They showed that the nonstationary GEV distributions performed better than their stationary equivalents. Likewise, Dixit and Jayakumar (2022) indicated that non-stationary analysis will be helpful in the accurate estimation of the drought characteristics under climate change.

To validate the results for each region, the selected model was examined for each station belonging to the homogenous regions and compared with the regional results. The results indicated that the best regional model for region one was considered the best-fitted model for all stations. However, the best regional model was M2, but the best model in some stations was considered M1 in region II (Table 4).

Comparing Fig. 4 with Table 4, it is apparent that the 95% confidence level for Region I and the average 90% confidence level for Region II are reasonable. Because in all Region I stations, the best model is M2, but for some stations in Region II, the best model is M1.

5.4 IDF Curves

The developed nonstationary and stationary IDF curves were extracted under different return periods and durations for both regions. It should be noted that the developed IDFs using time and CO₂ covariates were similar. This indicates that the effect of CO₂ concentration on rainfall intensity was not significant in the study area. However, it is crucial to study the effect of CO₂ on rainfall intensity in different climate characteristics. The results showed that the nonstationary IDF curves have consistently overestimated precipitation values compared to the stationary curves (Figs. 5 and 6). The overestimation of nonstationary curves could be attributed to the decreasing trend in rainfall over the study area. Since, this detected trend was not considered in the stationary curves. The overestimation of regional nonstationary curves compared to the corresponding at-site nonstationary curves could be also due to the difference in climate characteristics and topography between the regions.

In addition, the differences between nonstationary and stationary curves were notable in the lower return period, while this difference was reduced with increasing the return periods, indicating that the nonstationary approach is more important for short-duration events. Ganguli and Coulibaly (2017) investigated non-stationarity and trends in the short-duration precipitation extremes in selected urbanized locations in Southern Ontario, Canada. They reported that the stationary vs. nonstationary models do not exhibit any significant differences in the design storm intensity despite apparent signals of non-stationarity in precipitation extremes in all locations. Yang et al. (2020) reported that rainfall depth under non-stationarity was greater for return periods less than 10 years, however, the stationary rainfall depth was higher when the return period was over 20 years.

The regional nonstationary curves had overestimated rainfall intensity compared with the corresponding in-site nonstationary curves. The regional stationary curves, on the other hand, provided more accurate estimates for the study area, which indicates the importance of considering the local characteristics in developing IDF curves.

Although the regional stationary curves are, in some cases, underestimated the values of their corresponding stationary at-site curves, their differences were insignificant. In contrast, the difference between stationary and nonstationary IDF curves decreased over higher return periods. This could be because higher return periods represent rare events that are less affected by short-term climate variability and more influenced by long-term climate trends, which are better captured by stationary curves. The average values of overestimation (regional nonstationary and regional stationary) in the return period of 2 years is equal to 50 percent, in the return period of 100 years is equal to 25 percent in region two, and in region one is equal to 20 and 43 percent, respectively (Fig. 5). The difference between the two regions is not considerable. However, the existing difference can be caused by the lower uncertainty in region one and the climate characteristics and topography between the regions. The difference between stationary and nonstationary IDF curves decreased over higher return periods. This could be because higher return periods represent rare events less affected by short-term climate variability and more influenced by long-term climate trends, better captured by stationary curves. This exactly corresponds to the result of Mohan et al. (2023). They demonstrated higher levels of non-stationarity were observed in Coastal regions and Eastern parts of India compared to the central parts. Also, the comparison of spatial patterns of rainfall intensity estimates under stationary and nonstationary showed that about 23% of grids showed an overestimation of nonstationary rainfall over their stationary counterparts by at least 15%. The nonstationary curves still show higher estimates, which will increase confidence in applying these curves in hydrological studies.

6 Conclusions

The intensity–Duration–Frequency (IDF) curve is one of the most popular tools in water resources engineering, which can be utilized as an input in designing, planning, and exploiting water resources projects. One of the fundamental problems in many countries is the scattered or inadequate networks of the required meteorological stations such that their data are considered the main bases for IDF construction. This study applied a new model of NGN and nonstationary concepts to develop the regional IDF curves. For this purpose, taking into account the characteristics of longitude, latitude, average annual rainfall, altitude, and maximum 24-h annual rainfall for each station, and using three indicators and CS, Silhouette, and Calinski-Harabasz(CH), the Khuzestan Province was clustered into two separate and possibly homogenous regions.

Regional IDF curves have been proposed based on the stationary concept in the studies published. In this research, we confirmed that most of the stations across the Khuzestan Province exhibited a nonstationary condition, so extreme rainfalls with different durations were significantly affected. The results indicated that the nonstationary IDF curves have consistently overestimated precipitation values compared to the stationary curves due to to the decreasing trend in rainfall over the study area. The difference between stationary and nonstationary IDFs decreased over higher return periods. This could be because higher return periods represent rare events that are less affected by short-term climate variability and more influenced by long-term climate trends, which are better captured by stationary curves.

Availability of Data and Material

Not applicable.

Code Availability

Not applicable.

References

Abdi A, Hassanzadeh Y, Ouarda TB (2017) Regional frequency analysis using Growing Neural Gas network. J Hydrol 550:92–102
Article Google Scholar
Agilan V, Umamahesh NV (2016) Is the covariate based non-stationary rainfall IDF curve capable of encompassing future rainfall changes? J Hydrol 541:1441–1455
Article Google Scholar
Aho K, Derryberry D, Peterson T (2014) Model selection for ecologists: the worldviews of AIC and BIC. Ecology 95(3):631–636
Article Google Scholar
Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom 19(6):716–723
Article Google Scholar
Alemaw BF, Chaoka RT (2016) Regionalization of rainfall intensity-duration-frequency (IDF) curves in Botswana. J Water Resour Prot 8(12):1128
Article Google Scholar
Asadieh B, Krakauer NY (2015) Global trends in extreme precipitation: climate models versus observations. Hydrol Earth Syst Sci 19(2):877–891
Article Google Scholar
Blanchet J, Ceresetti D, Molinié G, Creutin JD (2016) A regional GEV scale-invariant framework for Intensity–Duration–Frequency analysis. J Hydrol 540:82–95
Article Google Scholar
Caliński T, Harabasz J (1974) A dendrite method for cluster analysis. Commun Stat-Theory Methods 3(1):1–27
Article Google Scholar
Cavanaugh NR, Gershunov A, Panorska AK, Kozubowski TJ (2015) The probability distribution of intense daily precipitation. Geophys Res Lett 42(5):1560–1567
Article Google Scholar
Cheng L, AghaKouchak A (2014) Nonstationary precipitation intensity-duration-frequency curves for infrastructure design in a changing climate. Sci Rep 4(1):1–6
Article Google Scholar
Chou CH, Su MC, Lai E (2004) A new cluster validity measure and its application to image compression. Pattern Anal Appl 7(2):205–220
Article Google Scholar
Dixit S, Jayakumar KV (2022) A non-stationary and probabilistic approach for drought characterization using trivariate and pairwise copula construction (PCC) model. Water Resour Manag 36(4):1217–1236
Article Google Scholar
Fritzke B (1995) A growing neural gas network learns topologies. Neur Inf Sys 625–632
Ganguli P, Coulibaly P (2017) Does nonstationarity in rainfall require nonstationary intensity–duration–frequency curves? Hydrol Earth Syst Sci 21(12):6461–6483
Article Google Scholar
Gao M, Mo D, Wu X (2016) Nonstationary modeling of extreme precipitation in China. Atmos Res 182:1–9
Article Google Scholar
Gilks WR, Richardson S, Spiegelhalter D (1995) Markov chain Monte Carlo in practice. CRC Press
Book Google Scholar
Gohari A, Shahrood AJ, Ghadimi S, Alborz M, Patro ER, Klöve B, Haghighi AT (2022) A century of variations in extreme flow across Finnish rivers. Environ Res Lett 17(12):124027
Article Google Scholar
Hardy JT (2003) Climate change: causes, effects, and solutions. John Wiley & Sons
Google Scholar
Hosking JR (1990) L-moments: Analysis and estimation of distributions using linear combinations of order statistics. J R Stat Soc: Series B (methodological) 52(1):105–124
Google Scholar
Hosking JRM, Wallis JR (1993) Some statistics useful in regional frequency analysis. Water Resour Res 29(2):271–281
Article Google Scholar
Jamali M, Gohari A, Motamedi A, Haghighi AT (2022) Spatiotemporal changes in air temperature and precipitation extremes over Iran. Water 14(21):3465
Article Google Scholar
Kendall M (1962) Rank CorrelationMethods, 3rd edn. Hafner PublishingCompany, New York
Google Scholar
Lee SH, Maeng SJ (2003) Frequency analysis of extreme rainfall using L-moment Irrigation and Drainage. The Journal of the International Commission on Irrigation and Drainage 52(3):219–230
Article Google Scholar
Lenderink G, Van Meijgaard E (2008) Increase in hourly precipitation extremes beyond expectations from temperature changes. Nat Geosci 1(8):511–514
Article Google Scholar
Li J, Lei Y, Tan S, Bell CD, Engel BA, Wang Y (2018) Nonstationary flood frequency analysis for annual flood peak and volume series in both univariate and bivariate domain. Water Resour Manage 32:4239–4252
Article Google Scholar
Lima CH, Kwon HH, Kim JY (2016) A Bayesian beta distribution model for estimating rainfall IDF curves in a changing climate. J Hydrol 540:744–756
Article Google Scholar
Mahmoudi MR, Eslamian S, Soltani S, Tahanian M (2023) Correction to: Regionalization of rainfall intensity–duration–frequency (IDF) curves with L-moments method using neural gas networks. Theoret Appl Climatol 51(1–2):1–11
Article Google Scholar
Mann HB (1945) Nonparametric tests against trend Econometrica. J Econom Soc 245–259.
Mohan MG, AR A, Krishnan A, Rajan A (2023) Hydrologic regionalization of non-stationary intensity–duration–frequency relationships for Indian mainland. H2Open J 6(2):223–241
Article Google Scholar
Motamedi A, Gohari A, Haghighi AT (2023) Three-decade assessment of dry and wet spells change across Iran, a fingerprint of climate change. Sci Rep 13(1):p2888
Article Google Scholar
Overeem I, Syvitski JP (2010) Shifting discharge peaks in Arctic rivers, 1977–2007. Geogr Ann Ser A Phys Geogr 92(2):285–296
Article Google Scholar
Perica S, Dietz S, Heim S, Hiner L, Maitaria K, Martin D, Pavlovic S, Roy I, Trypaluk C, Unruh D, Yan F (2011) Precipitation-Frequency Atlas of the United States Volume 6 Version 23, California
Ragno E, AghaKouchak A, Cheng L, Sadegh M (2019) A generalized framework for process-informed nonstationary extreme value analysis. Adv Water Resour 130:270–282
Article Google Scholar
Rao AR, Hamed KH (1997) Regional frequency analysis of Wabash River flood data by L-moments. J Hydrol Eng 2(4):169–179
Article Google Scholar
Rosner A, Vogel RM, Kirshen PH (2014) A risk-based approach to flood management decisions in a nonstationary world. Water Resour Res 50:1928–1942
Article Google Scholar
Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65
Article Google Scholar
Sarhadi A, Soulis ED (2017) Time-varying extreme rainfall intensity-duration-frequency curves in a changing climate. Geophys Res Lett 44(5):2454–2463
Article Google Scholar
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 461–464
Singh D, Horton DE, Tsiang M, Haugen M, Ashfaq M, Mei R, Rastogi D, Johnson NC, Charland A, Rajaratnam B, Diffenbaugh NS (2014) Severe precipitation in Northern India in June 2013: Causes, historical context, and changes in probability Bull Am. Meteorol Soc 95(9):S58–S61
Google Scholar
Smith RL (2001) Extreme value statistics in meteorology and the environment. Environ Stat 8:300–357
Google Scholar
Tramblay Y, Neppel L, Carreau J, Sanchez-Gomez E (2012) Extreme value modelling of daily areal rainfall over Mediterranean catchments in a changing climate. Hydrol Process 26(25):3934–3944
Article Google Scholar
Tsakiris GP, Loucks DP (2023) Adaptive water resources management under climate change: An introduction. Water Resour Manag 1–13
Vasiliades L, Galiatsatou P, Loukas AJ (2015) Nonstationary frequency analysis of annual maximum rainfall using climate covariates. Water Resour Manag 29:339–358
Article Google Scholar
Veneziano D, Yoon S (2013) Rainfall extremes, excesses, and intensity-duration-frequency curves: A unified asymptotic framework and new nonasymptotic results based on multifractal measures. Water Resour Res 49(7):4320–4334
Article Google Scholar
Villarini G, Serinaldi F, Smith JA, Krajewski WF (2009) On the stationarity of annual flood peaks in the continental United States during the 20th century. Water Resour Res 45(8)
Westra S, Fowler HJ, Evans JP, Alexander LV, Berg P, Johnson F, Kendon EJ, Lenderink G, Roberts N (2014) Future changes to the intensity and frequency of short-duration extreme rainfall. Rev Geophys 52(3):522–555
Article Google Scholar
Xu L, Zhou H, Du L, Yao H, Wang H (2015) Precipitation trends and variability from 1950 to 2000 in arid lands of Central Asia. J Arid Land 7(4):514–526
Article Google Scholar
Yang L, Li J, Kang A, Li S, Feng P (2020) The effect of nonstationarity in rainfall on urban flooding based on coupling SWMM and MIKE21. Water Resour Manag 34:1535–1551
Article Google Scholar
Yue S, Pilon P, Cavadias G (2002) Power of the Mann-Kendall and Spearman’s rho tests for detecting monotonic trends in hydrological series. J Hydrol 259(1–4):254–271
Article Google Scholar

Download references

Acknowledgements

The authors appreciate the constructive comments of anonymous reviewers on this paper, which helped improve the final version of the paper.

Funding

This research received no external funding.

Author information

Authors and Affiliations

Department of Water Science and Engineering, College of Agriculture, Isfahan University of Technology, Isfahan, 8415683111, Iran
Mohammad Reza Mahmoudi, Moein Tahanian, Alireza Gohari & Saeid Eslamian

Authors

Mohammad Reza Mahmoudi
View author publications
You can also search for this author in PubMed Google Scholar
Moein Tahanian
View author publications
You can also search for this author in PubMed Google Scholar
Alireza Gohari
View author publications
You can also search for this author in PubMed Google Scholar
Saeid Eslamian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors collaborated in the research presented in this publication by making the following contributions: research conceptualization, Mohammad Reza Mahmoudi (M.R.M.), Moein Tahanian (M.T.), Alireza Gohari (A.G.), Saeid Eslamian (S.E.); methodology, M.R.M., A.G., and S.E.; formal analysis, M.R.M, A.G., and S.E; writing—original draft preparation, M.R.M, M.T., and A.G.; writing—review and editing, M.R.M, M.T., and A.G.; supervision, A.G. and S.E.,

Corresponding author

Correspondence to Mohammad Reza Mahmoudi.

Ethics declarations

Ethics Approval

We confirm that this article is original research and has not been published or presented previously in any journal or conference in any language.

Consent to Participate

Not applicable.

Consent for Publication

All the authors consented to publish the paper.

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mahmoudi, M.R., Tahanian, M., Gohari, A. et al. Developing the Regional Nonstationary IDF Curves Using NGN-ProNEVA Framework. Water Resour Manage 37, 5581–5599 (2023). https://doi.org/10.1007/s11269-023-03619-5

Download citation

Received: 29 December 2022
Accepted: 13 September 2023
Published: 03 October 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s11269-023-03619-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Developing the Regional Nonstationary IDF Curves Using NGN-ProNEVA Framework

Abstract

Similar content being viewed by others

Regionalization of rainfall intensity–duration–frequency (IDF) curves with L-moments method using neural gas networks

Annual Daily Maximum Rainfall-Based IDF Curve Derivation Methodology

Framework for developing IDF curves using satellite precipitation: a case study using GPM-IMERG V6 data

1 Introduction

2 Study Area

3 Dataset

4 Methodology

4.1 Trend Analysis

4.2 Clustering

4.2.1 The Number of Optimal Clusters

4.2.2 Neural Gas Network (NGN)

4.2.3 L-Moments

4.3 Nonstationary IDF Curves

4.3.1 Parameter Estimation

4.3.2 Model Diagnostics and Selection

5 Results and Discussions

5.1 Trend Analysis

5.2 Clustering of Rainfall Stations

5.3 Nonstationary Models

5.4 IDF Curves

6 Conclusions

Availability of Data and Material

Code Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics Approval

Consent to Participate

Consent for Publication

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation