1 Introduction

Intensity–duration–frequency (IDF) curve is one of the most common tools in water resources engineering, which can be used as an input in planning and designing, and exploitation of water resources projects. One of the common problems in many countries is the scattered or very weak networks of the required meteorological stations such that their data are considered the main bases for IDF construction. To this problem, a regional analysis of rainfall depth and building the IDF curves have been proposed.

The IDF concept refers to Bernard’s efforts in 1932 (Bernard 1932), and a lot of the studies focused on improving the statistical inference methods used in IDF (Bell 1969). One of the noticeable researches in this field is Hasking and Wallis’ study (1997) on developing a method for L-moments estimation, probability-weighted moments (PWM) (Greenwood et al. 1979), parametric formulation of IDF relations (Koutsoyiannis et al. 1998), and employing the regional methods like the Index-Flood method. Today, Atlas of IDFs has been built in developed countries. One of the works is the National Oceanic and Atmospheric Administration (NOAA) atlas 14, which was created by American National Weather Services (Perica et al. 2013).

The regional analysis uses the group statistics and characteristics from co-behavioral stations instead of using data only from one station. Several studies using regional methods on the extreme rainfalls suggest that these techniques increasingly reduce the doubts about the estimates resulting from the at-site view of point (Lee and Maeng 2003). One of the main problems to expand frequency analysis results from one or more stations to one region is the hydrological lack of homogeneity in the region. Despite the suitability of cluster analysis for grouping the hydrological features, the homogeneity of the regions is not completely achieved. So, it is recommended to examine and test the cluster analysis results with the other conventional methods (Rousseeuw 1987).

Soltani et al. (2017), using the characteristics of the rainfall time scale and three variables of average daily rainfall intensity, the standard deviation of daily rainfall intensity, and scale index, drew the regional IDF curves for Khuzestan province and the absolute error of estimates for this method, which was mainly below 25%, and confirmed the results were acceptable.

Using topographic and rainfall characteristics, Alemaw and Chaoka (2016) divided Botswana into three hydrologically homogeneous regions using the K-means clustering model. They also used both gamma and lognormal probability distribution to estimate the depth of rainfall at the return periods up to 100 years for the mentioned areas.

Amin and Shaaban (2004) used generalized extreme values distribution(GEV) and extreme values distribution (EV1) with the least square method in the estimation of distribution parameters for the IDF curves in Peninsular Malaysia.

The IDF regionalization is very beneficial in terms of shortening the steps and required time to perform the calculations as well as providing it for the area and not only for the station.

Identifying the homogeneous regions is usually the most crucial and difficult step and a prerequisite for the frequency analysis hypotheses between the hydrologic frequency analysis stages of the region. This study presents a method based on the neural gas and growing neural gas networks to cluster the hydrological data and determine the homogeneous regions. The neural gas network is one of the types of competitive neural networks and uses an unsupervised teaching method. The network was first introduced by Martinez and Schulten (1991). One of the features of this algorithm is learning the topology or distribution shape of the data space. One of the issues with this algorithm is that it starts working with several elements, which makes the algorithm too slow at first. This problem was solved 4 years later when the growing neural gas network was presented by Fritzke (1994). The number of neurons in the growing neural gas network increases during the learning process regardless of prior knowledge and the governing structure of the inputs.

Abdi et al. (2017) investigated the ability of the growing neural gas network to regionalize the drought index for 40 synoptic stations in Iran, and the results of the heterogeneity evaluation based on L-moments showed the success of this algorithm compared to the other methods in determining the homogeneous sub-regions.

The application of neural gas networks in clustering has also been considered in other scientific disciplines such as robotics (Carlevarino et al. 2000; Ferrer 2014), medicine (Cselényi 2005; Oliveira Martins et al. 2009; Angelopoulou et al. 2015), and economics (Decker 2005; Lisboa et al. 2000). Therefore, this algorithm can be used for clustering and image segmentation.

No studies have been conducted to regionalize the IDF curves using clustering based on neural gas networks. Also, in the field of hydrology and water resources, neural gas networks have not been used so far except for a few cases (Abdi et al. 2017).

After using neural gas networks and other models for clustering, it is required to investigate the formed regions and stations in each area in terms of homogeneity and discordancy. For this purpose, Husking and Wallis tests, which are based on L-moments, are known as the best method for regional analysis.

L-moments were presented by Hosking (1990), and it has been found the great importance and application in many regional analyses. The most crucial applications of L-moments include detecting the homogeneous regions, determining the discordant stations, selecting the appropriate distribution function, and estimating the parameters of distribution functions (Hosking and Wallis 1997). The main advantage of L-moments over ordinary moments is that they can describe a larger range of distributions, and the estimates have less bias. They also work better at displaying outlier events (Rao and Hamed 1997).

In this study, both models of neural gas networks have been used to regionalize the IDF curves in Khuzestan province. The results were then compared using a test based on L-moments combining the conventional clustering models like Ward, K-means, SOM,Footnote 1 and FCM.Footnote 2 The IDF regional curves were also extracted with the L-moment concepts.

Several studies with the L-moments method have analyzed the frequency of extreme rainfalls and extracted the appropriate distribution function for each region. For instance, Yang et al. (2010) divided the Pearl river basin in China into six homogeneous clusters, and they performed the regional frequency analysis for maximum annual 1, 3, 5, and 7 days of rainfall with L-moments. The goodness of fit tests showed that the distribution functions of PE3,Footnote 3 GLO,Footnote 4 and GEV fit well in the most areas in the homogeneous regions.

Kyselý et al. (2007), Jingyi and Hall (2004), and Kjeldsen et al. (2002) studies in the application of statistical methods have confirmed L-moments and showed that PWMs and L-moments are preferred to the classic estimation methods, especially for the regional studies.

Using L-moments and regionalizing the GEV distribution, Ariff et al. (2016) extracted the regional IDFs for the Malaysian Peninsular region. They evaluated the application of this method due to the associated simplicity as well as its efficiency in areas without appropriate stations.

Eslamian and Feizi (2007), using L-moments and GEV distribution, compared the regional frequency analysis of maximum monthly rainfall in the Isfahan region, Iran, with their single-site values. They described the L-moments method as an accurate and helpful tool for confirming the similarities or differences in regional rainfall frequency analysis.

2 Study area and data

This study provided data from 21 rain gauge stations located in Khuzestan province, Iran, by Province Water and Electricity Organization. The minimum record length related to the Payepol station was 16 years, and the maximum record length related to Ahwaz stations and the Shohada dam was 42 years (Table 1). The data used to determine the number of optimal clusters and also the input data to clustering models include the geographical latitude and longitude variables, height from sea level, maximum average precipitation (MAP), maximum daily precipitation (MDP), annual rainfall average for each station which has been shown in Table 1. Khuzestan province, which covers 4% of the country’s total area, is the largest in the western half of the country. This province is located between 47°41′ to 50°39′ east longitude and 29°58′ to 33°04′ north of the equator. Despite having only 4% of the country’s total area, this province owns more than 30% of the country’s surface water.

Table 1 Details of meteorological stations

3 Methodology

In this section, a suggested method to extract the IDF regional curves has been explained; the steps are as follows:

  1. 1-

    Determine the number of optimal clusters using hydrologic data provided in Table 1.

  2. 2-

    Implement of neural gas networks and other common models mentioned in the clustering.

  3. 3-

    Investigate the homogeneity of the formed regions and the discordance of the stations in each region.

  4. 4-

    Determine the appropriate statistical distribution for each region and estimate the distribution parameters at the required duration.

  5. 5-

    Investigate the quantiles or amounts of precipitation in duration and the required return periods.

  6. 6-

    Draw the regional IDF curves.

3.1 Neural gas (NG) network

The rule of learning in the neural gas network is as follows:

$${w}_{i}^{new}={w}_{i}^{old}+{\alpha }_{i}(x-{w}_{i}^{old})$$
(1)
$${\alpha }_{i}=\varepsilon {e}^{\frac{{-k}_{i}}{\lambda }}$$
(2)

where \({w}_{i}\) is a gas molecule formed in data space. The number of these molecules is initially assumed as a value, and eventually, it is revised to have the logical and optimal function of the algorithm. These elements also have been selected in the main data range. \({\alpha }_{i}\) is a parameter that specifies the learning rate and depends on \({k}_{i}\) and λ. As if λ tends to infinity, learning of the whole neurons would be equal and if it tends to zero, then the nearest neuron begins to learn. The extreme modes of λ are not suitable alone, and usually, a mode between them is chosen. \({k}_{i}\) refers to the superior neuron to the i neuron. Ɛ is also a constant number that controls the learning rate.

To create a neighborhood between the first and second neurons in terms of proximity, an edge is created. For each neuron, there is \({c}_{i.j}\epsilon \left\{0.1\right\}\) which shows that there exists an edge or neighborhood or does not exist and also \({t}_{i.j}\epsilon \left\{0.1.2\dots .\right\}\) which shows the time intervals (age) from the last meeting or re-edge, that if it exceeds more than one size, the neighborhood will be broken. This approach helps the neural network to learn topology.

NG algorithm can be summarized as follows:

  • Step 1: A random position of \({w}_{i}\) is created in the data space.

  • Step 2: An input named x is selected from the expected data.

  • Step 3: Aging, which includes computation of the distance between x, and the centers of \({w}_{i}\) and \({k}_{i}\) aging for each center.

  • Step 4: Adaption or learning.

    $${w}_{i}^{new}={w}_{i}^{old}+\varepsilon {e}^{\frac{{-k}_{i}}{\lambda }}(x-{w}_{i}^{old})$$
    (3)

The main point is that during the training period, as the algorithm progresses, the learning speed should be reduced; otherwise, the neural network will be repeated, and an incorrect cycle will be created. For this purpose, the amount of λ and \(\varepsilon\) should be decreased as learning progresses. So, the following function would be used.

$$\begin{array}{c}\mathrm{G}(\mathrm{t})={G}_{i}{(\frac{{G}_{f}}{{G}_{i}})}^{\frac{t}{{t}_{max}}}\\ {\lambda }_{i}>{\lambda }_{f} , {\varepsilon }_{i}>{\varepsilon }_{f} , {T}_{i}<{T}_{f}\end{array}$$
(4)

where i index shows the parameter value at the beginning of learning and f index shows the value of the parameter at the end of the learning process. For instance, if t = 0, so \(\lambda \left(t\right)={\lambda }_{i}\), and if\(t={t}_{\mathrm{max}}\), so\(\lambda \left(t\right)={\lambda }_{f}\).

  • Step 5: An edge between the first two ranks in terms of proximity and age of this edge is considered equal to zero (create a neighborhood).

  • Step 6: Age of all edges increases (\({t}_{i.j}\to {t}_{i.j}+1\))

  • Step 7: It is assumed that \({k}_{i}=0\), and for each j which is \({t}_{i.j}>T\), it is considered as \({c}_{i.j}=0\). At this step for the reasons mentioned in step 4, T should be increased during the learning period to reduce the degree of rigidity, which means the edges are allowed to last longer.

  • Step 8: If the termination conditions are not met (for example, the maximum quantity of neurons or any amount of performance), the step 2 is repeated. Otherwise, algorithm steps would be finished.

3.2 Growing neural gas (GNG) network

GNG algorithm, which is based on unsupervised artificial neural networks, was first introduced by Fritzke (1994). The GNG network is a clustering algorithm working step by step; the number of neurons increases without using previous knowledge about the structure of input patterns during the learning process (Fink et al. 2015). Unlike classical clustering algorithms, the GNG algorithm owns a compatible network structure which makes it suitable for learning the large data set topologies (Zaki and Yin 2008). The main idea of GNG is that it will continuously add the new nodes (neurons) to a small initial network in a growing structure. In the GNG network, the neurons compete to determine which one is most similar to the input data set (Morell et al. 2014).

GNG algorithm can be summarized as given below:

  • Step 1: Creating two random neurons at locations \({w}_{1}\) and \({w}_{2}\)

  • Step 2: Selecting vector input called x

  • Step 3: Finding the best neuron (\({s}_{1}\)) and second-best neuron (\({s}_{2}\))

  • Step 4: Increasing age of all edges connected to \({s}_{1}\)

    $${\forall }_{j} :{t}_{{s}_{1}j}\leftarrow {t}_{{s}_{1}j}+1$$
  • Step 5: Increasing the amount of accumulated error in \({s}_{1}\)

    $${E}_{{s}_{1}}={E}_{{s}_{1}}+{\Delta E}_{{s}_{1}}$$
    (5)
    $${\Delta E}_{{s}_{1}}={\Vert {w}_{{s}_{1}}-x\Vert }^{2}$$
    (6)
  • Step 6: Adaptation

    $${w}_{{s}_{1}}^{new}={w}_{{s}_{1}}^{old}+{\varepsilon }_{b}(x-{w}_{{s}_{1}}^{old})$$
    (7)
    $$\begin{array}{c}{w}_{n}^{new}={w}_{n}^{old}+{\varepsilon }_{n}\left(x-{w}_{n}^{old}\right)\\ {\varepsilon }_{b}>{\varepsilon }_{n}\end{array}$$
    (8)
  • Step 7: Creating an edge between \({s}_{1}\) and \({s}_{2}\) if there is not any.

    $${C}_{{s}_{1}{s}_{2}}=1. {t}_{{s}_{1}{s}_{2}}=0$$
  • Step 8: All edges that their age is more than T will be deleted.

    $${t}_{ij}>T\to {C}_{ij}=0$$
  • Step 9: If the number of inputs presented to the network is an integer multiplier of L, a new neuron is created. This neuron is created at the location of \({w}_{r}\).

    $$\begin{array}{c}w_r=\frac12\left(w_q+w_f\right)\\C_{fq}=0.\;C_{rf}=C_{rq}=1\end{array}$$
    (9)

    q is the neural index which has the most amount of accumulated error; f is the neighbor index of q which has the most errors.

\({E}_{f}\) and \({E}_{q}\) errors with α coefficient are declined:

$$E_f\leftarrow{\alpha E}_f\;\;\;.\;\;\;E_q\leftarrow\alpha E_q\;\;\;\alpha<1.$$

Consider error \({E}_{r}\) equals to \({E}_{q}\). \({E}_{r}={E}_{q}\)

  • Step 10: Decreasing the accumulated error of all neurons. \({E}_{i}\leftarrow {dE}_{i} d<1\)

  • Step 11: If the stop measurement (for example maximum number of neurons or any scale of performance) has not yet been met, step 1 would be repeated.

3.3 Discordancy and heterogeneity measures

An area containing N stations is considered so that the i station has the record length of ni and the ratio of L-moments \({t}^{(i)}.\) \({t}_{3}^{(i)}\) and \({t}_{4}^{(i)}\). In this case, the discordancy criterion \({D}_{i}\) would be calculated using the below relations.

$${u}_{i}={\left[{t}^{(i)}.{t}_{3}^{(i)}. {t}_{4}^{(i)}\right]}^{T}$$
(10)
$$\overline{u }=\frac{1}{N}\sum\nolimits_{i=1}^{N}{u}_{i}$$
(11)
$$s={\left(N-1\right)}^{-1}\sum\nolimits_{i=1}^{N}\left({u}_{i}-\overline{u }\right){\left({u}_{i}-\overline{u }\right)}^{T}$$
(12)
$${D}_{i}=\frac{1}{3}{\left({u}_{i}-\overline{u }\right)}^{T}\left({u}_{i}-\overline{u }\right){S}^{-1}$$
(13)

where \({u}_{i}={\left[{t}^{(i)}.{t}_{3}^{(i)}. {t}_{4}^{(i)}\right]}^{T}\) is the L-moment ratio matrix in station i, N is the number of stations, and S is the sample covariance matrix.

If \({D}_{i}\) is big, the location i is discordant. An appropriate criterion to determine if a station is discordant or not is that \({D}_{i}\) is bigger than 3 or equal to it.

To calculate the degree of heterogeneity, first \({V}_{1}\) would be obtained using Eq. (14) for the observed data.

$$V_1=\sum\nolimits_{i=1}^Nn_i\left(t^{\left(i\right)}-\overline t\right)^2/\sum\nolimits_{i=1}^Nn_i$$
(14)
$$\overline{t }=(\sum\nolimits_{i=1}^{N}{N}_{i}{t}^{(i)})/(\sum\nolimits_{i=1}^{N}{n}_{i})$$
(15)

where \({n}_{i}\) is the size of samples in the station i, \({t}^{\left(i\right)}\) is the sample L-moment (L-CV), \(\overline{t }\) is the point average of sample moment (L-CV).

For each simulated area, \({V }_{1}\) would be calculated. Also, from simulated data, average \({\mu }_{v}\) and standard deviation \({\sigma }_{v}\) and inhomogeneity criterion would be determined through relation 16.

$${H}_{i}=\frac{({V}_{i}-{\upmu }_{v})}{{\sigma }_{v}}$$
(16)

Hosking and Wallis (1997) suggested that an area can be an acceptable homogenous area if \({\mathrm{H}}_{\mathrm{i}}\) is smaller than 1, and it can be relatively heterogeneouss if \({H}_{i}\) is between 1 and 2, and it would be definitely heterogeneous if \({\mathrm{H}}_{\mathrm{i}}\) is bigger than 2. In practice, the H1 criterion is more appropriate (Rao and Srinivas 2006).

3.4 Selecting the appropriate distribution

Selecting an appropriate frequency distribution for homogeneous regions can be done by comparing the distribution moments with the average regional moment of the data. Also, to select the best distribution, a goodness of fit test will be performed for the distribution function. This test would be done through calculation statistics of \({Z}^{Dist}\). An appropriate distribution function is a function which is \(\left|{Z}^{Dist}\right|<1.64\).

$${Z}^{Dist}=({\tau }_{4}^{Dist}-{\overline{\tau }}_{4}+{\beta }_{4})/{\sigma }_{4}$$
(17)
$${\beta }_{4}={N}_{sim}^{-1} {\sum }_{m=1}^{{N}_{sim}}({\overline{\tau }}_{4m}-{\overline{\tau }}_{4})$$
(18)
$${\sigma }_{4}={\left\{{\left({N}_{sim}-1\right)}^{-1}\sum_{m=1}^{{N}_{sim}}{\left({\overline{\tau }}_{4m}-{\overline{\tau }}_{4}\right)}^{2}-{N}_{sim}{\beta }_{4}^{2} \right\}}^{1/2}$$
(19)

Here “dist” means distribution, \({\uptau }_{4}^{\mathrm{Dist}}\) is the size or distribution kurtosis criterion (LCK), \({\overline{\uptau } }_{4}\) is the areal average of L-moment sample kurtosis, \({\upbeta }_{4}\) the area bias value of the above moment, \({\sigma }_{4}\) is the regional deviation of the above moment, and \({N}_{\mathrm{sim}}\) is the number of simulated areas.

3.5 Regionalization of IDF curves

A set of desired quantiles for the station j which has a record length of \({n}_{\mathrm{j}}\) and located in a region with the \({\mathrm{N}}_{\mathrm{s}}\) stations are shown by \({\mathrm{Q}}_{\mathrm{j}}\). Rainfall observation data, \({\mathrm{x}}_{\mathrm{j}}\) for the station in specified quantiles, T would be calculable through Eq. (20). So, rainfall data sets in the station j can be calculated as follows:

$$\begin{array}{c}x_{j.k}=Q_j\left(T_k\right)\\k=1\dots.n_j\;\;.\;\;j=1\dots.N_s\end{array}$$
(20)

If the area is homogeneous, the set of quantiles for the station j will be as per Eq. (21).

$$T=\mathrm{2,5},\mathrm{10,20,50,100 }j=1,\dots ,n {Q}_{j}\left(T\right)={\mu }_{j} {X}_{T}$$
(21)

In Eq. (21), \({\mathrm{X}}_{\mathrm{T}}\) is a set of dimensionless regional quantile with a probability of not exceeding f which is called the regional growth curve. \({\upmu }_{\mathrm{j}}\) is the scale factor for station i, where parameters such as mean or median are considered to simplify the calculations.

The value of variation coefficient moment and ratios of L-moments for the station j using single site data, \({\mathrm{x}}_{\mathrm{j}}\), is equal to their amounts for regional data. As a result, it will be possible to estimate the regional quantiles X, by equating the first to fourth moments of the region with the mean, the coefficient of variation moments, and the L-moments ratios of the distribution function considered for the region.

By estimating the set of quantiles X, for the maximum annual series of rainfall intensities in each duration and the desired return period in a homogeneous region, along with estimating the scale factor \({\mu }_{j}\), for only one station in the region, different values (i, d, T) using Eqs. (20) and (21) will be computable. So, it is not needed to estimate the probability of distribution function for every single annual series in each station. Finally, using these values, a regional IDF curve will be drawn for each homogeneous region.

To investigate the differences between the regional IDF curves which are based on the regional distribution functions with the stationary IDF curves, three equations of the coefficient of variation of root mean square error (CVRMSE), mean percentage difference (Δ), and mean bias error (MBE) as per below were used. The lower the CVRMSE and Δ values, the more accurate the model used in clustering. Also, the negative MBE values indicate overestimation, and the positive values indicate an underestimation of the regional values than the at-station rainfall values.

$${CV}_{RMSE}=\frac{\sqrt{\frac{1}{{N}_{d}{N}_{T}}\sum_{d=1}^{{N}_{d}}\sum_{T=1}^{{N}_{T}}{({x}_{d.T}-{z}_{d.T})}^{2}}}{\frac{1}{{N}_{d}{N}_{T}}\sum_{d=1}^{{N}_{d}}\sum_{T=1}^{{N}_{T}}{x}_{d.T}}\times 100$$
(22)
$$\Delta =\frac{1}{{N}_{d}{N}_{T}}\sum\nolimits_{d=1}^{{N}_{d}}\sum\nolimits_{T=1}^{{N}_{T}}\frac{\left|{x}_{d.T}-{z}_{d.T}\right|}{{x}_{d.T}}\times 100$$
(23)
$$MBE=\frac{\sum_{d=1}^{{N}_{d}}\sum_{T=1}^{{N}_{T}}({x}_{d.T}-{z}_{d.T})}{{N}_{d}{N}_{T}}$$
(24)

In the above relations, \({\mathrm{x}}_{\mathrm{d}.\mathrm{T}}\) and \({\mathrm{z}}_{\mathrm{d}.\mathrm{T}}\) are respectively maximum rainfall intensity in duration d and the return period T in the specified station and the homogeneous area in which the station is located. Also, \({N}_{d}\) and \({N}_{d}\) are the number of durations and return periods.

4 Results

4.1 Cluster analysis

Used data sets should be normalized before entering the clustering models. This is due to data from the different types, such as geographical and precipitation data, which also have different units. Based on these normalized data, probabilistic homogeneous regions were determined using clustering models, including the new method of neural gas networks and the common models of Ward, K-means, self-organizing map, and fuzzy C-means.

CS (Chou et al. 2004), Silhouette (Rousseeuw 1987), and Calinski-Harabasz (Caliński and Harabasz 1974) indices were used to determine the optimal number of clusters. Figure 1 shows the number of optimal clusters in a range of clusters. Since the highest value in Silhouette and Calinski-Harabasz indices and the lowest value in CS indicate the number of optimal clusters, the number of region in all three models is equal to 2.

Fig. 1
figure 1

CS, Calinsky-Harabasz, and Sillhouette values for determining the optimal number of clusters

The output of clustering models based on two separate regions in the specified area, which demonstrates how stations are divided between these two regions, is illustrated in Fig. 2, as it is known that the result of clustering for some stations is different in several models which is due to the differences in the performance of each model. The Ward, SOM, and K-means models have shown the same performance. Also, region 1 occupies more eastern areas, and region 2 occupies the central and western areas of the province. According to the height index of each station, the results show that the stations with the higher altitudes in the eastern cluster and the lower stations in the western cluster are divided, which confirms the proper functioning of neural gas networks in terms of topographic detection of the data space.

Fig. 2
figure 2

The result of the location of stations in the clusters was identified by (a) GNG method, (b) NG method, (c) FCM method, and (d) SOM, K-means, and Ward methods

The constant parameters used in both neural gas and growing neural gas networks are presented in Table 2.

Table 2 Employed parameters in the NG and GNG

4.2 Regional homogeneity tests

The H and Di statistics, which are tests based on L-moments, were used to investigate the regional homogeneity and discord of the stations in each region. These values were determined for the maximum intensity of annual rainfall at different duration, as well as the various models used for clustering. These results are presented for 24-h rainfall in Table 3. Referring to the results, in none of the applied clustering models in different time durations, there was no discordant station. Except for a few cases, all of the regions formed in different models were homogeneous, which indicates the reasonable accuracy of the clustering models used.

Table 3 Results of heterogeneity and discordancy measures for 24-h rainfall duration

According to the goodness of fit test results, the generalized logistics distribution (GLOG) and GEV distribution are selected as the regional distribution function for regions 1 and 2, respectively. To estimate the required quantiles, it is necessary to calculate the parameters of regional distribution. For this purpose, in all of the used clustering models, the first to fourth moments of both generated regions are considered equal to the first to fourth moments of the distribution function considered for the region. The neural gas network model (NG) results are presented as an instance in Table 4.

Table 4 Estimated parameters of GEV and GLOG distributions as the regional probability distributions of annual maximum storm intensities for two regions in the NG model

To determine the best clustering model, the numerical values of regional curves with the same stationary curves were compared. As shown in Table 5, according to the calculated estimate error values, the neural gas network clustering model has the lowest error amount in both indicators, which shows the superiority of this method over the other methods. Also, the negative MBE index for this model indicates that the numerical values of the regional IDF curves obtained from this method are somewhat more significant than the at-station values (overestimation). Also, by considering all three indicators, the growing neural gas network can be considered the second suitable model. By considering the error values between the regional IDF with at-station IDF, which are presented in Table 6, it is concluded that in stations with a short record length, the estimated error amount has been increased, and if this station is removed, the better performance can be expected from the used clustering models. The comparison between the two types of regional and at-site curves in the four selected stations is illustrated in Fig. 3.

Table 5 Average values of goodness-of-fit indices of the difference between IDF curves based on regional and at-site probability distributions for the used clustering models
Table 6 Values for goodness-of-fit indices of the difference between IDF curves based on regional and at-site probability distributions at 21 rainfall stations for the NG clustering model
Fig. 3
figure 3

IDF curves for four rainfall stations in Khuzestan province (Note: The values on each curve are the return period (T), blue lines are at-site, and red lines are regional IDF curves)

5 Conclusions

In this study, two new models of neural gas and growing neural gas networks were presented to regionalize the IDF curves. For this purpose, taking into account the characteristics of longitude, latitude, average annual rainfall, altitude, and maximum 24-h annual rainfall for each station, and using three indicators and CS, Silhouette, and Calinski-Harabasz(CH), it has been determined that Khuzestan province has two separate and possibly homogenous regions. Then, using different clustering models, homogeneous regions have been formed. Clustering was one of the most important and main steps of this research due to the associated sensitivity and great impact on the final result. Therefore, clustering operations were performed using six different models, Ward, K-means, FCM, and Self-organizing map (SOM), which are among the most widely used methods. In addition to the four methods mentioned, two new models of the neural gas network (NG) and growing neural gas network (GNG) were used for clustering.

To investigate the homogeneity of the two regions, as well as the discordancy of the stations in each region, in all the six models in eleven durations, the regional homogeneity tests and discordancy tests based on L-moments were used. In most models and different durations, the regions created by clustering had a good homogeneity. After determining the position of each station in the dual regions and identifying both areas as homogeneous, the regional distribution function was determined, and then the regional IDFs were extracted using the L-moments method. The regional IDF curve obtained for each area was compared with the at-station IDF curves in the same area. The results showed that in all of the stations, the regional and stationary IDF curves are highly consistent and show the same trend. This research is the first one to evaluate the efficiency of neural gas networks in regionalizing the IDF curves. Among extracted regional IDFs, the curves obtained from the region composed of neural gas networks and growing neural gas network models had the highest accuracy and the most compliance with the at-station curves, which indicates the efficiency of these models in terms of regionalization. The quality of operation of neural gas networks can improve the various issues and problems related to water resources management and planning.