1 Introduction

Since as early as the middle of the last century, extreme hydrological events (i.e., high and low precipitation, flood and drought) have been an active research subject. Extreme event magnitude and frequency (return period) are prerequisites for accurately designing and managing various water infrastructure systems (including dam spillways, culverts, storm sewer systems, water supply systems, and flood control structures) and in climate change and flood risk assessment studies. In its fourth assessment report (IPCC 2007), the Intergovernmental Panel on Climate Change (IPCC) stated that, influenced by excessive greenhouse gas emissions, global temperatures are likely to increase during the twenty-first century. This will probably result in changing of the hydrological cycle. Therefore, this changing climate is very likely to affect the frequency and intensity of extreme events. In recent years, a number of worldwide research studies have reported an increase in extreme precipitation events in terms of frequency and magnitude in most parts of the world, even in regions where there has been a decreasing trend in total precipitation (Arnone et al. 2013; Babar and Ramesh 2014; Douglas and Fairbank 2011; Groisman et al. 2005; Lenderink and van Meijgaard 2008; Ma et al. 2015; O’Gorman 2015).

Frequency analysis is a statistical technique that relates the magnitude of extreme events to their frequency of occurrence using an appropriate probability distribution function (Chow et al. 1988). In this technique, a historical precipitation series is used to select an appropriate probability distribution and its associated parameters. In the literature, the analysis of extreme events involves the use of annual maximum (AM) series (Adamowski 2000; Guru and Jha 2015; Li et al. 2016; Saf 2009; She et al. 2013; Xia et al. 2012a, 2012b) and peaks over a threshold (POT) series (Adamowski 2000; Guru and Jha 2015; Li et al. 2014a, b, 2016; She et al. 2013; Xia et al. 2012a, 2012b), which are also known as partial duration (PD) series, as the most common extreme precipitation data series. While the AM series considers only the largest precipitation event in each year, the POT series involves more information about the extremes because the second or third largest precipitation amount may also cause extreme precipitation events in some years (Xia et al. 2012b). For data series with limited records, Madsen et al. (1997a) recommend using POT data series. Madsen et al. (1997b) and others (Zin and Jemain 2010) also recommend the use of POT data series since this approach could improve the parameter estimation of the probability distribution that represents the data.

Unfortunately, there is no unique probability distribution that can be implemented over other distributions for at-site or regional frequency analysis. Therefore, several possible theoretical probability distributions have been used or suggested to describe extreme precipitation amounts (e.g., extreme value type I, called the Gumbel; generalized extreme value (GEV); extreme value type III, called the Weibull; normal; lognormal; gamma; Pearson type 3; log Pearson type 3; exponential; generalized Pareto (GP); and Wakeby), of which the extreme value type I (Gumbel) distribution and log Pearson type 3 distribution have found widespread application in hydrological modeling (Haddad and Rahman 2011).

Al-houri et al. (2014) indicated that the lognormal distribution provides a better fit than a linear distribution to the annual daily maximum precipitation in the Amman Zara Basin (AZB), Jordan. Zhi Li et al. (2014a) found that the skewed normal distribution performs the best among six probability distributions (exponential, gamma, Weibull, skewed normal, mixed exponential, and hybrid exponential/generalized Pareto) in reproducing extreme precipitation events on the Loess Plateau, China. However, Li et al. (2015) showed that the GEV, Burr, and Weibull distributions provide the best fit to both annual and seasonal maximum precipitation based on daily precipitation data at 13 stations in the Heihe River and Shiyang River basins. Beskow et al. (2015) compared four probability distributions (kappa, GEV, Grumbel, and lognormal) to identify the most appropriate distribution for the modeling of extreme precipitation events in southern Brazil. The GEV, among four other probability distributions, namely, the generalized logistic (GLO), Weibull, gamma, and lognormal distributions, represents the distribution with the most accurate seasonal maximum precipitation estimates for southern Quebec, Canada (Benyahya et al. 2014). Vahid Rahmani et al. (2014) used the Weibull distribution to calculate the extreme precipitation frequency using daily precipitation data (1920–2009) from 24 stations in Kansas and 15 stations from adjacent states. Some studies have used the GEV (Du et al. 2014; Madsen et al. 1997a, 1997b; Xia et al. 2012b) and generalized GP (Du et al. 2014; Li et al. 2014a, b; Madsen et al. 1997a, 1997b; Xia et al. 2012b) distributions to analyze historical extreme precipitation events. Moreover, from these studies, the conclusion that the GEV is expected to fit AM series and that the GP is the expected distribution to better fit the POT series is supported. The appropriate probability distribution among candidate distributions is often selected by either statistical tests (the chi-square test, Kolmogorov-Smirnov (KS) test, Anderson-Darling test, Akaike information criterion or Bayesian information criterion) or graphical methods (the quantile-quantile plot or the L moment ratio diagram).

Since it was introduced by Hosking (1990), due to its theoretical advantages listed below in the data and methodology section, the L moment method has been increasingly used in hydrological frequency analysis for the estimation of probability distribution parameters rather than other standard methods, such as the moment method, maximum likelihood method, and least-squares method (Gubareva and Gartsman 2010; Li et al. 2014a, b; Rahman et al. 2013; She et al. 2013; Xia et al. 2012b; Yang et al. 2010; Zakaria et al. 2012). Additionally, this method has been widely used for regional flood (Adamowski 2000; Ellouze and Abida 2008; Noto and Loggia 2009; Saf 2009; Yang et al. 2010) and precipitation (Abolverdi and Khalili 2010; Hassan and Ping 2012; Parida and Moalafhi 2008; Zakaria et al. 2012) frequency analysis, for mapping extreme drought events (Núñez et al. 2011; She et al. 2013; Zin and Jemain 2010) and for low-flow studies (Chen et al. 2006).

Intensive storms often cause property damage and loss of life. The November 5, 2015 flash flood in the capital of Jordan, Amman, resulting from approximately 45 mm of precipitation over 40 min, caused four deaths and caused hundreds of vehicles to be washed away or left stranded. Additionally, as many as 20 families were left without houses. However, studies about precipitation frequency analysis and the selection of an appropriate probability distribution to fit available historical data have received very limited attention from researchers in Jordan. However, most of the precipitation data are not analyzed comprehensively. The focus in studies has been on the general aspects of precipitation (Dahamsheh and Aksoy 2007; Salahat and Al-qinna 2015) (pattern, trend, and variability) and climate change impact (Matouq et al. 2013; Salahat and Al-qinna 2015; Törnros and Menzel 2014) and adaptation scenarios. To the best of my knowledge, only the study performed by Al-houri et al. (2014) has presented the application of the normal and lognormal distributions to fit only the AM data series in the AZB, Jordan. Accordingly, there is a need for reliable estimates of extreme precipitation events. Thus, the major objective of the present study was to examine the applicability of generalized probability distributions (GEV, GP, generalized lognormal (GLN), and GLO) to reliably represent the AM and POT data series at each of a number of selected weather stations in AZB, Jordan. The method of L moments was applied for parameter estimation of each aforementioned distribution. The applicability of the probability distributions was measured by using the Kolmogorov-Smirnov goodness-of-fit test and the L moment ratio diagram. Special emphasis was placed on the investigation of extreme precipitation trends over time using the Mann-Kendall (M-K) test. A secondary objective was to determine the precipitation amount at each selected weather station for the AM and POT data series under return periods of 5, 10, 25, and 50 years based on the best-fitted distribution. The results of this research will provide valuable information and constitute the basis for future studies on extreme precipitation analysis and climate change assessments in the AZB.

2 Data and methodology

2.1 Study area and datasets

The Amman Zara Basin (AZB), located in the northwest part of Jordan between 213000 to 319000 east and 1140000 to 1220000 north (according to the Palestine grid), was selected as the study area. The total basin area is approximately 4150 km2, of which 3725 km2 is located in Jordan, and the remaining area is in Syria (Fig. 1). The population density in AZB is relatively high, including the two largest urban cities in Jordan: Amman (the capital) and Zara. The AZB also contains most of Jordan’s commercial and industrial activities.

Fig. 1
figure 1

Amman Zara Basin and the location of weather stations

The elevation of the AZB ranges from 350 m below mean sea level at the Jordan River to 1100 m above mean sea level near Sweilih. The eastern margin of the basin is part of the desert plateau. Toward the west, the basin changes to highland and then becomes progressively steeper until it reaches the Jordan valley.

Daily precipitation records from 24 weather stations spread across the AZB during a period that exceeds 50 years (1940–2015 for most stations), collected from the Ministry of Water and Irrigation in Jordan, were used in this study. The locations of these stations across the basin are shown in Fig. 1, with the reference station ID, station name, station coordinates, and statistical characteristics of each station given in Table 1. The selected stations have relatively complete records for the whole period.

Table 1 Selected weather stations basic information (ID, name, coordinates, and elevation) and their statistics based on annual precipitations

The climate in AZB is considered a semiarid type with uneven spatial and temporal distributions of precipitation amounts. The bulk of this amount is concentrated between October and May (i.e., in the winter season). Statistically, the largest precipitation frequencies occur in January or February, with the maximum precipitation usually occurring in December or January. With decreasing altitude, the precipitation amount decreases toward the east from approximately 500 mm average annual precipitation in the west (Sweilih station) to approximately 100 mm in the east (Sabha and Subhiyeh station) (Table 1).

The coefficient of variation (CV), a statistic measuring the relative deviation from the average, is calculated in Table 1 based on annual precipitation and is an indicator of the interannual precipitation variation (i.e., year-to-year variability) throughout the basin. The spatial distribution of the CV, with values ranging from 0.32 to 0.51, indicates high interannual variability, which becomes more variable in the middle and east parts of the basin as aridity increases.

In this study, both the annual maximum (AM) series and the peak over threshold (POT) series are considered at each station. The AM series consists of the annual maximum daily precipitation, while the POT series contains all the daily precipitation whose magnitude exceeds the predefined threshold value for all values of recorded daily precipitation. More details on the selection of the threshold value will be given below in the results and discussion section (i.e., Section 3.1).

2.2 The Mann-Kendall test

The Mann-Kendall (M-K) test (Kendall 1975; Mann 1945) is the most widely used nonparametric statistical test for temporal trend detection in meteorological and hydrological data series (Ahmad et al. 2015; Arnone et al. 2013; Atta-ur-Rahman and Dawood 2017; Deni et al. 2010; Douglas and Fairbank 2011; Fiala et al. 2010; Haktanir et al. 2013; Ouarda et al. 2014; Porto de Carvalho et al. 2013; Xia et al. 2012b), such as temperature, precipitation, and streamflow. The test is rank-based, powerful when data are skewed, allows missing values and does not require that the data conform to the normal probability distribution (Yue et al. 2002).

The M-K test involves testing of the hypothesis in which the null hypothesis, H0, assumes that the data are independent, identically distributed, and not correlated (i.e., no trend will be observed). The alternative hypothesis, Ha, is that the data follow a monotonic trend over time (i.e., there will be either a significant positive or a significant negative trend). The calculation of the M-K test statistics S and V(S) and standardized test statistics ZMK for a time series of n observation data points are defined as follows (Kendall 1975).

$$ S=\sum \limits_{i=1}^{n-1}\sum \limits_{j=i+1}^n\operatorname{sign}\left({x}_j-{x}_i\right) $$
(1)

where S computes the difference between a later-measured value and all earlier-measured values, (xj− xi), by assigning integer values for (xj− xi) as follows: (xj− xi) = {1 for (xj− xi) > 0; 0 for (xj− xi) = 0; and − 1 for (xj− xi) < 0}.

For large values of n, the statistic S is approximately normally distributed (Kendall 1975) with a mean, E(S) equal to zero, and the variance is given by \( V(S)=\frac{n\left(n-1\right)\left(2n+5\right)}{18} \). Therefore, the standardized M-K test statistic ZMK is given by \( {Z}^{\mathrm{MK}}=\frac{S-1}{\sqrt{V(S)}} \) for S > 0, ZMK = 0 for S = 0, or \( {Z}^{\mathrm{MK}}=\frac{S+1}{\sqrt{V(S)}} \) for S < 0 and follows a standard normal distribution.

The null hypothesis is rejected (i.e., a significant trend exists that is either positive or negative) if\( \left|{Z}^{\mathrm{MK}}\right|>{Z}_{1-\alpha /2}^{\mathrm{critical}} \)in a two-sided test. The critical value \( {Z}_{1-\alpha /2}^{\mathrm{critical}} \) determined from the standard normal table at a significance level α of 0.05 is 1.96. Thus, a positive ZMK value (ZMK > 1.96) denotes a significant increasing trend at the 5% significance level, and a negative value (ZMK < − 1.96) denotes a significant decreasing trend at the 5% significance level.

2.3 Extreme precipitation probability distribution models

In this research, four generalized probability distributions (namely, GEV, GP, GLN, and GLO) are used to fit the AM and POT data series. Table 2 presents the probability density functions (PDFs) and the cumulative density functions (CDFs) of these distributions with their domains. In these distributions, the random variable x represents an extreme precipitation amount, ξ is the location parameter, α is the scale parameter, and k is the shape parameter.

Table 2 The probability density functions (PDF), cumulative distribution function (CDF) and domain for selected extreme probability distribution in this study

Fitting each of the aforementioned probability distributions to the observed data set can be accomplished by estimating its parameters (i.e., shape, location, and scale parameters). In this study, parameters are estimated using the L moment method.

2.4 L moment method

The main role of L moments is in estimating parameters for probability distributions. Additionally, L moments are considered another way to describe the shape of a probability and data samples (Hosking 1990) (i.e., provide measures of location, dispersion, skewness, and kurtosis). L moments, as defined by Hosking (1990) and Hosking and Wallis (1997), appeared to be a modification of probability-weighted moments (PWMs) as proposed by Greenwood et al. (1979). Essentially, L moments are linear combinations of probability-weighted moments.

The main advantage of L moments over conventional moments is that the former are linear functions of the data (Hosking 1990). The conventional moment estimators involve squaring, cubing, and quadrupling the observed data, as in the sample variance, sample coefficient of skewness and sample coefficient of kurtosis, respectively, which gives greater weight to observed data far from the mean value, leading to high bias and variance. Thus, L moment estimators of the dimensionless coefficients of variation and skewness and kurtosis are almost unbiased (Hosking 1990; Stedinger et al. 1993). Additionally, L moments suffer less from the effects of sampling variability (Hosking 1990) and are therefore not as sensitive to extreme values (maximum and minimum), and L moment estimates from small samples are subject to small bias. Furthermore, the required computation is quite limited compared with other traditional techniques, such as maximum likelihood and least-squares (Pandey et al. 2001). L moments are capable of characterizing a wider range of distributions compared to conventional moments. A distribution may be specified by its L moments, even if some of its conventional moments do not exist (Hosking 1990). Accordingly, the method of L-moments has found wide application in the field of hydrology, especially in the case of parameter estimation of probability distributions in frequency analysis studies (Gubareva and Gartsman 2010; Li et al. 2014a, b; Rahman et al. 2013; She et al. 2013; Xia et al. 2012b; Yang et al. 2010; Zakaria et al. 2012; Hosking 2006).

In practice, L moments are estimated using the observation data x(i) from a finite sample of size n that have been arranged in ascending order. The first four L moments are given by \( {\lambda}_1={b}_0=\overline{X} \), λ2 = 2b1 − b0, λ3 = 6b2 − 6b1 + b0 and λ4 = 20b3 − 30b2 + 12b1 − b0, where b0, b1, b2, and b3 are sample unbiased estimators of the PWMs and defined by:

$$ {b}_0=\sum \limits_{i=1}^n\frac{1}{n}{x}_{(i)}\kern0.5em ,\kern0.75em {b}_1=\sum \limits_{i=2}^n\frac{\left(i-1\right)}{n\left(n-1\right)\ }\ {x}_{(i)}\kern0.75em ,\kern0.5em {b}_2=\sum \limits_{i=3}^n\frac{\left(i-1\right)\left(i-2\right)\ }{n\left(n-1\right)\left(n-2\right)\ }\ {x}_{(i)},\kern0.5em {b}_3=\sum \limits_{i=1}^n\frac{\left(i-1\right)\left(i-2\right)\left(i-3\right)}{n\left(n-1\right)\left(n-2\right)\left(n-3\right)\ }\ {x}_{(i)} $$

According to terminology in Hosking (1990), λ1 is the L location, a measure of central tendency, and represents the mean of the distribution. λ2 is the L scale, analogous to the standard deviation. Additionally, other dimensionless quantities called L moment ratios, defined as L skewness τ3 (τ3 = λ3/λ2) and L kurtosis τ4 (τ4 = λ4/λ2), are measures of skewness and kurtosis, respectively. Finally, the dimensionless ratio called the coefficient of L variation, defined as τ2 (τ2 = λ2/λ1), is analogous to the conventional CV. The estimates of the scale, location, and shape parameters of the GEV, GP, GLO, and GLN distributions by L moments are given in Table 3 as developed by Hosking (1990).

Table 3 The L moments estimates of parameters and predicted precipitation amount associated with return period T years for selected extreme probability distribution in this study

2.5 Return period

In hydrology, the return period (T), also known as the average recurrence interval, is a means to express exceedance probabilities. The return period is defined as the average length of time in years in which an event of a given magnitude is to be equaled or exceeded on average once. Mathematically, the return period of an event of a given magnitude is equal to the reciprocal of the probability of exceedance, expressed as a fraction, of that event; i.e., T = 1/p, where p denotes the exceedance probability P(Xx). In terms of F, the cumulative density function evaluated for a particular event is given by T = 1/(1 − F). Table 3 gives the precipitation amount (i.e., the quantile) of GEV, GP, and GLO for a specified return period of T years.

To accomplish the second objective, the best fitted probability distribution among GEC, GLO, GP, and GLN for a particular AM or POT data series was used to predict the precipitation amount at each station when the return period T was 5, 10, 25, and 50 years.

2.6 Assessment of goodness-of-fit

The goodness-of-fit of a probability distribution describes its consistency with an observed data set. Most of the procedures that test goodness-of-fit are presented as special implementations of a more general method of comparing data with a distribution. This kind of comparison can be framed in terms of a hypothesis test in which the null hypothesis (Ho) for the test is the hypothesized probability distribution that fits the data adequately at a particular significance level. In this study, nonparametric Kolmogorov-Smirnov (KS) goodness-of-fit tests at the 5% significance level were employed to identify the applicability of the aforementioned probability distributions to fit the AM and POT data series. Moreover, the probability distribution that best fit the data among the applicable distributions was the one with the minimum value of the KS test statistic. In addition, a graphical assessment known as the L moment ratio diagram was employed to further evaluate whether the AM and POT data series conformed to specific probability distributions.

2.6.1 KS goodness-of-fit test:

The KS goodness-of-fit test is a nonparametric test based on a comparison of the cumulative distribution functions of the hypothesized distribution F0(x) and the empirical stepwise distribution Fn(x) mathematically expressed in Eq. (2) after the data series is sorted in ascending order. The KS test statistic (Dn) is the maximum absolute difference between the hypothesized and empirical values of the cumulative distribution over the entire range of X, Dn = max |F0(x) − Fn(x)|. The null hypothesis is rejected if the calculated Dn value exceeds a tabulated critical value (Dn-critical) for the sample size n and 5% significance level.

$$ {F}_n(x)=\left\{\begin{array}{c}0\kern5.5em ,x<{x}_1\\ {}\frac{k}{n}\kern2.25em ,{x}_k<x<{x}_{k+1}\\ {}1\kern6em ,x>{x}_n\end{array}\right. $$
(2)

where x1, x2, … . . , xn are the values of the ordered extreme precipitation amount and k is the rank of the precipitation amount in the data series organized in ascending order.

2.6.2 L Moment ratio diagram

The L moment ratio diagram is a graphical method for the evaluation of the goodness-of-fit. This method was introduced by Hosking (1990) as a useful tool for the selection of a suitable distribution among alternative distribution functions to describe observed data. The L moment ratio diagram contains the L kurtosis value (τ4) plotted against the L skewness value (τ3). For various distributions, the theoretical relationship of τ4 as a function of τ3 is available in the form of polynomial approximation expressions, as suggested by Hosking and Wallis (1997). The L moment diagram is usually constructed in practical ranges of 0 < τ3 < 0.5 and 0 < τ4 < 0.4 (Hosking and Wallis 1997). For the considered distributions in this study, these expressions are given as follows:

$$ {\tau}_4^{\mathrm{GEV}}=0.10701+0.11090{\tau}_3+0.84838{\tau}_3^2-0.06669{\tau}_3^3+0.00567{\tau}_3^4-0.04208{\tau}_3^5+0.03763{\tau}_3^6 $$
(3)
$$ {\tau}_4^{\mathrm{GP}}=0.20196{\tau}_3+0.95924{\tau}_3^2-0.20096{\tau}_3^3+0.04061{\tau}_3^4 $$
(4)
$$ {\tau}_4^{\mathrm{GLO}}=0.16667+0.83333{\tau}_3^2 $$
(5)
$$ {\tau}_4^{\mathrm{GLN}}=0.12282+0.77518{\tau}_3^2+0.12279{\tau}_3^4-0.13638{\tau}_3^6+0.11368{\tau}_3^8 $$
(6)

For each station, the sample estimates of L skewness (t3) and L kurtosis (t4) for the AM and POT data series were calculated and represented as point coordinates (t3, t4) in the L moment diagram. The best-fitted distribution function can be selected by comparing the vertical distance from the point (t3, t4), representing the station, to various candidate distribution function curves. The nearest curve to the point (i.e., the minimum vertical distance) can be chosen as the best-fitted distribution function for that station (She et al. 2013; Xia et al. 2012b; Zin and Jemain 2010).

3 Results and discussion

3.1 POT series threshold selection

The POT series contains all the daily precipitation whose magnitude exceeds a predefined threshold value for all values of recorded daily precipitation. The selection of the most proper threshold value to be used for generating the POT series remains a crucial task. A low threshold value can violate the underlying assumption of the serial independence of POT series (Zin and Jemain 2010), while a high threshold value will generate few extreme data points, is inadequate for probability distribution parameter estimation (Li et al. 2014a, b) and leads to high variance (Coles 2001). Several methods have been introduced for the selection of an optimal threshold value, such as the percentile value method, mean excess plot method, absolute critical value method, and Hill plot method.

In this study, the threshold is determined by the percentile value method together with the annual average occurrence number (AAON) criterion. AAON is defined as the number of data points in the POT data set divided by the timespan of data (Li et al. 2014a, b). Four POT extreme data series were constructed based on the 90th, 95th, 97.5th, and 99th percentile values. The optimal threshold value is the percentile value corresponding to the POT data series with an AAON criterion between 1 and 2 (Li et al. 2014a, b; Liu et al. 2017). Table 4 shows the potential threshold values based on the percentile method, as well as AAON values at each weather station in AZB. As illustrated, the 95th percentile of the recorded daily precipitation can be considered the optimal precipitation threshold for all weather stations in AZB except for three weather stations, AL0010, AL0018, and AL0019. The 97.5th percentile is a more suitable optimal precipitation threshold for these three stations. These percentile values (95th and 97.5th) as optimal precipitation thresholds are not surprising results for a semiarid climate region with rare precipitation events, as in AZB. Additionally, as the AAON value increases, the threshold value decreases.

Table 4 The precipitation 90th, 95th, 97.5th, and 99th percentile values and AAON values, in curly brackets, at each weather station in AZB

3.2 The distribution and variation of AM and POT

Figure 2 illustrates the AM and POT series for the Sweilih (AL0017) and Sabha and Subhiyeh (AL0058) weather stations as an example since these series include the highest and lowest annual precipitation averages, respectively. In the POT series at the Sweilih weather station, the tenth largest precipitation in 1992 was included, and up to the fifth largest precipitation in some other years was also included (i.e., 1951, 1971, and 1991). Additionally, the maximum precipitation amount of 44.5 mm in 2006 is lower than the tenth largest precipitation amount in 1992 of 45.5 mm in the POT series at the Sweilih weather station. Moreover, 21 out of 50 extreme values in the AM series are not included in the POT series for the Sabha and Subhiyeh weather station. These observations confirm previous findings in the literature, for instance (Xia et al. 2012b), that the POT series must also be considered when precipitation extremes are studied since the POT series provides more information on extremes than the AM series.

Fig. 2
figure 2

AM and POT series of a Sweilih weather station (AL0017) and b Sabha and Subhiyeh weather station (AL0058)

Figure 3 depicts the spatial distribution of the mean and the CV of the AM and POT series in AZB. The means (Fig. 3a, b) for both the AM and POT series have similar distributions with east-to-west gradients in which the maximum values are almost observed in the western part and the minimum values are almost observed in the eastern part. From Fig. 3c, d, a similar spatial pattern can be observed for AM and POT for the CV values. Large CV values are almost observed in the northeastern part and western part of the basin, while minimum values are almost observed in the middle part. Additionally, the values of the CV are larger than 0.37 in the whole area for AM, while for POT, the values are smaller than 0.44 in most of the region. It is also apparent that the mean of the POT series is slightly higher than that of the AM series, which indicates that the POT series considers more information on extremes than the AM series. However, the CV of the AM series is higher than that of the POT series. This suggests that the majority of POT series values are near the threshold level at each station.

Fig. 3
figure 3

The spatial distribution of the mean and the coefficient of variation (CV) of AM and POT series in AZB

3.3 Trend analysis of extreme precipitation

Table 5 summarizes the results of the Mann-Kendall (M-K) test. The table shows the test statistic value ZMK obtained at each station for the annual maximum precipitation (AM) and the POT. The ZMK values indicate that a mix of increasing and decreasing trends is observed at different stations for both series. However, not all trends are significant. For the AM series, an increasing trend in extreme precipitation is detected at 14 stations, among which three are significant at the 5% significance level (i.e., the null hypothesis Ho for the M-K test is rejected since |ZMK| > 1.96), including AL0002, AL0045, and AL0047. The first station is located in the northern part of AZB, whereas the other two stations are located in the western part of the basin. A decreasing trend is detected at 10 stations. None of these decreasing trends is significant.

Table 5 The Mann-Kendall (M-K) test statistic value (ZMK) and the Kolmogorov-Smirnov (KS) test statistic value (Dn) of AM and POT series for each weather station

Since the POT series considers up to the fifth largest precipitation in some years, in contrast to the AM series, and at the same time skips the largest precipitation in the AM series in other years, the trend analysis results for the POT series differ slightly from those for the AM series. For the POT series, an increasing trend in extreme precipitation is detected at 16 stations, 4 of which are significant at the 5% significance level. These stations with significant increasing trends include one more station, AL0003, in addition to those stations found by trend analysis of the AM series reported earlier. A decreasing trend is detected at eight stations, among which one station (i.e., AL0054) indicates a significant decreasing trend. In contrast to the AM series, when the POT series is considered, four stations show increasing trends rather than decreasing trends (AL0012, AL0015, AL0019, and AL0053), and two stations show negative trends rather than positive trends (AL0004 and AL0036).

Figure 4 illustrates the results of the M-K test. It is worth noting that increasing-trend stations are mostly found in the western part of the basin, while decreasing trends are found at stations in the northwestern, middle and eastern parts of the basin.

Fig. 4
figure 4

The spatial distribution of the trends of AM and POT series in AZB

The occurrence date and amount of maximum precipitation for each station throughout the entire study period are presented in Fig. 5. The highest values are mostly located in the eastern part of AZB, and the smallest values are mostly located in the western part. This accompanies the decrease in elevation and transition from semiarid to arid climate zones. The month with the most frequent occurrence of the maximum precipitation amount at most stations is January, followed by December (10 stations and 8 stations, respectively); these stations are mostly located in the eastern and western parts of AZB. In the middle part of the basin, the occurrence months of maximum precipitation are distributed among December, January, February, and March. Figure 5 further reveals that maximum precipitation events at nine stations occurred after 2009. Eight of these nine stations show positive trend values (refer to the M-K test results in Table 5). Three stations experience a significant trend. This suggests that the number of extreme precipitation events increased slightly in the last decade.

Fig. 5
figure 5

Occurrence date for maximum precipitation (month/day/year) and its amount in millimeters (number in red color) for each station through entire study period in AZB

3.4 The best-fit probability distribution for extreme precipitation

In this study, both the AM and POT extreme precipitation series are fitted by GEV, GP, GLO, and GLN distributions. The shape, location, and scale parameters for each distribution are estimated by the L moment method. Table 5 summarizes the results of the goodness-of-fit test (Kolmogorov-Smirnov (KS) test). The table shows the test statistics value Dn obtained at each station for the AM and POT series by using the four distributions. The smaller the value of the test statistic Dn, the better the probability distribution fits the extreme precipitation data series. Table 5 presents the optimal distribution for each station in AZB. Bold values in Table 5 indicate that the probability distribution is not adequate to fit the data series at the 5% significance level (i.e., the null hypothesis Ho for the KS test is rejected) since the Dn values are greater than the related critical value (Dn-critical = 1.36/\( \sqrt{n} \) for a sample size n at the 5% significance level).

Based on the KS test (Table 5), for the AM series, out of 24 stations, 11 are best fitted by the GLN distribution, 7 are best fitted by the GLO distribution and 6 are best fitted by the GEV distribution. None of the stations are fitted by the GP distribution. Most distributions passed the KS test at the 5% significance level, indicating that the sample distribution follows the theoretical distribution, except for the GP distribution, which failed to pass the test at 13 stations. However, for the POT series, most stations (18 out of 24) follow the GP distribution. A small number of stations can be better fitted by the GLN, GEV, and GLO distribution (3, 2, and 1 stations, respectively). The four distributions passed the KS test at the 5% significance level.

The graphical method for the evaluation of the optimal distribution through L moment diagrams is illustrated in Fig. 6. In Fig. 6, the sample estimates and theoretical L skewness and L kurtosis are compared for both the AM and POT series. For the AM series, the weather station sample estimates of L skewness (t3) and L kurtosis (t4) points (t3, t4) are scattered widely around the theoretical curves of the four distributions. Based on the criterion that the theoretical distribution curve with the minimum vertical distance will be chosen as the best optimal distribution, the number of stations that can be better fitted by GEV, GLN, GLO, and GP is 9, 7, 7, and 1 out of 24, respectively. For the POT series, most station sample estimate points are concentrated near the GP distribution. There are 19 out of 24 stations that can be better fitted by the GP distribution, and the remaining stations are better fitted by the GLN distribution. No station is fitted by either the GEV distribution or GLO distribution.

Fig. 6
figure 6

Sample estimates and theoretical L skewness and L kurtosis are compared for a AM and b POT series in the AZB

The results of the goodness-of-fit tests (KS test and L moment diagram) suggest that among the GEV, GLN, and GLO distributions, there is no unique distribution that can be consistently ranked best for all stations using the AM series, while the POT series can be better fitted by the GP distribution in AZB. Figure 7 presents the spatial distribution of best-fitted probability distributions for the (a) AM series and (b) POT series according to the goodness-of-fit KS test results in AZB. There is no clear pattern for the best-fitted distribution connected with any spatial location for the AM series.

Fig. 7
figure 7

The spatial distribution of best-fitted probability distribution for a AM series and b POT series according to the goodness-of-fit Kolmogorov-Smirnov (KS) test results in the AZB

The result that the best-fitted distributions vary among the weather stations for the AM series could be due to the uneven spatial variation in AM series characteristics (i.e., mean, standard deviation, skewness and kurtosis) within AZB. These characteristics may influence the effectiveness of the probability distributions to represent precipitation extremes at the considered weather stations. For example, GEV mostly ranks the best for stations with low skewness (0.3 to 1.2) and low kurtosis (− 0.2 to 2.4), while GLO mostly ranks the best for stations with high skewness (1.4 to 2.2) and high kurtosis (2.7 to 7.3). However, the influence of the aforementioned data series characteristics together with the optimal distribution parameters and the size of the data series should be appropriately investigated in future studies to obtain definitive conclusions. Meanwhile, one can still depend on goodness-of-fit tests (statistical tests or graphical methods) to identify the best distribution among candidate distributions.

3.5 Extreme precipitation estimation under different return periods

After obtaining the optimal distribution at each station, the important question about the occurrence chance of precipitation events larger than a certain amount in any given year in the future can be answered. This is accomplished through return period (T) analysis, where the magnitude of an extreme precipitation event is inversely related to its occurrence frequency (i.e., the extreme precipitation value is defined as the maximum value for which the probability that the annual maximum exceeds this value is 1/T). Therefore, extreme precipitation values under return periods of 5, 10, 25, and 50 years for the AM and POT series using the optimal distribution at each station in the AZB were calculated (equations are given in Table 3) and are illustrated in Fig. 8. For all considered return periods, the spatial variations in extreme precipitation in both the AM and POT series show similar distribution patterns, with decreasing gradients from the west to the east, which is in agreement with the spatial distribution of the means of the AM and POT series given in Fig. 3a, b. Additionally, considering the same return period, the calculated extreme precipitation values of AM are up to 18% greater than those of POT.

Fig. 8
figure 8

The distribution of calculated extreme precipitations in millimeters under return periods of 5, 10, 25, and 50 years for AM (ad) and POT (eh) series using the optimal distribution at each station in the AZB

Figure 9 presents the spatial distribution of the observed extreme precipitation amount when the return period is 50 years. In this figure, the maximum observed precipitation amount for each station over the last 50 years is assumed to be the station-observed extreme precipitation value when the return period is 50 years. From a comparison of the calculated extreme precipitation values based on the optimal distribution for both the AM (Fig. 8d) and POT (Fig. 8h) series under a return period of 50 years with the corresponding observed extreme values (Fig. 9), the observed values are higher than the calculated extreme precipitation values of the AM and POT series.

Fig. 9
figure 9

The spatial distribution of observed extreme precipitation in millimeters when the return period is 50 years in the AZB

Figure 10 shows the changes in precipitation extremes (i.e., the quantiles or return values) with an increasing return period (T) for the Sweilih (AL0017) weather station as an example. In this figure, for both the AM and POT series, the calculated precipitation extremes based on an optimal distribution together with the observed precipitation extremes are represented by the solid line and the circles, respectively. The return periods (T) associated with observed precipitation extremes are calculated by the Weibull plotting position formula, \( T=\frac{1}{1-\frac{i}{n+1}} \), where i is the rank of the extreme precipitation amount in the data series organized in ascending order and n is the length of data series. From this figure, the calculated precipitation amount based on the optimal distribution provides a good match to those observed from the AM and POT series, even for high return periods. Thus, the GLN and GP distributions can fit the observed extremes from the AM and POT series well, respectively, at this particular station (the results of the KS test, Table 5, are used to determine the optimal distribution).

Fig. 10
figure 10

Calculated precipitation based on optimal distribution and observed precipitation amount for AM and POT series for AL0017 weather station

Moreover, the calculated extreme precipitation amount from both the AM and POT series based on the optimal distribution can be compared with the observed value under a 50-year return period in terms of relative error (i.e., the difference between the calculated AM or POT value and the observed value relative to the observed value for a specified return period). The relative error values are illustrated in Fig. 11. From Fig. 11, the relative errors for the AM series are less than those for the POT series at most stations, suggesting that the AM series and the corresponding optimal distributions at each station can better model extreme precipitation events in the AZB than the POT series. Furthermore, no correlation is found between the observed extreme precipitation amounts and the relative error value at each station.

Fig. 11
figure 11

Relative errors between the calculated and observed extreme precipitation from both AM and POT series at each weather station in AZB when return period is 50 years

To obtain another representation of extreme precipitation at each station, the calculated extreme precipitation based on the optimal-distribution-fitted AM series for return periods of 10, 25, 50, and 100 years together with the corresponding total exceedance numbers of precipitation extremes during the study period of 1940–2015 are presented in Table 6. From Table 6, most of the extreme precipitation from 1940 to 2015 at each station occurred within 1- to 10-year return periods (data for 1- and 2-year return periods are not shown in Table 6). Moreover, the highest occurrence frequency of 10-year precipitation extremes occurred during 2010–2015, with 19 occurrences out of 81. Additionally, as the return period increases, extreme precipitation becomes rare. A total of 32, 17, and 12 events occurred within 25, 50, and 100 years, respectively, of precipitation extremes at all stations. For 25-year precipitation extremes, the number of occurrences in the 1980s is quite comparable to that occurring in the 2000s and during 2010 to 2015, with 8, 6, and 7 out of 32 occurrences, respectively. Eight out of 12 events and 8 out of 17 events occurred after 1990 among 100- and 50-year precipitation extremes, respectively. The 17 events for the 50-year return period occurred only at 13 stations, while the 12 events for the 100-year return period occurred at 12 stations. However, for 50- and 100-year extreme precipitation events, each station had at least one extreme precipitation event, either a 50- or 100-year amount, during its recorded data history, except station AL0053. The 50- and 100-year extreme precipitation events have occurred more frequently in recent years (i.e., after 1990) than in previous years. Higher precipitation extremes of 106.6 to 173.8 mm for the 50-year return period are almost observed in the western part of AZB. High precipitation extremes of 127–194 mm for the 100-year return period are almost observed in the western (AL0017, AL0018, and AL0027) and northern (AL0002 and AL0004) parts of AZB. Stations AL0017 and AL0018 are located in the city of Amman, the capital of Jordan.

Table 6 Calculated extreme precipitations at different return periods and total exceedance numbers of precipitation extremes during the study period

4 Conclusions

In this paper, precipitation extremes are studied in the AZB, Jordan. Daily precipitation records from 24 weather stations spread across the basin during a period that exceeds 50 years (during 1940–2015 for most stations) are used. Two extreme precipitation series (AM and POT); four generalized probability distributions (GEV, GP, GLO, and GLN); and the L moment method for distribution parameter estimation are used. Based on the results, the following specific conclusions can be drawn:

The trend analysis for the AM and the POT over the time period 1940–2015 indicates that a mix of increasing and decreasing trends is observed at different stations. Since the POT series considers up to the fifth largest precipitation in some years, in contrast to the AM series, and skips the largest precipitation in the AM series in other years, the trend analysis results for the POT series differ slightly from those of the AM series. For the AM series, extreme precipitation events at 14 stations, which are mostly found in the western part, show an increasing trend. Among these 14 stations, three are significant at the 5% significance level. The remaining 10 stations, which are mostly found in the middle and eastern parts, show insignificant decreasing trends. For the POT series, an increasing trend in extreme precipitation is detected at 16 stations, 4 of which are significant at the 5% significance level. A decreasing trend is detected at eight stations, among which one station indicates a significant decreasing trend.

According to the Kolmogorov-Smirnov (KS) test and L moment diagram, the probability distributions GEV, GLN, and GLO can better fit the AM series with no unique distribution among them consistently ranking the best for all stations, while the POT series is better fit by the GP distribution.

The extreme precipitation amounts under return periods of 5, 10, 25 and 50 years for the AM and POT series using the optimal distribution at each station show similar spatial distribution patterns with a decreasing gradient from west to east. Additionally, by considering the same return period, the calculated extreme precipitation amounts of the AM series are up to 18% greater than those of the POT series. The AM series can better model extreme precipitation events than the POT series in AZB since the relative errors between observed values and calculated values for the AM series are less than those for the POT series at most stations.

From the return period analysis, most of the extreme precipitation from 1940 to 2015 at each station occurred within 1- to 10-year return periods. Additionally, as the return period increases, extreme precipitation becomes rare. However, 23 stations experienced at least one extreme precipitation event, either a 50- or 100-year event, during their recorded data history. The 50- and 100-year extreme precipitation events occurred more frequently in recent years. For the 100-year return period, high precipitation extremes of 127–194 mm are almost observed in the western and northern parts of AZB at five stations, two of which are located in the city of Amman, the capital of Jordan.

In terms of application, the findings of this study about the best probability distributions for each of the data series and the prediction of precipitation extremes amounts for several return periods provide important and valuable information for water resources planning and management tasks in AZB and nearby basins in Jordan. Furthermore, these findings can be used to provide a theoretical support to allow the water managers and the policymakers in Jordan for proper actions such as construction of a proper drainage system and construction of flood control structures to control and minimize the risks of large damages caused by frequent precipitation extremes occurred in recent years in AZB.