Introduction

In arid regions, knowledge of rainfall variability, intensities, shifts, and trends are important for variety of applications in water resources management under the global warming effects. This variability pattern may cause different impacts on human life and engineering structure designs (Maragatham 2011; Goyal 2014). According to the IPCC et al. (2007) report, until the end of the twenty-first century, the global temperature increase is expected to cause increase in the frequency and amount of different weather events leading to extraordinary hydrological phenomena as floods, droughts, water shortages, or stresses as well as on the agricultural productivity (Wentz et al. 2007; IPCC et al. 2012; Artlert et al. 2013).

Several studies are carried out on the analysis of rainfall variability and distribution in arid regions. For instance, Zaman et al. (2012) analyzed the regional flood frequency in an arid region (Australia) to develop growth curve to estimate design floods for the ungauged catchments with reasonable accuracy. They concluded that this may be adapted to other similar regions of the world along with the locally developed prediction equation for the scaling to derive preliminary design flood estimates. Gong et al. (2004) analyzed daily precipitation of 30 gauge stations in the semiarid area of northern China from 1956 to 2000. Their study showed that the number of rainy days is decreasing significantly at a rate of −1.56 day per 10 years, implying an increase in drought stress. They conclude that global warming and human-induced land cover changes are supposed to be frequently related, at least partly, to the changes in precipitation.

In the Kingdom of Saudi Arabia (KSA), rainfall can be described as being little, unpredictable, and irregular but very extensive during the local storms. Almazroui et al. (2012) studied the recent climate change in the Arabian Peninsula and the seasonal rainfall and temperature climatology of KSA for 1978–2009. Surface observations indicated that, irrespective of season, rainfall insignificantly increase in the first period (1978–1993) and then significantly decrease in the second period (1994–2009). Elfeki and Al-Amri (2010) used Markov chains for modeling monthly rainfall records in KSA; they developed two spreadsheet models to check the probability distribution function (PDF) of the rainfall data at some stations and the other model is for calculating the Markov chain parameters from the data series. Alhassoun (2011) developed empirical formulae to estimate rainfall intensity in the Riyadh region, where the formula was derived using three different frequency methods (Gumbel, log-Pearson Type III, and log normal). These intensity-duration-frequency (IDF) curves are obtained for different durations and frequency periods. He stated that the derived equation could predict rainfall intensity in the Riyadh region for any return period with a given storm duration and calibrated parameters obtained from IDF curves according to a given storm duration.

Elsebaie (2012) developed a rainfall-duration-frequency relationship for two regions at Najran and Hafr Albatin regions in the KSA. Gumbel and the log-Pearson Type III of PDF frequency analyses techniques are used to develop the IDF relationship from rainfall data of these regions. Rainfall intensities obtained from these two methods showed good agreement with results from previous studies on some parts of the study area.

The western part of KSA receives a moderate amount of rainfall with long memory of dry periods due to its geographic nature and location within the subtropical zone. Subyani and Al-Ahmadi (2011) adapted runoff model from the US Soil Conservation Service and applied for the catchments of five selected ungauged dry wadis in the Al-Madinah area. Data from 16 rain gauges that have been recording the annual maximum daily rainfall for over 30 years are analyzed for derivation of the Gumbel extreme value PDF for 25, 50, and 100-year return periods. Hydrographs for different return periods are drawn using the results from this analysis along with the morphometric parameters of the wadi catchments. Regional maps of maximum probable precipitation and probable maximum flood are also produced for the study area.

Subyani (2012) studied the flood vulnerability in the western part of the KSA which has extremely high spatial and temporal variability in rainfall occurrence. His study showed that the prediction of annual maximum 24-h duration rainfall along with prediction of 25-, 50-, 100-, and 200-year return periods for the best fit of Gumbel PDFs. In addition, the probable maximum precipitation is also estimated on the basis of the same PDF. He concluded that the study will aid decision-making for future project designs and flood protection studies.

Knowledge and prediction of wet and dry spell probability occurrence, rainfall variability, shifts, and trends are critical when considering the impact of the climate change and can be estimated with reliability from reasonably available records (Şen 1983; Alyamani and Şen 1992; Subyani 2004). Wet and dry sequences are treated by different authors (Lioubimtseva 2004; Şen 2009; Almazroui 2011).

Subyani and Hajjar (2014) present detailed features of dry and wet spell durations and rainfall intensity series available (1971–2012) on daily basis for the Jeddah area, in the western Saudi Arabia. Their study showed trend changes in dry and wet spell durations and rainfall amount on daily, monthly, and annual time series. Three rain seasons are proposed in this investigation as “high” rain, “low” rain, and “dry” seasons. It shows that the overall average dry spell durations is about 80 consecutive days while the average wet spell durations is 1.39 days on the basis of average rainfall intensity threshold of 8.2 mm/day. Annual and seasonal autorun analyses confirm that the rainy seasons are tending to have more intense rainfall; in general, the seasons are becoming drier.

Because of the tremendous development in Al-Madinah city and modern projects in the Prophet’s Masjed and the increasing number of residents and visitors (MRSC 2013), these development projects need administrative decisions based on not only classical approaches but also additionally on consideration of the climate change and rainfall variability conditions.

The main objectives of the present study are to identify the statistical characteristic of the rainfall on daily bases and using the reduction method to disaggregate some missing daily rainfall records to hourly time series. The average empirical formula will be developed from IDF curves for Al-Madinah city using and testing different suitable methods of 10, 20, 30, 60, 120, 180, 360, 720, and 1440-min rainfall depth for different return periods (5, 10, 25, 50, 100, and 200-year). Another objective is to generate of wet and dry spells and daily rainfall amounts for the next 43 years based on the most suitable PDF fit for these sequences with best goodness of fit. The proposed methodology will be applied to the observed daily rainfall records for 1972–2014 from Al-Madinah rainfall stations located in western region of the KSA.

Study location and rainfall data

The Al-Madinah area, located in the western part of KSA, is bounded between latitude 24° 30′ and 24° 45′ N and longitude 39° 30′ and 39° 45′ E (Fig. 1). Geologically, the Al-Madinah area is a part of the mid-western Precambrian Arabian Shield, which is bounded on the west by the Red Sea rift and on the east by the Paleozoic and younger sedimentary rocks. The area is classified under predominately arid but hot summer and cooler winter seasons on the average. The mean temperature ranges from 24 to 40 °C in the summer and from 15 to 25 °C in the winter. Rainfall is sporadic, characterized by moderate to high variations in space and time. Each year, the rainy season is from October to April. The average annual rainfall is about 50 mm with some variations outside the metropolitan areas. In addition, the extraordinary increases of rainfall in frequency, occurrence, and amount are critical when considering the impact of the climate change. Al-Madinah climate station is located in Al-Madinah city with longitude 39.58° E and latitude 24.52° N, and the elevation is about 600 m above sea level (Fig. 1).

Fig. 1
figure 1

Location map of the study area

For the purpose of IDF curves and rainfall generation, 10-, 20-, 30-, 60-, 120-, 180-, 360-, 720-, and 1440-min duration rainfall data for the period 1972 to 2014 (43 years) are obtained from the Ministry of Water and Electricity, Hydrology Division (Ministry of Water and Electricity 2014) for Al-Madinah weather station (M001). Rainfall intensity ranges from 3.6 to 74 mm/h and for durations of 10 to 1440 min. Table 1 shows the annual maximum daily rainfall for Al-Madinah station.

Table 1 Annual maximum daily rainfall depth (mm) with number of rainy days

Unfortunately, 50 % of the rainfall intensity records are for 24 h only. However, to fill this gap and complement the data, methods suggested by Bell (1969) and Rathnam et al. (2000) are used to disaggregate daily rainfall to hourly time series. The maximum values of 24 h are converted into shorter duration (1, 2, 3, 6, and 12-h) values using the reduction formula suggested by the Indian Meteorological Department (Rathnam et al. 2000), as

$$ {p}_t={p}_{24}\ {\left(t/24\right)}^{1/3} $$
(1)

where, P t is the required rainfall depth in millimeters for t-hour duration, P 24 is the daily rainfall in millimeters, and t is the duration of rainfall for which the rainfall depth is required in hours. This formula gives the best estimation of short duration rainfall (Chowdhury et al. 2007; Awadallah et al. 2011).

Rainfall intensity-duration-frequency curves

In arid areas, rainfall does not occur uniformly; variations in its intensity occur not only from the center to the peripheries of the storm rainfall but also temporally at a given point. In many design problems related to watershed management, it is necessary to know the rainfall intensities of different durations and return periods. One of the first steps in many hydrologic design projects is the determination of the rainfall event or event to be used for design purposes. Rainfall IDF curves are derived from the statistical analysis of single storm rainfall records over a period of time and they are used to capture important characteristics of point rainfall for any desired shorter durations. In other word, it is defined as the calculation of average design rainfall intensity for a given exceedance probability over a range of duration.

The IDF formula is in the form of empirical equation representing the relationship among maximum rainfall intensity, as dependent variable, and other parameters such as rainfall duration and frequency (as independent variables). The general form of IDF curves can be written as follows (Chow et al. 1988)

$$ {I}_{\mathrm{T}}=\frac{P_{\mathrm{T}}}{t_{\mathrm{d}}} $$
(2)

where \( {I}_{\mathrm{T}} \) is the design rainfall intensity (mm/h), \( {t}_{\mathrm{d}} \) is the duration (hours), and \( {P}_{\mathrm{T}} \) is return periods (years).

The typical technique for establishment of the IDF curves can be achieved after the execution of the following steps.

  1. (a)

    Processing and selection of reliable of maximum annual single storm rainfall.

  2. (b)

    Finding the best fit as the reliable PDF to each group of the data values for a set of specific durations.

  3. (c)

    From the fitted relationships the rainfall intensity for any duration and return period can be derived.

  4. (d)

    Finally, the IDF curves can be obtained in two different ways, either by plotting the intensity versus duration for different return periods or the rainfall intensity is related in functional relationship to the rainfall duration and the return period using regression analysis (Chow et al. 1988; Alhassoun 2011).

There are commonly used theoretical distribution functions that are applied in many hydrological studies. Two common distributions are used in this study, namely, (Gumbel and log-Pearson Type III PDFs).

Methodology

Gumbel’s method

This is one of the most widely used PDF for calculating extreme values in hydrological and meteorological studies for the prediction of extreme meteorological events such as flood peaks, maximum rainfalls, and maximum wind speed. According to this PDF, a flood is the largest of the 365 daily flows, and the annual series of flood amounts constitutes a series of flow values. Its PDF is given by,

$$ p=1-{e}^{-{e}^{-y}} $$
(3)

where p is the probability of a given flow being equal or exceeded and y is the reduced variate as a function of the convenient PDF from statistical tables. In order to obtain the frequency rainfall depth P T (in mm) for any rainfall duration t d (in hour) with specific return period T r (in years), it is necessary to employ the following relationship.

$$ {P}_{\mathrm{T}}=\overline{P}+{K}_T{\sigma}_{\mathrm{P}} $$
(4)

where \( \overline{P}\kern0.5em \mathrm{and}\kern0.5em {\sigma}_{\mathrm{P}} \) are the mean and standard deviation of rainfall in specific duration (mm), respectively. \( {K}_T \)is the Gumbel frequency factor of the t given by (Chow et al. 1988) as

$$ {K}_T=-\frac{\sqrt{6}}{\pi}\left\{0.5772+ \ln \left[ \ln \frac{T_{\mathrm{r}}}{T-{1}_{\mathrm{r}}}\right]\right\} $$
(5)

The use of Eq. 5 yields the values of K 5 = 0.72, K 10 = 1.3, K 25 = 2.04, K 50 = 2.59, K 100 = 3.14, and K 200 = 3.68.

Log-Pearson Type III

The log-Pearson Type III PDF is particularly useful for hydrological analysis because the skew parameter enables sample fitting where other PDFs fail. The log-Pearson Type III frequency curve is characterized by three parameters. The mean represents the average ordinate, the standard deviation is for the slope of the straight line on the probability paper, and the skew coefficient corresponds to represents the degree of curvature. However, this technique is mainly based on the use of log-transformed data. The following equations have been used extensively in the literature (Zaman et al. 2012; Elsebaie 2012).

$$ \overline{ \log x}=\frac{{\displaystyle \sum \log x}}{n} $$
(6)
$$ {\sigma}_{\log x}=\sqrt{{\displaystyle \sum {\left( \log x-\overline{ \log x}\right)}^2}}/\left(n-1\right) $$
(7)
$$ G=n{\displaystyle \sum {\left( \log x-\overline{ \log x}\right)}^3/\left(n-1\right)\left(n-2\right){\left(\sigma \log x\right)}^3} $$
(8)

The value of \( \mathbf{x} \) for any recurrence interval is calculated by the following expression:

$$ \log x=\overline{ \log x}+k{\sigma}_{\log x} $$
(9)

After obtaining rainfall values by one or two of the above methods for different return periods, chi-square, Anderson-Darling, and Kolmogorov-Smirnov (K-S) tests are carried out to assess the “goodness of fit” in addition to visual inspections for graphical comparisons.

IDF curve results

EasyFit software (EasyFit Professional 2009) is used in this work. Table 2 shows the model parameters and goodness of fit for both Gumbel and log-Pearson Type III PDFs. According to goodness of fit (K-S, Darling, and chi-square tests), it is observed that there is no big difference between them. P-P plot is also used to determine how well a specific distribution fits to the observed data. An example of P-P plot for empirical CDF values against theoretical cumulative distribution function (CDF) values for 10-min, 30-min, 3-h, and 24-h durations is shown in Fig. 2. Within all rainfall durations and high rainfall values, the matching curves for both Gumbel and LPT III are acceptable, where the high rainfall durations (e.g., 3 and 24 h) and low rainfall values, LPT III is a more reliable CDF. However, due to the climate change report (IPCC et al. 2007, 2012) it is better to take the most risky case and its corresponding numerical value in any design business. From this point of view, the best fit appears with log-Pearson Type III PDF.

Table 2 Parameter estimation of the best fitted distributions
Fig. 2
figure 2

P-P plot for empirical CDF values against theoretical CDF values with different durations. a Gumbel and b LPT III CDF

Figures 3 and 4 show the IDF curves on logarithmic scale for Gumbel and log-Pearson Type III methods, respectively. For any storm design, it is recommended to use LPT III method.

Fig. 3
figure 3

IDF for Al-Madinah (Gumbel method)

Fig. 4
figure 4

IDF for Al-Madinah (LPT III method)

Empirical IDF formula

From the previous results, empirical IDF formula for IDF for Al-Madinah area can be related using a derived equation in power law (Koutsoyiannis et al. 1998; Alhassoun 2011) as,

$$ {I}_{\mathrm{T}}\left(\mathrm{mm}/\mathrm{h}\right)=\frac{a{T}_{\mathrm{t}}^m}{t_{\mathrm{d}}^b} $$
(10)

Table 3 shows the results of the empirical parameters from Gumbel and LPT III methods and the average. Finally, the general average IDF formula in Al-Madinah city for any design storm can be derived in the following form,

$$ {I}_T\left(\mathrm{mm}/\mathrm{h}\right)=\frac{149{T}_t^{0.24}}{t_d^{0.66}} $$
(11)
Table 3 rainfall intensity parameters for Al-Madinah area

To compare between the results from IDF-derived equation (Eq. 11) and the models, Figs. 5 and 6 show an example of these results (T r = 5, 50, 200-year) for Gumbel and LPT III methods, respectively. In conclusion, the average formula (Eq. 11) could be accepted for storm design in Al-Madinah city.

Fig. 5
figure 5

Comparison of IDF Gumbel model and equation

Fig. 6
figure 6

Comparison of IDF LPT III model and equation

Daily rainfall generation

Daily rainfall data time series of Al-Madinah stations is presented in Fig. 7, which shows the high variations in rainfall amounts throughout the 43 years (1972–2014) as the normal situation in arid regions. Statistically, the daily rainfall mean of rainy days is 0.13 mm/day with a standard deviation of 1.66 mm/day. This general statistics may not be informative; a better statistics would be to separate the rainy days data from the dry ones. The expected mean of rainy days is 5.9 mm/day and standard deviation is 8.32 mm/day.

Fig. 7
figure 7

Daily rainfall series of Al-Madinah station (1972–2014)

Autocorrelation function (ACF) may give a better statistics of rainfall variability in this study. It is calculated for 10 years as shown in Fig. 8. This figure shows weak annual repeatability and no significant correlation values (<0.2).

Fig. 8
figure 8

Autocorrelation functions of daily rainfall

Autorun daily rainfall

To analyze daily rainfall, a mechanism for synthetic daily rainfall generation for Al-Madinah rainfall station needs to preserve the historical wet and dry spell properties. Such a generating model of synthetic sequences requires identification of a suitable PDF model (Şen 1985; Subyani and Hajjar 2014). In addition to the classical statistical parameters and serial correlation coefficients, the wet and dry period statistics are also considered as they are estimated from the available historical sequences.

Spell durations statistics

From the daily rainfall data, the dry and wet spell durations are extracted as well as the rainfall intensity of the rainy days. Data are truncated at zero rainfall level in this study. The general statistics are shown in Table 4. As can be seen, the overall average dry spell duration is 71 consecutive days, while the wet spell durations average is 1.58 days with rainfall intensity of 5.9 mm/day. From the maximum values for both wet and dry spells, one can find that it did not rain at all for 371 days (almost 1 year) at Al-Madinah station, while at most it rained consecutively for 8 days. The median values give better statistics when extreme values occur in the data. It can be seen that the dry and wet spell durations have medians of 6 days and 1 day, respectively.

Table 4 Statistics for dry and wet spell durations and rainfall intensities

When analyzing the dry and wet spell durations and the rainfall intensity over the years, the trend is observed at Al-Madinah station (Fig. 9). Table 5 lists the trends along with its Mann-Kendall p values for testing them. On average, the dry spells have positive trends towards longer days (0.04). For the wet spells, there is a negative trend, which means that the rainfall days with more rainfalls are expected in the future due to global warming and human activities. For the rainfall intensities, there are positive trends but they are not significant. In addition, these figures may support the effect of climate change.

Fig. 9
figure 9

Trends for rainfall intensities and dry and wet spells

Table 5 Trends for dry and wet spells and rainfall intensities with MK test

Spell duration distributions

For rainfall generation, the essential step is to find the best probability mass function (or density function) for the dry and wet spell durations as well as the rainfall intensities of the rainy days. Besides, the search for fitting suitable distributionsis an essential step in the any rainfall generation. Utilizing the professional statistical software package (EasyFit software EasyFit Professional 2009), which includes numerous distribution functions, exhaustive fittings are performed on the data. One can notice that the dry and wet spell durations are integer numbers with a minimum of 1 day, while the rainfall intensity is a continuous positive non-zero number.

Results of the best fittings adopting Kolmogorov-Smirnov (K-S) goodness of fit for the dry spells, wet spells, and the rainfall intensities are as follows.

  • Dry spell duration is best distributed as log-logistic with parameters α = 0.625, β = 6.96, and γ = 1.

  • Wet spell duration is best distributed as according to Poisson with λ = 1.584.

  • Rainfall intensities are best distributed as log-Pearson Type III with α = 265.5, β = −0.09, and γ = 24.8.

P-P plot is also used to determine how well a specific distribution fits to the observed data. Figures 10, 11, and 12 show the P-P plot for empirical CDF values against theoretical CDF values for dry spell duration, wet spell, and rainfall intensities, respectively. By combining these three distributions, the random number of daily rainfall is generated for the next 43 years (2015–2058) as shown in Fig. 13.

Fig. 10
figure 10

P-P plot for empirical CDF values against Poisson CDF values for wet spell

Fig. 11
figure 11

P-P plot for empirical CDF values against log-logistic CDF values for dry spell

Fig. 12
figure 12

P-P plot for empirical CDF values against LPT III CDF values for rainfall intensities

Fig. 13
figure 13

Daily rainfall generation (2015–2058)

Conclusion

Detailed characteristics of the observed daily rainfall series available in Al-Madinah area are investigated over the period of 1972–2014 (43 years). The main objective of the present study area is to develop a suitable formula for storm design in different rainfall intensities and different return periods; in addition, generating 43 years of wet and dry spells and daily rainfall amounts are adapted using the most suitable probability distribution function (PDF) model fit for these sequences with best goodness of fit. This study shows generated intensity-duration-frequency (IDF) curves using different rainfall intensity analyses to find an equation for describing storm design. Disaggregation of daily rainfall to hourly time series using reduction method is helpful to find suitable PDF for analysis. The results show no remarkable difference between Gumbel and LPT III methods. An empirical formula for storm design is developed for various durations and different return periods for Al-Madinah area. This formula is recommended for design duration for planning storm water structure in Al-Madinah area.

On the other hand, autorun analysis is implemented in this study to investigate the properties of the dry and wet durations and rainfall intensities. However, the study shows that the overall average dry spell durations is about 71 consecutive days while the average wet spell durations is 1.58 days with an average rainfall threshold intensity of 5.9 mm/day. From trend analysis, the dry spells have positive trends towards longer days. For the wet spells, there is a negative trend, which means that the rainfall days are bound to have more intense rainfalls in the future due to global warming and human activities.