1 Introduction

In many hydrological studies, rainfall analysis should be conducted to derive an Intensity–Duration–Frequency (IDF) relationship in order to derive the desired rainfall intensity from the corresponding rainfall duration and return period. To obtain IDF relationships with appropriate accuracy, long-term rainfall data (e.g., 30 consecutive years) with different sub-daily durations (i.e., 15, 30, 60, 120, 360, 720 and 1440 min) have to be available (Chow et al. 1988). Also, in agricultural studies especially in plant pathology, the meteorological data should be available at hourly time scale (Caubel et al. 2012).

In many rainfall stations, the rainfall is measured daily, and shorter-duration data are not available. Therefore, in order to derive an IDF relationship, first, the daily rainfall data should be converted to the desired shorter-durations. Several researchers have studied this issue, and proposed different analytical and experimental equations for different regions around the world. Although some of these equations have shown appropriate accuracy, the disadvantage is that either the calculation of short duration rainfall from them is very complicated, or the evaluation of the parameters involved is complex.

One of the earliest equations, was proposed by Hershfield (1961) for the USA, which was later modified and extended by Bell (1969) as the following:

$${P}_{T}^{t}=\left(0.52+0.21.\mathrm{ln}\left(T\right)\right)\left(0.54{t}^{0.25}-0.50\right){P}_{10}^{60}$$
(1)

where \({P}_{T}^{t}\) is the rainfall depth with a duration of t (minute) and return period of T (year). This equation which is still commonly used in water engineering projects, is only applicable for rainfalls with a duration in the range of 5 to 120 min and a frequency between 5 to 100 years. As it can be seen in Eq. (1), evaluation of \({P}_{10}^{60}\) when there is only daily rainfall available, is very complicated (Mauriño 2004). Many other researchers by modifying and calibrating the coefficients of Eq. (1), proposed different equations for various regions in the world, other than the USA (Ghahraman and Abkhezr 2004; Vaziri 1991).

In spite of using daily rainfall depths for the calculation of short-duration rainfall, several studies attempted to relate short-duration rainfall intensity to different parameters, including: near surface temperature, dew point (Shaw et al. 2011; Utsumi et al. 2011; Schroeer and Kirchengast 2018; Ali and Mishra 2018), sea level pressure, relative humidity, wind, and geospatial height (Hertig and Jacobeit 2013; Lepore et al. 2016). In these studies, the researchers were focused on implementing a multivariate regression between various parameters to achieve equations with minimum error, which is very time-consuming and difficult.

Many studies are available which used statistical methods to obtain short-duration rainfall depths. Some of these methods are based on stochastic Bartlett-Lewis or Newman-Scott rectangular pulse models (Pui et al. 2012; Cowpertwait et al. 1996) in which, five to seven parameters are included in the calculations. Moreover, using many other statistical methods, the short duration rainfall can be estimated by statistical distribution fitting (Garcia-Bartual and Schneider 2001; Smithers and Schulze 2001; Haddad et al. 2011), logarithmic graph fitting techniques (Al Mamun et al. 2018), multivariate regression method (Koutsoyiannis et al. 2003) or L-moment method (Galoie et al. 2013a). Although these methods have represented satisfactory results, heavy statistical efforts are involved in them.

The K-nearest neighbor resampling (KNNR) method is one of the popular methods which is based on the nonparametric disaggregation technique. Although the application of this method in converting daily rainfall to sub-daily is not easy, a method based on three-day rainfall patterns was used by Park and Chung (2020) in order to improve the ability of the method. Also, some researchers used the random cascade model and method of the fragment (MOF) for converting the daily to sub-daily rainfall (Bakhshi and Al Janabi 2019; Li et al. 2018). The results showed that the accuracy of the method for different rainfall resolutions (e.g. 4-h, 1-h and 10-min) were different so that for 10-min rainfall the result was overestimated (Bakhshi and Al Janabi 2019).

One of the simplest and easy to use methods is an empirical equation proposed by the Indian Meteorological Department (IMD). It has been successfully used with significant accuracy by a number of researchers in India (Palaka et al. 2016) and other regions of the world (Logah et al. 2013; Rashid et al. 2012; Chowdhury et al. 2007). This method is used for the conversion of maximum daily to hourly rainfall depths, as the following:

$${P}^{t}={P}^{24}{\left(\frac{t}{24}\right)}^{\left(\frac{1}{3}\right)}$$
(2)

where, P is the rainfall depth and t is the duration (hour). The IMD is a straightforward equation, but attempts to employ this method in some regions such as northern Graz in Austria showed that this equation with the “1/3” power gave low accuracy results (Galoie et al. 2013b). In that study, the researchers showed that by changing the power to “0.29”, the accuracy of the final results improved significantly.

According to the literature review, the simplest, yet accurate method for the estimation of sub-daily rainfall from daily data is the IMD method. In this study, the IMD equation is modified in a way that it can be calibrated for different locations worldwide, to obtain sub-daily rainfall depths with high accuracy.

2 Methodology

In this study, the IMD equation is modified to a single parameter equation, called Modified Indian Meteorological Department (MIMD), as the following:

$${P}^{t}={P}^{24}{\left(\frac{t}{24}\right)}^{n}$$
(3)

where, n is the regional coefficient which can be obtained by calibration. As it can be seen in this study, this coefficient in various regions can be varied even more than \(\pm 15\%\) in comparison to the fixed power of IMD formula (0.33).

Five stations located in different regions with different climatic conditions were selected (Table 1), in order to find the corresponding “n” value for each region. The location of the stations is also shown in Fig. 1. For each of the selected stations, daily and sub-daily rainfall depths were available (i.e., 1, 6 or 12 h) in order to compare observed data with final modeled data. The climate classification in Table 1 is based on the Koppen method which is the most widely used method for climate classification (Beck et al. 2018).

Table 1 Selected stations and related information
Fig. 1
figure 1

The location of the selected stations in the world (A, B, C, D, and E denote the station codes)

2.1 Statistical Analysis

According to Table 1, the sub-daily rainfall depths are available for all stations. The return period of maximum annual values can be calculated using the Weibull equation as the following (Raghunath 2006):

$$Tr=\frac{N+1}{m}$$
(4)

where, Tr is the return period (year); N is the number of data availability years; m is the rank of observed rainfall values when arranged in descending order. Using Eq. (4), the computed return periods are limited to the duration of data available. In order to estimate rainfall depths in longer return periods, the type I generalized extreme value (GEV-I) distribution (Gumbel) is used as the following (Gumbel 1954; Singh 2013):

$$x=\overline{x }+\frac{\sigma }{{\tilde{\sigma }}_{N}}\left(y-{\overline{y} }_{N}\right)$$
$$y=-\mathrm{ln}\left(-\mathrm{ln}\left(\frac{Tr}{Tr-1}\right)\right)$$
(5)

where, \(\overline{x }\) and \(\sigma\) are the mean and standard deviation, respectively; \({\overline{\sigma }}_{N}\) and \({\overline{y} }_{N}\) can be obtained from GEV type I tables, based on the number of sample data (Singh 2013).

Analysis of variance (ANOVA) in most cases is done when two or more than two sets of sample data are compared together as a test of hypotheses. In this analysis, the student’s t-test is commonly used for comparing the true difference between two groups with 95% confidence interval (Landau and Everitt 2004). The 95% limit is not mandatory and can be changed according to the test accuracy but in practical, the 95% is very commonly used. The t-test assumes that the two groups are normally distributed with equal variances. If the variances are unequal, the Welch’s t-test should be used (Welch 1947). However, if the two groups have the same sample sizes and variances, Student’s t-test and Welch’s t-test will give the same results. The t-test formula can be expressed as follow (Landau and Everitt 2004):

$$t=\frac{{\overline{X} }_{1}-{\overline{X} }_{2}}{\sqrt{\frac{{s}_{1}^{2}}{{n}_{1}}+\frac{{s}_{2}^{2}}{{n}_{2}}}}$$
(6)

where, \(\overline{X }\) is the sample mean, \(s\) is the sample variance, \(n\) is the sample size and subscripts 1 and 2 refer to the sample groups. The analysis can be performed manually or conveniently with statistical software (e.g. R, SPSS, Excel and etc.).

Also, it should be noted that correlation coefficient (r test) is one of the most famous tests for every researcher in order to evaluate the similarity or deviation between two sets of data. Therefore, in order to evaluate the accuracy of the estimated values, the Root Mean Square Error (RMSE) (Eq. 7), and the sample Pearson correlation coefficient (Eq. 8) were used. The RMSE is frequently used to measure the average magnitude of error, and the sample Pearson correlation coefficient is used to evaluate the degree of agreement between the observed (\({P}_{o}\)) and estimated (\({P}_{m}\)) values.

$$RMSE=\sqrt{\frac{\sum_{i=1}^{N}{\left({P}_{o}-{P}_{m}\right)}^{2}}{N}}$$
(7)
$${r}_{{P}_{o}{P}_{m}}=\frac{N\sum {P}_{{o}_{i}}{P}_{{m}_{i}}-\sum {P}_{{o}_{i}}\sum {P}_{{m}_{i}}}{\sqrt{N\sum {P}_{{o}_{i}}^{2}-{\left(\sum {P}_{{o}_{i}}\right)}^{2}}\times \sqrt{N\sum {P}_{{m}_{i}}^{2}-{\left(\sum {P}_{{m}_{i}}\right)}^{2}}}$$
(8)

where, \(N\) is the sample size, \({P}_{o}\) and \({P}_{m}\) are observed and estimated rainfall depths, respectively. The closer the value of RMSE is to 0, the better the model performance is. The value of r is always between –1 and +1. If r =  + 1 or r = -1, it means that there is a perfect linear correlation between \({P}_{o}\) and \({P}_{m}\). For r =  + 1, \({P}_{m}\) increases as \({P}_{o}\) increases and for r = –1, vice versa. Also, a value of 0 implies that there is no linear correlation between the variables (Holmes et al. 2017).

3 Results and Discussion

The MIMD equation (Eq. 3) was used for the selected stations, and regional coefficients (n) that resulted in the best correlation between observed and estimated values were obtained. Table 2 summarizes the final estimations of regional coefficients (n) and the corresponding RMSE and correlation coefficients.

Table 2 Regional coefficient (n) and corresponding RMSE and correlation coefficient for maximum 12-h rainfall depths

Moreover, Fig. 2 represents the observed and estimated maximum 12-h rainfall depths for each stations data.

Fig. 2
figure 2

Observed and estimated 12-h rainfall depths based on estimated regional coefficient (n) of the MIMD equation (the station codes are represented in each panel)

According to Table 2 and Fig. 2, for all stations, the correlation coefficient of MIMD method is between 0.8 and 1.0 which can be described as a very strong positive association. Also, the regional coefficient (n) is not equal to 0.33 for any of the stations, which is the power of the IMD equation (Eq. 3). Moreover, Table 2 represents that the regional coefficient (n) in regions with humid climate is smaller than regions with dry climate.

To make a comparison between IMD and MIMD method, the correlation coefficients are calculated for IMD method and they are represented in Table 2. As it can be seen, by modification of IMD method and employing only one parameter as regional coefficient in MIMD method, the results are improved considerably in comparison with IMD method.

Also, the hypothesis testing of means was performed using t-test for observed and modeled data in SPSS software (Table 3). The analysis in all modeled data showed that the p-value (in SPSS denotes as sig) is more than 0.05 which represents that there was no significant difference between the modeled and observed data (Cleophas and Zwinderman 2016).

Table 3 The t-test outputs for observed and modeled data in SPSS software

Since sub-daily data less than 12-h were not available for stations (D) and (E), the correlation coefficients for 1- and 6-h rainfall depths were calculated for stations (A) and (B) in which, both 1- and 6-h data were available. Also, the correlation coefficients for 6-h rainfall depth was calculated for stations (C) in which, 6-h data were available.

The analysis showed that the estimated regional coefficients (n) are still valid for a short duration but the correlation coefficients are less than the values obtained for 12-h rainfall depths. This means that as the rainfall duration decreases, the accuracy of MIMD decreases as well. Table 4 summarizes the final analysis of short-duration rainfall.

Table 4 Statistical analysis of 1-, 6- and 12-h rainfall depths

In order to show the accuracy of MIMD, this method is compared with observed data and the Bell method. Figure 3 represents the final results for the station (E). As can be seen in this figure, the MIMD method can estimate the intensity better than the Bell method. The correlation coefficient (r) of the Bell method is 0.71 which is much less than the MIMD method calculated as 0.88 for the station (E).

Fig. 3
figure 3

A comparison between MIMD and Bell methods (station E)

4 Conclusion

Rainfall analysis is one of the most common processes in most water engineering projects and hydrological studies. In many rainfall stations, the rainfall depths are only available as daily and in order to estimate an IDF relationship, the daily rainfall data should be converted to sub-daily, such as 1-, 2-, 6- and 12-h. Many different methods have been suggested by a number of researchers which can be used for conversion of daily to sub-daily rainfall depth but most of them need intensive efforts to complete. Indian Meteorological Department (IMD) method is one of the simplest methods for converting daily to sub-daily rainfall depth but accuracy of this method in some regions is not acceptable. This study aimed to propose a modified form of IMD method, say Modified Indian Meteorological Method (MIMD) in order to increase the accuracy of IMD method. Five stations were selected in different countries with different climates so that in each of which, sub-daily rainfall data were available. The annual maximum values of daily and corresponding sub-daily rainfall depths were extracted and various return periods were calculated using Weibull and Gumbel methods. In this study, the IMD method was modified to a single parameter equation and then the parameter was calibrated for each station. The final results were compared with observed data and it showed that the accuracy of MIMD method in estimation of sub-daily (especially 6- and 12-h) rainfall depths was considerably increased (mean correlation coefficients as 0.9) in comparison to IMD method (mean correlation coefficient as 0.73). Also, the t-test analysis which was performed by SPSS software between observed data and MIMD outputs showed that the p-value for all stations was more than 0.05 which indicated that there was no significant difference between the modeled and observed data. The regional coefficient of the MIMD method for these five stations ranged between 0.29 to 0.35 while the coefficient of the IMD method is fixed as 0.33. Also, the results showed that the accuracy of the MIMD method was decreased as rainfall duration was decreased (less than 6-h).

For future studies, it is suggested that, more than one parameter is applied in MIMD method as regional coefficients so that the method increases the accuracy of the results for converting the daily to 1-h or less duration rainfall.