Abstract
A number of probability distributions are used to estimate flood discharges with various return periods of extreme hydrologic events. Among others, Log-Normal, Gumbel, Pearson Type III and Log-Pearson Type-III distributions are used in this study. The parameters of these distributions are estimated from the given data by the method of moments. The objectives of this study are: to carry out the flood frequency analysis using the different distributions to selected rivers in Myanmar, namely the Chindwin and Yenwe Rivers and to identify the most appropriate probability distribution for the basins under study. Log-Pearson Type III distribution can be recommended for both Chindwin and Yenwe basins among other distributions under study. The estimated flood values can be used in the engineering design of hydraulic structures such as dams, bridges, culverts, levees and other structures located along rivers and streams and for the effective management of flood plains.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Estimation of the magnitude and frequency of occurrence of extreme hydrologic events, such as severe storms, floods, and droughts are important in water resources planning and management. Frequency analysis of hydrologic data is one of the widely used methods of estimating the return period associated with a flood of a given magnitude through the use of probability distributions [1–8].
A number of probability distributions are used to fit the probability of occurrence of flood series. Details of these distributions are available in the literature including [9–11] among others. Selection of the most appropriate distribution for annual maximum series has been paid attention worldwide. Log-Pearson Type III distribution is the recommended technique for flood frequency analysis by the U.S. Water Advisory Committee on Water Data, USGS [3]. It is used to estimate the flood discharges for the streams in Iowa, USA [4]. The Generalized Extreme Value distribution is the standard method for UK as reported in [2] and it is adopted in regional flood frequency analysis in Northern Iceland [5] and Nile equatorial basins [6]. Gumbel distribution is used in Nyanyadzi river in Zimbabwe [7]. A World Meteorological Organization survey of 54 agencies in 28 countries reveals that Log-Normal distribution with 3 parameters is not a standard in any country, Generalized Extreme Value is a standard in one country, and Log-Pearson Type III is a standard in seven countries [8].
Numerous goodness-of-fit procedures exist for comparing the fit of alternative probability distributions to streamflow sequences. Since the introduction of L-moments, numerous investigators have recommended them to assess the goodness-of-fit of various probability distributions to regional samples of streamflow [8, 12, 13] and others. Vogel and Wilson [8] constructed L-moments diagrams for annual maximum streamflow at more than 1,455 rivers in the United States and concluded that the Generalized Extreme Value, three-parameter Log-Normal and the Log-Pearson Type III distributions provide good approximations to the distribution of annual maximum flood series. Since Log-Pearson Type III has been selected as the base method in the U.S., they suggested the agencies and countries to reevaluate their standards with respect to the choice of a suitable model for flood frequency analysis. Stedinger and Griffis [14] also recommended that the regional skew map given in [3] be updated to use the additional 30 years of data now available, to appropriately adjust for low outliers identified in the samples used to estimate the regional skew, and to use new and powerful statistical estimation procedures developed to use such data set.
The objectives of the study are hence twofold: (1) to carry out the flood frequency analysis using Log-Normal, Gumbel (Extreme Value Type I), Pearson Type III and Log-Pearson Type III which are commonly used frequency distributions to selected rivers in Myanmar, namely the Chindwin and Yenwe rivers and (2) to identify the most appropriate probability distribution for the basins under study. The estimated flood values can be used in the engineering design of hydraulic structures such as dams, bridges, culverts, levees and other structures located along rivers and streams and for the effective management of flood plains.
2 Data Used in This Study
In this study, two rivers namely Chindwin and Yenwe rivers in Myanmar are selected. The Chindwin river has floods several times in a year from the onset of rainy season to the end of the monsoon. Annual maximum discharges are collected at Hkamti, Htamathi and Homalin stations for Chindwin river. Average annual rainfall at Hkamti, Htamathi and Homalin stations are 3,847, 3,262 and 2,314 mm respectively. The mean monthly temperature varies from 28 °C in May to 18 °C in January.
Myochaung station for Yenwe river which is a tributary of Sittoung is also selected in this study. It lies in a tropical monsoon area with an average annual rainfall of about 2,000 mm. The mean monthly temperature varies from 31 °C in April to 25 °C in January. The location and area of these stations are shown in Table 1. Series of annual maximum discharge of each basin is plotted in Fig. 1.
It can be seen from Fig. 1 that the variation of the flood values from the mean in Chindwin is small compared to the variation in Yenwe basin. Statistical characteristic of each series is given in Table 2. Annual maximum series of Chindwin basins have negative skews while Yenwe has a positive skew.
3 Methodology
3.1 Testing for Outliers
Reference [9] recommends that adjustments be made for the outliers of the data. If the station skew is greater than +0.4, tests for high outliers are considered first and if the station skew is less than −0.4, tests for low outliers are considered first. Where the skew is between ±0.4, tests for both high and low outliers should be applied before eliminating any outliers from the data set. The following equation is used to detect high outliers:
where
- yH :
-
high outlier threshold in log units
- \( \bar{y} \) :
-
mean logarithm of variate
- s:
-
standard deviation of y’s
- KN :
-
10 % significance level K values.
The following equation is used to detect low outliers:
where yL = low outlier threshold in log units and other terms are as defined earlier.
3.2 Frequency Analysis Using Frequency Factors
Chow [9] proposed the frequency factor equation shown in (3) which is applicable to many probability distributions used in hydrologic frequency analysis.
which may be approximated by
where
- xT :
-
value of the variate x of a random hydrologic series with a return period T,
- \( \bar{x} \) :
-
mean of the variate,
- s:
-
standard deviation of the variate,
- KT :
-
frequency factor which depends upon the return period T and assumed frequency distribution
In the event that the variable analyzed is y = log x, then the same method is applied to the statistics for the logarithms of the data using
and the required value of xT is found by taking the antilog of yT.
Table 3 summarizes the probability density function, the range of the variable, and equations for estimating the distribution’s parameters from sample moments for each distribution [9]. The theoretical KT relationships for the selected probability distributions: Log-Normal, Pearson Type III, Log-Pearson Type III and Gumbel distributions extracted from [9] are given below.
-
1.
Gumbel distribution
The frequency factor KT for different return period T is given in (6).
-
2.
Log-Pearson Type III distribution
The frequency factor depends on the return period T and the coefficient of skewness CS. When CS = 0, the frequency factor is equal to the standard normal variable z. The value of z corresponding to an exceedence probability of p (p = 1/T) can be calculated by finding the value of an immediate variable w:
then calculating z using the approximation
where k = Cs/6, and Cs = coefficient of skewness of logarithms of the series.
-
3.
Pearson Type III distribution
For this distribution, the same equation in Log-Pearson Type III applies except that Cs is skewness of the original series (without taking the logarithmic values).
-
4.
Log-Normal distribution
For this distribution, the same equation in Log-Pearson Type III applies using Cs = 0.
3.3 Statistical Analysis
Goodness of fit between the observed events and the fitted distribution is tested by the two most commonly used tests: Chi square and Kolmogorov–Smirnov (K–S) [15].
-
1.
Chi square test
The statistic is calculated by
where Oj is the observed number of events in the jth class interval and Ej is the number of events that would be expected from the theoretical distribution.
-
2.
Kolmogorov–Smirnov (K–S) test
The statistic Dn is evaluated by observing the deviation of the sample distribution function P(x) from the completely specified continuous hypothetical distribution function P0(x), such that
The test requires that the computed value Dn from the sample distribution be less than the tabulated value of Dn at the required significance levels of 0.05 and 0.1.
3.4 Reliability of Analysis
The reliability of the results of frequency analysis depends on how well the assumed probability model applies to a given set of hydrologic data. This can be done by calculating confidence limits which are upper and lower boundary values of the confidence interval. For estimating the event magnitude for return period T, the upper limit UT,α and lower limit LT,α may be specified by adjustment of the frequency factor equation given in [9]:
where α is a significance level and is obtained by \( \alpha = \frac{1 - \beta }{2} \), β = confidence interval.
\( {{\text{K}}^{\text{U}}}_{{{\text{T,}}\alpha}} \) and \( {\text{K}}^{\text{L}}_{{{\text{T,}}\alpha}} \) are upper and lower confidence factors, which can be determined for normally distributed data using the noncentral t distribution. Approximate values for these factors are given by the following equations:
in which \( a = 1 - \frac{{Z_{\alpha }^{2} }}{2(n - 1)} \) and \( b = K_{T}^{2} - \frac{{Z_{\alpha }^{2} }}{n} \).
The quantity Zα is the standard normal variable with exceedence probability α.
4 Results
4.1 Flood Discharges
Annual maximum discharge series of Chindwin River at Hkamti, Htamathi and Homalin stations and Yenwe at Myochaung station are checked whether outliers exist in the data series before using them. Based on the skewness coefficient of the data series used, outliers are calculated and given in Table 4. KN value based on the number of data in the series is obtained from the table for outlier test in [3].
It is observed that the annual maximum discharges in the year 1994 are smaller than the low outliers for Chindwin at all stations. However, useful historical information is not available to adjust for low outliers and therefore they are retained in the series. There is no high outlier observed in all series.
Flood discharges with recurrence intervals of 2, 5, 10, 20, 50 and 100 years are calculated using four probability distributions: Log-normal, Gumbel, Pearson Type III and Log Pearson type III distributions. The results are given in Tables 5, 6, 7, 8 that show large differences in results obtained by the different methods, particularly at the larger return periods. It can be seen that Gumbel distribution gives the highest estimated flood discharges for Chindwin and Log-Normal distribution gives the highest for Yenwe. The estimated discharges for high return period (50 and 100 years) using Log-Normal and Gumbel distributions yield about 10–20 % higher than the flood values estimated by Pearson Type III and Log-Pearson Type III distributions for all basins under study.
4.2 Statistical Analysis
The two most commonly used tests of goodness of fit namely Chi square and Kolmogorov–Smirnov tests are applied to check the fit of probability distributions used in this study. Tables 9 and 10 list the values of Chi square statistics (χ2) and Kolmogorov–Smirnov (Dn) respectively.
It can be observed from the Chi square values of four distributions for all basins given in Table 9 that the values are less than the critical value at 95 % confidence level. From Table 10, it can be seen that all the computed values of Dn are less than the critical value at 95 % confidence level. From both statistical tests, no distinction among all distributions can be made for the basins under study. All distributions are acceptable to fit to the annual maximum series of Chindwin and Yenwe basins at 95 % confidence interval.
If the goodness of fit is the criterion used to select a specified distribution, then the distribution with three parameters will usually provide a much better fit than two parameters [16]. In this study, Log-Pearson Type III is selected as the recommended distribution since it includes the skew coefficient as a variable and therefore is more flexible than the Log-Normal which has a skew of zero of the logarithms. Log-Normal is considered as the special case of Log-Pearson Type III distribution. Another reason is that Pearson Type III is capable of fitting frequency relations that may, for hydrologic reasons, be highly skewed [1].
4.3 Reliability Analysis
Reliability analysis for the basins under study is performed using log-Pearson Type III distribution. The confidence limits with 90 % confidence interval are calculated and shown in Tables 11 and 12.
Graphical presentations of estimated flood discharges using all probability distributions under study together with the confidence limits using Log-Pearson Type III for each basin are shown in Fig. 2.
It can be seen from Fig. 2 that the confidence interval is quite wide especially in Yenwe basin since the sample size of each series is small in this study.
5 Conclusions
Based on the analysis of statistical tests, Gumbel, Pearson Type III and Log-Pearson Type III and Log-Normal distributions are suitable to estimate flood discharges for Chindwin and Yenwe basins.
Log-Pearson Type III distribution can be recommended for Chindwin and Yenwe basins since it includes the skew variable and therefore is more flexible than other distributions.
A study on frequency analysis for other gauging stations of Chindwin basin as well as the other river basins needs to be performed to develop the regional frequency analysis. Future research can be carried out by using other parameter estimator such as L moments in frequency analysis for the river basins in Myanmar.
References
M.A. Benson, Uniform flood-frequency estimating methods for federal agencies. Water Resour. Res. 4(5), 891–908 (1968)
European Cooperation in Science and Technology, Review of applied-statistical methods for flood-frequency analysis in Europe, Centre for Ecology and hydrology, Natural Environmental Research Council, (2012)
Interagency Advisory Committee on Water Data, Guidelines for determining flood flow frequency: Hydrology Subcommittee Bulletin #17B, available from Office of Water data Coordination, U.S. Department of the Interior Geological Survey, Reston, VA, 1982
D.A. Eash, Techniques for estimating flood-frequency discharges for the streams in IOWA, US Geological Survey, Water-Resources Investigation Reports 00-4233, IOWA Department of Transportation and the IOWA Highway Research Board (Project HR-395A), 2001
P. Crochet, Evaluation of two delineation methods for regional flood frequency analysis in Northern Iceland, Report, Icelandic Meteorological Office, (2012)
A.O. Opere, S. Mkhandi, P. Willems, At site flood frequency analysis for the Nile equtorial basins. From http://www.unesco.org/fileadmin/MULTIMEDIA/FIELD/Cario/pdf/AT_SITE_FLOOD-FREQUENCY_ANALYSIS.pdf
N. Mujere, Flood frequency analysis using the Gumbel distribution. Int. J. Comput. Sci. Eng. 3(7), 2774–2778 (2011)
R.M. Vogel, I. Wilson, Probability distribution of annual maximum, mean, and minimum streamflows in the United States. J. Hydrologic Eng. 1, 69–76 (1996)
V.T. Chow, D.R. Maidment, L.W. Mays, Applied Hydrology (McGraw-Hill Inc, New York, 1988)
K.C. Patra, Hydrology and Water Resources Engineering, 2nd edn. (Alpha Science International Ltd, Oxford, 2008)
K. Subramanya, Engineering Hydrology, 3rd edn. (Tata McGraw Hill Publishing Company Limited, New Delhi, 2009)
O.S. Selaman, S. Said, F.J. Putuhena, Flood frequency analysis for Sarawak using Weibull, Gringorten and L-moments formula. J. Inst. Eng. Malaysia 68(1), 43–52 (2007)
U.N. Ahmad, A. Shabri, Z.A. Zakaria, Flood frequency analysis of annual maximum stream flows using L-Moments and TL-Moments. Appl. Math. Sci. 5(5), 243–253 (2011)
J.R. Stedinger, V.W. Griffis, Flood frequency analysis in the United States: Time to update. J. Hydrologic Eng. ASCE 13, 199–204 (2008)
A. Hayter, Probability and Statics for Engineers and Scientists, Thomson Higher Education, 3rd edn. (2007)
R.M. Vogel, D.E. McMartin, Probability plot goodness-of-fit and skewness estimation, procedure for the Pearson type 3 distribution. Water Resour. Res. 27(12), 3149–3158 (1991)
Acknowledgments
Thanks are due to staff officers from Hydrology section, Irrigation Department and Meteorology and Hydrology Department, Myanmar for their friendly collaboration in data gathering.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media Singapore
About this paper
Cite this paper
Win, N.L., Win, K.M. (2014). Comparative Study of Flood Frequency Analysis on Selected Rivers in Myanmar. In: Hassan, R., Yusoff, M., Ismail, Z., Amin, N., Fadzil, M. (eds) InCIEC 2013. Springer, Singapore. https://doi.org/10.1007/978-981-4585-02-6_25
Download citation
DOI: https://doi.org/10.1007/978-981-4585-02-6_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-4585-01-9
Online ISBN: 978-981-4585-02-6
eBook Packages: EngineeringEngineering (R0)