Abstract
The statistical study of rainfall time series is one of the approaches for efficient hydrological system design. Identifying, and characterizing long-term rainfall time series could aid in improving hydrological systems forecasting. In the present study, eventual statistics was applied for the long-term (1851–2006) rainfall time series under seven meteorological regions of India. Linear trend analysis was carried out using Mann–Kendall test for the observed rainfall series. The observed trend using the above-mentioned approach has been ascertained using the innovative trend analysis method. Innovative trend analysis has been found to be a strong tool to detect the general trend of rainfall time series. Sequential Mann–Kendall test has also been carried out to examine nonlinear trends of the series. The partial sum of cumulative deviation test is also found to be suitable to detect the nonlinear trend. Innovative trend analysis, sequential Mann–Kendall test and partial cumulative deviation test have potential to detect the general as well as nonlinear trend for the rainfall time series. Annual rainfall analysis suggests that the maximum changes in mean rainfall is 11.53% for West Peninsular India, whereas the maximum fall in mean rainfall is 7.8% for the North Mountainous Indian region. The innovative trend analysis method is also capable of finding the number of change point available in the time series. Additionally, we have performed von Neumann ratio test and cumulative deviation test to estimate the departure from homogeneity. Singular spectrum analysis has been applied in this study to evaluate the order of departure from homogeneity in the rainfall time series. Monsoon season (JS) of North Mountainous India and West Peninsular India zones has higher departure from homogeneity and singular spectrum analysis shows the results to be in coherence with the same.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Forecasting of time series is an important mathematical aspects and it spans along many fields including water resources, economics and social demography (Granger and Newbold 2014; Salas 1980; Yaffee and McGee 2000). Understanding and analysis of the existing time series is required for future forecasting of the time series. The subject of long-term rainfall series analysis has received a great deal of attention recently, especially with regard to expected climate changes (Goyal 2014; Moyo et al. 2017; Murphy et al. 2017; Smit et al. 2000; Trenberth 1998). The study of long rainfall time series permits creating quantitative judgments of the potential boundaries of stationary hypothesis. The statistical studies of rainfall time series data were found to be useful in water resources planning and management and usually carried out at the spatio-temporal scales (Loucks et al. 2005; Vicente-Serrano 2006).
While examining any rainfall time series, one of the important tasks is to identify and summarize the time series data in classes by expressing their significant features (Hipel and McLeod 1994; Sethi et al. 2015; Tiwari et al. 2015). Trend analysis and homogeneity test are primary tasks of time series studies (Wilks 2011). Mann–Kendall analysis is the usually practised methodology for the trend detection for any time series (Sang et al. 2013; Yue et al. 2002). It suggests the polarity of trend (positive or negative) at different significance levels. Mann–Kendall test is non-parametric and does not consider any assumption on the distribution of the time series data. The Mann–Kendall test provides a single value for the one time series and is not able to extract the nonlinearity of the trend. Sequential Mann–Kendall Test was applied to detect nonlinear trend of the rainfall time series, i.e., temporal scale of the rainfall time series. Innovative trend analysis is one of the recently applied techniques to establish the general trend of the time series.
Homogeneity is a fundamental characteristic for the time series. von Neumann ratio test and cumulative deviation test are accepted techniques for homogeneity check. Singular spectrum analysis was suggested to observe the order of departure signature from homogeneity. Singular spectrum analysis is a strong tool to analyze the univariate time series in multidimensional spatial dimension.
Mann–Kendall and Sen’s tests were applied to long-term annual and seasonal rainfall series of India for trend analysis. West Peninsular India showed a positive trend for the annual rainfall series. Sequential Mann–Kendall test was applied to get the nonlinear trend of the rainfall time series and the result was validated using Mann–Kendall test. Partial cumulative deviation test was carried out in the study to get the nonlinear trend of the time series and was found to be in coherence with sequential Mann–Kendall test. Innovative trend analysis results could be useful to get trend in an easy way and to describe the number of temporal clusters available in the time series; therefore, it will be useful for soft computing analysis. Innovative trend analysis methodology was also able extract the monotonicity of the time series and hence useful to detect the number of available change points in the time series. In that way, it will be helpful to classify the rainfall series temporally. von Neumann ratio test and cumulative deviation test were applied to get the homogeneity of the long-term rainfall time series (1851–2006). Singular spectrum analysis was proposed and then successfully verified (from von Neumann ratio test and cumulative deviation test statistics) to detect the non-homogeneity character for the rainfall time series.
2 Study area
The country is situated north of the equator between 8°4′ and 37°6′ north latitude and 68°7′ and 97°25′ east longitude. It is the seventh largest country in the world, with a total area of 3,166,414 km2. India measures 3214 km from north to south and 2933 km from east to west. It has a land frontier of 15,200 km and a coastline of 7517 km.
For the study, seven regions in India were categorized, considering the seasonal and annual rainfall (Fig. 1). These regions are as follows: North Mountainous India (NMI), North Central India (NCI), Northwest India (NWI), East Peninsular India (EPI), West Peninsular India (WPI), South Peninsular India (SPI) and Northeast India (NEI) and are shown in Fig. 1 (Li et al. 2014; Sontakke et al. 2008). North Mountainous India (MNI) is the Great Himalayan Mountain of topography elevation more than 7000 m and mean annual rainfall about 1500 mm, of which nearly 72% is from the monsoon season.
Northwest India (NWI) region is an arid and semi-arid climate zone of India. In this region, the ‘Thar’ Desert is located. The mean annual rainfall of this region is 800 mm and 88% is contributed by the monsoon season. North Central India (NCI) is the Indo-Gangetic Plains, a humid sub-tropical climatic region. The mean annual rainfall is 1212 mm, with the monsoon contributing 85% of it. The mean annual rainfall for the WPI, EPI and SPI regions is 1103, 1162 and 1555 mm, with nearly 85, 73 and 60% falling in the monsoon season, respectively (Li et al. 2014; Parth Sarthi et al. 2015).
The presented case study comprises a very large area, of about 3.288 million km2. Available data consist of annual and monthly rainfall time series for 316 rain gauges categorized into seven homogeneous zones, from 1851 to 2006, analyzed to detect potential trends from different methodologies and their significance (Kumar et al. 2010). Long-term (1851–2006) instrumental area-averaged monthly rainfall series of seven homogeneous zones and the whole India were taken from the Indian Institute of Tropical Meteorology (http://www.tropmet.res.in/) for the study. The monthly data of rainfall series were prepared in five parts named as annual (AN), January–February (JF) March–April–May (MM), June–July–August–September (JS) and October–November–December (OD).
2.1 General statistical analysis
As a prelim step of time series analysis, statistics of the data were investigated. Some robust statistics like coefficient of variation, quartile skew coefficient and percentile coefficient of kurtosis were used in the present study including mean, maximum, minimum and standard deviation (Eqs. 1–3), where CV values of 0–15, 16–35 and > 36 indicate little, moderate and high variability, respectively. Italic values of Table 3 indicate the high variability of the rainfall series. Table 1 shows higher seasonal variation rather than annual variation for rainfall time series. Non-monsoon seasons show a higher variability than the monsoon season for all seven zones. Quartile skew coefficient was based on a comparison of distances of the third and the first quartile to the median:
where CV is the coefficient of variation, σ the standard deviation of the dataset, \(\bar{x}\) the mean of the dataset, qs the quartile skew coefficient, P10 the 10th percentile of the dataset, P25 the 25th percentile of the dataset, P50 the 50th percentile of the dataset, P75 the 75th percentile of the dataset, P90 the 90th percentile of the dataset and PCK the percentile coefficient of kurtosis.
3 Trend analysis
Identification of meteorological regimes of India is an important subject in stochastic hydrology under climate change conditions. Generally, parametric and non-parametric techniques have been employed to get the regime changes; the latter has been widely used mainly because of a fewer number of assumptions involved in their implementation (Khaliq et al. 2009). In any sign investigation study, mostly it is vital to estimate hydrological trends at local scales, that is, point estimates. In the current study, the MK and the SRC rank-based tests and CS test were used for identifying temporal changes in observational records. Most of these tests are based on the assumption of independent and identically distributed variables.
3.1 Mann–Kendall test
Mann (1945) presented a non-parametric test for randomness against time, which constitutes a particular application of Kendall’s test for correlation commonly known as the ‘Mann–Kendall’ test (Hamed 2008; Hamed and Rao 1998). Letting P1, P2, …, Pn be a sequence of rainfall measurements over time, Mann proposed to test the null hypothesis, H0, that the data come from a population where the random variables were independent and identically distributed. The alternative hypothesis, H1, is that the data follow a monotonic trend over time. Under H0, the Mann–Kendall test statistic (S) is (Eqs. 4– 7):
where
Under the hypothesis of independent and randomly distributed random variables, when n ≥ 8, the S statistic is approximately normally distributed, with zero mean and variance as follows:
where σ is the standard deviation of the dataset. Therefore, the standardized Z statistics follow a normal standardized distribution:
where S is the Mann–Kendall statistic.The hypothesis is that no trend is rejected when the Z value computed by the equation is greater in absolute value than the critical value Zα, at a chosen level of significance α.
3.2 Theil–Sen estimator (TS)
Theil–Sen estimator is a non-parametric statistics to calculate the median slope among all lines through pairs of two-dimensional sample points (Eq. 8). It is an unbiased estimator of the true slope in simple linear regression. Theil–Sen estimator is a widely used methodology after MK test to validate the results:
3.3 Changes in mean (%)
To calculate the percentage change in mean annual rainfall, mean trend applying Sen’s slope was used:
where β is Sen’s slope.
3.4 Sequential Mann–Kendall test
To detect the nonlinear trend with time, Sneyers (1990) introduced sequential or partial values, z(t), from the progressive analysis of the Mann–Kendall test. Herein, z(t) is a standardized variable that has zero mean and unit SD. The following steps are applied to calculate z(t) (Eqs. 10–13):
-
1.
The values of Pj mean time series (j = 1,…,n) are compared with Pi, (i = 1,…, j−1). At each comparison, the number of cases Pj > Pi is counted and denoted by nj.
-
2.
The test statistic t is then calculated by the equation
-
3.
The mean (E) and variance of the test statistics (Var) are
-
4.
The sequential values or partial values of the statistics z(t) are then calculated as
3.5 Partial cumulative deviation test (PS)
Cumulative deviations from the mean are rescaled using partial standard deviation of the series (Eqs. 14–16) and then it used to detect the nonlinear trend of the rainfall time series.
where k = 1, 2,……, n.
Pt is the rainfall sequence in time and n the total number of rainfall records.
3.6 Innovative trend analysis
Innovative trend analysis is a new technique proposed by Sen in 2011 (Şen 2011). The applicability of the analysis is very simple and applied to verify the results of MKT. The first half of the time series in the X-axis was plotted against the second half of the series in the Y-axis. It becomes obvious that the monotone increasing (decreasing) trend in the given time series fall above (below) the 1∶1 line. This idea was used to quantify the results of MKT. It is very much handy to use the ITA method to extract the trend information from the available rainfall time series.
4 Homogeneity test
4.1 von Neumann ratio test
The von Neumann ratio (N) is the commonly used test to check the absence or presence of homogeneity in the time series. It is directly associated with the first-order serial correlation coefficient. It is defined as follows:
where Pt is the rainfall sequence in time and n the total number of rainfall records.
4.2 Cumulative deviation test
Cumulative deviations from the mean are rescaled using standard deviation of the series (Eqs. 18–22) and then used to detect the nonlinear trend of the rainfall time series.
where k = 1, 2… n.
Pt is the rainfall sequence in time and n the total number of rainfall records.
5 Singular spectrum analysis
The singular spectrum analysis technique is a very new and powerful technique to analyze time series in hydrology (Sivapragasam et al. 2010). It has the capacity to perform multivariate component analysis from a single time series. In this method, singular spectrum analysis decomposes the original time series into different components of the series, and each series represents the trend, periodicity and noise of the series (Vitanov et al. 2008). This tool is very helpful in finding trends, smoothing of the time series, deriving the seasonality components, identifying periodicity at varying amplitudes and detecting shifting or change point in the series (Solow and Patwardhan 1996).
It is assumed that the any dynamic system satisfies for the set of k first-order differential equations for variables xi. The following sequential methodology was adopted to analyze the time series data using singular spectrum analysis.
-
1.
Conversion of univariate time series (xi) where “i” lies between 1 and n to multivariate characteristic matrix (X) using Eq. 23. Here, n represents the length of the time series and k is defined as the degree of freedom of the dynamic system. The degree of freedom k can vary from 2 to n − 1.
-
2.
Characteristic matrix (X) is transformed to decomposed vector (S) using Eq. 24. Basically, it is the multiplication of transpose characteristic matrix (X′) to characteristics matrix (X).
-
3.
Eigenvalues and eigenvector of the decomposed vector (S) is calculated. Eigenvalues of the decomposed time series are hereby used to derive the order of homogeneity for the rainfall time series.
6 Results and discussions
6.1 Rainfall trends
The results of the trend analysis using Mann–Kendall and Sen’s tests are presented in Tables 2 and 3. The rainfall data from seven zones were used, which had instrumental observed datasets with adequate record length (1851–2006). It was observed that only WPI had a positive trend under 95% significance level for the annual time series. Some seasonal variations were also observed for monsoon (NMI (negative trend) and WPI (positive trend)) and post-monsoon season (AI (positive trend) and NEI (positive trend)). Using same datasets, we adopted the Sen’s T test to identify the trend and observed that its results seemed to be quite coherent with the Mann–Kendall test. The results of the Sen’s T test seemed to be similar to those obtained from the Mann–Kendall test.
6.2 Changes in mean (%)
Using Eq. 9, long-term changes in mean value were estimated. It is noticed that the seasonal maximum positive and negative changes in the mean value are 27.49% (OD) and − 16.38 (JS) for the NWI and NMI rainfall zones, respectively, during 1851–2006. No significant changes were found in the mean values for the region NWI (JF), NCI (JF) and NEI (JF), corresponding to the season. In the annual analysis for different zones of India, it was found that the maximum change in the mean rainfall was 11.53% for WPI, whereas the maximum fall in the mean rainfall was 7.8% for the NMI region (Fig. 2).
6.3 Sequential Mann–Kendall test
The above described tests are the total estimators of the time series. They are unable to extract any idea about the local temporal changes. To get these temporal changes, sequential MK was carried out. The sequential MK values were computed for the rainfall time series from the beginning to end of the study period (1851–2006). The sequential MK values are presented as line graphs, and when the line crosses the upper or lower confidence limits it is an indication that there is a significant trend because the calculated MK value is greater than the absolute value of the normal standard Z value (at the 5% significance level). The intention of conducting the sequential MK tests and charting the results is to check how the trends fluctuated over the whole study period (1851–2006). Figure 3 shows that the sequential MK test method was able to detect nonlinear trend for the rainfall time series, whether it is annual or seasonal.
The applications of the partial cumulative deviation test were also presented for different types of rainfall series recorded at various zones and seasons (Fig. 4). It was found to be similar to the SQMKT test results and the nonlinear trend of the rainfall series could be observed.
The applications of the innovative trend analysis methodology were also presented for different types of rainfall series (for positive trend—Fig. 5, for negative trend—Fig. 6 and for no trend—Fig. 7) recorded at various zones and seasons. It was found to be in coherence with and Sen’s T test results.
6.4 Homogeneity test
von Neumann ratio and cumulative deviation test were performed for the rainfall time series (Tables 4 and 5). The expected value of the von Neumann ratio is 2. However, it tends to be greater or less than 2 for the non-homogenous time series. Cumulative deviation test statistics was calculated from partial cumulative deviation and rescaled with standard deviation for the whole time series.
Rainfall time series for seven zones were decomposed by applying Eqs. 23 and 24. Univariate matrix was converted to multivariate matrix with variable degree of freedom. The eigenvalues of the decomposed matrix for monsoon time series were plotted against the degree of freedom (Fig. 8). Figure 8 presents the order of non-homogeneity existing in the time series. Singular spectrum analysis of univariate rainfall series successfully extracts the homogeneous character of the time series. The monsoon season (JS) of the NMI and WPI zones showed higher departure from homogeneity (Tables 4 and 5) and singular spectrum analysis showed the results to be in coherence with the same (Fig. 8).
7 Conclusions
Eight statistical methods such as Mann–Kendall test, Theil–Sen slope estimation, sequential Mann–Kendall test, partial cumulative deviation test, innovative trend analysis, von Neumann ratio test and singular spectrum analysis were used to characterize the long-term rainfall time series of India. Mann–Kendall test and Theil–Sen slope estimation were used here to estimate the global trend of the time series. From the above tests, one cannot determine the local trend of the long-term rainfall time series. To address the problem stated above, the authors have utilized sequential Mann–Kendall test and partial cumulative deviation test for the same rainfall series. For example, annual time series for the WPI zone has the highest increasing trend as per MKT statistics (2.71). The same value did not represent any local changes happening during 1851–2006. Figure 3 addresses the problem of global trend of the time series and also represents the local trend. Innovative trend analysis is a unique graphical representation to find the trend of time series. The benefit of using this method is that it classifies the trend analysis at the minima or maxima level, which cannot be predicted using the first four methods. This will be helpful for planning purposes of watershed management, where the trend of lower range of rainfall is important for the drought-prone region. Subsequently, higher range rainfall statistics iss important for flood-affected zones. von Neumann ratio test and singular spectrum analysis were used to estimate the order of non-homogeneity in the time series.
References
Goyal MK (2014) Statistical analysis of long term trends of rainfall during 1901–2002 at Assam, India. Water Resour Manag 28(6):1501–1515
Granger CWJ, Newbold P (2014) Forecasting economic time series. Academic Press, Cambridge
Hamed KH (2008) Trend detection in hydrologic data: the Mann–Kendall trend test under the scaling hypothesis. J Hydrol 349:350–363
Hamed KH, Rao AR (1998) A modified Mann–Kendall trend test for autocorrelated data. J Hydrol 204:182–196
Hipel KW, McLeod AI (1994) Time series modelling of water resources and environmental systems. Elsevier, Amsterdam
Khaliq MN, Ouarda TB, Gachon P, Sushama L, St-Hilaire A (2009) Identification of hydrological trends in the presence of serial and cross correlations: A review of selected methods and their application to annual flow regimes of Canadian rivers. J Hydrol 368(1–4):117–130
Kumar V, Jain SK, Singh Y (2010) Analysis of long-term rainfall trends in India. Hydrol Sci J–Journal des Sciences Hydrologiques 55(4):484–496
Li L, Xu C-Y, Zhang Z, Jain S (2014) Validation of a new meteorological forcing data in analysis of spatial and temporal variability of rainfall in India. Stoch Environ Res Risk Assess 28:239–252. https://doi.org/10.1007/s00477-013-0745-7
Loucks DP, Van Beek E, Stedinger JR, Dijkman JP, Villars MT (2005) Water resources systems planning and management: an introduction to methods, models and applications. UNESCO, Paris
Mann HB (1945) Nonparametric tests against trend. Econometrica J Econometric Soc 13(3):245–259
Moyo M, Dorward P, Craufurd P (2017) Characterizing long term rainfall data for estimating climate risk in semi-arid Zimbabwe. In climate change adaptation in Africa (pp 661–675). Springer International Publishing
Murphy C, Burt TP, Broderick C, Duffy C, Macdonald N, Matthews T, McCarthy MP, et al. (2017) A 305 year monthly rainfall series for the Island of Ireland (1711–2016). In EGU General Assembly Conference Abstracts, vol 19, p 5615
Parth Sarthi P, Ghosh S, Kumar P (2015) Possible future projection of Indian summer monsoon rainfall (ISMR) with the evaluation of model performance in coupled model inter-comparison project phase 5 (CMIP5). Glob Planet Change 129:92–106. https://doi.org/10.1016/j.gloplacha.2015.03.005
Salas JD (1980) Applied modeling of hydrologic time series. Water Resources Publication, Littleton
Sang YF, Wang Z, Liu C (2013) Discrete wavelet-based trend identification in hydrologic time series. Hydrol Process 27:2021–2031
Şen Z (2011) Innovative Trend Analysis methodology. J Hydrol Eng 17:1042–1046
Sethi R, Pandey BK, Krishan R, Khare D, Nayak P (2015) Performance evaluation and hydrological trend detection of a reservoir under climate change condition. Model Earth Syst Environ 1:1–10
Sivapragasam C, Arun VM, Giridhar D (2010) A simple approach for improving spatial interpolation of rainfall using ANN. Meteorol Atmos Phys 109:1–7. https://doi.org/10.1007/s00703-010-0090-z
Smit B, Burton I, Klein RJ, Wandel J (2000) An anatomy of adaptation to climate change and variability. Clim Change 45:223–251
Sneyers R, Vandiepenbeeck M, Vanilierde R, Demarée GR (1990) Climatic changes in Belgium as appearing from the homogenized series of observations made in Brussels-Uccle (1933–1988) In: Schietecat GD (ed) Contributions à l’etude des changements de climat, vol 124. Bruxelles: Institut Royal Meteorologique de Belgique, Publications Série, pp 17–20.
Solow AR, Patwardhan A (1996) Extracting a smooth trend from a time series: A modification of singular spectrum analysis. J Clim 9:2163–2166
Sontakke NA, Singh N, Singh HN (2008) Instrumental period rainfall series of the Indian region (AD 1813–2005): revised reconstruction, update and analysis. Holocene 18:1055–1066. https://doi.org/10.1177/0959683608095576
Tiwari H, Rai SP, Sharma N, Kumar D (2015) Computational approaches for annual maximum river flow series. Ain Shams Eng J 23:24
Trenberth KE (1998) Atmospheric moisture residence times and cycling: implications for rainfall rates and climate change. Clim Change 39:667–694
Vicente-Serrano SM (2006) Differences in spatial patterns of drought on different time scales: an analysis of the Iberian Peninsula. Water Resour Manag 20:37–60
Vitanov NK, Sakai K, Dimitrova ZI (2008) SSA, PCA, TDPSC, ACFA: useful combination of methods for analysis of short and nonstationary time series. Chaos Solitons Fractals 37:187–202
Wilks DS (2011) Statistical methods in the atmospheric sciences, vol 100. Academic press, Cambridge
Yaffee RA, McGee M (2000) An introduction to time series analysis and forecasting: with applications of SAS® and SPSS®. Academic Press, Cambridge
Yue S, Pilon P, Cavadias G (2002) Power of the Mann–Kendall and Spearman’s rho tests for detecting monotonic trends in hydrological series. J Hydrol 259:254–271
Acknowledgements
The authors are grateful to the Department of Water Resources Development and Management, Indian Institute of Technology Roorkee, India. The authors are also thankful to the central library, Indian Institute of Technology Roorkee, India, for providing access to all research papers (http://mgcl.iitr.ac.in/). They are also thankful to the Ministry of Human Resource Development, Government of India, for their regular fellowship to conduct research. The authors are also thankful to all the anonymous reviewers.
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible Editor: A.-P. Dimri.
Rights and permissions
About this article
Cite this article
Tiwari, H., Pandey, B.K. Non-parametric characterization of long-term rainfall time series. Meteorol Atmos Phys 131, 627–637 (2019). https://doi.org/10.1007/s00703-018-0592-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00703-018-0592-7