1 Introduction

Hydrologic forecasting is to reveal the future hydrologic regimes, and further provide useful information for practical water activities (Tiwari and Chatterjee 2010; Dutta et al. 2012). In nature, hydrologic processes show highly complex characteristics due to the influences of many and always interrelated physical factors (Labat et al. 2000). Moreover, climatic change (Hanson et al. 2004) and human activities (Carsten et al. 2008) add the complexities of hydrologic processes by changing land surface conditions. Although great efforts were devoted by researchers, enough understanding of hydrologic processes has not been gained presently. The models used currently cannot always meet practical needs enough due to their limited applicable ranges and defects, causing the difficulty in accurately hydrologic forecasting (Tiwari and Chatterjee 2010).

Generally, the models used for hydrologic forecasting can be divided into two types: mathematical–physical models and black-box models (BBMs). The former mainly use a series of mathematical equations to describe physical hydrologic processes, so they usually need a large amount of data for calibration and validation purposes, and are computationally extensive (Arora 2002). However, the need of enough data cannot be always met, especially in developing countries where recorded data lengths are usually short. Comparatively, BBMs are effective alternatives in many practical situations because of having two advantages as low quantitative data demands and simple formulation (Jain and Kumar 2007). Hydrologic forecasting by black-box models is usually called hydrologic time series forecasting, and it is mainly considered in this study.

For overcoming the black-box defects of BBMs, many methods are often combined with BBMs to conduct hydrologic series forecasting. Present studies and practical applications (reviewed in Section 2) indicate the better performance of hybrid models rather than single model. Among the former the combination of wavelet analysis with black-box models has become a prevalent approach to forecast monthly precipitation (Nourani et al. 2009), monthly runoff (Kisi 2008, 2009b), shallow groundwater level and daily discharge (Wang and Ding 2003), rainfall-runoff (Coulibaly and Burn 2004), droughts (Kim and Valdes 2003), monthly lake level (Kisi 2009a), and other hydrologic variables. Wavelet analysis can elaborate the localized characteristics of a series both in temporal and frequency domains (Percival and Walden 2000), so it can handle series’ non-stationary characteristics and then guide hydrologic time series forecasting (Nourani et al. 2008).

However, the effectiveness of wavelet models is impacted by several key but presently unsolved issues. The most critical one is accurate wavelet decomposition of series, as the basis of wavelet aided hydrologic time series forecasting. Previous studies usually chose the decomposition level for wavelet decomposition as log 10 n (n is series’ length) (Aussem et al. 1998; Wang and Ding 2003). It may be unreasonable because of just depending on series’ length but cannot considering series’ composition (Nourani et al. 2011). Wavelet choice and wavelet threshold de-noising also impact the accuracy of wavelet decomposition (Sang et al. 2009a). Another key issue is uncertainty evaluation. Uncertainty is the objective existence in hydrologic forecasting (Krzysztofowicz 2001; Ghosh and Katkar 2012); previous studies of wavelet models mainly focused on improving the accuracy of forecasting result, but there are lacks of studies on uncertainty.

The main objective of this study is to propose an improved wavelet modeling framework for hydrologic time series forecasting. The methods for wavelet decomposition of series were first improved by discussing several key factors, by which different deterministic components and noise in hydrologic series can be separated. Based on wavelet decomposition of series, the improved wavelet modeling framework (WMF) was proposed. Finally, the improved WMF was verified by analyzing four cases. The conclusions are presented in the final section.

2 Review on Black-Box Models

Black-box models use certain mathematical tools to reveal the correlations of hydrologic variables, but do not consider physical hydrologic processes. Serial-analysis techniques (SAT) are the typical black-box models and develop extensively, but they cannot accurately describe hydrologic series with nonlinear and non-stationary characteristics. Another typical kind of black-box models is the neural models which also have gained increasing applications in hydrology. Neural models are supposed to possess the ability of reproducing the nonlinear relationships between input explanatory variables and output forecasted variables (Jain and Kumar 2007). Many studies have proved that they can model the hydrologic processes by a proper three-layer neural network with input-, hidden- and output-layer, and can perform better than SATs (Luk et al. 2000). By summarizing previous studies and applications of black-box models in hydrology, some personal understandings and conclusions are gained as follows:

  1. (1)

    Series decomposition should precede forecasting. An important understanding is accurate result of hydrologic time series forecasting usually cannot be gained if directly analyzing raw hydrologic series by black-box models (Jain and Kumar 2007). It is mainly due to the multi-components of hydrologic series (Labat 2005; Sang et al. 2012). Therefore, a good knowledge of hydrologic series’ composition should be first gained, or else the forecasting process would lack reliable hydrologic basis. However, most of black-box models used presently are not based on the composition of hydrologic series (Nourani et al. 2009). To improve hydrologic time series forecasting, accurate decomposition of series should precede the forecasting processes.

  2. (2)

    Deterministic components and noise should be forecasted respectively. Another major understanding is black-box models cannot satisfactorily forecast hydrologic extrema (Zhang and Govindaraju 2000). In the theories of stochastic hydrology (Yevjevich 1972), hydrologic series is basically composed of deterministic components and noise. The former are generated by certain deterministic physical mechanisms, and show deterministic characteristics as periods and trend; noise is generated by many random and uncertain factors, and shows pure random characters. Noise contaminates deterministic components and is closely relevant to hydrologic extrema (Kuczera 1992). Therefore, hydrologic extrema usually show pure random characters and cannot be accurately forecasted by deterministic models. To improve the result of hydrologic time series forecasting, deterministic components and noise should be first separated and then forecasted respectively.

  3. (3)

    Uncertainty evaluation should be studied. Previous studies of hydrologic time series forecasting mainly focused on improving the accuracy of forecasting result, but there is a lack of study on uncertainty evaluation. Hydrologic processes like any other natural processes have uncertainty (Arabi et al. 2007), so the forecasting result with a single optimal value is not convincing and “honest” (Sang et al. 2010). Uncertainty evaluation should be studied to obtain more reasonable forecasting results.

  4. (4)

    Hybrid model performs better than single model. Accurate result of hydrologic time series forecasting usually cannot be obtained by any model alone. To overcome the defects of single model, many new techniques have been applied in hydrologic time series forecasting, such as information theory (Bagtzoglou and Hossain 2009), fuzzy theory (Gao and Er 2005; Jeong et al. 2012) and wavelet theory (Coulibaly and Burn 2004). Various analyses have demonstrated the better performances of hybrid models compared with single model, and it is mainly due to the complementary effects among different methods.

  5. (5)

    Wavelet models should be developed. Wavelet models is one kind of the popular and widely used hybrid models, since wavelet analysis can remove noise and identify different deterministic components in hydrologic series, based on which the process of hydrologic time series forecasting is more guidable. However, several key issues impact the effectiveness of wavelet models, including (a) accurate wavelet decomposition of series is impacted by many key factors (Sang 2012); (b) hydrologic extrema still cannot be accurately forecasted if directly analyzing raw series by wavelet models, as clearly shown in various cases (Kisi 2009b; Nourani et al. 2009; etc.); (c) uncertainty evaluation is still an open issue when using wavelet models. These issues will be studied in the following sections, mainly to improve wavelet aided hydrologic time series forecasting.

3 Wavelet Decomposition of Series

The issue of wavelet decomposition of series was studied by Sang (2012). He first established the reference energy function for discrete wavelet analysis by operating Monte-Carlo simulation; then, by comparing energy function of hydrologic series with the reference energy function, he proposed the methods for wavelet choice, decomposition level choice, wavelet threshold de-noising and significance testing of DWT; finally, he gave a step-by-step guide to discrete wavelet decomposition of hydrologic series.

The methods were briefly described as follows: (1) for the analyzed series f(t) with the length of n, choose proper wavelet and compute the decomposition level of log 2 n, then apply dyadic DWT to it; (2) remove noise f n (t) in f(t) by wavelet threshold de-noising (WTD): \( f(t)={f_d}(t)+{f_n}(t) \); (3) determine the energy function of the de-noised series f d (t) by computing the energy \( {E_j}=\sum\limits_{t=1}^n {{{{({f_{dj }}(t))}}^2}} \) of its sub-signal f dj (t) under each level j; (4) determine the energy function of noise and estimate proper confidence interval (e.g. 95 %) by operating Monte-Carlo simulation; (5) use the energy of the sub-signal f d1 (t) under the first level to rescale both the energy function of noise and confidence interval, and take the result as the reference energy function; (6) compare the energy function of f d (t) with the reference energy function, and certain f dj (t) with the energy exceeding the confidence interval is taken as deterministic component; and (7) all deterministic components can be identified, and accurate wavelet decomposition of series f(t) is obtained finally. The method was used in this study for accurate wavelet decomposition of series, as the basis of wavelet aided hydrologic time series forecasting.

4 Improved Wavelet Modeling Framework Proposed

Hydrologic time series forecasting can be improved based on wavelet decomposition of series. As before stated, deterministic components and noise in original hydrologic series show obviously different characteristics, and they need to be analyzed by proper models respectively.

Deterministic components are analyzed by proper black-box model (such as SAT, neural model) to obtain the deterministic forecasting results. Three cases would be met in practice. First, forecasting of certain hydrologic variable by SAT models using its previous values; in this case, each deterministic components identified in original series need to be forecasted respectively. Second, forecasting of certain hydrologic variable by neural models using its previous values; in this case, deterministic components are used as the input data of neural network, and the number of deterministic components identified equals the input layer’s neurons number. Third, forecasting of certain hydrologic variable by neural models using other explanatory variables, such as rainfall-runoff forecasting; in this case, each explanatory series need to be decomposed, and their total number equals the input layer’s neurons number, and then the forecasted series need to be separated into deterministic component and noise, and the former is forecasted by the decomposed explanatory series.

Noise is the inventible part of hydrologic data (Yevjevich 1972). They reflect the intrinsic uncertainty of hydrologic processes and generate hydrologic extrema. In hydrologic time series forecasting, it can be an effective approach to evaluate uncertainty: quantitatively describe the random characters of the noise separated from the forecasted series by proper statistical method, such as normal hypothesis testing, hydrologic frequency analysis, or the principle of maximum entropy (POME) theory (Singh 1998), and estimate the forecasting result of hydrologic extrema with proper guarantee rate (or confidence level). Specifically, the separated noise following skew distribution can be analyzed by hydrologic frequency analysis using proper probability density function, and the separated noise following normal distribution can be analyzed by normal hypothesis testing.

By summarizing the above results, an improved wavelet modeling framework, WMF for short, is proposed for hydrologic time series forecasting (Fig. 1). It includes four main steps: decomposition of series to separate noise and identify deterministic components; selection of appropriate model to forecast deterministic components; analysis of noise to evaluate uncertainty; and summarization of them to obtain the final forecasting result.

Fig. 1
figure 1

Steps of hydrologic time series forecasting by the wavelet modeling framework (WMF) proposed. In Figure 1, “DWT” is discrete wavelet transform, and “WTD” is wavelet threshold de-noising

The specific steps of hydrologic time series forecasting by the improved WMF are explained as follows:

  1. (1)

    Choose proper wavelet and decomposition level, and then apply dyadic DWT to the analyzed series f(t), by which wavelet coefficients of the series f(t) under each level are obtained;

  2. (2)

    Remove noise in f(t) by WTD. The result is expressed as:

    $$ f(t)={f_d}(t)+{f_n}(t) $$
    (1)

    where f d (t) and f n (t) are the de-noised series and noise respectively;

  3. (3)

    Identify and separate deterministic components in series f d (t) by conducting significance testing of DWT. The result is expressed as:

    $$ f(t)={f_d}(t)+{f_n}(t)=\sum\limits_{i=1}^l {{f_{di }}(t)+} {f_n}(t) $$
    (2)

    where f di (t) is the ith deterministic component identified with the total number of l.

  4. (4)

    If the hydrologic series is forecasted using its previous values, it is decomposed according to the steps (1)–(3). If the hydrologic series is forecasted by other explanatory series, the latter are decomposed according to the steps (1) and (3), and the forecasted series is de-noised according to the steps (1) and (2).

  5. (5)

    Select proper black-box model to forecast deterministic components in the forecasted series using the results in step (4), and obtain the deterministic forecasting result \( {{\widehat{f}}_d}\left( {t+\tau } \right) \) with the forecasting period τ.

  6. (6)

    Use proper statistical method to describe the random characters of noise data f n (t), and estimate the value \( {{\widehat{f}}_n}(t) \) with proper guarantee rate (or confidence level).

  7. (7)

    Add up the two results, and obtain the final forecasting result \( \widehat{f}\left( {t+\tau } \right) \) with proper guarantee rate:

    $$ \widehat{f}\left( {t+\tau } \right)={{\widehat{f}}_d}\left( {t+\tau } \right)+{{\widehat{f}}_n}(t) $$
    (3)

5 Case Study

5.1 Data

Four cases were analyzed for verification purpose. These data were measured in different sampling rates and from different basins (Table 1). The case-I presents 20-year monthly runoff data measured at the Dashankou station in Northwest China, and it has two dominant periods of 6 and 12 months (Sang et al. 2009b). The case-II presents 41-year monthly precipitation data measured at the Nanjing weather station in East China, and it has a dominant period of 12 months due to the annual variability of hydrologic processes. The case-III presents 41-year annual precipitation data measured at the Nantong weather station at the outlet of the Changjiang River. The case-IV presents 125-day mine discharge data measured at the Hanqiao Coal Mine in the mid-east of China, and it is mainly determined by the local rainfall of PS4 series (Sang et al. 2009c).

Table 1 Observed hydrologic data used in this paper

5.2 Wavelet Decomposition Results

The wavelets used for analyzing these series were chosen according to de-noising results. The statistical characters of original series, de-noised series and separated noise, including mean \( \left( {\overline{x}} \right) \), coefficient of variation (Cv), coefficient of skewness (Cs), lag-1 (r1) and lag-2 (r2) autocorrelation coefficients, were computed. Table 2 indicates that by using the chosen wavelets mean values of de-noised and original series are similar; but de-noised series’ C v values are smaller and r 1 values are bigger than those of originals series; moreover, five separated noise series mainly show random characters by doing statistic hypothesis testing. Therefore, it is thought that de-noising results of these series by the chosen wavelets are reasonable.

Table 2 Statistical characters of the de-noising results of five series

Energy functions of these series are depicted in Fig. 2, and wavelet decomposition results are displayed in Fig. 3. RS1 series includes three deterministic components: the sub-signal under level 2 has the period of 6 months, the sum of sub-signals under levels 3–5 has the period of 12 months, and the residue is trend. RS2 series includes two deterministic components: the sub-signal under level 3 is the periodic component, and the residue is trend. RS3 series includes two deterministic components: the sub-signal under level 2 is the periodic component, and the residue is trend. For RS4 series, its sub-signals under the first two levels are composed of noise, but the residue is trend. PS4 series includes two deterministic components: the sum of sub-signals under the first six levels is the periodic component, and the residue is trend.

Fig. 2
figure 2

Energy functions of the series used in this paper, and the reference energy function with 95 % confidence interval

Fig. 3
figure 3

Wavelet decomposition results of the series used in this paper

5.3 Forecasting Results by Different Models

Six factors were considered when operating forecasting: (1) data. Each data of forecasted series was divided into two parts (Table 3) for calibration and verification respectively; (2) edge effect. The point-symmetric extension method was used to handle double edges of series (Kharitonenko et al. 2002); (3) model. Five models, denoted as M1, M2, M3, M4 and M5, were used. Since back propagation neural network (BPNN) performs better than SAT models, all five models were based on the former. For any series, it is directly forecasted when using M1 model; de-noised but un-decomposed, and then forecasted when using M2 model; decomposed by the decomposition level of log 10 n but not de-noised, and then forecasted when using M3 model; decomposed by significance testing of DWT but not de-noised, and then forecasted when using M4 model; de-noised and decomposed, and then forecasted when using M5 model (i.e., the proposed WMF); (4) model structure. The number of input layer’s neurons was determined according to series’ correlations when using M1 and M2 models, but determined according to series’ decomposition results when using M3, M4, and M5 models. The number of hidden layer’s neurons was selected by the trial-error procedure; (5) forecasting period. Both 1-step and 6-step ahead forecasting were conducted; (6) results evaluation. Three indexes, AARE (average absolute relative error), TS x (threshold statistics) and R 2 (coefficient of determination), were used to evaluate the results:

Table 3 Data used in the forecasting processes
$$ \begin{array}{*{20}c} {AARE=\frac{1}{N}\sum\limits_{i=1}^N {\left| {\frac{{f\prime (i)-f(i)}}{f(i) }} \right|} \times 100\%} \\ {T{S_x}=\frac{{{n_x}}}{N}\times 100\%} \\ {{R^2}=\frac{{\sum\limits_{i=1}^N {{{{\left( {f(i)-f\prime {(i)^{\tilde{\mkern6mu} }}} \right)}}^2}} }}{{\sum\limits_{i=1}^N {{{{\left( {f(i)-f\prime {(i)^{\tilde{\mkern6mu} }}} \right)}}^2}} +\sum\limits_{i=1}^N {{{{\left( {f\prime (i)-{{{\overline{{f\prime (i)}}}}^{\tilde{\mkern6mu} }}} \right)}}^2}} }}} \\ \end{array} $$
(4)

where f(i) is observed data and f’(i) is forecasted data with the total number of N, n x is the number of data points with the AARE values being smaller than x%; \( \overline{{f\prime (i)}} \) is the mean of f’(i).

The 1-step and 6-step ahead forecasting results of each case obtained by five models were evaluated, and the results are shown in Tables 4, and 5, in which the structures of BPNN are also presented.

Table 4 Evaluation of the 1-step ahead forecasting results of four cases obtained by different models
Table 5 Evaluation of the 6-step ahead forecasting results of four cases obtained by different models

The noise separated from RS1, RS2 and RS4 series have the C s values bigger than 0.7 (Table 2), and they do not follow normal distribution by dong statistical hypothesis testing. Therefore, these noise series were described by the Pearson-III (P-III) distribution and then analyzed by hydrologic frequency analysis according to China’s general practices. The noise in RS3 series with the C s value of 0.08 follows normal distribution accosting to the statistical testing results, and it was analyzed by normal hypothesis testing. The estimated noise values with various guarantee rates are shown in Table 6.

Table 6 Estimated values of noise with various guarantee rates by statistical methods*

The final 1-step and 6-step ahead forecasting results of four cases by the proposed WMF are depicted in Fig. 4. More details were discussed in the following section.

Fig. 4
figure 4

1-step (left) and 6-step (right) ahead forecasting results of four cases by different models. In Figure 4, M1 is a single BPNN; M2 is a wavelet de-noising-based BPNN, M3 and M4 are wavelet decomposition-based BPNNs, whereas M5 is a wavelet de-noising and decomposition-based BPNN. “GR” is the guarantee rate

6 Results Discussion

This study improved the understanding about the relationship between hydrologic series’ composition and hydrologic time series forecasting, and proposed an improved wavelet modeling framework, by which deterministic forecasting result can be gained, and uncertainty can be evaluated conveniently. By comparing the forecasting results of four cases using various models, it can be found that:

  1. (1)

    Hydrologic time series forecasting can be improved by first removing noise in the forecasted series. Forecasting results of four cases are not good enough when using M1, M3 and M4 models, and it is due to the noise impacts. However, when first removing noise from original series, forecasting results can be improved by M2 and M5 models.

  2. (2)

    Accurate series decomposition can improve hydrologic time series forecasting. Among the five models used, M3, M4 and M5 models are based on wavelet decomposition of series, and their results are much better than those by M1 model. The results demonstrate that deterministic components in hydrologic series show different characteristics, so they should be first separated and then forecasted respectively. In addition, M3 model performs worse than M4 and M5 models due to the unreasonable wavelet decomposition result using the decomposition level of log 10 n.

  3. (3)

    Uncertainty can be evaluated by the proposed WMF. By estimating the noise values with certain guarantee rate (Table 6), uncertainty of forecasting results can be quantitatively evaluated. Figure 4 indicates that most of the observed data fall within the range estimated by the forecasting results with 5 % and 95 % guarantee rates (95 % confidence interval for case-III) using M5 model, thus the uncertainty of forecasting result was effectively evaluated. It is concluded that de-noising does not impact the accuracy of deterministic forecasting results, but provides a feasible approach for uncertainty evaluation.

  4. (4)

    Forecasting results become worse with forecasting period increasing. The 6-step ahead forecasting results by any models are worse than 1-step ahead forecasting results (Tables 4, and 5), but this worse is not so perceptible when using M5 model. Therefore, it is thought that hydrologic forecasting, especially those with longer forecasting periods, can be improved by the proposed WMF.

  5. (5)

    Forecasting results of four cases show difference. Among the four cases, results of case-IV are the best, those of case-I and case-III followed, but the results of case-II are the worst. Forecasting results of any cases by M5 model are more accurate than those by other four models. Thereby, it is thought the proposed WMF has wide applicability in hydrologic time series forecasting.

  6. (6)

    Performances of five models have big differences. Tables 4, and 5 clearly show the different performances of five models. Among them, the M1 model performs the worst, especially when the forecasted series is greatly impacted by noise and longer forecasting period is needed. The performances of M2, M3, M4 and M5 models are different due to their different wavelet results; the results by M5 and M2 model as a whole are better than those by M4 model, while the results by M3 model are the worst. On the whole, the performance of five models is M5, M2, M4, M3, and M1 from good to bad.

  7. (7)

    The proposed WMF performs better compared with other four models. Forecasting process by the proposed WMF is based on hydrologic series’ composition and so has reliable hydrologic basis; moreover, the eligible rates of forecasting result are higher, and accurate forecasting result with longer forecasting period can also be obtained; besides, uncertainty can be estimated quantitatively, which can make the final forecasting result more reasonable.

7 Conclusions

In this paper, by first discussing several key issues on wavelet decomposition of series, an improved wavelet modeling framework was proposed for hydrologic time series forecasting. Analyses of four different cases verified its performance. By summarizing the study, it can be found that identification and separation of multi-components (including deterministic components and noise) of hydrologic series is an effective approach to improve hydrologic series forecasting. Compared with conventional black-box models, the WMF proposed shows three advantages: accurate decomposition of series that makes the forecasting process more guidable, higher effectiveness in accuracy and eligible rate and longer forecasting period, and the ability of uncertainty evaluation.

Nonetheless, more attention should still be paid to three issues in practice. First, accurate de-noising result is the prerequisite when using the proposed WMF, so noise should be further studied and more effective de-noising methods are needed. Second, conventional methods should be improved. Since black-box models are used to forecast deterministic components in original series when using the proposed WMF, forecasting result is influenced by their defects, such as determination of neural models’ structures and parameters estimation; to improve hydrologic time series forecasting, defects of conventional black-box models should be further solved. In addition, for wavelet transform with down-sampling we could get quite different wavelet coefficients even if only shifting the analyzed series a few sample points, so the bad behavior of down-sampling should be carefully considered in discrete wavelet analysis of series, and the use of orthonormal multi-wavelet shell can avoid this problem (Chen et al. 2003). Third, when using the proposed WMF, specific model should be established based on both WMF and the hydrologic activity studied, and several detailed issues, such as wavelet decomposition of series and selection of proper black-box models, should also be carefully analyzed.