1 Introduction

Water demand prediction (WDP) provides a basis for water allocation(Chen and Boccelli 2014; Romano and Kapelan 2014). Understanding the effects of administrative intervention and socioeconomic development on water demand is necessary to develop effective policies on water supply management. In recent years, research on WDP has received increased attention under a changing environment (Wada et al. 2013; Babel et al. 2014; Elliott et al. 2014; Wang et al. 2016; Mohtar and Daher 2016).

According to the prediction horizon and periodicity, WDP can be mainly categorized as long-term (i.e., spanning more than 2 years) (Rinaudo 2015), medium-term WDP (i.e., spanning from 3 months to 2 years) (Mombeni et al. 2013), and short-term (i.e. spanning less than 3 months) (Zhou et al. 2000; Bai et al. 2014; Mouatadid and Adamowski 2016). Various WDP models and methods can be broadly classified into time-series and multiple factor correlation analysis (Adamowski and Karapataki 2010; Campisi-Pinto et al. 2012; Zhai et al. 2012), or linear and nonlinear (Zhang et al. 2012; Chen 2011; Adamowski et al. 2012). Time-series methods can indicate the relationship between WD and time series by building the time series prediction model, which is preferable for short-term WDP; however, time-series techniques cannot reflect the inherent mechanism of WD and cannot easily present an objective illustration of the complex water demand system. Multiple factor correlation analysis methods can usually provide more reasonable WDP results by establishing the relationship between WD (as output) and its variables (as input), but they cannot easily provide an accurate forecast of each variable as an input. Linear methods (e.g., exponential smoothing models, autoregressive integrated moving average models, linear regression models, etc.) have been widely used because they are simple to understand and interpret. However, linear methods cannot handle the varying degrees of nonlinearity in WD data, which can be improved by nonlinear methods (e.g., ANNs, fuzzy logic, model trees, etc.). The performances of the models can be evaluated according to statistical criteria such as, average absolute relative error (AARE), normalized root mean square error (NRMSE) and threshold statistic (Ts). Each of the typical statistical models has its own characteristic, for example, the ANNs has the ability to learn from input and output and adapt to it in an interactive manner (Al-Zahrani and Abo-Monasar 2015); the FISs, as an inference mechanism, is used to establish a relationship between a series of input and output using fuzzy sets theory (Firat et al. 2009). The aforementioned WDP methods were demonstrated as useful tools in providing foundation for adjusting industrial structures, controlling pump constructions, scheduling urban water facilities, and making water supply management decisions by providing different prediction horizons of water demand, largely based on historical water use data.

These WDP methods typically focus on the performance of applied mathematical simulation models, which largely depend on the high quality of observed water use data or significant variables of WD as inputs. In fact, some constraints as inputs are considered in many developed statistical models, these constraints are usually parts of the model itself and have different functions. For example, the constrained orthogonal least-squares method is used to identify the structure and the parameters of the TSK fuzzy models (Mastorocostas et al. 2001); the linear constraints are used to incorporate weak supervision into the learning procedure and improve the performance of this procedure (Pathak et al. 2015); the use of inequality constraints in the linear regression models is to guarantee the mathematical coherence between the predicted values (Neto et al., de A Lima Neto et al. 2005). However, as to WDP models, several constraints, including available water resources, ecological water demand, and constrained total water use in countries (e.g., China) that largely contribute to water resource exploitation, are not properly considered, particularly for long-term WDP (Qin et al. 2015). Moreover, subregions in a basin are usually bundled with economy–population interactions, leading to water use competition (Liu and Wang 2003). Sustainable water exploitation has to be kept at the basin scale by treating each subregion as an individual stakeholder of water use. Expansion of WD for each subregion should be inhibited to improve water use efficiency and maintain a healthy ecosystem under constrained water use. Meanwhile, the role of each subregion, determined by its population, economy, location, and even political status, should be differentiated in WDP, which has received relatively less attention.

A two-stage approach to WDP is developed to support the management of WD under different frequencies at the basin scale and is applied in the Dongjiang River (DjR) basin in South China for a case study. Stage 1 aims to determine the total amount of water in a whole basin, with the consideration of available water resources under variable precipitation frequencies, constrained total water use, and ecological water demand, among others. Stage 2 aims to provide WDP results for each subregion and its industries by using the calculated weight based on the selected indicators and the WD capacity of each industry from the observed water use data, according to game theory. The main advantage of the proposed WDP approach over previous methods is its ability to reflect change in available water resources and the impact of government policy (e.g., the frequency of inflow, constrained total water use). Government policy is important because water use conflict—particularly between the upper-middle and downstream regions of a river basin—may occur, owing to increasing water demand intensified by fast industrialization and urbanization (Li et al. 2015). The second advantage relates to the consideration of the economic and political role of each subregion in the proposed WDP by selecting the indicators covering regional socioeconomic development and water use; this provides a basis for subregional WD in the basin under a changing environment. The third advantage includes its simple calculation and less dependence on the high quality of observed water use data.

This paper is organized as follows. The new proposed WDP approach will be described in the next section. The case study of the DjR basin is explained in section 3, followed by the application of the proposed approach in section 4. In section 5, we compare the proposed approach with typical WDP methods to clarify its characteristic. Finally, the conclusions are drawn in Section 6.

2 Methodology

The proposed WDP approach for a river basin in this study consists of two stages: constrained total WD of the whole basin (T w ) (stage 1) and total WD (T s ) and industrial WD for each subregion (stage 2). Detailed descriptions of the fundamental assumption and the two stages of the proposed WDP approach are presented in Fig. 1 and are explained in the following paragraphs.

Fig. 1
figure 1

Diagram of the proposed two-stage WDP approach

2.1 Fundamental Assumptions of the Two-Stage WDP Approach

The proposed WDP is based on two assumptions: 1) Given the constrained total WD of a whole basin, each subregion has fully grasped the WD information of other subregions, and 2) for each subregion, industrial WD is chosen to maximize its efficiency within the capacity of industrial water use of the subregion—that is, T s and the industrial WD of each subregion in a basin can be considered in a multi-stage complete information static game model. In the game model, each subregion and industry is taken as an individual stakeholder, who is allocated WD in a cooperative way under the constrained total WD of a whole basin; Specifically, assumption 1 illustrates that for a given constrained total WD of the whole basin, T s varies and is predicted by its socioeconomic development level and location in the cooperative game model, which provides a basis for industrial WD in a subregion. Water use efficiency for each industry is optimal when a Nash equilibrium of WD exists for all subregions and industries in the whole basin. If the water use efficiency of industry A is lower than that of others, the WD of industry A would be restricted and partly moved to an industry with higher water use efficiency. This effect prompts industry A to improve its water use efficiency. Finally, a new Nash equilibrium of WD for each industry is achieved. The cooperative game model in the study is applied just because it fits the situation of basin-scale WD conflict more easily and is convenient in practical use.

2.2 Constrained Total WD of a Whole Basin-Stage 1

For any basin, we assume that limited water resources are available for human and ecosystem development in a certain space–time range. In this study, the limited water resources are referred to as the constrained total WD of the whole basin, which is affected by both natural (e.g., variation of available water resources, climate change, and ecosystem change, etc.) and socioeconomic factors (e.g., national or regional policy, regional development plan, etc.). The water demand of a basin consists of off-stream WD for socioeconomic development and instream WD mainly for ecological conservation. Ecological WD is essential in maintaining a sustainable ecosystem for a basin, which should be deducted from the total water resources. Thus, the total available water ( T a ) is calculated as

$$ {\mathrm{T}}_a={\mathrm{T}}_g-{\mathrm{WD}}_e $$
(1)

Where T g is the total water resources of the whole basin, WD e is the ecological WD of the basin.

Given the constrained factors of WD for a basin and based on T a , its constrained total WD should be the minimum value of the constraints and is expressed as

$$ {\mathrm{T}}_w=\min \left(\ {\mathrm{T}}_a,\mathrm{cons}\ 1,\mathrm{cons}\ 2,\dots, \mathrm{cons}\ \mathrm{n}\right) $$
(2)

Where T w is the constrained total WD of the whole basin; cons 1 , cons 2 ,  …  , cons n are the amounts of water resources corresponding to the n constrained factors.

2.3 T s and industrial WD of each Subregion -Stage 2

In this stage, indicators covering socioeconomic development, water use, and water use efficiency are first selected to calculate the weight of each subregion used for allocating the constrained total WD of the whole basin (T w ). Industrial WD is then determined by the capacity of observed industrial water use and subregional total WD.

The indicator selection is mainly based on the three principles as follows:

  1. (1)

    To response well to the changing environment

The indicators should fully reflect the changing environment, such as climate change, human activity, and government policy and so on.

  1. (2)

    A comprehensive and representative combination

The water demand system is complicated and the indicators should fully cover the influencing factors of the system. Meanwhile, the indicators should also be representative due to the difficult collection and great amount of calculation. In case of data redundancy, key indicators should be selected and focus on the aspects of socioeconomic development, water use, and water use efficiency.

  1. (3)

    A greater operability

Data of the indicators should be easily accessible and available, and the calculation of the indicators should be simple and clear.

In case of data redundancy, the final indicators can be chosen by using analytic hierarchy process (AHP) or principal components analysis (PCA) method from the selected indicators.

The indicator matrix A is listed as:

$$ \mathrm{A}=\left[\begin{array}{ccc}\hfill {I}_{11}\hfill & \hfill \cdots \hfill & \hfill {I}_{1m}\hfill \\ {}\hfill \vdots \hfill & \hfill {I}_{ij}\hfill & \hfill \vdots \hfill \\ {}\hfill {I}_{n1}\hfill & \hfill \cdots \hfill & \hfill {I}_{nm}\hfill \end{array}\right] $$
(3)

Where n is the number of subregions, m is the number of the chosen indicator, I ij is the value of the j th indicator in the i th subregion.

The values of the chosen indicators have to be standardized because their dimensions are inconsistent. For each column in the above matrix, the difference between the value of each variable and the minimum value in the column is divided by the difference between the maximum and the minimum value of the column, which is the standardized series. The standardized indicator matrix B is listed as:

$$ \mathrm{B}=\left[\begin{array}{ccc}\hfill {i}_{11}\hfill & \hfill \cdots \hfill & \hfill {i}_{1m}\hfill \\ {}\hfill \vdots \hfill & \hfill {i}_{ij}\hfill & \hfill \vdots \hfill \\ {}\hfill {i}_{n1}\hfill & \hfill \cdots \hfill & \hfill {i}_{nm}\hfill \end{array}\right] $$
(4)

For instance, i ij in the standardized series of the j th column is calculated as

$$ {i}_{ij}=\frac{I_{ij-}{I}_{\mathit{\min},j}}{I_{\mathit{\max},j}-{I}_{\mathit{\min},j}} $$
(5)

Where I min , j and I max , j are the minimum and the maximum value of the j th  indicator, iij is the standardized value of the jth indicator in the ith subregion.

From Eq. (5), i ij  ∈ [0, 1]. The weight of the i th subregion is equal to the sum of all elements in the i th row of the standardized indicator matrix. The weight matrix K is given as follow:

$$ K=\left[\begin{array}{c}\hfill {k}_1\hfill \\ {}\hfill \begin{array}{c}\hfill \vdots \hfill \\ {}\hfill {k}_i\hfill \\ {}\hfill \vdots \hfill \end{array}\hfill \\ {}\hfill {k}_n\hfill \end{array}\right]=\left[\begin{array}{ccc}\hfill {i}_{11}+{i}_{12}+\hfill & \hfill \cdots \hfill & \hfill +{i}_{1m}\hfill \\ {}\hfill \begin{array}{c}\hfill \vdots \hfill \\ {}\hfill {i}_{i1}+{i}_{i2}\hfill \\ {}\hfill \vdots \hfill \end{array}+\hfill & \hfill \ddots \hfill & \hfill \begin{array}{c}\hfill \vdots \hfill \\ {}\hfill +{i}_{im}\hfill \\ {}\hfill \vdots \hfill \end{array}\hfill \\ {}\hfill {i}_{n1}+{i}_{n2}+\hfill & \hfill \cdots \hfill & \hfill +{i}_{nm}\hfill \end{array}\right] $$
(6)

Where k i is the weight of the i th subregion.

According to game theory, the total WD of each subregion can be allocated by using its weight from T w .

$$ {\mathrm{T}}_i={\mathrm{T}}_{w\times}\frac{k_i}{\sum {k}_i} $$
(7)

where T i is the total WD of the i th subregion.

Industrial WD in a subregion can be calculated as follows:

$$ \mathrm{industrial}\ {\mathrm{WD}}_{it}={\mathrm{T}}_{i\times}\frac{C_{it}}{\sum_{t=1}^l{C}_{it}} $$
(8)

Where industrial WD it is the WD of the t th industry in the i th subregion, l is the number of industries, and C it is the capacity of the t th observed industrial water use in the i th subregion.

3 Case Study

Originating from Xunwu County in Jiangxi Province, flowing southwest to Guangdong Province and running into the Pearl River estuary, the Dongjiang River (DjR) is one of the three branches of the Pearl River in southern China (Fig. 2). Its mainstream has a total length of 562 km, and the total basin has a drainage area of 35,340 km2 (He et al. 2015). The annual rainfall over the basin varies between 1500 mm in the dry season from October to March and 2400 mm in the wet or monsoon season from April to September (He et al. 2013). The DjR flows across Meizhou, Shaoguan, and Heyuan in the upstream region, Huizhou and Guangzhou in the midstream region, and Dongguan and Shenzhen in the downstream region. The river is the major water source for Hong Kong outside the basin.

Fig. 2
figure 2

Sketch map of the DjR basin in South China

The population and economy in the DjR basin have kept a rapid and persistent development because of China’s reform and opening-up policy. Up to 2015, total population and gross domestic product (GDP) in the whole basin increased to $27.8 million and $222 billion, respectively, which prompts large amounts of off-stream water demand. Meanwhile, the competition for water among subregions and water use sectors in the basin rise because of the unbalanced socioeconomic development in the upstream and downstream regions, as well as the uneven spatiotemporal distribution of water resources. The minimum, suitable, and maximum ecological instream flows of the Boluo station, which is the control hydrological station of the whole basin, are 187.4, 678.5, and 2091.5 m3/s, respectively (Zhang et al. 2012). Ecological instream flow is often occupied, particularly in extreme dry years (Lin et al. 2014a, b). Thus, the proposed two-stage WPD approach was applied in the prediction of T w and the industrial WD of the entire DjR basin and its seven subregions (Table 1).

Table 1 Seven subregions of the DjR basin

In the current study, total water resources as well as ecological instream water demand under variable precipitation frequencies (50%, 90%, and 95%) were obtained from the daily streamflow data of the Boluo station, covering the 1959–2010 period (Fig. 3). Hydrological data were released by the Guangdong Hydrological Bureau of China, with no missing data. The domestic, primary, secondary, tertiary and environmental (off stream) water use data of the seven subregions for the 1980–2015 period are summarized in Fig. 4. The socioeconomic development data of the subregions (e.g., population, GDP, water consumption) were obtained from Guangdong Water Resource Bulletin and Guangdong Statistical Yearbook. Since 2011, the Chinese government has put forward a water resource management policy concerning total water use, water use efficiency, and water quality, commonly referred to as “three red lines”; a specified control target for total water use has been set. For the subregions in the DjR, the control targets for total water use are 8.38, 8.75, and 8.29 billion m3 under the frequencies of 50%, 90%, and 95%, respectively (Guangdong Provincial Water Resources Department 2012).

Fig. 3
figure 3

Total water resources, ecological water demand, constrained total water use, and WD at frequencies of 50%, 90%, and 95% in the DjR basin in South China

Fig. 4
figure 4

Industrial WD of each subregion from 1980 to 2015 in the DjR basin in South China

4 Results

4.1 Constrained Total WD of the DjR Basin

The total water resources and ecological WD of the DjR basin under the frequencies of 50%, 90%, and 95% were calculated based on the observed daily streamflow data of the Boluo station. The calculated ecological WD is also compared with the ecological instream flow of the Boluo station proposed by Zhang et al. (2012). In the present study, the constrained total water use quota released by the local government was used to calculate T w (stage 1, Fig. 1). According to Eqs. (1)–(2), the constrained total WDs of the DjR basin under the frequencies of 50%, 90%, and 95% are 7.92, 7.3, and 5.96 billion m3, respectively (Fig. 3). With extreme dry years as an example, T w should be restricted to 5.96 billion m3, which not only ensures the ecological WD but also complies with the policy of constrained total water use. In an extreme dry year, T w is determined by the available water in the whole basin.

4.2 T s and Industrial WD of each Subregion in the DjR Basin

T s and the industrial WD of each subregion in the DjR basin were calculated in stage 2. First, eleven indicators (Table 2) covering regional water use amounts, the socioeconomic development and water use efficiency of the DjR basin were selected according to the three principles described in section 2.3, and the weight of each subregion for determining its total WD were calculated based on Eqs. (4)–(6). The final weight matrix of the seven subregions (Table 1) from the upstream to the downstream is K = [0.126, 0.04,0.063, 0.16, 0.151, 0.225, 0.234 ]. Second, T s was calculated in accordance with Eq. (7) on the basis of T w and the weight of each subregion. Last, the industrial WD of each subregion was calculated based on Eq. (8), considering the capacity of observed industrial water use in the 1985–2015 period (Fig. 4).

Table 2 Selected indicators for calculating the weight of each subregion in the DjR basin

Subregions such as Shenzhen and Dongguan in the downstream region, which exhibit high levels of socioeconomic development and water use efficiency, have larger weights for being allocated subregional total WD (0.225 for Dongguan and 0.234 for Shenzhen); meanwhile, Meizhou and Shaoguan in the upstream region, have weights equal to 0.04 and 0.06, respectively. The weights of the subregions in the midstream region, including Heyuan, Huizhou and Zengcheng, are smaller than those in the downstream region but larger than those upstream regions. This difference can be largely attributed to the observed water use, local water resources, socioeconomic development, and water use efficiency of each subregion in the basin (Fig. 5). For instance, Shenzhen in the downstream region has the largest population and GDP and exhibits the highest water use efficiency; it requires larger amounts of water, particularly for domestic water use, and should be allocated the highest weight. Huizhou and Heyuan in the midstream region have a smaller population and GDP; in addition, they have lower irrigation rates of agricultural water but have more effective irrigated areas (0.109 and 0.075 billion hectares in Huizhou and Heyuan, respectively) than Shenzhen (0.003 billion hectares), which accounts for the most agriculture water use and should be allocated with a higher weight of subregional total WD.

Fig. 5
figure 5

Standardized value of 11 selected indicators covering average annual observed water use, socioeconomic development, and water use efficiency of each subregion from 1980 to 2015 in the DjR basin in South China

T s and the industrial WD of each subregion in the DjR basin under the frequencies of 50%, 90%, and 95% are presented in Fig. 6. Shenzhen, Dongguan, Huizhou, Zengcheng, Heyuan, Shaoguan, and Meizhou have T s rates of 23%, 22%, 16%, 15%, 13%, 6%, and 4% of T w , respectively; each rate corresponds to the calculated weight of each subregion. As to the whole basin, the WD of the domestic, primary, secondary, tertiary, and urban environment are 1.08, 2.26, 2.02, 0.44, and 0.16 billion m3, respectively, in extreme dry years. Regarding the industrial WD of each subregion, Shenzhen has the most balanced WD structure; the secondary WDs of Dongguan and Zengcheng account for the most, and the primary WDs of Heyuan and Huizhou account for 60% and 64% of its total subregional WD, respectively, which is determined by the industrial structure of the subregions. From the upstream to the downstream regions of the DjR basin, the proportion of primary WD decreases, whereas that of secondary WD increases until a balanced WD structure is achieved in the most developed region.

Fig. 6
figure 6

T s and industrial WD for each subregion at precipitation frequencies of 50%, 90%, and 95% in the DjR basin in South China

The observed water use in the three typical years (2000, 2002, and 2004) correspond to the frequencies of 50%, 90%, and 95% are chosen and compared to verify T s and the industrial WD of each subregion in the DjR basin (Fig. 7). Overall, the structure of total WD is similar to that of the observed water use in the whole basin and its subregions in the upstream and midstream regions under the three precipitation frequencies. Moreover, for each subregion, the proportion of T s in T w is in accordance with the proportion of its total observed water use in the whole basin; a greater difference between T s and the total observed water use is observed. This difference may be partly attributed to uncertainties in calculating T w (Eqs. (1) to (2)), including the total available water, ecological WD, and constraints. One of these constrained factors is the total water use quota released by the government, which leads to increased differences in the total amount; another reason is the capacity of each industrial water use (from the series of observed data) when calculating industrial WD (Eq. (8)), which results in a similar structure of WD and observed water use for each subregion. An increased significant difference is observed between the structures of WD and observed water use in Shenzhen, the most developed region in the downstream region. For example, WD of the primary is larger than that of observed water use, which leads to a smaller WD of other industries, compared with that of the observed water use. The proposed method performs well in allocating T w to its subregions, but more actions should be taken to reduce the uncertainty in calculating T w .

Fig. 7
figure 7

Comparison of the observed water use and the predicted total and industrial WD of the entire DjR basin and its seven subregions at precipitation frequencies of 50%, 90%, and 95%. WDP represents water demand prediction, Obs represents observed water demand

5 Discussion

The two-stage WDP approach proposed in this study differs from previous WDP methods (e.g., artificial neural networks, time-series models, auto-regressive models) in that first, it intends to predict the constrained total and industrial WD of a basin and its subregions under variable frequencies of the total water resources and has no specific predict horizon, which is different from the hourly, daily, weekly, monthly, or annual WDP of an urban city or just a subregion by using time-series, statistical, or artificial models. Second, the proposed method gives T w considering the available water resources, ecological WD, and policy regulation (e.g., constrained total water use quota), and based on T w , T s and the industrial WD of each subregion are allocated using the weight calculated by socioeconomic development and water use indicators, according to game theory; meanwhile, in previous WDP methods, WD with different prediction horizons are simulated using the time-series model or the artificial model based on the available water consumption and climatic data, and the WDP models are built largely relying on the identification of major variables negatively or positively affecting WD (Ghimire et al. 2016), which requires large amounts of high-quality observed water use data.

WDP in the current study is conducted by using the two-stage WDP approach under the constraints of available water resources, ecological WD, and constrained total water use quota and so on, according to the game theory. Therefore, the proposed approach is simple and has a greater operability for the WDP of a river basin and its involved subregions, compared with the previous typical statistical models. In addition, the constraints for WD vary according to climate change, land use change, population growth, and government policy and so on. Hence, the proposed method can respond better to the changing environment. For example, in stage 1, the constrained total water use quota released by the government and the variability of water resources due to climate change can be reflected in the determination of T w (Eq. (1)), which renders the WDP result close to actual results.

In the current study, only the constrained total water use quota released by the government was considered to reflect the impact of the policy on T w ; however, factors affecting the WD of a basin are complex and dynamic (Yang et al. 2016). This finding suggests that more constrained factors, such as stakeholder interests, water environmental capacity, and water quality should be incorporated into stage 1 when determining the constrained total WD of a whole basin under the changing environment. Overall, the proposed approach helps allocate water for subregions along the river under ecological and environmental protection, reduces water use conflicts in the upstream and downstream regions, and suggests adaptive measures for sustainable water resources management under the changing environment.

6 Conclusion

This study proposed a two-stage approach to predict the total and industrial WD of a basin and its subregions under variable frequencies. The DjR basin was chosen for a case study. Available water resources, ecological WD, and constrained total water use quota were considered in stage 1 to determine the constrained total WD of the DjR basin. Game theory was applied to allocate T w to its seven subregions by calculating the weight of each subregion and to predict industrial WD by using the water use capacity of each industry. Eleven indicators covering historical water use and socioeconomic development were chosen to calculate the weight of each subregion. The observed and predicted T w and industrial WD of the whole basin and its seven subregions were compared. The following conclusions were drawn:

  1. (1)

    The structure of the total WD predicted by the proposed method in this study is similar to that of the observed water use in the whole basin and its subregions, mainly in the upstream and midstream regions.

  2. (2)

    The proportion of T s in T w is in accordance with the proportion of the total observed water use for each subregion. Meanwhile, the difference between T s and total observed water use is larger because of some uncertainties in calculating T w .

  3. (3)

    The proposed method is simple and performs well in allocating T w to the subregions and predicting industrial WD at the basin scale.

Further studies should focus on incorporating additional necessary constraints into stage 1 and reducing the uncertainty in determining T w under a changing environment. Adaptive measures should also be taken to cope with the predicted T s in the basin.