Introduction

Approximately 54% of the world population lived in cities in 2014, which is expected to rise to two-thirds by 2050, generating 55% of additional water demand worldwide (Marchal et al. 2011). Future water conditions will become harder to manage unless current water challenges, namely, demand management, water security, conservation, sustainable consumption, and water efficiency, can be addressed globally (Arfanuzzaman and Rahman 2017). In addition, in the coming years, there will be limited ways to increase water supply and a detailed approach to water use management will be required (Haddad and Lindner 2001). Sustainable water demand management guarantees efficient water usage to maintain economic growth, household consumption, food production, industry, and energy of a country (WWAP 2015). This study takes up the challenges by analyzing water demand at Kenyir Lake meticulously using two techniques: fuzzy inference system (FIS) and multiple linear regression (MLR).

The analysis of water demand is becoming a challenging step because of the changes caused by climatic conditions, land use patterns, human activities, and changes in technology. To know thoroughly the water demand of an area, its data collection and techniques are reviewed, defined, identified, and executed carefully. A number of techniques have been proposed, but they revolved in academics only. This is because of their complexity and nature, the quality of the available data, and the number of variables (Alamanos et al. 2019). FIS is a method that conveys a significant relationship between variables, and MLR gives the direction of each variable. Both methods have their ways and contributions in analyzing current water demand to be compared in this study. There are other techniques used by other researchers, for example, the logistic model (Ren et al. 2019), system dynamics model (Huang and Yin 2017), and water evaluation and planning (Ospina-Noreña et al. 2009), in which their analyses revolve on the prediction of future water demand.

In the problem of demand and supply of water at Kenyir Lake, FIS is performed by defining the fuzzy rules of system input and output variables of the study. Three steps are involved in the fuzzy process: regulate the rule conclusion into a suitable degree of sample and rule premise; all rule conclusions are clustered with results spread from potential output values; and lastly, the defuzzification process involves choosing a value among a set of potential output values (Dubois and Prade 1996). Through the fuzzy process, water demand values are defined, and the difference between the values is portrayed. MLR, conversely, classifies the data according to dependent and independent variables. Instead, of using simple linear regression, MLR is chosen because the study adopted several independent variables to discuss the variations in the water demand (Uriel 2013). In this study, the strategies involved in MLR analysis are a selection of input data, regression analysis, and assessment of model performance. MLR also produced numerical values of water demand that can be compared with those generated via FIS analysis and actual water consumption.

The proposed FIS method creates a flowchart (Fig. 2) on how the input variables go through a fuzzy process (fuzzification, rule evaluation, and defuzzification) to provide results associated with the membership functions of each variable. The model of the fuzzy system essentially includes defining the assumptions and implications of the system, separating variables into subsequent sets, and determining linguistic terms for identified output for each set (Moorthi et al. 2018). FIS analysis can be done easily and is user-friendly especially to policy makers, as it provides both linguistic and numerical results. As for the comparison method, that is, MLR, the results give the direction and size of the effect of each variable relative to a dependent variable (Neuman 2011). The effect is precisely measured in value. For instance, when the value gets higher, the effect is larger on a variable that predicts the dependent variable.

Input variables of the number of people, income generated, and water consumption at Kenyir Lake are applied in FIS to produce an output of the level of water demand numerically relative to the 27 rules confirmed by experts. For MLR analysis, the independent variables are the number of people and income generated, and the dependent variable is water consumption at Kenyir Lake. The datasets of the population of people and income generation for all premises at Kenyir Lake are collected thoroughly from on-site investigation and interviews. Water consumption at Kenyir Lake for each premise is estimated and used in these analyses. Both result analyses are evaluated using the statistical parameters of mean absolute deviation (MAD), mean square error (MSE), root mean square error (RMSE), and mean absolute percentage error (MAPE) with the actual water consumption at Kenyir Lake to determine their accuracy and performance.

Materials and methods

Case study

Kenyir Lake is the largest man-made lake in Southeast Asia surrounded by the greenery of diverse flora and fauna. This man-made lake is located in the district of Hulu Terengganu of the state of Terengganu on the east coast of Malaysia (Fig. 1). Its geographical coordinates are the latitudes of between 4° 40′ N and 5° 15′ N and the longitudes of between 102° 32′ E and 102° 55′ E (Rouf et al. 2010). The lake covers an area of 260,000 hectares (Yusof et al. 2009) with a water catchment area of 38,000 hectares (Lola et al. 2017). Its mean depth is 37 m, with a top water level of 145 m above mean sea level (Rouf et al. 2010).

Fig. 1
figure 1

The location of the study area: a the map of Malaysia, b Kenyir Lake

Kenyir Lake was created in the year 1985 by the damming of two main rivers, Terengganu and Terenggan Rivers, to generate hydroelectricity in Malaysia (Suratman et al. 2017). Underwater current from the inflow of river water and the generation of hydroelectric power stations causes a combination of top and bottom water columns (Rosle et al. 2018). Water in the lake replenishes from nine major rivers diverting to the lake, namely, Lasir, Belimbing, Tembat, Ketiar, Leper, Pertang, Kerbat, Terenggan, and Kenyir Rivers. The water level of the lake varies as it works similar to a reservoir. As reported by Rauf et al. (2015), water level is high during monsoon and low during dry season.

The beauty of Kenyir Lake has caught the attention of tourists and has become an ecotourism destination. Several ecotourism activities, namely, fishing, camping, jungle trekking, water sports, staying in houseboats, and exploring the attraction parks, can be done there. It is also known as the haven of sport-fishing and intensive cage aquaculture activities and has a high market value of fish stocks (Sharip and Zakaria 2008). These advantages of Kenyir Lake have contributed to an increase in water demand since there has been an increase in the number of tourists.

Data description

To analyze the water demand at Kenyir Lake, important data, including water consumption values, the number of people, and the income generated from all premises at the lake, should be collected. Information on the respective data were gathered during the year 2018 from the on-site investigation and interviews of respective authorities and local communities. Tables 1, 2, and 3 respectively show the data of average daily water consumption, average daily number of people, and average income generated from the premises at Kenyir Lake.

Table 1 Average daily water consumption at Kenyir Lake
Table 2 Average daily number of people at Kenyir Lake
Table 3 Average daily income generated at Kenyir Lake

Table 1 establishes the average actual water consumption of every premise at Kenyir Lake. Of all the premises, the highest and lowest water consumptions recorded are 662 m3 for houseboats and 1.9 m3 for camping sites, respectively. Houseboats are one of the famous attractions at Kenyir Lake for which locals and tourists are willing to pay for a superior package offered by houseboat operators (Muhamad et al. 2014). The lowest water consumption is from the camping sites because people visited the premise mostly during public and school holidays only.

Table 2 displays the average daily number of people who visited the premises in Kenyir Lake. Houseboats received the highest number of people daily, whereas camping sites received the lowest. The data collected are the data on actual water consumption. Hence, the average number of people responds directly proportional to the average actual water consumption at Kenyir Lake. As years go by, the population will continue to grow and expand, which will be coupled with increasing per capita consumption, leading to a continuous increase in water consumption (Yuanzheng et al. 2012).

The premises at Kenyir Lake generate income mainly from tourism activities. The higher the number of visitors, the more income the premises generate. Table 3 shows that some of the premises, including Herb Park, Tropical Park, KETENGAH and Tourist Information Center, public toilets at Pengkalan Gawi, food stalls at Pengkalan Gawi, and prayer rooms at Pengkalan Gawi, do not generate any income. Both Herb Park and Tropical Park offer free-of-charge entrance as tourists already pay for boating service to go to these premises. Conversely, other premises are public amenities that can be used and enjoyed for free. The level of income generated has a high correlation with the number of people and water consumption. Makki et al. (2015) found that higher water consumption comes most likely from premises with higher income levels and a higher number of visitors.

Model description

FIS and MLR are the two techniques used for water demand analyses. These methods involved variables or factors that influence water demand, that is, water consumption, the number of visitors, and income generated; however, the application of the variables is different between the techniques.

FIS

The fundamental fuzzy logic was initiated by Zadeh (1965). Fuzzy logic is a form of logic that processes and gives reasoning in the rough estimation of linguistic terms (Wu 2015). Some studies of a fuzzy logic approach focusing on the water resource domain include those of Shrestha et al. (1996), who implemented a fuzzy rule-based on reservoir operation, and Faye et al. (2000), who used a fuzzy modeling approach for the long-term management of water resources systems. Fuzzy logic comprises fuzzy sets, fuzzy rules, fuzzy reasoning, and FISs (Jang et al. 1997).

The basis of fuzzy logic is to consider the system states in subsets or fuzzy set form with an example of each labeled with “low,” “medium,” and “high.” The fuzzy set is constituted of elements of varying degrees of membership in the set. A fuzzy set permits the user to find vulnerabilities in data. In this study, the fuzzy sets comprise the input data of water consumption, number of people, and income generated and the output data of the level of water demand linguistically and numerically. There are 27 rules evaluated relatively with their input data using a fuzzy associative matrix (FAM). Figure 2 illustrates a clear overview of the FIS process.

Fig. 2
figure 2

Flowchart of FIS analysis

Figure 2 describes the layout of the FIS process to analyze water demand in the form of values and words. Three input variables in the system inputs are water consumption (0–662 m3), the number of people who consumed water (0–2700 people), and the income generated in each premise (RM 0–RM 24,000). First, the input variables are fuzzified based on their membership functions, and then an evaluation is conducted following the fuzzy inference rules. Then, they will be defuzzified based on their output membership functions, resulting in the volume of water demand (0–670 m3) as an FIS output.

An FAM is developed to ensure that every possible outcome from the FIS analysis is considered (Zolkepli et al. 2014). Figure 3 presents the FAM for the FIS analysis results with three inputs of water consumption, number of people, and income generated. FAM is ideal to present a rule editor in a matrix form by including all possible outputs that correspond to the inputs used in the system. It also relies on the number of fuzzy inputs and their membership functions (Baldovino and Dadios 2016). There are 27 possible rules established from the FAM of this study.

Fig. 3
figure 3

Fuzzy associative matrix. LOW, low; MED, medium; HIGH, high

Upon the completion of fuzzy rule evaluation, water demand analysis was carried on with defuzzification, with the conversion of fuzzy outputs into numerical values, as a numerical output of FIS. The numerical output of FIS was then converted into literature means based on their level of water demand, that is, low, medium, or high. The defuzzification strategy used in this study is the centroid method, which is the most common and preferable among all defuzzification methods (Takagi and Sugeno 1985). The algebraic expression used (Eq. 1) to obtain the volume of water demand is shown below:

$$ {y}^{\ast }=\frac{\int f\left(x,y\right).y\ dy}{\int f\left(x,y\right) dy}, $$
(1)

where y* is the defuzzified output of volume water demand produced in FIS analyses.

Multiple linear regression

Linear regression is a linear modeling that portrays the connection between a dependent variable and an independent variable. For cases with more than one independent variable, the linear regression is known as MLR. MLR analysis is the most efficient tool in utilizing the relationship between the explanatory variable and the outcome (Sharma et al. 2020). The development of MLR model includes the selection of input data, regression analysis, and evaluation criteria on model performance. The relationship in the MLR model is between two or more independent variables and a dependent variable (Abba et al. 2017). MLR equation with both dependent and independent variables is defined in Eq. 2 (Chen and Liu 2015). The usefulness of MLR is meant for a small number of variables, not significantly collinear, and has a strong relationship of variables in the system (Hanrahan et al. 2004).

$$ Y={B}_0+{B}_1{x}_1+{B}_2{x}_2 $$
(2)

where Y is the dependent variable (predicted water demand), B0 is the y-intercept or constant (value of dependent variable “y” when x1 and x2 = 0), B1 and B2 are multiple regression coefficients, and x1 and x2 are independent variables (water consumption, number of people, and income generated).

The effectiveness of the developed MLR model was evaluated using the coefficient of determination (R2), adjusted R2, and p value test. A fraction shows the proportion of the total variation of the selected dependent variable and the explanatory variable (Sahoo and Jha 2013). Adjusted R2 is defined similarly as R2 except for the fact that the number of degrees of freedom is considered. The probability of error involved in approving the validity of observed results is defined as the p value. A p value of 0.05, which is 5% of error and 95% confidence interval, is commonly known as a “border-line acceptable” error level (Haan 2002). The computations of R2 and adjusted R2 are presented in Eqs. 3 and 4, respectively.

$$ {R}^2={\left(\frac{\sum_{i=1}^n\left({x}_i-\overline{x}\right)\left({y}_i-\overline{y}\right)}{N\ {\sigma}_x{\sigma}_y}\right)}^2, $$
(3)

where R2 is the coefficient of determination, xi and yi are ith variables, \( \overline{x} \) and \( \overline{y} \) are mean; σx and σy are standard deviation of the x and y values of variables, respectively; and N is the total sample size.

$$ \mathrm{Adjusted}\ {R}^2=1-\left(\frac{\left(1-{R}^2\right)\left(N-1\right)}{N-p-1}\right), $$
(4)

where R2 is the coefficient of determination and p is the number of variables.

In the MLR model, it is clear that as the number of variables increases, the accuracy of the model increases (Swain and Patel 2017). A good model with several variables has a large percentage of R2 (> 70%) (Neuman 2011). However, high values of R2 due to many variables can be misleading. To overcome this problem, adjusted R2 is computed because the difference between the value of R2 and adjusted R2 determines the accuracy of the model too. Yu et al. (2015) mentioned that a good model has a value of adjusted R2 lesser than that of R2 and the difference between the values of R2 and adjusted R2 is very small. In addition, the p value test was also examined for every regression factor as part of multiple regression analysis. The p value test is used to explore the chance of mistakenly rejecting a null hypothesis, in which the value can be between 0 and 1 and commonly expressed as a percentage (Lawens and Mutsvangwa 2018). A p value of < 5% indicates that the resulting factor is significant.

Statistical evaluation of analysis methods

Statistical evaluation plays an essential role in choosing the right analysis methods. From the evaluation, the forecasting error, which is the difference between the actual value and forecasted values at a given time, was defined (Hanke and Wichern 2014). The common indicators applied in this study to evaluate the accuracy of the analysis are MAD, MSE, RMSE, and MAPE. Regardless of the measure being applied, the most accurate analysis method resulted in the lowest value of the statistical evaluation (Ryu and Sanchez 2003).

MAD measured the overall forecast error of the analysis. The value is computed by dividing the sum of the absolute value of single forecast errors by the sample size (Heizer and Render 2011). Next, MSE is a typically accepted method for the evaluation of exponential smoothing and other techniques. It measures error as the sum of squares of differences between actual and forecast values divided by the number of samples. RMSE is the square root of MSE. Ryu and Sanchez (2003) stated that RMSE is the computed error in terms of units that are equal to the original values. Another statistical evaluation chosen in this study is MAPE, which is usually used in quantitative techniques of analysis (Makridakis et al. 1997). MAPE is the average of the sum of all the percentage errors for the sample dataset, disregarding the signs. Equations 5, 6, 7, and 8 present the MAD, MSE, RMSE, and MAPE equations, respectively:

$$ MAD=\frac{1}{n}{\sum}_{i=0}^n\left|{Y}_i-{F}_i\right|, $$
(5)
$$ MSE=\frac{1}{n}{\sum}_{i=0}^n{\left({Y}_i-{F}_i\right)}^2, $$
(6)
$$ RMSE=\frac{1}{n}{\sum}_{i=0}^n\left|\sqrt{\left({Y}_i-{F}_i\right)}\right|, $$
(7)
$$ MAPE=\frac{1}{n}{\sum}_{i=0}^n\frac{\left|{Y}_i-{F}_i\right|}{Y_i}, $$
(8)

where Yi is the actual value of water consumption, Fi is the forecast value of water consumption using FIS and MLR, and n is the number of premises.

Results and discussion

FIS

Water demand at Kenyir Lake was analyzed using FIS with the information on the input, output, and rules selected accordingly. Each process analyzed in FIS was executed using Matlab. Figures 4, 5, 6, and 7 depict the inputs and the outputs used in the water demand analysis.

Fig. 4
figure 4

Input water consumption

Fig. 5
figure 5

Input number of people

Fig. 6
figure 6

Input income generated

Fig. 7
figure 7

Output levels of water demand

The ranges of values for input water consumption shown in Fig. 4 are 0–8 m3 (low), 8–25 m3 (medium), and 25–662 m3 (high). The water consumption values ranging from 0 to 662 m3 are used as an input for the FIS process.

Figure 5 establishes the number of people ranging from 0 to 2700 as a second input in the FIS process. There are three levels for the number of people, namely, low, medium, and high, with ranges of 0–60, 60–300, and 300–2700 people, respectively.

The third input used is the income generated, which ranged from RM 0 to RM 24,000 as depicted in Fig. 6. The levels of income generated used are low (RM 0–RM 300), medium (RM 300–RM 6000), and high (RM 6000–RM 24,000).

Figure 7 illustrates the ranges for the output levels of water demand for this FIS analysis. The values of the low, medium, and high levels of water demand are 0–8, 8–25, and 25–670 m3, respectively. Figure 8 shows the entire process of the FIS for this water demand analysis. Three left columns and the most-right column in the figure refer to the inputs utilized and the output inferred by FIS, respectively. The process undergoes linguistic variable fuzzification through defuzzification of the aggregate output to evaluate the numerical values of the expected water demand analyzed.

Fig. 8
figure 8

FIS process of water demand analysis with 27 IF–THEN rules

From Fig. 8, the yellow area in each set shows which degree of the input value is determined as a member of the associated fuzzy set. The blue area depicts which degree of a related fuzzy set is chosen based on the input data used, whereas the blue area at the most bottom represents the aggregated output of the fuzzy set. The red line at the middle of the input variables of the number of people, income generated, and water consumption shows their numerical values. Nevertheless, the red line on the output column from Fig. 8 depicts the numerical values of the water demand inferred by FIS through the entire process. One example of the FIS process related to Fig. 8 is as follows: the input number of 1400 people, RM 12,500 income generated, and 340 m3 water consumption that resulted in 345 m3 of estimated water demand.

The three-dimensional curved surface shown in Figs. 9, 10, and 11 illustrates the output variables of water demand analysis at different input variables. Specific values of water demand correspond to each operating condition. The summary that can be drawn from the figures is water demand reacts to the number of people and water consumption rather than to income, since as the number of people and water consumption increase, water demand increases. Admission to some of the premises at Kenyir Lake is free of charge; hence, no income is generated from them. Thus, income is less influential than the number of people and water consumption. Table 4 presents the full results of forecasted water consumption at Kenyir Lake using FIS analysis.

Fig. 9
figure 9

Three-dimensional curved surface for the input number of people and income generated

Fig. 10
figure 10

Three-dimensional curved surface for the input number of people and water consumption

Fig. 11
figure 11

Three-dimensional curved surface for the input water consumption and income generated

Table 4 Actual and forecasted water consumption at Kenyir Lake

MLR

The MLR model used in this study comprises two independent variables, namely, the number of people and the income generated. The regression function constructed from MLR model (Eq. 2) is given as:

$$ Y=-22.66854+0.15743\ {x}_1+0.00916\ {x}_2. $$
(9)

The constant B0 of − 22.66854 represents water demand when the number of tourists and income generated are 0. In this study, historical data showed an increase of tourists (the number of people) in the study area yearly, which corresponds perpendicularly to the income growth anticipated yearly.

From the MLR model, the results show that the number of people is more influential to water demand than to income generated. This can be proven by the fact that premises with a low number of people have negative water demand values. Those premises include offices of KETENGAH and Tourist Information Center, camping sites, National Park, Kelah Sanctuary Park, Tropical Park, Orchid Park, Lawit Lodge, and Kenyir Eco Resort. As explained by Neuman (2011), the positive and negative signs of the resulting values only show the direction and effect of the variables to the dependent variable. In this study, Table 4 establishes the forecasted water consumption for every premise at Kenyir Lake using MLR analysis.

The MLR results are also being validated with the coefficient of determination (R2), adjusted R2, and p value tests. The computed values of R2 and adjusted R2 are found to be 0.9684 and 0.9653, respectively. R2 and adjusted R2 values are also expressed as 97% of the variation in water demand values estimated to be relative to the variation in the independent variables (the number of people and the income generated), which shows an extremely satisfactory agreement. In addition, the resulting R2 value in this study has a good linear approximation of the regression function because only 3% error was found in the model. Differences between R2 and adjusted R2 are very small and sufficient to justify the predictability of the model.

The p values for variable x1 (the number of people) and x2 (the income generated) are found to be 0.01008 and 0.00132, respectively. Thus, the p values for both variables are significant (< 5%) in presenting the target variable, and the null hypothesis can be rejected. The validation from R2 and adjusted R2 of the model and the p value of the independent variables show that the final model and variables used can be considered notable.

Summary

Water demand values for all premises at Kenyir Lake have been analyzed using FIS and MLR. Table 4 presents the results for both analyses and the actual water consumption at Kenyir Lake. The results of water demand analyses via FIS and MLR range from 0.47 to 660.76 m3. The values are compared with the actual water consumption at Kenyir Lake to determine the accuracy and performance of each analysis. FIS analysis relies on membership functions, range of membership functions, and fuzzy rules to estimate water demand. Conversely, the MLR technique accesses both independent and dependent variables to produce water demand values.

As shown in Table 4, there is a large difference between the water demand values from FIS analysis and the actual water consumption. However, this is not applicable to all premises being analyzed because of the free-of-charge entrance for some premises. This also led to a low connection between the membership function and the variable. Premises that have a large gap between the actual water consumption are public toilets at Pengkalan Gawi, Kenyir Water Park, houseboats, Rumah Rehat Persekutuan, Musang Kenyir Resort, and Lake Land Resort. One of the premises that have a slight gap between FIS result and actual water consumption is Tropical Park at 1.00 m3 and 2.00 m3, respectively. Although this study has developed a FIS model to analyze water demand, the results obtained are responsive to the type of membership function utilized. Thus, FIS has dimensionality problems that come from the increased number of fuzzy sets (Moorthi et al. 2018).

The outcome of water demand analysis at Kenyir Lake using MLR is more reliable than using FIS, since most of the water demand values from MLR are almost similar to the actual water consumption, as shown in Table 4. For instance, the water demand value using MLR and the actual water consumption for houseboats are very close, that is, 660.76 m3 and 662.00 m3, respectively. Total water demand values for all premises at Kenyir Lake for FIS and MLR analyses are 2314.38 m3 and 1358.22 m3, respectively. Also, the total actual water consumption is 1249.98 m3, which is the least difference with water demand value using MLR. Lawens and Mutsvangwa (2018) agreed that water demand forecasts using statistical analysis (MLR) have great benefits, since the significant driver can be identified. In addition, the MLR analysis used is valuable for future research on water modeling (Sundari et al. 2013).

Forecasting errors for FIS and MLR analysis were evaluated using statistical indices. Table 5 presents the statistical results for both analyses. The values of MAD, MSE, RMSE, and MAPE for 23 premises using FIS analysis are 76.83, 23103.41, 152.00, and 262.83%, respectively. Conversely, the MAD, MSE, RMSE, and MAPE of MLR analysis are found to be 8.96, 244.96, 15.65, and 108.87%, respectively. Given the statistical parameter, MLR analysis predicts water consumption more accurately than FIS analysis because the forecast error values of MLR analysis are smaller than those of FIS analysis. MLR is the most appropriate analysis method in terms of accuracy (statistically) and simplicity. Similar to other studies (Karmaker et al. 2017; Mehri 2013; Ryu and Sanchez 2003), the best forecasting method is chosen based on the smallest forecasted errors of MAD, MSE, RMSE, and MAPE.

Table 5 Statistical evaluations for FIS analysis and MLR analysis

The methods of FIS and MLR are valid analysis tools and great visualization for the relationships between the variables. Researchers, decision makers, and the community can have a better understanding of results via FIS because this method provides verbal terms. Factors that influence water demand can also be analyzed using MLR. Water demand analysis can be extended using other techniques by gathering more variable datasets from the study area. A longer duration of research study provides an opportunity for the data to be widely explored, hence rendering meaningful effects on the study.

The application of the analysis techniques to determine the supply and demand at Kenyir Lake, Terengganu, contributes to sustainable water demand management. Water is closely related to most paths of sustainable development, but whether water is used or abused depends on water managers and communities (Cosgrove and Loucks 2015). Hence, the participation of communities in improving water demand management is very crucial as they can develop an environmentally friendly behavior and reconstruct their intention toward it. A study by Aliabadi et al. (2020) proved that rural people focus more on the advantages and benefits of sustainable water management because they know the importance of water resources, human health, and environmental impacts.

Conclusions

The values of water demand analysis at Kenyir Lake, as revealed by the experimental results of FIS and MLR, are 2297 m3 and 1471 m3, respectively. It can be shown that MLR analysis converges closer to the actual water consumption of 1249.8 m3 than does FIS analysis. Water demand analysis by MLR is found to be valid and highly applicable as the method also can be proven by its R2 of 0.9684 and adjusted R2 of 0.9653.

The techniques proposed in this study, namely, FIS and MLR analyses, have their distinctive benefits in analyzing the water demand. FIS analysis interpreted the input variables and produced linguistic and numerical terms by considering the fuzzy rules confirmed by the experts. Experts’ knowledge contributes to the great reliability of the results. Conversely, MLR results provide verifications from R2, adjusted R2, and the numerical value of the effect on each variable. From this, the resulting water demand can be verified easily. MLR is a very direct and quick technique that requires only a few variables (Luna et al. 2017). The statistical evaluations also showed that MLR analysis outperformed FIS analysis by smaller values of forecasted errors. The MAD, MSE, RMSE, and MAPE values of MLR analysis are 8.96, 244.96, 15.6, and 108.87%, respectively.