1 Introduction

Frequent pipe breaks in water distribution networks (WDNs) are one of the major challenges facing water utilities worldwide (Ghorbanian et al. 2015). This issue can cause high direct and indirect economic, environmental and social costs and remains a significant concern due to some adverse consequences, including water and energy losses, substantial repair cost, service level interruptions, traffic delays, and water contamination in distribution networks. Besides, the majority of water utilities are operating under limited budgets and scarce capital resources. Therefore, as substantial investment is required to renew and replace pipe networks, pipe breaks should be avoided as great an extent as possible (Martínez-Codina et al. 2015).

For all these reasons, Several predictive break models have been developed to assist water utilities for future planning, prioritizing and decision-making of maintenance, rehabilitation, and replacement (M/R/R) activities (Kabir et al. 2015, 2016; Kakoudakis et al. 2017). These models can be classified into physical and statistical models (Kleiner and Rajani 2001; Nishiyama and Filion 2013). Physical models attempt to model the physical mechanisms of pipe breaks which are often very complex to be completely understood (Rajani and Kleiner 2001).

The statistical models have been developed to identify patterns of pipe breaks and failure behaviors based on historical data of pipe breaks. These models can be applied with various levels of input data, even when few data is available (Kleiner and Rajani 2001; Nishiyama and Filion 2013; Scheidegger et al. 2015). Besides the statistical models, a number of newer methods have been employed based on data mining techniques due to the complexity of water networks (Berardi et al. 2008; Xu et al. 2011). These methods include techniques like artificial neural networks (Jafar et al. 2010; Shirzad et al. 2014; Kutyłowska 2015), genetic programming (Babovic et al. 2002; Xu et al. 2011) and evolutionary polynomial regression (Giustolisi and Savic 2006, 2009; Berardi et al. 2008; Kakoudakis et al. 2017).

Input variables in the statistical models may be referred to as effective factors that contribute to unintentional pipe breaks and affect breakage rates. These factors can be classified into three main categories: physical, operational, and environmental factors (Kleiner and Rajani 2001; Al-Barqawi and Zayed 2006). They include pipe age, pipe material, pipe diameter, soil type, traffic loading, water pressure, water quality, and climatic conditions such as temperature, rainfall, and groundwater condition. It should be noted that pipe break data are often needed to be segregated to homogenous groups with similar properties (operational, environmental and physical) to better identify breakage patterns and to obtain statistical significance (Kleiner and Rajani 2001; Xu et al. 2011).

Water pressure is one of the main operational factors contributing to the break rates of pipes in water networks and there is a direct link between the occurrence of pipe break and operating pressures. Several researchers have emphasized that high pressure has a substantial influence on pipe break rates (Pearson et al. 2005; Thornton and Lambert 2005, 2007; Martínez-Codina et al. 2015). It has been consistently reported in the literature that the frequency of pipe breaks significantly reduced after implementation of pressure management (PM) in many WDNs throughout the world (Lambert 2002; Thornton and Lambert 2006, 2007; Girard and Stewart 2007; Palau et al. 2012). PM is usually associated with maximum pressure control using pressure-regulating valves (PRVs) in order to reduce excessive pressure at certain times of the day. In addition, the effectiveness of implementing PM schemes mainly depends on the conditions of the distribution network (Thornton and Lambert 2007; Vicente et al. 2016). Therefore, the analysis of the relationship between pressure and pipe breaks is necessary for implementing effective PM schemes and prioritizing M/R/R activities.

The majority of statistical models have been focused primarily on describing pipe break rates based on a combination of effective factors contributing to pipe breaks (Berardi et al. 2008; Wang et al. 2009; Kabir et al. 2010; Xu et al. 2011). The most frequent factors have been considered to predict pipe break rates include physical and environmental factors like pipe age, pipe material, pipe length, pipe diameter, previous pipe breaks, temperature and soil corrosivity (Kleiner and Rajani 2001; Ghorbanian et al. 2015; Kabir et al. 2015). Little effort in previous studies has been made to investigate the relationship between pressure and occurrence of pipe breaks, since it is difficult to establish such a relationship (Lambert and Fantozzi 2010; Martínez-Codina et al. 2015). Moreover, because of the scarcity of filed data, pressure in some predictive break models is often estimated by hydraulic simulation models. The estimated values of pressure may be included a degree of uncertainty, especially in large WDNs, associated with the input parameters such as water demands and hydraulic conductivity of pipes (Ghorbanian et al. 2015). This may be lead to provide imprecise results by predictive pipe break models.

The International water Association task force conducted a number of studies to demonstrate the effects of PM on reducing pipe breaks. They pointed out that pipe break frequency similar to leakage can be related to a power component of pressure (Lambert 2001; Farley and Trow 2003). Thus, the N2 exponent was proposed to predict changes in new pipe break occurrences (B0 to B1) as pressure varies (P0 to P1) following the exponential relationship: (\( \raisebox{1ex}{${B}_1$}\!\left/ \!\raisebox{-1ex}{${B}_0$}\right.={\left(\raisebox{1ex}{${P}_1$}\!\left/ \!\raisebox{-1ex}{${P}_0$}\right.\right)}^{N_2} \)). Following studies showed N2 values can widely vary between 0.2 and 12 for water mains and service connections, and thus, the N2 approach for analysis of pressure-pipe break relationships was inappropriate (Pearson et al. 2005; Thornton and Lambert 2005; Lambert et al. 2013). Thornton and Lambert (2007) demonstrated that the reduction in new pipe breaks depends on the break frequency before implementing PM. They introduced a conceptual approach to provide an explanation of the variability of results between different zones in response to pressure changes (Thornton and Lambert 2007). This concept explains that the break frequency of a specified zone has a relatively low value until some particular pressure is exceeded, and then small increases in pressure may cause break frequency increases rapidly. However, they did not explain how this particular pressure ranges may be identified. A general form of equation through the use of N2 exponent was developed to explain the conceptual approach based on the several sets of field data (Lambert and Thornton 2012).

Martínez-Codina et al. (2015) proposed a new methodology based on a Bayesian approach for identifying maximum pressure ranges that is most likely to reduce pipe breaks. The results confirmed that the probability of pipe breaks increases for specific pressure ranges. Thus, they suggested that maximum operating pressure should have an upper limit for the reduction of the probability of breaks. However, the recorded pipe breaks for all materials were used to determine pressure ranges in their study, as the number of pipe breaks for each type of pipe material was too small for statistical inference. This may produce statistical noise in data and causing imprecise results in the estimation of the probability of pipe breaks, as different pipe materials are likely to respond differently to changes in pressure (Lambert and Thornton 2011).

The main objective of this paper is to investigate the relationship between operating pressures and pipe break frequency in WDNs. A field data-based methodology is proposed to describe pressure-pipe break relationships by determining break rate functions (BRFs). Moreover, when the BRFs are established, the Bayes’ theorem is applied to identify particular pressure thresholds on the BRF curves in which the pipe break rates increase rapidly for small increases in pressure. The estimated pressure ranges can assist water utilities in the implementation of effective PM for reducing pipe breaks while providing adequate service pressure. To demonstrate the applicability of the proposed methodology, it was applied to a large portion of the WDN of Tehran, Iran. This study provides a more comprehensive understanding of pressure-pipe breaks relationship in WDNs. The developed BRFs can be applied to estimate expected pipe breaks based on the probability distribution of operating pressures. The results can also be incorporated into the development of economic models to determine financial benefits of PM.

2 Methodology

It is well established that maximum operating pressure is a prime or contributory factor to the occurrences of pipe breaks in WDNs. This operational factor may cause an increase in pipe breaks coupled with the physical and environmental factors. Therefore, it is essential to establish a relationship between operating pressure and pipe breaks. The proposed methodology uses field data of pressure and recorded pipe breaks to develop such a relationship by determining BRFs. The methodology is summarized in Fig. 1, comprising three main steps. The first step includes collection and data filtering of pressure and pipe breaks. In the second step, the BRFs for different pipe materials of water mains are determined to describe the relationship between pressure and pipe break rates. Finally, the Bayes’ theorem is applied in order to determine the maximum pressure thresholds on the BRF curves that could have a substantial influence on the reduction of pipe breaks.

Fig. 1
figure 1

The proposed methodology for developing BRFs in WDNs

2.1 Data Collection and Filtering

To precisely describe the relationship between pressure and pipe breaks based on BRFs, the collecting adequate field data of operating pressure and pipe breaks is important. Operating pressure in an individual zone of a WDN is characterized by means of a pressure indicator, which is defined as the calculated statistics from the time series of pressure head over a specific time window (Martínez-Codina et al. 2015). Here, instantaneous pressure values for the calculation of the pressure heads are measured at the Average Zone Point (AZP) of the study area where the pressure variations can be considered to be representative of the zone average (ILMSS 2013; Renaud et al. 2015). Subsequently, the Maximum Pressure Indicator (MPI) is calculated as the maximum value of the time series of the pressure head over the time window.

A systematic approach is applied to identify the location of the AZP at an individual zone (WSAA 2009; ILMSS 2013; Moslehi et al. 2020). First, the geographical information system (GIS) tool is exploited to calculate the weighted average ground level (WAGL) of the zone. Then, a physical point in the center of the zone is selected as AZP for pressure measurements with the same calculated WAGL. The instantaneous pressure values are measured at the AZP with regular time step (e.g., every10-min. Finally, the time series of the hourly average pressure heads are calculated from the measured instantaneous pressure values and the set of MPI values is calculated from the hourly average pressure time series over a specific time window.

Pipe breaks data is usually collected and recorded annually by the water utilities in a database. The major information about each pipe break in a database may often be included the type of repairs, causes of breaks (i.e. internal or external causes), failure modes, the location of a pipe break, the time when a pipe break is first reported and pipe characteristics such as pipe type (i.e., water main or service connection), pipe diameter and pipe material. Data filtering of recorded pipe breaks data into homogenous groups with similar properties is necessary to use the proposed methodology, which can be based on pipe characteristics (e.g., type and material), failure modes and causes of breaks.

2.2 Development of BRFs

In the second step of the methodology, the relationship between pressure and pipe breaks is established by developing BRFs where an MPI is associated with an average level of pipe break rates (Filion et al. 2007; Ghorbanian et al. 2015). The BRF in an individual zone (as depicted in Fig. 2) can be calculated using the following equation (Lambert and Thornton 2012):

$$ BRF={BF}_{npd}+{BF}_{pd}={BF}_{npd}+A\times {MPI}_{AZP}^{N_2} $$
(1)

where BFnpd = non-pressure dependent break rate (not influenced by pressure); BFpd = pressure-dependent break rate (varies with pressure). MPIAZP = the MPI value at the AZP; N2 = the exponent of pressure-pipe break relationship. The “A” = a zone parameter influencing the curve slope in pressure-dependent part. Moreover, the following relationship can be used to assess the effect of PM on pipe break rates (Lambert and Thornton 2012):

$$ R=\left(1-{C}_b\right)\left(1-{\left(\raisebox{1ex}{${MPI}_{AZP_1}$}\!\left/ \!\raisebox{-1ex}{${MPI}_{AZP_0}$}\right.\right)}^{N_2}\right) $$
(2)

where R = reduction of pipe breaks due to implementing PM; Cb = the ratio of BFnpd part to initial break rate before PM implementation (BF0) that varies between [0 1]; MPIAZP0 and MPIAZP1 represent the MPI values at AZP before and after PM implementation, respectively.

Fig. 2
figure 2

The hypothetical break rate function

The BFnpd for an individual zone may be affected by some influential factors such as pipe age, ground movement, traffic loading, low temperatures and quality of installation (Lambert and Thornton 2012; Ghorbanian et al. 2015). An Initial estimate of BFnpd can be considered as the lower boundary of the data points in a plot that depicting break rates against average zone night pressure (Lambert et al. 2013). Conversely, the BFpd is a function of MPIAZP to the N2 exponent. Lambert and Thornton (2012) recommended the value of N2 exponent is recommended to be close to 3 (Lambert and Thornton 2012). As different pipe materials may respond differently to pressure variations, the value of N2 exponent is estimated for various materials.

The developed BRFs is evaluated by means of model performance indicators, including the coefficient of determination (R2) and the root mean square error (RMSE). Their mathematical relationships are defined as follows:

$$ {R}^2=\frac{{\left({\sum}_{i=1}^n\left({y}_{p,i}-{\overline{y}}_p\right)\left({y}_{o,i}-{\overline{y}}_o\right)\right)}^2}{\sum_{i=1}^n{\left({y}_{p,i}-{\overline{y}}_p\right)}^2{\sum}_{i=1}^n{\left({y}_{o,i}-{\overline{y}}_o\right)}^2} $$
(3)
$$ RMSE=\sqrt{\frac{\sum_{i=1}^n{\left({y}_p-{y}_o\right)}^2}{n}} $$
(4)

where yp= predicted values; yo= the observed value; \( {\overline{y}}_p \)= the mean of predicted value and \( {\overline{y}}_o \)= the mean of the observed value, and n is the total number of data points.

It should be noted that two simplifications have been considered to develop a BRF. Firstly, each level of the maximum pressure indicator is associated with an average break rate in determining the BRF. Secondly, the maximum pressure is considered to be the only factor controlling pipe break rates (Filion et al. 2007; Ghorbanian et al. 2015). In essence, regarding the break causes, this study focused on the pressure-related pipe breaks. Moreover, the developed BRF may be valid for an individual zone with a group of homogenous pipes, including water mains or service connections with predominant pipe materials under similar environmental conditions (Lambert and Thornton 2012; Ghorbanian et al. 2015). Thus, no effort is made to derive a general relationship between the maximum pressure and pipe break rates for a WDN as a whole in the proposed methodology.

2.3 Identify the MPI Threshold on the BRF

As shown in Fig. 2, there may be an MPI threshold on the developed BRF where for a small increase in threshold may cause a large increase in pipe break rate. The pressure threshold can be identified by a probabilistic approach based on the Bayes theorem. The approach uses pipe breaks data for different pipe materials and pressure heads at the AZP to estimate cumulative distribution functions (CDFs) of the MPI values under normal operation of pressure regime, and when they are coincident with a reported pipe break. These two estimated CDFs are referred to as the unconditional and the break-conditioned CDFs of the MPI, respectively (Martínez-Codina et al. 2015). Subsequently, the estimated CDFs are compared to calculate a probability ratio (PR), which allows the identification of the pressure indicator threshold on the developed BRF.

The hourly average pressure time series at AZP with a moving time window is analyzed to calculate the set of unconditional MPI values. The unconditional CDF of the MPI is then computed from the calculated set. Furthermore, the set of MPI values conditioned to breaks is obtained using recorded pressure heads before each reported pipe break over a specific time window. Thus, the number of pipe breaks is equal to the number of MPI conditioned to breaks. Eventually, the empirical break-conditioned CDF is estimated by the MPI values conditioned to breaks (Martínez-Codina et al. 2015). It should be noted that the time window to derive the maximum pressure indicator conditioned to breaks include awareness time, which has been reported in the literature between 1 and 3 days (Fanner and Thornton 2005; AWWA 2009; European Commission 2015).Moreover, the selected time window is the same window width used to calculate the set of break-conditioned and unconditional MPI values.

The number of recorded pipe breaks for different pipe materials is often not enough during the time period of data records to estimate the break-conditioned CDF precisely. This may be lead to instabilities in the estimation of the probability of pipe breaks (Martínez-Codina et al. 2015). Therefore, this study proposes that the empirical break-conditioned CDF is fitted to a non-parametric break-conditioned CDF, which is estimated by means of the kernel distribution function. The Chi-squared test is then applied to assess whether the non-parametric break-conditioned CDF is followed by the sample of break- conditioned values of the MPI. In this application, the Chi-squared test assesses whether a sample comes from a specified probability distribution function (null hypothesis). The null hypothesis, H0, and the alternative hypothesis, HA, may be written as follows:

$$ {H}_0=X\to {F}_0(x)={F}_1(x) $$
(5)
$$ {H}_A=X\to {F}_0(x)\ne {F}_1(x) $$
(6)

where x = sample values of the MPI conditioned to breaks; F0(x) = the empirical break-conditioned CDF and F1(x) = the non-parametric break conditioned CDF. The null hypothesis (H0) should be rejected at any significance greater than the p value.

The relationship between the estimated CDFs is established by Bayes theorem, which the probability of having a break in a specific time window when the MPI takes a value in the interval [Iα, Iβ] can be calculated as follows:

$$ P\left(B|I\right)=\frac{P\left(I|B\right).P(B)}{P(I)} $$
(7)

where P(B) = the system-wide probability of pipe breaks, P(I) = the probability of the MPI when it is in the interval [Iα, Iβ], P(I| B) = The conditional probability of the MPI when takes a value in the interval [Iα, Iβ], if there is a break in a specific time window. Statistically, if the maximum pressure is not influential on pipe break rates, then the conditional probability,P(I| B), and unconditional probability, P(I), are similar and may be independent events, P(I| B) = P(I). Conversely, if these two events are dependent, it may be concluded that maximum pressure may cause a decrease or increase of pipe break rates andP(I| B) ≠ P(I). The ratio between them is referred to the probability ratio (PR), which allows the identification of a pressure indicator threshold on the BRF curve and calculated as follows:

$$ PR=\frac{P\left(I|B\right)}{P(I)} $$
(8)

The conditional probability (P(I| B)) and unconditional probability (P(I)) can be calculated using break-conditioned and unconditional CDFs as follows:

$$ P(I)=P\left({I}_{\alpha}\le {I}_i\le {I}_{\beta}\right)={F}_I\left({I}_{\beta}\right)-{F}_I\left({I}_{\alpha}\right) $$
(9)
$$ P\left(I|B\right)=P\left(\left[{I}_{\alpha}\le {I}_i\le {I}_{\beta}\right]|B\right)={F}_{I_B}\left({I}_{\beta}\right)-{F}_{I_B}\left({I}_{\alpha}\right) $$
(10)

where FI= the unconditional CDFs of the MPI and \( {F}_{I_B} \)= the break-conditioned CDFs of the MPI.

3 Case Study

The proposed methodology is applied to a large zone of the WDN of Tehran, Iran. Figure 3 shows a GIS map of the zone. The network in the zone supplied by gravity and consist of 516 Km length of mains with a total of 9967 individual pipes. The general characteristics of the zone are summarized in Table 1. The mixed main materials of the zone comprise of ductile iron (DI), steel, asbestos cement (AC), polyethylene (PE) and polyvinyl chloride (PVC). As shown in Table 1, the predominant pipe materials are 76% DI and 17% PE, as more than 93% of the mains are made from these materials. Therefore, this work is conducted based on the collected data from DI and PE pipe materials.

Fig. 3
figure 3

Representation of the zone considered in this study

Table 1 General characteristics of the study area

Pipe breaks data of the studied zone for a period of 9 years (Mar-2008 to Mar-2017) were collected and filtered from the database. Table 2 shows the number of pipe breaks and annual failure rates (number of breaks/100 km/year) for the selected time periods and pipe materials. The periods of time are considered two consecutive years both before and after implementing PM, as it may assist in predicting more precisely the BRFs. It should be mentioned that there was not detailed information about modes of failure for different water mains in the database. Therefore, no attempts were made to associate a failure mode to a recorded pipe break in order to identify and remove those pipe breaks that were not caused by pressure. Moreover, only unintentional pipe breaks were considered in the present study. It should be noted that intentional pipe breaks occur mainly due to the improper execution of construction work, either directly through excavators or indirectly through vibrating construction machines or heavy vehicles.

Table 2 pipe break rates before and after pressure management for PE and DI water mains

The GIS tools was exploited to identify the ground level of every service connection within the zone. Subsequently, the WAGL of all service connections was calculated and an appropriate point with ground level close to the calculated WAGL and near the center of the zone was selected as the AZP. Instantaneous pressure values were measured at the AZP of the zone every 15 min. The hourly average pressure heads were then calculated from the average of the four recorded pressure values over each hour. Because of the cyclical behavior of water pressure, the time window was selected 24 h or 1 day (Martínez-Codina et al. 2016). The set of MPI values were obtained from the maximum of hourly average pressure values for each individual day. Subsequently, the maximum and average of the set of the MPI values over 1 year were calculated to develop the BRFs for different pipe materials in the zone. It should be noted that the elevation difference between WAGL and the AZP was added to the obtained values to adjust the MPI values. The WAGL and the AZP elevation were found to be 1545.2 and 1553.1 m, respectively. Table 3 shows the annual maximum and average of the set of MPI values as well as the time period of data records, which is 2 years before and after implementation of PM.

Table 3 The calculated MPI values for the studied zone before and after PM

4 Results and Discussion

Following the proposed methodology, the filtered data were used to establish the BRFs for different pipe materials that best fit the data. The BFnpd values were estimated using a quick method proposed by Lambert et al. (2013). The estimated values of the BFnpd were found to be 23.8 and 1.28 breaks/ 100 km/year for PE and DI, respectively (Table 4). Table 5 shows the developed BRFs and the calculated model performance indicators for PE and DI pipe materials. The graphical plots of the developed BRFs are illustrated in Figs. 4 and 5.

Table 4 Estimation of BFnpd for PE and DI water mains
Table 5 the developed BRFs for the selected pipe materials in the study area
Fig. 4
figure 4

Break rate function for PE material using a) Annual maximum of the MPI values; b) Annual average of the MPI values

Fig. 5
figure 5

Break rate function for DI material using a) Annual maximum of the MPI values; b) Annual average of the MPI values

The results provided in Table 5 demonstrate that an improvement for the two performance indicators (R2 and RMSE) is achieved when the annual average of the MPIs was used to develop the BRFs. More specifically, the comparison of the annual average with the annual maximum of the MPI shows that the improvement for PE and DI pipe materials is 9%, 17% for R2, and 38.5%, 82% for RMSE, respectively. The improvement achieved can be linked to the fact that the annual average of the MPIs can better represent the failure behavior of water mains. This is because the studied zone is operated under normal pressure regime and often not subject to surges or pressure transients due to supplying the network by gravity with continuous water supply in the time period of the analysis. Moreover, after implementing PM, the reduction of surges and large variations in pressure may cause the MPIs do not interact to the same extent with the other contributing factors to increase the frequency of pipe breaks Thus, the calculated annual average of the MPIs may be close to those MPI values contributing pipe breaks.

Consequently, the calculated annual average of the MPIs may be an adequate indicator to represent the influence of maximum pressure on the occurrence of pipe breaks. It should be noted that this indicator is not in agreement with the indicator suggested by Lambert and Thornton (2011), which is the maximum static pressure at the AZP, plus any pressure transients and surges over time period (i.e., the annual maximum of the MPIs).

In Table 5, the estimated N2 values are also given for PE and DI pipe materials. As can be seen, the estimated values of N2 exponent are significantly different for the two MPIs. The N2 values for the annual average of the MPIs were found to be higher than by average 60% of the annual maximum of the MPIs. Additionally, the estimated values of N2 exponent for the annual average of the MPIs are near the suggested value for the N2 exponent (i.e., close to 3) in other studies (Lambert and Thornton 2012; Lambert et al. 2013). Furthermore, the N2 values for the selected pipe materials should be noticed. As shown in Table 5, the estimated values of N2 exponent for PE are higher than the DI material. One conclusion derived from the results is that maximum pressure is more influential on pressure-dependent pipe break rates for PE pipe materials. As a consequence, N2 value would be dependent on pipe material, and may be different for each material. Therefore, if there is no scarcity of pipe breaks data, the value of N2 exponent should be estimated for each pipe material separately in an individual zone.

Over period of years, the slope of the developed BRF curves in the pressure-dependent part may increase because of influential factors contributing to pipe breaks such as age, corrosion, operating characteristics, climate and soil conditions (Thornton and Lambert 2007). Moreover, some local conditions of the zone, including ground movement, traffic loading, quality of installation, and seasonal influences, may change the non-pressure dependent pipe break rates (Creaco and Walski 2017). Therefore, a unique BRF should be provided under similar operational and environmental conditions in a zone for each type and pipe material when sufficient field data is available to obtain a robust statistical analysis. In addition, the determination of the BRFs should be periodically updated to incorporate the changing conditions of the network.

The predicted reduction of pipe break rates (R) for PE, and DI pipe materials are presented in Table 6. It should be mentioned that the Cb values were calculated using the estimated BFnpd, and the actual pipe breaks before PM (BF0). The obtained Cb values were found to be 0.2 and 0.13 for PE and DI materials, which can be explained in terms of pipe ages. The Cb usually has higher values in new networks because the more numbers of pipe breaks in new WDNs would not be directly related to pressure and may be associated with BFnpd. Conversely, in old WDNs, the more numbers of pipe breaks are usually associated with pressure-related failures, and thus, Cb has lower values (Lambert et al. 2013; Creaco and Walski 2017). The estimated reduction of pipe breaks (R) was found to be 63.31%, 61.8% for PE, DI pipe materials, respectively (Table 6). The estimated values for pipe breaks reductions can be explained in terms of Cb values and the reduction of the maximum operating pressures, which can cause significant reductions of pipe breaks in the zone Table 6 shows that the predicted reductions of pipe breaks will reach values around 60% if the MPI is decreased around 40%. These results suggest that the reduction of maximum pressure is significantly influential on the reductions of pipe break rates in the study area.

Table 6 The reduction of pipe breaks for PE and DI materials

In the last part of the methodology, after the BRFs were developed, the maximum pressure thresholds on the BRFs are identified where pipe break rate increases rapidly for small pressure increases. For this reason, the Bayes theorem was applied to determine a pressure ratio that compares the probability distribution of unconditional and break-conditioned of the MPI.

The set of the unconditional MPI values as a random variable was used to derive the unconditional CDF. Moreover, the empirical break-conditioned CDF was estimated by the MPI values conditioned to breaks. It should be noted that the time window used to determine the MPI values were selected 1 day. As the number of pipe breaks for the selected pipe materials is not long enough to estimate the break-conditioned CDF precisely, the empirical break-conditioned CDF was fitted to a non-parametric kernel distribution to determine the probability distribution of the MPI conditioned to breaks. In order to assess whether the break-conditioned distribution follows the same non-parametric kernel distribution function in the fitting, the Chi-squared test was applied. Table 7 presents the test results at 95% confidence level and the non-parametric CDF parameters for the zone.

Table 7 Test results (p value) of the Chi-squared test for the empirical break-conditioned CDF of the MPI at the 95% confidence level for the zone

As shown in Table 7, the results of the Chi-squared test demonstrate that p values is above 0.05 for PE and DI materials. This is inferred from the result that the probability distribution functions of the MPIs conditioned to breaks are reproduced quite well by the non-parametric distribution function, at a confidence level of 95%. Figure 6 graphically represents the unconditional CDF, the empirical break-conditioned CDF, and the non-parametric break-conditioned CDF of the MPI values for the PE and DI pipe materials. In order to compute the unconditional and break-conditioned probabilities of the MPI, the total range of the MPI on the developed BRFs was divided into 17 pressure intervals with equal length. The probability of the MPI for a specific pressure interval [Iα, Iβ] was then calculated using the unconditional and the non-parametric break-conditioned CDFs. Subsequently, the probability ratio (PR) was determined using the ratio between the computed probability of the unconditional and break-conditioned MPI in each pressure interval. Figure 7 shows the calculated PR values for PE and DI materials in the zone. The unit PR is distinguished by the horizontal solid line. This is inferred from the result that the maximum pressure clearly influences the probability of the occurrence of pipe breaks and increases the pipe failure rates, as the probability ratio is greater than 1 for certain maximum pressure ranges. At a practical level, these ranges should be avoided to reduce pipe break rates and implement effective PM in the zone. Conversely, the reduction of maximum pressure has will not necessarily result in decreasing pipe breaks when the probability ratio is smaller than 1. However, the residual working life of pipes can be extended, as it is explained in other studies (Thornton and Lambert 2007; Lambert and Thornton 2011; Lambert et al. 2013). As can be seen in Fig. 7, all PRs do not follow a definite or regular trend, although it is expected an increment trend with increasing maximum pressure values. This may be due to the fact that pipe breaks data is not long enough that the probability distribution of pipe breaks can be estimated precisely.

Fig. 6
figure 6

a The estimated CDFs of the MPI values for PE pipe material; b The estimated CDFs of the MPI values for DI pipe material. Unconditional CDF (dots), empirical break-conditioned CDF (open circle) and non- parametric break conditioned CDF (solid line)

Fig. 7
figure 7

The calculated PR values on the maximum pressure range a for PE material; b for DI material

A pressure indicator threshold could be identified in Fig. 7, which corresponds to the first maximum pressure range that crosses the unit PR. These thresholds were found 31 and 36 m pressure head for PE and DI materials, respectively, in the study area. As can be expected, the larger value of thresholds is related to DI materials. This can be related to the fact that DI pipe material may provide more resistance against maximum pressure causing pipe breaks. Moreover, the differences between the calculated pressure thresholds can be explained in terms of the concept of failure envelopes, which is a function of pipe materials (Pearson et al. 2005). As mentioned previously, over the period of years, the BRFs may vary because of changing in zone conditions, operating pressure, and combination of influential factors contributing to pipe breaks like pipe deterioration, low temperatures, and traffic loading. Furthermore, the break- conditioned CDFs of MPIs need to be periodically estimated, as each pipe material may respond differently to changes in maximum pressures. For all these reasons, the maximum pressure thresholds should be identified for the BRFs with homogenous pipes (type and material) under similar environmental and operational conditions.

There are some limitations to apply the proposed methodology. First, no attempt was made to derive a general BRF for a whole WDN. On the other hand, the objective of this study is to determine the BRFs for an individual zone of a WDN and for different materials in order to assess the response of each zone to the reduction of maximum pressure. Second, this study is unable to consider pipe breaks due to short term hydraulic transients, as pressure data were collected at 15-min intervals. Third, there was no segregated pipe breaks data based on failure modes, as such data were not available in the database. Therefore, BFnpd was estimated based on all data of pipe breaks, and no attempt was made to remove the pressure-related pipe breaks. Additionally, for the same reason, all pipe breaks data was used to determine the BRFs. This may increase the statistical noise in data, which can be a source of error in determining BRFs and can prevent a robust statistical inference. Forth, the maximum pressure is considered to be a key control parameter in occurrences of pipe breaks. Thus, this parameter coupled with other physical and environmental factors, can increase pipe break rates. Fifth, The BRFs and the pressure thresholds on them may be valid for homogenous pipes (type and material) under similar operational and environmental conditions. Therefore, when these conditions change, the analysis should be repeated.

5 Conclusions

A practical field data-based methodology was presented to determine the BRFs with the objective of exploring the relationship between pressure and pipe breaks in WDNs. The methodology was applied to a large zone of the WDN of Teheran (Iran). The AZP was determined based on a four-step systematic approach, and the MPI values were calculated using the measured pressure heads at the AZP over a specific time window. The BRFs were determined through the analysis of the MPI values, and the filtered pipe breaks data for each pipe material. The results showed that the annual average of the MPI could better represent the failure behavior of water mains. Both the performance indicators (R2 and RMSE) were improved when this indicator was used to determine the BRFs. The estimated N2 exponent for PE and DI materials were found 3.11 and 2.41, confirming that N2 exponent would be dependent to pipe materials. Thus, if sufficient field data is available, the BRFs and also N2 exponent should be estimated separately for each pipe of material. The results also demonstrated that, on average, the % reduction in pipe breaks was 1.5 times the % reduction in maximum pressure in the study area. The determined BRFs can be used to evaluate the associated benefits of PM for the development of economic models.

The pressure threshold on the obtained BRFs were identified based on a Bayes theorem which corresponds to the first maximum pressure ranges that crosses the unit PR. The probability distribution functions of unconditional and break-conditioned of the MPI values were computed to identify the PR values. The first one was used to derive the unconditional CDF of the MPI. The empirical break-conditioned CDF was also estimated by the second probability distribution. It was then fitted to a non-parametric CDF estimated by means of the kernel distribution function, as the number of pipe breaks in not long enough to estimate the probability of the MPI conditioned to breaks precisely. The results of Chi-squared test indicated that the probability distribution of the MPI conditioned to breaks were reproduced quite well by the non-parametric CDF, at a confidence level of 95% for PE and DI pipe materials.

The results of this paper pointed out that the pipe break rates may increase for specific maximum pressure ranges above the maximum pressure thresholds where the PR values are greater than 1. Conversely, when the PR smaller than 1, the reduction of maximum pressure does not necessarily affect on decreasing of pipe break rates, however, the residual life of pipes would be extended. The maximum pressure thresholds on the BRFs were found 31 and 36 m for PE and DI materials. The differences between these thresholds can be explained in terms of the concept of the failure envelope, which is the function of pipe materials. At a practical level, the MPI values should be limited to the obtained thresholds to implement effective PM and reduce pipe break rates as far as possible in the zone. The results of this paper suggest that the BRFs and pressure thresholds on them should be determined for homogenous pipes (type and material) under similar physical and operational conditions. Moreover, they should be updated periodically as the conditions of the zone, and operating pressure can change over the period of years. In the methodology presented here, the usage of commercial packages like HYTRAN for evaluating hydraulic transients causing pipe breaks may be explored. Furthermore, other pressure indicators related to pipe breaks like pressure range, pressure variability, and pressure variation rate may be investigated and tested to improve the results.