1 Introduction

The functioning of water supply infrastructure is crucial for the safety and well-being of communities, but it is threatened by an increasing number of both human actions and natural disasters, which are often unusual, unforeseeable and unavoidable (Meng et al. 2018; Pagano et al. 2018a). Consolidated risk management tools are often of limited use as they are unable to address unpredictable threats. A shift from risk to resilience management is therefore emerging, since a resilient system is capable of coping with unexpected, not-forecasted hazards (for instance, extreme weather events) (Meng et al. 2018).

Resilience can be defined as “the degree to which the system minimizes level of service failure magnitude and duration over its design life when subject to exceptional conditions” (Butler et al. 2016). A comprehensive resilience assessment of a water distribution network (WDN), therefore, requires knowledge of the level of service failure magnitude and duration when faced with threats. Available approaches for assessing WDN resilience (e.g. Shin et al. 2018) can be broadly classified as either ‘property-based’ or ‘performance-based’.

‘Property-based’ approaches investigate the susceptibility of WDNs to failure, focusing on the link between system performance and inherent structural properties such as robustness, diversity, connectivity and redundancy (Yazdani and Jeffrey 2012; Butler et al. 2016). One such approach considers the WDN as a set of multiple interconnected and interacting nodes (e.g. demand points, tanks and reservoirs) and edges (e.g. pipes, pumps and valves) and uses Graph Theory (GT) to explicitly analyze key properties, thus providing an intuitively robust and quantitative evaluation (Yazdani and Jeffrey 2012). The use of GT-based metrics found wide and early acceptance in WDN research applications (e.g. Jacobs and Goulter 1989; Walski 1993). Many researchers employed such methodologies for reliability analysis (e.g. Ostfeld 2005; Yazdani and Jeffrey 2012) and to investigate failure conditions due to several phenomena (e.g. random failures, deterioration, catastrophic events, targeted attacks). GT may also provide simplified information on system resilience by enabling identification of structural vulnerabilities and points of failure (Yazdani et al. 2013) and analyzing the disruption caused by the failure of individual components (Yazdani and Jeffrey 2012; Meng et al. 2018). Both ‘network-level’ and ‘local’ GT metrics are used for such purposes (Yazdani et al. 2011; Yazdani and Jeffrey 2012; Pagano et al. 2018a). The former employs simple graph metrics to analyze global network features. The latter is based on the removal of components (either random or targeted) to assess different failure scenarios (Yazdani et al. 2013). Although some studies have identified a direct correlation between GT metrics and network performance (e.g. Meng et al. 2018), an explicit focus on the role and potentiality of GT representations of pipe networks for WDN hydraulic performance analysis is still lacking (Torres et al. 2016). In fact, whilst specific properties may provide resilient performance, this cannot be guaranteed (Butler et al. 2016). Particularly, the increasing level of complexity and interconnection in water systems is a challenge since any change in the network characteristics has consequences on hydraulic function (Yazdani et al. 2013).

‘Performance-based’ approaches require modelling of performance (i.e. the ability of a network to maintain supply under failure conditions) under multiple system failure scenarios, using hydraulic models. Both single component failure analysis and global resilience analysis (GRA) can be used (Diao et al. 2016). GRA focuses on level of service provision under any possible magnitude of a given system failure mode, irrespective of the threat that may cause this failure (Diao et al. 2016). For example, in a WDN, the effects of any pipe failure magnitude (e.g. number of pipes failed at the same time) on supply could be captured using GRA. This method overcomes the challenge faced in conventional top-down approaches of identifying all the possible threats (e.g. the causes of pipe failure), and instead focuses on the system failure modes as these are easier to identify and characterize (Butler et al. 2016). GRA results in the generation of response curves (system performance in terms of both supply failure magnitude and duration as a function of system failure magnitude), the area under which provides an indication of how resilient the level of service provision is to a given system failure mode. A reduction in the area under the response curve, therefore, represents an increase in resilience. This is a highly flexible approach and has been applied previously to water distribution systems in several case studies (e.g. Diao et al. 2016). These studies have demonstrated that, in addition to providing a performance-based measure of resilience, GRA can be used to identify scenarios that result in the greatest loss of service, therefore acting as a diagnostic framework and aiding the development of interventions (Diao et al. 2016).

Table 1 provides a comparative summary of the main characteristics of GRA and GT.

Table 1 Comparison of the key features of GRA and GT

The relationship between property- and performance-based measures of resilience has received limited attention to date. No comprehensive studies relate resilience with topological attributes of WDNs, and the appropriateness of topological metrics for resilience assessment is unknown (Meng et al. 2018). These knowledge gaps need to be addressed in order to define effective frameworks for assessing and enhancing resilience in practice.

This research aims to provide a greater understanding of the extent to which an assessment framework based only on the characterization of topological and connectivity properties may be a surrogate for more detailed simulation-based models. Three different WDNs (detailed in Section 2) are analyzed for this purpose: L’Aquila (a real network), D-Town (a benchmark) and EXNET (a benchmark). Based on the available literature, a set of network-level GT-based measures is first used to perform a preliminary classification of the structure of each network (Section 3.1); second, a specific local GT-measure is introduced to evaluate and rank the impact of single pipe failures (Section 3.2). Levels of service failure magnitude and duration resulting from any single pipe failure are then obtained as part of a GRA (Section 3.3) and the pipe rankings compared with those derived from the developed GT-based measure (Section 4). Comparison of the results obtained for three highly different WDNs supports the understanding of specific potentialities and limitations of the use of both approaches, and also provides suggestions for future research (Section 5).

2 Case Study Networks

In order to provide a detailed comparative analysis of the performances of GRA and GT in different conditions, three networks (illustrated in Fig. 1) are analyzed: L’Aquila (a real network), EXNET and D-Town (benchmark networks). Basic hydraulic information, both under ordinary conditions and under ‘failure’ was derived from EPANET models. Key characteristics of the case study networks are provided below.

Fig. 1
figure 1

Case study networks

2.1 L’Aquila

L’Aquila city (central Italy) was struck by a severe earthquake (6.3 magnitude) on April 6th 2009 and the WDN was significantly impacted. Effective resilience assessment methodologies would be of great support to the WDN reconstruction process, since the previous network showed serious limitations in adaptive capacity (e.g. Pagano et al. 2017; Pagano et al. 2018b). This analysis focuses on the new WDN underlying the historical city center, presented in Fig. 1a, which consists of 539 junctions, 808 pipes and a single tank (798 m above sea level, 2000 m3 capacity). As the reconstruction process is still ongoing, significant uncertainties exist over hydraulic operation, and investigation into the potential use of property-based analyses is highly relevant.

2.2 D-Town

D-Town (Fig. 1b) is a benchmark WDN consisting of five district metered areas. In total, it contains 399 junctions, 7 storage tanks, 443 pipes, 11 pumps, 5 valves and a single reservoir. D-Town is highly relevant to this study since it is characterized by complex hydraulic operation, despite the limited number of elements.

2.3 EXNET

EXNET (Fig. 1c) has been set up by the University of Exeter as a realistic and challenging problem. The network consists of relatively small pipes and few transmission mains, with a large head-loss range at the extremities of the system, making it highly sensitive to demand increases. EXNET contains 1893 junctions (5 of which receive water from adjacent systems), 2462 pipes, 8 valves and 2 reservoirs. This benchmark is of particular interest since it is characterized by complex hydraulic operation and by a high number of elements.

3 Methodology

3.1 Network-Level GT Metrics

A graph G = G(n, m) consists of n nodes and m edges. A WDN can be specifically modelled as a graph with nodes/vertices connected by links/edges, and a set of data attributed to them (e.g. nodal demand, edge capacity, flow direction, energy losses) (Meng et al. 2018). The key characteristic of a WDN is that every node should be connected, by at least one path, to one or more source node(s) (e.g. a tank). The structure of a graph could be expressed, mathematically, as an adjacency matrix A, i.e. a 0–1 matrix representing the pairwise relations between nodes (Aij = Aji = 1 if there is a link connecting node i and node j). The adjacency matrix is the basis for the calculation of topological metrics.

Table 2 identifies a set of network-level GT metrics that are widely used for WDN analysis. They are mathematical attributes related to the main topological properties of networks, which can be related to system resilience (Meng et al. 2018). Using a multi-metric approach, based on multiple attributes, helps identify and compare relevant network properties (Hwang and Lansey 2017).

Table 2 Set of network-level GT metrics used

Physical and operational attributes of nodes and edges can be used to compute network-level GT metrics in a weighted and directed form (Yazdani and Jeffrey 2012; Porse and Lund 2016). For the purposes of this study, network-level metrics are all computed as undirected and unweighted, since WDN operating conditions (e.g. flow direction) may change under failure conditions.

3.2 Local GT Measures for Pipe Ranking

The proposed local GT-based analysis framework aims to identify and rank the most crucial elements for system operation in case of failures, relying on topological features only. Specifically, the methodology focuses on potential changes in connection between demand nodes and supply sources caused by single-pipe failures. Since multiple connections often exist between a source and a node, identifying all routes would be computationally expensive. However, limiting the analysis to the ‘shortest path’ is an appropriate assumption (e.g. Yazdani et al. 2013).

Following Herrera et al. (2016), a surrogate measure of the energy losses is a hydraulically relevant – and easy to quantify – measure of how well a node is connected to the available source(s). Using purely topological characteristics, the energy losses on the edges are proportional to f ∙ L/D, where f is the friction factor [−] and L and D are the length [m] and diameter [m] of the edge respectively. The shortest path is thus the one with the lowest value of total energy loss.

The first step of the edge ranking procedure requires identification of the source nodes (S), and computation of the shortest path (SP) from each source to all other nodes of the network (s) under ordinary conditions (SPs, i, 0). The Dijkstra shortest-paths algorithm is used for this purpose (Dijkstra 1959). Each SP is characterized through a sequence of K-1 nodes and K edges and weighted according to the total energy loss (\( weight={\sum}_{k=1}^Kf(k)\bullet {L}_k/{D}_k \)). The second step of the analysis consists of the iterative removal of every edge (j) and subsequent re-computation of all the weighted shortest paths (SPs, i, j). Comparison between SPs, i, 0 and SPs, i, j then allows the change in connectivity between nodes and sources as a consequence of the failure of the edge j, to be assessed. Potential scenarios are: 1) the shortest paths do not change; 2) the SPs, i, j returns infinity, meaning that the demand nodes on that path become disconnected from the source s, and that the edge j is a bridge; 3) the shortest paths between the source s and one or more nodes increase. These three cases are represented graphically in Fig. 2.

Fig. 2
figure 2

Graphical representation of the potential impact of edge removal on WDN connectivity: case 1) no changes occurred in the SP between the source s1 and the node 1 after the removal of edge 1; case 2) the removal of the edge 2 results in the disconnection between the source s1 and the node 2; case 3) the removal of the edge 3 results in the increase of the shortest path between the source s1 and the node 3

In case (1), the role of edge j in the global operation of the WDN can be considered negligible. In case (2), the total nodal demand that becomes disconnected from all sources once edge j is removed (DDj [l/s]) is computed according to Eq. 10:

$$ D{D}_j=\sum \limits_{s=1}^S\sum \limits_{k=1}^{K-1}{Q}_k $$
(10)

Edges with a positive DDj value are thus ranked accordingly. Particularly in simple networks with a single source, this analysis may identify specific parts of the network completely cut off from the water supply.

In case (3), the impact of edge failure is estimated by computing the shortest path change (SPCs, i, j, Eq. 11) between all n nodes and S sources, and the cumulate value (Eq. 12):

$$ { SP C}_{s,i,j}=\left({SP}_{s,i,j}-{SP}_{s,i,0}\right) $$
(11)
$$ SP{C}_j={\sum}_{s=1}^S{\sum}_{i=1}^n SP{C}_{s,i,j} $$
(12)

Edges with a positive value of SPCj can be thus ranked accordingly. Two subsets of edges may be therefore identified, and particular attention should be given to those having the highest values of either DDj or SPCj.

3.3 Global Resilience Analysis

A detailed description of the GRA methodology is provided by Diao et al. (2016). Key steps are as follows:

  1. 1.

    Identify the system failure mode(s) for analysis and an appropriate measure of magnitude.

  2. 2.

    Identify the required level(s) of service and appropriate measure(s).

  3. 3.

    Calculate the required level of service measure(s) under every system failure magnitude. Where multiple scenarios are possible for each system failure magnitude (e.g. for failure of 1% of pipes in the system), sampling and targeted scenario development are used, as detailed by Diao et al. (2016).

  4. 4.

    Plot each level of service measure as a function of system failure magnitude

Further information on the system failure mode, level of services measures and network simulations in this study are given in the following sections.

3.3.1 System Failure Mode

Multiple system failure modes exist; to enable comparison of GRA and GT, this study considers pipe failure. The percentage of pipes in the network failed represents the system failure magnitude and values in the range 0–100% are evaluated (note that ‘magnitude’ is used here to address the quantity of pipes failed, not the frequency of pipe failure). Pipe failures are modelled in EPANET by setting the corresponding pipe statuses to ‘closed’. They are applied at 10 A.M. so as to capture the effects of peak demand, and maintained to the end of a 24 h simulation.

Random pipe failure samples are generated at every failure magnitude and, additionally, pipe failure combinations resulting in the minimum and maximum response at each pipe failure magnitude are carried forward for targeted failure scenario development, as described by Diao et al. (2016). This approach is found to provide a good estimation of the minimum, mean and maximum response curves whilst maintaining a manageable computation time.

3.3.2 Level of Service

Chosen measures of level of service failure are: a) Supply failure duration, and b) Supply failure magnitude. Given that EPANET is demand driven and supply is not directly calculated, supply at each time step and node is estimated using Eq. 13:

$$ {\displaystyle \begin{array}{c} if\;{P}_{j,i}\le 0:{S}_{j,i}=0\\ {} if\kern0.37em 0<{P}_{j,i}<{P}_{lim}:{S}_{j,i}={D}_{j,i}\cdot \sqrt{P_{j,i}/{P}_{lim}}\\ {} if\;{P}_{j,i}\ge {P}_{lim}:{S}_{j,i}={D}_{j,i}\end{array}} $$
(13)

Where: Pj,i = pressure at node j at time I [m]; Sj,i = supply at node j at time I [l/s]; Plim = required minimum pressure, set to 15 m [m]; Dj,i = demand at node j at time I [l/s].

Supply failure duration is calculated using Eq. 14; this gives a (unitless) value normalized with respect to the system (pipe) failure duration.

$$ Supply\ failure\ duration=\frac{\sum \limits_0^{i=T}\left({F}_i\bullet {t}_i\right)}{F_P} $$
(14)

Where: Fi = System supply failure state at time step i (0 if Si = Di, 1 if Si < Di) [−]; FP = total pipe failure duration [hr].

Supply failure magnitude is calculated using Eq. 15, which gives the fraction of network demand not supplied during the pressure failure period.

$$ Supply\ failure\ magnitude=\frac{\sum \limits_0^{i=T}\left(\sum \limits_0^{j=n}\left({D}_{j,i}-{S}_{j,i}\right)\right)}{\sum \limits_0^{i=T}\left(\sum \limits_0^{j=n}{D}_{j,i}\right)} $$
(15)

Where: n = number of nodes; T = number of time steps; ti = Duration of time step i.

4 Results

4.1 WDN Characterization Based on Network-Level GT Metrics

The values of metrics described in the Table 2 are summarized in Table 3 and their significance with respect to network performance is discussed in the present section. As stated previously, these network-level metrics are computed in unweighted and undirected form.

Table 3 Network-level GT metrics for the case study WDNs

The average node degree (k) and link density (q) are key structural measures, representative of network connectivity. Higher k and q values suggest higher network connection, and thus a better resistance to failures (e.g. Zeng et al. 2017).

Higher values of the central point dominance (Cb) suggest that D-Town and L’Aquila are more centralized networks than EXNET, i.e. they tend to a ‘star’ topology with a significant concentration around central locations. The values of the clustering coefficient Cc suggest that L’Aquila and EXNET are more tightly connected and have better performance in terms of network efficiency and redundancy. The meshedness coefficient Rm confirms the higher redundancy of L’Aquila network. The density of bridges Dbr denotes the different presence of elements whose removal may isolate parts of the network.

Among the investigated networks, D-Town is the smallest (considering n and m), but has the highest diameter D(G), which suggests a higher spread. This result is also confirmed by the high value of the lT, which provides a view of network reachability and efficiency in water transport: shorter paths indicate more efficient networks, and systems with shorter water travel time.

The critical breakdown ratio fc indicates topology robustness. Larger values, as for D-Town, might indicate higher resistance to random failures of components and lower vulnerability.

Referring to the spectral properties, higher values of the normalized spectral gap Δλ (as for L’Aquila) indicate a better optimized connectivity layout and a better robustness. The algebraic connectivity λ2 of L’Aquila is also significantly higher, suggesting enhanced fault tolerance and robustness against efforts to bisect the network, and thus to isolate its parts (Zeng et al. 2017). D-Town has the lowest values for both parameters.

4.2 Global Resilience Analysis

The supply failure magnitude and duration response to pipe failure magnitudes of up to 100% in the case study networks are shown in Fig. 3. Figure 3a shows the maximum (solid line), the mean (dashed line) and the minimum (dotted line) pressure failure duration in L’Aquila. The mean supply failure duration increases rapidly as the number of pipes failed increases. When considering the system as a whole, failure of 18.1% of pipes (equivalent to 146 pipes) will typically result in supply failure during the entire pipe failure period. Supply failure duration responses are only plotted for L’Aquila, as pressures below 15 m (and hence supply failures) are present in D-Town and EXNET at all times, irrespective of the number of pipe failures. Note that, due to the large number of pipes in EXNET and the high computational demand of GRA, it is not feasible to evaluate all pipe failure magnitudes for this system. Figures 3b-d, therefore, show the response to up to 300 simultaneous pipe failures (12.2%) in this system. This number of pipe failures results in mean and maximum supply failure magnitudes of 94% and 100% respectively, and thus consideration of higher pipe failure magnitudes would yield little further information of interest. Based on the multiple random and targeted pipe failure scenarios evaluated at each pipe failure magnitude, three sets of curves are shown: the minimum (Fig. 3b), the mean (Fig. 3c) and the maximum (Fig. 3d) response.

Fig. 3
figure 3

Service failure duration and magnitude response to pipe failure

Analysis of the minimum response curves in Fig. 3a and b show that the L’Aquila network is capable of maintaining full supply with up to 41.1% pipe failure (331 pipes). However, the mean supply failure magnitude for this number of pipe failures is 97.5%. On average, at least 99% of global network demand will be met with up to 0.7% of pipes failed (i.e. fewer than 7 pipe failures). On average, 7 pipe failures result in a 50.6% supply failure in D-Town and a 73% supply failure in EXNET, as these networks have a significant volume of demand affected by unsatisfactory pressure even when no pipe failures are present. The minimum response curve for EXNET, however, does show an initial drop under small pipe failure magnitudes, indicating that there are one or more pipes which, if closed, actually reduce the presence of unsatisfactory pressure in the network.

The maximum supply failure magnitude response curves show that complete loss of supply may occur in L’Aquila with failure of just 4 pipes (0.5% of pipes); however, this can occur in D-Town and EXNET with a single pipe failure. A summary of the mean and maximum supply failure magnitude responses to up to 4 simultaneous pipe failures in the three case study WDNs is given in Table 4. These results suggest that L’Aquila is the most resilient of the three case studies (with respect to pipe failure). To aid identification of critical components and reveal potential focus areas for further improvement, specific pipe failures which result in the maximum supply losses are identified.

Table 4 Summary of supply failure magnitudes (percentage of network demand during pressure failure period not supplied) resulting from up to four simultaneous pipe failures and identification of pipes resulting in maximum supply failure magnitude

4.3 Comparison between GRA and Local GT-Based Measures Based on Pipe Rankings

This section compares the supply failure magnitude response to single pipe failure scenarios, derived as part of the GRA process, with rankings obtained from the proposed local GT-based measure. The aim is to provide an understanding of the extent to which WDN resilience to pipe failure can be estimated based on topological and connectivity characteristics only, given the assumption that EPANET provides an accurate measure of hydraulic performance under single pipe failure conditions in the GRA. In order to limit the analysis to the most relevant elements, only the top-ranked pipes are taken into account.

L’Aquila represents the simplest network in terms of hydraulic operation and identification of bridges is crucial for this kind of network, since their failure could cause the disconnection of wide areas from the water source. The results of single pipe failure scenarios in the GRA suggest that, individually, only 28 (out of 808) pipes have an impact on network performance if they fail individually. The GT analysis suggests that all these pipes are bridges for the WDN. Table S1 in the Supplemental Data lists the highest ranked pipes, based on DDj value and supply failure magnitude resulting from their individual failure (derived from GRA).

The results summarized in Table S1 show that the top 28 pipes as identified by GT are all ranked in the top 28 in the performance-based analysis as well. Only one minor difference is present in the ranking, which is a remarkably good result. The performance assessment in case of failure is instead conditioned by the hydraulic regime, which is explicitly included in GRA. The relevance of topology and connectivity to the performance of L’Aquila WDN is particularly high due to the simple structure of the network (i.e. it is supplied by a single source and characterized by a regular, grid-like structure). Therefore, for this network, the GT-based analysis is highly representative of the actual network operation.

Both D-Town and EXNET have a complex structure characterized by multiple sources which affect hydraulic operation. EXNET, in particular, is a highly complex WDN and provides a demanding test for the proposed GT-based approach. In both cases, the consistency of results obtained from the two methodologies was assessed by focusing on pipes ranked in the top 10% according to GRA, and checking how many edges were also identified in the top 10% when ranked by the local GT-based measures.

Focusing on D-Town, the methodology provides remarkably good results, as shown in the Table S2 of the Supplementary Material. 95% of the pipes ranked in the top 10% according to GRA (38 out of 40), for example, are also in the top 10% according to the GT-based measures. More specifically, the top 10 ranked pipes according to the values of both DDj and SPCj (highlighted in grey in the Table S2) fall within the top 40 as identified by GRA.

In EXNET, agreement between the methodologies reduces to 62%, since only 151 out of the top 245 high-ranked pipes according to GRA are in the top 10% of pipes according to the GT-based rank. Going further into details, only 50% of the top 10 ranked pipes according to the local GT-based measures (in grey in the Table S3) fall within the top 10% of pipes according to GRA ranking. Full results for all WDNs, are provided in the Supplementary Information.

In order to understand the rationale behind such discrepancies, two EXNET pipe failure scenarios with the greatest difference between their rankings were identified and their hydraulic behavior analyzed. These pipes are physically close to each other and connected at one node: Pipe 3048 (D = 1073 mm) and Pipe 3474 (D = 900 mm). The location of these pipes is shown in Fig. 4. Both pipes are high-ranked according to the GT approach (ranks 12 and 15 respectively based on SPC), whereas only one (pipe 3474) is high-ranked according to GRA and the other (pipe 3048) is among the lowest ranked (ranks 3 and 2462 respectively). The hydraulic operation of the system was investigated considering the impact of single pipe failure on system operation, as shown in Fig. 4, which includes: a) identification of the nodes with unsatisfactory pressure due to the failure of pipe 3474; b) identification of the nodes with unsatisfactory pressure due to the failure of pipe 3048; c) the flow rate in pipes connected to 3474 after its failure; d) the flow rate in pipes connected to 3048 after its failure.

Fig. 4
figure 4

Comparative analysis of the impact of single pipe failure of edges 3474 and 3048

Based on the pressures and flow rates shown in Fig. 4, it can be seen that the impact of pipe 3474 or 3048 failing individually is highly different, mainly due to the role of pipe 3367: analysis of ordinary operation and failure conditions suggests that when pipe 3048 fails, pipe 3367 is subjected to a change in the flow direction which supports the operation of pipe 3474. This means that the impact of pipe failure can be partially absorbed by the system, which is resilient enough to adapt to a change in hydraulic conditions. When pipe 3474 fails, pipe 3367 does not support system adaptation, and this results in a wider area of the WDN with pressure below an acceptable value.

5 Discussion and Conclusions

Both performance- and property- based approaches are used for investigating the behavior of WDNs and supporting resilience assessment, but no comprehensive comparative analysis has previously been performed. In particular, additional efforts are needed in order to support a deeper understanding of their limits and potential, thus facilitating the selection of the most suitable one, considering both the purpose of the analysis to be performed and the WDN characteristics (Shin et al. 2018). This paper presents a critical comparison between two different methodologies belonging to the aforementioned categories, i.e. GRA and GT-based metrics. GRA can be used as comprehensive diagnostic framework linking system attributes (e.g. connectivity and capacity) to performance (e.g. level of service), and can be adopted to illustrate the complex dynamic responses of systems to various failure modes. GT-based approaches are highly relevant with a twofold perspective: (1) to propose a ‘network-level’ classification of different WDNs and provide a better understanding of the influence of key properties (e.g. connectivity, robustness, redundancy) on system resilience, with a relatively fast and inexpensive computation; (2) with the implementation of specific ‘local’ measures, to determine a pipe ranking defining the impact of single pipe failure on system connectivity.

The comparison of results based on the local GT-based measures and GRA for three highly different WDNs enables conclusions to be drawn regarding the potential and applicability of these methodologies for resilience assessment also in other networks. Firstly, network-level topological and connectivity aspects are certainly useful to characterize a WDN, since the interconnectedness of the system is relevant for its operation both in ordinary conditions and under failure. Particularly, the selection of a set of network-level GT-based metrics could be highly useful in order to describe network-level system structure and characteristics. Network-level topological properties can be useful as surrogate measures of global system resilience. Secondly, a deeper understanding and modeling of WDN response to stress requires the development and computation of local GT metrics, which explicitly account for the connectivity of the system with water sources, along with the role that single pipes might have on system operation. Indeed, a comprehensive metric of the impact of pipe failure in terms of network connectivity should take into account both the possibility of isolation for specific parts of the network, and the increase of the shortest paths between source(s) and demand nodes, which may cause a substantial reduction of pressure. Nevertheless, the appropriateness and effectiveness of such methods may vary significantly with network complexity and according to the specific operating conditions, and additional research is needed in this direction. The analyses summarized in the present paper suggest that the effectiveness and reliability of GT-based metrics is significantly higher for WDNs with a basic structure (e.g. single source, regular structure, limited size) and simple operating conditions. In such cases (e.g. L’Aquila), GT could support the effective preliminary identification of the most critical pipes, thus helping to avoid the computational effort associated with other methods. For more complex networks (e.g. EXNET), the topology of the network is only partially representative of system operation, since the hydraulic conditions may significantly change as a consequence of pipe failure and result in local effects which are hard to predict without hydraulic modelling. Additional efforts are needed to support a more effective and reliable implementation of property-based approaches in case of failure (Hwang and Lansey 2017; Shin et al. 2018). As the GRA implementation suggests, evaluation of hydraulic system performance is essential for comprehensive resilience analysis of complex WDNs and can provide additional information with respect to the effects of multiple pipe failures. Future research activities should be also oriented towards a comprehensive comparison of multiple different resilience assessment measures in a wider set of WDNs.