Keywords

1 Introduction

It is necessary to track resiliency metrics to be able to determine that, in the operation of the power system under low-probability high-impact events, which goals have been achieved and which one not been achieved. Resiliency metrics are used at different levels for different intentions. Some of the purposes are relevant to the national or regional macro policies and some others to a local or tools aspect. As an example, what is the effect of the resiliency on the economic damages caused by natural disasters at the national or regional level? For a power plant operator, it can be essential to know how many and what types of spare parts are available. Considering each purpose of the system needs a unique set of metrics. Because one set of metrics does not support all the goals of the system. Then this chapter first reviews the existing metrics for measuring resiliency of electrical systems, then a strategy will be developed that can be used to determine the appropriate set of resiliency metrics according to the goals pursued from the system. In the event that national or regional macro intentions are considered, greater focus will be on strategic aspects of the metrics set. In this case, matters like budget, availability of equipment, number of generators and operators, speed and accuracy of response teams, schedules, existing technologies such as smart grids , etc. will be considered. But if local aims are taken into account, the operational aspects of the electrical system are used to define the resiliency. In this case, matters like timely detection of the outages, fast recovery after disasters, convenient repairs, system efficiency, reliability indices , system hardening, improving social welfare, etc. are considered.

The necessity of quantifying resilience metrics is an important challenge, which mostly depends on how to define the resiliency.

Resiliency may sometime be considered as the time for recovery of power system after a disaster. In a complete definition, in addition to the time required for recovery, the capability of the system to withstand malicious events, system adaptability, and desirable extensibility can be considered as principal resiliency characteristics. Resiliency can be calculated mathematically as the area under system’s performance curve.

Resilience metrics must have some basic features. These features are essential for the development of a comprehensive metric. In other words, a general metric must [1]:

  • Be useful. A comprehensive metric must be helpful for decision making incorporate system planning, real-time actions, and policy determinations.

  • Provide a comparable structure. Calculation of this metric for different systems should provide comparable information.

  • It must be usable in operational and planning contexts. Operational contexts such as pre-configuration the system before a disaster and planning contexts such as implanting of electric conductors.

  • Be comprehensive and extensible. The appropriate index should be extensible over time and must be calculated with the advancement of technology and equipment in complex computational methods.

  • Be quantitative. The appropriate metric should be quantitatively quantifiable.

  • Consider uncertainties. It is very important that the resilience metric should reflect the system’s uncertainties.

  • Consider the recovery/restoration time of the system after a disaster. An appropriate metric of resiliency should somehow take into account the duration of outages.

2 Resiliency Metrics, Different Definitions

The resilience metric may have terms of a threat or a set of threats. In fact, this criterion answers the question of “resilient to what?”. Usually, resiliency is considered against natural disasters such as earthquakes, storms, floods, etc., but these studies can be generalized to sudden human-caused events such as accident and war. It can be observed that the natural disasters tend to follow cycles. The time interval between the onset of an event up to the occurrence of another can be classified into four phases [2]:

  • Phase 1 (During the event): The length of this phase (Δt1) can be a few minutes to a few days. In this situation, the main purpose is to reduce the damages and loss of services.

  • Phase 2 (Immediate aftermath): This stage takes a few days to several weeks. The main goal of this period is to start recovery and repair actions. This phase lasts Δt2 and ends when these activities are almost completed.

  • Phase 3 (Intermediate aftermath): This phase usually lasts from a few weeks to several months and sometimes interference with phase three. In this phase, the main objective is to investigate the disaster’s effect on a specific part of the power system by calculating the system efficiency indices and assessing the extent of the damage.

  • Phase 4 (Long-term aftermath): This stage may take a few months to several years. In this phase, the main goal is to prepare for the occurrence of the next disaster using the results obtained in phase three. These preparations include corrective actions, modification of the operational strategies, and the strengthening of infrastructure . this phase ends with the onset of the next event.

Figure 4.1 shows the different phases of the time horizon immediate following a disaster occurrence until the next disaster.

Fig. 4.1
figure 1

Representation of different phases of the extreme event [2]

Although the resilience measures are used to evaluate the consequence of a disaster, it must also be used to assess the ability of power grid in cover its objectives. This means that the performance of the system affects the resiliency measures directly. For example, the area under S(t) in Fig. 4.1, which is a measure of the loads supplied by the power grid during and after the disaster, is a performance-based metric for resiliency [2]. Equation (4.1) describes this metric mathematically.

$$ R_{1} = \int\limits_{t} {S(t)dt} $$
(4.1)

Another measure based on the quality of the power network service described by (4.2), which is the number of events that, as a result of their occurrence, the network voltage falls outside of the standard range.

$$ R_{2} = \sum n $$
(4.2)

where n is an event which caused the voltage level of the power grid to violate the standard ranges.

According to U.S. Presidential Policy Directive 21 [3] (PPD-21), the resiliency is defined as: “the ability to prepare for and adapt to changing conditions and withstand and recover rapidly from disruptions.” In this definition, resilience includes “the ability to withstand and recover from deliberate attacks , accidents, or naturally occurring threats or incidents.” Therefore, based on the four main characteristics stated in the definition of the resilience, i.e. withstanding capability, recovery speed, planning capacity, and adaption capability [4], a quantitative metric for the resiliency of a single load in a period of time (T = tup+ tdown) can be mathematically modeled by (4.3).

$$ r_{l} = \frac{{t_{up} }}{{(t_{up} + t_{down} )}} $$
(4.3)

In (4.3), downtime (tdown), which is related to the hardware aspects of the power system and human-related processes, shows the system’s recovery speed. The ability of the power grid to withstand the disaster is related directly to its hardware and equipment characteristics, which tup shows this index. It should be noted that several references have proposed similar relationships to measure resiliency in other systems, such as communication sites [5], supply networks [6], and urban infrastructure systems [7]. According to [2], it is possible to define the resiliency of the power generation resources for N loads as:

$$ R_{L} = \frac{{\sum\nolimits_{i = 1}^{N} {t_{up,i} } }}{{\sum\nolimits_{i = 1}^{N} {(t_{up,i} + t_{down,i} )} }} $$
(4.4)

Equations (4.3) and (4.4) are similar to the equation of availability in reliability theory, but an infinite number of repair and failure sequences are used for calculating of the availability measurement where the measures of the resiliency of (4.3) and (4.4) can be based on a single sequence in duration T.

Suppose that n0 number of the total customers (N) in a given region under study at the time interval T experienced an outage. In this case, the outage index is calculated by (4.5) [2].

$$ \theta = \frac{{n_{0} }}{N} $$
(4.5)

This is the equivalent to the SAIFI in IEEE Standard 1366 that is widely used to assess the outages of the power systems.

In Ref. [2], the recovery speed (vr) for the N number of customers is defined as (4.6).

$$ v_{r} (t) = \frac{d\theta }{{dt_{r} }},\quad t_{r} = t - t_{{\left| {\theta = \theta_{\hbox{max} } } \right.}} $$
(4.6)

For one customer N = 1 and thus n0 is 1 or 0. Assume the customer has experienced an outage (n0 = 1). In this case, since all the customers experienced the outage, dtr can be taken equal to Tdown, as a result:

$$ v_{r,i} (t) = \frac{1}{{t_{down} }} $$
(4.7)

In a similar manner disruption speed for a group of customers and a single one can be calculated as (4.8) and (4.9) [2], respectively.

$$ v_{d} (t) = \frac{d\theta }{dt},\quad {\text{for}}\;\;t < t_{{\left| {\theta = \theta_{\hbox{max} } } \right.}} $$
(4.8)
$$ v_{d,i} (t) = \frac{1}{{t_{up} }} $$
(4.9)

It is clear that the (4.7) and (4.9) are analogous to the concepts of repair rate (µ) and failure rate (λ) in reliability theory, respectively.

The power network’s ability to withstand a destructive event can be measured by a metric called resistance, which is defined for a single customer by using (4.10) [2].

$$ \varphi_{I} = \frac{{t_{1,u} }}{{\Delta t_{1} }}\sigma $$
(4.10)

where t1,u is Δt1 in Fig. 4.1, which the customer still receives power before the outage. σ can be specified as a function which represents the severity of the destruction of the extreme event and can be defined for different types of hazards such storm, flood, earthquakes, etc. [7]. It has to be noted that σ > 0. In order to better understand the time intervals, Fig. 4.2 shows the details of the time periods of the first two phases of a disaster (phases 1 and 2 shown in Fig. 4.1).

Fig. 4.2
figure 2

Details of the time periods of the first two phases of a disaster

Also, the resistance of φ for N loads is defined by (4.11) [2].

$$ \varphi = \frac{{\sum\nolimits_{i = 1}^{N} {t_{1,u,i} } }}{{\theta_{\hbox{max} } N\Delta t_{1} }}\sigma $$
(4.11)

In this state, in addition to the importance of the time which loads can receive power under the disaster conditions (t1,u in Fig. 4.2), the maximum amount of lost power of the customers experienced the outage is also very essential.

Brittleness is the amount of damage which the power system receives from a disruptive event and is calculated for N loads using (4.12).

$$ B = \frac{{\theta_{\hbox{max} } }}{D} \times 100 $$
(4.12)

where D is highly related to the characteristics of the infrastructures .

The dependency of one infrastructure to the other ones is defined as [8] “a linkage or connection between two infrastructures, through which the state of one infrastructure influences or is correlated to the state of the other”. In accordance with Refs. [2, 8], it is possible to quantitatively measure the dependence of the loads to the power grid by resilience-oriented adjusting the amount of energy storage resources. The level of dependency of a load from the power grid may be calculated based on rl of (4.3) as [2, 8]:

$$ R_{L} = 1 - (1 - r_{l} )e^{{ - \mu T_{s} }} $$
(4.13)

According to (4.7), µ is equal to the inverse of tdown.

Reference [2] represents the intrinsic relation between dependence with the concept of resilience and how energy storages may or may not lead to a loss of power for customers during an outage as follows:

$$ \frac{1}{\mu }\frac{{dR_{L} }}{{dT_{s} }} = (1 - r_{l} )\text{e}^{{ - \mu T_{s} }} $$
(4.14)

Hence,

$$ R_{L} = 1 - \frac{1}{\mu }\frac{{dR_{L} }}{{dT_{s} }} $$
(4.15)

As a result,

$$ \frac{1}{\mu }\frac{{dR_{L} }}{{dT_{s} }} = - R_{L} + 1 $$
(4.16)

where tdown= 1/µ, shows how much restrictions there is locally for a shift in local resilience by attaching new energy storage systems near the load or it is indicating in order to obtain the same local system resiliency, how much more or less energy storage devices have to be available.

Reference [9] defined a multiple-component resiliency metric for power distribution system based on the network topology as:

$$ \Re_{\tau } = \sum\limits_{j = 1}^{\eta } {V_{j} \lambda^{\prime}(i,j)} $$
(4.17)

where η is the number of metric components. V is equal to:

$$ V = [\begin{array}{*{20}c} {A_{fc} } & {B_{D} } & {C_{{C_{B} }} } & {D_{lG} } & {E_{{C_{n} }} } & {F_{{\Lambda_{2} }} } \\ \end{array} ]^{T} $$
(4.18)

In (4.18) A, B, C, … , F are the obtained weights to indicate the importance of its corresponding measure. λ(i,j) is an element of \( \vec{\Re }_{\tau } {\vec{\Re }_{\tau }^{T}} \) and given by:

$$ \lambda^{\prime}(i,j) = \frac{{\lambda (i,j) - \min_{i = 1}^{\eta } (\lambda (i,j))}}{{\max_{i = 1}^{\eta } (\lambda (i,j)) - \min_{i = 1}^{\eta } (\lambda (i,j))}} $$
(4.19)
$$ \vec{\Re }_{\tau } \vec{\Re }_{\tau }^{T} = \begin{array}{*{20}c} {}\\ {f_{c} } \\ D \\ {C_{B} } \\ {l_{G} } \\ {C_{n} } \\ {\Lambda_{2} } \\ \end{array} \begin{array}{*{20}c} \begin{aligned} \begin{array}{*{20}c} \quad{f_{c} } & \; D & \quad{C_{B} } & \;\;{l_{G} } &\;\; {C_{n} } & {\Lambda_{2} } \\ \end{array} \hfill \\ \left( {\begin{array}{*{20}c} 1 & a & b & c & d & e \\ {1/a} & 1 & f & g & h & i \\ {1/b} & {1/f} & 1 & j & k & l \\ {1/c} & {1/g} & {1/j} & 1 & m & n \\ {1/d} & {1/h} & {1/k} & {1/m} & 1 & o \\ {1/e} & {1/i} & {1/l} & {1/n} & {1/o} & 1 \\ \end{array} } \right) \hfill \\ \end{aligned} \\ \end{array} $$
(4.20)

where a, b, c, … , o are weight coefficients in the interval (0, 1] [4].

$$ \vec{\Re }_{\tau } = [\begin{array}{*{20}c} {f_{c} } & {D_{G} } & {l_{G} } & {C_{B} } & {C_{n} } & {\Lambda_{2} } \\ \end{array} ] $$
(4.21)

Assume the power distribution system demonstrated by a graph H = (M, S, V) comprising of M nodes, a set of section (edges) S with each element connected from node x to node y with a corresponding weight V.

In (4.21) DG (the optimal (shortest) path between the farthest nodes) calculated as:

$$ D_{G} = \frac{2E}{|N|(|N| - 1)} $$
(4.22)

lG represents the length of the graph and obtained by (4.23).

$$ l_{G} = \frac{{\sum\nolimits_{i \ne j} {d(n_{i} ,n_{j} )} }}{N(N - 1)} $$
(4.23)

CB is the betweenness centrality of the graph and calculated as:

$$ C_{B} (i) = \sum\limits_{{n_{i} \ne n \ne n_{l} }} {\frac{{n_{k} \to n_{l} ,n_{i} }}{{n_{k} \to n_{l} }}} $$
(4.24)

where \( n_{k} \to n_{l} ,n_{i} \) is 1 if the optimal path between the node nk to nl passes through ni and 0 if nk to nl does not pass through ni. The phrase \( n_{k} \to n_{l} \) is to show the optimal (shortest) path between the nodes nk and nl [9].

Cn in (4.21) shows the clustering factor of the power distribution system and calculated by (4.25) [9].

$$ C_{n} = \frac{{2e_{n} }}{{k_{n} (k_{n} - 1)}} $$
(4.25)

Algebraic connectivity of the power distribution network is indicated as \( \Lambda_{2} \) and is calculated by (4.26).

$$ \Lambda_{2} = {\text{eig }}L_{(i,j)}^{2} $$
(4.26)

where Laplacian Matrix is obtained as [9]:

$$ L_{(i,j)} = \left\{ {\begin{array}{*{20}c} {{\text{deg}} (n_{i} )} & {{\text{if}}\,i = j} \\ { - 1} & {{\text{if}}\,i \ne j{\text{ and }}n_{i} {\text{ is adjacent to }}n_{j} } \\ 0 & {\text{otherwise}} \\ \end{array} } \right. $$
(4.27)

There are more metrics which can be used for defining resiliency of a power system and (4.21) is only one combination.

In Ref. [10] six metrics that can measure the operational resiliency of microgrids have been identified based on graph theory and Choquet integral. This definition is based on three main assumptions:

  • The number of paths between supply and load nodes affects the resiliency.

  • Increasing the ratio of power supply resources to system loads improves (increases) the system resiliency.

  • The increase in the number of switches in the system will increase the system resiliency, while the increase in the number of switching actions required to connect critical loads to the power supply will reduce the system resiliency.

For the six resiliency metrics defined in Ref. [10] it is assumed the power distribution system is equivalent to a graph that has n nodes and their nodes are connected to one another by e branches. In this equalization, the buses and lines of the power distribution system are demonstrated with nodes and branches, respectively.

Branch Number Impact (BNI)

This measure is equal to the ratio of the total number of joined branches for each RIWL in a PN to the number of all CLs [10].

$$ BNI_{q} = \frac{{\sum\nolimits_{k = 1}^{N_{q}} {\frac{{{\text{Nodes in }}RIWL{\text{ for }}k{\text{th }}PN}}{{{\text{Number of }}CL{\text{s in }}k{\text{th }}PN}}} }}{N_{q}} $$
(4.28)

Overlapping Branches (OB)

This metric is equal to the total number of joint branches in each PCWL in a PN [10].

$$ OB_{q} = \frac{{\sum\nolimits_{k = 1}^{N_{q}} {{\text{common branches in }}k{\text{th }}PN} }}{N_{q}} $$
(4.29)

Switching Actions (SA)

This measure represents the total number of switching operations (change in the state of switches, i.e. closed to open and vice versa) needed to connect all the CLs to sources through different FNs [10].

The Number of Resources (NoR)

It is equal to the ratio of the total number of possible resources utilized to supply all CLs to the number of all CLs in each PN [10].

$$ NoR_{q} = \frac{{\sum\nolimits_{k = 1}^{N_{q}} {\frac{{{\text{Resurces supplying all }}CL{\text{s in }}k{\text{th }}PN}}{{{\text{Number of }}CL{\text{s in }}k{\text{th }}PN}}} }}{N_{q}} $$
(4.30)

Route Abundance (RA)

This is the ratio of the total number of routes that is possible for all CLs joining to all resources to the total number of CLs in each FN [10].

$$ RA_{q} = \frac{{{\text{Routes joning all }}CL{\text{s to all resources in }}q{\text{th }}FN}}{{{\text{Number of }}CL{\text{s in }}q{\text{th }}FN}} $$
(4.31)

The Probability of Accessibility and Penalty Factor (PoA & PF)

This metric is based on two factors: the probability of availability of the source, and the losses in distribution or penalty factor PoA & PF for a FN is calculated by (4.32) [10].

$$ PoA\;{{\& }}\;PF_{q} = \frac{{\sum\nolimits_{k = 1}^{{N_{q} }} {PoA \times PF{\text{ for }}k{\text{th }}PN} }}{{N_{q} }} $$
(4.32)

Figure 4.3 shows the framework that is used in Ref. [10] for quantifying resiliency in a power distribution system using graph theory and Choquet integral computation.

Fig. 4.3
figure 3

Flowchart of quantifying resiliency in power distribution system [10]

Reference [11] is introduced four metrics as (4.33) to measure the resiliency of power grid under extreme events .

$$ \Im = \{ K,LOLP,EDNS,G\} $$
(4.33)

In (4.33) K demonstrates the expected number of lines are on outage due to the inordinate event and is calculated as:

$$ K = \int_{0}^{\infty } {kf(k)dk} $$
(4.34)
$$ f = P_{d} (k|\varUpsilon ) $$
(4.35)

where, Pd refers to the conditional probability of outage of k lines in ϒ [11].

LOLP and EDNS which are known reliability indices [12] are modified in Ref. [11] and defined as survivability following extreme events.

$$ LOLP = \sum\limits_{{e_{i} \in \, S_{e} }} {P_{{e_{i} }} } $$
(4.36)
$$ EDNS = \sum\limits_{{e_{i} \in \, S_{e} }} {P_{{e_{i} }} C_{{e_{i} }} } $$
(4.37)

where \( C_{{e_{i} }} \) is obtained using optimal power flow (OPF) [11].

Parameter G in (4.33) measures the complexity of grid restoration . It must be mentioned, the power system restoration process after an extreme event , depending on the kind and amount of intensity of the disaster and the extent of damage to the critical infrastructures of the system, may take several hours to several days. The grid recovery index is expressed as (4.38) [11].

$$ \begin{aligned} G = \sum\limits_{i = 1}^{5} {w_{i} \eta_{i} } \hfill \\ {\text{where }}\sum\limits_{i = 1}^{5} {w_{i} } = 1 \hfill \\ \end{aligned} $$
(4.38)

where, wi and ηi are described in Fig. 4.4.

Fig. 4.4
figure 4

Grid recovery index factors [11]

In accordance with Refs. [13, 14], the resilience level of the power system under a natural disaster can be plotted as in Fig. 4.5 in five-time intervals. The first stage, which covers the time interval [to tes], shows the system’s resilience level before the horrible event. The devastating event starts at time tes and continues until the time tee. During this period, the level of resilience of the system gradually decreases from the initial value Ro to its minimum value, i.e. Rp (the gradient depends on the structure and capabilities of the power network). The preparation is then followed to start the recovery process at the fastest possible time interval, i.e. [tee trs]. With the onset of restoration process at time trs, the level of resilience of the power system gradually returns to its original state (the desired value before the disaster, Ro). The time after tre is the state of resilience after the completion system recovery process.

Fig. 4.5
figure 5

The resilience level of the power system under a natural disaster

Reference [14] has considered a set of network performance indices as a benchmark for measuring the resilience level of the power system against extreme events and called it as ΦΛEΠ.

In this definition, Φ represents the number of lines that are tripped per hours (during the extreme event occurrence) and calculated by (4.39):

$$ {\Phi } = \frac{{R_{p} - R_{o} }}{{t_{ee} - t_{es} }} $$
(4.39)

The parameter Λ refers to the amount of power system resilience level reduction due to the occurrence of a malicious event (number of lines tripped) and is equal to:

$$ {\Lambda} = R_{p} - R_{o} $$
(4.40)

The time duration that it takes to start the restoration/recovery process after the occurrence of an extreme event represented by E and is equal to:

$$ E = t_{rs} - t_{ee} $$
(4.41)

After the start of the recovery /restoration process, the number of lines that are retrieved per hour is shown using Π and is equal to:

$$ {\Pi} = \frac{{R_{o} - R_{P} }}{{t_{re} - t_{rs} }} $$
(4.42)

In addition to the ΦΛEΠ metric, for calculating the lines that were in service from the beginning of the disaster to the end of the recovery/restoration process (the lines that have not experienced the outage), a criterion called the Area is defined and, in accordance with Fig. 4.5, is equal to [14]:

$$ Area = \int\limits_{{t_{es} }}^{{t_{re} }} {R(t)dt} = \frac{{\Lambda \times (t_{ee} - t_{es} )}}{2} + (R_{p} \times (t_{rs} - t_{es} )) + \frac{{\Lambda \times (t_{rs} - t_{re} )}}{2} $$
(4.43)

Similarly, by plotting the variations of resilience level of the critical infrastructures under a natural disaster, we can also calculate the ΦΛEΠ and Area metrics for them [14].

Reference [15] has improved the power system resiliency based on the grid reconfiguration. In this regard, three metrics have been suggested for quantitative evaluation of power system resiliency.

$$ {{\Psi}} = [ {\Psi}_{i,n,d,t}^{\lambda } , {\Psi}_{i,n,d,t}^{\mu } , {\Psi}_{i,n,d,t}^{\partial } ] $$
(4.44)

In (4.44) when the ith plan of the network reconfiguration is considered at time t, \( \Psi_{i,n,d,t}^{\lambda } \) calculated as:

$$ {\Psi}_{i,n,d,t}^{\lambda } = \frac{{\sum\nolimits_{i \in I} {\sum\nolimits_{n \in N} {P_{{d_{n} ,i}}^{t|\varepsilon } } } }}{{P_{d}^{T} }} $$
(4.45)

The term \( {\Psi}_{i,n,d,t}^{\mu } \) in (4.44) is equal to:

$$ {\Psi}_{i,n,d,t}^{\mu } = \sum\limits_{i \in I} {\sum\limits_{n \in N} {C_{dn} (P_{{d_{n} ,i + 1}}^{t|\varepsilon } - } } P_{{d_{n} ,i}}^{t|\varepsilon } ) $$
(4.46)

The last parameter of \( \Psi \) in (4.44) is \( {\Psi}_{i,n,d,t}^{\partial } \) which is calculated as:

$$ {\Psi}_{i,n,d,t}^{\partial } = \sum\limits_{i \in I} {\sum\limits_{n \in N} {\frac{{P_{{d_{n} ,i}}^{t|\varepsilon } - P_{{d_{n} }}^{{t_{d} |\varepsilon }} }}{{P_{d}^{T} - P_{{d_{n} }}^{{t_{d} |\varepsilon }} }} \times 100} } $$
(4.47)

3 Conclusion

In this chapter, quantitative metrics that were proposed in the literature to assess the resilience of power systems were explained. Researchers have proposed different metrics for the resiliency of power grid in various viewpoints such customer perspective and power distribution level. Physical structure and network topology, severity and type of the threat , system performance under malicious event, restoration/recovery time after the disaster, network reliability indices , number of critical infrastructures such as transformers, storage resources, distributed energy resources etc., are effective in the assessment of the power network resiliency. In a general viewpoint, resilience metrics may be classified in three categories such simulation-based methods (whose are based on the performance of the system), analytical methods (whose are based on the probability and reliability indices) , and statistical analysis of historic outage data. It should be noted that the power system planner can use one or a several numbers of the metrics for an accurate measurement of the resilience of the power system for a specific event with a known severity considering the purpose of system planning.