1 Introduction

1.1 Motivation

Managers of service systems, as well as supply chains for goods, are continuously looking for ways to reduce total costs while also improving the performance of their systems in order to stay competitive in today’s business landscape. At the same time, many companies face disruptions and other unexpected events throughout their supply chains, and these can lead to disastrous financial losses. Combined with the worldwide economic downturn in recent years, this risk necessitates the use of proactive strategies for mitigating the effects of disruptions.

We present a model that combines facility location and network design decisions under the risk of disruptions. Thus, our model optimizes strategic decisions that account simultaneously for the need for efficiency (i.e., low costs) and for reliability (i.e., disruption resilience). (Other papers discuss operational approaches for mitigating disruptions; see, e.g., Tomlin (2006), or see Snyder et al. (2010) for a review.) Our model minimizes the total transportation cost assuming no disruptions take place while imposing a budget constraint on the fixed cost of building facilities and links in the network. It also ensures the reliability of the resulting network by enforcing an upper bound on the total cost that may result when disruptions occur.

Facility location problems choose the locations of facilities and, often, the allocation of customers to them, in order to optimize some objective function, such as minimizing the operating cost or maximizing the demands covered. Based on their objective functions and constraints, facility location problems are categorized into several problem classes, such as the p-median and p-center problems (Hakimi 1964), the uncapacitated facility location problem (Kuehn and Hamburger 1963), the maximum covering location problem (Church and ReVelle 1974) and the set covering location problem (Toregas et al. 1971). In contrast, in network design problems, the basic goal is to optimally construct a network that enables some kind of flow, and possibly that satisfies some additional constraints. The nodes usually are given and the problem must make decisions about which links (edges) to choose from among a set of potential edges.

Our work combines aspects of facility location and network design by making open/close decisions on nodes, as well as on the edges used to travel among the nodes. Often these two problems (facility location and network design) are solved independently, but we would argue that it is more realistic and effective to model and solve them simultaneously. All of the aforementioned classical facility location models locate facilities on a predetermined network. However, the topology of the underlying network may profoundly affect the optimal facility locations. Joint facility location/network design problems have many applications in industries and services, and some studies clearly illustrate the value of solving them simultaneously; see for example, Melkote (1996), Melkote and Daskin (2001), whose facility location/network design models underlie our own.

Another significant aspect that can affect facility location and network design is the reliability of the system. Assuming that facilities are always available and never disrupted is typical in classical studies. Although most companies would like to assume that disruptions rarely happen, and that even if they occur, their supply chains will be reliable enough, in practice, some unexpected disruptions happen and some companies are vulnerable and therefore easily disrupted. The terrorist attacks of 9/11, the catastrophic devastation caused by Hurricane Katrina (Barrionuevo and Deutsch 2005; Latour 2001; Mouawad 2005), the huge finespaid by the Boeing company in compensation for postponing the delivery of the Dreamliner 787 (Bathgate and Hayashi 2008; Peng et al. 2011) andthetragic earthquake and subsequent tsunamiin Japan in 2011 (Bathgate and Hayashi 2008; Clark and Takahashi 2011) are among the most obvious examples of these kind of disruptions.

It can be difficult for companies to remove (or even reduce) the causes of disruptions, since often the causes—such as equipment failures, natural disasters, industrial accidents, power outages, labor strikes, and terrorism—are out of companies’ control and cannot effectively be avoided by precautionary actions. Although some disruptions are short-lived, they can still cause serious long-term negative financial and operational outcomes. Some studies have quantified these negative effects of disruptions empirically; for example, the abnormal stock returns of firms that have been affected by disruptions can reach approximately 40 % (Hendricks and Singhal 2005). Similar findings are described by (Peng et al. 2011; Hicks 2002).

When facility disruptions occur, customers may have to be reassigned from their original facilities to the other available facilities, in which case the transportation costs will surely increase. Moreover, the facility locations that are chosen when the disruption risks are ignored may not be good locations to respond to disruptions; therefore, it is important to incorporate the risk of disruptions when making facility location and network design decisions. That is the primary focus of our study.

1.2 Literature Review

The initial model for the facility location/network design problem (FLNDP) was introduced by Daskin et al. (1993). They demonstrated the importance of optimizing facility locations at the same time asnetwork design and developed a mathematical model to do so. Subsequently, Melkote (1996) developed three models for the FLNDP in his doctoral thesis, including uncapacitated and capacitated versions (UFLNDP and CFLNDP, respectively) and the maximum covering location-network design problem (MCLNDP). These models were also described by Melkote and Daskin (2001). In another doctoral thesis, some efficient approaches were developed to solve the static budget-constrained FLNDP by Cocking (2008). Also, Cocking (2008) developed some useful algorithms to find good upper and lower bounds on the optimal solution. The main heuristics that were proposed by Cocking are simple greedy heuristics, a local search heuristic, meta-heuristics such as simulated annealing (SA) and variable neighborhood search (VNS), and a custom heuristic based on the problem-specific structure of FLND. In addition, a branch-and-cut algorithm using heuristic solutions as upper bounds and cutting planes to improve the lower bound of the problem were developed. The method reduced the number of nodes which were needed to approach optimality.

Drezner and Wesolowsky (2003) proposed a new network design problem with potential links, each of which can be either constructed at a given cost or not. Also, each constructed link can be constructed as either a one-way or two-way link. Bigotte et al.(2010) studied a version of the FLNDP in which multiple levels of urban centers and multiple levels of network links were considered simultaneously in developing a mixed integer mathematical model. Their model determines the best transfers of urban centers and network links to a new level of hierarchy in order to improve the accessibility of all types of facilities. Jabalameli and Mortezaei (2011) proposed a bi-objective mixed integer programming formulation as an extension of the CFLNDP in which the capacity of each link for transferring the demands is limited. Contreras and Fernandez (2012) reviewed the relevant modeling aspects, alternative formulations and several algorithmic strategies for the FLNDP. They studied general network design problems in which design decisions to locate facilities and to select links on an underlying network are combined with operational allocation and routing decisions to satisfy demands. Contreras et al. (2012) presented a combined FLND problem to minimize the maximum customer-facility travel time. They developed and compared two mixed integer programming formulations by generalizing the classical p-center problem so that the models consider the location of facilities and the design of the underlying network simultaneously. Rahmaniani and Ghaderi (2013) developed a mixed-integer model in order to optimize the location of facilities and related transportation network simultaneously to minimize the total transportation and operating costs. They assumed that there are several types of transportation links in which their capacity, transportation and construction costs are different. Also, Ghaderi and Jabalameli (2013) proposed a mathematical model for the dynamic version of uncapacitated facility location–network design problem with a constraint on investment budget for opening the facilities and constructing arcs at each time period during the planning horizon. This model determine the optimal locations of facilities and also the design of the underlying network simultaneously. Table 1 presents an overview of the literature on the FLNDproblem.

Table 1 An overview on the literature of the FLND problem

The literature related to system reliability in facility location problems demonstrates that, in light of the huge investment required for facility location, the attention paid to system failures in facility location has increased in recent years (Qi and Shen 2007; Qi et al. 2010).Drezner (1987) was one of the first researchers who proposed mathematical models for facility location with unreliable suppliers. He studied the unreliable p-median and (p, q)-center location problems, in which a facility has a given probability of becoming inactive. In subsequent research, Snyder (2003), 2005, 2007) proposed several mathematical programming formulations for the reliable p-median and fixed charge problems based on level assignments, in which the candidate sites are subject to random disruptions with equal probability. Berman et al. (2007) formulated a p-median problem with disruptions that relaxes the equal-probability assumption made by Snyder and Daskin (2005). Their model is highly non-linear, and they focus on structural properties and special cases. Shen et al. (2009) also relaxed the assumption of uniform failure probabilities, formulated the stochastic fixed-charged facility location problem as a nonlinear mixed integer program, and proposed several heuristic solution algorithms, as well as an approximation algorithm for the equal-probability case. Lim et al. (2009) proposed a reliability continuum approximation (CA) approach for facility location problems with uniform customer density in which facilities can be protected with additional investments. They demonstrated the impact of misestimating the disruption probability in facility location problems in the presence of random facility disruptions.

Matisziw et al. (2010) studied the effects of disruptions on network services and proposed a multi-objective optimization approach for network restoration during disaster recovery. The model evaluates a tradeoff between two objectives, minimization of system cost and maximization of system flow over a planning horizon within budgetary restrictions and repair scheduling prioritization issues. Berman and Krass (2011) studied the effect of line segment failure on then-facility median problem. They assumed that customers have some information about the status of each facility ahead of time and thus travel directly to the closest operating facility. They proposed a linear combination of deterministic median problems for minimizing the expected travel distance. The problem can be solved for any number of facilities and also valid when the failures are correlated. In another study, Berman et al. (2013) assumed that the facilities are not perfectly reliable and failures may be correlated. Moreover, customer information regarding the state of facilities may be incomplete. Therefore, they studied the impact of failure probabilities, correlated failures, availability of information, and the median and center objectives on the optimal location patterns for location and reliability problems on a line line. They derive explicit expressions for facility routes as functions of the model parameters in order to obtain managerial insights.

Hanley and Church (2011) developed a facility location–interdiction covering model for finding a robust arrangement of facilities that has a suitable efficiency under worst-case facility losses. They formulated a MIP model in which all possible interdiction patterns are considered, and a second, more compact bi-level model in which the optimal interdiction pattern is implicitly defined in terms of the chosen facility locations. Peng et al. (2011) studied the effect of considering reliability in logistic networks design problems with facility disruptions and illustrated that applying a reliable network design is often possible with small increases in total location and allocation costs. They considered open/close decisions on nodes but not on arcs of the commodity production/delivery system. By applying the p-robustness criterion (which bounds the cost in disruption scenarios), they simultaneously minimize the nominal cost (the cost when no disruptions occur) and reduce the disruption risk. Recently, Liberatore et al. (2012) proposed a tri-level mathematical model for the problem of optimizing for tification plans in capacitated median distribution systems with limited protective resources in the face of disruptions that involve large regions. They illustrated empirically that considering correlation effects in a system plays an important role in reducing the suboptimal protection plans and subsequently decreasing the non essential growth in the system cost when disruptions happen. Jabbarzadeh et al. (2012) studied a supply chain design problem in which distribution centers may have partial or complete disruptions. The problem was formulated as a mixed-integer nonlinear program which maximizes the total profit of the system while taking into account different disruption scenarios at facilities. Alcaraz et al. (2012) proposed two hybrid meta-heuristics to solve the Reliability p-median problem, a genetic algorithm and a scatter search approach. They proposed several modifications to improve these algorithms in order to solve the problem. Kim and O’Kelly (2009) and Lei (2013) considered reliable hub-location problems, which have some similarity to our model in the sense that arcs connect hubs (facilities) to each other and not only to demand nodes, though these models do not make open/close decisions about arcs, as our model does. Also, in another study, Shishebori et al. (2013) considered the reliable facility location/network design problem considering system reliability (the same problem we consider in this paper). They present a mixed integer non-linear programming (MINLP) formulation for the problem and present a relatively small case study but do not propose a solution method. In contrast, we formulate the problem as a mixed-integer linear programming (MILP) problem andpropose an efficient heuristic to solve it. Moreover, some other recent studies were done at the facility location problems area regarding to system disruptions (e.g., Yushimito et al. (2012), Chen et al. (2013), Aydin et al. (2013), Maheshwari et al. (2014), Correia (2014), Berglund and Kwon (2014)).

We note that the literature on facility location with disruptions is related to, but distinct from, the literature on backup coverage models (e.g., Hogan and ReVelle (1986), Daskin et al. (1988)). Backup coverage models aim to cover each customer multiple times (that is, to have multiple facilities within a certain radius of each customer) in order to account for congestion at the facilities. However, there is no explicit assignment of customers to “primary” and “backup” facilities, as there is in our model and other disruption models.

Table 2 provides an overview of the literature of facility location problems with respect to system reliability.

Table 2 An overview of the literature on facility location problems considering system reliability

1.3 Model Overview

It is evident from the preceding literature review that the existing studies have not considered both network design and system reliability together with facility location. In fact, the literature review illustrates that there is a research gap in facility location regarding more realistic factors such as network design and system reliability to manage practical facility location problems. However, there are numerous examples of practical problems in which simultaneously considering facility location, network design, and system reliability is critical in improving the efficiency, usefulness, and security of the system. These examples include pipelines for gas and water, infrastructure for airline and railroad networks, and systems for delivering services such as health care and education. (In the latter example, link construction may represent establishing routes for medical transport vehicles or school buses, or may represent the construction of new roads to access the facilities, e.g., in under developed regions.) Moreover, our model includes a budget constraint on the fixed cost of locating nodes and links in the network, which reflects a practical constraint faced by many of these systems.

We will refer to our model as the reliable budget-constrained facility location/network design problem (RBFLNDP). The main contributions that differentiate this paper from existing ones in the related literature can be summarized as follows: (1) We introduce a new optimization model to consider simultaneously facility location and allocation, network design, system reliability and a budget constraint as a mixed-integer, linear programming (MILP) problem. Our model integrates tactical and strategic decision making, such as determining the optimum locations of new facilities, optimum construction of transportation links, and optimum allocation of demand nodes to located facilities so that total costs as well as system reliability are optimized. (2) Our new mathematical formulation not only takes into account facility location costs, link construction costs, and transportation costs, but also constrains the maximum allowable disruption cost of the system, as well as the investment in facility location and transportation link construction. (3) We develop a new hybrid heuristic solution approach for the RBFLNDP that, to the best of our knowledge, has not previously been proposed for solving facility location problems.

The remainder of the paper is organized as follows: In Section 2, the mathematical formulation of the RBFLNDP is developed. In Section 3, the hybrid LP relaxation heuristic solution approach is proposed and described. Then, in Section 4, a numerical example that illustrates the application of the heuristic is demonstrated and, based on it, a sensitivity analysis of the model parameters is reported. Computational results are presented in Section 5 and finally, conclusions and future work are discussed in Section 6.

2 Mathematical Formulation of the RBFLNDP

2.1 Problem Description

We assume that we are given a set of demand nodes, as well as a set of potential transportation links among them. Each of the demand nodes is also a candidate facility node, and our goal is to choose facility nodes and transportation links, thereby constructing a transportation network to meet all of the demand. Fixed costs are incurred for constructing nodes and links, and transportation costs are incurred for each unit of demand that flows along the links. (The preceding two assumptions are made without loss of generality, since if some demand nodes are not eligible to become facilities, we can set their fixed costs to infinity, and if some facilities or links already exist, we can set their fixed costs to zero.) The total investment cost (for locating facilities and constructing links) has a predetermined upper bound, represented in our model by a budget constraint. An additional upper bound is imposed on the investment by requiring that no more than P facilities are constructed. (By including both fixed costs and a limit of P facilities, our model combines aspects of both the uncapacitated fixed cost location problem and the p-median problem.)

Moreover, we assume that the facilities are not reliable and, due to unexpected events such as poor weather or sabotage, they occasionally fail and become unavailable. Accordingly, the demand nodes that were served by the disrupted facility must be reassigned to the nearest active facility. Of course, the re-assigned flowsto the backup facilities are not optimal, leading to increased transportation costs, as well as increased link construction costs to accommodate the rerouted flows. The increased cost that is incurred during a disruption is known as the “failure cost” (Snyder 2003; JabalAmeli and Mortezaei 2011, Contreras and Fernandez 2012), and its upper bound may be called the “maximum allowable failure cost”. Our goal is to bound the failure cost that occurs for any disruption, regardless of how likely the disruption is. Accordingly, we do not consider either the probability or the duration of disruptions. (For similar approaches in the literature, see Snyder (2003) or Peng et al. (2011).)

We assume that the system functions as a customer-to-server system in which customers themselves travel to the facilities in order to receive service. Thus, when we speak of a “flow” on a given link, we are speaking of the flow of customers traveling on the link toward the facility they patronize. In contrast, many facility location or network design models consider the flows of goods, andin the opposite direction, from facilities toward customers. The customer-to-server assumption is common for service systems and is consistent with the models proposed by Melkote (1996). However, this assumption is not critical for our model, which could be easily adapted to accommodate flows in the opposite direction through appropriate modifications to the parameters, decision variables, and constraints.

In our model, when we choose the location of a facility, we also choose which facility will serve as its backup when the facility is disrupted. Note that this differs from the notion of “backup facility” as used by Snyder and Daskin (2005) and other authors, in which backups are assigned at the customer level, not the facility level. That is, in other models, two customers assigned to the same facility may have different backup facilities, whereas in our model, they have the same backup facility.

Suppose that node kis chosen as the backup facility for a facility at node i. We assume that additional links must be constructed beforehand to accommodate the new flows into node k in case a disruption occurs at node i. Thus, to accommodate rerouting when i is disrupted, a link (j, k) must be constructed for every link (j, i) that was constructed for normal flows, and a link from i to k must also be constructed (for the demands originating at i).

To summarize, our problem is to determine: (1) the optimum locations of facilities; (2) the primary facilityof every demand node; (3) the backup facility of every facility; (4) the transportation links that should be constructed in advance for both normal and disrupted conditions; and (5) the amount of demand that should be transported on each transportation link. The objective function minimizes the transportation cost, while constraints bound the investment cost and the failure cost.

2.2 Additional Assumptions

In addition to the assumptions described in the preceding section, we assume the following:

  • The facilities and network links are uncapacitated.

  • Facilities can only be located on the nodes of the network, not on links.

  • All travel costs are symmetric.

  • All network links are reliable; that is, disruptions occur at the nodes only.

  • At most one facility fails at a time.

  • Demands may be routed through intermediate nodes (other customer nodes) in both normal and disrupted conditions.

2.3 Notation

N :

set of nodes in the network; each is both a demand node and a potential facility location

S :

set of candidate links in the network

d i :

demand at node iN

D :

∑  i ∈ N d i total demand

B :

investment budget for facility location and link construction

f i :

fixed cost of locating a facility at node iN

c ij :

cost of constructing link(i, j)

P :

number of facilities to open(P ≥2)

FC :

maximum allowable failure cost

t ij 0 :

transportation cost of a unit of flow on link (i,j)

t ij l :

transportation cost forall of the demand of node lto flow on link (i,j), t ij l = t ij 0 d l

M :

large number

Note that t ij 0 and t ij l represent link-specific transportation costs, not origin–destination transportation costs. Since customers traverse routes that consist of multiple links (unlike in classical location problems but similar to network design problems), we must model the flows link-by-link and therefore use link-specific transportation costs.

We assume that the maximum allowable failure cost, FC, is the same for all facilities, for the sake of simplicity. The model can easily be modified to allow facility-dependent maximum allowable failure costs by appending a subscript to FC and modifying the appropriate constraint in the formulation below. Determining a suitable value for FC in practice may be challenging, because firms and service organizations may find it difficult to quantify the specific maximum failure cost they could withstand. However, our problem can be solved iteratively with different values of FC to obtain a tradeoff curve from which decision makers may choose a solution that strikes an appropriate balance between operating cost and failure cost, based on their preference. The method for generating this tradeoff curve is discussed in Section 4.

Z ik :

1 if a facility is located at node i and the facility located at node k is node i’s backup facility, 0 otherwise

X ij :

1 if link (i,j) is constructed, 0 otherwise

Y ij l :

fraction of demand of node lthat flows on link (i,j)∈S

W i l :

fraction of demand of node lthat is served by a facility at node iN

Since a backup facility is required for each open facility, a facility is located at node i if and only if Σ k Z ik  = 1. (It is allowable for Z ik and Z ki both to equal 1. Each facility may serve as both a “primary” facility and as a backup for another facility.)

The transportation (flow) variables work as follows. Y ij l represents the fraction of node l’s demand that flows on link (i, j). (Recall that “flow” refers to the flow of customers toward their facilities.) No flow is allowed on (i, j) unless that link is constructed, i.e., unless X ij  = 1. Moreover, if link (i, j) is constructed, then we assume that the demand of node i flows on it. (This is optimal if the link construction and transportation costs satisfy the triangle inequality.) Thus, we define Y ij i = X ij . A second set of flow variables, W i l, indicates which facilities (i) serve which customers (l), ignoring which links that flow actually travels on. We assume that if i is selected as a facility, then that facility serves the demand at i itself; hence, W i i = Σ k Z ik .

2.4 Formulation

With respect to the aboveassumptions and notations, the mathematical formulation of the RBFLNDP is shown below:

$$ {}^{\left(\mathrm{RBFLNDP}\right)}\kern1em Min\kern0.24em C={\displaystyle \sum_{\left(i,j\right)\in S}{\displaystyle \sum_{l\in N}\;{t}_{ij}^l\;{Y}_{ij}^l}} $$
(1)
$$ \begin{array}{l}s.t.\hfill \\ {}{\displaystyle \sum_{i\in N}{\displaystyle \sum_{k\in N}{f}_i{Z}_{ik}}+{\displaystyle \sum_{\left(i,j\right)\in S}{c}_{ij}{X}_{ij}}\le B}\hfill \end{array} $$
(2)
$$ \begin{array}{l}\left[{\displaystyle \sum_{m\in N}{\displaystyle \sum_{\kern0.36em n\in N}{f}_m\;{Z}_{mn}}+{\displaystyle \sum_{\left(m,j\right)\in S}{c}_{mj}\;{X}_{mj}}+{\displaystyle \sum_{\left(m,j\right)\in S}{t}_{mj}^m\;{X}_{mj}}+{\displaystyle \sum_{\left(m,j\right)\in S\;}{\displaystyle \sum_{l\in N:l\ne m}\;{t}_{mj}^l\;{Y}_{mj}^l}}}\right]\;\\ {}+\left[{c}_{ik}+{\displaystyle \sum_{j\in N}\kern0.48em {c}_{jk}\;{X}_{ji}}+{\displaystyle \sum_{j\in N}\left({t}_{jk}^j-{t}_{ji}^j\right)\;{X}_{ji}}+{\displaystyle \sum_{j\in N}{\displaystyle \sum_{l\in N:l\ne j}\;\left({t}_{jk}^l-{t}_{ji}^l\right)\;{Y}_{ji}^l}}\right]\le FC+M\left(1-{Z}_{ik}\right)\kern0.72em \forall i,\kern0.24em k\in N\end{array} $$
(3)
$$ {X}_{li}+{\displaystyle \sum_{j\in N:j\ne l}\kern0.36em {Y}_{ji}^l}=\kern0.36em {\displaystyle \sum_{j\in N}\kern0.36em {Y}_{ij}^l}+{W}_i^l\kern2.64em \forall i,\;l\in N\;:\;i\ne l, \kern0.24em \left(l, \kern0.24em i\right)\in S $$
(4)
$$ {\displaystyle \sum_{j\in N:j\ne l}\kern0.36em {Y}_{ji}^l}=\kern0.36em {\displaystyle \sum_{j\in N}\kern0.36em {Y}_{ij}^l}+{W}_i^l\kern2.64em \forall i,\;l\in N\;:\;i\ne l, \kern0.36em \left(l, \kern0.24em i\right)\notin S $$
(5)
$$ {\displaystyle \sum_{k\in N}\;{Z}_{ik}}+{\displaystyle \sum_{j\in N}\;{X}_{ij}}=1\kern2.76em \forall i\in N $$
(6)
$$ {\displaystyle \sum_{k\in N}\kern0.36em {Z}_{ik}}+{\displaystyle \sum_{j\in N:j\ne i}\kern0.36em {W}_j^i}=1\kern2.52em \forall i\in N $$
(7)
$$ {Y}_{ij}^l\le {X}_{ij}\kern4.92em \forall \left(i,\;j\right)\in S\kern0.24em ,\kern0.48em \forall l\in N\;:\;i\ne l, $$
(8)
$$ {W}_i^l\kern0.48em \le {\displaystyle \sum_{k\in N}\kern0.36em {Z}_{ik}}\kern1.92em \forall i, \kern0.24em l\in N\;:\;i\ne l $$
(9)
$$ {\displaystyle \sum_{i\in N}{\displaystyle \sum_{k\in N}\;{Z}_{ik}}}=P $$
(10)
$$ {\displaystyle \sum_{k\in N}\kern0.36em {Z}_{ik}}\le 1\kern1.8em \forall i\in N $$
(11)
$$ {\displaystyle \sum_{i\in N}\kern0.36em {Z}_{ik}}\le {\displaystyle \sum_{m\in N}\kern0.36em {Z}_{km}}\kern1.56em \forall k\in N $$
(12)
$$ {Z}_{ii}=0\kern2.52em \forall i\in N $$
(13)
$$ {Y}_{ij}^l\ge 0\kern2.28em \forall i,\;j, l\in N $$
(14)
$$ {W}_i^l\ge 0\kern2.28em \forall i, l\in N $$
(15)
$$ {X}_{ij}\in \left\{0,\;1\right\}\kern2.16em \forall \left(i,\;j\right)\in S\kern0.24em ,\forall l\in N:\kern0.24em l\ne i $$
(16)
$$ {Z}_{ik}\in \left\{0,\;1\right\}\kern2.16em \forall i,\;k\in N:\kern0.24em l\ne i $$
(17)

The objective function (1) includes the transportation costs on all transportation links. The term of the function represents any additional links that node i’s demand travels along. Constraint (2) stipulates that the investment in facility location and link construction cannot exceed B. Constraints (3) are the reliability constraints. If a facility is opened at i and has backup facility k (i.e., Z ik  = 1), then the total failure cost may not be greater than FC. The first bracket calculates the “nominal” cost (the location and transportation cost if no disruptions occur), while the second bracket calculates the increase in cost when facility i fails. In particular, when facility i fails we must construct a new link to k from each node j for which a link (j, i) exists for normal conditions, as well as a new link from i to k to accommodate i’s demand; these costs are represented by the first two terms in the second bracket. We must also re-route the flows on these new links, and the additional transportation cost from this re-routing is represented by the second two terms inside the second bracket. Note that if Z ik  = 0, then the constraint is non-binding since the right-hand side is large.

Constraints (4) and (5) are flow-conservation constraints requiring that, for each pair of nodes l and i, the flow of node l’s demand into node i equals the flow of the same out of node i plus the demand served by a facility at node i, if any. Note that the customer-to-server assumption means that we treat facility nodes as “sinks” for the demand. The two constraints differ by the term X li , which is included in the inbound flow if (l, i) is a potential link (in which case it equals Y li l).

Constraints (6) ensure that, for each node i, either there is a facility at i or some link is constructed out of i. Constraints (7) require the demand of node i to find a destination, whether it is satisfied by node i itself (Z ik  = 1 for some k) or by some other node j (W j i = 1). Constraints (8) and (9) guarantee that potential links and facilities are not used if they are not constructed. Constraint (10) restricts the total number of newly located facilities toP. Constraints (11) stipulate that at most one facility k may be chosen as the backup for i. Constraints (12) prevent facility k from being used as a backup for facility i if k has not been opened. Constraints (13) say that a facility cannot be selected as its own backup facility. Constraints (14)–(17) are standard non-negativity and integrality constraints.

Note that, although we define Y and W as continuous variables, there exist optimal solutions in which they are binary, as in many other uncapacitated facility location models. Thus, we can treat the demand of each node as though it is completely assigned to the closest single facility rather than split among multiple facilities.

2.5 Complexity

Property 1 establishes that the RBFLND problem is NP-hard, since it has thep-median problem, which is itself NP-hard, as a special case.

Property 1

The RBFLND problem is NP-hard.

Proof

The RBFLNDP reduces to the classical p-median problem if we set f ij  = c ij  = 0 and B = FC = ∞. Since the p-median problem is NP-hard (Kariv and Hakimi 1979), so is the RBFLNDP. (Note that the RBFLNDP allows travel through intermediate nodes, whereas the classical p-median assumes direct travel between customers and facilities. However, the model in Kariv and Hakimi (1979) assumes distances are calculated as shortest paths in an underlying network, and therefore the two interpretations are identical.) □.

3 Solution Procedure: LP Relaxation Heuristic Approach

We coded the formulation for the RBFLNDP proposed in Section 2–5 in GAMS 23.3 and solved it using CPLEX 12. We found that CPLEX can find an optimal solution for the RBFLNDP quickly for small-scale instances but that the run time increases sharply as the problem size grows, as suggested by Property 1. Thus, it is desirable to have an efficient heuristic solution procedure to solve larger-scale instances of the RBFLNDP. From the literature review in Section 1–2, one can conclude that customized heuristics based on problem-specific structure have played an important role in solving both the FLND problem and reliable facility location problems, and that they can often obtain near-optimal solutions to these problems in a reasonable computation time.

Motivated by this, we propose a new LP relaxation-based heuristic to solve the RBFLNDP. The basic idea is to first solve the LP relaxation of (RBFLNDP), then to round the resulting location variable matrix (Z LP) to integers, and finally to solve the original (RBFLNDP) model but with the location variables fixed to these new integer values. The rounding is performed based not on the individual elements Z ij LP of Z LP, but on the sums Z ij LP + Z ji LP, since this sum provides more information about whether i and j represent a good pair of facilities. The heuristic proceeds as follows:

  1. 1.

    Solve the LP relaxation of (RBFLNDP). Let Z LP denote the n × n matrix of location variables, and let Z ij LP be its (i,j) th element.

  2. 2.

    Improve Z LP heuristically to obtain a binary matrix Z imp as follows:

    1. a.

      Let Z LP2 be a new upper-triangular matrix in which Z ij LP2 = Z ij LP + Z ji LP if i < j and Z ij LP2 = 0 if i > j. (Recall that Z ij LP2 = 0 if i = j.)

    2. b.

      Let Z imp be an n × n matrix consisting of all 0 s, Ψ = {Ø} andθ = 0. (Note that θ represents the number of elements of Ψ.)

    3. c.

      While θ ≤ P do:

      1. i.

        Let (i,j) be the largest element of Z LP2. If there are multiple elements with the same maximum value, select the one whose column has the largest sum.

      2. ii.

        Set the values ofZ ij LP2andZ ij impbased on the “If-Then” conditions described in Fig. 1.

        Fig. 1
        figure 1

        “if-then” conditions flowchart of heuristic algorithm

  3. 3.

    Solve (RBFLNDP), treating the decision variables Z as fixed parameters equal to the corresponding values in Z imp.

We consider a small numerical example to illustrate the heuristic. Suppose that Z LP, the output of the LP relaxation of (RBFLNDP), is as given below. By adding the corresponding values as in step 2(a), we obtain the matrix Z LP2.

$$ \begin{array}{l}\kern0.36em {Z}^{LP}=\left[\begin{array}{cccccc}\hfill 0\hfill & \hfill 0.341\hfill & \hfill 0.318\hfill & \hfill 0.341\hfill & \hfill 0.318\hfill & \hfill 0.341\hfill \\ {}\hfill 0.341\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0.318\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0.341\hfill & \hfill 0.113\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0.341\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\kern1.08em \Rightarrow {Z}^{LP2}=\kern0.6em \left[\begin{array}{cccccc}\hfill 0\hfill & \hfill 0.682\hfill & \hfill 0.318\hfill & \hfill 0.682\hfill & \hfill 0.318\hfill & \hfill 0.341\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0.318\hfill & \hfill 0.113\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0.341\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \\ {} Final\kern0.36em {Z}^{LP2} = \left[\begin{array}{cccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0.318\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0.341\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0.318\hfill & \hfill 0.113\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0.341\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \end{array} $$

The maximum value of Z LP2 is 0.682, but elements (1,2) and (1,4) both attain this maximum. Since column 2 sums to 0.682 and column 4 sums to 0.795, we select element (1,4), and we set Z 1,4 imp = 1 and Z 1,4 LP2 = 0. We repeat this process until Ψ contains P elements. If P = 4, the final matrixZ imphas elements (1,4), (4,1), (2,1), and (5,1) equal to 1. In other words, facilities are opened at nodes 1, 2, 4, and 5, and the variables Z 14, Z 41, Z 21, and Z 51 are fixed to 1 when we solve (RBFLNDP).

4 Numerical Example

We now provide a numerical example illustrating the application of the model and heuristic to a 21-node example used by several authors (Melkote 1996; Daskin 1987, Hodgson and Rosing 1992, Simchi-Levi and Berman 1988). The network has 38 potential links that may be selected for construction. Other data, such as the facility location cost, the demand of each node and the transportation cost of each link are described in Melkote (1996). Also, a fixed coefficient u is defined as the cost of constructing one unit length of a link, so that each link construction cost is calculated as c ij = ut ij . We assume that P = 4, u = 30 and the budget B = $25,000.

We begin by ignoring the disruptions (setting FC = ∞). The optimal solution for this problem, obtained by solving (RBFLNDP) optimally using CPLEX, is depicted in Fig. 2. The optimal facilities are 2, 10, 12, and 18, and the optimal links are displayed in the figure. Note that some demands (e.g., nodes 4 and 11) are served directly by a facility, while others (e.g., nodes 5 and 13) are served via intermediate nodes and links.

Fig. 2
figure 2

The optimal solution of the numerical example without considering disruptions (FC = ∞)

The total cost of this solution, which ignores disruptions, is $28,410. However, suppose that the worst-case disruption occurs; this happens to be facility 18, which has the maximum failure cost (i.e., the largest left-hand side of constraints (3)). In this case, the demand nodes that are served by facility 18 must be served by facility 18’s best (or nearest) backup facility, facility 10. (Transportation links that are constructed for failure conditions are indicated as dashed lines in the figures.) For example, in Fig. 2, demand node 19 is served by facility 18 normally (and routed 19-17-18), but if facility 18 is disrupted, it is served by node 10 (and routed 19-17-10). This results in a failure cost of $168,666. (Recall that the objective function includes transportation costs only, whereas the failure cost includes all costs.)

Now suppose, instead, that the decision makers want the maximum value of the failure cost to be not more than $159,000 (FC = $159,000). In this case, the optimal design, shown in Fig. 3, has a cost of $32,310 when there is no disruption. But, in this case, if there is a disruption at the facility located at node 10 (which now has the maximum failure cost), the demand nodes that are served by node 10 are fulfilled by facility 10’s backup facility, facility 2 (according to the solution obtained for the RBFLNDP model), and the failure cost is $158,256. (Note that node 2 may not be the best (nearest) backup facility for node 10. However, it is good enough to ensure that the failure cost is less than the upper bound of $159,000. In general the model does not differentiate among these “good enough” backup facilities, and any of them may be chosen.) Clearly, the new solution causes a small increase in transportation cost versus the original solution ($32,310 versus $28,410), but the maximum failure cost is significantly reduced ($158,000versus $168,666). This reduction can be critical, especially in emergency conditions.

Fig. 3
figure 3

The optimal solution of the numerical example considering disruptions (FC = $159,000)

To provide insights into the behavior of the objective function of (RBFLNDP) in response to changes inFC and B, Fig. 4 and Fig. 5 present trade off curves for the RBFLNDP for the problem instance from Melkote (1996). Figure 4 shows how the objective function changes with FC. The optimal FLNDP solution (FC = ∞) is the left-most point on the curve, and subsequent points represent solutions obtained by choosing other values of FC. Evidently, the objective function improves as the maximum failure cost increases; there is a tradeoff between the two. This relationship is logical because, in order to increase the reliability of the network, additional facility location costs, link construction costs, and transportation costs must be paid. Fortunately, the left part of the tradeoff curve is steep, indicating that large improvements in reliability may be attained with small increases in FLNDP cost. For example, the fourth point on the curve has a 6 % larger value ofC* versus the optimal solution to the FLNDP but a 20 % reduction in the maximum failure cost. The smooth right-most portion of the curve is of less interest, because it shows a largeincrease in the total cost compared with a very small decrease in the maximum failure cost.

Fig. 4
figure 4

The changes in optimal value of objective function for different values of FC

Fig. 5
figure 5

The changes in optimal value of C* for different values of B

Another important factor affecting the value of the objective function is the investment budget B. Figure 5 illustrates the changes in the optimal value of the objective function (C*) for different values of B. From Fig. 5, it is clear that the optimal value of C* will increase considerably as the value of B decreases. This relationship is also logical, because as the investment budget decreases, fewer facilities and links can be constructed, and therefore travel costs will increase.

5 Computational Results

A series of numerical experiments to evaluate the performance of the proposed heuristic approach were performed. The algorithm was coded in MATLAB R2011b and GAMS 23.3.3 and executed on a computer with an AMD Opteron 2.0 GHz (×16) processor and 32GB RAM, operating under Linux.

5.1 Experimental Design

In order to verify the performance of the proposed heuristic approach, we solved 30 problems ofvarying sizes. These problems were generated randomly in a manner similar to that described in the literature (Melkote 1996, 2001). In particular: The transportation cost for each link was randomly drawnfrom a discrete uniform distribution on [30, 100]. The construction costs of new links were calculatedby multiplying the transportation cost by a coefficient u, where u has a discrete uniform distribution on [15, 30] and is drawn once for each instance and used for all links. The demand at each node and the fixed cost of opening each facility were sampled uniformly from [10, 150] and [1,200, 3,000], respectively, and then rounded to the nearest integer. Our instances contain between N = 5 and 60 nodes and P varies from 2 to 9.

We set CPLEX’s optimality tolerance to 10 % and its time limit to 2,500 s for both the exact and heuristic methods. That is, CPLEX terminated when either of these criteria were reached, both when solving the problem optimally and when solving the IP in the final step of the heuristic.

5.2 Algorithm Performance

Table 3 summarizes the performance of the proposed heuristic algorithm with that of CPLEX. For each algorithm, the tablerepresents the run time (“Time”) and objective value (“Cost”). The run time for the heuristic includes the time required to solve the LP relaxation in step 1, which is then used as an input for the main step of the algorithm. The table reports the lower bound from CPLEX (“CLB”) and from the heuristic (“HLB”), where the latter represents the objective value of the LP relaxation solved in step 1 of the heuristic, and the percentage gap between the objective value of the best solution found and the corresponding lower bound:

Table 3 Comparison of performance of heuristic vs. CPLEX
$$ Ga{p}_{CPLEX}\;\left(\%\right)=\frac{Cos{t}_{CPLEX}- CLB}{Cos{t}_{CPLEX}}\times 100\kern.5em ;\kern.5em Ga{p}_{Heuristic}\;\left(\%\right)=\frac{Cos{t}_{Heuristic}- HLB}{Cos{t}_{Heuristic}}\times 100 $$

Finally, the last two columns give the ratio between the computation times (solution costs, respectively) of the two methods:

$$ Rati{o}_{Tim e}\;\left(\%\right)=\frac{Tim{e}_{Heuristic}}{Tim{e}_{CPLEX}}\times 100\kern.5em ;\kern.5em Rati{o}_{Cos t}\;\left(\%\right)=\frac{Cos{t}_{Heuristic}}{Cos{t}_{CPLEX}}\times 100 $$

Values less than 100 % in the “Time (%)” and “Cost (%)” columns indicate that our heuristic outperformed CPLEX with respect to CPU time and solution cost, respectively. (CPLEX may find sub-optimal solutions because of our termination settings, as described above.) Our heuristic was faster than CPLEX for all instances. In addition, the notation “n/a” in the table indicates that no feasible solution could be obtained for that instance in the allowed time (2,500 s).

The solutions returned by our heuristic were, on average, 1.4 % more expensive than those returned by CPLEX (across all instances), and it was able to find the same or better solutions than CPLEX for 25 of the 36 test problems (69.4 %). The heuristic is also much faster: it required only 64.6 % of CPLEX’s time, on average, and failed to find solutions within the time limit for only one of the instances, whereas CPLEX failed to do so for 9 of the 36.

Figure 6 illustrates the run times of the proposed heuristic algorithm and CPLEX graphically. Each data point represents the average of the three instances (each with a different value of P) for each value of N. From Fig. 6, it can be concluded that the CPU time of CPLEX increases sharply as the number of nodes increases; moreover, CPLEX cannot solve the instances with more than 50 nodes to 10 % optimality. In contrast, the proposed heuristic can obtain the same or better solutions in a reasonable time compared with CPLEX.

Fig. 6
figure 6

Run times of proposed heuristic algorithm vs. CPLEX

In Fig. 7, we plot the average cost ratio vs. the number of nodes. From the figure one can conclude that the proposed heuristic can obtain the same or better solutions most of the time compared with CPLEX. We note that CPLEX cannot find any feasible solution for test problems with more than 50 nodes in the time limit; therefore no gaps are plotted for those values of N.

Fig. 7
figure 7

Performance of the proposed heuristic algorithm vs. CPLEX with respect to the value of objective function (Ratio Cost (%))

In Fig. 8, the percentage gap (between the objective value of the best solution found and the corresponding lower bound) of CPLEX and the heuristic are compared graphically. From the figure, it can be concluded that the proposed heuristic has smaller gaps, and also more stability, compared with CPLEX. This provides further evidence of the efficiency of our heuristic.

Fig. 8
figure 8

Performance of the proposed heuristic vs. CPLEX with respect to the percentage gap between the objective value of the best solution found and the corresponding lower bound

6 Conclusions and Future Research

In this paper, we considered the combined facility location/network design problem considering two additional aspects not previously included in studies of this problem, namely, system reliability and budget constraints. Our problem, called the reliable budget-constrained facility location/network design problem (RBFLNDP), was formulated as a mixed integer linear programming model. The basic principle in the proposed formulation is the concept of “backup” assignments, which indicate the backup facilities to which clients are assigned when closer facilities have failed and are not available. The tradeoff between the nominal cost and system reliability emphasized that significant improvements in system reliability are often possible with slight increases in the total cost. Moreover, a sensitivity analysis was done to provide insight into the behavior of the proposed model in response to changes in the reliability limit and the investment budget. The sensitivity analysis for the maximum allowable failure cost indicates that large improvements in reliability may be attained with small increases in cost, while that for the investment budget showed that the optimal value of the objective function increases considerably as the budget decreases. This effect is logical, because of the limitation the investment budget places on new facility location and link construction. We proposed an efficient heuristic based on LP relaxation to solve the proposed mathematical model. Numerical tests showed that the proposed heuristic consistently outperforms CPLEX in terms of solution speed, while still maintaining excellent solution quality.

Our findings raise some questions for future research. First, our heuristic may still require an unacceptably long run time for considerably larger problem instances, so it would be desirable to develop an alternate heuristic capable of solving larger instances in reasonable time. One possible avenue is the development of meta-heuristics such as tabu search (TS) and particle swarm optimization (PSO). Second, we considered only a single objective function in this paper; however, considering the RBFLNDP as a multi-objective problem, such as minimizing the operating costs while also maximizing the reliability of system, may find practical application in industries and services. Third, we studied only disruptions at facilities (nodes). However, disruptions of links (arcs) can be considered simultaneously as another kind of system disruption.