1 Introduction

Sustainable logistics is one of the novel concepts in supply chain management. In recent years, researchers’ attentions are attracted to apply green and satiability in their researches. Some Governments and non-Government organizations oblige logistic companies to address environmental issues in their strategic decisions. Earlier works in Supply Chain Management (SCM) concentrated on economical issues more than other issues, but the trend of researches shows that environmental issues have been considered further in recent years (Sbihi and Eglese 2007).

Sustainability can be defined as the ability of enhancing the quality of human life with simultaneous consideration of activities related to protection of environment (Sbihi and Eglese 2007). According to the world commission on environment and development (termed Brundtland) definition, sustainability is “the potency to meet the present needs of human without disruption and damage to capability of next generations to meet their needs” (Brundtland et al. 1987). Sustainable supply logistic is the logistic system in which economical, social, and environmental objectives are considered simultaneously. In this research, social objectives including minimization of transportation risk and minimization of undesirable sites risks are addressed in addition to the economical costs. Moreover, environmental issues are embedded in the one of the objectives. These terms of objective function aim to reduce destructive impacts of fuel consumption on environment. So, the present paper which considers economical, social, and environmental aspects of logistics management, simultaneously are categorized as a sustainable logistics management.

Green logistics is all activities corresponding to the production and distribution in a way that environmental issues are taken into consideration. The significance of green logistics is derived by the fact that current logistics strategies applied in logistics companies are not sustainable in the long term. Lin et al. (2014) showed that a new branch in vehicle routing problem (VRP) which is one of the typical problems in operations research (OR) has emerged and they called it green vehicle routing problem. They classified this field into the three major categories, i.e., Green-VRP, pollution routing problem, and VRP in reverse logistics. Green-VRP deals with the problems in which the optimization of energy consumption is addressed (Salimifard et al. 2012; Erdoğan and Miller-Hooks 2012; Küçükoglu et al. 2015). The pollution routing problem (PRP) introduced by Bektas and Laporte (2011) is concerned with reduction of greenhouse gases (GHG) in particular CO2 emissions. These gases have unfavorable effects on ecosystems and humans’ health and efforts to reduce of these gases has increased in recent years (Fagerholt et al. 2010; Demir et al. 2014; Kramer et al. 2015). The third type of green VRP is VRP in reverse logistics (VRPRL). Dekker et al. (2013) proposed the definition for reverse logistics: “The process of planning, implementing and controlling backward flows of raw materials, in process inventory, packaging and finished goods, from a manufacturing, distribution or use point, to a point of recovery or point of proper disposal”. According to the Lin et al. (2014), VRPRL is divided into three categories that waste collection management is considered as one of these categories. Both waste collection management and reduction of CO2 emission which are two main features of green logistics problems are investigated in this paper. In other words, reduction of CO2 emission by means of decrease in fuel consumption and applying of the proposed model in waste collection management are green features of the presented model in this paper.

As aforementioned, waste collection problem is categorized as one of the subsets of sustainability (Beamon 2008) and green logistics (Lin et al. 2014). Waste collection problem is one of the most important and crucial applications of vehicle routing problems in real world for the reason that wastes and in particular hazardous wastes influence on human health and have destructive effect on environment. Waste collection management consists of all processes which are related to collect, reuse, dispose and recycle of wastes. These processes are key factors in protecting an environment and conserving resources. Alumur and Kara (2007) presented a new mathematical model for the hazardous waste location routing problem. This paper aimed to minimize total cost and transportation risk. This model was used in real case in central Anatolian region of Turkey. Samanlioglu (2013) extended the collection network presented in Alumur and Kara (2007) and proposed a new network for waste management. This network includes generation, treatment, recycling, disposal nodes and the arcs between them. This network is the most complete network considered in the literature. We improve this network in the paper by adding depots of vehicles to previous network. In addition, the relationship between generation nodes which was neglected in previous network is considered in our research.

Finding an appropriate location for undesirable waste sites and determining of rational routes for collection of wastes from generation nodes are strategic and tactical decisions which should be made with respect to defined goals. Therefore, many of papers in the literature have applied location routing problem (LRP) or VRP models to describe their waste collection network (Alumur and Kara 2007; Samanlioglu 2013; Nambiar et al. 1981; Zhao and Zhao 2010; Zografros and Samara 1989). Martínez-Salazar et al. (2014) introduced a new version of LRP model which combined transportation problem with LRP and called it transportation location routing problem (TLRP). They applied this model in distribution network and two metaheuristic algorithms were used to tackle the problem. Utilizing of TLRP in waste collection network can be interesting subject, so in this paper, waste collection problem is modeled in the format of TLRP.

Metaheuristic algorithms are very common approach for solving optimization problems (Azadeh and Farrokhi-Asl 2017). In recent years, researchers’ attention to multi objective problems (MOPs) with Pareto approaches grows more rapidly. Population-based metaheuristics like Non-dominated Sorting Genetic Algorithm (NSGA-II) is an appropriate approach to solve MOPs, because they deal with a set of solutions that allow decision makers to reach several efficient solutions in a single run of the algorithm. In addition, Pareto population-based metaheuristics methods like NSGA-II are less sensitive to the convexity of the Pareto front. For reason of NP-hard nature of the presented problem (Alumur and Kara 2007), exact methods which can find Pareto optimal set are unable to tackle the large sized problems. In this regard, we present a hybrid metaheuristic algorithm to solve the problem presented in this paper. Moreover, the literature of this field are full of papers which have applied metaheuristics to solve the waste management problem and reached to rational solutions. Especially in recent years researchers have used these approaches (Farrokhi-Asl et al. 2017; Rabbani et al. 2018; Mahmoudsoltani et al. 2018; Gatica 2018).

The paper is structured as follows. In Sect. 2, the problem and its attribute is defined and then, the manner in which the fuel consumption and CO2 emission is evaluated is described. Also, the mathematical model is presented in this section. The methodologies for solving a problem is introduced in Sect. 3. In Sect. 4, experimental results for evaluating the proposed algorithms are conducted. Finally, conclusion remarks and future research are provided in Sect. 5.

2 Problem description

The problem considered in this paper is about collection of hazardous wastes from generation nodes and transmitting them to the relevant sites such as treatment, recycle, and disposal sites. The graph network of the problem comprises some nodes and arcs between these nodes which includes potential locations for establishing depots, generation nodes, and potential locations for recycling, treatment and disposal facilities. Potential locations for establishing facilities are the locations that a facility can be opened there, but there is no obligation to establish facility at each potential location. There are some vehicles in each opened depot and these depots are capacitated and the capacity of each depot is different from other depots; that is, amount of wastes collected by vehicles of one depot must not trespass from capacity of the depot. There are certain fixed costs to establish a depot in each potential location and this cost is another attribute of each depot which effects on our decisions. It should be noted that the vehicles are heterogeneous and multi-compartment means that they have specific capacity for each type of waste. Also, the method for fuel consumption and CO2 emission evaluation are described in Fuel consumption rate and CO2 emission estimation subsections.

Collected wastes from generation nodes are transmitted toward treatment facilities, in order to treat of hazardous wastes and reduce risk level. The locations of these treatment facilities are unknown and should be chosen from potential locations for treatment facilities. The capacity of each established treatment facility is limited. Another assumption addressed in this paper is the limitation about compatibility between wastes and applied technology of treatment facility. Each type of waste can only treated at treatment center which has compatible technology. According to Alumur and Kara (2007), two types of technology (i.e. incineration and chemical treatment) and three types of waste (i.e. compatible wastes with incineration, compatible wastes with chemical treatment, and compatible wastes with both of them) are considered in this problem. The vehicles move between treatment facilities until they unload all wastes.

After treatment process and reduction of wastes risks, waste residues are divided into two groups. First group is the waste residues which are directly transmitted to the disposal facilities. The second group which are recyclable after treatment process are transmitted to recycling centers. According to Samanlioglu (2013), the average amount of recyclable waste residues after treatment process for chemical technology are 30% and for incineration technology are zero. It should be considered that the volume reduction for chemical treatment facilities are 20% and for incineration treatment facilities are 80%. Transportation cost per unit of waste between each pair of nodes is specified and the location of these undesirable sites should be chosen from potential locations. Also, the capacity of each site is limited and are different from each other. Waste residues which are the result of recycling process are transmitted to disposal centers. A percentage of these waste residues are considered as a 5% of input wastes to recycling centers.

As shown in Fig. 1, the problem of this paper can be seen as an extension to the TLRP which has composed from three stages. At the first stage, wastes are collected from generation nodes and these wastes are transmitted to compatible treatment centers. Finally, the vehicles should return to the depots. The second stage includes transportation of waste residues from treatment facilities to recycling and disposal centers. Eventually, the last stage is related to the transportation of waste residues from recycling centers to disposal centers.

Fig. 1
figure 1

Waste collection network

This problem includes three objective functions. The first one is economical cost which comprises routes cost, fuel consumption cost, CO2 emission cost, sites establishing cost, and transportation costs. The second objective calculates transportation risk in which the population suffered from wastes along routes are minimized. Site risk objective function (i.e., the third objective function) minimizes the total population who lives around undesirable facilities in the presented collection network. The schematic figure for the problem is depicted in Fig. 2 in which opened facilities are shown with solid shapes.

Fig. 2
figure 2

The schematic figure for waste collection problem

2.1 Fuel consumption rate

Xiao et al. (2012) introduced a factor fuel consumption rate (FCR) which is a function of vehicle’s load and vehicle’s weight. In their method for evaluating FCR, the statistical data published by Japan Government (http://www.mlit.go.jp/common/000037099.pdf/) is used. Analysis performed by Xiao et al. (2012) showed that linear regression can be applied with high R-Squared level to represent the relationship between fuel consumption rate and gross weight of vehicles. Given this assumption, the FCR can be calculated by:

$$\uprho\left( {F_{1} } \right) = a\left( {F_{0} + F_{1} } \right) + b$$
(1)

where \(F_{1}\) represents the weight of vehicle’s load, \(F_{0}\) is the weight of empty vehicle, and \(\uprho\left( {F_{1} } \right)\) is the fuel consumption of vehicle with the load weight of \(F_{1}\) per unit of distance. If the vehicle is full, the formulation is transformed to:

$$\rho^{*} = a\left( {F_{0} + Q} \right) + b$$
(2)

where \(\rho^{*}\) shows the FCR a vehicle when the vehicle is full. The capacity of the vehicle is notated by Q. If the vehicle is empty, the FCR calculated as follow:

$$\rho_{0} = a\left( {F_{0} + 0} \right) + b$$
(3)

The FCR in no load condition is represented by \(\rho_{0}\).With respect to the Eqs. (13) the Eq. (1) can be rewritten as follows:

$$\uprho\left( {F_{1} } \right) = \rho_{0} + \frac{{\rho^{*} - \rho_{0} }}{Q}F_{1}$$
(4)

According to Eq. (4), fuel consumption rate can be evaluated for each amount of vehicle load by knowing fuel consumption rate at the conditions where the vehicle is full of load or the vehicle is empty. The website of “goodyear” company (http://www.goodyear.com/truck/pdf/commercialtiresystems/FuelEcon.pdf/) is used for estimating fuel consumption rate in full or empty modes of vehicles.

The load of vehicles are different in arcs between nodes, so FCR for each arc should be evaluated separately with respect to the load transmitted in each arc. For calculation of fuel consumption in each arc the Eq. (5) is applied.

$$C_{fuel}^{ij} = f\rho_{ij} dis_{ij}$$
(5)

where \(C_{fuel}^{ij}\) denotes fuel consumption cost in arc (i, j), f is unit cost of fuel, and \(dis_{ij}\) denotes the distance between node i and node j.

2.2 CO2 emission estimation

The cost imposed by carbon dioxide to environment can be estimated by knowing the amount of fuel which is consumed by vehicles. Fuel consumption is one of factors associated with CO2 emission rate and some of researches have used this factor to estimate the amount of CO2 emissions (Bektaş and Laporte 2011). The cost of CO2 is described per gram of it.

Estimation of this cost is very hard task, but some researches have tried to estimate the social cost of CO2 emission (Tseng and Hung 2014). For example, Forkenbrock (2001) estimated that this cost is between 10$ and 20$. Tol (2005) compiled 133 distinguish cost estimation for CO2 emission from 28 papers. He developed a density function to estimate this cost. This probability is dependent on different situations, but an important deduction was that the mean of this cost is almost $93/t of CO2. The Department of Environment, Food and Rural Affairs of United Kingdom (DEFRA) applied a different method named the shadow price of CO2 and suggested to set it at £27/t of CO2 for year 2010, and this estimation increases by 2% for each successive year. According to this fact we can set this cost at $46/t for year 2015. Also, the amount of CO2 emitted per liter of gasoline is considered as 2.3 kg (Bektaş and Laporte 2011).

2.3 Mathematical formulation

Assumption:

  • All facilities have limited capacity.

  • Vehicles are multi-compartment; that is, they have different parts for each type of waste.

  • The vehicles and facilities capacity is set in the manner that the problem has a feasible solution.

  • The wastes of each generation node must be collected by only one vehicle.

  • Three types of waste and two types of treatment technology are considered.

  • Parameters are deterministic.

Sets and indices:

D = {1, 2, 3,…,d}:

Set of potential locations for depots

N = {1, 2, 3,…,n}:

Set of generation nodes

T = {1, 2, 3,…,t}:

Set of potential locations for treatment facilities

R = {1, 2, 3,…,r}:

Set of potential locations for recycling centers

P = {1, 2, 3,…,p}:

Set of potential locations for disposal centers

W = {1, 2, 3,…,w}:

Set of waste types

Q = {1, 2, 3,…,q}:

Set of technologies in treatment facilities

K = {1, 2, 3,…,k}:

Set of vehicles

I = {1, 2,…,i}:

Set of all nodes

d:

Index for depots

n, n1:

Index for generation nodes

t:

Index for treatment centers

r:

Index for recycling center

p:

Index for disposal centers

w:

Index for waste type

q:

Index for technology

k:

Index for vehicles

i, j:

Index for all nodes

Parameters:

\(O_{d}\):

Fixed cost of establishing a depot at node d

\(ot_{tq}\):

Fixed cost of establishing a treatment facility with technology q at node t

\(or_{r}\):

Fixed cost of establishing a recycling facility at node r

\(od_{p}\):

Fixed cost of establishing a disposal center at node p

\(cd_{d}\):

Capacity of depot d

\(ct_{tq}\):

Capacity of treatment facility t with technology q

\(cr_{r}\):

Capacity of recycling facility r

\(cp_{p}\):

Capacity of disposal center p

\(cv_{wk}\):

Capacity of vehicle k for waste type w

f:

One unit of fuel’s cost

e:

Cost of CO2 emission per consumption of one liter fuel

\(T_{k}\):

Maximum allowable route time for vehicle k

\(ti_{ij}\):

Traveling time between node i and node j

\(dis_{ij}\):

Distance between node i and node j

\(popt_{ij}\):

Number of people along the arc between node i and j

\(pops_{i}\):

Population size around the facility i

\(gen_{nw}\):

Amount of generated wastes type w at node n

\(fv_{k}\):

Fixed cost of using a vehicle k

\(fd_{k}\):

Wage of vehicle k driver per unit of time

\(com_{wq}\):

1 if waste type w is compatible with technology q; 0 otherwise

\(\beta_{wq}\):

Recycle percent of hazardous waste type w treated with technology q

\(re_{wq}\):

Mass reduction of waste type w treated with technology q

\(tr_{tr}\):

Transportation cost of one unit waste residues between treatment facility t and recycling center r

\(td_{td}\):

Transportation cost of one unit waste residues between treatment facility t and disposal center p

\(rd_{rp}\):

Transportation cost of one unit waste residues between recycling facility r and disposal center p

\(\gamma_{r}\):

Recycle percent of waste residue recycled in recycling center r

\(\rho_{0k}\):

Fuel consumption rate for empty vehicle k

\(\rho_{k}^{*}\):

Fuel consumption rate for full loaded vehicle k

\(Q_{k}\):

Maximum allowable load for vehicle k

Decision variables:

\(y_{d}\):

1 if a depot is opened in potential location d; 0 otherwise

\(b_{tq}\):

1 if a treatment facility with technology q is established in potential location t; 0 otherwise

\(z_{r}\):

1 if a recycling center is opened in potential location r; 0 otherwise

\(g_{p}\):

1 if a disposal center is opened in potential location p; 0 otherwise

\(ki_{tr}\):

Amount of waste residues transmitted between treatment facility t and recycling center r

\(a_{tp}\):

Amount of waste residues transmitted between treatment facility t and disposal center p

\(v_{rp}\):

Amount of waste residues transmitted between recycling center r and disposal center p

\(\theta_{tqwk}\):

Amount of waste type w unloaded at treatment facility t with technology q by means of vehicle k

\(\omega_{ijk}\):

Amount of waste type w transmitted between node i and j

\(U_{iwk}\):

Auxiliary variable that represents the amount of waste type w in vehicle k just after leaving node \(i \in N\) or the amount of unloaded waste type w by vehicle k just after leaving node \(i \in T\)

\(S_{nd}\):

1 if generation node n is assigned to depot d; 0 otherwise

\(e_{ik}\):

Traveling time of vehicle k just after leaving node i

\(x_{0nk}^{d}\):

1 if generation node n is the first node in the route of vehicle k starting from depot d; 0 otherwise

\(x_{ijk}^{d}\):

1 if generation node j is visited just after node i in the route of vehicle k starting from depot d; 0 otherwise

\(x_{n0k}^{d}\):

1 if generation node n is the last node in the route of vehicle k starting from depot d; 0 otherwise

\(at_{wqt}\):

Amount of wastes type w treated in treatment facility t with technology q

\(ar_{r}\):

Amount of waste residues recycled in recycling center r

\(ad_{p}\):

Amount of waste residues disposed in disposal center p

Mathematical model:

$$\begin{aligned} {\text{Min}}\;Z1 & = \mathop \sum \limits_{d \in D} O_{d} y_{d} + \mathop \sum \limits_{k \in K} fd_{k} \mathop \sum \limits_{d \in D} \mathop \sum \limits_{n \in N} ti_{dn} x_{0nk}^{d} \\ & \quad \quad + \,\mathop \sum \limits_{k \in K} fd_{k} \mathop \sum \limits_{t \in T} \mathop \sum \limits_{d \in D} ti_{td} x_{t0k}^{d} + \mathop \sum \limits_{k \in K} fd_{k} \mathop \sum \limits_{d \in D} \mathop \sum \limits_{i \in N \cup T} \mathop \sum \limits_{j \in N \cup T} ti_{ij} x_{ijk}^{d} \\ & \quad \quad + \,\sum\limits_{q \in Q} {\mathop \sum \limits_{t \in T} ot_{tq} b_{tq} + \mathop \sum \limits_{r \in R} or_{r} z_{r} } \\ & \quad \quad + \,\mathop \sum \limits_{p \in P} od_{p} g_{p} + \sum\limits_{t \in T} {\mathop \sum \limits_{r \in R} tr_{tr} ki_{tr} } \\ & \quad \quad + \,\sum\limits_{t \in T} {\mathop \sum \limits_{p \in P} td_{tp} a_{tp} + \mathop \sum \limits_{r \in R} \mathop \sum \limits_{p \in P} rd_{rp} v_{rp} + \mathop \sum \limits_{d \in D} \mathop \sum \limits_{n \in N} \mathop \sum \limits_{k \in K} fv_{k} x_{0nk}^{d} } \\ & \quad \quad + \,\mathop \sum \limits_{k \in K} \mathop \sum \limits_{d \in D} \mathop \sum \limits_{i \in N \cup T} \mathop \sum \limits_{j \in N \cup T} \left( {f + e} \right)dis_{ij} \left( {\rho_{0k} x_{ijk}^{d} + \frac{{\rho_{k}^{*} - \rho_{0k} }}{{Q_{k} }}\omega_{ijk} } \right) \\ & \quad \quad + \,\mathop \sum \limits_{d \in D} \mathop \sum \limits_{n \in N} \mathop \sum \limits_{k \in K} \left( {f + e} \right)dis_{dn} x_{0nk}^{d} \rho_{0k} + \mathop \sum \limits_{k \in K} \mathop \sum \limits_{t \in T} \mathop \sum \limits_{d \in D} \left( {f + e} \right)dis_{td} x_{t0k}^{d} \rho_{0k} \\ \end{aligned}$$
(6)
$${\text{Min}}\;Z2 = \mathop \sum \limits_{d \in D} \mathop \sum \limits_{i \in N \cup T} \mathop \sum \limits_{j \in N \cup T} \mathop \sum \limits_{k \in K} popt_{ij} x_{ijk}^{d}$$
(7)
$$\begin{aligned} & Min\;Z3 = \mathop \sum \limits_{t \in T} \mathop \sum \limits_{q \in Q} pops_{t} b_{tq} + \mathop \sum \limits_{r \in R} pops_{r} z_{r} + \mathop \sum \limits_{p \in P} pops_{p} g_{p} \\ & {\text{s}} . {\text{t}} .\\ \end{aligned}$$
(8)
$$\mathop \sum \limits_{n \in N} \mathop \sum \limits_{k \in K} x_{0nk}^{d} \le cd_{d} y_{d} \quad \forall \;d \in D$$
(9)
$$\mathop \sum \limits_{n \in N} x_{0nk}^{d} = \mathop \sum \limits_{t \in T} x_{t0k}^{d} \quad \forall \;d \in D, \;k \in K$$
(10)
$$\mathop \sum \limits_{n \in N} \mathop \sum \limits_{d \in D} S_{nd} = 1$$
(11)
$$\mathop \sum \limits_{i \in N} \mathop \sum \limits_{k \in K} x_{ijk}^{d} + \mathop \sum \limits_{k \in K} x_{0nk}^{d} = S_{nd} \quad \forall \;j \in N, \;d \in D, \;j = n$$
(12)
$$\mathop \sum \limits_{j \in N \cup T} \mathop \sum \limits_{k \in K} x_{ijk}^{d} = S_{nd} \quad \forall \;i \in N, \;d \in D, \;i = n$$
(13)
$$\mathop \sum \limits_{k \in K} \mathop \sum \limits_{j \in N} \omega_{ijk} = 0\quad \forall \;i \in D$$
(14)
$$\mathop \sum \limits_{k \in K} \mathop \sum \limits_{j \in T \cup N} \omega_{ijk} - \mathop \sum \limits_{j \in N \cup D} \mathop \sum \limits_{k \in K} \omega_{jik} = \mathop \sum \limits_{w \in W} gen_{nw} \quad \forall \;i \in N, \;n = i$$
(15)
$$\mathop \sum \limits_{j \in N \cup T} \omega_{jik} - \mathop \sum \limits_{j \in T \cup D} \omega_{ijk} = \mathop \sum \limits_{w \in W} \mathop \sum \limits_{k \in K} \theta_{iqwk} \quad \forall \;t \in T, \;k \in K, \;i = t$$
(16)
$$\omega_{ijk} \le \left( {\mathop \sum \limits_{d \in D} x_{ijk}^{d} } \right)\left( {\mathop \sum \limits_{w \in W} cv_{wk} } \right)\quad \forall \;i,\;j \in N \cup T, \;k \in K$$
(17)
$$\omega_{ijk} = \mathop \sum \limits_{t \in T} \mathop \sum \limits_{q \in Q} \mathop \sum \limits_{w \in W} \theta_{tqwk} \quad \forall \;n \in N,\; t \in T, \;k \in K,\; i = n, \;j = t$$
(18)
$$\mathop \sum \limits_{w \in W} U_{iwk} = \mathop \sum \limits_{j \in N \cup T} \omega_{ijk} \quad \forall \;k \in K,\; i \in N$$
(19)
$$\mathop \sum \limits_{w \in W} U_{iwk} = \mathop \sum \limits_{j \in N \cup T} \omega_{jik} \quad \forall \;k \in K, \;i \in T$$
(20)
$$U_{iwk} - U_{jwk} + cv_{wk} \mathop \sum \limits_{d \in D} x_{ijk}^{d} \le cv_{wk} - gen_{nw} \quad \forall \;w \in W,\; i \in N,\;j \in N, \;k \in K, \;j = n$$
(21)
$$gen_{nw} \le U_{iwk} \le cv_{wk} \quad \forall \;i \in N, \;w \in W, \;k \in K, \;n = i$$
(22)
$$U_{jwk} - U_{iwk} + cv_{wk} \mathop \sum \limits_{d \in D} x_{ijk}^{d} \le cv_{wk} - \mathop \sum \limits_{q \in Q} \theta_{tqwk} \quad \forall \;i,\;j \in T,\;w \in W, \;k \in K, \;i = t$$
(23)
$$gen_{nw} \le U_{iwk} \le cv_{wk} - \left( {cv_{wk} + gen_{nw} } \right)\mathop \sum \limits_{d \in D} x_{0n}^{d} \quad \forall \;i \in N, \;w \in W, \;k \in K, \;i = n$$
(24)
$$e_{ik} - e_{jk} + \left( {T_{k} + ti_{ij} } \right)\mathop \sum \limits_{d \in D} x_{ijk}^{d} + \left( {T_{k} - ti_{ji} } \right)\mathop \sum \limits_{d \in D} x_{ji}^{d} \le T_{k} \quad \forall \;k \in K,\; j,\;i \in N \cup T \cup D$$
(25)
$$\mathop \sum \limits_{d \in D} ti_{ji} x_{0nk}^{d} \le e_{ik} \le T_{k} + \mathop \sum \limits_{d \in D} (ti_{ij} - T_{k} )x_{0n}^{d} \quad \forall \;k,\;\forall \;i \in N \cup T, \;i = n$$
(26)
$$e_{ik} \le T_{k} - \mathop \sum \limits_{d \in D} ti_{ij} x_{tok}^{d} \quad \forall \;i \in T, \;k \in K,\; i = t, \;j = d$$
(27)
$$\mathop \sum \limits_{i \in D \cup N} \mathop \sum \limits_{k \in K} \mathop \sum \limits_{d \in D} x_{ijk}^{d} = 1\quad \forall \;j \in N$$
(28)
$$\mathop \sum \limits_{j \in T \cup N} \mathop \sum \limits_{k \in K} \mathop \sum \limits_{d \in D} x_{ijk}^{d} = 1\quad \forall \;i \in N$$
(29)
$$at_{wqt} = \mathop \sum \limits_{k \in K} \theta_{tqwk} \quad \forall \;w \in W, \;q \in Q,\; t \in T$$
(30)
$$at_{wqt} \le ct_{tq} com_{wq} \quad \forall \;w \in W, \;q \in Q, \;t \in T$$
(31)
$$\mathop \sum \limits_{w \in W} at_{wqt} \le ct_{tq} b_{tq} \quad \forall \;q \in Q, \;t \in T$$
(32)
$$\mathop \sum \limits_{q \in Q} \mathop \sum \limits_{w \in W} at_{wqt} \left( {1 - r_{wq} } \right)\beta_{wq} = \mathop \sum \limits_{r \in R} ki_{tr} \quad \forall \;t \in T$$
(33)
$$\mathop \sum \limits_{t \in T} ki_{tr} = ar_{r} \quad \forall \;r \in R$$
(34)
$$ar_{r} \left( {1 - \gamma_{r} } \right) = \mathop \sum \limits_{p \in P} v_{rp} \quad \forall \;r \in R$$
(35)
$$\mathop \sum \limits_{t \in T} v_{tp} + \mathop \sum \limits_{r \in R} a_{rp} = ad_{p} \quad \forall \;p \in P$$
(36)
$$ar_{r} \le cr_{r} z_{r} \quad \forall \;r \in R$$
(37)
$$ad_{p} \le cp_{p} g_{p} \quad \forall \;p \in P$$
(38)
$$e_{ik} = 0\quad \forall \;i \in D, \;k \in K$$
(39)
$$\mathop \sum \limits_{i \in N \cup T} \mathop \sum \limits_{d \in D} x_{ijk}^{d} \le 1\quad \forall \;j \in T, \;k \in K$$
(40)
$$\mathop \sum \limits_{i \in N \cup T} \mathop \sum \limits_{d \in D} x_{ijk}^{d} = \mathop \sum \limits_{d \in D} \mathop \sum \limits_{s \in T} x_{jsk}^{d} + \mathop \sum \limits_{d \in D} x_{t0k}^{d} \quad \forall \;j \in T, \;k \in K,\;j = t$$
(41)
$$y_{d} ,b_{tq} ,z_{r} ,g_{p} , S_{nd} ,x_{0nk}^{d} ,x_{ijk}^{d} ,x_{t0k}^{d} = \left\{ {0,1} \right\}\quad \forall \;d \in D,\;t \in T,\;q \in Q,\;r \in R,\;p \in P,\;n \in N,\;k \in K, \;i,\;j \in D \cup N \cup T$$
(42)
$$ki_{tr} ,a_{tp} ,v_{rp} ,\theta_{tqwk} ,\omega_{iwk} ,e_{ik} ,at_{wqt} ,ar_{r} ,ad_{p} \ge 0\quad \forall \;t \in T,\;q \in Q,\;r \in R,\;p \in P,\;k \in K,\; i \in D \cup N \cup T,\;w \in W$$
(43)

The first objective function includes 14 terms. The first Term is related to depots opening cost. Wages of drivers are calculated in terms 2–4. Cost of establishing treatment facilities, recycling centers, and disposal centers are reflected by term 5–7, respectively. Terms 8 and 9 are related to transportation of waste residues from treatment facilities to recycling and disposal centers and term 10 evaluates transportation cost from recycling centers to disposal ones. Fixed cost of applying vehicles are addressed in term 11. The last three terms evaluate fuel and CO2 emission cost. Objective function two associates with transportation risk in which the number of peoples along the routes are minimized. Also, the third objective function minimizes the sites risk; that is, number of people which are around the undesirable sites is minimized.

Constraint (9) associates with the capacity of each depot, means that a number of vehicles which departs from an opened depot should not trespass from the capacity of the depot. Constraint (10) guarantees that all vehicles return to the depots. Constraint (11) ensures that each generation node is assigned to only one depot. Continuity in each route is guaranteed by constraints (12) and (13). Constraints (14)–(17) specify the load of each vehicle after leaving a node. Constraint (18) determines the relation between two variables and guarantees that all loads of each vehicle should be unloaded at treatment facilities. A relation between two variables in generation nodes and treatment facilities nodes are specified in constraints (19) and (20). Constraints (21)–(24) are the modification of the well-known sub tour elimination constraint called Miller–Tucker–Zemlin (MTZ) for the presented problem (Desrochers and Laporte 1991). These constraints also satisfy the vehicles capacity constraint. Traveling distance limitation is satisfied by means of constraints (25)–(27). Each generation node should be serviced by only one vehicle. This restriction is addressed in constraints (28) and (29). Constraint (30) evaluates amount of wastes treated in each treatment facility. Constraint (31) associates with compatibility limitation between waste type and technology applied in treatment facility. The next constraint is related to capacity of treatment facility. Constraint (33) specifies the amount of carried waste residues from each treatment facility to recycling center. The amount of recycled waste residues in each recycling center is determined by constraint (34). Constraint (35) calculates the carried waste residues from recycling center to disposal centers. The waste residues which are processed in each disposal center are evaluated in constraint (36). Two subsequent constraints consider the capacity of recycling and disposal centers. The length of each route in the depots’ place is set zero by constraint (39). Constraint (40) declares that each vehicle can enter to treatment facility at most once. Constraint (41) guarantees that each vehicle which enters to treatment facility should depart this facility. Constraints (42)–(43) specify the range of decision variables.

3 Methodology

Combination of metaheuristic algorithms can always enhance the performance of each individual algorithm to solve single objective problems. Generally, we can say that hybrid algorithms including other metaheuristics are capable to use advantageous of each constituent algorithm to provide broad space to search (Rabbani et al. 2016). These facts motivate us to introduce a new hybrid algorithm to tackle multi objective problems. The new hybrid multi-objective meta-heuristic is introduced in this section. This algorithm uses the attributes of genetic algorithm (GA) and cultural algorithm (CA) which are the algorithms inspired from biologic and social evolution respectively. This algorithm enhances its ability to reach efficient solutions by combination of methodologies used in GA and CA. This algorithm is population based, so the manner in which the solutions are generated is important. Solution representation is described in next subsection and the steps and operators of this algorithm are explained later. Also, four other well-known metaheuristic algorithms which are applied in this paper to verify the results of proposed metaheuristic algorithm are described briefly.

3.1 Multi-objective optimization

The single objective optimization problems aims to reach the optimum or near optimum value. In this problems comparison and sorting of the solutions existing in solution space is easy; because one objective is available and solutions are sorted regarding the objective function they have. For instance, in minimization problem if \(f\left( x \right) < f\left( y \right)\) we can say that solution x is better than solution y. Nevertheless, in multi objective problems finding a single solution which optimizes all objective functions is difficult and uncommon. In this type of problems, objective function is a vector, so multi-objective optimization is named multi criteria optimization or vector optimization. Hwang and Masud (2012) classified methods for solving multi-objective problems into three categories based on the role of decision maker in the process of solving the problem: The priori methods, the interactive methods and the posteriori or methods. In posteriori methods (the approach applied in this paper) the efficient solutions of the problem are produced and then the decision maker is asked to specify most preferred solution among them. The multi objective problems aim to find a feasible decision variables that satisfies constraints and optimizes the vector function whose elements represent the objective functions value. These elements of objective function are usually in conflict with each other. Therefore, the term of optimization in this type of optimization problems means that finding the solutions which can satisfy all constraints and give the values of all objective elements acceptable. We can define it mathematically as follows:

If \(X = [x_{1} , \ldots , x_{N} ]^{T}\) is the vector of decision variables, finding the vector such as \(X^{*} = \left[ {x_{1}^{*} , \ldots ,x_{N}^{*} } \right]^{T}\) which satisfies all constraints and optimizes the vector of objective function:

$$\begin{aligned} & F = \left[ {F_{1} \left( x \right), \ldots ,F_{m} \left( x \right)} \right]^{T} \\ & g_{i} \left( x \right) \ge 0,\quad i = 1,2, \ldots ,p \\ & h_{j} \left( x \right) = 0,\quad j = 1,2, \ldots ,q \\ \end{aligned}$$
(44)

So we want to reach the solutions which satisfy (44) and yield to the optimum values of all objective functions. But, as aforementioned, finding the solution which optimizes all objectives and satisfies all constraints is usually impossible; hence, we define the concept of efficient solutions. Let, a general multi objective maximization problem with n decision variable and m objectives (\(m > 1\)):

$$\begin{aligned} {\text{Maximize}}\;\;F = \left[ {F_{1} \left( x \right), \ldots ,F_{m} \left( x \right)} \right]^{T} \hfill \\ \quad {\text{where}}\;x \in R^{n} \;{\text{and}}\;f\left( X \right) \in R^{m} \hfill \\ \end{aligned}$$
(45)

We define that solution x dominates the solution y for maximization problem if:

$$\begin{aligned} & f_{i} \left( X \right) \ge f_{i} \left( Y \right),\quad \forall \,i \in \left\{ {1,2, \ldots ,m} \right\} \\ & f_{i} \left( X \right) > f_{i} \left( Y \right),\quad \exists \,i \in \left\{ {1,2, \ldots ,m} \right\} \\ \end{aligned}$$
(46)

If there is any solution that no solution dominates it, this solution is called non-dominated solution. The solutions that are not dominated by any other solution in the feasible space are called Pareto optimal. Their corresponding curve in the objective space are called Pareto frontier.

3.2 Solution representation

The approach in which the solutions are represented has a great effect in reaching to solutions with good quality in acceptable computational time. For this purpose, we use order based representation which has been applied for programming a combinatorial optimization problems (Martínez-Salazar et al. 2014). For coding solutions, five strings of real or integer numbers are defined that each of these strings specifies some attributes of the solution. These five strings determine assignments of generation nodes to depots and routes, order of treatment facilities, order of recycling centers, order of disposal centers, and technologies developed in treatment facilities respectively. So, the solution representation are composed of following strings:

  • Assignment string

  • Order of treatment facilities string

  • Order of recycling centers string

  • Order of disposal centers string

  • Technology string

The structure for each of strings can be different with respect to algorithm in which the solution representation is applied. The structure of last four strings are same for all algorithms and composed of the real numbers in the range of [0–1]. Unlike these four strings, the first string’s structure is different in proposed algorithms with respect to structure of each algorithm. This string includes real numbers in the range [0–1] for multi-objective particle swarm optimization (MOPSO), multi objective hybrid cultural and genetic algorithm (MOHCGA) and discrete integer numbers for non-dominated sorting genetic algorithm (NSGA-II), strength Pareto evolutionary algorithm (SPEA-II), multi-objective simulated annealing (MOSA).

The problem consists of several stages. The first string is applied for tackling first stage which is location routing problem (LRP). In this stage the assignment of generation nodes to depots, the routes between these nodes, and established depots are specified. First string includes \(n + d + k - 1\) integer numbers where n is number of generation nodes, d denotes the number of potential locations for depot, and k is the number of vehicles. Numbers between 1 and n associate with generation nodes, numbers between \(n + 1\) and \(n + k\) represent vehicles, and numbers between \(n + k + 1\) and \(n + d + k - 1\) are considered as a delimiter. The generation nodes and vehicles are initially assigned to depots. For this purpose, the locations of numbers associated with depots are marked in the first string. Numbers from first component in the string to the location of the first signed component are allocated to the first depot. Numbers from first signed component to the second one are assigned to the second depot and so forth. If there are not any numbers between two signed components in the string, this means that corresponding depot is not open. Figure 3 illustrates this manner more clearly. In this example, six generation nodes, four vehicles, four potential location for depots are considered. Numbers 11–13 are delimiters and the location of these numbers are marked. Numbers 1, 4, and 5 are assigned to the first depot. Since there is no number between 12 and 11, the second depot will be close. Numbers 3, 6, 7, 8, and 9 are assigned to the third depot and number 2 and 10 are assigned to the forth one. At this point the allocation of vehicles and nodes to depots are specified.

Fig. 3
figure 3

Example for assignments of generation nodes and vehicles to depots

The next step is routes formation in each depot. Numbers in each separated section until the location of component which is associated with vehicle are assigned to this vehicle. The numbers between first vehicle location and the second one are formed the second route and so on. If there is no number corresponding to one vehicle in assigned numbers to the specific depot (, i.e. all numbers are between 1 and n) all numbers are assigned to the one free vehicle.

It should be noted that in this solution representation procedure constraints which are related to capacity limitation and route length limitation are not considered. For this reason and simplifying the representation of solutions a virtual objective function called penalty function is considered. This function‘s value are proportional to the amount of exceeding from these constraints. When this virtual objective function gets value means that the solution is infeasible.

As an aforementioned, for the reason of continuous nature of operators in MOHCGA and MOPSO algorithms, we use the real numbers between zero and one for first string in these two algorithms and then we transform these numbers into discrete integer numbers. The mapping procedure from real numbers to integer ones are performed by the approach shown in Fig. 4. Some studies have used this strategy for applying metaheuristic algorithms and reached to acceptable results (Martínez-Salazar et al. 2014; Rabbani et al. 2017). One integer number is labeled for each real generated number. Then, this real numbers are sorted decently. By relocation of these numbers the attached labels are also relocated and the permutation of numbers are gained.

Fig. 4
figure 4

Example of transforming continuous numbers into permutation of integer numbers

Thus, the routes and established depots are specified. The next stage is related to unloading the wastes in compatible treatment facilities. Strings 2–5 are real continuous numbers between zero and one. Length of these strength are equal to t, r, p, and t, respectively. We explained that how these numbers can be transformed into the permutation of integer numbers. In each route after servicing to the last generation nodes the vehicle moves to the first treatment facility in the second string which has two necessary conditions: (1) Treatment facility has enough capacity. (2) The loads of vehicle should be compatible with technology of treatment facility. For specifying technology applied in each treatment facility, we multiply corresponding component in fifth string by three and then we round it up. The obtained value specifies the technology implemented in corresponding treatment facility. The vehicle unloads the wastes which are compatible with the treatment facility. Then, if the vehicle is empty and unloads all of its wastes, the vehicle is free and should come back to the depot; otherwise, the vehicle moves toward the second treatment facility in the second string which has necessary conditions until it depletes all of its load. In this way all wastes of vehicles are unloaded at appropriate treatment facility. It should be noted that the facilities which wastes have not been sent to it do not establish.

The next stage associates with solving a transportation problem between treatment facilities and recycling and disposal centers. Recycle percent and mass reduction percent is determined at each treatment facility. So, the amount of waste residues which should be transported from each treatment facility to recycling and disposal centers can be calculated. We start with the first opened treatment facility specified by second string and send its recyclable waste residues to the first recycling centers specified by third string. The waste residues are transported to first recycling center until it is filled. Then, the second recycling center is selected and waste residues are sent to it until its capacity is filled and so forth. For the transportation stage between treatment facilities and disposal centers and also between recycling and disposal center we perform like above-mentioned with respect to third and fourth strings. By performing all of these steps we can get a solution from five strings.

3.3 Proposed algorithm

Cultural algorithm (CA) which was introduced for the first time by Reynolds (1994) can be seen as an extension for genetic algorithm (GA). Cultural algorithm for single objective problems addresses interactions between individuals in the population, but these interactions are not taken into account by GA. Each solution in the population communicate with other solutions to generate a belief space (culture) and belief space records knowledge about the problem and utilizes this knowledge to make better generated solutions. Figure 5 depicts the mechanism of cultural algorithm. Experiences of individuals in the way of problem solving are selected regarding some criteria to produce knowledge about the problem and this knowledge impacts on the belief space. The belief space manipulates the knowledge to effect on the evolution and improvement of the population space.

Fig. 5
figure 5

Single objective CA template

Best et al. (2010) modified CA framework to handle multi-objective problems. The experimental results conducted in the paper showed that Multi-objective Cultural Algorithm (MOCA) can be applied independently or as a supplementary to other multi-objective algorithms. Another version of cultural algorithm which is used in the current paper is the hybrid version of CA and GA. In the algorithm, knowledge is used to record and transmit knowledge from one generation to the next one. Hybrid Cultural and Genetic Algorithm (HCGA) includes two main spaces: population space and belief space. The population space associates with an extended GA in which crossover and mutation operators are used to improve generations.

Belief space consists of several main components. In CA, knowledge sources includes but not limited to situational knowledge, topographical knowledge, normative knowledge, and historical knowledge. According to the attributes of GA operators in population space, we address only two main components of the algorithm; that is, normative knowledge and situational knowledge. The steps of proposed Multi-objective Hybrid Cultural and Genetic Algorithm (MOHCG) are described as follows:

  • Step 1 The algorithm is initialized by means of solutions represented according to the previous subsection.

  • Step 2 The solutions are evaluated with respect to their objective functions and a rank is assigned to each solution regarding the non-domination concept. Level one is the best level, 2 is the next best level, and so forth. For the solutions in the same rank, a crowding distance is calculated as a second criterion to sort solutions. By this criterion, the distribution of solutions goes toward a uniformly spread Pareto front at different iterations of the algorithm. Among two solutions at the same front, the solution located in a low density region is preferred.

  • Step 3 Specific Operators are used to adjust belief space. The first operator sets the situational knowledge and the second one upgrades the normative knowledge.

  • Step 4 Cultural space influence on population space. In other words, knowledge about normative solutions and situational solutions impacts on direction of individuals in the population. Population is upgraded with respect to belief space and new generated population is available to keep on the algorithm.

  • Step 5 Genetic operators including crossover and mutation operators are exerted to generate the offsprings from parents.

  • Step 6 The new generated solutions (offsprings) and the old population are merged with each other.

  • Step 7 The merged populations are sorted with respect to the rank of solutions and crowding distance.

  • Step 8 The solutions in the first rank are recorded as Pareto solution.

  • Step 9 The stop criterion in this algorithm is considered as maximum number of iterations. If the criterion is satisfied the algorithm is stopped; otherwise the steps 3–9 are repeated.

The main structure of proposed MOHCG algorithm is shown in Fig. 6.

Fig. 6
figure 6

Flowchart of proposed MOHCG algorithm

3.3.1 Acceptance criterion

It is common that in the society, people who are brilliant and have a specific feature can influence other people or transform the conditions or influence the culture and beliefs. CA which is inspired from this cultural evolution in societies specifies the members of population which are eligible to upgrade belief space. In single objective version of CA, the solutions are sortable regarding their objective functions value. The solutions which have better rank in are accepted to change the belief space. In the proposed algorithm, solutions are sorted with respect to their rank as the first criterion and crowding distance as the second criterion. Therefore, if two solutions have same rank, we refer to crowding distance to determine which one is better.

After sorting the solutions in population space, specific number of solutions is accepted to effect on belief space. In this algorithm, the number of solutions influencing on belief space is dynamic; that is, this number is reduced iteration by iteration. The number of effective members in population is calculated as follow:

$$n_{B} \left( t \right) = \left\lceil {\frac{n\gamma }{t}} \right\rceil$$
(47)

where \(n_{B} \left( t \right)\) is the number of effective members in iteration t, \(n\) is the population size and \(\gamma\) is the real number value between 0 and 1. In this equation, the number of effective members is decreased gradually.

3.3.2 Adjustment of belief space

Situational knowledge considers the records of the population and good solutions in each iteration are capable to transform the situational knowledge. The operation of this knowledge is similar to the concept of leader in particle swarm optimization (PSO) introduced by Kennedy and Eberhart (1997). The proposed algorithm selects one solution from Pareto solutions randomly and then the selected solution is compared with the solution which is the current situational knowledge. If the selected solution dominates representative solution for situational knowledge, the situational knowledge will be upgraded. If the current solution dominates the selected solution, we do not perform any thing. Finally, if two solutions do not dominate each other, on solution is selected randomly to be situational knowledge. These actions are summarized by following formulation in which SK(t) denotes situational knowledge at iteration t and CS represents selected solution.

$$SK(t + 1) = \left\{ {\begin{array}{*{20}l} {SK(t)} \hfill & \quad {{\text{SK(t)}} \prec {\text{CS}}} \hfill \\ {CS} \hfill & \quad {{\text{CS}} \prec {\text{SK(t)}}} \hfill \\ \end{array} } \right.$$
(48)

Normative knowledge provides a set of promising ranges for variables of solutions. Let \(x = \left( {x_{1} , x_{2} , \ldots ,x_{n} } \right)\) shows the decision variables of each solution. One range for each variable of solutions are specified by normative knowledge. Each variable’s range (\(I_{i}\)) has minimum and maximum value denoted by \(x_{i}^{min}\) and \(x_{i}^{max}\). These minimum and maximum values are taken from solutions and Corresponding objective functions of these solutions are used to specify lower bound of objective function (\(L_{i}\)) and upper bound of objective function (\(U_{i}\)). Therefore, each solution is shown by three factors as follows:

$$\begin{aligned} x_{i} & = \left( {I_{i} , L_{i} ,U_{i} } \right) \\ I_{i} & = \left( {x_{i}^{min} ,x_{i}^{max} } \right) \\ L_{i} & = \left( {L_{i}^{1} ,L_{i}^{2} , \ldots ,L_{i}^{m} } \right) \\ U_{i} & = \left( {U_{i}^{1} ,U_{i}^{2} , \ldots ,U_{i}^{m} } \right) \\ \end{aligned}$$
(49)

For changing the normative knowledge in each iteration, we use the following formulation:

$$x_{j}^{min} = \left\{ {\begin{array}{*{20}l} {x_{lj} \left( t \right),\quad \quad x_{lj} \left( t \right) < x_{j}^{min} \left( t \right)\; or} \hfill \\ { \quad \quad \quad \;\quad f\left( {x_{l} } \right) \prec \left( {L_{j}^{1} , \ldots ,L_{j}^{m} } \right)} \hfill \\ {x_{j}^{min} \left( t \right),\quad \,{\text{Otherwise}};} \hfill \\ \end{array} } \right.$$
(50)

where \(x_{lj} \left( t \right)\) is the j-th component of the selected solution from Pareto solutions. If the corresponding \(x_{j}^{min}\) value is updated, \(L_{j}^{i} ,\quad \forall i = 1,2, \ldots ,m\) will be updated, and new lower bounds replace with older ones. The similar actions are done on the \(x_{j}^{max}\) and \(U_{j}^{i} , \quad \forall i = 1,2, \ldots ,m\).

$$x_{j}^{max} = \left\{ {\begin{array}{*{20}l} {x_{lj} \left( t \right),\quad \quad x_{lj} \left( t \right) > x_{j}^{max} \left( t \right) or} \hfill \\ {\quad \quad \quad \quad \;f\left( {x_{l} } \right) \prec \left( {L_{j}^{1} , \ldots ,L_{j}^{m} } \right)} \hfill \\ {x_{j}^{max} \left( t \right),\quad \,{\text{Otherwise}};} \hfill \\ \end{array} } \right.$$
(51)

If the corresponding \(x_{j}^{max}\) values is updated, \(U_{j}^{i} , \quad \forall i = 1,2, \ldots ,m\) will be updated, and new upper bounds replace with older ones.

3.3.3 Population space change

After updating the situational and normative knowledge, these changes should influence on the population space and the solutions in the population space should conform themselves with the new belief space. For changing the population space, the solutions and their variables attempt to move into the normative ranges and become more close to the situational knowledge. In this proposed algorithm, four types of local searches to change a current solution and to search in neighborhoods are considered. In each iteration, one integer random number in the range of [1–4] is generated and one of these methods is selected randomly according to the generated number. These local searches are described as follows:

  1. 1.

    In this neighbor search method, only normative knowledge is used to specify the direction of searching in solution space. The following formulations are used to generate new solutions:

    $$\begin{aligned} x_{ij}^{\prime } (t) & = x_{ij} (t) + \sigma_{j} (t) \times N(0,1) \\ \sigma_{j} (t) & = \left[ {x_{j}^{\hbox{max} } (t) - x_{j}^{\hbox{min} } (t)} \right] \times \alpha \quad 0\le \alpha \le 1 \\ \end{aligned}$$
    (52)
  2. 2.

    In the second local search, we apply just situational knowledge for upgrading the solutions. The search aims to approach the solutions to the components of situational knowledge.

    $$x_{ij}^{\prime } \left( t \right) = \left\{ {\begin{array}{*{20}l} {x_{ij} \left( t \right) + \sigma_{ij} \left| {N\left( {0,1} \right)} \right|,} \hfill & \quad {x_{ij} \left( t \right) < y_{j} \left( t \right)} \hfill \\ {x_{ij} \left( t \right) - \sigma_{ij} \left| {N\left( {0,1} \right)} \right|,} \hfill & \quad {x_{ij} \left( t \right) > y_{j} \left( t \right)} \hfill \\ {x_{ij} \left( t \right) + \sigma_{ij} N\left( {0,1} \right),} \hfill & \quad {x_{ij} \left( t \right) = y_{j} \left( t \right)} \hfill \\ \end{array} } \right.$$
    (53)
    $$\sigma_{ij} \left( t \right) = \alpha \times \left[ {x_{ij} \left( t \right) - y_{j} \left( t \right)} \right]\quad 0 \le \alpha \le 1$$
    (54)

    where \(y_{j} \left( t \right)\) is the j-th component of situational knowledge in iteration t.

  3. 3.

    In the third type of local search, situational knowledge is used to specify the direction of search and normative knowledge for the length of this movement. The formulation is the same with the above formulations, but for specifying the length of movement following formulation is used:

    $$\sigma_{ij} (t) = \alpha \times \left[ {x_{j}^{\hbox{max} } - x_{j}^{\hbox{min} } } \right]$$
    (55)
  4. 4.

    In the fourth type of local search, we use only normative knowledge, but not as the same way in the first method. \(\beta\) denotes one random parameters between 0 and 1. The schematic Figure for this method is shown in Fig. 7.

    Fig. 7
    figure 7

    Schematic Figure of the fourth type of local search

    $$x_{ij}^{\prime } \left( t \right) = \left\{ {\begin{array}{*{20}l} {x_{ij} \left( t \right) + \sigma_{j} \left| {N\left( {0,1} \right)} \right|,} \hfill & \quad {x_{ij} \left( t \right) < x_{j}^{min} \left( t \right)} \hfill \\ {x_{ij} \left( t \right) - \sigma_{ij} \left| {N\left( {0,1} \right)} \right|,} \hfill & \quad {x_{ij} \left( t \right) > x_{j}^{max} \left( t \right)} \hfill \\ {x_{ij} \left( t \right) +\upbeta\sigma_{ij} N\left( {0,1} \right),} \hfill & \quad {Otherwise;} \hfill \\ \end{array} } \right.$$
    (56)

3.4 Crossover and mutation operators

The efficiency of the algorithms applying these two operators are dependent upon the performance of these two operators. These operators help us to search extensively in the space of solutions. There are various types of typical mutation and crossover operators in the literature and we use some of them in this paper according to the structure of strings defined here. If the string is composed of discrete numbers the following steps are performed. Initially two parents are selected using roulette wheel procedure which gives more chance to the better solutions. The order crossover (OX) is selected in this paper to apply on discrete based strings (Oliver et al. 1987). OX crossover choses two components in the string randomly. The components between two selected components in the first parent are inherited to the first offspring. Then, these components are omitted from the second parent and the rest of components are transmitted to the first offspring in the order of appearance in the second parent. Similarly, these steps are performed for generating the second offspring. Figure 8 illustrates the methodology of OX crossover.

Fig. 8
figure 8

Illustrative example for OX crossover

For mutation operator and creation of neighborhoods for current solution in permutation based strings, three methods including swap, insertion and reversion are selected (Rabbani et al. 2016). Swap operator selects two components and swaps their locations in the string. Reversion operator selects two components in the string and reverses the components between these two selected components. Insertion operators also selects two components and inserts the first selected component after the second one.

In continuous based strings, the manner is completely different and the following formulation is applied to generate offsprings. \(y_{i}^{1}\) and \(y_{i}^{2}\) denote i-th component of first and second offsprings respectively. Also, \(x_{i}^{1}\) and \(x_{i}^{1}\) represent i-th component of first and second parents.

$$\left\{ {\begin{array}{*{20}l} {y_{i}^{1} = \alpha x_{i}^{1} + \left( {1 - \alpha } \right)x_{i}^{2} } \hfill \\ {y_{i}^{2} = \alpha x_{i}^{2} + \left( {1 - \alpha } \right)x_{i}^{1} } \hfill \\ \end{array} } \right.\quad 0 < \alpha < 1$$
(57)

4 Experimental results

Some experiments are conducted in this section and the results obtained from proposed algorithm are compared to four well-known multi-objective metaheuristic algorithms including NSGA-II (Deb et al. 2000), SPEA-II (Laumanns 2001), MOSA (Nam and Park 2000), and MOPSO (Coello and Lechuga 2002). The investigation of these algorithms are not the goal of this paper, so the interested readers can refer to (Coello et al. 2002) which is one of the most complete books on multi-objective evolutionary algorithms. At first the parameters of algorithms are set by means of response surface methodology (RSM), then the comparison metrics will be introduced and finally the numerical results are provided.

4.1 Parameter tuning

Parameters of metaheuristic algorithms have important role in the performance of these algorithms. So, as above-mentioned the RSM method is used to set the parameters. RSM method is applicable in tuning of effective parameters in performance of processes. In this method, regression analysis is used to analyze different levels of parameters. Two common design are beneficial in fitting an appropriate regression model including central composite design (CSD) and Box-Behnken design (BBD). The difference of these two designs are related to the range in which the parameters are defined. The central composite design is used in this paper to tune parameters. Initially, the parameters which are effective on the performance of each algorithm are specified. For each parameter two levels are considered. Lowest level of parameter associates with the − 1 and highest level of parameter associates with + 1. The manner in which the level of parameters are formulated for CSD method is shown as follows:

$$X_{i} = \frac{{r_{i} - \left( {\frac{h + l}{2}} \right)}}{{\left( {\frac{h - l}{2}} \right)}}$$
(58)

where h and i are the highest and lowest level of corresponding parameters, respectively. \(r_{i}\) denotes the value of parameter i and \(X_{i}\) specifies the coded value of this parameter (i.e., all parameters are transformed into the real numbers between zero and one). Expert Design software is used to analyze the outputs of algorithms with respect to designed experiments. Figure 9 shows the output of the software for MOHCGA. The results are summarized as follows:

Fig. 9
figure 9

Outputs of software for parameters tuning of MOHCGA

  • MOHCGA parameters:

    • Maximum number of iteration is 150.

    • Population size is 200.

    • Crossover rate and mutation rate are 0.3 and 0.22 respectively.

    • Acceptance rate is 0.697.

    • \(\alpha\) and \(\beta\) are considered equal to 0.4 and 0.716 respectively.

    • Roulette wheel coefficient is 0.0243.

4.2 Comparison metric

Since the comparison between multi-objective algorithms are different with single objective problems, some comparison metrics are introduced and the obtained results are compared with each other with respect these metrics.

4.2.1 Number of Pareto solutions (NPS)

This metric evaluates number of Pareto solutions obtained by means of each algorithm. The great number for this metrics is desirable for us.

4.2.2 Spacing metric (SM)

This metric shows the uniformity of the gained Pareto solutions in phenotype space. The algorithm which has lowest value of spacing metric has best performance. The following formulation is used to calculate spacing metric:

$$SM = \sqrt {\frac{1}{N - 1} \times \sum\limits_{i = 1}^{n} {(d_{i} - \bar{d})^{2} } }$$
(59)

where \(d_{i}\) is the Euclidean distance between solution i and the nearest solution belonged to Pareto sets of solutions. \(\bar{d}\) is the average value of all \(d_{i}\). Also, N denotes the number of Pareto solutions.

4.2.3 Mean ideal distance (MID)

This metric specifies the distance between the Pareto solutions and ideal solution. According to the NP-hard nature of proposed problem, each component of ideal point is the best one found by all metaheuristic algorithms. This metric can be calculated by:

$$MID = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \mathop \sum \nolimits_{j = 1}^{4} \sqrt {\left( {\frac{{f_{ij} - f_{j}^{best} }}{{f_{j}^{max} - f_{j}^{min} }}} \right)^{2} } }}{n}$$
(60)

n represents the number of Pareto solutions, \(f_{j}^{max}\) and \(f_{j}^{min}\) denote the maximum and mi minimum value for the j-th objective function obtained by all metaheuristics, and \(f_{ij}\) is the j-th objective function of i-th solution in Pareto front. The lower value of this metric is better value.

4.2.4 Coverage metric (CM)

The quality of solutions obtained by each algorithm is evaluated by means of this metric. If \(x^{\prime}\) and \(x^{\prime\prime}\) are the two sets of Pareto solutions obtained by two different algorithm, the CM function evaluates the quality of these two sets and the output of this function is the number in the range [0–1].

$$CM\left( {x^{\prime},x^{\prime\prime}} \right) = \frac{{\left| {\left\{ {a^{\prime\prime} \in x^{\prime\prime}; \;\exists \, a^{\prime} \in x^{\prime}: a^{\prime}\,{ \preccurlyeq }\,a^{\prime\prime}} \right\}} \right|}}{{\left| {x^{\prime\prime}} \right|}}$$
(61)

4.2.5 K-th distance metric

In this metric, distance between points and their k-th neighbor are considered. In this paper the k is set three. Higher value for this metric is desirable.

4.3 Numerical results

Some instances with different sizes are conducted in this section. These instances are divided into two sets. The first set is related to small scaled problem and the second one is large scaled problems. The scale of each problem is specified by attributes of the problem. Characteristics of each problem are shown by d#n#t#r#p#k which represent number of potential locations for depots, generation nodes, potential locations for treatment facility, potential location for recycling centers, potential location for disposal centers, and types of vehicles, respectively. Tables 1, 2 show the attribute of each problem in small and large scales. The generated problems can be downloaded at the below link:

Table 1 Characteristics of small size problems
Table 2 Characteristics of large size problems

https://www.dropbox.com/sh/7yov1ars5bkq3n0/AACgSyjWX-LvQOGnTQILiwp6a?dl=0.

Since components and the investigated problem in the current study is different from the studies in the literature, we cannot find similar data set, so we generate a new data set to perform experiments to evaluate performance of presented algorithms. According to the above link, this data set is generated randomly and all values for the parameters in different sizes can be found in this link.

Initially, the NPS and computational time of algorithms are investigated. In each metric student’s t-test is applied to specify the best algorithm. Unfamiliar readers with this test can refer to Haynes (2013). In some cases, for hypothesis test for the difference between two population means, we deal with the data collected from two populations which have correlation between them. Student’s t-test is appropriate statistical test in this condition. Tables 3, 4 and Fig. 10 summarizes the performance of all algorithms in the NPS and computational metrics. According to the obtained results, we conclude that the SPEA-II have better performance in NPS metric and NSGA-II outperforms other algorithms in computational time. Table 5, 6 perform one sided t-test to evaluate the accuracy of this hypothesis.

Table 3 Results for NPS and CPU time in small size problems
Table 4 Results for NPS and CPU time in large size problems
Fig. 10
figure 10

Comparison between computational times of all algorithms for all problems

Table 5 Pairwise comparison between SPEA-II and other algorithms in NPS metric
Table 6 Pairwise comparison between NSGA-II and other algorithms in CPU time

It can be concluded that our hypothesis about SPEA-II and NSGA-II are correct. Now, we investigate the SM and the obtained results are summarized in Tables 7, 8. We conclude that SPEA-II achieve better solution in SM with respect to obtained results. The Table 9 shows that our assertion can be true with 95% confidence.

Table 7 Results of SM in small size problems
Table 8 Results of SM in large size problems
Table 9 Pairwise comparison between SPEA-II and other algorithms in SM

The results of algorithm are compared with each other in Tables 10, 11, 12, 13 with respect to MID and k-th distance metric. This results show that MOHCGA outperform other algorithms in MID; that is, the solutions obtained by this algorithm has less distance from ideal point. In k-th distance metric also MOHCGA has better results that other algorithms. Therefore, it can be concluded that MOHCGA generates widely speared solutions in phenotype space.

Table 10 Comparison between algorithms with respect to MID and k-th distance metric in small size problems
Table 11 Comparison between algorithms with respect to MID and k-th distance metric in large size problems
Table 12 Pairwise comparison between MOHCGA and other algorithms in MID
Table 13 Pairwise comparison between MOHCGA and other algorithms in k-th distance metric

In the following, the comparison between algorithms with respect to CM is provided. The results are summarized in Tables 14, 15. These tables show that SPEA-II have better performance that other algorithms in CM. Figures 11, 12 are presented in order to give graphical perception about the performance of algorithms in small and large size problems.

Table 14 Average of pairwise comparison for CM in small size problems
Table 15 Average of pairwise comparison for CM in large size problems
Fig. 11
figure 11

Comparison between algorithms in small size problems

Fig. 12
figure 12

Comparison between algorithms in large size problems

5 Conclusion

In this work, green concepts and systematic collection of wastes in new presented network were considered, simultaneously. For designing a rational network, some location decisions in design phase of the network and waste collection decisions in operational phase were made in this paper. The problem included activities related to collection, treatment, recycling, and disposal of hazardous wastes in multi stage network. Fuel consumption and CO2 emission along the collection routes were investigated in this paper for the first time. On other hands, sustainability and reducing social effects of waste were emphasized in this paper. The network presented in this research composed of depots, generation nodes, treatment facilities, recycling centers, and disposal centers, so this makes the networks most complete network investigated in this field. According to general framework for designing a waste management network presented here, it can be applicable in different real collection networks. Also, some risks associated with facilities were considered in this paper. A new mathematical formulation with three objective functions was proposed in which each objective corresponds to one component of sustainability. Five multi-objective metaheuristic algorithms were applied to tackle the problem which one of these algorithms called MOHCGA was proposed in this paper. To the best of our knowledge, MOHCGA and MOSA were applied for the first time in the field of location routing problem. Some experiments were conducted to show the efficiency of proposed algorithm. Also, experiments attribute were provided in an Internet link and interested reads can refer to it. Since the comparison between multi-objective algorithms are different with single objective problems, some comparison metrics were introduced and the obtained results are compared with each other with respect these metrics. According to the obtained results, performance of MOHCGA was competitive with other algorithms performance, specifically in MID and k-th distance metric MOHCGA outperformed other algorithms; that is, MOHCGA can find solutions which are closer to the ideal solution. On the other hand, SPEA-II algorithm can generate more Pareto solutions which are distributed uniformly in Pareto front.

We suggest for the interested researcher in this field to consider the velocity of vehicles as an effective variable on fuel consumption. Also, time windows restriction can be considered in generation nodes or facilities’ locations. Using some methods such as epsilon constraint, goal programming, or goal attainment to achieve the exact Pareto frontier and comparison the results of approximate algorithms with these methods could be attractive. Since the model presented in this paper is a general framework for designing a waste management network, applying this model in a real case situation and investigating results are recommended for future researches.