1 Introduction

The rapid development of science and technology has accelerated the speed of product updates but also has caused a large number of discarded products such as computers, cell phones, LCDs, and vehicles. These end-of-life products contain a lot of resources such as metals, glass, and some toxic substances. It is essential to recycled these discarded products properly, not only to save resources but also to protect the environment. With the improvement of environmental protection awareness, there is an increasing interest in recycling, reuse, and remanufacturing technologies. And the recycling, reusing, remanufacturing of end-of-life products have attracted widespread concern [1]. Disassembly refers to a systematic process of extracting each part from a product for recycling and reuse and is a critical part of product recovery and sustainable manufacturing [2].

Disassembly can be performed in a disassembly cell, a single workstation, or a disassembly line. The disassembly line is a flow line designed to disassemble parts from a product automatically for subsequent recovery processes and remanufacturing, and the conventional disassembly line configuration is shown in Fig. 1a. It has many advantages compared with the first two disassembly modes. For example, it is the most suitable setting for disassembly of large products or small products in large quantities; the disassembly line offers the highest disassembly efficiency, and it is widely valued as the best way to automatically disassemble parts in remanufacturing especially in industrial mass production [3, 4].

Fig. 1
figure 1

The layout of two-sided disassembly

To improve the disassembly system and optimize the use of resources, the researchers proposed the disassembly line balancing problem (DLBP), that is, to reasonably assign disassembly tasks to each workstation, evenly distribute tasks, and achieve a balancing state while satisfying the constraint and precedence relationship among parts [5]. Recently, DLBP has attracted attention in industry and academy.

DLBP is a complicated combinatorial optimization problem. Different types of disassembly lines are different in the disassembly processing method, and the mathematical models of evaluation indicators are established based on it [6]. In previous studies, such as disassembly efficiency, the number of workstations, cost, hazard degree, demand index are frequently selected indicators [7, 8]. To reduce construction costs and improve disassembly efficiency, the number of workstations and the idle time are the preferred optimization objectives. Considering the meaning of DLBP aforementioned, it is necessary to take the smoothness of the workload distribution of each workstation into account, namely, the smoothness index [4]. In addition, it is vital to minimize the number of direction changes and tool changes (called auxiliary operation indicator) to simplify auxiliary operations. Therefore, the following four objectives are considered in this paper: (1) minimize the number of mated-workstations; (2) minimize the idle time; (3) minimize the smoothness index; (4) minimize the auxiliary indicator.

In order to address DLBP, many researchers developed some heuristic methods to get optimal or near-optimal results of DLBP. For instance, Ren et al. [9] used a 2-opt algorithm; Ilgin [10] constructed a DEMATEL-based heuristic. Some researchers have adopted mathematical programming methods in DLBP, such as a mixed-integer linear programming model [11,12,13]. DLBP is an NP-complete problem, so the amount of computation increases exponentially as the structure of a product becomes complex [14]. Although mathematical programming is suitable for solving simple examples of DLBP, due to its combinatorial nature, it will soon fail to solve large-sized products [15]. On the contrary, the meta-heuristic methods can get satisfactory results within a reasonable time. It has been widely used and has become the mainstream method in DLBP, such as particle swarm optimization (PSO) [16], ant colony optimization (ACO) [17, 18], genetic algorithm (GA) [13, 14, 19], gravitational search algorithm [20], discrete bee algorithm (DBA) [21], fruit-fly optimization algorithm (FOA) [22]. When dealing with various objectives, some researchers consider each objective lexicographically, i.e., optimize each objective one by one [14,15,16,17]. But this is essentially a single-objective perspective, which cannot fully consider each objective [23]. Besides, some studies integrated different objectives into one objective by weighting [13, 21, 24, 25]. Since the different objective units, it is difficult to remove the unit among different objective functions and set weight coefficients reasonably. Besides, scoring will bring subjectivity in the optimization process. Once the importance of each objective changes, the weights need to be changed and re-optimized. In addition, since the objectives may conflict with each other, it is hard for each objective to achieve the best value at the same time. On the contrary, the Pareto nondominated relationship can be used to measure each objective comprehensively and is widely used in multi-objective optimization, and it is used in this paper.

All work mentioned above has made great achievements in the conventional DLBP which is mainly based on the straight disassembly line. The conventional straight disassembly line follows a simple linear process mode where workers can only disassembly one part at a time, and only one side is utilized, and the majority of current DLBP researches are based on it [7]. But this layout is less productive. There are different types of disassembly lines. Except for the straight disassembly line, there are also line layouts such as two-sided disassembly lines. In a two-sided disassembly line, different disassembly tasks are performed on the same product in parallel at both sides of the line, and a pair of two facing workstations crossing the line is called a “mated-workstation” (see Fig. 1b). A two-sided production line has some advantages, such as higher disassembly efficiency, shorter disassembly line length, and lower construction and maintenance cost [26]. Similarly, the two-sided disassembly line needs to balance various resource utilization, and the DLBP also exists in the two-sided disassembly line, namely two-sided DLBP (TDLBP), regarded as an extension of conventional DLBP. However, due to the constraint relationship among parts, it is a bit difficult to coordinate asynchronous disassembly operations at both sides. This is the main challenge of two-sided disassembly scheme planning. Therefore, it is meaningful to research the TDLBP. In terms of optimization algorithm, Karaboga and Basturk [27] introduced an optimization algorithm based on bee colony behavior known as the artificial bee colony (ABC) algorithm. Due to the fewer control parameters, it is easier to implement and has an excellent optimization performance [28]. It has already been applied to conventional DLBP, and it stands out from other algorithms [15, 29]. In recent years, TDLBP has started to be studied. Wang et al. [30] first introduced the concept of TDLBP and used a multi-objective flower pollination algorithm for optimization. Recently, Kucukkoc [13] used an improved genetic algorithm for the optimization of TDLBP. To date, there are few studies on the TDLBP. This paper proposed a model including a novel encoding, decoding method, and an improved multi-objective artificial bee colony (IMOABC) algorithm to solve TDLBP.

Compared with the current DLBP researches, the novelty and main contributions of this paper can be summarized as follows.

  1. 1.

    This paper describes the product constraint and connect relationships based on the hybrid graph and defines the two matrices to digitize the constraint relationships for programming.

  2. 2.

    A two-sided line disassembly sequence model and a set of coding and decoding methods are proposed.

  3. 3.

    A multi-objective two-sided disassembly line model is established to optimize the number of mated-workstations, idle time, smoothness, and auxiliary processes, and the ABC algorithm is adapted and applied to the current model.

Compared with Kucukkoc’s work [13], we use a hybrid graph model instead of the AND/OR graph model. In the optimization process, we based on the Pareto non-dominated relationship instead of weighting, which can avoid the conflict of each objective. Once the demand changes, we can simply reselect a solution from the obtained Pareto set, whereas Kucukkoc’s method requires reconfiguring coefficients of the objective function and re-optimizing it.

The rest of the paper is as follows. In Sect. 2, a hybrid graph is used to present precedence and connection of product structure description, as well as a disassembly scheme planning method, including the disassembly sequence encoding and decoding method. The multi-objective optimization function and the IMOABC algorithm are represented in Sect. 3. The performance of the IMOABC and the practicality of the proposed model are verified in Sect. 4. Section 5 introduces conclusions and future works.

2 Precedence Model and Disassembly Scheme Generation

2.1 Constraints and Precedence Relationship Illustration

To illustrate the product structure and determine the order of disassembly tasks, some information models are used such as the directed graph, AND/OR graph, and Petri net [31, 32]. These graph models only represent the disassembly priority relationship among parts, but the actual connection relation that helps designers to understand complicated structures among parts cannot be illustrated [33]. The hybrid graph model can compensate for it and express direct and indirect precedence relation intuitively [24]. So, the hybrid model is proposed for product structure description. A bearing model and its hybrid graph are represented in Fig. 2.

Fig. 2
figure 2

The exploded view and hybrid graph of a certain sliding bearing

Then, the disassembly sequence can be generated by analyzing these information models, which is important for DLBP. To facilitate programming, a connection matrix Mc = [ci,j]n×n and a precedence matrix Mp = [pi,j]n×n are introduced to describe the connection and precedence relationship of part i and j of a product, and the meanings of ci,j and pi,j are shown in Eqs. (1) and (2).

$$c_{i,j} = \left\{ \begin{gathered} 1 \, if \, part \, i \, and \, j \, connect \, each \, other \hfill \\ 0 \, otherwise \hfill \\ \end{gathered} \right.,$$
(1)
$$p_{i,j} = \left\{ \begin{gathered} 1 \, if \, part \, i \, constrains \, part \, j \hfill \\ 0 \, otherwise \hfill \\ \end{gathered} \right..$$
(2)

The Mc and Mp of the bearing in Fig. 2 are as follows:

$${\varvec{M}}_{{\varvec{c}}} = \left[ {\begin{array}{*{20}l} 0 & 1 & 0 & 0 & 0 & 0 & 1 & 0 \\ 1 & 0 & 1 & 0 & 1 & 0 & 0 & 1 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ \end{array} } \right],$$
$${\varvec{M}}_{{\varvec{p}}}\, { = }\,\left[ {\begin{array}{*{20}l} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 1 & 1 & 0 \\ 1 & 1 & 0 & 0 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 1 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 1 & 1 & 0 & 0 & 0 & 1 & 1 & 0 \\ \end{array} } \right].$$

Similarly, the Mc and Mp of any product can be obtained by this method of analysis, which possesses generality.

Assumptions

The disassembly task assignment method and following multi-objective models are based on these assumptions for problem simplification [14, 34]: (a) The disassembly time of each part is known in advance; (b) Completely disassemble the product and disassemble a single product on the disassembly line; (c) The supply of end-of-life product is infinite; (d) Each task can only be assigned to one station; (e) The total disassembly time of tasks assigned to one side of mated-workstation shall not exceed cycle time CT; (f) All the parts in a product are available and recyclable, without deletions, additions, modifications, physical or functional defects.

2.2 Disassembly Sequence Generation of Two-sided Disassembly Line

In order to disassemble a part (such as part g) from a product, the following conditions (i.e. constraint condition) should be observed: part g is unrestricted by other parts and part g connects at most one part. As shown in Eq. (3), this constraint condition can be denoted by the elements of Mc and Mp:

$$\sum\limits_{i = 1}^{n} {c{}_{i,g}} \le 1,\sum\limits_{i = 1}^{n} {p{}_{i,g}} = 0.$$
(3)

The connection and constraint relationship of a product changed once a part is disassembled. If the part g is disassembled, the g-th row of Mp, and the g-th row and g-th column elements of Mc must be cleared, i.e. update Mc and Mp. Iterate this procedure, a disassembly sequence can be obtained, and it can be used in conventional DLBP. Due to the constraints and precedence among parts and the asynchronous process nature of the two-sided disassembly, this conventional disassembly sequence cannot contain enough information for disassembly scheme planning. So, a modified disassembly sequence is proposed, which is expressed as X = [X1,X2].

X1 = [p1,…,pn] is the same as the conventional disassembly sequence, where pi (pi ∈ P, P is the parts set) corresponds to a specific part or task. X2 = [s1,…,sn] is a sub-vector of X which indicates the disassembly side assignment result of each disassembly task. For example, if si = R (or L), it means pi will be disassembled at the right side (or left side) of the two-sided line. A feasible disassembly sequence generation method is described as follows:

Step 1: Construct the Mc and Mp, and two empty vectors X1 and X2;

Step 2: Based on Eq. (3), find all the parts that can be disassembled in this stage, and randomly select one;

Step 3: Insert the selected part into the corresponding empty position of X1;

Step 4: Update Mc and Mp; If all the parts have been disassembled, go to Step 5, else go to Step 2;

Step 5: Fill X2 up with character ‘R’ or ‘L’ randomly;

Step 6: End.

2.3 Decoding of Two-sided Disassembly Scheme

The decoding refers to plan a disassembly scheme based on the type of the disassembly line and disassembly sequence, i.e. assigning each disassembly task to a certain workstation and determining its start and end time. It is different from the conventional one-sided disassembly line, because the disassembly scheme is an asynchronous process of a two-sided disassembly line, and the constraint relationship must be considered for operation coordination, which complicates the calculation of work time ST. In the job-shop scheduling problem, a job is assigned to a specific workstation depending on its start and end time [35]. This idea is applied to TDLBP for disassembly scheme planning in this paper, and three basic equations are proposed for decoding as shown in Eqs. (4) to (6).

$$t_{e,i} = t_{s,i} + t_{d,j} ,j \in P,i = \left\{ \begin{gathered} R \, right \, side \hfill \\ L \, left \, side \hfill \\ \end{gathered} \right.,$$
(4)
$$t_{e} = \mathop {\max }\limits_{i = ^{\prime}L^{\prime},^{\prime}R^{\prime}} (t_{e,i} ),t_{e} \le CT,$$
(5)
$$ST_{i,j} = \sum\limits_{v = 1}^{c} {t_{d,v}^{i,j} ,i = \left\{ \begin{gathered} R \, right \, side \hfill \\ L \, left \, side \hfill \\ \end{gathered} \right.} ,$$
(6)

where, ts,i, and te,i respectively represent the start and end time of side i, and te is the end time judging whether to open a new workstation, CT is the cycle time (the maximum disassembly time).td,j is the disassembly time of part j. Equation (4) and (5) are used to determine whether the part can be disassembled in the current mated-workstation. Equation (6) is for computing the actual disassembly time of a mated-workstation, where c is the number of parts disassembled at side i of mated-workstation j, ti,j d,v is the part v’s disassembly time assigned to side i of mated-workstation j. STi,j is the sum disassembled time of side i of mated-workstation j. Note that i = R (right station) or L (left station). Then, the actual work time of a mated-workstation can be calculated, and the detailed decoding method is proposed in Fig. 3. Where Pm refers to the set of parts disassembled at mated-workstation m; pm,L and pm,R refer to the last parts disassembled at the left and right side of mated-workstation, respectively; m is the number of mated-workstations; k is the iteration counter; the symbol “ → ” refers to the constraint, e.g. if p1 constrains p2, it is denoted as “p1 → p2”. If p1 does not constrain p2, it is expressed as “”.

Fig. 3
figure 3

The flow chart of the decoding process

Take a feasible solution X of Fig. 2s , X = [X1 = [1,2,3,4,5,6,7,8],X2 = [R,L,L,R,R,L,R,L]], set cycle time CT = 9 s, and disassembly time of each part is known (see Fig. 2b). Figure 4a is the Gantt chart to illustrate this scheme, and the two numbers in each block represent the part number and disassembly time. Figure 4b shows the task assignment and arrangement of this scheme example.

Fig. 4
figure 4

The disassembly assignment scheme and the layout of the two-sided disassembly line

According to Eqs. (4) and (5) and the decoding method, the disassembly scheme can be planned. First, assign part 3 to the right side of mated-workstation 1. Since part 8 is not constrained by part 3, part 8 can be disassembled at the left side meanwhile. As part 3 and 8 both constrain part 2, part 2 can be dismantled after part 8 and 3 are removed (in the 4th second). The disassembly plans for part 5 and 4 are similar to the previous planning method. However, the mate-workstation 1 cannot have enough capacity for part 6’s disassembly, so a new mated-workstation will open. As part 6 constrains part 7, part 7 will be disassembled after part 6’s disassembly, i.e., in the 15th second. Since ts,2 + tR,2 d,6 > 2 × CT, part 7 must be disassembled at the right side of mated-workstation 3. No worker at this side because the right side of mated-workstation 2 is empty. According to the proposed decoding method, the tasks can be assigned to both sides considering constraints but the objective function is not determined, and the quality of the scheme cannot be measured.

3 Multi-objective Functions and Optimization Method

This section introduces a multi-objective model in Sect. 3.1, and Sect. 3.3 to 3.5 introduces an improved multi-objective artificial bee colony (IMOABC) algorithm for solving TDLBP. In the ABC algorithm, each food source is a feasible solution associated with an employed bee. Also, three kinds of bees are employed to search for food sources, namely, employed bee, onlooker bee, and scout bee. The employed bees search for new food sources (new solutions) and share information with onlooker bees. One onlooker bee evaluates the quality of food sources obtained by employed bees and select one for further exploration. If a food source is depleted, it will be abandoned and introduced into the scout bee, i.e., a new solution will be randomly generated.

3.1 Multi-objective Functions

According to the available disassembly scheme, the balancing state of a disassembly line can be evaluated by the objective functions, including the number of mated-workstations (f1), idle time (f2), auxiliary operation indicator (f3), smoothness index (f4). Converted these objectives into minimization objectives for optimization, as shown in Eq. (7).

$$\min F = [f_{1} ,f_{2} ,f_{3} ,f_{4} ].$$
(7)

(1) Number of mated-workstations. The number of workstations is an important indicator for evaluating the performance of the disassembly line. In a two-sided disassembly line, it is called “mated-workstation”. Fewer mated-workstations can reduce construction costs, workers, equipment, and space occupation. This indicator is expressed in Eq. (8).

$$f_{1} = m,$$
(8)

where m is the number of mated-workstation of a two-sided disassembly line, which is determined by the decoding process in Sect. 2.4.

(2) Idle time. In order to improve disassembly efficiency, the wasted time, i.e., idle time should be considered. The less the idle time, the higher the utilization rate of the disassembly system.

$$f_{2} = \sqrt {\sum\limits_{i = L}^{^{\prime}R^{\prime}} {\sum\limits_{j = 1}^{m} {(CT - ST_{i,j} )^{2} } } } \, i = \left\{ \begin{gathered} L \, Left \, side \hfill \\ R \, Right \, side \hfill \\ \end{gathered} \right.,$$
(9)

where STi,j is the disassembly time of mated-workstation j at side i, m is the number of mated-workstations.

(3) Auxiliary indicator. Direction and tool change indicator is developed to measure the disassembly scheme’s performance. Too many tool and direction changes will result in invalid operation and affect disassembly efficiency. The expression of the auxiliary indicator is shown in Eqs. (10) and (11).

$$f_{3} = \sum\limits_{i = 1}^{r - 1} {(\alpha (d_{i} ) + \beta (dt_{i} )) + \sum\limits_{i = 1}^{l - 1} {(\alpha (d_{i} ) + \beta (dt_{i} ))} } ,$$
(10)
$$\begin{gathered} \alpha (d_{i} ) = \left\{ \begin{gathered} 0 \, no \, direction \, change \hfill \\ 1 \, if \, 90^{ \circ } \, change \, is \, required \hfill \\ 2 \, if \, 180^{ \circ } \, change \, is \, required \hfill \\ \end{gathered} \right. \hfill \\ \beta (dt_{i} ) = \left\{ \begin{gathered} 0 \, no \, tool \, change \hfill \\ 1 \, if \, tool \, change \, is \, required \hfill \\ \end{gathered} \right., \hfill \\ \end{gathered}$$
(11)

where l is the number of tasks assigned to the left stations, and r is the number of tasks assigned to the right stations. di is the disassembly direction of part i, dti is the disassembly tool of part i. α and β are direction change operator and tool change operator respectively, and their meanings are shown in Eq. (11).

(4) Smoothness index. The smoothness index is used to measure the balance of workload distribution, which can improve disassembly efficiency and reduce the difference in workload assignment. Its prototype is proposed in [36] and modified to a two-sided line version based on the model proposed in this paper, as shown in Eq. (12).

$$f_{4} = - \ln (2 \cdot m)/\left( {\sum\limits_{j = 1}^{m} {\sum\limits_{i = ^{\prime}R^{\prime}}^{^{\prime}L^{\prime}} {\left( {\frac{{(CT - ST_{i,j} )^{2} }}{{\sum\limits_{i = 1}^{m} {\sum\limits_{j = ^{\prime}R^{\prime}}^{^{\prime}L^{\prime}} {(CT - ST_{i,j} )}^{2} } }} \cdot \ln \left( {\frac{{(CT - ST_{i,j} )^{2} }}{{\sum\limits_{i = 1}^{m} {\sum\limits_{j = ^{\prime}R^{\prime}}^{^{\prime}L^{\prime}} {(CT - ST_{i,j} )} ^{2}} }}} \right)} \right)} } } \right),\,i = \left\{ \begin{gathered} L \, Left \, side \hfill \\ R \, Right \, side \hfill \\ \end{gathered} \right..$$
(12)

(5) Constraints.

$$m_{\min } = \left\lceil {0.5 \times \left\lceil {\frac{{\sum\limits_{j = 1}^{n} {t_{d,j} } }}{CT}} \right\rceil } \right\rceil \le m \le n,$$
(13)
$$ST_{i,j} \le CT,i \in \{ 1,2\} ,j \in \{ 1,\ldots,m\} ,$$
(14)
$$\begin{gathered} \sqrt {\sum\limits_{i = ^{\prime}R^{\prime}}^{^{\prime}L^{\prime}} {\sum\limits_{j = 1}^{{m_{\min } }} {(CT - ST_{i,j} )^{2} } } } \hfill \\ \le f_{2} \le \sqrt {\sum\limits_{i = ^{\prime}R^{\prime}}^{^{\prime}L^{\prime}} {\sum\limits_{j = 1}^{{m_{\max } }} {(CT - ST_{i,j} )^{2} } } } , \hfill \\ j \in \{ 1,\ldots,m\} ,i = \left\{ \begin{gathered} L \, Left \, side \hfill \\ R \, Right \, side \hfill \\ \end{gathered} \right.. \hfill \\ \end{gathered}$$
(15)

Equation (13) is a constraint implying the maximum and the minimum number of mated-workstations. According to the assumptions in Sect. 2.2, Eq. (14) ensures the maximum workload of each workstation. Equation (15) describes the minimum and maximum idle time of a mated-workstation.

3.2 Multi-objective Optimizer and Hypervolume Indicator

The multi-objective optimization refers to finding a solution vector x in decision space S without violating predetermined constraints so that the objective function vector F(Z) = [f1(Z), f2(Z),…,fu(Z)] can reach the optimal solution. The objective vector and constraints are shown in Eq. (16).

$$\begin{gathered} \min F({\varvec{Z}}) = [f_{1} ({\varvec{Z}}),f_{2} ({\varvec{Z}}),\ldots,f_{u} ({\varvec{Z}})] \hfill \\ s.t\left\{ \begin{gathered} g_{i} ({\varvec{Z}}) \le 0,i = 1,2,\ldots,q \hfill \\ h_{j} ({\varvec{Z}}) = 0,j = 1,2,\ldots,r \hfill \\ {\varvec{Z}} \in S \hfill \\ \end{gathered} \right., \hfill \\ \end{gathered}$$
(16)

where, Z = [z1,…,zn] is a solution vector, S is the feasible space of Z, F is the multi-objective vector; u is the number of objective functions, q is the number of inequality constraints, and r is the number of equality constraints.

In multi-objective optimization, the Pareto dominance relationship is often used for solution comparison. For example, Z1 and Z2 are two solution vectors. If they satisfy Eqs. (17) and (18), Z1 dominates Z2, denoted as Z1Z2. By Pareto dominance relationship, all objectives can be considered, and each goal can be balanced in consideration of complicated conflicts.

$$\begin{gathered} f_{i} ({\varvec{Z}}_{1} ) \le f_{i} ({\varvec{Z}}_{2} ),\forall i \in \{ 1,\ldots,u\} , \hfill \\ \, {\varvec{Z}}_{1} \in S,{\varvec{Z}}_{2} \in S, \hfill \\ \end{gathered}$$
(17)
$$\begin{gathered} f_{i} ({\varvec{Z}}_{1} ) < f_{i} ({\varvec{Z}}_{2} ),\exists i \in \{ 1,\ldots,u\} , \hfill \\ \, {\varvec{Z}}_{1} \in S,{\varvec{Z}}_{2} \in S. \hfill \\ \end{gathered}$$
(18)

Deb et al. [37] combined GA with Pareto grade (PG) sorting and crowding distance (CD) in selection. A set of solutions can be sorted into different PGs by Eqs. (19) and (20). The CD of Zp can be calculated by Eq. (19), where k is the number of non-dominated solutions. The CD values of the non-dominated set boundary are denoted as CD(Z1) = CD(Zk) = ∞. if Z1 and Z2 satisfy Eq. (20) or (21), Z1 dominates Z2. This mechanism is the core of the tournament selection based on the Pareto relationship.

$$CD({\varvec{Z}}_{p} ) = \sum\limits_{i = 1}^{u} {\frac{{f_{i}^{p + 1} - f_{i}^{p - 1} }}{{f_{i}^{\max } - f_{i}^{\min } }}} ,p \in \{ 2,\ldots,k - 1\} ,$$
(19)
$$PG({\varvec{Z}}_{1} ) < PG({\varvec{Z}}_{2} ),$$
(20)
$$PG({\varvec{Z}}_{1} ) = PG({\varvec{Z}}_{2} ),CD({\varvec{Z}}_{1} ) > CD({\varvec{Z}}_{2} ).$$
(21)

But this mechanism cannot compare or evaluate the quality of Pareto non-dominated front. The hypervolume indicator is used to measure the quality of a non-dominated set, which measures how much of the objective space is dominated by an approximation set [38]. This indicator does not require information about the real Pareto-front, and it can be denoted as [39]:

$$HV(NDS,R_{0} ) = Leb\left(\bigcup\limits_{{Z_{i} \in NDS}} {[f_{1} ({\varvec{Z}}_{i} ),f_{1} (R_{0} )] \times } \cdots \times [f_{u} ({\varvec{Z}}_{i} ),f_{u} (R_{0} )]\right),R_{0} \in S,{\varvec{Z}}_{i} \prec R_{0} ,$$
(22)
$$f_{j} (R_{0} ) > f_{j} ({\varvec{Z}}_{i} ),\forall j \in \{ 1,\ldots,u\} ,{\varvec{Z}}_{i} \in NDS,$$
(23)

where HV(NDS, R0) is the size of the space covered by non-dominated set NDS, R0 is a reference point, Leb is Lebesgue measure, and Eq. (23) is the constraint of Eq. (22). Note that R0 must be the same when comparing the optimization performance of different algorithms. Otherwise, the HV is not comparable.

3.3 Employed Bee Operator

In ABC, each employed bee (a feasible solution) shares information, and Kalayci and Gupta [40] applied two neighborhood search operators to DLBP, i.e., swap and insert operator. Similarly, in their subsequent studies, they proposed some operators for neighborhood search, such as swap operator, insert operator, one point left operator, one point right operator [40]. We adopted it and applied it to the modified disassembly sequence. As shown in Fig. 5, a feasible solution of Fig. 2 is taken as an example to introduce the mechanism of these operators. To explore solution space, a local search loop mechanism combining these four operators is proposed as follows for information sharing, where the determination of dominated relationship is based on Eqs. (17) to (21).

Fig. 5
figure 5

The mechanism of these neighbor search operators

Step 1. Set the counter variable i = 0 and initialize local iteration limit tlim;

Step 2. Randomly select one of these four search operators above;

Step 3. Perform the selected operator on solution X and get new solution X’;

Step 4. If i ≤ tlim and X dominates X’, set i = (i + 1) and go to Step 3; else go to Step 4;

Step 5. Set X = X’;

Step 6. Place X and X’ in employed bee set and new employed bee set respectively. If XX’, place X’ in inferior set Q.

3.4 Onlooker Bee Operator

In traditional ABC, an onlooker bee selects a food source depending on the percent of nectar amount of each food source among the total nectar amount. Here, the tournament selection method is adopted, i.e., choose two solutions and compare their PG and CD values, and select a better one according to the method introduced in Sect. 3.2. Then, use the local search loop mechanism to diversify the population. The onlooker bee colonies before and after neighbor search are noted as onlook bees set and new onlooker bees set respectively. If a solution is dominated by the solution before its neighbor search, put this solution into inferior set Q.

3.5 Scout Bee Operator

In the neighborhood search, the new solution X’ may be dominated by the original solution X, and collect all the X’, namely inferior set Q. In this phrase, the two-point crossover of GA is adopted to explore solution space and perform a two-point crossover on the inferior set. Take two available food sources of the bearing of Fig. 2 as an example, and Fig. 6 shows the basic procedure of a two-point crossover. X1 = [X11 = [1,2,3,4,5,6,7,8],X12 = [R,L,L,R,R,R,L,L]],X2 = [X21 = [1,2,3,4,5,6,7,8],X22 = [L,L,R,L,R,R,R,L]] are taken as an example. In Fig. 6, two points in X11 are tasks 4 and 2, keeping sequence fraction [5] and [1, 6, 7] unchanged, [2,3,4, 8] becomes [2,3,4, 8] by X21 mapping, and final X31 = [1,2,3,4,5,6,7,8]. Similarly, X32 is obtained via this process. Of course, repeat this procedure until finding a feasible solution.

Fig. 6
figure 6

Two-point crossover operator

Finally, integrate employed bee colony, new employed bee colony, onlooker bee colony, new onlooker bee colony, and inferior set Q together, and rank them with non-dominated sorting. According to the ranking result, select a designated number of bees as the initial group for the next iteration until reach the iteration limit. Table 1 shows the pseudo-code of an improved multi-objective artificial bee colony (IMOABC) algorithm to solve TDLBP.

Table 1 The frame and procedure of IMOABC

4 Case Study

In this section, in order to verify the practicality of the proposed two-sided model, including sequence coding, decoding method, and an optimization method, an automotive engine composed of 32 parts is taken as a case.

Some parts are simplified for simplification, e.g., belts, bearings, and pins are removed, and a group of fasters is treated as one part. Some information on the engine parts and their attributes, including name, quantity, disassembly tool, estimated disassembly time, and direction are shown in Table 2. The hybrid graph is shown in Figs. 7, and 8 is the explosive view of an engine. The connection matrix Mc and precedence matrix Mp are shown in Figs. 9 and 10. Also, Table 3 listed the essential disassembly tools. The two-sided disassembly schemes can be obtained and each objective is evaluated by the method and model introduced in Sects. 2 and 3.

Table 2 The name of automobile engine parts and properties
Fig. 7
figure 7

Hybrid graph of the engine

Fig. 8
figure 8

The exploded view of an automotive engine

Fig. 9
figure 9

Connection matrix Mc of the engine

Fig. 10
figure 10

The precedence matrix Mp of the engine

Table 3 Disassembly tools for the automobile engine

4.1 Numerical Result and Algorithm Comparison

To test the optimization performance of IMOABC, three multi-objective algorithms are introduced for comparison, i.e., the famous NSGA-II [37], HABC [15] (an ABC recently applied to DLBP), and the multi-objective flower pollination algorithm (MOFPA) [30] (a recent TDLBP optimization method).

The parameters of IMOABC are set as follows: the population size PopSize = 50, iteration limit IteLim = 100, crossover possibility pc = 0.8, local iteration limit tlim = 5. The PopSize of NSGA-II, HABC, and MOFPA is the same as IMOABC. The population size of IMOABC, NSGA-II, and MOFPA is 50. According to the method introduced in Sect. 2, generate the disassembly sequence and two-sided disassembly scheme, and compute the value of each objective function in Sect. 3.1. In order to evaluate the solution quality obtained by each algorithm, the hypervolume indicator is used, and reference point R0 is set as (10,500,50,2).

Each algorithm runs 30 times, and Fig. 11 shows the hypervolume values of each run. It can be observed that the maximum hypervolume values of these four methods are 3991.7, 12,408.6, 11,988.7, 14,396.9, respectively, and IMOABC produces the best hypervolume result. To compare the stability of each algorithm, the hypervolume value box plot of 30 results for four algorithms is shown in Fig. 12. After removing the abnormal solution denoted in Fig. 12, the maximum hypervolume values of HABC, NSGA-II, MOFPA, IMOABC are 3279.8, 12,408.6, 11,988.7, 13,594.6, respectively. The best-known hypervolume result is obtained by IMOABC. From a comparison of Figs. 11 and 12, IMOABC can not only generate better solutions but also has better stability. Among these four algorithms, its overall performance is the best.

Fig. 11
figure 11

The hypervolume values of listed methods for 30 trials

Fig. 12
figure 12

Box plot of hypervolume value of four methods

Table 4 and Fig. 13 show the Pareto non-dominated frontiers of the best hypervolume in each method. But due to the random assignment of ‘L’ and ‘R’ in X2, it contains many different schemes with the same multi-objective value. The third column of Table 4 is the number of solutions with the same multi-objective value, and it is marked in Fig. 13. Using the hypervolume indicator and box plot to evaluate the optimization results, whether in terms of the stability of the solution set or the quality of the solution set, IMOABC is the best.

Table 4 The Pareto frontier of the four algorithms
Fig. 13
figure 13

The Pareto front of four methods

In addition, it is necessary to analyze each objective function optimization result of the optimal Pareto front. As shown in Table 4 and Fig. 13, HABC, NSGA-II, MOFPA, and IMOABC can reach the best-known value of 6 on the objective number of mated-workstations, and also the objective idle time of four algorithms is 178.835, 145.353, 144.874 and 145.291. In terms of mated-workstation, compared with MOFPA and IMOABC, all methods can get the 6 mated-workstations, while HABC and NSGA-II have fewer solutions of six mated-workstations. The fewer mated-workstations and lower idle time mean less construction cost and higher work efficiency, and IMOABC performs well in these two objectives optimization.

Table 5 shows the comparison results of the best Pareto frontiers of these four methods in Fig. 13, it can be found that the number of mated-workstations of IMOABC is 6. But some schemes generated by other methods require 7, 8, or more mated-workstations. In terms of average idle time, compared with IMOABC, the disassembly schemes obtained by HABC, NSGA-II, and MOPFA increased by 90.85%, 31.54%, 10.75% respectively. It means that the optimization results of the IMOABC’s Pareto solution on the idle time objective is significantly better than the other three methods, which improves work efficiency significantly. Moreover, a similar situation also occurs in the auxiliary indicator, which simplifies disassembly operation. However, its average smooth index is inferior to other methods except for HABC. Nevertheless, it is almost impossible to get the best solution for all the objectives, because the improvement of one objective may worsen other objectives, and it is necessary to make a compromise on each objective.

Table 5 Comparison of the result of Fig. 13

Anyway, whether it is the quality of the Pareto solution set measured by the hypervolume indicator or the overall optimization effect of each objective, IMOABC performs better than the other three algorithms. In this case, the IMOABC has a better performance on two-sided disassembly line scheme optimization compared with these three methods.

4.2 Analysis of Two-sided Disassembly Schemes Selection

Take one disassembly sequence from each of the three Pareto solutions obtained by IMOABC, as shown in Table 6. After decoding, the Gantt charts of these three schemes are shown in Fig. 14 to illustrate the disassembly scheme of the two-sided disassembly line in detail. The parts disassembled from “ + z”, “-x” and “ + x” directions are represented by yellow, green and cyan color blocks on the Gantt chart respectively, and the block length represents the specific disassembly time of the parts.

Table 6 Three two-sided disassembly sequence
Fig. 14
figure 14

Three disassembly schemes of engine two-sided disassembly line

Note that the part number and tool number are marked on each block. It can be observed that the parts with the same direction and tool are concentrated, which reduces the auxiliary operations significantly.

In the Pareto front, each solution does not dominate each other. The Pareto-based optimization process doesn’t need to consider which objective is the most important in optimization, but only the dominant relationship between the objective vectors. The process is not mixed with any subjective factors, and the constraints and conflicts between objectives and be avoided. After obtaining the Pareto set, the importance of each objective depends on the subjective demand of decision-makers, and the solution that best meets the requirements can be selected from it. Analytical hierarchical process (AHP) is a critical multicriteria decision-making method that balances the interactions among decision criteria and widely used in decision making, and it is chosen for scheme selection [41]. The main steps of AHP: (1) remove the dimension of each objective; (2) compare the importance of each objective and scoring to get a judgment matrix; (3) transform multi-objectives into a single objective [42]. Equation (24) denotes the dimensionless equation, where maxfi and minfi are the maximum and minimum values of fi components in Pareto optimal set, ωi is a weighting index according to the importance value shown in Table 7.

$$J = \sum\limits_{i = 1}^{4} {\omega_{i} \times \frac{{f_{i} - \min f_{i} }}{{\max f_{i} - \min f_{i} }}} .$$
(24)
Table 7 The scale of scoring and description

By comparison and scoring, the judgment matrix is as follows:

$$\left[ {\begin{array}{*{20}l} 1 \hfill & 2 \hfill & 3 \hfill & 3 \hfill \\ {1/2} \hfill & 1 \hfill & 3 \hfill & 3 \hfill \\ {1/3} \hfill & {1/3} \hfill & 1 \hfill & 1 \hfill \\ {1/3} \hfill & {1/3} \hfill & 1 \hfill & 1 \hfill \\ \end{array} } \right].$$

Finally, the weighting value ωi can be obtained from the judgment matrix by \(\omega_{i} = \frac{1}{4} \times \sum\limits_{j = 1}^{4} {(a_{i,j} /\sum\limits_{i = 1}^{4} {a_{i,j} } )}\), and ω1 = 0.439, ω2 = 0.313, ω3 = 0.124, and ω4 = 0.124. Thus, the optimal solution in the Pareto front of IMOABC is the first type in Table 6, and the Gantt chart of the best solution (solution 1) is shown in Eq. (14), and Fig. 15 illustrates this task distribution result.

Fig. 15
figure 15

The assignment result of the selected disassembly schemes

Through the above analysis, the proposed methodology can balance workload distribution, improve disassembly efficiency, reduce the number of workstations for cost reduction. For this case study, three kinds of solutions generated by IMOABC are provided for choosing. Compared with considering each objective lexicographically, Pareto non-dominated relationship can consider each goal, the results obtained by IMOABC is better than that of IMOABC. All solutions consider minimizing the number of mated-workstations, improving efficiency, rationally distributing workload, and simplifying the auxiliary operations. Also, decision-makers can choose a specific disassembly scheme based on their actual demands and the AHP method.

5 Conclusion and Future Work

The majority of published DLBP works are mainly based on a straight one-sided disassembly line, only one part can be removed at a time, and this linear production mode leads to a long work time. This work studied DLBP based on a two-sided disassembly line for disassembly efficiency improvement and construction costs reduction. The main research contributions and novelties are as follows:

  1. 1.

    In order to describe the constraints among parts, a hybrid graph is used, and a connection matrix and a precedence matrix are constructed to express the relationship among parts in detail. On this basis, an improved disassembly sequence encoding method is customized for a two-sided disassembly line.

  2. 2.

    In response to the challenge in a two-sided disassembly line scheme planning, a decoding method is established for task assignment.

  3. 3.

    Aiming to reduce the number of mated-workstations, idle time, auxiliary indicator, and smooth index, a multi-objective two-sided disassembly line balancing model is established, and an improved multi-objective artificial bee colony algorithm which based on Pareto non-dominated relationship, crossover and mutation operators of GA is proposed for disassembly scheme optimization.

To verify the practicality and superiority of the proposed method and model, a case of an automotive engine is studied and compared with three typical algorithms. The results show that the proposed model and methodology can be applied to two-sided disassembly scheme planning and improve the scheme optimization performance of a two-sided disassembly line by the analysis in Sect. 4.1. Besides, the result indicates that the developed methodology can guide the disassembly of end-of-life products on a two-sided disassembly line.

Although the effectiveness of the proposed methodology has been verified, some limitations still exist. In this paper, the disassembly line is only designed for a single product. It is necessary to introduce a mixed-model for a two-sided disassembly line, which can handle various products by one line. Also, more uncertainties need to be considered, such as disassembly time might be changed, and some parts in the end-of-life sample product may be lost. Some objectives should be taken into consideration, such as profit and energy consumption.