1 Introduction

In the background of economic globalization [11], the demand for individualization of orders has greatly increased, which has led to a diversification of products [12, 27, 33]. Enterprises are developing in the direction of mass customization. It is becoming increasingly difficult for traditional manufacturing and management patterns to meet the development requirements of modern enterprises. Therefore, the manufacturing mode has begun to evolve from the traditional single workshop to multiple distributed workshops [25, 28]. Distributed manufacturing has gradually become an important trend and has been successfully applied in many important industries, such as aircraft, aerospace, and electronic products [9, 26, 40]. Focusing on distributed manufacturing systems, distributed shop scheduling studies plans for distributing jobs among enterprises, factories, and shops and the production schemes within each enterprise, factory, or shop to optimize production indices [34].. Therefore, it is critical to generate production schemes based on the concrete conditions of different factories. By rationally allocating production tasks across factories, managers can better hedge the risks associated with single production and reduce the corresponding production costs and product delivery cycles [25, 31].

Distributed manufacturing patterns have become a typical phenomenon in industry [24]. Current research related to distributed manufacturing focuses on distributed flow shop problems (DFSPs) or distributed job shop problems (DJSPs). The structures of factories are mostly homogeneous [19]. However, in fact, due to the different production capabilities of different factories, they can provide completely different alternative process plans for the same part. Treating each factory as a simple homogeneous job shop environment constitutes neglect and waste of flexible manufacturing capability. Therefore, considering heterogeneous process flexibility in distributed manufacturing, i.e., distributed integrated process planning and scheduling (DIPPS) problems, is more suitable for actual production environments. Compared with the DJSP or distributed FJSP (DFJSP), the DIPPS problem is more general and has a more complex solution space structure, which also means that it is more relevant to actual production requirements and more valuable for research.

By taking full advantage of the complementarity of process planning and shop scheduling, their integration can greatly facilitate the development of an intelligent manufacturing system [1]. In the last decade, the integrated process planning and scheduling (IPPS) problem has become a research hotspot in the manufacturing system area. It is also an NP-hard problem and is more complicated than the job shop scheduling problem [29]. Process planning has a large number of manufacturing flexibilities that can provide more choices for production but makes it difficult to obtain the optimal process plan [14]. These complex constraints and flexibilities bring great difficulty for modeling and solving this problem. Traditionally, process planning and shop scheduling are carried out sequentially. Scheduling the jobs whose process routes are predetermined means that the process planning system cannot provide flexible support for the scheduling system. This can lead to some production problems, such as unbalanced machine loads, bottlenecked resources, and conflicting optimization objectives in the distributed manufacturing system [20]. Therefore, it is essential to carry out research on IPPS in distributed manufacturing systems.

There are few existing publications on DIPPS, and their innovations have mainly concerned the design of chromosome representation methods for genetic algorithms (GAs) [13, 35, 36]. These studies were very creative, with novel ideas for subsequent DIPPS research. However, the above research did not greatly improve the overall framework of the algorithm. Their encoding methods were carried out by encoding the process routes generated from the network graph rather than directly encoding the process network itself. This indirect intermediate method can allow the algorithms to miss some potential optimal solutions. In addition, the work of generating process routes from various networks with complex topological relationships can consume many human and computational resources and reduce the efficiency of production management. Most importantly, the above research assumed that the number of process routes was enumerable and limited, but it was not. The jobs used as examples in these publications have only 3–5 alternative process routes [13], while the actual process planning (PP) problem has been proven to be NP-hard in the literature [17]. Therefore, it cannot be simply represented by a limited number (3–5) of routes, and such simplification is inappropriate. In this paper, a mathematical model of DIPPS based on a network graph is developed, and a new encoding and decoding approach is designed. This approach directly codifies the selection state of the OR-nodes in the chromosomes, which can enable more kinds of representations for process routes and the effective reconstruction of the solution space.

Most production problems are multiobjective, while all the current studies of DIPPS problems are single-objective [13, 35, 36, 38]. Aiming to solve multiobjective problems in actual production and to provide high-quality alternative schemes for production management decision-makers, this paper investigates the multiobjective DIPPS (MOIPPS) problem to minimize makespan, total machine workload, and maximum machine workload. Common handling methods for multiobjective problems include the weight-based method and the Pareto-based method. Because it is difficult to trade off weight coefficients between objectives, the Pareto-based method has become mainstream in current multiobjective optimization research. As a typical representative of many meta-heuristics, the GA has always been one of the main methods of solving scheduling problems. However, the traditional GA is not as powerful in local search as it is in global search. The memetic algorithm (MA) is a combination of global search based on population and local heuristic search based on individuals; thus, it has a good balance of exploration ability and exploitation ability. MA has been successfully applied in solving IPPS problems and has achieved a certain degree of success. To improve the performance of traditional MA, a new encoding method for chromosome representation is proposed and integrated into the MA framework, and a multiobjective MA (MOMA) is further proposed to solve MODIPPS problems. For the local search component of the MOMA, a simulated annealing (SA) mechanism is introduced to receive relatively worse individuals with a certain probability in the neighborhood of the current solution, which can prevent the algorithm from falling into a local optimum prematurely and can balance the global search capability and local search capability. For fairness of comparison, numerical experiments are conducted on open benchmarks. The main contributions of this paper are as follows:

  1. 1.

    In this paper, the MODIPPS problem is studied for the first time, and a mixed-integer linear programming (MILP) mathematical model for the multiobjective problem is established based on heterogeneous process network graphs.

  2. 2.

    The SA mechanism is introduced to prevent the MOMA from falling into a local optimum by receiving relatively worse individuals with a certain probability, balancing the global search ability and local search ability of the algorithm.

  3. 3.

    The proposed MOMA can obtain a better Pareto solution set. Comparison experiments on the benchmarks show sufficient effectiveness and superiority in solving MODIPPS problems compared with existing classic multiobjective optimization algorithms.

The structure of the paper is arranged as follows: Sect. 2 reviews the relevant literature. Section 3 introduces the MODIPPS problem and the mathematical model. Section 4 presents the MOMA and the SA mechanism. Section 5 describes the numerical experiments. Section 6 concludes the research.

2 Literature review

2.1 Distributed shop scheduling problem

Jia et al. [5] proposed distributed manufacturing to reduce product cost and risk and proposed an improved genetic algorithm to solve the DJSP. Naderi et al. [23] studied the DFSP and established six different MILP models and then proved that the distributed permutation flow shop scheduling problem (DPFSP) is an NP-hard problem. To meet the requirements of different manufacturing modes, Ying et al. [34] established a MILP model for the distributed hybrid flow shop scheduling problem (DHFSP) for the first time and designed an iterative greedy (IG) algorithm. As an extension of the DFSP, optional flexible constraints were added to the DHFSP. Zhao et al. [40] proposed an improved differential evolution algorithm for the DFSP with blocking to minimize the maximum completion time. Giovanni et al. [3] researched the DFJSP, which can be regarded as an extension of the DJSP. Based on the above work, Wu et al. [33] investigated the distributed assembly flexible job shop scheduling problem and proposed a MILP model. To minimize tardiness and cost, a hybrid multiobjective algorithm of the differential evolution (DE) algorithm and SA was designed to solve the problem. Li et al. [7] first studied the distributed heterogeneous no-wait flow shop scheduling problem and proposed a discrete artificial bee colony algorithm (DABC) to solve this problem. For the distributed heterogeneous permutation flow shop scheduling problem, to minimize the maximum completion time, negative social impact, and energy consumption, Lu et al. [18] proposed a knowledge-based multiobjective memetic optimization algorithm (KMMOA).

In summary, the current distributed scheduling research has mainly focused on the basic flow shop scheduling problem (FSP), permutation flow shop scheduling problem, and JSP. Moreover, the environment of most factories is homogeneous. To better serve practical production, it is necessary to study distributed shop scheduling problems with heterogeneous process flexibilities.

2.2 Multi-objective integrated process planning and scheduling problem

In the IPPS problem, process planning is often considered based on a classical scheduling problem. A process planning problem can be viewed as a constrained traveling salesman problem (TSP), which is NP-hard. Similarly, the JSP has also been proved to be NP-hard. As a problem integrating the above two, the IPPS problem is much more difficult to solve. Li et al. [8] proposed a game theory-based approach to investigate the multiobjective IPPS problem, and the Nash equilibrium in the game theory-based approach has been used to address multiple objectives. Mohammadi et al. [20] proposed a hybrid multiobjective simulated annealing algorithm (MOHSA) by fully utilizing the capability of exploration search and fast convergence. Considering IPPS as a multiobjective optimization problem in reconfigurable manufacturing settings, Mohapatra et al. [21] applied the nondominated sorting genetic algorithm-II (NSGA-II) to take into account the computational intractability of the problem. Li et al. [10] modified the solution representation with a multilayer structure and applied NSGA-II to solve MOIPPS problems. Jin et al. [6] proposed a multiobjective memetic algorithm to solve the MOIPPS problem. Considering the makespan, critical machine workload, and machine total workload as the objective functions, Shokouhi et al. [29] tried to solve the IPPS problem with the GA in a weighted-sum form. Zhao et al. [39] proposed a two-generation Pareto ant colony algorithm for multiobjective job shop scheduling problems (MOJSP) with alternative process plans and unrelated parallel machines. Aiming at saving energy and reducing carbon emissions, Zhang et al. [37] took advantage of the integration between process planning and scheduling to achieve energy savings by using a hierarchical multistrategy genetic algorithm based on nondominated sorting. For a real-world case from a battery packaging machinery workshop, Wen et al. [32] proposed a two-stage solution method based on NSGA-II for green MOIPPS problems. The current status of research related to MOIPPS is arranged chronologically in Table 1. The research objectives focus on the makespan, cost, workload, energy consumption, etc. Numerous algorithms have been proposed by researchers to solve this problem, with notable results. In this paper, the framework is also based on an evolutionary algorithm, the memetic algorithm, which incorporates local search to balance the search ability of the algorithm.

Table 1 Publications on the MOIPPS problem

3 Problem formulation for MODIPPS

3.1 DIPPS problem and the network graph

In the DIPPS problem, each distributed factory can provide a process network graph for a job i, and the concrete process route of job i needs to be summarized from the network graph. As shown in Fig. 1, the network graph is composed of five types of nodes: the start node, which is virtual and represents the start of the job’s production process; the end node, which is also virtual and indicates the end of the production process; intermediate nodes, implying operations; and OR-nodes, combined with JOIN-nodes, representing process flexibilities. An intermediate node contains three types of information: the operation number in a solid circle, the optional machine number in \(\{ \}\), and the corresponding processing time in \([]\). The nodes in the network are connected by arrows indicating the precedence relationship constraints between operations. Only one of the process paths between an OR-node and the corresponding JOIN-node is chosen. The operations on the chosen paths with other operations compose a feasible operation combination for the part. In addition, only one machine is selected as the platform to process the corresponding operation: 1(3)–6(7)–4(5)–7(13)–5(8)–9(3)–10(13). The numbers outside the parentheses indicate operations, and those inside are the machines. Since different jobs have different process network graphs, the number of alternative process routes may be great.

Fig. 1
figure 1

An example of a network graph

3.2 MILP mathematical model

For preliminary knowledge of the mathematical model, the details can be found in our previous work [15]. This MOIPPS problem is established based on the precedence relationship modeling idea, and the process network graph and topology analysis were discussed in detail in the previous publication [14].

The model is established based on the following assumptions:

(1) A job can only be assigned to one factory;

(2) A machine can only process one operation at a time, and the processing procedure cannot be interrupted;

(3) At the beginning, all jobs and machines are already available;

(4) The transportation time of the jobs from each factory to the destination varies.

Sets, subscripts, and notation

 F

The set of factories

 f

Factories, 1 ≤ f ≤|F|

 N

The job set

 \(J^{f}_{i}\)

The operation set of job i if it is assigned to factory f

 \(R^{f}_{i}\)

The OR-node set of job i’s process network in factory f

 Mf

The machine set in factory f

 \(O^{f}_{ij}\)

Operation j of job i if it is assigned to factory f

 i, i

Jobs, 1 ≤ i ≤|N|

 j, j

Operations, 1 ≤ j ≤ \(|J^{f}_{i}|\)

 k, k

Machines, 1 ≤ k ≤ |Mf|

 r, r

OR-nodes, 1 ≤ r ≤ \(|R^{f}_{i}|\)

 l, l

Links

Parameters

 \(U^{f}_{ijj^{\prime}}\)

1 if \(O^{f}_{ij}\) should be processed before \(O^{f}_{ij}\), according to the precedence relationship represented by job i’s network provided by factory f; 0 otherwise

 \(P^{f}_{ijk}\)

The processing time of \(O^{f}_{ij}\) on the kth machine of set Mf

 \(W^{f}_{ijrl}\)

1 if \(O^{f}_{ij}\) is located at the rth OR-node’s lth link in the network provided by factory f for job i; 0 otherwise

 \(E^{f}_{i}\)

The transportation time of job i from factory f to its destination

Variables

 \(D^{f}_{i}\)

1 if job i is delivered to factory f; 0 otherwise

 \(R^{f}_{irl}\)

1 if the lth link of the rth OR-node of job i is selected; 0 otherwise

 \(X^{f}_{ij}\)

1 if \(O^{f}_{ij}\) is selected; 0 otherwise

 \(Z^{f}_{ijk}\)

1 if \(O^{f}_{ij}\) is processed on machine k; 0 otherwise

 \(Y^{f}_{iji^{\prime}j^{\prime}}\)

1 if \(O^{f}_{ij}\) is processed before \(O^{fi^{\prime}}_{j^{\prime}}\); 0 otherwise

 \(S^{f}_{ij}\)

The starting time of \(O^{f}_{ij}\)

 Ci

The completion time of job i

 Cmax

Makespan

 MMW

The maximal machine workload

 TWM

The total workload of the machines

Objectives:

(1) Minimize the maximum completion time:

$$ \min f_{1} = C_{\max } $$
(1)

(2) Minimize the maximal machine workload:

$$ \min f_{2} = MMW $$
(2)

(3) Minimize the total workload of the machines:

$$ \min f_{3} = TWM $$
(3)

Constraints:

Each job can only be assigned to one factory:

$$ \sum\limits_{f} {D_{i}^{f} } = 1,\;\;\;\;\;\;\forall i $$
(4)

Only one link of an OR-node can be selected. If the job is not assigned to factory f, the network with the OR-nodes provided by factory f for job i will not be considered:

$$ \sum\limits_{l} {R_{irl}^{f} } = D_{i}^{f} ,\;\;\;\;\;\;\forall f,i,r $$
(5)
$$ \sum\limits_{j} {X_{ij}^{f} } = D_{i}^{f} ,\;\;\;\;\;\;\forall f,i $$
(6)

If the link where operation \(O^{f}_{ij}\) is located is not traversed, \(O^{f}_{ij}\) will not be selected.

$$ X_{ij}^{f} \le M \cdot (1 - W_{ijrl}^{f} ) + M \cdot R_{irl}^{f} ,\;\;\;\;\;\;\forall f,i,j,r,l $$
(7)

After analyzing the premises of selecting the operations, it can be concluded that if an operation is not controlled by any OR-node, it will certainly be selected, and it will also be selected if all its controlling links are traversed. Therefore, this paper formulates the OR-node function constraint as follows:

$$ X_{ij}^{f} \ge 1 - M \cdot \sum\limits_{r} {\sum\limits_{l} {W_{ijrl}^{f} \cdot (1 - R_{irl}^{f} )} ,\;\;\;\;\;\;\forall f,i,j} $$
(8)

\(O^{f}_{ij}\) will be assigned to one machine selected from its alternative machine set.

$$ \sum\limits_{k} {Z_{ijk}^{f} } = X_{ij}^{f} ,\;\;\;\;\;\;\forall f,i,j $$
(9)

The precedence relationship 0–1 variable \(Y^{f} _{iji^{\prime}j^{\prime}} \) should be constrained according to the precedence constraints expressed by the networks.

$$ \sum\limits_{i^{\prime}} {\sum\limits_{j} {\sum\limits_{j^{\prime}} {Y_{iji^{\prime}j^{\prime}}^{f} } } } \le M \cdot D_{i}^{f} ,\;\;\;\;\;\;\forall f,i $$
(10)
$$ \sum\limits_{i} {\sum\limits_{j} {\sum\limits_{j^{\prime}} {Y_{iji^{\prime}j^{\prime}}^{f} } } } \le M \cdot D_{i^{\prime}}^{f} ,\;\;\;\;\;\;\forall f,i^{\prime} $$
(11)
$$ M \cdot (2 - X_{ij}^{f} - X_{ij^{\prime}}^{f} ) + Y_{ijij^{\prime}}^{f} \ge U_{ijj^{\prime}}^{f} ,\;\;\;\;\;\;\forall f,i,j,j^{\prime} $$
(12)

The operations belonging to the same job should be processed sequentially according to the precedence relationships.

$$ S_{ij^{\prime}}^{f} \ge S_{ij}^{f} + \sum\limits_{k} {P_{ijk}^{f} \cdot Z_{ijk}^{f} } - M \cdot (3 - X_{ij}^{f} - X_{ij^{\prime}}^{f} - Y_{ijij^{\prime}}^{f} ),\;\;\forall f,i,j,j^{\prime} $$
(13)
$$ S_{ij}^{f} \ge S_{ij^{\prime}}^{f} + \sum\limits_{k} {P_{ij^{\prime}k}^{f} \cdot Z_{ij^{\prime}k}^{f} } - M \cdot (2 - X_{ij}^{f} - X_{ij^{\prime}}^{f} + Y_{ijij^{\prime}}^{f} ),\;\;\forall f,i,j,j^{\prime} $$
(14)

The operations assigned to the same machine should also obey the precedence relationships.

$$\begin{aligned} S_{i^{\prime}j^{\prime}}^{f} &\ge S_{ij}^{f} + P_{ijk}^{f} - M\\ &\quad \cdot (3 - Z_{ijk}^{f} - Z_{i^{\prime}j^{\prime}k^{\prime}}^{f} - Y_{iji^{\prime}j^{\prime}}^{f} ),\;\;\forall f,i,i^{\prime},j,j^{\prime},k\\ \end{aligned} $$
(15)
$$ S_{ij}^{f} \ge S_{i^{\prime}j^{\prime}}^{f} + P_{i^{\prime}j^{\prime}k}^{f} - M \cdot (2 - Z_{ijk}^{f} - Z_{i^{\prime}j^{\prime}k^{\prime}}^{f} + Y_{iji^{\prime}j^{\prime}}^{f} ),\;\;\forall f,i,i^{\prime},j,j^{\prime},k $$
(16)

If job i is not processed in factory f, \(S_{ij}^{f}\) will not be considered and is set to 0:

$$ \sum\limits_{j} {S_{ij}^{f} } \le M \cdot D_{i}^{f} ,\;\;\;\;\;\;\forall f,i $$
(17)

From the above, the completion time can be obtained as:

$$ C_{\max } \ge S_{ij}^{f} + \sum\limits_{k} {P_{ijk}^{f} \cdot Z_{ijk}^{f} } - M \cdot (1 - X_{ij}^{f} ) + E_{i}^{f} \cdot D_{i}^{f} ,\;\;\;\;\;\;\forall f,i,j $$
(18)

The MMW value can be obtained as:

$$ MMW = \mathop {\max }\limits_{\forall f,k} \left( {\sum\limits_{i} {\sum\limits_{j} {P_{ijk}^{f} \cdot Z_{ijk}^{f} } } } \right) $$
(19)

The TWM value can be obtained as:

$$ TWM{ = }\sum\limits_{f} {\sum\limits_{k} {\sum\limits_{i} {\sum\limits_{j} {P_{ijk}^{f} \cdot Z_{ijk}^{f} } } } } $$
(20)

4 The proposed MOMA

4.1 Multiobjective memetic algorithm

In contrast to the GA, which simulates biological evolutionary procedures, Moscato et al. proposed the MA [4], which simulates cultural evolutionary procedures. The MA uses local heuristic search to simulate mutation and is supported by a large amount of expertise, so the MA can be considered a combination of population-based global search and individual-based local heuristic search (Table 2). This combination mechanism of the MA makes its search efficiency greater than that of the traditional GA in some optimization problems, and it can be applied to a wide range of problem areas with satisfactory results. The crossover operator and mutation operators for individuals are designed according to the encoding method. In addition, the MOMA deals with multiple objectives in a Pareto-based way to find the optimal Pareto frontier. The Pareto frontier and Pareto dominating relationships have been widely introduced in multiobjective optimization publications and will not be repeated in this paper. The flow chart of the MOMA is given in Fig. 2.

Table 2 The MODIPPS example problem
Fig. 2
figure 2

The flow chart of the proposed MOMA

The concrete steps of the proposed MOMA are shown below:

Step 1: Initialization. Set the parameters and generate the population randomly. The generation number is Gen = 1.

Step 2: Select individuals through tournament selection, and perform crossover with probability PC and mutation with probability PM.

Step 3: For each individual Xcur, perform an SA search.

Step 4: Evaluate the population and generate the Pareto frontier. Determine whether the termination criteria are satisfied; if so, go to step 5, and otherwise, go to step 3.

Step 5: Output the solutions of the current Pareto frontier. End.

4.2 Coding methods

The encoding method proposed in this paper includes four types of chromosomes: factory chromosomes, OR-node chromosomes, operation chromosomes, and machine chromosomes (Fig. 3). Except for the factory chromosome, all types of chromosomes have subchromosomes corresponding to factories, as shown in Fig. 4. The factory chromosome is used to represent the factory selected for the job. The OR-node chromosome represents the selection state of the OR-nodes in the network. ‘1’ and ‘2’ represent the left and right links, respectively. For the operation chromosome, each gene contains both a job number and an operation number. The decoding order is from left to right. The machine chromosome, arranged in the order of the job numbers, represents the machine assigned by the operation at each position. In conclusion, the complete production scheme is decoded from the above example: factory 1 is responsible for processing jobs 1 and 3, while factory 2 is responsible for job 2. The production order of the operations in factory 1 is 1–1(1) → 3–2(2) → 1–2(4) → 1–3(2) → 3–3(1) → 3–7(5)  → 3–5(4) → 3–6(4), and that in factory 2 is 2–3(2) → 2–1(1) → 2–4(4) → 2–6(3). The pair of numbers outside the parentheses represents a job and its operation number, and the number inside the parentheses is the assigned machine number. According to the decoding method described in the previous publication [16], the objective makespan can be obtained easily.

Fig. 3
figure 3

Example process network graphs

Fig. 4
figure 4

The coding method for a MOIPPS scheme

4.3 Initialization

Each individual in the population is randomly initialized according to the above encoding rules, and each individual represents a solution. All individuals should obey the precedence constraints to ensure the smooth operation of the MOMA. Without infeasible individuals in the population, in other words, there is no need to adjust the operation chromosome of the infeasible individuals after each iteration. This saves adjustment time for the algorithm to a certain extent and can accelerate the solving speed of the algorithm.

4.4 Crossover

A new individual is obtained by the crossover operator between two parent individuals, which are selected by the tournament selection method. The crossover method is shown in Fig. 5. Two points are randomly selected, and the genes outside the points are transferred to the new individual at the same positions, while the sequence of genes inside the points is obtained by mapping. This mapping operation can generate an offspring chromosome with the same legality from two feasible operation chromosomes to ensure continuous legitimacy in the evolutionary process of the MOMA. For factory, OR-node, and machine chromosomes that are not constrained by precedence, their offspring chromosomes can be obtained by swapping the subsequences in the corresponding positions. This operation is relatively simple, so it will not be described here.

Fig. 5
figure 5

Crossover for the operation chromosome

4.5 Mutation

This paper designs a single-point insertion operation as the mutation operator, as shown in Fig. 6. A point is randomly selected to determine the range where it can be inserted, i.e., its predecessor operation and successor operation. The selected operation point can be inserted at any position in the range. The factory chromosome, OR-node chromosome, and machine chromosome are not constrained by the precedence constraints, so the mutation can be performed through single-point switching to randomly select a point and switch the selection state to one of the other states. The switching operation is shown in Fig. 7.

Fig. 6
figure 6

Mutation for operation chromosome

Fig. 7
figure 7

Mutation for the three chromosomes

4.6 Simulated annealing mechanism

The proposed algorithm has a powerful global search capability, but it easily falls into a local optimum, so the SA mechanism for local search is introduced here to address the shortcomings of the algorithm. In the SA step, new individuals are generated by the above two genetic operations. The concrete steps of SA are shown below.

For each individual Xcur, perform SA search:

Step 1: Randomly select an individual to carry out the crossover operation with the current individual Xcur, and then carry out mutation on the current individual to generate a new individual Xnew.

Step 2: Determine whether Xnew dominates Xcur. If so, set Xcur = Xnew and go to step 5; otherwise, go to step 3.

Step 3: Determine whether Xcur dominates Xnew. If so, go to step 4; otherwise, set Xcur = Xnew or Xnew = Xcur and go to step 5.

Step 4: Calculate PSA and generate a [0, 1] random value Rand. Determine whether Rand < PSA. If so, set Xcur = Xnew; otherwise, go to step 5.

Step 5: Determine whether SA has terminated. If so, go to step 6; otherwise, go to step 1.

Step 6: Output the current solution Xcur. End.

The PSA of the single-objective optimization problem is calculated as follows, where fnew and fcur denote the fitness values of the new individual and the current individual:

$$ P_{SA} = \left\{ \begin{gathered} 1,\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;f^{{{\text{new}}}} < f^{{{\text{cur}}}} \hfill \\ \exp \left( - \frac{{f^{{{\text{new}}}} - f^{{{\text{cur}}}} }}{T}\right),\;\;\;\;\;\;\;\;f^{{{\text{new}}}} \ge f^{{{\text{cur}}}} \hfill \\ \end{gathered} \right. $$
(21)

Since this paper studies multiobjective optimization, the above PSA formula for single-objective problems cannot be applied directly to this multiobjective problem. The PSA formula for the multiobjective problem is shown below, and the subscript m denotes the m th optimization objective in the multiobjective optimization problem.

$$ P_{SA} = \left\{ \begin{gathered} \;1,\;\;\;\;\;\;\;\;\; X_{new} \;{\text{dominates}}\;X_{cur} \hfill \\ \prod\limits_{m = 1}^{M} {\exp \left( - \frac{{f_{m} }}{T}\right)} ,\;\;\;\;otherwise \hfill \\ \end{gathered} \right. $$
(22)

The value T in Eqs. (21) and (22), which is the current temperature, and the intermediate variable fm can be calculated as follows:

$$ f_{m} = \left\{ \begin{gathered} \;0,\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;if\;\;f_{m}^{{{\text{new}}}} < f_{m}^{{{\text{cur}}}} \hfill \\ f_{m}^{{{\text{new}}}} - f_{m}^{{{\text{cur}}}} ,\;\;\;\;otherwise \hfill \\ \end{gathered} \right. $$
(23)
$$ T = \alpha^{k} \cdot T_{0} $$
(24)

where α is the cooling rate; k is the number of cooling times, which is set to the current iteration number of the MOMA; and T0 is the initial temperature.

5 Experimental studies and discussion

To test the proposed MOMA, two groups of numerical experiments are designed. The data of these numerical experiments are openly available in existing publications, so the experimental results can objectively illustrate the superiority of the proposed algorithm. The cases are divided into two groups according to the size of the experimental case: the first group is from the literature [35, 36], and the second is adopted from [13]. The case data can be downloaded from the relevant website to facilitate comparative studies by future researchers. The proposed MOMA is implemented via C++ programming and is run on a PC with an i7-8700 CPU and 16 GB memory. The classic NSGA-II and MOEA/D algorithms are employed for comparison with the proposed MOMA. In addition, a version of the MOMA without an SA mechanism, named MOMA_NSA, is employed to prove the effectiveness of SA in the MOMA. Each algorithm is run 10 times independently with a fixed running time of 300 s. The three performance indicators for the multiobjective algorithms are the set coverage (CS), generational distance (GD), and inverse generational distance (IGD). CS was proposed by Zitzler et al. [41] to compare the coverage rates of two solution sets. A larger CS value means that there are more solutions in this solution set that dominate those in the other set, which also means that this set is closer to the optimal Pareto frontier. The second evaluation indicator is GD [30], which is used to describe the distance between the current Pareto frontier and the optimal Pareto frontier. The IGD indicator is a comprehensive evaluation index that can reflect the convergence and distribution degrees [2]. The smaller the IGD value is, the better the comprehensive performance of the MOEA is.

(1) CS. Suppose both sets A and B are approximations of the Pareto frontier. C (A, B) > C (B, A) means that set A is a better approximation of the optimal Pareto frontier. Then, C (A, B) is defined by Eq. (25).

$$ C(A,B) = \frac{{\left| {\left\{ {b \in B\left| {\exists a \in A:a\;{\text{dominates}}\;{\text{b}}\;} \right.} \right\}} \right|}}{\left| B \right|} $$
(25)

(2) GD. By calculating the distance between a point on the current Pareto frontier and the optimal Pareto frontier, the solution set obtained by the algorithm is evaluated. The smaller the GD value is, the better the current Pareto frontier. GD is defined as follows:

$$ GD = \frac{{\sqrt {\sum\nolimits_{i = 1}^{n} {d_{i}^{2} } } }}{n}, $$
(26)

where di is the Euclidean distance from the ith point of the current Pareto frontier (PF) to the nearest point of the optimal Pareto frontier (PF*) and n is the number of solutions in PF. Since the optimal frontier cannot be obtained directly, PF* here is the optimal Pareto frontier composed of all the solutions of various multiobjective evolutionary algorithms after each independent run.

(3) IGD. The shortest distance from a point on PF* to PF is calculated to evaluate the algorithms. IGD is defined as follows:

$$ IGD = \frac{{\sqrt {\sum\nolimits_{i \in PF*} {d_{i}^{*2} } } }}{n}, $$
(27)

where di* is the Euclidean distance from the ith point of PF* to the nearest point of PF and n is the number of solutions in PF*. Since the optimal frontier cannot be obtained directly, PF* here is the optimal Pareto frontier composed of all the solutions of various multiobjective evolutionary algorithms.

The parameter configuration affects the performance of the MOMA. The parameters are the population size PopSize, the crossover probability Pc, the mutation probability Pm, the cooling rate α, and the initial temperature T0. A Taguchi design-of-experiment (DOE) approach is adopted to determine the most appropriate parameter configuration. The level of each parameter is as follows: PopSize = \(\{\)100, 200, 300, 400\(\}\), Pm = \(\{\)0.2, 0.4, 0.6, 0.8\(\}\), Pc = \(\{\)0.2, 0.4, 0.6, 0.8\(\}\), α = \(\{\)0.96, 0.97, 0.98, 0.99\(\}\), and T0 = \(\{\)200, 300, 400, 500\(\}\). An orthogonal array L16(45) is used, as shown in Table 3. The comprehensive index IGD is selected to evaluate the parameter combinations. According to Fig. 8, the MOMA performs better with the parameter combination PopSize = 400, Pm = 0.2, Pc = 0.2, α = 0.97, and T0 = 500.

Table 3 Orthogonal array and average values
Fig. 8
figure 8

Main effects plot for MOMA

5.1 Experiment 1

Cases P1 and P2 in this group of experiments are small-scale DIPPS problems, which are derived from the literature [35, 36]. By comparing the indicators GD, IGD, and CS, as shown in Tables 4, 5, the proposed MOMA can obtain a better Pareto frontier with smaller GD and IGD values than those of the other comparison algorithms, indicating that the frontier of the MOMA is closer to the optimal one. According to the value of the indicator CS, the number of dominating solutions in the MOMA solution set is greater than the number of dominated solutions when compared with the other two algorithms. Therefore, the proposed MOMA performed better than NSGA-II, MOMA_NSA and MOEA/D.

Table 4 The comparison based on GD
Table 5 The comparison based on IGD

5.2 Experiment 2

This group of experiments contains 11 cases. The numbers of factories, jobs, and machines in these examples are shown in the Tables 6, 7, and the cases that belong to the large-scale DIPPS problem are indicated. The comparison results for NSGA-II, MOMA_NSA and MOEA/D are shown in Tables 8, 9, 10, 11. The MOMA has smaller GD and IGD values and is superior to the comparison algorithms in terms of the indicator CS. Taking case E11 as an example, its Gantt chart and the Pareto front graphs of the three algorithms are as shown in Figs. 9 and 10.

Table 6 The comparison based on CS (part 1)
Table 7 The comparison based on CS (part 2)
Table 8 The comparison based on GD
Table 9 The comparison based on IGD
Table 10 The comparison based on coverage (part 1)
Table 11 The comparison based on coverage (part 2)
Fig. 9
figure 9

Pareto frontier graph of three algorithms of case E11

Fig. 10
figure 10

The Gantt chart for case E11

5.3 Discussions

From the above experimental results and the three comparative indicators, it can be concluded that the MOMA proposed in this paper outperforms the comparative algorithms in the different testing instances. Compared with MOEAD and NSGA2, the MOMA has not only a strong global search ability but also a better local search ability. By introducing the SA mechanism for local search in a specific neighborhood, the MOMA should accept relatively poor solutions with a certain probability so that it converges at a certain speed and ensures the diversity of the population. Therefore, the algorithm can avoid falling into a local optimum prematurely and can guarantee the solution quality. The balance of exploration and exploitation abilities is the MOMA’s important characteristic, and it is also the key to obtaining better solutions.

6 Conclusions

This paper describes a new manufacturing mode in a distributed manufacturing system, which integrates the process planning subsystem with the scheduling subsystem. Owing to the complementarity of the two subsystems, the production efficiency and the quality can be significantly improved. This method could provide more potential choices for production managers to save costs and keep production steady. Furthermore, it is difficult for operators to make decisions on multiobjective problems in actual production, and actual problems are always multiobjective problems. For the MODIPPS problem, a novel network-based MILP model is established for the first time, and the MOMA is designed by taking advantage of the network and the OR-node logic. The encoding and decoding method is newly designed for the distributed manufacturing mode to integrate the process planning and scheduling steps. Owing to the introduction of the SA mechanism to enhance the exploration and exploitation ability, the proposed MOMA outperforms the other two classic multiobjective algorithms.

The MOMA proposed in this paper can provide managers with a near-optimal Pareto frontier and greatly reduces the difficulty of decision-making for multiobjective production problems. The integration mode can accelerate the intelligentization of modern manufacturing systems and can greatly improve the production management capacity of enterprises. This successful application should inspire managers to consider and deal with production problems from a global perspective and explore their solutions from the perspective of the essential problem characteristics.