Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

The traveling salesman problem (TSP) is one of the most studied problems in combinatorial optimization and has got applications in many different areas. The TSP consists of finding a shortest tour in a complete graph whose edges (i,j) have cost (distance) \(c_{ij}\). A comprehensive treatment of the traveling salesman problem can be found in [3].

In this paper we do not assume that the cost matrix is symmetric. However, our figures will refer to symmetric instances. We consider a dual relaxation of the original problem—the assignment problem \(A\) based on the same cost matrix. The result is a dual relaxation, possibly with subtours, as shown in Fig. 1. This problem can also be solved by any code for the assignment problem, e.g. [1].

$$\begin{aligned} A: \qquad \qquad \mathrm{min }\sum _{i=1}^n \sum _{j=1}^n c_{ij} x_{ij}&\end{aligned}$$
(1)
$$\begin{aligned} \mathrm{s.t.}&\nonumber \\ \sum _{i=1}^n x_{ij} = 1 \qquad 1\le j \le n&\end{aligned}$$
(2)
$$\begin{aligned} \sum _{j=1}^n x_{ij} = 1 \qquad 1\le i \le n&\end{aligned}$$
(3)
$$\begin{aligned} 0 \le x_{ij} \le 1 \qquad 1 \le i,j \le n&\end{aligned}$$
(4)
Fig. 1
figure 1

Dual relaxation with subtours

This optimal solution can be transformed into a tour as shown in the figure below. The value of the objective function of the solution in this example is 2,744 and it is an upper bound for the optimal solution (Fig. 2). By using the Lin-Kernighan heuristic [4] we can obtain an even better upper bound 2,726. The optimal value of the dual heuristic is 2,426. Hence, the length of an optimal tour is between these two values.

Fig. 2
figure 2

Transformation into a primal feasible solution

2 Computing an Upper Bound

In this paper we introduce a new relaxation \(A'\) of the TSP. Due to inequality 6 and 7 every node must have at least one adjacent edge and at most two adjacent edges. Equation 8 assures that there are exactly \(n-1\) edges.

$$\begin{aligned} A': \qquad \qquad \mathrm{min} \sum _{i=1}^n \sum _{j=1}^n c_{ij} x_{ij}&\end{aligned}$$
(5)
$$\begin{aligned} s.t.&\nonumber \\ \sum _{i=1}^n x_{ij} + \sum _{i=1}^n x_{ji} \le 2 \qquad 1\le j \le n&\end{aligned}$$
(6)
$$\begin{aligned} \sum _{i=1}^n x_{ij} + \sum _{i=1}^n x_{ji} \ge 1 \qquad 1\le j \le n&\end{aligned}$$
(7)
$$\begin{aligned} \sum _{i=1}^n \sum _{j=1}^n x_{ij} = n - 1&\end{aligned}$$
(8)
$$\begin{aligned} M, \text {a set of valid TSP constraints}&\end{aligned}$$
(9)
$$\begin{aligned} 0 \le x_{ij} \le 1 \qquad 1 \le i,j \le n&\end{aligned}$$
(10)

The set M may consist of some valid TSP constraints which do not contradict constraint 8. For example, M could be chosen as a set of subtour elimination constraints. We have tested our approach with \( M = \{x: x_{ij} + x_{ji} \le 1, 1 \le i,j \le n\}\) to avoid 2-cycles. An optimal solution (objective value 2,624) for this problem is shown in Fig. 3. If we delete the constraints of type 9 then the resulting problem \(A^{*}\) is comparable to an assignment problem where only \(n-1\) nodes are assigned. In [2] the first author analyzed the bipartite weighted matching problem with respect to slightly changed problems of the original problem. In one type of problem two nodes are deleted in the bipartite graph (one at each partition). The solution is of course a complete matching (an assignment) with \(n-1\) edges and therefore also a solution for \(A^{*}\) which can be computed in \(O(n^3)\).

Fig. 3
figure 3

Relaxation with \(n-1\) edges

Let \(f(A')\) be the objective value of the above problem \(A'\). OPT denotes the optimal solution of the original TSP and \(P'\) is any primal feasible solution. Then, of course we have

$$\begin{aligned} f(A') \le { OPT} \le f(P') \end{aligned}$$
(11)

Theorem 1

\(f(P') - f(A')\) is an upper bound for the longest edge in an optimal solution of the TSP.

Proof

For any primal feasible solution P with objective value \(f(P) \le f(P') \) we claim:

If \((i,j)\) is the longest edge in P then \(c_{ij} \le f(P') - f(A')\).

Suppose \(f(P') - c_{ij} < f(A') \) then \(P \setminus \{(i,j)\}\) is a feasible solution for problem \(A'\) with objective value \(f(P) - c_{ij}\). Hence, \(f(P) - c_{ij} \le f(P') - c_{ij} < f(A') \) by our assumption. However, \(f(A')\) was optimal and therefore we have a contradiction. This means that all edges longer than \(f(P') - f(A')\) can not be in a better solution than \(P'\), in particular all these edges can not be in an optimal solution. \(\square \)

In our example our best primal solution was 2,726 and the objective value of \(A'\) is 2,624. Therefore the difference 102 of these values is an upper bound for the longest edge in an optimal solution. This improves the value 2,352 computed via \(A^{*}\). In Fig. 4 the edge \((a,b)\) has length 104 and therefore this edge can not be in an optimal solution. All in all 3,542 edges (or 83 %) are longer than the computed bound and can be deleted.

Fig. 4
figure 4

Example for an edge to be deleted

Remark 1

There are TSP instances where the longest edge of the problem is in an optimal solution.

If all cities are on a semicircular then the longest edge (the diameter of the circular) is of course in the optimal solution. In this case our bound is useless.

3 Computational Results

We have analyzed our approach with several instances in TSPLIB [5] where our set \(M\) was chosen to be \(M = \{x: x_{ij} + x_{ji} \le 1, 1 \le i,j \le n\}\). The first two columns denote the name and size of the problem.

Table 1 Upper bounds for longest edge

The entries of the column “length” are the computed upper bounds for the respective instances. In the last two columns the number of edges longer than this bound and their percentage is given. This means for example for the drilling problem in instance a280 that 42  % of all edges are too long to be in an optimal solution. In all instances, computing the euclidian distances from the problem data takes more time than the computation of the LP-solution of \(A'\). All primal feasible solutions were produced by the Lin-Kernighan heuristic [4].

Our computed bounds may be helpful computationally as they lead to potentially much sparser graphs to be considered in various algorithms.