Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 5.1 Introduction

Consider the following decision problem.

Distance Geometry Problem (DGP). Given a weighted undirected graph G = (V, E, d), where \(d : E \rightarrow \mathbb{F}\), and a positive integer K, establish whether there exists an embedding \(x : V \rightarrow {\mathbb{R}}^{K}\) such that

$$\forall \{i,j\} \in E\quad \|{x}_{i} - {x}_{j}\| = {d}_{\mathit{ij}},$$
(5.1)

where F is a set of nonnegative numbers, which, for the purposes of this chapter, we assume to be either integers \(\mathbb{N}\) or rationals \({\mathbb{Q}}_{+}\). We denote explicit dependence of the DGP on K by DGP\({}_{K}\).

The DGP is NP-hard, but even when \(\mathbb{F} = \mathbb{N}\) it is not known, whenever K > 1, whether it is in NP or not. Trying to prove that the DGP is in NP would involve finding a polynomial size representation for the solutions of a polynomial system of equations of degree two. Disproving the statement would probably be much more difficult. This chapter relates a possible proof technique for showing that DGP \(\in \) NP and the corresponding failure, in the hope of enticing new efforts on this topic.

1.1 5.1.1 Applications

In the Molecular Distance Geometry Problem (MDGP), G is a molecule graph where the E is the set of known interatomic distances and K = 3. Since the function of molecules depends strongly on their spatial configuration, finding an embedding of V in \({\mathbb{R}}^{3}\) is of practical interest [11, 13]. A distinguishing property is that because of the experimental techniques involved, most distances are bounded above by 6Å, so that the resulting graph is 3D generalization of a Unit Disk Graph (UDG) [1].

Wireless Sensor Network Localization (WSNL) aims to embed a wireless sensor network in \({\mathbb{R}}^{2}\) (so K = 2). Pairs of sensors can estimate their distance by measuring the power used for a two-way communication. Since sensor networks always include a wired backbone (allowing the link between the sensor network and the external world) and the position of the wired backbone components is usually known, the distinguishing mathematical property of the WSNL is that a partial embedding \(x{^\prime} : U \rightarrow {\mathbb{Q}}^{2}\) is known in advance, where \(U \subseteq V\) is the set of wired backbone components, called anchors in the WSNL literature [4, 21]. Again, because wireless communication can only occur below a certain distance threshold, the resulting graph is a UDG.

Lines of forces acting on static physical structure (such as a building) define a graph. If the forces sum to zero, then the structure stands. Starting from such basic definitions, a theory of bar-and-joint structures has been developed ever since the XVIII century [3, 10, 15, 18, 24]. This involves embeddings of the graph where joints are vertices and bars (with their lengths) are weighted edges; the zero sum force requirement holds if a given embedding is an isolated point in embedding space. More recently, the interest was shifted towards graphs whose topology itself guarantees that all (or almost all) embeddings are isolated points. Such graphs are termed rigid [6, 20].

Graph Drawing (GD) is a discipline studying algorithms for drawing graphs. The embedding might be defined for any \(K \geq 1\), but of course only projections in 2D and 3D are actually represented visually. See http://www.graphdrawing.org for more information.

1.2 5.1.2 Complexity

Saxe [19] proved in 1979 that DGP\({}_{1}\) with \(\mathbb{F} = \mathbb{N}\) is NP-complete by means of a reduction from Subset-Sum [5]. It is in NP because a given embedding x can be verified to satisfy (5.1) in polynomial time. Furthermore, an instance \(\{{a}_{1},\ldots ,{a}_{n}\} \in {\mathbb{Z}}^{n}\) of Subset-Sum can be suitably reduced to the instance G = (V, E, d) with \(V =\{ {v}_{0},\ldots ,{v}_{n-1}\}\), \(E =\{\{ {v}_{i},{v}_{i+1\!\!\kern 18pt {\rm mod}\,\,n}\}\;\vert \;i < n\}\), \({d}_{i,i+1\!\!\kern 18pt {\rm mod}\,\,n} = {a}_{i}\) for all i < n.

For what concerns K > 1, in [19], Sect. 5, Saxe writes,

  • NP-Completeness is defined for language recognition problems on Turing Machines, which inherently can deal only with integers and not with arbitrary reals.

  • Given a “random” embedding of an unweighted graph into a Euclidean space, any two of the edge weights induced by the embedding will be incommensurable with probability 1. Moreover, if the graph is overconstrained and the dimension of the space is at least two, then rounding the induced edge-weights to multiples of some small distance will almost always produce a weighted graph that is not embeddable in space.

The DGP contains the DGP\({}_{1}\), which is NP-complete, but as remarked by Saxe, the DGP itself might not be in NP. Thus, it is commonly stated in the literature that the DGP (and in particular the MDGP and the WSNL) is NP-hard (see, e.g., [4, 9]). By definition [5], a problem is NP-hard when every problem in NP can be reduced to it, independently of whether the problem itself is in NP or not.

In order to show that a decision problem is in NP, we have to perform the following steps:

  1. 1.

    Encoding certificates of YES instances

  2. 2.

    Showing that such certificates can be verified in time which is polynomial in the size of the instance

In the case of the DGP, the certificates are solutions of the system 5.1. Squaring every equation of the system yields

$$\forall \{i,j\} \in E\quad \|{x}_{i} - {x{}_{j}\|}^{2} = {d}_{ ij}^{2}.$$
(5.2)

System 5.2 has the same set of solutions as Eq. 5.1, since d always takes nonnegative values. Notice, however, that Eq. 5.2 is a polynomial system: as such, its solutions \(x = (({x}_{11},\ldots ,{x}_{1K},),\ldots ,({x}_{n1},\ldots ,{x}_{nK}))\) always have algebraic components.

2 5.2 Representations of Algebraic Numbers

It is well known that some algebraic numbers over \(\mathbb{Q}\) can be written as mathematical expressions involving integers and elementary operators such as sum, subtraction, product, fraction, and k-root. Let us call \(\mathcal{O}\) the set of operator symbols \(+,\times ,\div ,\root{k}\of{}\). The statement \(\mbox{ DGP} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}\) is equivalent to stating that all components of an embedding solving the instance can always be written as meaningful strings of symbols in \(\mathbb{Z}\) and \(\mathcal{O}\), the size of which is bound by a polynomial in the instance size. Not all algebraic numbers, however, can be written this way: specifically, this is the case if and only if the Galois group of the minimal polynomial of the algebraic number in question is soluble [22]. What about those algebraic numbers that do not satisfy this requirement?

If \(\alpha \) is a root of a polynomial \(p(x)\) over \(\mathbb{Q}\) whose Galois group is not soluble, then it cannot be expressed using symbols in \(\mathbb{Z} \cup \mathcal{O}\) alone. What one can do, however, is to adjoin other algebraic numbers in \(B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}\) to \(\mathbb{Q}\), obtaining other fields \(F = \mathbb{Q}[{\beta }_{1},\ldots ,{\beta }_{h}]\), until the minimal polynomial of \(\alpha \) over F has a soluble Galois group. This process terminates: it suffices to adjoin all the roots of p(x). Symbolic algebra packages such as Maple [14] attempt to find smallest h such that the Galois group of p(x) over F is soluble. Then \(\alpha \) can be expressed by meaningful strings of symbols in \(\mathbb{Z} \cup B \cup \mathcal{O}\).

Example 5.1.

Asking Maple to solve

$$\begin{array}{rcl} {x}^{5} + y + 1& =& 0 \\ {y}^{2} + y - x& =& \end{array}$$
(0)

yields the solution \(x = {\alpha }^{2} + \alpha \), \(y = \alpha \), where \(\alpha \) is a root of the polynomial \((x + 1)({x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1)\). The Galois group of \({x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1\) is S 8, the full symmetric group over 8 elements, and S 8 is not soluble.

2.1 5.2.1 Polynomial System Representation

Each algebraic number \(\alpha \in \mathbb{A}\) can be associated with a polynomial \({p}_{\alpha } \in \mathbb{Q}[x]\) such that \(p(\alpha ) = 0\) and a rational \(\bar{\alpha } \in \mathbb{Q}\) which is closest to \(\alpha \) than to the other roots of \({p}_{\alpha }\).

Example 5.2.

For \(\alpha = \root{3}\of{\frac{1} {2} + \sqrt{[4]3}}\) we might choose its minimal polynomial over \(\mathbb{Q}\), \({p}_{\alpha }(x) = {x}^{12} - 2{x}^{9} + \frac{3} {2}{x}^{6} -\frac{1} {2}{x}^{3} -\frac{47} {16}\), and set \(\bar{\alpha } = 2\), which is closest to \(\alpha \) than to the other real root of \({p}_{\alpha }\).

As mentioned above, embeddings can be seen as sequences of algebraic numbers. Any sequence S of \(\mathcal{l}\) algebraic numbers can be associated with a multivariate polynomial system \({\mathbf{p}}_{S} \in \mathbb{Z}[{x}_{1},\ldots ,{x}_{\mathcal{l}}]\) such that \({\mathbf{p}}_{S}(S) = 0\), together with a rational vector \(q \in {\mathbb{Q}}^{\mathcal{l}}\) such that \(\|S - {q\|}_{2}\) is smallest.

2.2 5.2.2 Formal Grammar Representation

The “meaningful strings” mentioned above, used to represent algebraic numbers in a field F = Q[B] where \(B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}\), are generated by the formal grammar:

$$\mathbb{A}\rightarrow (\mathbb{A} + \mathbb{A}) \vee (\mathbb{A} \times \mathbb{A}) \vee (\mathbb{A} \div \mathbb{A}) \vee (\root{\mathbb{Z}}\of{\mathbb{A}}) \vee (\mathbb{Z}) \vee (B)$$

where, with a slight abuse of notation, we use \(\mathbb{A}, \mathbb{Z}\) to denote the type of algebraic and integer numbers. Given a string consisting of symbols in \(\mathbb{Z} \cup B \cup \mathcal{O}\), the string is meaningful if it matches the pattern given by the grammar. The algorithm that matches strings to grammars [12] is recursive in nature and yields a grammar derivation trees [16]. Each algebraic number in \(\mathbb{A}\) can be represented with respect to B by its corresponding derivation tree.

Example 5.3.

The algebraic number \(\alpha = \root{3}\of{\frac{1} {2} + \root{4}\of{3}}\) yields the grammar derivation tree shown in Fig. 5.1.

Fig. 5.1
figure 1

The derivation tree for \(\alpha \) according to the algebraic grammar

3 5.3 The Gröbner Bases Strategy

We restrict our attention to K = 2 and propose to pursue a line of argument showing that DGP2 ∈ { NP}. It is well known that any multivariate polynomial system of equations such as Eq. 5.2 can be reduced to a “triangular form” by employing Gröbner bases and the Buchberger algorithm [2] (a clear and short introduction to these concepts can be found in [8]). We represent an embedding \(x : V \rightarrow {\mathbb{R}}^{2}\) solving a DGP2 instance as the sequence \(({x}_{11},{x}_{12},{x}_{21},{x}_{22},\ldots ,{x}_{n1},{x}_{n2})\).

Example 5.4.

Consider the right-angled triangle with smallest possible integer side lengths (3, 4, 5) in \({\mathbb{R}}^{2}\) delimited by x 1 = (0, 0), \({x}_{2} = (3,0)\), x 3 = (0, 4). System 5.2 is:

$$\begin{array}{rcl}{ ({x}_{11} - {x}_{21})}^{2} + {({x}_{ 12} - {x}_{22})}^{2}& =& 9 \\ {({x}_{11} - {x}_{31})}^{2} + {({x}_{ 12} - {x}_{32})}^{2}& =& 16 \\ {({x}_{21} - {x}_{31})}^{2} + {({x}_{ 22} - {x}_{32})}^{2}& =& \end{array}$$
(25.)

The above system describes all (3, 4, 5)-sided triangles in \({\mathbb{R}}^{2}\). We can fix \({x}_{11} = {x}_{12} = 0\) and \({x}_{21} = 3\) to eliminate rotations and translations. This reduces the system to

$$\begin{array}{rcl} {3}^{2} + {x}_{ 22}^{2}& =& 9 \\ {x}_{31}^{2} + {x}_{ 32}^{2}& =& 16 \\ {(3 - {x}_{31})}^{2} + {({x}_{ 22} - {x}_{32})}^{2}& =& \end{array}$$
(25.)

A Gröbner basis of the above system (provided by Maple 9.5 [14] with the pure lexicographic term ordering) is given by

$$\begin{array}{rcl} {x}_{31}^{2}& =& 0 \\ {x}_{32}^{2}& =& 16 \\ 16{x}_{22} + 3{x}_{31}{x}_{32}& =& \end{array}$$
(0.)

It is clear that the Gröbner system has two real solutions given by \({x}_{22} = {x}_{31} = 0\) and \({x}_{32} = \pm 4\), which correspond to two congruent conformations reflected along the 1st coordinate, as shown in Fig. 5.2.

Fig. 5.2
figure 2

The two configurations given by the Gröbner system in Example 5.4

Let the system 5.2 have solution set X, and let x ∈ X. According to Sect. 5.2.1 we can represent x by Eq. 5.2 and a rational vector q ∈  2n which is closest to x than any other x′ ∈ X. Because of Gröbner basis theory, it follows that the very same embedding can be represented by any Gröbner basis system derived from Eq. 5.2 and q. The advantage in reducing the original system 5.2 to triangular form is that, by a form of back substitution, we can easily derive the set B referred to in Sect. 5.2, together with the string that describes the components of x.

Fig. 5.3
figure 3

A triangle chain of size 10 embedded in \({\mathbb{R}}^{2}\)

Showing that the size of a Gröbner basis is bounded by a polynomial in the instance size would be a (substantial) first step toward proving that \({\mbox{ DGP}}_{2} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}\). Unfortunately, this is false in general: the size of a Gröbner basis grows doubly exponentially. The polynomial system 5.2, however, has a very special structure, which—one might hope—could provide an exception. The rest of this section will introduce an infinite class of DGP instances which provide empirical evidence to the contrary. This is, of course, not a conclusive statement.

3.1 5.3.1 The Empirical Evidence Against

In this section we construct an infinite class of graphs embedded in 2 which have a Gröbner basis whose size, obtained computationally for a few cases, indicates an exponential growth in the instance size. The graph class consists of a chain of triangles sharing a side: \(V =\{ 1,\ldots ,n\}\) (with n ≥ 3), and \(E =\{\{ v - 2,v\},\{v - 1,v\}\;\vert \;v > 2\}\). The weight function \(d : E \rightarrow {\mathbb{Q}}_{+}\) is such that \({d}_{uv} = \frac{1} {u}\) for all {u, v} such that u < v. Examples with n = 10 and n = 20 are given, respectively, in Figs. 5.3 and 5.4.

These triangle chains embedded in \({\mathbb{R}}^{2}\) provide rigid frameworks [7] and are examples of of Henneberg type I graphs [23] and of Discretizable DGP (DDGP) instances [17]. Using Maple [14], we were able to show that the dependency of the Gröbner basis size in terms of the instance size looks exponential over a set of triangle chains with n vertices with \(n \in \{ 3,\ldots ,11\}\). More precisely, the number of equations in the Gröbner basis and the size of each equation both seem to grow exponentially (or worse), whereas the degree seems to grow linearly, as shown in Fig. 5.5.

Fig. 5.4 Fig. 5.5
figure 4figure 4

A triangle chain of size 20 embedded in \({\mathbb{R}}^{2}\)

The growth pattern of the number of equations (left) in the Gröbner basis of triangle chains, the size of each equation (center), and the degree of the system (right)