Abstract
Given a weighted undirected graph \(G = (V,E,d)\) with \(d : E \rightarrow {\mathbb{Q}}_{+}\) and a positive integer K, the distance geometry problem (DGP) asks to find an embedding \(x : V \rightarrow {\mathbb{R}}^{K}\) of G such that for each edge \(\{i,j\}\) we have \(\|{x}_{i} - {x}_{j}\| = {d}_{ij}\). Saxe proved in 1979 that the DGP is NP-complete with K = 1 and doubted the applicability of the Turing machine model to the case with K > 1, because the certificates for YES instances might involve real numbers. This chapter is an account of an unfortunately failed attempt to prove that the DGP is in NP for K = 2. We hope that our failure will motivate further work on the question.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
- Distance Geometry Problem (DGP)
- Molecular Distance Geometry Problem (MDGP)
- Wireless Sensor Network Localization (WSNL)
- Meaningful Strings
- Polynomial System Representation
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
1 5.1 Introduction
Consider the following decision problem.
Distance Geometry Problem (DGP). Given a weighted undirected graph G = (V, E, d), where \(d : E \rightarrow \mathbb{F}\), and a positive integer K, establish whether there exists an embedding \(x : V \rightarrow {\mathbb{R}}^{K}\) such that
where F is a set of nonnegative numbers, which, for the purposes of this chapter, we assume to be either integers \(\mathbb{N}\) or rationals \({\mathbb{Q}}_{+}\). We denote explicit dependence of the DGP on K by DGP\({}_{K}\).
The DGP is NP-hard, but even when \(\mathbb{F} = \mathbb{N}\) it is not known, whenever K > 1, whether it is in NP or not. Trying to prove that the DGP is in NP would involve finding a polynomial size representation for the solutions of a polynomial system of equations of degree two. Disproving the statement would probably be much more difficult. This chapter relates a possible proof technique for showing that DGP \(\in \) NP and the corresponding failure, in the hope of enticing new efforts on this topic.
1.1 5.1.1 Applications
In the Molecular Distance Geometry Problem (MDGP), G is a molecule graph where the E is the set of known interatomic distances and K = 3. Since the function of molecules depends strongly on their spatial configuration, finding an embedding of V in \({\mathbb{R}}^{3}\) is of practical interest [11, 13]. A distinguishing property is that because of the experimental techniques involved, most distances are bounded above by 6Å, so that the resulting graph is 3D generalization of a Unit Disk Graph (UDG) [1].
Wireless Sensor Network Localization (WSNL) aims to embed a wireless sensor network in \({\mathbb{R}}^{2}\) (so K = 2). Pairs of sensors can estimate their distance by measuring the power used for a two-way communication. Since sensor networks always include a wired backbone (allowing the link between the sensor network and the external world) and the position of the wired backbone components is usually known, the distinguishing mathematical property of the WSNL is that a partial embedding \(x{^\prime} : U \rightarrow {\mathbb{Q}}^{2}\) is known in advance, where \(U \subseteq V\) is the set of wired backbone components, called anchors in the WSNL literature [4, 21]. Again, because wireless communication can only occur below a certain distance threshold, the resulting graph is a UDG.
Lines of forces acting on static physical structure (such as a building) define a graph. If the forces sum to zero, then the structure stands. Starting from such basic definitions, a theory of bar-and-joint structures has been developed ever since the XVIII century [3, 10, 15, 18, 24]. This involves embeddings of the graph where joints are vertices and bars (with their lengths) are weighted edges; the zero sum force requirement holds if a given embedding is an isolated point in embedding space. More recently, the interest was shifted towards graphs whose topology itself guarantees that all (or almost all) embeddings are isolated points. Such graphs are termed rigid [6, 20].
Graph Drawing (GD) is a discipline studying algorithms for drawing graphs. The embedding might be defined for any \(K \geq 1\), but of course only projections in 2D and 3D are actually represented visually. See http://www.graphdrawing.org for more information.
1.2 5.1.2 Complexity
Saxe [19] proved in 1979 that DGP\({}_{1}\) with \(\mathbb{F} = \mathbb{N}\) is NP-complete by means of a reduction from Subset-Sum [5]. It is in NP because a given embedding x can be verified to satisfy (5.1) in polynomial time. Furthermore, an instance \(\{{a}_{1},\ldots ,{a}_{n}\} \in {\mathbb{Z}}^{n}\) of Subset-Sum can be suitably reduced to the instance G = (V, E, d) with \(V =\{ {v}_{0},\ldots ,{v}_{n-1}\}\), \(E =\{\{ {v}_{i},{v}_{i+1\!\!\kern 18pt {\rm mod}\,\,n}\}\;\vert \;i < n\}\), \({d}_{i,i+1\!\!\kern 18pt {\rm mod}\,\,n} = {a}_{i}\) for all i < n.
For what concerns K > 1, in [19], Sect. 5, Saxe writes,
NP-Completeness is defined for language recognition problems on Turing Machines, which inherently can deal only with integers and not with arbitrary reals.
Given a “random” embedding of an unweighted graph into a Euclidean space, any two of the edge weights induced by the embedding will be incommensurable with probability 1. Moreover, if the graph is overconstrained and the dimension of the space is at least two, then rounding the induced edge-weights to multiples of some small distance will almost always produce a weighted graph that is not embeddable in space.
The DGP contains the DGP\({}_{1}\), which is NP-complete, but as remarked by Saxe, the DGP itself might not be in NP. Thus, it is commonly stated in the literature that the DGP (and in particular the MDGP and the WSNL) is NP-hard (see, e.g., [4, 9]). By definition [5], a problem is NP-hard when every problem in NP can be reduced to it, independently of whether the problem itself is in NP or not.
In order to show that a decision problem is in NP, we have to perform the following steps:
-
1.
Encoding certificates of YES instances
-
2.
Showing that such certificates can be verified in time which is polynomial in the size of the instance
In the case of the DGP, the certificates are solutions of the system 5.1. Squaring every equation of the system yields
System 5.2 has the same set of solutions as Eq. 5.1, since d always takes nonnegative values. Notice, however, that Eq. 5.2 is a polynomial system: as such, its solutions \(x = (({x}_{11},\ldots ,{x}_{1K},),\ldots ,({x}_{n1},\ldots ,{x}_{nK}))\) always have algebraic components.
2 5.2 Representations of Algebraic Numbers
It is well known that some algebraic numbers over \(\mathbb{Q}\) can be written as mathematical expressions involving integers and elementary operators such as sum, subtraction, product, fraction, and k-root. Let us call \(\mathcal{O}\) the set of operator symbols \(+,\times ,\div ,\root{k}\of{}\). The statement \(\mbox{ DGP} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}\) is equivalent to stating that all components of an embedding solving the instance can always be written as meaningful strings of symbols in \(\mathbb{Z}\) and \(\mathcal{O}\), the size of which is bound by a polynomial in the instance size. Not all algebraic numbers, however, can be written this way: specifically, this is the case if and only if the Galois group of the minimal polynomial of the algebraic number in question is soluble [22]. What about those algebraic numbers that do not satisfy this requirement?
If \(\alpha \) is a root of a polynomial \(p(x)\) over \(\mathbb{Q}\) whose Galois group is not soluble, then it cannot be expressed using symbols in \(\mathbb{Z} \cup \mathcal{O}\) alone. What one can do, however, is to adjoin other algebraic numbers in \(B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}\) to \(\mathbb{Q}\), obtaining other fields \(F = \mathbb{Q}[{\beta }_{1},\ldots ,{\beta }_{h}]\), until the minimal polynomial of \(\alpha \) over F has a soluble Galois group. This process terminates: it suffices to adjoin all the roots of p(x). Symbolic algebra packages such as Maple [14] attempt to find smallest h such that the Galois group of p(x) over F is soluble. Then \(\alpha \) can be expressed by meaningful strings of symbols in \(\mathbb{Z} \cup B \cup \mathcal{O}\).
Example 5.1.
Asking Maple to solve
yields the solution \(x = {\alpha }^{2} + \alpha \), \(y = \alpha \), where \(\alpha \) is a root of the polynomial \((x + 1)({x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1)\). The Galois group of \({x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1\) is S 8, the full symmetric group over 8 elements, and S 8 is not soluble.
2.1 5.2.1 Polynomial System Representation
Each algebraic number \(\alpha \in \mathbb{A}\) can be associated with a polynomial \({p}_{\alpha } \in \mathbb{Q}[x]\) such that \(p(\alpha ) = 0\) and a rational \(\bar{\alpha } \in \mathbb{Q}\) which is closest to \(\alpha \) than to the other roots of \({p}_{\alpha }\).
Example 5.2.
For \(\alpha = \root{3}\of{\frac{1} {2} + \sqrt{[4]3}}\) we might choose its minimal polynomial over \(\mathbb{Q}\), \({p}_{\alpha }(x) = {x}^{12} - 2{x}^{9} + \frac{3} {2}{x}^{6} -\frac{1} {2}{x}^{3} -\frac{47} {16}\), and set \(\bar{\alpha } = 2\), which is closest to \(\alpha \) than to the other real root of \({p}_{\alpha }\).
As mentioned above, embeddings can be seen as sequences of algebraic numbers. Any sequence S of \(\mathcal{l}\) algebraic numbers can be associated with a multivariate polynomial system \({\mathbf{p}}_{S} \in \mathbb{Z}[{x}_{1},\ldots ,{x}_{\mathcal{l}}]\) such that \({\mathbf{p}}_{S}(S) = 0\), together with a rational vector \(q \in {\mathbb{Q}}^{\mathcal{l}}\) such that \(\|S - {q\|}_{2}\) is smallest.
2.2 5.2.2 Formal Grammar Representation
The “meaningful strings” mentioned above, used to represent algebraic numbers in a field F = Q[B] where \(B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}\), are generated by the formal grammar:
where, with a slight abuse of notation, we use \(\mathbb{A}, \mathbb{Z}\) to denote the type of algebraic and integer numbers. Given a string consisting of symbols in \(\mathbb{Z} \cup B \cup \mathcal{O}\), the string is meaningful if it matches the pattern given by the grammar. The algorithm that matches strings to grammars [12] is recursive in nature and yields a grammar derivation trees [16]. Each algebraic number in \(\mathbb{A}\) can be represented with respect to B by its corresponding derivation tree.
Example 5.3.
The algebraic number \(\alpha = \root{3}\of{\frac{1} {2} + \root{4}\of{3}}\) yields the grammar derivation tree shown in Fig. 5.1.
3 5.3 The Gröbner Bases Strategy
We restrict our attention to K = 2 and propose to pursue a line of argument showing that DGP2 ∈ { NP}. It is well known that any multivariate polynomial system of equations such as Eq. 5.2 can be reduced to a “triangular form” by employing Gröbner bases and the Buchberger algorithm [2] (a clear and short introduction to these concepts can be found in [8]). We represent an embedding \(x : V \rightarrow {\mathbb{R}}^{2}\) solving a DGP2 instance as the sequence \(({x}_{11},{x}_{12},{x}_{21},{x}_{22},\ldots ,{x}_{n1},{x}_{n2})\).
Example 5.4.
Consider the right-angled triangle with smallest possible integer side lengths (3, 4, 5) in \({\mathbb{R}}^{2}\) delimited by x 1 = (0, 0), \({x}_{2} = (3,0)\), x 3 = (0, 4). System 5.2 is:
The above system describes all (3, 4, 5)-sided triangles in \({\mathbb{R}}^{2}\). We can fix \({x}_{11} = {x}_{12} = 0\) and \({x}_{21} = 3\) to eliminate rotations and translations. This reduces the system to
A Gröbner basis of the above system (provided by Maple 9.5 [14] with the pure lexicographic term ordering) is given by
It is clear that the Gröbner system has two real solutions given by \({x}_{22} = {x}_{31} = 0\) and \({x}_{32} = \pm 4\), which correspond to two congruent conformations reflected along the 1st coordinate, as shown in Fig. 5.2.
Let the system 5.2 have solution set X, and let x ∈ X. According to Sect. 5.2.1 we can represent x by Eq. 5.2 and a rational vector q ∈ ℚ 2n which is closest to x than any other x′ ∈ X. Because of Gröbner basis theory, it follows that the very same embedding can be represented by any Gröbner basis system derived from Eq. 5.2 and q. The advantage in reducing the original system 5.2 to triangular form is that, by a form of back substitution, we can easily derive the set B referred to in Sect. 5.2, together with the string that describes the components of x.
Showing that the size of a Gröbner basis is bounded by a polynomial in the instance size would be a (substantial) first step toward proving that \({\mbox{ DGP}}_{2} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}\). Unfortunately, this is false in general: the size of a Gröbner basis grows doubly exponentially. The polynomial system 5.2, however, has a very special structure, which—one might hope—could provide an exception. The rest of this section will introduce an infinite class of DGP instances which provide empirical evidence to the contrary. This is, of course, not a conclusive statement.
3.1 5.3.1 The Empirical Evidence Against
In this section we construct an infinite class of graphs embedded in ℝ 2 which have a Gröbner basis whose size, obtained computationally for a few cases, indicates an exponential growth in the instance size. The graph class consists of a chain of triangles sharing a side: \(V =\{ 1,\ldots ,n\}\) (with n ≥ 3), and \(E =\{\{ v - 2,v\},\{v - 1,v\}\;\vert \;v > 2\}\). The weight function \(d : E \rightarrow {\mathbb{Q}}_{+}\) is such that \({d}_{uv} = \frac{1} {u}\) for all {u, v} such that u < v. Examples with n = 10 and n = 20 are given, respectively, in Figs. 5.3 and 5.4.
These triangle chains embedded in \({\mathbb{R}}^{2}\) provide rigid frameworks [7] and are examples of of Henneberg type I graphs [23] and of Discretizable DGP (DDGP) instances [17]. Using Maple [14], we were able to show that the dependency of the Gröbner basis size in terms of the instance size looks exponential over a set of triangle chains with n vertices with \(n \in \{ 3,\ldots ,11\}\). More precisely, the number of equations in the Gröbner basis and the size of each equation both seem to grow exponentially (or worse), whereas the degree seems to grow linearly, as shown in Fig. 5.5.
References
Clark, B., Colburn, C., Johnson, D.: Unit disk graph. Discrete Math. 86, 165–177 (1990)
Cox, D., Little, J., O’Shea, D.: Ideals, Varieties and Algorithms, 2nd edn. Springer, Berlin (1997)
Cremona, L.: Le figure reciproche nella statica grafica. In: Bernardoni, G., Milano (1872)
Eren, T., Goldenberg, D., Whiteley, W., Yang, Y., Morse, A., Anderson, B., Belhumeur, P.: Rigidity, computation, and randomization in network localization. In: IEEE Infocom Proceedings, 2673–2684 (2004)
Garey, M., Johnson, D.: Computers and Intractability: A Guide to the Theory of NP-Completeness. Freeman and Company, New York (1979)
Graver, J.: Rigidity matroids. SIAM J. Discrete Math. 4, 355–368 (1991)
Graver, J., Servatius, B., Servatius, H.: Combinatorial rigidity. Am. Math. Soc. (1993) http://books.google.com.pe/books/about/Combinatorial_Rigidity.html?id=0XwvY1GVNN4C
Hägglöf, K., Lindberg, P., Svensson, L.: Computing global minima to polynomial optimization problems using gröbner bases. J. Global Optim. 7(2), 115–125 (1995)
Hendrickson, B.: The molecule problem: exploiting structure in global optimization. SIAM J. Optim. 5, 835–857 (1995)
Henneberg, L.: Die Graphische Statik der starren Systeme. Teubner, Leipzig (1911)
Lavor, C., Liberti, L., Maculan, N., Mucherino, A.: Recent advances on the discretizable molecular distance geometry problem. Eur. J. Oper. Res. 219, 698–706 (2012)
Levine, R., Mason, T., Brown, D.: Lex and Yacc, 2nd edn. O’Reilly, Cambridge (1995)
Liberti, L., Lavor, C., Mucherino, A., Maculan, N.: Molecular distance geometry methods: from continuous to discrete. Int. Trans. Oper. Res.18, 33–51 (2010)
Maplesoft, Inc.: Maple 9 Getting Started Guide. Maplesoft, Waterloo (2003) http://www.maplesoft.com/products/maple/manuals/GettingStartedGuide.pdf
Maxwell, J.: On the calculation of the equilibrium and stiffness of frames. Phil. Mag. 27(182), 294–299 (1864)
Mosses, P.: Denotational semantics, In: van Leeuwen, J. (ed.) Handbook of Theoretical Computer Science B: Formal Models and Semantics, pp. 575–631. Elsevier, Amsterdam (1990)
Mucherino, A., Lavor, C., Liberti, L.: The discretizable distance geometry problem. Optimization Letters, Springer: 6(8), 1671–1686 (2012)
Saviotti, C.: Nouvelles méthodes pour le calcul des travures réticulaires In: Appendix to Cremona, L., “Les figures réciproques en statique graphique”, pp. 37–100. Gauthier-Villars, Paris (1885)
Saxe, J.: Embeddability of weighted graphs in k-space is strongly NP-hard. In: Proceedings of 17th Allerton Conference in Communications, Control and Computing, pp. 480–489 (1979)
Servatius, B., Servatius, H.: Generic and abstract rigidity, In: Thorpe, M., Duxbury, P. (eds.) Rigidity Theory and Applications, Fundamental Materials Research, pp. 1–19. Springer, New York (2002) DOI: 10.1007/0-306-47089-6_1
So, A.M.C., Ye, Y.: Theory of semidefinite programming for sensor network localization. Math. Program. B 109, 367–384 (2007)
Stewart, I.: Galois Theory, 2nd edn. Chapman and Hall, London (1989)
Tay, T.S., Whiteley, W.: Generating isostatic frameworks. Structural Topology 11, 21–69 (1985)
Varignon, P.: Nouvelle Mecanique. Claude Jombert, Paris (1725)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Beeker, N., Gaubert, S., Glusa, C., Liberti, L. (2013). Is the Distance Geometry Problem in NP?. In: Mucherino, A., Lavor, C., Liberti, L., Maculan, N. (eds) Distance Geometry. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-5128-0_5
Download citation
DOI: https://doi.org/10.1007/978-1-4614-5128-0_5
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-5127-3
Online ISBN: 978-1-4614-5128-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)