Is the Distance Geometry Problem in NP?

Beeker, Nathanael; Gaubert, Stéphane; Glusa, Christian; Liberti, Leo

doi:10.1007/978-1-4614-5128-0_5

Nathanael Beeker⁵,
Stéphane Gaubert⁶,
Christian Glusa⁵ &
…
Leo Liberti⁷

2019 Accesses
2 Citations

Abstract

Given a weighted undirected graph $G = (V,E,d)$ with $d : E \rightarrow {\mathbb{Q}}_{+}$ and a positive integer K, the distance geometry problem (DGP) asks to find an embedding $x : V \rightarrow {\mathbb{R}}^{K}$ of G such that for each edge $\{i,j\}$ we have $\|{x}_{i} - {x}_{j}\| = {d}_{ij}$. Saxe proved in 1979 that the DGP is NP-complete with K = 1 and doubted the applicability of the Turing machine model to the case with K > 1, because the certificates for YES instances might involve real numbers. This chapter is an account of an unfortunately failed attempt to prove that the DGP is in NP for K = 2. We hope that our failure will motivate further work on the question.

Access provided by Autonomous University of Puebla. Download chapter PDF

Computing the Metric Dimension by Decomposing Graphs into Extended Biconnected Components

Optimal Discretization Orders for Distance Geometry: A Theoretical Standpoint

The Fractional k-truncated Metric Dimension of Graphs

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 5.1 Introduction

Consider the following decision problem.

Distance Geometry Problem (DGP). Given a weighted undirected graph G = (V, E, d), where $d : E \rightarrow \mathbb{F}$, and a positive integer K, establish whether there exists an embedding $x : V \rightarrow {\mathbb{R}}^{K}$ such that

$$\forall \{i,j\} \in E\quad \|{x}_{i} - {x}_{j}\| = {d}_{\mathit{ij}},$$

(5.1)

where F is a set of nonnegative numbers, which, for the purposes of this chapter, we assume to be either integers $\mathbb{N}$ or rationals ${\mathbb{Q}}_{+}$. We denote explicit dependence of the DGP on K by DGP${}_{K}$.

The DGP is NP-hard, but even when $\mathbb{F} = \mathbb{N}$ it is not known, whenever K > 1, whether it is in NP or not. Trying to prove that the DGP is in NP would involve finding a polynomial size representation for the solutions of a polynomial system of equations of degree two. Disproving the statement would probably be much more difficult. This chapter relates a possible proof technique for showing that DGP $\in $ NP and the corresponding failure, in the hope of enticing new efforts on this topic.

1.1 5.1.1 Applications

In the Molecular Distance Geometry Problem (MDGP), G is a molecule graph where the E is the set of known interatomic distances and K = 3. Since the function of molecules depends strongly on their spatial configuration, finding an embedding of V in ${\mathbb{R}}^{3}$ is of practical interest [11, 13]. A distinguishing property is that because of the experimental techniques involved, most distances are bounded above by 6Å, so that the resulting graph is 3D generalization of a Unit Disk Graph (UDG) [1].

Wireless Sensor Network Localization (WSNL) aims to embed a wireless sensor network in ${\mathbb{R}}^{2}$ (so K = 2). Pairs of sensors can estimate their distance by measuring the power used for a two-way communication. Since sensor networks always include a wired backbone (allowing the link between the sensor network and the external world) and the position of the wired backbone components is usually known, the distinguishing mathematical property of the WSNL is that a partial embedding $x{^\prime} : U \rightarrow {\mathbb{Q}}^{2}$ is known in advance, where $U \subseteq V$ is the set of wired backbone components, called anchors in the WSNL literature [4, 21]. Again, because wireless communication can only occur below a certain distance threshold, the resulting graph is a UDG.

Lines of forces acting on static physical structure (such as a building) define a graph. If the forces sum to zero, then the structure stands. Starting from such basic definitions, a theory of bar-and-joint structures has been developed ever since the XVIII century [3, 10, 15, 18, 24]. This involves embeddings of the graph where joints are vertices and bars (with their lengths) are weighted edges; the zero sum force requirement holds if a given embedding is an isolated point in embedding space. More recently, the interest was shifted towards graphs whose topology itself guarantees that all (or almost all) embeddings are isolated points. Such graphs are termed rigid [6, 20].

Graph Drawing (GD) is a discipline studying algorithms for drawing graphs. The embedding might be defined for any $K \geq 1$, but of course only projections in 2D and 3D are actually represented visually. See http://www.graphdrawing.org for more information.

1.2 5.1.2 Complexity

Saxe [19] proved in 1979 that DGP${}_{1}$ with $\mathbb{F} = \mathbb{N}$ is NP-complete by means of a reduction from Subset-Sum [5]. It is in NP because a given embedding x can be verified to satisfy (5.1) in polynomial time. Furthermore, an instance $\{{a}_{1},\ldots ,{a}_{n}\} \in {\mathbb{Z}}^{n}$ of Subset-Sum can be suitably reduced to the instance G = (V, E, d) with $V =\{ {v}_{0},\ldots ,{v}_{n-1}\}$, $E =\{\{ {v}_{i},{v}_{i+1\!\!\kern 18pt {\rm mod}\,\,n}\}\;\vert \;i < n\}$, ${d}_{i,i+1\!\!\kern 18pt {\rm mod}\,\,n} = {a}_{i}$ for all i < n.

For what concerns K > 1, in [19], Sect. 5, Saxe writes,

NP-Completeness is defined for language recognition problems on Turing Machines, which inherently can deal only with integers and not with arbitrary reals.

Given a “random” embedding of an unweighted graph into a Euclidean space, any two of the edge weights induced by the embedding will be incommensurable with probability 1. Moreover, if the graph is overconstrained and the dimension of the space is at least two, then rounding the induced edge-weights to multiples of some small distance will almost always produce a weighted graph that is not embeddable in space.

The DGP contains the DGP${}_{1}$, which is NP-complete, but as remarked by Saxe, the DGP itself might not be in NP. Thus, it is commonly stated in the literature that the DGP (and in particular the MDGP and the WSNL) is NP-hard (see, e.g., [4, 9]). By definition [5], a problem is NP-hard when every problem in NP can be reduced to it, independently of whether the problem itself is in NP or not.

In order to show that a decision problem is in NP, we have to perform the following steps:

1.
Encoding certificates of YES instances
2.
Showing that such certificates can be verified in time which is polynomial in the size of the instance

In the case of the DGP, the certificates are solutions of the system 5.1. Squaring every equation of the system yields

$$\forall \{i,j\} \in E\quad \|{x}_{i} - {x{}_{j}\|}^{2} = {d}_{ ij}^{2}.$$

(5.2)

System 5.2 has the same set of solutions as Eq. 5.1, since d always takes nonnegative values. Notice, however, that Eq. 5.2 is a polynomial system: as such, its solutions $x = (({x}_{11},\ldots ,{x}_{1K},),\ldots ,({x}_{n1},\ldots ,{x}_{nK}))$ always have algebraic components.

2 5.2 Representations of Algebraic Numbers

It is well known that some algebraic numbers over $\mathbb{Q}$ can be written as mathematical expressions involving integers and elementary operators such as sum, subtraction, product, fraction, and k-root. Let us call $\mathcal{O}$ the set of operator symbols $+,\times ,\div ,\root{k}\of{}$. The statement $\mbox{ DGP} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}$ is equivalent to stating that all components of an embedding solving the instance can always be written as meaningful strings of symbols in $\mathbb{Z}$ and $\mathcal{O}$, the size of which is bound by a polynomial in the instance size. Not all algebraic numbers, however, can be written this way: specifically, this is the case if and only if the Galois group of the minimal polynomial of the algebraic number in question is soluble [22]. What about those algebraic numbers that do not satisfy this requirement?

If $\alpha $ is a root of a polynomial $p(x)$ over $\mathbb{Q}$ whose Galois group is not soluble, then it cannot be expressed using symbols in $\mathbb{Z} \cup \mathcal{O}$ alone. What one can do, however, is to adjoin other algebraic numbers in $B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}$ to $\mathbb{Q}$, obtaining other fields $F = \mathbb{Q}[{\beta }_{1},\ldots ,{\beta }_{h}]$, until the minimal polynomial of $\alpha $ over F has a soluble Galois group. This process terminates: it suffices to adjoin all the roots of p(x). Symbolic algebra packages such as Maple [14] attempt to find smallest h such that the Galois group of p(x) over F is soluble. Then $\alpha $ can be expressed by meaningful strings of symbols in $\mathbb{Z} \cup B \cup \mathcal{O}$.

Example 5.1.

Asking Maple to solve

$$\begin{array}{rcl} {x}^{5} + y + 1& =& 0 \\ {y}^{2} + y - x& =& \end{array}$$

(0)

yields the solution $x = {\alpha }^{2} + \alpha $, $y = \alpha $, where $\alpha $ is a root of the polynomial $(x + 1)({x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1)$. The Galois group of ${x}^{8} + 3{x}^{7} + 3{x}^{6} + {x}^{5} + 1$ is S ₈, the full symmetric group over 8 elements, and S ₈ is not soluble.

2.1 5.2.1 Polynomial System Representation

Each algebraic number $\alpha \in \mathbb{A}$ can be associated with a polynomial ${p}_{\alpha } \in \mathbb{Q}[x]$ such that $p(\alpha ) = 0$ and a rational $\bar{\alpha } \in \mathbb{Q}$ which is closest to $\alpha $ than to the other roots of ${p}_{\alpha }$.

Example 5.2.

For $\alpha = \root{3}\of{\frac{1} {2} + \sqrt{[4]3}}$ we might choose its minimal polynomial over $\mathbb{Q}$, ${p}_{\alpha }(x) = {x}^{12} - 2{x}^{9} + \frac{3} {2}{x}^{6} -\frac{1} {2}{x}^{3} -\frac{47} {16}$, and set $\bar{\alpha } = 2$, which is closest to $\alpha $ than to the other real root of ${p}_{\alpha }$.

As mentioned above, embeddings can be seen as sequences of algebraic numbers. Any sequence S of $\mathcal{l}$ algebraic numbers can be associated with a multivariate polynomial system ${\mathbf{p}}_{S} \in \mathbb{Z}[{x}_{1},\ldots ,{x}_{\mathcal{l}}]$ such that ${\mathbf{p}}_{S}(S) = 0$, together with a rational vector $q \in {\mathbb{Q}}^{\mathcal{l}}$ such that $\|S - {q\|}_{2}$ is smallest.

2.2 5.2.2 Formal Grammar Representation

The “meaningful strings” mentioned above, used to represent algebraic numbers in a field F = Q[B] where $B =\{ {\beta }_{1},\ldots ,{\beta }_{h}\}$, are generated by the formal grammar:

$$\mathbb{A}\rightarrow (\mathbb{A} + \mathbb{A}) \vee (\mathbb{A} \times \mathbb{A}) \vee (\mathbb{A} \div \mathbb{A}) \vee (\root{\mathbb{Z}}\of{\mathbb{A}}) \vee (\mathbb{Z}) \vee (B)$$

where, with a slight abuse of notation, we use $\mathbb{A}, \mathbb{Z}$ to denote the type of algebraic and integer numbers. Given a string consisting of symbols in $\mathbb{Z} \cup B \cup \mathcal{O}$, the string is meaningful if it matches the pattern given by the grammar. The algorithm that matches strings to grammars [12] is recursive in nature and yields a grammar derivation trees [16]. Each algebraic number in $\mathbb{A}$ can be represented with respect to B by its corresponding derivation tree.

Example 5.3.

The algebraic number $\alpha = \root{3}\of{\frac{1} {2} + \root{4}\of{3}}$ yields the grammar derivation tree shown in Fig. 5.1.

3 5.3 The Gröbner Bases Strategy

We restrict our attention to K = 2 and propose to pursue a line of argument showing that DGP₂ ∈ { NP}. It is well known that any multivariate polynomial system of equations such as Eq. 5.2 can be reduced to a “triangular form” by employing Gröbner bases and the Buchberger algorithm [2] (a clear and short introduction to these concepts can be found in [8]). We represent an embedding $x : V \rightarrow {\mathbb{R}}^{2}$ solving a DGP₂ instance as the sequence $({x}_{11},{x}_{12},{x}_{21},{x}_{22},\ldots ,{x}_{n1},{x}_{n2})$.

Example 5.4.

Consider the right-angled triangle with smallest possible integer side lengths (3, 4, 5) in ${\mathbb{R}}^{2}$ delimited by x ₁ = (0, 0), ${x}_{2} = (3,0)$, x ₃ = (0, 4). System 5.2 is:

$$\begin{array}{rcl}{ ({x}_{11} - {x}_{21})}^{2} + {({x}_{ 12} - {x}_{22})}^{2}& =& 9 \\ {({x}_{11} - {x}_{31})}^{2} + {({x}_{ 12} - {x}_{32})}^{2}& =& 16 \\ {({x}_{21} - {x}_{31})}^{2} + {({x}_{ 22} - {x}_{32})}^{2}& =& \end{array}$$

(25.)

The above system describes all (3, 4, 5)-sided triangles in ${\mathbb{R}}^{2}$. We can fix ${x}_{11} = {x}_{12} = 0$ and ${x}_{21} = 3$ to eliminate rotations and translations. This reduces the system to

$$\begin{array}{rcl} {3}^{2} + {x}_{ 22}^{2}& =& 9 \\ {x}_{31}^{2} + {x}_{ 32}^{2}& =& 16 \\ {(3 - {x}_{31})}^{2} + {({x}_{ 22} - {x}_{32})}^{2}& =& \end{array}$$

(25.)

A Gröbner basis of the above system (provided by Maple 9.5 [14] with the pure lexicographic term ordering) is given by

$$\begin{array}{rcl} {x}_{31}^{2}& =& 0 \\ {x}_{32}^{2}& =& 16 \\ 16{x}_{22} + 3{x}_{31}{x}_{32}& =& \end{array}$$

(0.)

It is clear that the Gröbner system has two real solutions given by ${x}_{22} = {x}_{31} = 0$ and ${x}_{32} = \pm 4$, which correspond to two congruent conformations reflected along the 1st coordinate, as shown in Fig. 5.2.

Let the system 5.2 have solution set X, and let x ∈ X. According to Sect. 5.2.1 we can represent x by Eq. 5.2 and a rational vector q ∈ ℚ ²ⁿ which is closest to x than any other x′ ∈ X. Because of Gröbner basis theory, it follows that the very same embedding can be represented by any Gröbner basis system derived from Eq. 5.2 and q. The advantage in reducing the original system 5.2 to triangular form is that, by a form of back substitution, we can easily derive the set B referred to in Sect. 5.2, together with the string that describes the components of x.

Showing that the size of a Gröbner basis is bounded by a polynomial in the instance size would be a (substantial) first step toward proving that ${\mbox{ DGP}}_{2} \in \mathbf{<Emphasis Type="Bold">\text{ NP}</Emphasis>}$. Unfortunately, this is false in general: the size of a Gröbner basis grows doubly exponentially. The polynomial system 5.2, however, has a very special structure, which—one might hope—could provide an exception. The rest of this section will introduce an infinite class of DGP instances which provide empirical evidence to the contrary. This is, of course, not a conclusive statement.

3.1 5.3.1 The Empirical Evidence Against

In this section we construct an infinite class of graphs embedded in ℝ ² which have a Gröbner basis whose size, obtained computationally for a few cases, indicates an exponential growth in the instance size. The graph class consists of a chain of triangles sharing a side: $V =\{ 1,\ldots ,n\}$ (with n ≥ 3), and $E =\{\{ v - 2,v\},\{v - 1,v\}\;\vert \;v > 2\}$. The weight function $d : E \rightarrow {\mathbb{Q}}_{+}$ is such that ${d}_{uv} = \frac{1} {u}$ for all {u, v} such that u < v. Examples with n = 10 and n = 20 are given, respectively, in Figs. 5.3 and 5.4.

These triangle chains embedded in ${\mathbb{R}}^{2}$ provide rigid frameworks [7] and are examples of of Henneberg type I graphs [23] and of Discretizable DGP (DDGP) instances [17]. Using Maple [14], we were able to show that the dependency of the Gröbner basis size in terms of the instance size looks exponential over a set of triangle chains with n vertices with $n \in \{ 3,\ldots ,11\}$. More precisely, the number of equations in the Gröbner basis and the size of each equation both seem to grow exponentially (or worse), whereas the degree seems to grow linearly, as shown in Fig. 5.5.

References

Clark, B., Colburn, C., Johnson, D.: Unit disk graph. Discrete Math. 86, 165–177 (1990)
Google Scholar
Cox, D., Little, J., O’Shea, D.: Ideals, Varieties and Algorithms, 2nd edn. Springer, Berlin (1997)
Google Scholar
Cremona, L.: Le figure reciproche nella statica grafica. In: Bernardoni, G., Milano (1872)
Google Scholar
Eren, T., Goldenberg, D., Whiteley, W., Yang, Y., Morse, A., Anderson, B., Belhumeur, P.: Rigidity, computation, and randomization in network localization. In: IEEE Infocom Proceedings, 2673–2684 (2004)
Google Scholar
Garey, M., Johnson, D.: Computers and Intractability: A Guide to the Theory of NP-Completeness. Freeman and Company, New York (1979)
Google Scholar
Graver, J.: Rigidity matroids. SIAM J. Discrete Math. 4, 355–368 (1991)
Google Scholar
Graver, J., Servatius, B., Servatius, H.: Combinatorial rigidity. Am. Math. Soc. (1993) http://books.google.com.pe/books/about/Combinatorial_Rigidity.html?id=0XwvY1GVNN4C
Hägglöf, K., Lindberg, P., Svensson, L.: Computing global minima to polynomial optimization problems using gröbner bases. J. Global Optim. 7(2), 115–125 (1995)
Google Scholar
Hendrickson, B.: The molecule problem: exploiting structure in global optimization. SIAM J. Optim. 5, 835–857 (1995)
Google Scholar
Henneberg, L.: Die Graphische Statik der starren Systeme. Teubner, Leipzig (1911)
Google Scholar
Lavor, C., Liberti, L., Maculan, N., Mucherino, A.: Recent advances on the discretizable molecular distance geometry problem. Eur. J. Oper. Res. 219, 698–706 (2012)
Google Scholar
Levine, R., Mason, T., Brown, D.: Lex and Yacc, 2nd edn. O’Reilly, Cambridge (1995)
Google Scholar
Liberti, L., Lavor, C., Mucherino, A., Maculan, N.: Molecular distance geometry methods: from continuous to discrete. Int. Trans. Oper. Res.18, 33–51 (2010)
Google Scholar
Maplesoft, Inc.: Maple 9 Getting Started Guide. Maplesoft, Waterloo (2003) http://www.maplesoft.com/products/maple/manuals/GettingStartedGuide.pdf
Maxwell, J.: On the calculation of the equilibrium and stiffness of frames. Phil. Mag. 27(182), 294–299 (1864)
Google Scholar
Mosses, P.: Denotational semantics, In: van Leeuwen, J. (ed.) Handbook of Theoretical Computer Science B: Formal Models and Semantics, pp. 575–631. Elsevier, Amsterdam (1990)
Google Scholar
Mucherino, A., Lavor, C., Liberti, L.: The discretizable distance geometry problem. Optimization Letters, Springer: 6(8), 1671–1686 (2012)
Google Scholar
Saviotti, C.: Nouvelles méthodes pour le calcul des travures réticulaires In: Appendix to Cremona, L., “Les figures réciproques en statique graphique”, pp. 37–100. Gauthier-Villars, Paris (1885)
Google Scholar
Saxe, J.: Embeddability of weighted graphs in k-space is strongly NP-hard. In: Proceedings of 17th Allerton Conference in Communications, Control and Computing, pp. 480–489 (1979)
Google Scholar
Servatius, B., Servatius, H.: Generic and abstract rigidity, In: Thorpe, M., Duxbury, P. (eds.) Rigidity Theory and Applications, Fundamental Materials Research, pp. 1–19. Springer, New York (2002) DOI: 10.1007/0-306-47089-6_1
Google Scholar
So, A.M.C., Ye, Y.: Theory of semidefinite programming for sensor network localization. Math. Program. B 109, 367–384 (2007)
Google Scholar
Stewart, I.: Galois Theory, 2nd edn. Chapman and Hall, London (1989)
Google Scholar
Tay, T.S., Whiteley, W.: Generating isostatic frameworks. Structural Topology 11, 21–69 (1985)
Google Scholar
Varignon, P.: Nouvelle Mecanique. Claude Jombert, Paris (1725)
Google Scholar

Download references

Author information

Authors and Affiliations

CMAP, École Polytechnique, 91128, Palaiseau, France
Nathanael Beeker & Christian Glusa
INRIA Rocquencourt, Cedex, France
Stéphane Gaubert
LIX, École Polytechnique, 91128, Palaiseau, France
Leo Liberti

Authors

Nathanael Beeker
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Gaubert
View author publications
You can also search for this author in PubMed Google Scholar
Christian Glusa
View author publications
You can also search for this author in PubMed Google Scholar
Leo Liberti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian Glusa .

Editor information

Editors and Affiliations

IRISA, University of Rennes 1, avenue du General Leclerc, Rennes, 35042, France
Antonio Mucherino
, Dept of Applied Maths (IMECC-UNICAMP), State University of Campinas, Campinas, 13081, Brazil
Carlile Lavor
, LIX, Ecole Polytechnique, Palaiseau, 91128, France
Leo Liberti
Instituto Alberto Luiz Coimbra de, Pos-Graduacao e Pesquisa de, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
Nelson Maculan

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Beeker, N., Gaubert, S., Glusa, C., Liberti, L. (2013). Is the Distance Geometry Problem in NP?. In: Mucherino, A., Lavor, C., Liberti, L., Maculan, N. (eds) Distance Geometry. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-5128-0_5

Download citation

DOI: https://doi.org/10.1007/978-1-4614-5128-0_5
Published: 03 November 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-5127-3
Online ISBN: 978-1-4614-5128-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us