Abstract
Oriented Conformal Geometric Algebra was recently applied to Molecular Distance Geometry, where we want to determine 3D protein structures using distance information provided by Nuclear Magnetic Resonance experiments. We present new results that simplify the associated calculations.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
1 Introduction
Distance Geometry (DG) deals with calculation of points using distances between some of them [30, 31]. One of the most important applications of DG is related to the determination of 3D protein structures using distance data from Nuclear Magnetic Resonance (NMR) experiments [11, 27, 34, 40]. For other applications and more information about DG, see [4, 5, 32, 33, 36].
Given a graph \(G=(V,E,d)\), where V represents the set of atoms of a molecule and E is the set of atom pairs for which a distance is available (defined by \(d:E\rightarrow (0,\infty )\)), the Molecular Distance Geometry Problem (MDGP) is to find a function \(x:V\rightarrow \mathbb {R}^{3} \) that associates each element of V with a point in \(\mathbb {R}^{3}\) in such a way that the Euclidean distances between the points correspond to the values given by d [30].
Information from protein geometry and NMR data allow us to solve the MDGP using a combinatorial method, called Branch-and-Prune (BP) [7, 29]. The discrete MDGP is called the Discretizable MDGP (DMDGP), which is based on a vertex order \( v_{1},...,v_{n}\in V\), also given as input of the problem [8, 15, 25, 26, 35]. Formally, the DMDGP is defined as follows [19, 20] (we denote \(x_{i}\) instead of \( x(v_{i})\) and \(d_{i,j}\) instead of \(d(v_{i},v_{j})\)):
Definition 1
Given a simple undirected graph \(G=(V,E,d)\), whose edges are weighted by \( d:E\rightarrow (0,\infty )\), and a vertex order \(v_{1},...,v_{n}\in V\), such that
-
1.
For \(v_{1},v_{2},v_{3}\in V\), there exist \(x_{1},x_{2},x_{3}\in \mathbb {R}^{3}\) satisfying the given distances;
-
2.
For \(i>3\),
$$\begin{aligned} \{\{v_{i-3},v_{i}\},\{v_{i-2},v_{i}\},\{v_{i-1},v_{i}\}\}\subset E \end{aligned}$$(1)
and
find a function \(x:V\rightarrow \mathbb {R}^{3}\) satisfying
Property 1 avoids solutions modulo rotations and translations and Property 2 allows us to calculate the two possible positions for \(v_{4}\). For each position for \(v_{4}\), we have other two for \(v_{5}\), and so on, implying that the DMDGP search space is finite [20].
For \(i>3\), we may also have \(\{v_{j},v_{i}\}\in E\), \(j<i-3\), adding another equation (\(||x_{i}-x_{j}||=d_{j,i}\)) to the system related to \(v_{i}\):
We obtain a unique solution \(x_{i}^{*}\) for \(v_{i}\), supposing \( ||x_{i}^{*}-x_{j}||=d_{j,i}\) and that the points \( x_{i-1},x_{i-2,}x_{i-3},x_{j}\in \mathbb {R}^{3}\) are not in the same plane. If both possible positions for \(v_{i}\) are unfeasible with respect to additional distances \(d_{j,i}\), \(j<i-3\), it is necessary to consider the other possible position for \(v_{i-1}\) and repeat the procedure. Essentially, this is what the BP algorithm does [20].
Since distances \(d_{i-1,i}\) and \(d_{i-2,i}\) are related to bond lengths and bond angles of a protein, they can be considered precise values. This may not happen to distances \(d_{j,i}\), \(j\le i-3\), since they may be provided by NMR experiments [28, 40]. In [21], distances \(d_{j,i}\) are represented as interval distances \([\underline{d}_{j,i}, \overline{d}_{j,i}]\), \(\underline{d}_{j,i}\le d_{j,i}\le \overline{d}_{j,i} \), and an extension of the BP algorithm, called iBP, is proposed. The idea is to sample values from intervals \([\underline{d}_{i-3,i}, \overline{d}_{i-3,i}]\) in order to solve the associated system to calculate positions to \(v_{i}\), which implies that, choosing many values, the search space increases exponentially, and for small samples, a solution may not be found [1, 9, 10, 16, 37, 39].
Geometrically, even considering interval distances, solving systems like (4) is to intersect spheres and spherical shells centered at the positions for vertices \(v_{i-1},v_{i-2},v_{i-3},v_{j}\) (\(j<i-3\)) with radius \(d_{i-1,i},d_{i-2,i},[\underline{d}_{i-3,i},\overline{d}_{i-3,i}],[ \underline{d}_{j,i},\overline{d}_{j,i}]\), respectively.
In [2, 3], Conformal Geometric Algebra (CGA) was used to model uncertainties in the DMDGP, for the branching phase of iBP (intersection of two spheres and a spherical shell), and, in [24], the Oriented CGA (OCGA) [6] was applied for the pruning phase of iBP (when additional spherical shells must be considered in the intersection). Another CGA approach is discussed in [13].
We present new results that simplify the calculations in the pruning phase of iBP algorithm.
Next section explains how CGA and OCGA replace the classical approach for solving the DMDGP and Sect. 3 provides our contribution.
2 Conformal Geometric Algebra (CGA) and Oriented CGA (OCGA) for the iBP Algorithm
2.1 CGA for Branching
Replacing \(d_{i-3,i}\in \mathbb {R}\) by interval distance \([\underline{d} _{i-3,i},\overline{d}_{i-3,i}]\) in (4), we have to intersect two spheres with one spherical shell resulting in two arcs, instead of two points in \(\mathbb {R}^{3}\) (see Fig. 1).
The points \(\underline{P_{i}^{0}},\underline{P_{i}^{1}}\) and \(\overline{P_{i}^{0}},\overline{P_{i}^{1}}\), from the intersection of spheres centered at the positions for \(v_{i-1},v_{i-2},v_{i-3}\) with radius \( d_{i-1,i},d_{i-2,i},\underline{d}_{i-3,i}\) and \(d_{i-1,i},d_{i-2,i}, \overline{d}_{i-3,i}\), respectively (see Fig. 1), can be obtained from the point pairs generated by
where underline and overline indicate the use of \(\underline{d} _{i-3,i}\) and \(\overline{d}_{i-3,i}\), respectively [2] (in the conformal model, \(S_{i,j}\) is the sphere centered at the position for vertex \(v_{i}\) with radius \(d_{i,j}\)).
With the starting and the ending point of an arc, we can define a rotor, for \(i\ge 4\), given by
where \(\lambda _{i}\in [0,\; \phi _{i}]\) is the rotation angle corresponding to the arcs \(\underline{P_{i}^{0}}\overline{P_{i}^{0}}\) and \(\underline{P_{i}^{1}} \overline{P_{i}^{1}}\) (see Fig. 1), \(z_{i}^{*}\) is the dual of \(z_{i}\), and
since the rotation axis of \(R_{i}\) is defined by \(X_{i-2}\) and \(X_{i-1}\) (the positions for \(v_{i-2}\) and \(v_{i-1}\) in the conformal model). Note that \(\phi _{i}\), \(i\ge 4\), can be computed apriori based on the given intervals \([\underline{d}_{i-3,i},\overline{d}_{i-3,i}]\) and the DMDGP definition [2].
In order to consider the effect of changing points in the arcs (to avoid the sampling process), the rotation axes of the rotors must be replaced by (see [3])
implying that
for \(i\ge 4\). The values \(b\in \{0,1\}\) are defined when iBP chooses one of the branches in the search tree [3].
Note that, when all distances \(d_{i-3,i}\) are precise values, i.e. \( \phi _{i}=0\) for \(i\ge 4\),
2.2 OCGA for Pruning
During the pruning phase of iBP, when additional spherical shells must be considered, the arc orientation is even more important. In [24], this was done using the Oriented Conformal Geometric Algebra (OCGA) [6], which is an extension of the Oriented Projective Geometry [38].
First, we define an orientation for the circle obtained from the intersection \({S_{i-1,i}\wedge S_{i-2,i}}\) (Fig. 2), given by
Since \(C_{i}\) is a trivector in the conformal space, its dual \(C_{i}^{*}\) is a bivector orthogonal to the plane that contains the circle \(C_{i}\), which implies that the line
is oriented according to \(C_{i}\).
Using the normalized bivector dual to the rotation axis \(C_{i}^{*}\wedge e_{\infty }\), we can define \(z_i^{*}\) in a different way, that carries the orientation of \(C_i\). The associated rotor \(R_{i}\) is now defined by:
where
Supposing that for \(v_{i}\), \(i>4\), there is a pruning edge \( \{v_{j},v_{i}\}\in E\), \(j<i-3\), with interval distance \([\underline{d}_{j,i}, \overline{d}_{j,i}]\), and denoting by \(\underline{P_{j}^{0}}\overline{ P_{j}^{0}}\) and \(\underline{P_{j}^{1}}\overline{P_{j}^{1}}\) the arcs obtained from the intersections \(S_{i-1,i}\wedge S_{i-2,i}\wedge \underline{S}_{j,i}\) and \(S_{i-1,i}\wedge S_{i-2,i}\wedge \overline{S}_{j,i}\), we compute the new starting and ending points of the associated rotors doing the following calculations [24] (all the possible cases are illustrated in Fig. 3):
and
where
The same procedure must be done for \(\underline{P_{j}^{1}}\) and \(\overline{P_{j}^{1}}\).
3 New Approach for iBP Prunning Phase
Considering a circle C, obtained from the intersection of spheres \(S_{1}\) and \(S_{2}\), we obtain
Since C can also be defined by three points \(P_{1},P_{2},\,P_{3}\in C\), let us suppose that
For other three points \(Q_{1},Q_{2},\,Q_{3}\in C\), but with opposite orientation, we have
Thus, for any three distinct points in C, given by \(S_{1}\wedge S_{2}\), the expression \(\left( \frac{C}{||C||}\right) ^{*}\) is constant (up to a sign ±). This implies that to distinguish the orientation of two trivectors that define C, it is enough to check the signs of some fixed coordinate (\(\ne 0\)) of the trivectors.
For example, for points \(P_{1},P_{2},\,P_{3}\in C\), with
and choosing the coordinate of
of the trivector \(P_{1}\wedge P_{2}\wedge P_{3}\), we have to calcultate
From Sect. 2, in order to know the position of \(\underline{P_{j}^{0}}\) (obtained from the intersection \(S_{i-1,i}\wedge S_{i-2,i}\wedge \underline{S}_{j,i}\)), in terms of arc \(\underline{P_{i}^{0}}\overline{P_{i}^{0}}\), we have to calculate
Using the new idea, in addition to avoid two trivector products, we just calculate (and compare the signs) expressions like (6): one for \( \underline{P_{j}^{0}}\wedge \overline{P_{i}^{0}}\wedge \overline{P_{i}^{1}}\) and another for \(\underline{P_{i}^{0}}\wedge \underline{P_{j}^{0}}\wedge \overline{P_{i}^{1}}\).
Without loss of generality, let us suppose that the sign of expression (6), associated to the points \(\underline{P_{j}^{0}},\overline{P_{i}^{0}}, \overline{P_{i}^{1}}\), is positive, denoted by
So, all the cases illustrated in Fig. 3 are given, respectively, by
-
Case 1:
$$\begin{aligned} \left[ \underline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \\ \left[ \underline{P_{i}^{0}},\underline{P_{j}^{0}},\overline{P_{i}^{1}} \right]< & {} 0 \\ \left[ \overline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \\ \left[ \underline{P_{i}^{0}},\overline{P_{j}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \end{aligned}$$ -
Case 2:
$$\begin{aligned} \left[ \underline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \\ \left[ \underline{P_{i}^{0}},\underline{P_{j}^{0}},\overline{P_{i}^{1}} \right]> & {} 0 \\ \left[ \overline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]< & {} 0 \\ \left[ \underline{P_{i}^{0}},\overline{P_{j}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \end{aligned}$$ -
Case 3:
$$\begin{aligned} \left[ \underline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \\ \left[ \underline{P_{i}^{0}},\underline{P_{j}^{0}},\overline{P_{i}^{1}} \right]> & {} 0 \\ \left[ \overline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \\ \left[ \underline{P_{i}^{0}},\overline{P_{j}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \end{aligned}$$ -
Case 4:
$$\begin{aligned} \left[ \underline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]< & {} 0 \\ \left[ \underline{P_{i}^{0}},\underline{P_{j}^{0}},\overline{P_{i}^{1}} \right]> & {} 0 \\ \left[ \overline{P_{j}^{0}},\overline{P_{i}^{0}},\overline{P_{i}^{1}}\right]< & {} 0 \\ \left[ \underline{P_{i}^{0}},\overline{P_{j}^{0}},\overline{P_{i}^{1}}\right]> & {} 0 \end{aligned}$$
3.1 Example
Let us consider the same example given in [24], that is, a DMDGP instance with the following data (all the calculations were done using GAALOP [18]):
Since \(d_{1,4}\) is also a precise value, we can fix the first four points, given by
From the intersections of spheres \(\underline{S}_{2,5}\wedge S_{3,5}\wedge S_{4,5}\) and \(\overline{S}_{2,5}\wedge S_{3,5}\wedge S_{4,5}\), we obtain the arcs \(\underline{P_{5}^{0}}\overline{P_{5}^{0}}\) and \(\underline{P_{5}^{1}} \overline{P_{5}^{1}}\), defined by the points
Using the interval distance \(d_{1,5}\), we calculate \(\underline{S} _{1,5}\wedge S_{3,5}\wedge S_{4,5}\) and \(\overline{S}_{1,5}\wedge S_{3,5}\wedge S_{4,5}\), giving the points
With the orientation of \(C_{5}=S_{3,5}\wedge S_{4,5}\) defined by
the calculations necessary to test if \(\underline{A_{5}^{0}}\in \underline{P_{5}^{0}}\overline{P_{5}^{0}}\), using the original strategy, are the following (\(e_{ij}=e_{i}\wedge e_{j}\) and \(E=e_{\infty }\wedge e_{0}\)):
and
However, using the proposed approach, we just calculate the coordinates of \({ e_{12}\wedge e_{0}}\), using (6):
Since \(\left[ \underline{A_{5}^{0}},\overline{P_{5}^{0}},\overline{P_{5}^{1}} \right] >0\) and \(\left[ \underline{P_{5}^{0}},\underline{A_{5}^{0}}, \overline{P_{5}^{1}}\right] >0\), we conclude that
4 Conclusion and Acknowledgements
The first mathematical relationship between Distance Geometry and Geometric Algebra was established by Dress and Havel, in 1993 [14]. Since 2015 [22], the connection between the DMDGP and Geometric Algebra has been studied, becoming part of the relevant and challenging applications of Conformal Geometric Algebra [12, 17, 23].
We would like to thank Sebastià Xambó for the organization of the minisymposium “Systems, Patterns and Data Engineering with Geometric Calculi”, at the ICIAM 2019, and to encourage us to write this work. We are also grateful to the Brazilian research agencies CNPq and FAPESP for the support and to the reviewer’s comments.
References
Agra, A., Figueiredo, R., Lavor, C., Maculan, N., Pereira, A., Requejo, C.: Feasibility check for the distance geometry problem: an application to molecular conformations. Int. Trans. Oper. Res. 24, 1023–1040 (2017)
Alves, R., Lavor, C.: Geometric algebra to model uncertainties in the discretizable molecular distance geometry problem. Adv. Appl. Clifford Algebra 27, 439–452 (2017)
Alves, R., Lavor, C., Souza, C., Souza, M.: Clifford algebra and discretizable distance geometry. Math. Methods Appl. Sci. 41, 3999–4346 (2018)
Billinge, S.J., Duxbury, P.M., Gon calves, D.S., Lavor, C., Mucherino, A.: Assigned and unassigned distance geometry: applications to biological molecules and nanostructures. 4OR 14, 337–376 (2016)
Billinge, S., Duxbury, P., Gonçalves, D., Lavor, C., Mucherino, A.: Recent results on assigned and unassigned distance geometry with applications to protein molecules and nanostructures. Ann. Oper. Res. 271, 161–203 (2018)
Cameron, J., Lasenby, J.: Oriented conformal geometric algebra. Adv. Appl. Clifford Algebra 18, 523–538 (2008)
Carvalho, R., Lavor, C., Protti, F.: Extending the geometric build-up algorithm for the molecular distance geometry problem. Inf. Process. Lett. 108, 234–237 (2008)
Cassioli, A., Gunluk, O., Lavor, C., Liberti, L.: Discretization vertex orders in distance geometry. Disc. Appl. Math. 197, 27–41 (2015)
Cassioli, A., Bardiaux, B., Bouvier, G., Mucherino, A., Alves, R., Liberti, L., Nilges, M., Lavor, C., Malliavin, T.E: An algorithm to enumerate all possible protein conformations verifying a set of distance constraints. BMC Bioinformatics 16, 16–23 (2015)
Costa, T., Bouwmeester, H., Lodwick, W., Lavor, C.: Calculating the possible conformations arising from uncertainty in the molecular distance geometry problem using constraint interval analysis. Inf. Sci. 415–416, 41–52 (2017)
Crippen, G., Havel, T.: Distance Geometry and Molecular Conformation. Wiley (1988)
Dorst, L., Fontijne, D., Mann, S.: Geometric Algebra for Computer Science: An Object-Oriented Approach to Geometry. Morgan Kaufman (2007)
Dorst, L.: Boolean combination of circular arcs using orthogonal spheres. Adv. Appl. Clifford Algebra 29, 1–21 (2019)
Dress, A., Havel, T.: Distance geometry and geometric algebra. Found. Phys. 23, 1357–1374 (1993)
Gonçalves, D., Mucherino, A.: Discretization orders and efficient computation of Cartesian coordinates for distance geometry. Optim. Lett. 8, 2111–2125 (2014)
Gonçalves, D., Mucherino, A., Lavor, C., Liberti, L.: Recent advances on the interval distance geometry problem. J. Global Optim. 69, 525–545 (2017)
Hestenes, D.: Old wine in new bottles: a new algebraic framework for computational geometry. In: Advances in Geometric Algebra with Applications in Science and Engineering, Corrochano, E., Sobczyk, G. (eds.), Birkhäuser, pp. 1–14 (2001)
Hildenbrand, D.: Foundations of Geometric Algebra Computing. Springer (2012)
Lavor, C., Liberti, L., Maculan, N., Mucherino, A.: Recent advances on the discretizable molecular distance geometry problem. Eur. J. Oper. Res. 219, 698–706 (2012)
Lavor, C., Liberti, L., Maculan, N., Mucherino, A.: The discretizable molecular distance geometry problem. Comput. Optim. Appl. 52, 115–146 (2012)
Lavor, C., Liberti, L., Mucherino, A.: The interval BP algorithm for the discretizable molecular distance geometry problem with interval data. J. Global Optim. 56, 855–871 (2013)
Lavor, C., Alves, R., Figueiredo, W., Petraglia, A., Maculan, N.: Clifford algebra and the discretizable molecular distance geometry problem. Adv. Appl. Clifford Algebra 25, 925–942 (2015)
Lavor, C., Xambó-Descamps, S., Zaplana, I.: A Geometric Algebra Invitation to Space-Time Physics Robotics and Molecular Geometry. SpringerBriefs in Mathematics, Springer (2018)
Lavor, C., Alves, R.: Oriented conformal geometric algebra and the molecular distance geometry problem. Adv. Appl. Clifford Algebra 29, 1–19 (2019)
Lavor, C., Liberti, L., Donald, B., Worley, B., Bardiaux, B., Malliavin, T., Nilges, M.: Minimal NMR distance information for rigidity of protein graphs. Discrete Appl. Math. 256, 91–104 (2019)
Lavor, C., Souza, M., Mariano, L., Liberti, L.: On the polinomiality of finding \(^{K}\)DMDGP re-orders. Discrete Appl. Math. 267, 190–194 (2019)
Lavor, C., Alves, R., Souza, M., Aragon, J.: NMR protein structure calculation and sphere intersections. Comput. Math. Biophys. 8, 89–101 (2020)
Lavor, C., Souza, M., Carvalho, L., Gonçalves, D., Mucherino, A.: Improving the sampling process in the interval branch-and-prune algorithm for the discretizable molecular distance geometry problem. Appl. Math. Comput. 389, 125586–125597 (2021)
Liberti, L., Lavor, C., Maculan, N.: A branch-and-prune algorithm for the molecular distance geometry problem. Int. Trans. Oper. Res. 15, 1–17 (2008)
Liberti, L., Lavor, C., Maculan, N., Mucherino, A.: Euclidean distance geometry and applications. SIAM Rev. 56, 3–69 (2014)
Liberti, L., Lavor, C.: Six mathematical gems from the history of distance geometry. Int. Trans. Oper. Res. 23, 897–920 (2016)
Liberti, L., Lavor, C.: Euclidean Distance Geometry: An Introduction. Springer (2017)
Liberti, L., Lavor, C.: Open research areas in distance geometry. In: Open Problems in Optimization and Data Analysis, Migalas, A., Pardalos, P. (eds.), Springer, pp. 183–223 (2018)
Malliavin, T., Mucherino, A., Lavor, C., Liberti, L.: Systematic exploration of protein conformational space using a distance geometry approach. J. Chem. Inf. Model. 59, 4486–4503 (2019)
Mucherino, A., Lavor, C., Liberti, L.: The discretizable distance geometry problem. Optimization Lett. 6, 1671–1686 (2012)
Mucherino, A., Lavor, C., Liberti, L., Maculan, N.: Distance Geometry: Theory, Methods, and Applications. Springer (2013)
Souza, M., Lavor, C., Muritiba, A., Maculan, N.: Solving the molecular distance geometry problem with inaccurate distance data. BMC Bioinformatics 14, S71–S76 (2013)
Stolfi, J.: Oriented Projective Geometry - A Framework for Geometric Computations. Academic Press (1991)
Worley, B., Delhommel, F., Cordier, F., Malliavin, T., Bardiaux, B., Wolff, N., Nilges, M., Lavor, C., Liberti, L.: Tuning interval branch-and-prune for protein structure determination. J. Global Optim. 72, 109–127 (2018)
Wütrich, K.: Protein structure determination in solution by nuclear magnetic resonance spectroscopy. Science 243, 45–50 (1989)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lavor, C., Alves, R. (2021). Recent Advances on Oriented Conformal Geometric Algebra Applied to Molecular Distance Geometry. In: Xambó-Descamps, S. (eds) Systems, Patterns and Data Engineering with Geometric Calculi. SEMA SIMAI Springer Series(), vol 13. Springer, Cham. https://doi.org/10.1007/978-3-030-74486-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-74486-1_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74485-4
Online ISBN: 978-3-030-74486-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)