Comparing the Spatial Structure of Molecules by Minimizing a Comparison Function

Laneev, E. B.; Chernikova, N. Yu.

doi:10.1134/S0965542519010135

Comparing the Spatial Structure of Molecules by Minimizing a Comparison Function

Published: 14 May 2019

Volume 59, pages 128–135, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Computational Mathematics and Mathematical Physics Aims and scope Submit manuscript

Comparing the Spatial Structure of Molecules by Minimizing a Comparison Function

Download PDF

E. B. Laneev¹ &
N. Yu. Chernikova¹

90 Accesses
1 Citation
Explore all metrics

Abstract

A method for the quantitative comparison of the spatial geometric structure of two molecules is proposed. It is based on the minimization of a comparison function using the rotation of molecules when their centers of mass are brought into coincidence. The minimizing angles are found using the Rosenbrock method.

A DIRECT-type global optimization algorithm for image registration

Article 02 June 2020

When Similarity Measures Lie

Data Science: Similarity, Dissimilarity and Correlation Functions

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 INTRODUCTION

This paper is devoted to the theoretical and numerical investigation of the mathematical model of comparing two molecules in a problem of structural chemistry. This model is reduced to comparing two objects consisting of $N$ ordered points of fixed geometry that behave as rigid bodies in ${{R}^{3}}$. The comparison principle in this model is based on optimizing the superposition of these two objects by translations and rotations. The optimal superposition is achieved by finding the translations and rotations minimizing a function that compares the geometry of two objects. This function is the sum of distances between the points of the two objects with identical indexes. To minimize the comparison function (see [1, 2]), the zero-order Rosenbrock method is used (see [3, 4]). The results are used to compare the geometry of real-life molecules.

The molecules of many substances can exist in the form of conformers (conformation is the spatial arrangement of atoms in a molecule; there are various manifestations of this phenomenon in chemistry [5])—the conformers have the same structural formula but different spatial configuration. Hence, the problem of comparing the geometry of conformers in space arises because the conventional characteristics, such as bond lengths (interatomic distances) and valence angles, do not always show the differences of the molecular geometry. Moreover, there is need in comparing fragments of chemically different molecules, the close neighborhood of atoms (coordination polyhedra), or other large or small complexes of atoms. For this purpose, a method for the quantitative comparison of molecular geometry based on the minimization of a comparison functions by translating and rotating molecules is proposed. It is proved that the minimum with respect to translations is achieved by bringing certain characteristic points, which are arbitrary called the molecular centers of mass, into coincidence. Minimization with respect to rotation angles is performed using the zero-order Rosenbrock method.

2 DESCRIPTION OF THE MATHEMATICAL MODEL

In this section and in the next one, we present some definitions and theorems.

Definition 1. The geometric structure (or briefly structure) is a rigid geometric construct consisting of $N$ ordered points in ${{R}^{3}}$ with the coordinates $({{x}_{i}},{{y}_{i}},{{z}_{i}})$ ($i = 1,...,N$) that moves in ${{R}^{3}}$ as a rigid body.

We assume that each $i$th point of this structure is assigned a weighting coefficient ${{w}_{i}} \geqslant 0$ such that $\sum\nolimits_{i = 1}^N \,{{w}_{i}} = W > 0$. Let ${{i}_{k}}$, $k = 1,\; \ldots ,\;K$, be the indexes of nonzero weighting coefficients ${{w}_{i}}$. Then,

$$W = \sum\limits_{i = 1}^N \,{{w}_{i}} = \sum\limits_{k = 1}^K \,{{w}_{{{{i}_{k}}}}} > 0.$$

((1))

Let two geometric structures consisting of $N$ points each with the coordinates $({{x}_{{1,i}}},{{y}_{{1,i}}},{{z}_{{1,i}}})$ and $({{x}_{{2,i}}},{{y}_{{2,i}}},{{z}_{{2,i}}})$ ($i = 1,...,N$) be given. For each fixed index $i$, the point with the coordinates $({{x}_{{1,i}}},{{y}_{{1,i}}},{{z}_{{1,i}}})$ of the first structure corresponds to the point $({{x}_{{2,i}}},{{y}_{{2,i}}},{{z}_{{2,i}}})$ of the second structure.

Let the geometries of both structures be identical. Consider two arbitrary points specified by the vectors ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ that are identically positioned relative to the corresponding geometric structure (in particular, we may choose two points of these structures with the same indexes as ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$). Then, by rotating one structure relative to the other, we can bring all points with identical indexes into coincidence thus achieving the complete coincidence of the two structures. If the two structures have different geometries, then it is natural to pose the problem of bringing them into optimal “coincidence” by various translations and rotations of one structure relative to the other considered as rigid bodies.

The optimality criterion of coincidence of two geometric structures is formulated in terms of the minimum of the comparison function

$$U({{{\mathbf{r}}}_{{1,0}}},{{{\mathbf{r}}}_{{2,0}}},\varphi ,\theta ,\psi ) = \sum\limits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{1,0}}} - Q({{{\mathbf{r}}}_{{2,i}}} - {{{\mathbf{r}}}_{{2,0}}})} \right|}^{2}},$$

((2))

with respect to translations and rotation (Euler) angles (see [6]), where the vectors ${{{\mathbf{r}}}_{{1,i}}}$ and ${{{\mathbf{r}}}_{{2,i}}}$ determine the position of points in the first and the second structures, respectively; the vectors ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ determine the displacement of the first and the second structures to the corresponding points; and $Q = Q(\varphi ,\theta ,\psi )$ are the rotation matrices through the Euler angles ($\psi $ is the precession angle, $\theta $ is the nutation angle, and $\varphi $ is the intrinsic rotation angle):

$$Q = \left( {\begin{array}{*{20}{c}} {\cos \psi \cos \varphi - \sin \psi \sin \varphi \cos \theta }&{ - {\kern 1pt} \cos \psi \sin \varphi - \sin \psi \cos \varphi \cos \theta }&{\sin \psi \sin \theta } \\ {\sin \psi \cos \varphi + \cos \psi \sin \varphi \cos \theta }&{ - {\kern 1pt} \sin \psi \sin \varphi + \cos \psi \cos \varphi \cos \theta }&{ - {\kern 1pt} \cos \psi \sin \theta } \\ {\sin \varphi \sin \theta }&{\cos \varphi \sin \theta }&{\cos \theta } \end{array}} \right).$$

((3))

Thus, the function $U$ defined by (2) is the sum of distances between the corresponding points of two geometric structures with the weights ${{w}_{i}}$ after bringing the points determined by the vectors ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ into coincidence and rotating the second structure relative to the first one. Let us examine problems emerging in the process of minimizing the comparison function (2).

3 MINIMIZING THE COMPARISON FUNCTION OF TWO GEOMETRIC STRUCTURES

We show that, by bringing certain characteristic points of two geometric structures into coincidence, the comparison function minimization problem with respect to the entire set of variables can be reduced to the minimization of the function $U$ with respect to the rotation angles $\varphi $, $\theta $, and $\psi $.

In this paper, we give more complete and rigorous formulation and proof of the idea proposed in [7].

Theorem 1. The minimum of the function $U$ is attained at the point corresponding to the “center of mass” of the two geometric structures determined by the vectors

$${{{\mathbf{r}}}_{{j,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{j,i}}},\quad j = 1,2.$$

((4))

Proof. In function (2), we fix the angles $\varphi $, $\theta $, and $\psi $ and, therefore, the matrix $Q$. Hence, we will consider (2) as a function of the variables ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$. This function is convex and quadratic with respect to these two variables; therefore, its minimum with respect to ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ is attained at the points at which the derivative with respect to these variables vanishes. Thus, we obtain the equations

$$\frac{{\partial U}}{{\partial {{{\mathbf{r}}}_{{1,0}}}}} = 0,\quad \frac{{\partial U}}{{\partial {{{\mathbf{r}}}_{{2,0}}}}} = 0.$$

((5))

By differentiating, we obtain

$$ - 2\sum\limits_{i = 1}^N {{{w}_{i}}({{r}_{{1,i}}} - {{r}_{{1,0}}} - Q({{r}_{{2,i}}} - {{r}_{{2,0}}})) = 0.} $$

((6))

$$2{{Q}^{T}}\sum\limits_{i = 1}^N {{{w}_{i}}({{r}_{{1,i}}} - {{r}_{{1,0}}} - Q({{r}_{{2,i}}} - {{r}_{{2,0}}})) = 0.} $$

((7))

Since the matrix ${{Q}^{T}}$ is nonsingular, Eq. (7) is equivalent to Eq. (6). By solving Eq. (6) for ${{{\mathbf{r}}}_{{1,0}}}$, we obtain

$${{{\mathbf{r}}}_{{1,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{1,i}}} - Q\left( {\frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{2,i}}} - {{{\mathbf{r}}}_{{2,0}}}} \right).$$

((8))

To make the expression in parentheses in (8) equal to zero, we choose ${{{\mathbf{r}}}_{{2,0}}}$ from the condition

$${{{\mathbf{r}}}_{{2,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{2,i}}}.$$

((9))

Then, (8) implies

$${{{\mathbf{r}}}_{{1,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{1,i}}}.$$

((10))

We have already mentioned above that Eqs. (6) and (7) are equivalent; therefore, points (9) and (10) satisfy Eqs. (5). Hence, the vectors ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ provide the minimum to function (2) at fixed rotation angles $\varphi $, $\theta $, and $\psi $. This completes the proof of Theorem 1.

Thus, the minimum of function (2) corresponds to translating the centers of mass of the geometric structures determined by formulas (10) and (9) to the origin. Now, function (2) can be considered as a function of the rotation angles:

$$U(\varphi ,\theta ,\psi ) = \sum\limits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{1,0}}} - Q(\varphi ,\theta ,\psi )({{{\mathbf{r}}}_{{2,i}}} - {{{\mathbf{r}}}_{{2,0}}})} \right|}^{2}},$$

((11))

where ${{{\mathbf{r}}}_{{1,i}}}{\text{ and }}{{{\mathbf{r}}}_{{2,i}}}$ are the given coordinates of the geometric structures and the centers of mass ${{{\mathbf{r}}}_{{1,0}}}$ and ${{{\mathbf{r}}}_{{2,0}}}$ are found by formulas (10) and (9). Function (11) is minimized with respect to the angles $\varphi $, $\theta $, and $\psi $.

Suppose that the minimum of the function $U = U(\varphi ,\theta ,\psi )$ defined by (11) is attained at the point $(\mathop {{{\varphi }_{0}},{{\theta }_{0}},\psi }\nolimits_0 )$. The proximity measure between two geometric structures is defined by

$$s = {{\left( {\frac{{U({{\varphi }_{0}},{{\theta }_{0}},{{\psi }_{0}})}}{W}} \right)}^{{1/2}}}.$$

((12))

This quantity can be considered as a quantitative characteristic of the proximity measure between two geometric structures because it is the averaged distance between the points with identical indexes in the two structures after they have been brought into “coincidence.”

Two structures are said to be approximately equal if

$$s = {{\left( {\frac{1}{W}\mathop {\min}\limits_{\varphi ,\theta ,\psi } U(\varphi ,\theta ,\psi )} \right)}^{{1/2}}} = {{\left( {\frac{{U({{\varphi }_{0}},{{\theta }_{0}},{{\psi }_{0}})}}{W}} \right)}^{{1/2}}} \leqslant {{s}_{0}},$$

((13))

where ${{s}_{0}}$ is a given quantity (in applications, it is determined by the specific practical situations). The inequality $s \leqslant {{s}_{0}}$ is called the proximity criterion of two structures. In numerical computations, $s$ can have a computational error.

Note that if the minimizer is not unique and there exists a point $(\mathop {{{\varphi }_{1}},{{\theta }_{1}},\psi }\nolimits_1 )$ such that $U(\varphi \mathop {_{1},{{\theta }_{1}},\psi }\nolimits_1 ) = U(\mathop {{{\varphi }_{0}},{{\theta }_{0}},\psi }\nolimits_0 )$, then the quantity $s$ remains unchanged and the nonuniqueness of the minimizer does not affect the proximity criterion of geometric structures.

Now, the test for comparing the geometry of two geometric structures can be subdivided into three phases: (1) translate the center of mass of each geometric structure to the origin; (2) minimize function (11) with respect to the Euler angles; (3) calculate the quantity $s$ by formula (13) and draw a conclusion on the proximity of these structures.

Two geometric structures are called equal if their points with identical indexes can be brought into coincidence by moving this structures as rigid bodies. It is well known (see [8]) that such a movement can be made by a translation and orthogonal rotation.

Test (1)–(3) for comparing two geometric structures is valid if two equal structures with different coordinates ${{{\mathbf{r}}}_{{1,i}}}$ and ${{{\mathbf{r}}}_{{2,i}}}$ are brought into coincidence at the minimizer of function (11), i.e., if the proximity measure between two equal structures is $s = 0$.

Theorem 2.The minimum of the nonnegative function (11) is zero if and only if two geometric structures are equal.

Proof. Let two equal structures be given. This means that they can be brought into coincidence at the points with identical indexes and their centers of mass can be moved to the origin. In this case, the points of these structures with identical indexes have identical coordinates determined by the vectors ${{{\mathbf{R}}}_{i}}$, $i = 1,\; \ldots ,\;N$. Equal structures in which the points with identical indexes have different coordinates can be obtained by moving these structures apart, i.e., by rotating the second structure relative to the first one and displacing each structure. More precisely, rotate the second structure relative to the first one through the angles $\bar {\varphi }$, $\bar {\theta }$, and $\bar {\psi }$; and then move the center of mass of the first structure to the point determined by the vector ${{{\mathbf{R}}}_{{1,0}}}$ and the center of mass of the second structure to the point determined by the vector ${{{\mathbf{R}}}_{{2,0}}}$. Then, the coordinates of the points of the first and the second structures will be

$${{{\mathbf{r}}}_{{1,i}}} = {{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{1,0}}},\quad {{{\mathbf{r}}}_{{2,i}}} = {{Q}_{0}}(\bar {\varphi },\bar {\theta },\bar {\psi }){{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{2,0}}},\quad i = 1, \ldots ,N,$$

((14))

where ${{Q}_{0}}(\bar {\varphi },\bar {\theta },\bar {\psi })$ is a matrix of form (3). It is clear that the coordinates ${{{\mathbf{r}}}_{{1,i}}}$, ${{{\mathbf{r}}}_{{2,i}}}$ in (14) can specify an arbitrary position of two equal structures in ${{R}^{3}}$.

Now assume that the coordinates of two equal structures with, generally, different coordinates (14) of points with identical indexes are given, i.e., $\sum\nolimits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{2,i}}}} \right|}^{2}} \geqslant 0$. Taking into account that the coordinates of two equal structures can be represented in form (14), we apply to these two structures algorithm (1)–(3).

(1) Calculate the centers of mass of two equal structures. According to (10) and (14), we have for the center of mass of the first structure

$${{{\mathbf{r}}}_{{1,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{1,i}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}({{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{1,0}}}) = {{{\mathbf{R}}}_{{1,0}}},$$

((15))

because ${{{\mathbf{R}}}_{i}}$ are the coordinates of the structure points relative to the center of mass and, therefore, $\sum\nolimits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{R}}}_{i}} = 0$. Similarly, for the center of mass of the second structure, we obtain from (9) and (14) that

$${{{\mathbf{r}}}_{{2,0}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{r}}}_{{2,i}}} = \frac{1}{W}\sum\limits_{i = 1}^N \,{{w}_{i}}({{Q}_{0}}{{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{2,0}}}) = \frac{1}{W}\left( {{{Q}_{0}}\sum\limits_{i = 1}^N \,{{w}_{i}}{{{\mathbf{R}}}_{i}} + \left( {\sum\limits_{i = 1}^N \,{{w}_{i}}} \right){{{\mathbf{R}}}_{{2,0}}}} \right) = {{{\mathbf{R}}}_{{2,0}}}.$$

((16))

(2) Taking into account (14), (15), and (16), the comparison function (11) takes the form

$$\begin{gathered} U(\varphi ,\theta ,\psi ) = \sum\limits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{1,0}}} - Q(\varphi ,\theta ,\psi )({{{\mathbf{r}}}_{{2,i}}} - {{{\mathbf{r}}}_{{2,0}}})} \right|}^{2}} \\ = \sum\limits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{1,0}}} - {{{\mathbf{R}}}_{{1,0}}} - Q({{Q}_{0}}{{{\mathbf{R}}}_{i}} + {{{\mathbf{R}}}_{{2,0}}} - {{{\mathbf{R}}}_{{2,0}}})} \right|}^{2}} = \sum\limits_{i = 1}^N \,{{w}_{i}}{{\left| {{{{\mathbf{R}}}_{i}} - Q{{Q}_{0}}{{{\mathbf{R}}}_{i}}} \right|}^{2}}{\kern 1pt} . \\ \end{gathered} $$

((17))

It is clear that the minimum of the function $U,$ which is equal to zero, is attained, according to (1), at

$${{{\mathbf{R}}}_{{{{i}_{k}}}}} - Q(\varphi ,\theta ,\psi ){{Q}_{0}}(\bar {\varphi },\bar {\theta },\bar {\psi }){{{\mathbf{R}}}_{{{{i}_{k}}}}} = 0,\quad k = 1, \ldots ,K.$$

This implies that $Q{{Q}_{0}} = E$ and $Q = Q_{0}^{{ - 1}}$. Since the rotation matrix ${{Q}_{0}}$ is orthogonal, its inverse matrix coincides with the transpose matrix $Q_{0}^{{ - 1}} = Q_{0}^{T}$; hence, we obtain the rotation matrix $Q$ that minimizes the minimum of the function $U$ of form $Q(\varphi ,\theta ,\psi ) = Q_{0}^{T}(\bar {\varphi },\bar {\theta },\bar {\psi })$. Since the matrices $Q$ and ${{Q}_{0}}$ can be represented in form (3), it is easy to verify that $Q_{0}^{T}(\bar {\varphi },\bar {\theta },\bar {\psi }) = Q(\pi - \bar {\psi },\bar {\theta },\pi - \bar {\varphi })$ and, therefore,

$$Q(\varphi ,\theta ,\psi ) = Q_{0}^{T}(\bar {\varphi },\bar {\theta },\bar {\psi }) = Q(\pi - \bar {\psi },\bar {\theta },\pi - \bar {\varphi }).$$

((18))

This implies that, for two equal structures, the minimum of the function $U$, which equals zero, is attained at the point $(\varphi ,\theta ,\psi ) = (\pi - \bar {\psi },\bar {\theta },\pi - \bar {\varphi })$. These structures can be brought into coincidence using translations (15), (16), and rotation (18).

The converse assertion of the theorem is obvious because $U = 0$ implies that the coordinates of each point $i$ of the first structure with the weight ${{w}_{i}} \ne 0$ in (11) coincide with the coordinates of the second structure; i.e., the points and the structures as a whole are brought into coincidence by displacing the first structure by ${{{\mathbf{r}}}_{{1,0}}}$, the second structure by ${{{\mathbf{r}}}_{{2,0}}}$, and the rotation of the second structure through the angles $\varphi ,\;\theta ,\;{\text{and}}\;\psi $ determined by the matrix $Q$. This completes the proof of Theorem 2.

Thus, according to Theorem 2, the comparison algorithm (1)–(3), which minimizes the comparison function, identifies equal structures and, therefore, provides an objective estimate of the difference of geometry of two “close” structures.

While applying algorithm (1)–(3) for comparing two structures, we will numerically minimize the comparison function $U(\varphi ,\theta ,\psi )$ with respect to the Euler angles after bringing the centers of mass into coincidence.

4 NUMERICAL SOLUTION OF THE OPTIMIZATION PROBLEM

In this section, we present numerical results of comparing geometric structures by minimizing the comparison function (11). Since (11) is not a convex function, we used the zero-order Rosenbrock method [3, 4], which proved to be effective in structural chemistry [7]. In the computer program implementing algorithm (1)–(3) designed for the minimization of the comparison function (11) using the Rosenbrock method, we used the optimization library [9].

We demonstrate the effectiveness of algorithm (1)–(3) described and justified above using structural chemistry applications as examples. We will consider molecules with ordered arrangement of the point atoms as geometric structures. When comparing molecules with the same structural formula, we are interested in the difference in their spatial structure.

In the examples discussed below, we present the results of comparing lactide molecules C₆H₈O₄ studied in [10]; these molecules consist of $N = 10$ main atoms (four oxygen atoms and six carbon atoms). The coordinates of the hydrogen atoms were not involved in the computations because the accuracy of their determination is lower than that of other atoms, and they are irrelevant for the problem under examination. The geometry of these molecules is shown in Fig. 1. The coordinates of the molecules were obtained by X-ray structural analysis. In our experiments, the atoms to be brought into coincidence were assigned the weights ${{w}_{i}} = 1$.

Example 1. To check the validity of the comparison algorithm (1)–(3) and the effectiveness of the program, we consider the example of bringing two identical lactide molecules into coincidence. The initial coordinates ${{{\mathbf{r}}}_{{1,i}}}$, ${{{\mathbf{r}}}_{{2,i}}}$${\text{(}}i = 1,...,10)$ of the atoms of two molecules were obtained using formulas (14) and (3). The coordinates of the first molecule atoms were obtained by displacing the coordinates of the original molecule atoms; the coordinates of the second molecule atoms were obtained from the coordinates of the original molecule atoms by displacing them and rotating through the angles $(\bar {\varphi },\bar {\theta },\bar {\psi }) = (\pi {\text{/}}3,\pi {\text{/}}6,\pi {\text{/}}2)$. Table 1 shows the values of coordinates of the first and the second molecules in ångströms. The minimizer of the function $U$ is found by the Rosenbrock method to be $({{\varphi }_{0}},{{\theta }_{0}},{{\psi }_{0}}) = (90.00^\circ ,30.00^\circ ,120.00^\circ )$, which agrees with (18). The measure of proximity (which is called proximity characteristic in chemistry) calculated by formula (12) was $s = 8.19 \times {{10}^{{ - 8}}}$ Å. The ratio to the minimum distance ${{R}_{{{\text{min}}}}}$ between the atoms in the molecules

$${{R}_{{{\text{min}}}}} = \mathop {\min}\limits_{i > j} \left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{1,j}}}} \right| = 1.20~\;{{{\AA}}}$$

is $s{\text{/}}{{R}_{{{\text{min}}}}} = 6.83 \times {{10}^{{ - 8}}}$, which is practically equal to zero. Thus, algorithm (1)–(3) perfectly brings identical molecules into coincidence.

Table 1. Coordinates of atoms (in Å) of two identical lactide molecules

Full size table

Example 2. Comparison of three lactide molecules. Table 2 shows the coordinates of the oxygen ${{O}_{j}}$, $j = 1,\; \ldots ,\;4$ and carbon ${{C}_{j}}$, $j = 1,\; \ldots ,\;6$ atoms in ångströms in three symmetrically independent (within the same crystal) lactide molecules C₆H₈O₄, one of which is shown in Fig. 1.

Table 2. Coordinates of atoms (in Å) of three lactide molecules C₆H₈O₄

Full size table

Application of algorithm (1)–(3) for comparing molecules 1 and 2, 1 and 3, and 2 and 3 gives the results presented in Table 3. The minimizing angles produced by these computations are also presented in Table 3. Practical experience of examining conformation of molecules based on results of comparing a large number of structures [11] yielded the following arbitrary classification: $s \leqslant {{s}_{0}} = 0.1$ Å indicates that the molecules are almost identical, $0.1\;{\AA} < s \leqslant 0.2$ Å indicates that the molecules are close to each other, and $s > 0.2$ Å indicates that the molecules are different.

Table 3. The residuals Δr_i, the characteristic s, and the Euler angles φ, θ, ψ for the minimizer of the comparison function

Full size table

The analysis of residuals $\Delta {{r}_{i}} = \left| {{{{\mathbf{r}}}_{{1,i}}} - {{{\mathbf{r}}}_{{2,i}}}} \right|,\;i = 1, \ldots ,10$, i.e., the distances between the atoms with identical indexes after the molecules are brought into coincidence (at the minimizer of $U$) and of the quantity $s$ for ${{s}_{0}} = 0.1$ Å suggests that the first and the third molecules, as well as the second and the third molecules have almost identical geometries because in both cases $s < {{s}_{0}}$. The greatest differences are observed between the first and the second molecules $(s = 0.11\;{\AA} > {{s}_{0}})$; these molecules can be considered close to each other. It is seen from Table 3 that the maximum residuals are characteristic of substituent atoms (atoms outside the ring). To demonstrate these differences more clearly, we performed an additional computation by assigning the zero weight ${{w}_{i}} = 0$ to the substituents. In this case, we obtained $s < 0.04$ Å. This indicates that the rings in the molecules are practically identical. The detected differences in the position of the atoms in the first and the second molecules when the rings are brought into coincidence are seen in Fig. 2, which was produced using the coordinates of molecules at the minimizer of the comparison function $U$.

Example 3. Checking the intrinsic symmetry of a molecule.

Using the comparison algorithm (1)–(3), we checked the intrinsic symmetry of the lactide molecule depicted in Fig. 1. The assumed second-order symmetry axis passes vertically through the ring center. To apply algorithm (1)–(3), we formed the “second” molecule by changing the indexing of the atoms. It is seen in Fig. 1 that, due to the assumed symmetry, atom 1 in the second molecule must correspond to atom 2 in the first molecule, atom 5 must correspond to atom 7, and so on. The reindexing of atoms in the second molecule is shown in Table 4. The results of comparing the “two” molecules are also shown in Table 4. The comparison characteristic in this case is $s = 0.009\;{\AA} < {{s}_{0}} = 0.1$ Å; hence, the molecules can be considered identical and, therefore, the original molecule has second-order symmetry with a high degree of accuracy.

Table 4. The residuals Δr_i, the characteristic s, and the Euler angles φ, θ, ψ for the minimizer of the comparison function when checking the intrinsic symmetry of molecule 1

Full size table

The computations described above were performed using the computer program $COMPARISON$ that was written in C#, for which we are grateful to the author of this program N.V. Bakhtadze.

REFERENCES

F. P. Vasil’ev, Optimization Methods (Faktorial, Moscow, 2002) [in Russian].
Google Scholar
A. F. Izmailov and M. V. Solodov, Numerical Optimization Methods (Fizmatlit, Moscow, 2005) [in Russian].
MATH Google Scholar
D. Himmelblau, Applied Nonlinear Programming (McGraw-Hill, New York, 1971).
MATH Google Scholar
B. Yu. Lemeshko, Optimization Methods (Novosibirsk State Univ, Novosibirsk, 2009) [in Russian].
Google Scholar
N. Yu. Chernikova, “Manifestation of conformational differences of molecules,” Vestn. Tambov Gos. Univ., Ser. Estestv. Techn. Nauki 18, 1252–1254 (2013).
Google Scholar
A. I. Lurie, Analytical Mechanics (Springer, Berlin, 2002).
A. E. Razumaeva and P. M. Zorkii, “Computer programs for chemical analysis of crystal structures containing symmetrically independent molecules,” Vestn. Mos. Gos. Univ, Ser. Khim. 21 (1), 27–30 (1980).
Google Scholar
S. P. Novikov and I. A. Taimanov, Modern Geometric Structures and Fields (Mosc. Tsentr Nepreryvnogo Mat. Obrazovaniya, Moscow, 2014) [in Russian].
Google Scholar
K. Kniaz, Optimization.NET. http://www.kniaz.net/software/rosnm.aspx
B. G. Belen’kaya, V. K. Bel’skii, A. I. Dement’ev, V. I. Sakharova, and N. Yu. Chernikova, “Crystal and molecular structures of glycolide and lactide: Association by hydrogen bonds CH…O,” Crystallography 42, 449–452 (1997).
Google Scholar
E. E. Lavut, P. M. Zorkii, and N. Yu. Chernikova, “Contact conformers in crystals of coordination compounds,” J. Struct. Chem. 22, 715–718 (1981).
Article Google Scholar

Download references

ACKNOWLEDGMENTS

This work was supported by the Ministry for Science and Education of Russian Federation, project no. 1.962.2017/4.6, and by the Russian Foundation for Basic Research, project no. 18-01-00590.

Author information

Authors and Affiliations

Peoples’ Friendship University of Russia, 117198, Moscow, Russia
E. B. Laneev & N. Yu. Chernikova

Authors

E. B. Laneev
View author publications
You can also search for this author in PubMed Google Scholar
N. Yu. Chernikova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to E. B. Laneev or N. Yu. Chernikova.

Additional information

Translated by A. Klimontovich

Rights and permissions

Reprints and permissions

About this article

Cite this article

Laneev, E.B., Chernikova, N.Y. Comparing the Spatial Structure of Molecules by Minimizing a Comparison Function. Comput. Math. and Math. Phys. 59, 128–135 (2019). https://doi.org/10.1134/S0965542519010135

Download citation

Received: 24 August 2018
Published: 14 May 2019
Issue Date: January 2019
DOI: https://doi.org/10.1134/S0965542519010135

Keywords:

Use our pre-submission checklist

Avoid common mistakes on your manuscript.