On the accuracy of population analyses based on fitted densities#

de la Lande, Aurélien; Clavaguéra, Carine; Köster, Andreas

doi:10.1007/s00894-017-3264-5

On the accuracy of population analyses based on fitted densities^#

Original Paper
Published: 02 March 2017

Volume 23, article number 99, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Molecular Modeling Aims and scope Submit manuscript

On the accuracy of population analyses based on fitted densities^#

Download PDF

Aurélien de la Lande¹,
Carine Clavaguéra¹ &
Andreas Köster²

1447 Accesses
14 Citations
Explore all metrics

Abstract

Population analyses are part of the theoretical chemist’s toolbox. They provide means to extract information about the repartition of the electronic density among molecules or solids. The values of atomic multipoles in a molecule can shed light on its electrostatic properties and may help to predict how different molecules could interact or to rationalize chemical reactivity for instance. Not being physical observables to which a quantum mechanical operator can be associated, atomic charges and higher order atomic multipoles cannot be defined unambiguously in a molecule, and therefore, several population schemes (PS) have been devised in the last decades. In the context of density functional theory (DFT), PS based on the electron density seem to be best grounded. In particular, some groups have proposed various iterative schemes the outcomes of which are very encouraging. Modern implementations of DFT that are for example based on density fitting techniques permit the investigation of molecular systems comprising of hundreds of atoms. However, population analyses following iterative schemes may become very CPU time consuming for such large systems. In this article, we investigate if the computationally less expensive analyses of the variationally fitted electronic densities can be safely carried out instead of the Kohn-Sham density. It is shown that as long as flexible auxiliary function sets including f and g functions are used, the multipoles extracted from the fitted densities are extremely close to those obtained from the KS density. We further assess if the multipoles obtained through the Hirshfeld’s approach, in its standard or iterative form, can be a useful approach to calculate interaction energies in non-covalent complexes. Relative energies computed with the AMOEBA polarizable forced field combined to iterative Hirshfeld multipoles are encouraging.

Mathematical Aspects of Density Functionals and Density Matrix Functionals in Quantum Chemistry

The use of constrained methods to analyze the molecular reactivity and to define a new type of pseudo atoms

Article Open access 16 July 2024

Long-range parameter optimization for a better description of potential energy surfaces using Density Functional Theory

Article 15 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Electronic population analyses (PA) represent a powerful means to connect the outputs of quantum chemistry computations to the chemical knowledge. Atomic charges may be useful to rationalize chemical reactivity, for example to investigate substituent effects on an energy profile. Atomic charges and higher order multipoles are also key quantities for molecular mechanics force fields. The possibility to extract reliable multipoles from quantum chemistry computations is actually a topic of high interest for the development of second or third generation force fields and hybrid QM/MM schemes [1–3]. Various quantum chemistry methodologies also rely on atomic charges. For the sake of illustration let us mention constrained density functional theory (DFT) whereby atomic charges are imposed to some molecular fragments in order to define diabatic states at the DFT level [4, 5]. Such diabatic states proved to be useful for modeling electron transfer processes or charge transfers within non-covalent complexes [6].

It is actually well-known that there is no unique way to define an atom in a polyatomic molecule, and hence to define atomic charges. This is due to the fact that atomic charges are not physical observables. Despite this ambiguity of atomic charges, atom centered multipole expansions are rather reliable if the problem of atomic multipole invariance is properly addressed. A convenient way to do so is the definition of cumulative atomic multipole moments [7, 8]. These moments are built up from atomic charges, which are invariant to coordinate transformation. Because the convergence of the atom centered multipole expansion depends on the quality of the underlying atomic charges [9], population analyses are important ingredients for the development of atom centered multipole expansions with only a limited number of expansion terms. A critical point to any population scheme is to divide the real or function space in order to distribute the electron density over the atoms. It is customary to classify population schemes along various categories. The Mulliken [10, 11] and Löwdin [12] approaches as well as the more elaborated natural population analysis [13] or natural bonding orbital (NBO) [14] analysis define atomic charges by division of the function space spanned by the atomic or molecular orbitals. Another family encompasses the Becke [15], Hirshfeld [16], and Voronoi deformation density [17] approaches. These schemes are based on a real space partitioning of the electron density itself. On-going efforts are deployed by several groups to improve the quality of the charges produced by these approaches [18–20]. For example iterative schemes have been proposed such as the Hirshfeld-I [21], Hirshfeld-λI [22], the fractional occupation Hirshfeld-I [23] or the Stockholder-I methods [24]. We also mention the methods where the charges are obtained through the integration of the electronic density over topological basins of well suited functions [25]. These functions may be the electronic density (the Bader’s atoms-in-molecules theory [26]) or the electron localization function [25]. For the sake of completeness, we finally mention methods like the Merz-Singh-Kollman scheme that consists in the fit of the atomic charges so as to reproduce the electrostatic potential created by the molecule [27–29]. These methods have been abundantly employed to develop molecular mechanics force fields like AMBER [30] or CHARMM [31].

In this paper, we focus on methods based on the real space integration of the density in the context of DFT. These calculations have become the most used quantum chemistry approaches because they provide an excellent cost/quality ratio. Indeed, modeling systems containing up to a few hundreds of atoms are nowadays accessible by DFT. These performances have become possible thanks to the development of ingenious algorithms for solving the Kohn-Sham equations. In particular, methods resorting on fitted densities besides the Kohn-Sham density (e.g., variational density fitting [32, 33] and Cholesky decomposition of the density matrix [34]) permit the elimination of the cumbersome evaluation of four-center electron repulsion integrals.

Performing population analyses on optimized electronic densities is usually not a CPU demanding task compared to the self-consistent-field (SCF) procedure. However, if iterative schemes are employed this task can be a computational bottleneck. This limitation can become critical for systems comprising tens or hundreds of atoms, as commonly investigated by nowadays DFT approaches. One possible strategy to overcome these potential limitations is to develop more efficient parallelization and/or grid techniques. An alternative, although not exclusive, approach is to perform analyses based on auxiliary function densities instead of Kohn-Sham densities. Even though this procedure seems rather straightforward, it must be taken into account that auxiliary densities, as obtained in density fitting approaches, are not designed to mimic the Kohn-Sham orbital density, but to provide a density from which a mathematically simpler electron-electron repulsion energy term will be calculated, avoiding explicit four-center-integrals. There is no guarantee that auxiliary densities can be used in lieu of Kohn-Sham densities in population analyses.

The objectives of this paper are twofold. First, we wish to assess whether auxiliary densities are suited for population analyses. By this, we mean extracting not only monopoles (atomic charges) but also atomic dipoles and quadrupoles. We have considered large sets of organic molecules in our tests. Second, we wish to investigate the capabilities of various populations schemes, including some of the most advanced to produce electrostatic multipoles that may be used for interaction energy calculations. Reliable schemes would then be valuable for second or third generation force fields or for accurate hybrid DFT/MM (molecular mechanics) approaches. As an example, relative energies of different structures between a tryptamine molecule, a water molecule and a sodium cation are computed with the AMOEBA [35, 36] force field using multipoles extracted from DFT population analyses.

The article is organized in three parts. We first present the new population schemes we have introduced in deMon2k. We then report extensive benchmark calculations on sets of organic molecules. We finally report the performances of the population schemes to produce atomic multipoles that can be used in the AMOEBA force field to reproduce structures and non-covalent energies.

Population schemes

The population schemes described thereafter have been implemented in a new version of the program deMon2k [37]. They can be classified into two categories. The first category refers to population analyses that define atomic charges from the number of electrons belonging to each atom and its nuclear charge. The charge on atom A reads:

$$ {Q}_A={Z}_A-{N}_A $$

(1)

where N _A represents the number of electrons on atom A and Z _A its nuclear charge. In the second category the charge is defined from a deformation density between the converged SCF density and a reference density.

$$ {Q}_A={N}_A-{N}_A^{ref} $$

(2)

Typically N _A and N ^ref_A refer to the number of electrons of atom A from the SCF electronic density and the so-called promolecular density which is the superposition of non-interacting atomic densities. The latter scheme will be called deformation density analyses. In both approaches, N _A is obtained by numerical integration of the electronic density over a grid of points

$$ {N}_A={\displaystyle \sum_i}\rho \left({r}_i\right){\omega}_q\left({r}_i\right){\omega}_A\left({r}_i\right) $$

(3)

where the index i loops over grid points. We have chosen Lebedev grids for the angular integration in combination with an Euler-MacLaurin radial quadrature scheme. The here used grids are identical to the default fixed grids in deMon2k for coarse, medium, and fine integration accuracy. In the above formula ω _q collects all quadrature weights, angular and radial ones, whereas ω _A is an atomic weight function for the real space partition into atomic cells. Five variants have been implemented in deMon2k. These are the Voronoi (V), Becke (B) [15], Hirshfeld (H) [16], iterative Hirshfeld (IH) [21] and, finally, the iterative Hirshfeld with fractional occupations numbers (IHFO) [23] partition schemes. The Voronoi cell of atom A is defined by all grid points that are closer to nucleus A than from any other atom. Therefore, the ω _A function takes the form:

$$ {\omega}_A^V\left({r}_i\right)=1\kern0.75em \mathrm{if}\kern0.5em \left|{r}_i-{r}_A\right|<\left|{r}_i-{r}_X\right|\ \forall\ X\ne A $$

(4)

$$ {\omega}_A^V\left({r}_i\right)=0\kern0.5em \left|{r}_i-{r}_A\right|>\left|{r}_i-{r}_X\right|\ \forall\ X\ne A $$

(5)

r _i, r _A and r _X are the positions of grid point i and of nuclei A and X, respectively. The Voronoi scheme renders a space division by non-overlapping polyhedrons. The Becke atomic cells are defined from the Voronoi cells by making them slightly overlapping [15]. This is achieved by introducing a smoothing function to define fuzzy borders of cells.

$$ {\omega}_A^B\left({r}_i\right)=\frac{P_A\left({r}_i\right)}{{\displaystyle {\sum}_X}{P}_X\left({r}_i\right)} $$

(6)

The cell functions P _A(r _i) and P _X(r _i) are defined by:

$$ {P}_A\left({r}_i\right)={\displaystyle \prod_{B\ne A}} s\left({\mu}_{A B}\right) $$

(7)

.

The “soft” step function s(μ _AB) is obtained by a threefold iteration of the polynom $ p\left({\mu}_{AB}\right)=\frac{3}{2}{\mu}_{AB}-\frac{1}{2}{\mu}_{AB}^3 $. The here appearing elliptic coordinate,

$$ {\mu}_{A B}=\frac{r_A-{r}_B}{R_{A B}}, $$

is defined in the local coordinate system of the atom pair A and B as depicted in Fig. 1

For atom A ω ^B_A equals unity close to the nuclei but it rapidly drops to zero when approaching the border of the Voronoi cell of the atom. Both the Voronoi and Becke schemes are based on geometrical considerations only. The chemical nature of the atoms composing the molecule of interest never enters into the definition of the atom cells, and then into the definition of the atomic charges. As a consequence, these population schemes may produce atomic charges that are not satisfactory from a chemical point of view. For example, charges on hydrogen atoms typically take values around -0.5, simply because the Voronoi/Becke cells of hydrogen atoms expand to half the length of the bonds in which they are engaged. The H, IH, and IHFO schemes constitute improvements in that regard. For these three schemes, the integration weights are functions of atomic reference densities, ρ ^ref_A .

$$ {\omega}_A^H\left({r}_i\right)=\frac{\rho_A^{r ef}\left({r}_i\right)}{{\displaystyle {\sum}_X}{\rho}_X^{r ef}\left({r}_i\right)} $$

(8)

The denominator in Eq. (8), $ {\displaystyle {\sum}_X}{\rho}_X^{ref} $, defines the so-called promolecular density. There is some liberty to define the ρ ^ref_X functions. In the standard Hirshfeld scheme, ρ ^ref_X are the densities of neutral atoms. Note that other choices are acceptable, for example ρ ^ref_X may be the densities of isolated ions. The non-uniqueness of reference density is actually a drawback of the standard Hirshfeld scheme. In deMon2k, ρ ^ref_X are obtained by performing SCF calculations of spherically averaged neutral atoms. The ω ^H_A will be close to unity near atom A but will progressively decay to zero when approaching other nuclei. Note that another drawback of the standard Hirshfeld partition is that atomic charges are generally close to zero.

To alleviate the inconvenience of the standard scheme, iterative variants have been proposed [21]. In the IH scheme, one chooses ρ ^ref_X to be the density of an isolated atom X holding the same number of electrons N _X as the atom in the molecule (thereafter denoted $ {\rho}_X^{ref,{N}_X} $). In other words the ω ^IH_A function, hence the Hirshfeld cell of atom A, is adjusted iteratively so that both the reference atom and the corresponding atom in the molecule have the same number of electrons. This procedure has been shown to minimize the loss of information when defining an atom in a molecule according to the Shanon theory of information [21]. In the original article of the IH scheme, the authors proposed to define $ {\rho}_X^{ref,{N}_X} $ by interpolation between electronic densities of isolated ions, the electron numbers of which bracket N _X.

$$ {\rho}_X^{ref,{N}_X}={\rho}_X^{fint\left({N}_x\right)}\left[ cint\left({N}_x\right)-{N}_x\right]+{\rho}_X^{cint\left({N}_x\right)}\left[{N}_x- fint\left({N}_x\right)\right] $$

(9)

In this expression, fint(N _x) (resp. cint(N _x)) is the largest (resp. smallest) integer less (resp. greater) than or equal to N _x. Alternatively $ {\rho}_X^{ref,{N}_X} $ can be obtained by running a SCF calculation for an ion holding N _x electrons. Note that N _x is usually a non-integer number. Both variants have been tested in deMon2k and showed to give very similar atomic charges. We finally only kept the second variant based on atomic SCF calculations with non-integer electron numbers because of its simple straightforward definition.

The IHFO scheme represents an extension of the IH scheme in which both alpha (ρ ^α) and beta (ρ ^β) densities are integrated separately [23]. Accordingly, N ^σ_A , ω ^σ_A and $ {\rho}_A^{ref,{N}_A^{\sigma}} $ become spin-specific. The reference ionic densities are obtained as for the IH scheme by running SCF calculations in which the number of both alpha and beta numbers of electrons are imposed. Now the reference atom and the corresponding atom in the molecule have the same charges and spin charges. For closed-shell molecules, the IH and IHFO schemes obviously are identical but they should produce different charges for open-shell systems. Another alternative for defining ρ ^ref_A in Eq. (7) is the IHDO-D scheme where the atomic dipoles are further imposed in the iterative procedure. We leave the introduction of the IHDO-D scheme in deMon2k for future work. Finally, we mention that ρ ^ref_A may also be built from the densities of reference molecular fragments, as shown for example in [17]. We have already described such an implementation in deMon2k [6].

Once atomic cells have been defined according to any of the partition schemes defined above, atomic charges are easily computed with Eqs. (1) or (2). Now, higher order moments can be defined based on the atomic cells. For example, the components of the "intrinsic" atom dipoles (μ) and quadrupoles (Θ) can be calculated by:

$$ {\mu}_{\alpha}^A={\displaystyle \sum_i}\left({r}_{i,\alpha}-{r}_{A,\alpha}\right)\rho \left({r}_i\right){\omega}_q\left({r}_i\right){\omega}_A\left({r}_i\right) $$

(10)

$$ {\Theta}_{\alpha \beta}^A={\displaystyle \sum_i}\left({r}_{i,\alpha}-{r}_{A,\alpha}\right)\left({r}_{i,\beta}-{r}_{A,\beta}\right)\rho \left({r}_i\right){\omega}_q\left({r}_i\right){\omega}_A\left({r}_i\right) $$

(11)

where r _A,α are the components of the position vector of nucleus A.

All the electronic densities that have been introduced above are obtained from the Kohn-Sham molecular orbitals (MOs). In deMon2k, these MOs are expanded within the LCGTO approximation (linear combination of Gaussian-type orbitals). The corresponding density is given as:

$$ \rho (r)={\displaystyle \sum_{\mu, \nu}}{P}_{\mu \nu}\mu (r)\nu (r) $$

(12)

where P _μν is an element of the density matrix and μ, ν represent GTOs. Greek letters are used as indexes and also to label the GTOs. In deMon2k auxiliary densities (denoted by $ \overset{\sim }{\rho} $) are also introduced to reduce the scaling of the calculation of the Coulomb interaction. The auxiliary density, $ \overset{\sim }{\rho} $, is expanded as a linear combination of auxiliary functions $ \overline{k} $ :

$$ \overset{\sim }{\rho}(r)={\displaystyle \sum_{\overline{k}}}{x}_{\overline{k}}\overline{k}(r) $$

(13)

where $ {x}_{\overline{k}} $ are the so-called Coulomb fitting coefficients. In deMon2k, the $ \overline{k} $ are primitive Hermite GTOs [38]. The coefficients $ {x}_{\overline{k}} $ are obtained from the variational fitting of the Coulomb potential as proposed by Dunlap [32, 33]. Because we have at hand such auxiliary densities in DFT calculation in deMon2k, we may expect them to be valuable to perform population analysis in lieu of the Kohn-Sham density. The fitted density may be used to calculate the number of electrons for each atom by replacing ρ with $ \overset{\sim }{\rho} $ in Eqs. (1) or (2). It can also be used to calculate the integration weights involved in the Hirshfeld schemes. In both cases, one should expect an important saving of computer time since the Kohn-Sham density is expressed as a sum of products of atomic orbitals whereas the fitted density is a simple linear combination of auxiliary functions. Note that the number of atomic orbital products greatly exceeds the number of auxiliary functions which are typically 3 to 5 times the number of basis functions. Thus, significant computational savings can be expected. Approximating the Hirshfeld weights using the fitted density is certainly less dramatic than integrating this density itself instead of the Kohn-Sham density. In our implementation the Hirshfeld weights are always calculated with \overset{\sim}{\rho} while the liberty is left to the user to integrate either the Kohn-Sham, ρ, or fitted, $ \overset{\sim }{\rho} $, densities. To conclude this section, we stress that for deformation density analyses, although the fitted reference densities, $ {\overset{\sim }{\rho}}_A^{ref} $, are used to calculate the integration weights, the Kohn-Sham reference densities, ρ ^ref_A , are used to calculate N ^ref_A .

Accuracy of population analyses

In this section, we assess the accuracy of population analysis performed from the auxiliary function density within the Becke, Hirshfeld (standard and iterative variant), and Voronoi deformation density (VDD). To this end we consider two sets of molecules. The first one is an ensemble of 66 organic molecules relevant to biological structures (thereafter referred as S66). It contains C, H, N, and O atoms. The S66 set of molecules has been reported recently by Řezác et al. in the context of benchmarking computations of interaction energies by quantum chemistry methodologies [39, 40]. Although the present paper is not devoted to this topic, the S66 set still provides a valuable ensemble of organic molecules to test our population analysis implementation. The second set contains 40 halogenated organic molecules, also provided by Řezác and Hobza [41]. In total 105 organic molecules are considered. These test sets encompass 987 H, 584 C, 70 N, 96 O, 2 S, and 74 halogen (X) atoms. We used the DZVP-GGA (double zeta with valence polarization functions, calibrated for generalized-gradient-approximation functionals) [42] basis set and the PBE exchange correlation functional [43]. The XC energy and potential have been integrated numerically on an adaptive grid of medium accuracy [44]. The auxiliary density has been used to compute both the classical Coulomb and XC potential following the so-called auxiliary DFT (ADFT) framework [45]. Various auxiliary basis sets have been considered. Auxiliary basis sets are generated by an automatic procedure implemented in deMon2k that depends on the atomic orbital basis set. The GEN-An auxiliary function sets contain groups of auxiliary functions with s and spd angular momenta. The index n determines the number of auxiliary function sets, i.e., the number of these sets increase with increasing n [42]. We have considered the GEN-A1, GEN-A2, and GEN-A3 auxiliary function sets, as well as the GEN-A2* that is supplemented by f and g auxiliary functions. Numerical integrations involved in population analysis have been carried out with fixed grids of medium accuracy. For the iterative schemes the iterations were pursued until the root-mean-square error was below 10^-5.

We first consider atomic charges obtained by four population schemes. We report the mean unsigned error (MUE) and the maximum error (MAXERR) between atomic charges obtained by analyzing the Kohn-Sham and the auxiliary density in Figs. 2 and 3, respectively. For simplicity, we will refer to them as the KS (BASIS) and auxiliary (AUXIS) charges. The calculations are repeated for four sets of auxiliary functions. With the Hirshfeld schemes the differences between the KS and auxiliary charges decrease when going from GEN-A1 to GEN-A2. Passing to GEN-A3 does not guarantee a better convergence. For the Becke and VDD schemes none of the GEN-A1, -A2 or -A3 auxiliary function sets allow to match the atomic charges obtained by integration of the KS density. Similar conclusions can be drawn for the maximum errors (Fig. 3). Note that the maximum errors with the GEN-A1 auxiliary function set can be quite large (0.3 e^-). For any of the four population schemes investigated here, it is the addition of angular flexibility in the auxiliary function set (i.e., GEN-A2*) that enables a significant decrease of the MUE and of the maximum error. In conclusion the GEN-A2* auxiliary function set seems to offer an excellent accuracy close to 0.01 e^- of the fitted density analysis compared to the KS density. On the other hand, auxiliary function sets comprising only s and spd sets (GEN-An, n = 1, 2 or 3) should only be used in population analyses of the fitted electronic densities if qualitative results are aimed.

We now turn to the analysis of the intrinsic dipole moments. In Fig. 4, we report the root-mean-square-deviation between the norms of the intrinsic dipole moments obtained with the AUXIS and BASIS approaches. In Fig. 5, we report the angles between the dipoles obtained with the two approaches. The atomic dipoles obtained with GEN-A1 are clearly not reliable. In particular, the orientation of the dipoles obtained from the integration of ρ or of $ \overset{\sim }{\rho} $ can be extremely different (see Fig. 5). The situation is largely improved with GEN-A2 or GEN-A3 both in term of the norms and orientations of the dipole moments. When using GEN-A2* the comparison is, as for atomic charges, much more satisfactory. In most cases the RMSD between the dipoles obtained with both approaches is below 0.01 D, while the orientation of the AUXIS dipoles is below 1° from the BASIS dipoles.

Computational performances

In this section, we report the efficiency of our iterative Hirshfeld population analysis implementation, which is the most time consuming partition scheme here discussed. To this end, we optimized the insulin molecule at the PBE/DZVP/GEN-A2 level of theory employing ADFT. This molecule contains 784 atoms (H, C, N, O, and S) and a total of 3078 electrons. The optimized geometry is depicted in Fig. 6.

For the optimized insulin structure, we performed IH analyses of the KS (BASIS) and fitted (AUXIS) densities employing a varying number of compute cores. The same level of theory as for the structure optimization was used, i.e., PBE/DZVP/GEN-A2. The resulting timings are depicted in Fig. 7 as a function of the number of cores. All calculations are performed with Intel^® Xeon™ E5-2650v2 (2.6 GHz) 8 core CPUs with 4 GB RAM per core. To guide the eye the individual data points in Fig. 7 are connected. As expected the analysis of the fitted density is always significantly faster (factor of around 2) than the KS density. The scaling with respect to the number of cores is rather satisfying. As Fig. 7 shows computational savings are still gained when passing from 96 to 128 cores.

To put the timings in Fig. 7 into perspective we note that the structure optimization of insulin took around 1200 optimization steps. This optimization required 3 weeks on 32 of the above specified Xeon™ cores. Thus, the here reported timings for the iterative Hirshfeld analysis are a small overhead to the structure optimization. Because they scale similar to the SCF and geometry optimization this relation holds also for larger number of cores, e.g., the above shown 128 cores. In fact, in the case of the insulin molecule the CPU time needed for the iterative Hirshfeld analysis is only 2 to 3 times larger as for a single point ADFT energy calculation. Therefore, the here discussed population analysis implementation can be used for larger systems that are of interest to biological chemistry or material science.

Electrostatic interaction calculations with amoeba

In this section we wish to determine if the multipole distribution obtained from the population schemes described above are suitable to calculate electrostatic interaction energies within supramolecular complexes. Previous work showed that iterative approaches give atomic charges that better reproduce the DFT electrostatic potential than non-iterative approaches [46]. We use the Na⁺(Tryp)(H₂O) complex to see the capability of the various population schemes to produce multipole sets in order to compute non-covalent interactions with the AMOEBA polarizable force field. The tryptamine molecule (Tryp) is derived from the tryptophan amino acid, the carboxylic acid being replaced by a hydrogen atom, and classified as a neurotransmitter. Four conformers have been selected from the previous work of Nicely and Lisy (see Fig. 1 in ref. [47] and Fig. 8). For structures A and B, the water molecule interacts both with the sodium cation and the amino group, making a hydrogen bond with the amino nitrogen. They differ mainly by the orientation of the ethylamine side chain. In structure C, the sodium cation is “sandwiched” between the water and the amino group whereas in structure D, the water molecule is acceptor of a hydrogen bond from the indole N-H.

This complex provides interesting structural pictures with various electrostatic interactions such as charge-dipole, dipole-dipole and polarization effects. The AMOEBA force field was chosen for its high-level treatment of electrostatic interactions by using a multipolar expansion up to quadrupoles on each atom and an explicit iterative polarization term. The multipoles have been computed following both IH and H schemes for the isolated tryptamine and then defined in their local atomic frame using the Orient program [48] to be used in the framework of AMOEBA. The relative energies taking structure A as reference, computed at the M06-2X/TZVP [49] level, are compared with the different atomic multipole sets and the energetic errors are reported in Fig. 9. When extracting multipoles from the KS density (BASIS), the error is found to be small, in the range of the error from quantum chemistry calculations ("IH GEN-A2*/A3* (BASIS)" histograms). If the multipoles are extracted from the auxiliary density (AUXIS) the error is still very small as long as GEN-A2* is used ("IH GEN-A2* (AUXIS)" histograms). The errors associated with multipoles obtained from auxiliary density may become large when using the GEN-A2 function set ("IH GEN-A2 (AUXIS)" histograms). Finally, we find that the standard Hirshfeld scheme is less accurate to reproduce such relative energies than the iterative version in both BASIS or AUXIS approaches.

Due to the various interactions involved in the different structures, the errors to reproduce the relative energies should come from wrong specific energetic contributions. The graphs in Fig. 10 represent the values of the main components of the electrostatic energy and the total electrostatic energy for structures B, C, and D as a function of the error on the total energy relative to structure A. For the three structures, the largest error comes from the underestimation of the Na⁺-N(amino) interaction whereas the energy for the Na⁺-O interaction remains relatively constant. Furthermore, the water-N(amino) interaction also plays a role in structure B and the water-N(indole) one in structure D, with a less extent, respectively. Furthermore, this effect can be easily correlated to the charge and the x-component of the dipole moment of the nitrogen atom of the amino group when they are reported as a function of the energetic error for C. The large error for C in the IH scheme using the GEN-A2 auxiliary basis can be explained by a cumulative error on the charge and dipole moment on N and the small value of the polarization energy (Fig. 10).

Consequently, the polarization energy of the structures can be compared for the different multipole sets (Fig. 11). Even if the effect is smaller than for the electrostatic component, this term contributes to the non-covalent interactions. Let’s focus on the IH first. When the GEN-A2 auxiliary function set is used the AUXIS and BASIS give different polarization energies that may differ by several kJ mol^-1. This is especially noticeable for structure C for example. On the other hand we obtain very similar polarization energies for each structures when the GEN-A2* auxiliary function set is used (compare the "IH GEN-A2* (BASIS)" vs. "IH GEN-A2* (AUXIS)" data points). When considering the standard Hirshfeld scheme, we find that polarization energies are significantly larger. For example if we consider structure C, together with the BASIS approach (taking GEN-A2*) the polarization energy goes from 65 kJ mol^-1 to 85 kJ mol^-1.

Conclusions

In this work we have been interested in the accuracy of population analyses based on fitted densities in the context of DFT. The main conclusion of our study can be summarized as follow. We found that fitted densities can actually be used instead of the Kohn-Sham density to extract electrostatic multipoles from DFT calculations. However, the quality of the auxiliary function sets to expand the fitted density has a great impact on the results and should thus be considered with care in applications. With standard sets comprising s and spd angular momentum functions only qualitative agreement between the BASIS and AUXIS atomic charges is found. Conversely, when using GEN-A2* (or GEN-A3*) the agreement between both approaches is excellent for either atomic charges or higher order multipoles. As seen in the case of insulin, the AUXIS approach offers a significant reduction of computational cost compared to the BASIS one. We finally tested the capabilities of the extracted multipoles of the tryptamine system to provide sufficiently accurate interaction energy calculation with the AMOEBA force field. In that regard the iterative Hirshfeld scheme represents a clear improvement over the traditional Hirshfeld scheme. Good results have been obtained with the IH scheme and GEN-A2* or GEN-A3* auxiliary function sets. Overall these results encourage us to pursue our ongoing efforts on the implementation of advanced QM/MM schemes that include second and third generation force fields in deMon2k [50, 51].

References

Misquitta AJ, Stone AJ, Fazeli F (2014) Distributed multipoles from a robust basis-space implementation of the iterated stockholder atoms procedure. J Chem Theory Comput 10(12):5405–5418. doi:10.1021/ct5008444
Mei Y, Simmonett AC, Pickard FC, DiStasio RA, Brooks BR, Shao Y (2015) Numerical study on the partitioning of the molecular polarizability into fluctuating charge and induced atomic dipole contributions. J Phys Chem A 119(22):5865–5882. doi:10.1021/acs.jpca.5b03159
Verstraelen T, Vandenbrande S, Heidar-Zadeh F, Vanduyfhuys L, Van Speybroeck V, Waroquier M, Ayers PW (2016) Minimal basis iterative stockholder: atoms in molecules for force-field development. J Chem Theory Comput 12(8):3894–3912. doi:10.1021/acs.jctc.6b00456
Kaduk B, Kowalczyk T, Van Voorhis T (2012) Constrained density functional theory. Chem Rev 112(1):321–370. doi:10.1021/cr200148b
Voorhis TV, Kowalczyk T, Kaduk B, Wang L-P, Cheng C-L, Wu Q (2010) The diabatic picture of electron transfer, reaction barriers, and molecular dynamics. Annu Rev Phys Chem 61(1):149–170. doi:10.1146/annurev.physchem.012809.103324
Řezáč J, de la Lande A (2015) Robust, basis-set independent method for the evaluation of charge-transfer energy in noncovalent complexes. J Chem Theory Comput 11(2):528–537. doi:10.1021/ct501115m
Sokalski WA, Poirier RA (1983) Cumulative atomic multipole representation of the molecular charge distribution and its basis set dependence. Chem Phys Lett 98(1):86–92. doi:10.1016/0009-2614(83)80208-5
Article CAS Google Scholar
Sokalski WA, Sawaryn A (1987) Correlated molecular and cumulative atomic multipole moments. J Chem Phys 87(1):526–534. doi:10.1063/1.453600
Article CAS Google Scholar
Köster AM, Kölle C, Jug K (1993) Approximation of molecular electrostatic potentials. J Chem Phys 99(2):1224–1229. doi:10.1063/1.465366
Article Google Scholar
Mulliken RS (1955) Electronic population analysis on LCAO–MO molecular wave functions. I. J Chem Phys 23(10):1833–1840. doi:10.1063/1.1740588
Carbó-Dorca R, Bultinck P (2004) Quantum mechanical basis for Mulliken population analysis. J Math Chem 36(3):231–239. doi:10.1023/b:jomc.0000044221.23647.20
Löwdin P-O (1970) On the nonorthogonality problem*. In: Per-Olov L (ed) Advances in quantum chemistry, vol 5. Academic, Cambridge, pp 185-199. doi:10.1016/S0065-3276(08)60339-1
Reed AE, Weinstock RB, Weinhold F (1985) Natural population analysis. J Chem Phys 83(2):735–746. doi:10.1063/1.449486
Article CAS Google Scholar
Reed AE, Curtiss LA, Weinhold F (1988) Intermolecular interactions from a natural bond orbital, donor-acceptor viewpoint. Chem Rev 88(6):899–926. doi:10.1021/cr00088a005
Article CAS Google Scholar
Becke AD (1988) A multicenter numerical integration scheme for polyatomic molecules. J Chem Phys 88(4):2547–2553. doi:10.1063/1.454033
Article CAS Google Scholar
Hirshfeld FL (1977) Bonded-atom fragments for describing molecular charge densities. Theoret Chim Acta 44(2):129–138. doi:10.1007/bf00549096
Article CAS Google Scholar
Fonseca Guerra C, Handgraaf J-W, Baerends EJ, Bickelhaupt FM (2004) Voronoi deformation density (VDD) charges: Assessment of the Mulliken, Bader, Hirshfeld, Weinhold, and VDD methods for charge analysis. J Comput Chem 25(2):189–210. doi:10.1002/jcc.10351
Article Google Scholar
Heyndrickx W, Salvador P, Bultinck P, Solà M, Matito E (2011) Performance of 3D-space-based atoms-in-molecules methods for electronic delocalization aromaticity indices. J Comput Chem 32(3):386–395. doi:10.1002/jcc.21621
Article CAS Google Scholar
Matito E, Sola M, Salvador P, Duran M (2007) Electron sharing indexes at the correlated level. Application to aromaticity calculations. Faraday Discuss 135:325–345. doi:10.1039/b605086g
Article CAS Google Scholar
Salvador P, Ramos-Cordoba E (2013) Communication: An approximation to Bader’s topological atom. J Chem Phys 139(7):071103. doi:10.1063/1.4818751
Bultinck P, Van Alsenoy C, Ayers PW, Carbó-Dorca R (2007) Critical analysis and extension of the Hirshfeld atoms in molecules. J Chem Phys 126(14):144111. doi:10.1063/1.2715563
Article Google Scholar
Ghillemijn D, Bultinck P, Van Neck D, Ayers PW (2011) A self-consistent Hirshfeld method for the atom in the molecule based on minimization of information loss. J Comput Chem 32(8):1561–1567. doi:10.1002/jcc.21734
Article CAS Google Scholar
Geldof D, Krishtal A, Blockhuys F, Van Alsenoy C (2011) An extension of the Hirshfeld method to open shell systems using fractional occupations. J Chem Theory Comput 7(5):1328–1335. doi:10.1021/ct100743h
Lillestolen TC, Wheatley RJ (2009) Atomic charge densities generated using an iterative stockholder procedure. J Chem Phys 131(14):144101. doi:10.1063/1.3243863
Article Google Scholar
Piquemal JP, Pilmé J, Parisel O, Gérard H, Fourré I, Bergès J, Gourlaouen C, De La Lande A, Van Severen MC, Silvi B (2008) What can be learnt on biologically relevant systems from the topological analysis of the electron localization function? Int J Quantum Chem 108(11):1951–1969. doi:10.1002/qua.21711
Article CAS Google Scholar
Bader RFW, Nguyen-Dang TT, Tal Y (1981) A topological theory of molecular structure. Rep Prog Phys 44(8):893
Article Google Scholar
Singh UC, Kollman PA (1984) An approach to computing electrostatic charges for molecules. J Comput Chem 5(2):129–145. doi:10.1002/jcc.540050204
Article CAS Google Scholar
Besler BH, Merz KM, Kollman PA (1990) Atomic charges derived from semiempirical methods. J Comput Chem 11(4):431–439. doi:10.1002/jcc.540110404
Article CAS Google Scholar
Chirlian LE, Francl MM (1987) Atomic charges derived from electrostatic potentials: A detailed study. J Comput Chem 8(6):894–905. doi:10.1002/jcc.540080616
Article CAS Google Scholar
Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA (2004) Development and testing of a general amber force field. J Comput Chem 25(9):1157–1174. doi:10.1002/jcc.20035
Article CAS Google Scholar
Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, Mackerell AD (2010) CHARMM general force field: a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J Comput Chem 31(4):671–690. doi:10.1002/jcc.21367
Mintmire JW, Dunlap BI (1982) Fitting the Coulomb potential variationally in linear-combination-of-atomic-orbitals density-functional calculations. Phys Rev A 25(1):88–95
Article CAS Google Scholar
Dunlap BI, Rösch N, Trickey SB (2010) Variational fitting methods for electronic structure calculations. Mol Phys 108(21-23):3167–3180. doi:10.1080/00268976.2010.518982
Article CAS Google Scholar
Aquilante F, Pedersen TB, Lindh R (2007) Low-cost evaluation of the exchange Fock matrix from Cholesky and density fitting representations of the electron repulsion integrals. J Chem Phys 126(19):194106. doi:10.1063/1.2736701
Article Google Scholar
Ren P, Ponder JW (2003) Polarizable atomic multipole water model for molecular mechanics simulation. J Phys Chem B 107(24):5933–5947. doi:10.1021/jp027815+
Ponder JW, Wu C, Ren P, Pande VS, Chodera JD, Schnieders MJ, Haque I, Mobley DL, Lambrecht DS, DiStasio RA, Head-Gordon M, Clark GNI, Johnson ME, Head-Gordon T (2010) Current status of the AMOEBA polarizable force field. J Phys Chem B 114(8):2549–2564. doi:10.1021/jp910674d
Köster AM, Geudtner G, Alvarez-Ibarra A, Calaminici P, Casida ME, Carmona-Espindola J, Dominguez V, Flores-Moreno R, Gamboa GU, Goursot A, Heine T, Ipatov A, de la Lande A, Janetzko F, del Campo J-M, Mejia-Rodriguez D, Reveles J, Vasquez-Perez J, Vela A, Zuniga-Gutierrez B, Salahub DR (2016) deMon2k Version 5. Mexico City
Köster AM (2003) Hermite Gaussian auxiliary functions for the variational fitting of the Coulomb potential in density functional methods. J Chem Phys 118(22):9943–9951. doi:10.1063/1.1571519
Article Google Scholar
Řezáč J, Riley KE, Hobza P (2011) S66: a well-balanced database of benchmark interaction energies relevant to biomolecular structures. J Chem Theory Comput 7(8):2427–2438. doi:10.1021/ct2002946
Řezáč J, Riley KE, Hobza P (2011) Extensions of the S66 data set: more accurate interaction energies and angular-displaced nonequilibrium geometries. J Chem Theory Comput 7(11):3466–3470. doi:10.1021/ct200523a
Řezáč J, Riley KE, Hobza P (2012) Benchmark calculations ofnoncovalent interactions of halogenated molecules. J Chem Theory Comput 8(11):4285–4292. doi:10.1021/ct300647k
Calaminici P, Janetzko F, Köster AM, Mejia-Olvera R, Zuniga-Gutierrez B (2007) Density functional theory optimized basis sets for gradient corrected functionals: 3d transition metal systems. J Chem Phys 126(4):044108. doi:10.1063/1.2431643
Article Google Scholar
Perdew JP, Burke K, Ernzerhof M (1996) Generalized gradient approximation made simple. Phys Rev Lett 77(18):3865–3868
Köster AM, Flores-Moreno R, Reveles JU (2004) Efficient and reliable numerical integration of exchange-correlation energies and potentials. J Chem Phys 121(2):681–690. doi:10.1063/1.1759323
Article Google Scholar
Köster AM, Reveles JU, del Campo JM (2004) Calculation of exchange-correlation potentials with auxiliary function densities. J Chem Phys 121(8):3417–3424. doi:10.1063/1.1771638
Article Google Scholar
Van Damme S, Bultinck P, Fias S (2009) Electrostatic potentials from self-consistent Hirshfeld atomic charges. J Chem Theory Comput 5(2):334–340. doi:10.1021/ct800394q
Nicely AL, Lisy JM (2011) Charge and temperature effects on hydrated tryptamine cluster ions. J Phys Chem A 115(13):2669–2678. doi:10.1021/jp1059648
Stone AJ, Dullweber A, Engkvist O, Fraschini E, Hodges MP, Meredith AW, Nutt DR, Popelier PLA, Wales DJ (2002) Orient: a program for studying interactions between molecules. version 4.5 edn. University of Cambridge
Mejía-Rodríguez D, Huang X, del Campo JM, Köster AM (2015) Hybrid functionals with variationally fitted exact exchange. Adv Quantum Chem 71:41–67
Alvarez-Ibarra A, Köster AM, Zhang R, Salahub DR (2012) Asymptotic expansion for electrostatic embedding integrals in QM/MM calculations. J Chem Theory Comput 8(11):4232–4238. doi:10.1021/ct300609z
Salahub D, Noskov S, Lev B, Zhang R, Ngo V, Goursot A, Calaminici P, Köster A, Alvarez-Ibarra A, Mejía-Rodríguez D, Řezáč J, Cailliez F, de la Lande A (2015) QM/MM calculations with deMon2k. Molecules 20(3):4780

Download references

Acknowledgement

AMK gratefully acknowledge support from CONACYT through the grant CB-179409.

Author information

Authors and Affiliations

Laboratoire de Chimie Physique, UMR 8000 CNRS/Univ. Paris-Sud, Univ. Paris-Saclay, 91405, Orsay, France
Aurélien de la Lande & Carine Clavaguéra
Departamento de Química, CINVESTAV, Avenida Instituto Politécnico Nacional 2508, México, D.F., Mexico
Andreas Köster

Authors

Aurélien de la Lande
View author publications
You can also search for this author in PubMed Google Scholar
Carine Clavaguéra
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Köster
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Aurélien de la Lande or Andreas Köster.

Additional information

^#Dedicated to Henry Chermette for his contributions to Density Functional Theory

This paper belongs to Topical Collection Festschrift in Honor of Henry Chermette

Rights and permissions

Reprints and permissions

About this article

Cite this article

de la Lande, A., Clavaguéra, C. & Köster, A. On the accuracy of population analyses based on fitted densities^# . J Mol Model 23, 99 (2017). https://doi.org/10.1007/s00894-017-3264-5

Download citation

Received: 18 September 2016
Accepted: 30 January 2017
Published: 02 March 2017
DOI: https://doi.org/10.1007/s00894-017-3264-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On the accuracy of population analyses based on fitted densities^#

Abstract

Similar content being viewed by others

Mathematical Aspects of Density Functionals and Density Matrix Functionals in Quantum Chemistry

The use of constrained methods to analyze the molecular reactivity and to define a new type of pseudo atoms

Long-range parameter optimization for a better description of potential energy surfaces using Density Functional Theory