A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region

Teramoto, Hiroshi; Toda, Mikito; Komatsuzaki, Tamiki

doi:10.1007/s00214-014-1571-9

A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region

Regular Article
Published: 13 September 2014

Volume 133, article number 1571, (2014)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Theoretical Chemistry Accounts Aims and scope Submit manuscript

A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region

Download PDF

Hiroshi Teramoto¹,
Mikito Toda² &
Tamiki Komatsuzaki¹

246 Accesses
1 Citation
Explore all metrics

Abstract

Validity ranges of Lie canonical perturbation theory (LCPT) are investigated in terms of non-blow-up regions. We investigate how the validity ranges depend on the perturbation order in two systems, one of which is a simple Hamiltonian system with one degree of freedom and the other is a HCN molecule. Our analysis of the former system indicates that non-blow-up regions become reduced in size as the perturbation order increases. In case of LCPT by Dragt and Finn and that by Deprit, the non-blow-up regions enclose the region inside the separatrix of the Hamiltonian, but it may not be the case for LCPT by Hori. We also analyze how well the actions constructed by these LCPTs approximate the true action of the Hamiltonian in the non-blow-up regions and find that the conventional truncated LCPT does not work over the whole region inside the separatrix, whereas LCPT by Dragt and Finn without truncation does. Our analysis of the latter system indicates that non-blow-up regions do not necessarily cover the whole regions inside the HCN well. We propose a new perturbation method to improve non-blow-up regions and validity ranges inside them. Our method is free from blowing up and retains the same normal form as the conventional LCPT. We demonstrate our method in the two systems and show that the actions constructed by our method have larger validity ranges than those by the conventional and our previous methods proposed in Teramoto and Komatsuzaki (J Chem Phys 129:094302, 2008; Phys Rev E 78:017202, 2008).

Perturbation Gadgets: Arbitrary Energy Scales from a Single Strong Interaction

Article Open access 29 November 2019

Second-order Møller–Plesset perturbation (MP2) theory at finite temperature: relation with Surján’s density matrix MP2 and its application to linear-scaling divide-and-conquer method

Article 15 August 2015

Numerical stochastic perturbation theory applied to the twisted Eguchi-Kawai model

Article Open access 25 June 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Canonical transformations are coordinate transformations in the phase space of Hamiltonian systems that preserve symplectic two forms, i.e., preserving the form of Hamiltonian equations of motions. Canonical perturbation theory (CPT) is one of the fundamental theories of solving nonlinear dynamical problems that is carried by perturbation from integrable systems through some canonical transformation. CPT has often been applied for seeking for integrals of motions, adiabatic invariants and a better and simpler description of the systems [3]. The traditional canonical transformation is by mixed-variable generating function composed of old and new canonical variables. The most traditional Poincaré-Von Zeipel CPT [3, 4] based on the mixed-variable generating function approach, however, imposes a major impediment to implementing higher order perturbations. Among CPTs, Lie canonical perturbation theory (LCPT) originally developed by Hori [5, 6] and Deprit [7], later by Dragt and Finn [8], is very powerful in that canonical transformation is carried by a series of operations of Poisson brackets avoiding cumbersome generating function of mixed variables where complete inversion from the old to the new canonical variables is rather straightforward. Their mutual relation and their computational efficiency have also been investigated [4, 9–14]. These different formats result in the same normal form Hamiltonian but can result in different normal form transformations. The convergence or divergence of the normal form and normal form transformation of order infinity has been investigated in the previous studies [15–27] under various conditions. Under some of these conditions, the normal form and normal form transformation converges globally, but, since generic Hamiltonians are non-integrable [28], in most cases, there is no hope to seek for (nontrivial) global integral of motions without any symmetries. Under such circumstance, what one can do best is to look into a better and simpler local description of the system in question. LCPT has been applied to seeking for such local descriptions in a perturbative manner from integrable solutions and shown to be versatile in various types of Hamiltonian in the research fields such as celestial mechanics [29, 30], atomic physics [31, 32] and cluster physics [33–39]. For example, in the context of chemical reaction dynamics, LCPT has been applied to seeking (locally-)no-return transition state and the associated reaction coordinate buried in the phase space for many-degrees of freedom Hamiltonian systems such as intramolecular proton transfer in malonaldehyde [40, 41], argon cluster isomerization [33–39], ${\hbox {O}}({}^1D)+{\hbox {N}}_2{\hbox {O}} \rightarrow {\hbox {NO}}+{\hbox {NO}}$ [42], a hydrogen atom in crossed electric and magnetic fields [32, 43], HCN isomerization [1, 2, 44–46] and so forth. LCPT was generalized to dissipative systems such as multidimensional (generalized) Langevin formulation to describe reactions under thermal fluctuation, in which no-return transition state can be obtained by incorporating nonlinearity of the system and interactions with heat bath [47–54]. The pioneering studies on semiclassical analog of LCPT was also carried out in late 1980s for multidimensional resonant, nonresonant and nearly resonant systems [55–57]: They presented a method for deriving corrections in powers of Planck’s constant by the reflection of the underlying (near) divergence properties of classical chaos, which was found to be effective even at low-order corrections in improving the accuracy of the energy eigenvalues. Recently, their semiclassical studies were extended to the analyses of reaction dynamics over a rank-one saddle under a time-dependent external field (optimally controlled laser pulse), and it was found that optimally controlled laser pulse corresponds to modulating the boundary of the reaction in the phase space so as to catch the system excited in the reactant well and then to release it into the product [58]. This method provides a new protocol to design the laser field facilitated by the classical phase space picture [59].

However, in most cases, the convergence radii of these LCPTs are limited even for finite order of perturbations [60], and the convergence radii shrink to zero as the perturbation order increases. In a context of chemical reaction dynamics, molecules exhibit larger amplitude motions as their energies increase. For these molecules to surmount the reaction barriers, they must have large enough energies. Therefore, to describe and understand the chemical reaction dynamics, it is vital to develop a perturbation method that is valid not only in the very vicinity of their equilibrium structures but also in regions far from them. In a broader context, if we succeed in obtaining a better estimation for approximate invariants of motion, we would be able to analyze dynamics not only for near-integrable systems but also for systems with mixed phase space, i.e., those systems which exhibit both chaotic and regular behavior.

In the study of systems with mixed phase space, one of the crucial problems is to find boundaries between chaotic and regular behavior. For systems of more than two degrees of freedom, it is well known that the KAM tori do not divide the equi-energy surface into two separate regions. In fact, Arnold showed, for a specific model Hamiltonian, that trajectories detour around KAM tori, thereby leading to the motion along the resonances [61]. Such motions are now called the Arnold diffusion [62–64]. Moreover, it is known that the resonances constitute a network called the Arnold web [3, 31, 65–70], where the motion across the resonances gives rise to faster diffusion especially around resonance junctions [71, 72]. Thus, in the analysis of the dynamics on the Arnold web, it is crucial to find those regions which trap trajectories for a finite but longer duration [73], since distribution of resonances plays a key role for statistical features of the reaction dynamics [74, 75]. Then, better approximate invariants would offer a clue to find how chaotic and regular regions are distributed in the phase space.

Another important issue in systems of mixed phase space is the transport in the phase space [76–78], i.e., to understand how different regions of the phase space are dynamically connected. In chemistry, reaction processes are nothing but the transport from a potential well to another one via a saddle region. Thus, we face the problem of what kind of phase space structures connect dynamics in a well to that in another one [79]. In such studies, we need to construct better action variables, if any, in different regions of the phase space so that validity ranges of different sets of variables overlap with each other. Then, we could investigate the connection based on the transformation between different sets of the approximate invariants corresponding to different regions of the phase space.

Teramoto et al. [1, 2] proposed a method that makes LCPT valid in wider regions than those in the previous method and demonstrated it in a highly excited HCN molecule. The crux is to calculate canonical transformation in each order of LCPT without any truncation errors. However, validity ranges of their method are also limited by non-blow-up regions. Validity range of a LCPT is a subset of phase space where the resulting normal form is valid. For example, if the normal form is to construct slowly varying actions, center manifolds or stable and unstable manifolds, the validity range of the LCPT is a region where the resulting normal form describes these objects within a given accuracy needed to describe systems. Non-blow-up region of LCPT is a subset of initial conditions in the phase space where the results of the perturbation are finite. Non-blow-up region limits the validity ranges of LCPT because the results should be at least finite to validate them. To improve their method further, it is important to understand these concepts. Section 2 is devoted to an illustration of these concepts in a simple one-degree-of-freedom Hamiltonian system and elucidations of these concepts along with their numerical demonstrations. In Sect. 3, we propose a new perturbation method to avoid blowing up while retaining the normal forms and demonstrate it in Sect. 4. Section 5 is devoted to conclusions and discussions.

2 Non-blow-up regions of Lie canonical perturbation theory, LCPT

2.1 An illustration of a validity range of LCPT for a one-dimensional Hamiltonian system

To illustrate non-blow-up regions of LCPT, let us investigate a simple Hamiltonian system. Let $(q, p)$ be a coordinate and its conjugate momentum with a Hamiltonian represented by

$$\begin{aligned} H (q, p) = \frac{1}{2} \left( p^2 + q^2 \right) + \left( 2 p^2 q - q^3 \right) . \end{aligned}$$

(1)

LCPT seeks for a canonical transformation $(q, p) \mapsto (Q, P)$ so that the Hamiltonian [Eq. (1)] in terms of the new coordinate $(Q, P)$ becomes simple in a certain sense. There exist several conventions for the simplicities [12] and normal forms that attain them.^{Footnote 1} In this specific example, the leading order of normal form would be like

$$\begin{aligned} \bar{H} \left( Q, P \right) = \frac{1}{2} \left( P^2 + Q^2 \right) + O \left( 4 \right) , \end{aligned}$$

(2)

[$O(4)$ means a collection of terms of order quartic and those of higher than quartic with respect to $P$ and $Q$.] such that it has the same quadratic terms but does not have terms of order cubic. To obtain the normal form, LCPT seeks for a canonical transformation generated by a generating function $F(q, p)$, i.e.,

$$\begin{aligned} Q (q, p)&= e^{ - \left\{ F (q, p), \cdot \right\} } q , \end{aligned}$$

(3)

$$\begin{aligned} P (q, p)&= e^{ - \left\{ F (q, p), \cdot \right\} } p , \end{aligned}$$

(4)

where $\{\cdot , \cdot \}$ is Poisson bracket defined as

$$\begin{aligned} \left\{ A (q, p), B (q, p) \right\} = \frac{\partial A (q, p)}{\partial q} \frac{\partial B \left( q, p \right) }{\partial p} - \frac{\partial A (q, p)}{\partial p} \frac{\partial B \left( q, p \right) }{\partial q}. \end{aligned}$$

(5)

A benefit for Lie canonical transformation is that the inverse transformation $(Q, P) \rightarrow (q, p)$ can be easily written as

$$\begin{aligned} q \left( Q, P \right)&= e^{\left\{ F \left( Q, P \right) , \cdot \right\} } Q, \end{aligned}$$

(6)

$$\begin{aligned} p \left( Q, P \right)&= e^{\left\{ F \left( Q, P \right) , \cdot \right\} } P. \end{aligned}$$

(7)

This can be evaluated using the following relation,

$$\begin{aligned} \left\{ A{^\prime} \left( Q, P \right) , B{^\prime} \left( Q, P \right) \right\}&= \frac{\partial A{^\prime} \left( Q (q, p), P (q, p) \right) }{\partial q} \frac{\partial B{^\prime} \left( Q (q, p), P (q, p) \right) }{\partial p} \nonumber \\&\quad- \frac{\partial A{^\prime} \left( Q (q, p), P (q, p) \right) }{\partial p} \frac{\partial B{^\prime} \left( Q (q, p), P (q, p) \right) }{\partial q},\nonumber \\&= \frac{\partial A{^\prime} \left( Q, P \right) }{\partial Q} \frac{\partial B{^\prime} \left( Q, P \right) }{\partial P} - \frac{\partial A{^\prime} \left( Q, P \right) }{\partial P} \frac{\partial B{^\prime} \left( Q, P \right) }{\partial Q}, \end{aligned}$$

(8)

which holds for arbitrary differentiable functions $A{^\prime} (Q, P)$ and $B{^\prime}(Q, P)$ if the transformation $(q, p) \mapsto (Q, P)$ is a canonical transformation [80]. using Eq. (8), the leading order expression of Eqs. (6) and (7) can be written as

$$\begin{aligned} q \left( Q, P \right)&= Q - \frac{\partial F \left( Q, P \right) }{\partial P} + \cdots , \end{aligned}$$

(9)

$$\begin{aligned} p \left( Q, P \right)&= P + \frac{\partial F \left( Q, P \right) }{\partial Q} + \cdots . \end{aligned}$$

(10)

By plugging the leading order expression in Eq. (1), we get

$$\begin{aligned} \bar{H} \left( Q, P \right)&= H \left( p, q \right) \nonumber \\&= \frac{1}{2} \left( Q^2+P^2 \right) + \left( 2 P^2 Q - Q^3 \right) - Q \frac{\partial F \left( Q, P \right) }{\partial P} + P \frac{\partial F \left( Q, P \right) }{\partial Q} + \cdots . \end{aligned}$$

(11)

To eliminate the cubic term of Eq. (11), the generating function $F(Q, P)$ should satisfy

$$\begin{aligned} \left( 2 P^2 Q - Q^3 \right) - Q \frac{\partial F \left( Q, P \right) }{\partial P} + P \frac{\partial F \left( Q, P \right) }{\partial Q} = 0. \end{aligned}$$

(12)

Equation (12) can have multiple solutions but, in this specific case, the conventional semi-simple normal form requires

1.
$F (Q, P)$ is of order cubic.
2.
$F (Q, P) \in \text {Im} (- Q \frac{\partial }{\partial P} + P \frac{\partial }{\partial Q}) = \text {Im} \{ H_2 (Q, P), \cdot \},$where $H_2 (Q, P) = \frac{1}{2} (P^2+Q^2)$ is the quadratic term of Eq. (2) and $\text {Im} \, A$ is the image of the operator $A$, i.e., $\text {Im} \, A = \{ f | \exists g, \, f = A g \}$.

By these requirements, Eq. (12) has an unique solution, $F(Q, P) = - P Q^2$. By plugging this in Eq. (11), the $\cdots$ terms in Eq. (11) become of order quartic or higher than quartic, and thus, the canonical transformation generated by the generating function $F(Q, P)$ is actually what we sought for. In this case, we can exactly calculate the canonical transformation [Eqs. (3) and (4)] and we get

$$\begin{aligned} Q (q, p)&= \frac{q}{1+q}, \end{aligned}$$

(13)

$$\begin{aligned} P (q, p)&= p \left( q+1 \right) ^2, \end{aligned}$$

(14)

using the fact that the canonical transformations can be calculated by integrating the following differential equations up to $\epsilon = 1$,

$$\begin{aligned} \frac{dq (\epsilon )}{d \epsilon }&= \frac{\partial F \left( q (\epsilon ), p (\epsilon ) \right) }{\partial p}, \end{aligned}$$

(15)

$$\begin{aligned} \frac{dp (\epsilon )}{d \epsilon }&= -\frac{\partial F \left( q (\epsilon ), p (\epsilon ) \right) }{\partial q}, \end{aligned}$$

(16)

starting from the initial condition $(q (0), p(0)) = (q, p)$ at $\epsilon = 0$. Then, $Q (q, p)$ and $P (q, p)$ can be obtained as $(Q (q, p), P (q, p)) = (q (1), p (1))$.

Note that the canonical transformation has a set of singular points at $q = -1$. Therefore, the maximally connected component containing the origin and where the canonical transformation is well-defined is $\text {Dom}_F = \{(q, p) | -1 < q\}.$ We call $\text {Dom}_F$ the non-blow-up region of the canonical transformation generated by $F$. As long as one uses the formal power series of the canonical transformation, its domain of convergence cannot go beyond this region, and thus, this region limits the validity range of Lie canonical perturbation theory. To illustrate this, let us consider the power series expansion of the canonical transformation [Eqs. (13) and (14)],

$$\begin{aligned} Q (q, p)&= \sum _{l=0}^{\infty } \left( -1 \right) ^l q^{l+1}, \end{aligned}$$

(17)

$$\begin{aligned} P (q, p)&= p q^2 + 2 p q + p. \end{aligned}$$

(18)

The region where this expansion converges is $\{(q, p) | -1 < q < 1\}$, which is strictly smaller than $\text {Dom}_F$. Roughly speaking, the convergence radius of a canonical transformation is determined by the shortest distance between the expansion origin and the singularity of the canonical transformation, and the expansion converges only within an isotropic circle of the radius.^{Footnote 2} However, if its non-blow-up region extends anisotropically in the phase space, like the current example, the non-blow-up region can be much larger than the region where the expansion converges.

2.2 Non-blow-up regions of LCPT for $n$-dimensional Hamiltonian systems

Let ${\bf{q}} = (q_1, \ldots , q_n)$ be coordinates in an $n$-dimensional Hamiltonian system and ${\bf{p}} = (p_1,\ldots , p_n)$ be their conjugate momenta with a Hamiltonian $H ({\bf{q}}, {\bf{p}})$, which is analytic in a neighborhood of the origin $({\bf{q}}, {\bf{p}}) = {\bf{0}}$. In addition, let the Hamiltonian have a stationary point at the origin $( {\bf{q}}, {\bf{p}}) = {\bf{0}}$, i.e., $(\frac{\partial H ({\bf{q}}, {\bf{p}})}{\partial {\bf{p}}}, -\frac{\partial H ({\bf{q}}, {\bf{p}})}{\partial {\bf{q}}}) |_{({\bf{q}}, {\bf{p}}) = {\bf{0}}} = {\bf{0}}$. Without loss of generality, the value of the Hamiltonian at the origin can be set to zero, i.e., $H ({\bf{0}}, {\bf{0}}) = 0$. Under these settings, in a neighborhood of the origin, the Hamiltonian can be written as

$$\begin{aligned} H \left( {\bf{q}}, {\bf{p}} \right) = \sum _{k=2}^{\infty } H_k \left( {\bf{q}}, {\bf{p}} \right) \end{aligned}$$

(19)

where $H_k ({\bf{q}}, {\bf{p}})$ is a homogeneous polynomial of order $k$ with respect to $({\bf{q}}, {\bf{p}})$. Depending on the form of $H_2 ({\bf{q}}, {\bf{p}})$, several types of normal forms have been proposed, such as, semi-simple normal form, inner product normal form [12]. There also exist several types of the normalization procedures to realize the normal forms [12]. Here, we use the normalization procedure due to Dragt and Finn [82], which is classified as format 2a in [12].^{Footnote 3} However, our method works for other procedures classified into format 2 in [12]. The procedure of Dragt and Finn aims at normalizing the Hamiltonian [Eq. (19)] by the following consecutive Lie canonical transformations

$$\begin{aligned} {\bf{Q}}^{(m)} \left( {\bf{q}}, {\bf{p}} \right)&= e^{- \{F_{{m}}, \cdot \}} e^{- \left\{ F_{{m-1}}, \cdot \right\} } \cdots e^{- \{ F_{{3}}, \cdot \}} {\bf{q}}, \end{aligned}$$

(20)

$$\begin{aligned} {\bf{P}}^{(m)} \left( {\bf{q}}, {\bf{p}} \right)&= e^{- \{F_m, \cdot \}} e^{- \left\{ F_{m-1}, \cdot \right\} } \cdots e^{- \{ F_3, \cdot \}} {\bf{p}}, \end{aligned}$$

(21)

generated by the generating functions $F_m, F_{m-1}, \ldots , F_3$ where $F_k \, (3 \le k \le m)$ is a homogeneous polynomial of order $k$ with respect to $({\bf{q}}, {\bf{p}})$. The non-blow-up region $U_m$ of the LCPT is

$$\begin{aligned} U_m&= \text {Dom}_{F_3} \cap \bigcap _{k=4}^m e^{\{ F_3, \cdot \}} \ldots e^{\left\{ F_{k-1}, \cdot \right\} }\text {Dom}_{F_k}, \nonumber \\&= \text {Dom}_{F_3} \cap e^{\{ F_3, \cdot \}} \text {Dom}_{F_4} \cap \cdots \cap e^{\{ F_3, \cdot \}} \ldots e^{\left\{ F_{m-1}, \cdot \right\} } \text {Dom}_{F_m}, \end{aligned}$$

(22)

that is, an intersection between $\text {Dom}_{F_{3}}$ and $e^{\{ F_3, \cdot \}} \ldots e^{\left\{ F_{k-1}, \cdot \right\} } \text {Dom}_{F_k} \, ( k=4, \ldots , m )$, which is the non-blow-up region $\text {Dom}_{F_k}$ pulled back to the space spanned by the original phase space variables, ${\bf{p}}$ and ${\bf{q}}$.

In general, we have $U_{m_1} \subseteq U_{m_2}$ for $m_1 \ge m_2,$ and thus, the non-blow-up region shrinks as the perturbation order m(m > 3) increases. The question of how the non-blow-up region shrinks depends on specific forms of the generating functions but, in general, if $k \, (k \ge 3)$ is odd and if 0 is an isolated critical point of $F_k ({\bf{q}}, {\bf{p}})$,^{Footnote 4} the differential equation induced by $F_k ({\bf{q}}, {\bf{p}})$,

$$\begin{aligned} \frac{d {\bf{q}}}{d \epsilon }&= \frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{p}}}, \end{aligned}$$

(23)

$$\begin{aligned} \frac{d {\bf{p}}}{d \epsilon }&= -\frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{q}}}, \end{aligned}$$

(24)

is unbounded, i.e., there is at least one unbounded solution [83].^{Footnote 5} If the unbounded solution blows up in a finite time, it can be shown that non-blow-up region of the canonical transformation generated by $F_k$ is not equal to the whole phase space. The reason is the following. Let $k$ be an odd integer that is larger than 2 and $({\bf{q}} (\epsilon ), {\bf{p}} (\epsilon ))$ be one of the solutions of the differential equation that blows up at $\epsilon ^*$. Then,

$$\begin{aligned} \left( {\bf{q}}' (\epsilon ), {\bf{p}}' (\epsilon ) \right) = \left( \epsilon ^* \right) ^{\frac{1}{k-2}} \left( {\bf{q}} \left( \epsilon ^* \epsilon \right) , {\bf{p}} \left( \epsilon ^* \epsilon \right) \right) \end{aligned}$$

(25)

is also the solution of the differential equation that blows up at $\epsilon = 1$.

2.3 A demonstration of how the non-blow-up region $U_m$ depends on the perturbation order $m$

In this section, we provide two examples of how the non-blow-up region $U_m$ shrinks as the perturbation order $m$ increases. First, we evaluate non-blow-up regions of LCPT by Dragt and Finn in a Hamiltonian [Eq. (1)] in Sect. 2.3.1 and compare them with those of LCPTs by Hori and by Deprit. The relation between these LCPTs has been discussed in [4, 9–14], and comparison has been made in terms of the computational complexity [4, 14], generalizability to non-autonomous Hamiltonian systems [4] and to an abstract setting of graded Lie algebras [12, 13]. Here, these LCPTs are compared in terms of their non-blow-up regions and validity ranges. In Sect. 2.3.2, we evaluate non-blow-up regions in a HCN molecule. In both the examples, we use a blow-up technique to integrate the differential equation in Eqs. (23) and (24) shown in Sect. 6.1 in Appendix. In the calculation of generating functions, we used an algorithm of Broer [14] for LCPT by Dragt and Finn and the triangle algorithms [10, 12] for LCPTs by Hori and Deprit.

2.3.1 Non-blow-up regions in a Hamiltonian [Eq. (1)]

In this section, we investigate how the non-blow-up region $U_m$ depends on the perturbation order $m$. To investigate their relation to the phase space topology of the Hamiltonian [Eq. (1)], we plot contour lines of the Hamiltonian in Fig. 1a in the energy range $[-0.1, 0.4]$. This Hamiltonian has four fixed points, one of which is elliptic and the other three are hyperbolic. The elliptic fixed point is located at the origin $(q, p) = {\bf{0}}$ and the other three are located at $( \frac{1}{3},0) ,\,(-\frac{1}{4}, \sqrt{\frac{7}{32}})$ and $(-\frac{1}{4}, -\sqrt{\frac{7}{32}})$, respectively. The one located at $(\frac{1}{3}, 0)$ has an energy $\frac{1}{54}$ that is smaller than that of the other two hyperbolic fixed points, and thus, the closest separatrix from the origin is made up of the stable and unstable manifolds of the hyperbolic fixed point $(\frac{1}{3}, 0)$. In Fig. 1b, we plot non-blow-up regions $U_m \, (m=5,10,15,20)$ of LCPT by Dragt and Finn in the Hamiltonian [Eq. (1)] along with the separatrix of the Hamiltonian. From this figure, $U_m$ shrinks as $m$ increases and $U_m$ converges into the region inside the separatrix. To see if the similar behavior can be seen in other types of perturbation theory, we compare non-blow-up regions of LCPT by Hori, and that by Deprit which are classified as format 2b and 2c in [12], respectively. The former one seeks for the canonical perturbation of the form

$$\begin{aligned} \tilde{Q}^{(m)} (q, p)&= e^{- \sum _{k=3}^m \left\{ \tilde{F}_k, \cdot \right\} } q, \end{aligned}$$

(26)

$$\begin{aligned} \tilde{P}^{(m)} (q, p)&= e^{- \sum _{k=3}^m \left\{ \tilde{F}_k, \cdot \right\} } p, \end{aligned}$$

(27)

where $\tilde{F}_k (q, p) \, (k=3, \ldots , m)$ is a homogeneous polynomial of order $k$ with respect to $q$ and $p$. The generating function $\tilde{F}_k (q, p) \, (k=3, \ldots , m)$ is determined by the conventional manner (see [12]). $\tilde{Q}^{(m)} (q, p)$ and $\tilde{P}^{(m)} (q, p)$ can be obtained by integrating the differential equations

$$\begin{aligned} \frac{d q (\epsilon )}{d \epsilon }&= \frac{\partial \sum _{k=3}^m \tilde{F}_k \left( q (\epsilon ), p (\epsilon ) \right) }{\partial p}, \end{aligned}$$

(28)

$$\begin{aligned} \frac{d p (\epsilon )}{d \epsilon }&= -\frac{\partial \sum _{k=3}^m \tilde{F}_k \left( q (\epsilon ), p (\epsilon ) \right) }{\partial q}. \end{aligned}$$

(29)

up to $\epsilon = 1$ starting from $(q (0), p (0)) = (q, p)$ at $\epsilon = 0$ and by putting

$$\begin{aligned} \left( \tilde{Q}^{(m)} (q, p), \tilde{P}^{(m)} (q, p) \right) = \left( q (1), p (1) \right) . \end{aligned}$$

(30)

The latter one seeks for a canonical transformation generated by the generating function $W^{( m)} (\epsilon , q, p) = \sum _{k=3}^m \frac{\epsilon ^{k-3}}{(k-3)!} \hat{F}_k (q, p)$, where $\hat{F}_k (q, p)$ is a homogeneous polynomial of order $k$. In this case, the new variables $( \hat{Q}^{(m)} (q, p), \hat{P}^{(m)} (q, p) )$ can be obtained by integrating the differential equation,

$$\begin{aligned} \frac{d q (\epsilon )}{d \epsilon }&= \frac{\partial W^{(m)} \left( \epsilon , q (\epsilon ), p (\epsilon ) \right) }{\partial p}, \end{aligned}$$

(31)

$$\begin{aligned} \frac{d p (\epsilon )}{d \epsilon }&= -\frac{\partial W^{(m)} \left( \epsilon , q (\epsilon ), p (\epsilon ) \right) }{\partial q}, \end{aligned}$$

(32)

until $\epsilon = 1$, starting from the initial condition $(q (0), p (0)) = (q, p)$ at $\epsilon = 0$. Then, $(\hat{Q}^{(m)}(q, p), \hat{P}^{(m)} (q, p))$ can be obtained as $(\hat{Q}^{(m)} (q, p), \hat{P}^{(m)} (q, p)) = (q (1), p (1)).$ In these cases, we also define non-blow-up regions of the LCPT of order $m,\, \tilde{U}_m$ (Hori) and $\hat{U}_m$ (Deprit) as the set of the initial conditions where the solutions of the canonical transformations generated by these generating functions are bounded. In Fig. 1c, d, we plot $\tilde{U}_m$ and $\hat{U}_m$ for $m = 5,10,15$ and $20$. For Hori’s LCPT, the non-blow-up regions $\tilde{U}_{15}$ and $\tilde{U}_{20}$ do not cover the whole region inside the separatrix, whereas those of Deprit’s LCPT cover relatively wide regions in the phase space. In this specific example, LCPT by Dragt and Finn and that by Deprit have wider non-blow-up regions up to the perturbation order 20th than that by Hori. More systematic study is needed to determine the best format of all the possible formats of LCPT (some of them is listed in [12]) that leads to the widest non-blow-up region among them.

To investigate validity ranges of the LCPTs, we compare the action variables constructed using the LCPTs with the true action inside the separatrix. Here, the true action is defined as

$$\begin{aligned} I = \frac{1}{2 \pi } \int _{\left\{ (q, p) | H (q, p) \le E, \text {inside the separatrix} \right\} } dq dp, \end{aligned}$$

(33)

[80] while the actions constructed by LCPTs are denoted as $I^{(20)} = \frac{1}{2} ((p^{(20)})^2 + (q^{(20)})^2)$ (Dragt and Finn), $\tilde{I}^{(20)} = \frac{1}{2} ((\tilde{p}^{(20)})^2 + (\tilde{q}^{(20)})^2)$ (Hori), and $\hat{I}^{(20)} = \frac{1}{2} ((\hat{p}^{(20)})^2 + (\hat{q}^{(20)})^2)$ (Deprit) with $I^{(20)}_{\text {trunc}} = \frac{1}{2} ((p^{(20)}_{\text {trunc}})^2 + (q^{(20)}_{\text {trunc}})^2)$ (Dragt and Finn, truncated), respectively, where $p^{(20)}_{\text {trunc}}$ and $q^{(20)}_{\text {trunc}}$ are constructed as follows. First, expand the canonical transformation Eqs. (20) and (21) with respect to ${\bf{q}}$ and ${\bf{p}}$, and then, truncated it at the order $21$-st, which is the conventional prescription used in [32]. These actions are close with each other within $O ( E^{\frac{21}{2}} )$. It is because the Hamiltonian (1) can be written as $H (q, p) = H_{\text {int}} ( I^{(20)} ) + O \left( 21\right)$ [this symbol $O$ is the same as that defined in Eq. (2)] in terms of these actions, and, thus, the following equation

$$\begin{aligned} I&= \frac{1}{2 \pi } \int _{\left\{ (q, p) | H_{\text {int}} \left( I^{(20)} \right) \le E, \text {inside the separatrix} \right\} } dq dp + O \left( E^{\frac{20 + 1}{2}} \right) ,\nonumber \\&= \frac{1}{2 \pi } \int _{\left\{ (q, p) | H_{\text {int}} \left( I^{(20)} \right) \le E, \text {inside the separatrix} \right\} } dI^{(20)} d\varTheta ^{(20)} + O \left( E^{\frac{21}{2}} \right) ,\nonumber \\&= \int _{\left\{ I' | H_{\text {int}} \left( I' \right) \le E \right\} } dI^{(20)} + O \left( E^{\frac{21}{2}} \right) , \nonumber \\&= I^{(20)} + O \left( E^{\frac{21}{2}} \right) , \end{aligned}$$

(34)

holds, where $\varTheta ^{(20)} = \arctan \frac{q^{(20)}}{p^{(20)}}$ is an angle variable that is conjugate to $I^{(20)}$. To derive the last equality, we use the fact that $H_{\text {int}} (I')$ is monotonically increasing with respect to $I'$. Therefore, the difference between the action $I^{(20)}$ and the true action is $O (E^{\frac{21}{2}})$. The same is true for the other actions. Note that, at energies above that of the separatrix, contour lines of the Hamiltonian do not enclose finite regions, and, thus, the true action is defined only inside the separatrix. However, the actions constructed using the LCPTs are well-defined inside their non-blow-up regions and we call them actions in what follows. In Fig. 2a–d, we show their relative errors from the true action $I$, defined as (a) $\frac{| I^{(20)} - I|}{I}$ (Dragt and Finn), (b) $\frac{|\tilde{I}^{(20)} - I|}{I}$ (Hori), (c) $\frac{| \hat{I}^{(20)} - I|}{I}$ (Deprit), and (d) $\frac{| I^{(20)}_{\text {trunc}} - I |}{I}$ (Dragt and Finn, truncated), respectively. This comparison shows that the truncated one $I^{(20)}_{\text {trunc}}$ cannot describe the true action properly at the region close to the separatrix (the relative error exceeds 100.), whereas $I^{(20)}$ describes the action inside the separatrix within 1 % error. This tendency does not change even if the perturbation order is increased further. In addition, $\hat{I}^{(20)}$ (Deprit) has larger errors than $I^{(20)}$ (Dragt and Finn), whereas $\tilde{I}^{(20)}$ (Hori) has errors comparable to those of $I^{(20)}$ (Dragt and Finn) has inside of the non-blow-up region $\tilde{U}_{20}$. More systematic study is needed to be done, but, in this specific example, the LCPT by Dragt and Finn leads to the best result among all, regarding the width of the non-blow-up region and the accuracy inside of it. Therefore, we use the LCPT by Dragt and Finn in what follows.

2.3.2 Non-blow-up regions in a HCN molecule

The schematic figure of this molecule is shown in Fig. 3. This molecule consists of three atoms H, C and N. Restricting to the zero total angular momentum, the Hamiltonian can be described by the following three degrees of freedom (dofs), $r$ (distance between C and N atom), $R$ (distance between H and the center of mass of C and N) and $\gamma$ (angle between H and C as seen from the center of mass of C and N) in the Jacobi coordinate. The corresponding Hamiltonian is

$$\begin{aligned} H = \frac{1}{2 \mu } p_r^2 + \frac{1}{2m} p_R^2 + \frac{1}{2} \left( \frac{1}{\mu r^2} + \frac{1}{m R^2} \right) p_{\gamma }^2 + V \left( r, R, \gamma \right) \end{aligned}$$

(35)

where $\mu = (m_C m_N)/(m_C+m_N)$ is the reduced mass of the CN diatom, $m = (m_H \left( m_C + m_N \right) )/(\left( m_H+m_C+m_N \right) )$ the reduced mass of the full system and the potential $V \left( r, R, \gamma \right)$ is taken from Murrell et al. [84]. This molecule has two minima that have collinear configurations, one is called HCN and the other is CNH. The potential energy of the saddle located in between the HCN and CNH wells is $-0.444$ kcal/mol. The HCN and CNH well and the saddle point that lies between the two wells correspond to $\gamma \approx 0, \,\approx \pi$ and $\gamma \approx \pm 1.168$ rad., respectively. In Fig. 4, we show intersections between non-blow-up regions $U_m \, (m = 4, 8, 12, 16)$ and $p_r = p_{\gamma }, p_R = 0, H = -0.430$ kcal/mol projected on the coordinate space $( r, R, \gamma )$ using ParaView [85], version 4.10. In Fig. 4, the boundary of the energetically accessible region is plotted in a transparent surface. This surface looks like a bottle, and its neck corresponds to the saddle region $\gamma \approx 1.168$ (rad.). This figure indicates that non-blow-up regions disappear at the saddle region and, at the perturbation of order 16th, $U_{16}$ cannot cover the whole region inside HCN basin $\gamma = -1.168 \sim 1.168$ (rad.).

3 A method of how to improve non-blow-up regions

In this section, we propose a method to improve validity ranges of Lie canonical perturbation theory. In this section and in what follows, we assume $H_2 ( {\bf{q}}, {\bf{p}} )$ can be written as $\frac{1}{2} \sum _{i=1}^n \omega _i (q_i^2 + p_i^2)$, which holds if the origin $( {\bf{q}}, {\bf{p}} ) = {\bf{0}}$ is an elliptic fixed point, where $\omega _i \, ( \omega _i > 0 )$ is a linear frequency of the $i$th mode. However, it is straightforward to generalize this method to the other types of fixed points. We propose generating functions of a form $\check{F}_k ( {\bf{q}}, {\bf{p}} ) = ( 1 - \exp ( - \frac{\alpha _k}{H_2^l} ) ) F_k ( {\bf{q}}, {\bf{p}} )$ where $F_k ( {\bf{q}}, {\bf{p}} ) \, ( k=3, \ldots , m )$ are the generating functions of LCPT by Dragt and Finn and $l$ and $\alpha _k \, ( k=3, \ldots , m )$ are positive real numbers.

First, note that the new generating function $\check{F}_k ( {\bf{q}}, {\bf{p}} )$ has the same Taylor coefficients as $F_k ( {\bf{q}}, {\bf{p}} )$, and thus, the resulting Hamiltonian $\check{H}^{(m)} = e^{- \{ \check{F}_m, \cdot \}} \ldots e^{- \{ \check{F}_3, \cdot \}} H$ has the same normal form as $H^{(m)} = e^{- \{F_m, \cdot \}} \ldots e^{- \{ F_3, \cdot \}} H$ up to the order $m$.

Second, due to the factor $( 1 - \exp ( - \frac{\alpha _k}{H_2^l} ) )$ in front of the generating function, the canonical transformation generated by the new generating function is free from blowing up. To show this, it is sufficient to show that all the solutions of the following differential equations $( 3 \le k \le m )$,

$$\begin{aligned} \frac{d {\bf{q}}}{d \epsilon }&= \frac{\partial \check{F}_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{p}}}, \end{aligned}$$

(36)

$$\begin{aligned} \frac{d {\bf{p}}}{d \epsilon }&= -\frac{\partial \check{F}_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{q}}}, \end{aligned}$$

(37)

do not blow up for $\epsilon \in \left[ 0, 1 \right]$. It is because the results of LCPT should be finite if the solutions do not blow up for all $k=3, \ldots , m$. To show this, it is sufficient to show $r_{\omega } = \left\| \left( {\bf{q}}, {\bf{p}} \right) \right\| _{\omega }$ is bounded within the unit time by the time evolution of Eqs. (36) and (37) where $\left\| \cdot \right\| _{\omega }$ is a norm induced by a weighted inner product

$$\begin{aligned} \langle \left( {\bf{q}}', {\bf{p}}' \right) , \left( {\bf{q}}, {\bf{p}} \right) \rangle = \sum _{i=1}^n \omega _i \left( q_i' q_i + p_i' p_i \right) . \end{aligned}$$

(38)

It can be evaluated as

$$\begin{aligned} \left| \frac{d \log r_{\omega }}{d \epsilon } \right| \le C_k r^{k-2}_{\omega } \left( 1 - \exp \left( - \frac{2^l \alpha _k}{r^{2l}_{\omega }} \right) \right) , \end{aligned}$$

(39)

where we set

$$\begin{aligned} C_k = \max _{\left\| {\bf{e}} \right\| _{\omega } = 1} \left\| \nabla F_k \left( {\bf{e}} \right) \right\| _{\omega }, \end{aligned}$$

(40)

which is a finite number. Under the condition $l \ge \frac{k-2}{2}$, the right-hand side of Eq. (39) is bounded, and thus, $r_{\omega }$ has a finite growth rate during the unit time interval. The detailed derivation of Eq. (39) is shown in Sect. 6.2 in Appendix. The condition $l \ge \frac{k-2}{2}$ is a sufficient condition for non-blow-up because if $l \ge \frac{k-2}{2}$ holds, the right-hand side of Eq. (39) has a finite limit $\lim _{r_{\omega } \rightarrow \infty } 2^l \alpha _k C_k r^{k-2-2l}_{\omega }$, and, thus, it has a finite maximum value in $[ 0, \infty ]$.

Note that the canonical transformation generated by $\check{F}_k$ is no longer analytic. This is due to the fact that the factor $( 1 - \exp ( - \frac{\alpha _k}{H_2^l} ) )$ is not analytic at the origin $( {\bf{q}}, {\bf{p}} ) = {\bf{0}}$. In general, the normal form of the Hamiltonian is merely a formal power series, and, at best, it is an asymptotic power series with respect to the normalized action-angle variables. This indicates non-existence of analytic canonical transformation that leads to the desired normal form because its existence implies that the original Hamiltonian depends analytically on the normalized action-angle variables. Contrastingly, due to the Borel-Ritt theorem [12], for every formal power series, there exists a $C^{\infty }$ function (which is not necessarily analytic) whose Taylor coefficients are the same as those of the formal power series. Therefore, there may be a canonical transformation of $C^{\infty }$ that leads to the desired normal form. This is one of the reasons why we seek for a non-analytic canonical transformation. A method of how to determine $l$ and $\alpha _k \, ( k=3, \ldots , m )$ is shown in Sect. 6.3 in “Appendix”.

4 Demonstration of our method to improve the validity range

In this section, we demonstrate how our method works for the two systems.

4.1 Demonstration of our method in the Hamiltonian system [Eq. (1)]

In Fig. 5a, b, we show that the two actions $I^{(20)}$ and $\check{I}^{(20)}$ along with the true action $I$, where $\check{I}^{(20)}$ is defined as

$$\begin{aligned} \check{I}^{(20)} = \frac{1}{2} \left( \left( \check{p}^{(20)} \right) ^{2} + \left( \check{q}^{(20)} \right) ^{2} \right) . \end{aligned}$$

(41)

This figure indicates that the action $\check{I}^{(20)}$ extends smoothly to the outside of the non-blow-up region $U_{20},$ whereas $I^{(20)}$ has some spurious peaks indicated by the circles in Fig. 5a′. To investigate how the action $\check{I}^{(20)}$ describes the dynamics for the outside region of the separatrix, we superpose the contour surface of $\check{I}^{(20)}$ with the contour lines of the Hamiltonian in Fig. 6a. The separatrix is indicated by the pink dotted line in this figure. This figure indicates that the two contour lines are roughly parallel with each other. To quantify it, we plot $\left| \frac{\left\{ \check{I}^{(20)}, H \right\} }{\check{I}^{(20)}} \right|$ and $\left| \frac{\left\{ I^{(20)}_{\text {trunc}}, H \right\} }{I^{(20)}_{\text {trunc}}} \right|$ in Fig. 6b, c, respectively. If the contour lines of the actions and the Hamiltonian are parallel with each other, this quantity should be zero. This figure indicates that $\left| \frac{\left\{ \check{I}^{(20)}, H \right\} }{\check{I}^{(20)}} \right|$ is smaller than $\left| \frac{\left\{ I^{(20)}_{\text {trunc}}, H \right\} }{I^{(20)}_{\text {trunc}}} \right|$ by more than 100 times for the outside of the separatrix, whereas $\left| \frac{\left\{ \check{I}^{(20)}, H \right\} }{\check{I}^{(20)}} \right|$ is $<0.1$ in the plotted region. Here, we use $I^{(20)}_{\text {trunc}}$ as a reference to compare because the non-blow-up region $U_{20}$ is almost the same as the region inside the separatrix (see Fig. 1b), and, thus, it cannot be used to compare with $\check{I}^{(20)}$ outside of the separatrix. Again, note that the true action defined as Eq. (33) does not exist outside of the separatrix but the action $\check{I}^{(20)}$ is well defined even outside of the separatrix and serves as an approximate integral of motion, i.e., $\left| \left\{ \check{I}^{(20)}, H \right\} \right| \le 0.1 \times \check{I}^{(20)}$.

4.2 Demonstration of our method in the HCN molecule

In this section, we apply our method to the HCN molecule to demonstrate how our method improves the behavior of the action variables. To demonstrate it, we calculate the actions $I^{\left( 7 \right) }_i \, \left( i=1,2,3 \right) ,\, \check{I}^{\left( 7 \right) }_i \, \left( i=1,2,3 \right)$ and $I^{\left( 7 \right) }_{\text {trunc},i} \, \left( i=1,2,3 \right)$ along a trajectory at energy $-0.430$ kcal/mol, which is beyond the potential energy of the saddle located in between HCN and CNH. Roughly speaking, the third mode ($i=3$) corresponds to the $\gamma$ direction that leads to structural transitions between HCN and CNH and the other modes $i=1,2$ are the bath modes that weakly couple to the third mode. The perturbation order $7$th is shown to be sufficient to obtain converged actions [1, 2] at this energy. In Fig. 7a, b, we show a typical trajectory of (a) $r, \,R$ and (b) $\gamma$, respectively. The phase space region $-1.168 \le \gamma \le 1.168 \, ( \text {mod} \, 2 \pi )$ corresponds to the HCN well, and this trajectory shows two structural transitions between HCN and CNH at the time instances indicated by the arrows, i.e., $t = 8.671$ (fs) and $t = 6.697 \times 10^1$ (fs), in Fig. 7b. In the HCN well, we show how the actions evolve in time along the trajectory in Fig. 7c ($I^{\left( 7 \right) }_i \, \left( i=1,2,3 \right) , \, \check{I}^{\left( 7 \right) }_i \, \left( i=1,2,3 \right)$) and (d) ($I^{\left( 7 \right) }_{\text {trunc},i} \, \left( i=1,2,3 \right)$). The actions $I^{\left( 7 \right) }_{\text {trunc},i} \, \left( i=1,2,3 \right)$ change abruptly in time, and it is very difficult to extract any insight from the actions, but the actions shown in Fig. 7c indicate the existence of the slowly varying actions. However, the actions $I^{\left( 7 \right) }_i \, \left( i=1,2,3 \right)$ have spurious peaks as indicated by the circles in Fig. 7c. These peaks appear when the trajectory comes very close to the edge of the HCN well ($\gamma \approx 1.168$ or $\gamma \approx 2 \pi -1.168$). Contrastingly, the actions $\check{I}^{\left( 7 \right) }_i \, \left( i=1,2,3 \right)$ are free from these spurious peaks. Further study is needed to quantify the difference between the two, but this demonstration indicates potentiality for our method to suppress these spurious peaks on the edge of non-blow-up regions.

5 Conclusions and discussions

Validity ranges of LCPT have been investigated in terms of non-blow-up regions. Non-blow-up region of LCPT is a subset of initial conditions in the phase space where the results of the perturbation are finite. Non-blow-up region limits the validity ranges of LCPT because the results should be at least finite to validate them. We have investigated how the validity ranges depend on the perturbation order in two systems, one of which is a simple Hamiltonian system with one degree of freedom and the other is a HCN molecule. Our analysis of the former system indicates that non-blow-up regions become reduced in size as the perturbation order increases. In case of LCPT by Dragt and Finn and that by Deprit, the non-blow-up regions enclose the region inside the separatrix of the Hamiltonian, but it may not be the case for LCPT by Hori. We have also analyzed how well the actions constructed by these LCPTs approximate the true action of the Hamiltonian in the non-blow-up regions and have found that the conventional truncated LCPT does not work over the whole region inside the separatrix, whereas Dragt and Finn’s without truncation does. In addition, LCPT by Dragt and Finn leads to smaller errors than those by Deprit. Regarding the width of the non-blow-up region and the accuracy inside it, LCPT by Dragt and Finn leads to the best results among the three. Our analysis of the latter system indicates that non-blow-up regions do not necessarily cover the whole region inside the HCN well.

We have proposed a new perturbation method to improve non-blow-up regions and validity ranges inside them. Our method is free from blowing up and retains the same normal form as the conventional LCPT. We demonstrated our method in the two systems and showed that the actions constructed by our method have larger validity ranges than those by the conventional ones and our previous method proposed in [1, 2]. Previously, Padé approximations have also been used to improve validity ranges of LCPT [86–91]. Empirically, these approximations work well and poles of the Padé approximation tend to clump together in the regions where chaotic motion is observed, such as separatrices or other chaotic regions [86, 89]. However, even for an entire function that is analytic in the whole complex plane, its Padé approximation can diverge everywhere [92], and thus, it may not be a reliable method to investigate the phase space geometry. In addition, Teramoto et al. [1, 2] demonstrated that Padé approximation does not work for a highly excited HCN molecule. Contrastingly, our method is free from such a spurious diverging behavior and works even for such highly excited molecules. Some other possible methods to improve validity ranges are using different styles of normalization [12, 93] and using Kolmogorov normal form [94–97]. Both of the methods can be used combining with our method.

Our method can be applied to various subjects in dynamical reaction theory. For example, it would enable us to estimate the time evolution of action variables more precisely than the existing methods, since the action variables constructed by our method are free from blowing up. Thus, it provides us with a new methodology to visualize the Arnold web leading to a better understanding of the dynamical mechanism of intramolecular vibrational-energy redistribution (IVR) [98]. Moreover, the method can be used to investigate how the region around the potential saddle and the well are connected dynamically, since the actions thus constructed offer a better approximation of the real dynamics locally even beyond the separatrix. Therefore, we could evaluate how the stable/unstable manifolds emanating from the normally hyperbolic invariant manifold (NHIM) around the saddle look like in the well even when the energy of the reactive mode is larger than that of the saddle. It would make it possible to understand how the reactive mode obtains energy to go over the saddle from the well and how it loses energy to end up in the well. Results of these studies will be published in near future in separate papers.

Notes

In a context of chemistry, people are often interested in extracting slowly varying action variables, i.e., adiabatic invariants. It is because these variables determine the slowest time scale of intra-molecular vibrational relaxation and chemical reaction triggered by that. For this purpose, the desired normal form would be the one that maximally decouples these action variables from those of the other degrees of freedom.
Note that if the generating function $F$ is real analytic, $Q (q, p)$ and $P (q, p)$ are also real analytic in $\text {Dom}_F$ [81].
In this book, the existing types of perturbations are classified to the five formats, format 1a, 1b, 2a, 2b and 2c, depending on whether they use generating function or not, iterative or recursive. LCPTs by Dragt and Finn, Hori and Deprit are classified into format 2a, 2b and 2c, respectively. For details of the classification, see [12] in Sect. 3.2.
0 is an isolated critical point of $F_k ({\bf{q}}, {\bf{p}})$ if $F_k ({\bf{q}}, {\bf{p}})$ has a critical (stationary) point at 0 and there exists an open neighborhood of 0 within which there is no critical point other than 0.
See COROLLARY in [83] in p. 1921. Note that $k$ in this manuscript corresponds to $k+1$ in their notation.

References

Teramoto H, Komatsuzaki T (2008) Exploring remnant of invariants buried in a deep potential well in chemical reactions. J Chem Phys 129:094302
Article Google Scholar
Teramoto H, Komatsuzaki T (2008) Probing remnants of invariant s to mediate energy exchange in highly-chaotic many-dimensional systems. Phys Rev E 78:017202
Article Google Scholar
Lichtenberg AJ, Lieberman MA (1991) Regular and chaotic dynamics, 2nd edn. Springer, New York
Google Scholar
Cary JR (1981) Lie transform perturbation theory for Hamiltonian systems. Phys Rev 79:129
Google Scholar
Hori G (1966) Theory of general perturbations with unspecified canonical variables. Publ Astron Soc Jpn 18:287
Google Scholar
Hori G (1967) Non-linear coupling of two harmonic oscillations. Publ Astron Soc Jpn 19:229
Google Scholar
Deprit A (1969) Canonical transformations depending on a small parameter. Celest Mech 1:12
Article Google Scholar
Dragt AJ, Finn JM (1976) Lie series and invariant functions for analytic symplectic maps. J Math Phys 17:2215
Article Google Scholar
Campbell JA, Jefferys WH (1970) Equivalence of the perturbation theories of Hori and Deprit. Celest Mech 2:467
Article Google Scholar
Marsman WA (1970) A new algorithm for the Lie transformation. Celest Mech 3:81
Article Google Scholar
Koseleff PV (1994) Comparison between Deprit and Dragt-Finn perturbation methods. Celest Mech Dyn Astron 58:17
Article Google Scholar
Murdock J (2003) Normal forms and unfoldings for local dynamical systems. Springer monographs in mathematics, 1st edn. Springer, New York
Book Google Scholar
Sanders JA, Verhulst F, Murdock J (2007) Averaging methods in nonlinear dynamical systems. Applied mathematical sciences, 2nd edn. Springer, New York
Google Scholar
Broer H, Hoveijn I, Lunter G, Vegter G (2003) Bifurcations in Hamiltonian Systems. Lecture notes in mathematics, vol 1806. Springer, Berlin
Siegel CL (1941) On the integrals of canonical systems. Ann Math 42:806
Article Google Scholar
Bryuno AD (1975) Normal form of real differential equations. Math Notes 18:722
Article Google Scholar
Bryuno AD (1982) Divergence of a real normalizing transformation. Math Notes 31:207
Article Google Scholar
Ito H (1989) Convergence of Birkhoff normal forms for integrable systems. Comment Math Helv 64:412
Article Google Scholar
Ito H (1992) Integrability of Hamiltonian systems and Birkhoff normal forms in the simple resonance case. Math Ann 292:411
Article Google Scholar
Bruno AD, Walcher S (1994) Symmetries and convergence of normalizing transformations. J Math Anal Appl 183:571
Article Google Scholar
Cicogna G (1996) On the convergence of normalizing transformations in the presence of symmetries. J Math Anal Appl 199:243
Article Google Scholar
Kappeler T, Kodama Y, Némethi A (1998) On the Birkhoff normal form of a completely integrable Hamiltonian system near a fixed point with resonance. Ann Scuola Norm Sup Pisa Cl Sci XXVI:623
Google Scholar
Walcher S (2000) On convergent normal form transformations in presence of symmetries. J Math Anal Appl 244:17
Article Google Scholar
Pérez-Marco P (2001) Total convergence or general divergence in small divisors. Commun Math Phys 223:451
Article Google Scholar
Cicogna G, Walcher S (2002) Convergence of normal form transformations: the role of symmetries. Acta Appl Math 70:95
Article Google Scholar
Zung NT (2005) Convergence versus integrability in Birkhoff normal form. Ann Math 161:141
Article Google Scholar
Chiba H (2009) Extension and unification of singular perturbation methods for ODEs based on the renormalization group method. SIAM J Appl Dyn Syst 8:1066
Article Google Scholar
Markus L, Meyer KR (1974) Generic hamiltonian dynamical systems are neither integrable nor ergodic. Mem Am Math Soc 144
Koon WS, Lo MW, Marsden JE, Ross SD (2000) Heteroclinic connections between periodic orbits and resonance transitions in celestial mechanics. Chaos 10:427
Article Google Scholar
Jaffe C, Ross SD, Lo MW, Marsden J, Farrelly D, Uzer T (2002) Statistical theory of asteroid escape rates. Phys Rev Lett 89:011101
Article Google Scholar
von Milczewski J, Diercksen GHF, Uzer T (1996) Computation of the Arnol’d Web for the hydrogen atom in crossed electric and magnetic fields. Phys Rev Lett 76:2890
Article Google Scholar
Uzer T, Jaffé C, Palacián J, Yanguas P, Wiggins S (2002) The geometry of reaction dynamics. Nonlinearity 15:957
Article Google Scholar
Komatsuzaki T, Berry RS (1999) Regularity in chaotic reaction paths. $\text{ I }.\, \text{ Ar }_6$. J Chem Phys 110:9160–9173
Article CAS Google Scholar
Komatsuzaki T, Berry RS (1999) Regularity in chaotic reaction path $\text{ II }:\, \text{ Ar }_6$—energy dependence and visualization of the reaction bottleneck. Phys Chem Chem Phy. 1:1387
Article CAS Google Scholar
Komatsuzaki T, Berry RS (2000) Local regularity and non-recrossing path in transition states—a new strategy in chemical reaction theories. J Mol Struct (Theochem) 506:55
Article CAS Google Scholar
Komatsuzaki T, Berry RS (2001) Regularity in chaotic reaction paths. III: local invariances at the reaction bottleneck. J Chem Phys 115:4105
Article CAS Google Scholar
Komatsuzaki T, Berry RS (2001) Dynamical hierarchy in transition states: why and how does a system climb over the mountain? Proc Natl Acad Sci USA 98:7666
Article CAS Google Scholar
Komatsuzaki T, Berry RS (2002) A dynamical propensity rule of transitions in chemical reactions. J Phys Chem A 106:10945
Article CAS Google Scholar
Komatsuzaki T, Berry RS (2002) Chemical reaction dynamics: many-body chaos and regularity. Adv Chem Phys 123:79
CAS Google Scholar
Komatsuzaki T, Nagaoka M (1996) Study on “regularity” of the barrier recrossing motion. J Chem Phys 105:10838
Article CAS Google Scholar
Komatsuzaki T, Nagaoka M (1997) A dividing surface free from a barrier recrossing motion in many-body systems. Chem Phys Lett 265:91
Article CAS Google Scholar
Kawai S, Fujimura Y, Kajimoto O, Yamashita T, Li C-B, Komatsuzaki T, Toda M (2007) Dimension reduction for extracting geometrical structure of multidimensional phase space: application to fast energy exchange in the reaction $\text{ O }(^1{{\rm D}})+\text{ N }_2{{\rm O}}\rightarrow \text{ NO }+\text{ NO }$. Phys Rev A 75:022714
Article Google Scholar
Kawai S, Komatsuzaki T (2010) Robust existence of a reaction boundary to separate the fate of a chemical reaction. Phys Rev Lett 105:048304
Article Google Scholar
Jaffé C, Kawai S, Palacián J, Yanguas P, Uzer T (2005) A new look at the transition state: Wigner’s dynamical perspective revisited. Adv Chem Phys 130:171
Google Scholar
Li C-B, Matsunaga Y, Toda M, Komatsuzaki T (2005) Phase space reaction network on a multisaddle energy landscape: Hcn isomerization. J Chem Phys 123:184301
Article Google Scholar
Waalkens H, Burbanks A, Wiggins S (2004) Phase space conduits for reaction in multidimensional systems, HCN isomerization in three dimensions. J Chem Phys 121:6207
Article CAS Google Scholar
Bartsch T, Hernandez R, Uzer T (2005) Transition state in a noisy environment. Phys Rev Lett 95:058301
Article Google Scholar
Bartsch T, Uzer T, Hernandez R (2005) Stochastic transition states: reaction geometry amidst noise. J Chem Phys 123:204102
Article Google Scholar
Bartsch T, Uzer T, Moix JM, Hernandez R (2006) Identifying reactive trajectories using a moving transition state. J Chem Phys 124:244310
Article Google Scholar
Kawai S, Komatsuzaki T (2009) Dynamical reaction coordinate buried in thermal fluctuation i: time-dependent normal form theory for multidimensional underdamped langevin equation. J Chem Phys 131:224505
Article Google Scholar
Kawai S, Komatsuzaki T (2009) Dynamical reaction coordinate buried in thermal fluctuation ii: numerical examples. J Chem Phys 131:224506
Article Google Scholar
Kawai S, Komatsuzaki T (2010) Hierarchy of reaction dynamics in a thermally fluctuating environment. Phys Chem Chem Phys 12:7626–7635
Article CAS Google Scholar
Kawai S, Komatsuzaki T (2010) Nonlinear dynamical effects on reaction rate constants in thermally fluctuating environments. Phys Chem Chem Phys 12:7636–7647
Article CAS Google Scholar
Kawai S, Komatsuzaki T (2010) Dynamical reaction coordinate in thermally fluctuating environment in the framework of multidimensional generalized langevin equations. Phys Chem Chem Phys 12:15382–15391
Article CAS Google Scholar
Fried LE, Ezra GS (1987) Semiclassical quantization using perturbation theory: algebraic quantization of multidimensional systems. J Chem Phys 86:6270
Article CAS Google Scholar
Fried LE, Ezra GS (1988) Perturb: a special-purpose algebraic manipulation program for classical perturbation theory. Comput Phys Commun 51:103
Article CAS Google Scholar
Fried LE, Ezra GS (1988) Semiclassical quantization of polyatomic molecules: some recent developments. J Phys Chem 92:3144
Article CAS Google Scholar
Kawai S, Komatsuzaki T (2011) Quantum reaction boundary to mediate reactions in laser fields. J Chem Phys 134:024317
Article Google Scholar
Kawai S, Komatsuzaki T (2012) Laser control of chemical reactions by phase space structures. Bull Chem Soc Jpn 85:854–861
Article CAS Google Scholar
Giorgilli A, Galgani L (1985) Rigorous estimates for the series expansions of hamiltonian perturbation theory. Celest Mech 37:95
Article Google Scholar
Arnold V (1964) Instabilities in dynamical systems with several degrees of freedom. Sov Math Dokl 5:581
Google Scholar
Chirikov BV (1979) A universal instability of many-dimensional oscillator systems. Phys Rep 52:263
Article Google Scholar
Guzzo M, Lega E, Froeschlé C (2009) A numerical study of the topology of normally hyperbolic invariant manifolds supporting arnold diffusion in quasi-integrable systems. Phys D 238:1797
Article CAS Google Scholar
Cincottaa PM, Efthymiopoulosb C, Giordanoa CM, Mestrea MF (2014) Chirikov and nekhoroshev diffusion estimates: bridging the two sides of the river. Phys D 266:49
Article Google Scholar
Martens CC, Davis MJ, Ezra GS (1987) Local frequency analysis of chaotic motion in multidimensional systems: energy transport and bottlenecks in planar OCS. Chem Phys Lett 142:519
Article CAS Google Scholar
Atkins KM, Logan DE (1992) Intersecting resonances as a route to chaos: classical and quantum studies of a three-oscillator model. Phys Lett A 162:255
Article Google Scholar
Froeschlé C, Guzzo M, Lega E (2000) Graphical evolution of the arnold web: from order to chaos. Science 289:2108
Article Google Scholar
Chandre C, Wiggins S, Uzer T (2003) Time-frequency analysis of chaotic systems. Phys D 181:171
Article CAS Google Scholar
Shojiguchi A, Li C-B, Komastuzaki T, Toda M (2006) Wavelet analysis and Arnold web picture for detecting energy transfer in a Hamiltonian dynamical system. Laser Phys 17:1097
Article Google Scholar
Arnold VI, Kozlov VV, Neishtadt AI (2006) Mathematical aspects of classical and celestial mechanics. Encyclopedia of mathematical sciences, 3rd edn. Springer, Berlin
Google Scholar
Laskar J (1993) Frequency analysis for multi-dimensional systems. Global dynamics and diffusion. Phys D 67:257
Article Google Scholar
Honjo S, Kaneko K (2003) Structure of resonances and transport in multidimensional Hamiltonian dynamical systems. Adv Chem Phys 130B:437
Google Scholar
Semparithi A, Keshavamurthy S (2006) Intramolecular vibrational energy redistribution as state space diffusion: classical-quantum correspondence. J Chem Phys 125:141101
Article Google Scholar
Shojiguchi A, Li C-B, Komastuzaki T, Toda M (2007) Fractional behavior in nonergodic reaction processes of isomerization. Phys Rev E75:035204(R)
Google Scholar
Shojiguchi A, Li C-B, Komastuzaki T, Toda M (2007) Fractional behavior in multidimensional Hamiltonian systems describing reactions. Phys Rev E76:056205
Google Scholar
Wiggins S (1990) On the geometry of transport in phase space I. Transport in k-degree-of-freedom Hamiltonian systems, $2 \le k \le \infty$. Phys D 44:471
Article Google Scholar
Gillilan RE, Ezra GS (1991) Transport and turnstiles in multidimensional hamiltonian mappings for unimolecular fragmentation: application to van der Waals predissociation. J Chem Phys 94:2648
Article CAS Google Scholar
Toda M (1995) Crisis in chaotic scattering of a highly excited van der waals complex. Phys Rev Lett 74:2670
Article CAS Google Scholar
Shojiguchi A, Li C-B, Komastuzaki T, Toda M (2008) Dynamical foundation and limitation of statistical reaction theory. Commun Nonlinear Sci Numer Simul 13:857
Article Google Scholar
Goldstein H, Poole CP Jr, Safko JL (2001) Classical mechanics, 3rd edn. Addison-Wesley, Boston
Google Scholar
Coddington EE (1984) Theory of ordinary differential equations. Krieger Pub Co, Huntington
Google Scholar
Dragt AJ, Finn JM (1979) Normal form mirror machine hamiltonians. J Math Phys 20:2649
Article Google Scholar
Coleman CS (1984) Boundedness and unboundedness in polynomial differential systems. Nonlinear Anal Theory Methods Appl 8:1287
Article Google Scholar
Murrell JN, Carter S, Halonen LO (1982) Frequency optimized potential energy functions for the ground-state surfaces of hcn and hcp. J Mol Spectrosc 93:307
Article CAS Google Scholar
Ahrens J, Geveci B, Law C (2005) ParaView: an end-user tool for large data visualization. In: Hansen C, Johnson C (eds) The visualization handbook. Academic Press, London, p 717
Chapter Google Scholar
Shirts RB, Reinhardt WP (1982) Approximate constants of motion for classically chaotic vibrational dynamics: vague tori, semiclassical quantization, and classical intramolecular energy flow. J Chem Phys 77:5204
Article CAS Google Scholar
Ali MK, Wood WR, Devitt JS (1986) On the summation of the Birkhoff–Gustavson normal form of an anharmonic oscillator. J Math Phys 27:1806
Article Google Scholar
Ali MK, Wood WR (1987) The Birkhoff–Gustavson normal form of Double-Well anharmonic oscillators. Prog Theor Phys 78:766
Article Google Scholar
Robnik M (1993) On the Padé approximations to the Birkhoff–Gustavson normal form. J Phys A Math Gen 26:7427
Article Google Scholar
Li CB, Shojiguchi A, Toda M, Komatsuzaki T (2006) Definability of no-return transition states in high energy regime above threshold. Phys Rev Lett 97:028302
Article Google Scholar
Teramoto H, Takatsuka K (2007) Local integrals and their globally connected invariant structure in phase space giving rise to a promoting mode of chemical reaction. J Chem Phys 126:124110
Article Google Scholar
Baker GA Jr, Graves-Morris P (1996) Padé approximants, 2nd edn. Cambridge University Press, Cambridge
Book Google Scholar
Kaluža M, Robnik M (1992) Improved accuracy of the Birkhoff–Gustavson normal form and its convergence properties. J Phys A Math Gen 25:5311
Article Google Scholar
Arnold VI (1963) Proof of a theorem of A. N. Kolmogorov on the invariance of quasi-periodic motions under small perturbations of the Hamiltonian. Russ Math Surv 18:9
Article Google Scholar
Howland RA (1977) An accelerated eliminations technique for the solution of perturbed Hamiltonian systems. Celest Mech 15:327
Article Google Scholar
Howland RA, Richardson DL (1984) The Hamiltonian transformation in quadratic Lie transforms. Celest Mech 32:99
Article Google Scholar
Gabern F, Jorba À, Locatelli U (2005) On the construction of the Kolmogorov normal form for the Trojan asteroids. Nonlinearity 18:1705
Article Google Scholar
Uzer T (1991) Theories of intramolecular vibrational energy transfer. Phys Rep 199:73
Article CAS Google Scholar
Press WH, Teukolosky SA, Vetterling WT, Flannery BP (2007) Numerical recipes, the art of scientific computing. International series of monographs on chemistry, 3rd edn. Cambridge University Press, Cambridge
Google Scholar
Strpistrup B (2008) Programming: principles and practice using C++, 3rd edn. Addison-Wesley Professional, Boston
Google Scholar

Download references

Acknowledgments

We would like to dedicate this article to continuous stimulating and pioneering works by Professor Greg. Ezra in the research field of classical and semiclassical chemical dynamics. HT would like thank Professor Kazuyuki Yagasaki and Zin Arai for their valuable comments on the definition of the action in the Hamiltonian with one degree of freedom and Professor Turgay Uzer for his comment on the Padé approximation as an alternative method for improving the validity range. This work has been supported by JSPS, the Cooperative Research Program of “Network Joint Research Center for Materials and Devices”, Research Center for Computational Science, Okazaki, Japan, and Grant-in-Aid for challenging Exploratory Research (to TK), and Grant-in-Aid for Scientific Research (B) (to TK) from the Ministry of Education, Culture, Sports, Science and Technology, and Nara Women’s University Intramural Grant for Project Research (to MT), Grant-in-Aid for challenging Exploratory Research (to MT), and Grant-in-Aid for Scientific Research (C) (to MT) from the Ministry of Education, Culture, Sports, Science and Technology.

Author information

Authors and Affiliations

Molecule and Life Nonlinear Sciences Laboratory, Research Institute for Electronic Science, Hokkaido University, Kita 20 Nishi 10, Kita-ku, Sapporo, 001-0020, Japan
Hiroshi Teramoto & Tamiki Komatsuzaki
Nonequilibrium Dynamics Laboratory, Research group of Physics, Division of Natural Science, Nara Women’s University, Nara, 630-8506, Japan
Mikito Toda

Authors

Hiroshi Teramoto
View author publications
You can also search for this author in PubMed Google Scholar
Mikito Toda
View author publications
You can also search for this author in PubMed Google Scholar
Tamiki Komatsuzaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tamiki Komatsuzaki.

Additional information

Dedicated to Professor Greg Ezra and published as part of the special collection of articles celebrating his 60th birthday.

Appendix

1.1 A blow-up method to solve Eqs. (23) and (24)

It is difficult to solve Eqs. (23) and (24) directly because their right-hand sides are of order $k-1$ with respect to ${\bf{q}}$ and ${\bf{p}}$ and increase rapidly as ${\bf{q}}$ and ${\bf{p}}$ increase. To solve the differential equation, we introduce the following blow-up coordinates [83], $r$ and ${\bf{e}}$ such that $\left( {\bf{q}}, {\bf{p}} \right) = r {\bf{e}}$ and ${\bf{e}} \cdot {\bf{e}} = 1$. In addition to them, we introduce a scaled virtual time $s$ so that $\epsilon$ increases slower as the solution approaches to the infinity such that $d \epsilon = \frac{1}{r^{\left( k-2 \right) }} d s$. In terms of the blow-up coordinates and the scaled virtual time, Eqs. (23) and (24) can be written as

$$\begin{aligned} \frac{d \log r}{ds}&= \varTheta _k \left( {\bf{e}} \right) , \end{aligned}$$

(42)

$$\begin{aligned} \frac{d {\bf{e}}}{ds}&= \left( \left. \frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{p}}} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}}, \left. - \frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{q}}} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} \right) - \varTheta _k \left( {\bf{e}} \right) {\bf{e}}, \end{aligned}$$

(43)

$$\begin{aligned} \frac{d \epsilon }{ds}&= \frac{1}{r^{\left( k-2 \right) }}, \end{aligned}$$

(44)

where $\varTheta _k \left( {\bf{e}} \right)$ is defined as

$$\begin{aligned} \varTheta _k \left( {\bf{e}} \right) = {\bf{e}} \cdot \left( \left. \frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{p}}} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}}, \left. - \frac{\partial F_k \left( {\bf{q}}, {\bf{p}} \right) }{\partial {\bf{q}}} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} \right) . \end{aligned}$$

(45)

Equations (42), (43) and (44) can be solved stably because norms of their right-hand sides are bounded by a finite value, i.e., $\max _{{\bf{e}} \cdot {\bf{e}} = 1} \left\| \nabla F_k \right\|$, where $\left\| \cdot \right\|$ is Euclidean norm. In this paper, we integrate this differential equation using Stepper Dropper853 [99], which is 8th order Runge-Kutta method with step size control under the constraint ${\bf{e}} \cdot {\bf{e}} = 1$ until $\epsilon \le 1$. We use the double precision to integrate them and if the value of $\log r$ exceeds the logarithm of the maximum value of double defined in the standard C++ library [100],

$$\begin{aligned} \tt{std::numeric\_limits<double>::infty()} \end{aligned}$$

that is 1.79769e+308 in our current environment, we regard the solution as one that blows up.

This method can be also used to solve differential equations induced by generating functions in LCPTs by Hori [Eqs. (28) and (29)] and Deprit [Eqs. (31) and (32)]. However, those generating functions are not homogeneous polynomials, and thus, this method needs to be adopted for them. To solve them accurately, we decompose the phase space into two regions: One is a region where the highest order terms in the generating function dominate and the other is its complement. In the former region, by introducing the blow-up coordinate and the scaled virtual time, the differential equations can be written as Eqs. (42), (43) and (44) plus some correction terms of order $1/r$. They can be integrated in the same manner as above. In the latter region, since the lower order terms still dominate, we can directly integrate the differential equations.

1.2 A derivation of Eq. (39)

The $\epsilon$-derivative of $r_{\omega }$ can be calculated as follows,

$$\begin{aligned} \frac{d r_{\omega }}{d\epsilon }&= \frac{1}{r_{\omega }} \sum _{i=1}^n \omega _i \left( q_i \frac{dq_i}{d\epsilon } + p_i \frac{dp_i}{d\epsilon } \right) , \nonumber \\&= \frac{1}{r_{\omega }} \sum _{i=1}^n \omega _i \left( q_i \frac{\partial \check{F}_k}{\partial p_i} - p_i \frac{\partial \check{F}_k}{\partial q_i} \right) ,\nonumber \\&= \frac{1}{r_{\omega }} \sum _{i=1}^n \omega _i \left( q_i \frac{\partial F_k}{\partial p_i} - p_i \frac{\partial F_k}{\partial q_i} \right) \left( 1 - \exp \left( -\frac{2^l \alpha _k}{r_{\omega }^{2l}} \right) \right) . \end{aligned}$$

(46)

Define ${\bf{e}}$ as $\left( {\bf{q}}, {\bf{p}} \right) = r_{\omega } {\bf{e}}$, then, $\left\| {\bf{e}} \right\| _{\omega } = 1$ holds. using this, we get

$$\begin{aligned} \frac{d r_{\omega }}{d\epsilon }&= \sum _{i=1}^n \omega _i \left( e_i \frac{\partial F_k}{\partial p_i} - e_{i+n} \frac{\partial F_k}{\partial q_i} \right) \left( 1 - \exp \left( -\frac{2^l \alpha _k}{r_{\omega }^{2l}} \right) \right) , \\&= r_{\omega }^{k-1} \sum _{i=1}^n \omega _i \left( e_i \left. \frac{\partial F_k}{\partial p_i} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} - e_{i+n} \left. \frac{\partial F_k}{\partial q_i} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} \right) \left( 1 - \exp \left( -\frac{2^l \alpha _k}{r_{\omega }^{2l}} \right) \right) . \end{aligned}$$

Finally, we get

$$\begin{aligned} \left| \frac{d \log r_{\omega }}{d\epsilon }\right| \le r_{\omega }^{k-2} \left( 1 - \exp \left( -\frac{2^l \alpha _k}{r_{\omega }^{2l}} \right) \right) \max _{\Vert {\bf{e}} \Vert _{\omega } = 1} \Vert \nabla F \left( {\bf{e}} \right) \Vert _{\omega }. \end{aligned}$$

(47)

using the following inequality,

$$\begin{aligned}&- \max _{\left\| {\bf{e}} \right\| _{\omega } = 1} \left\| \nabla F \left( {\bf{e}} \right) \right\| _{\omega } \nonumber \\&\quad \le \sum _{i=1}^n \omega _i \left( e_i \left. \frac{\partial F_k}{\partial p_i} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} - e_{i+n} \left. \frac{\partial F_k}{\partial q_i} \right| _{\left( {\bf{q}}, {\bf{p}} \right) = {\bf{e}}} \right) \nonumber \\&\quad \le \max _{\left\| {\bf{e}} \right\| _{\omega } = 1} \left\| \nabla F \left( {\bf{e}} \right) \right\| _{\omega }. \end{aligned}$$

(48)

1.3 A method to determine $l$ and $\alpha _k \, ( k=3, \ldots , m )$ in Sect. 3

As pointed out in Sect. 3, the condition $l \ge \frac{k-2}{2}$ is a sufficient condition for the right-hand side of Eq. (39) is bounded. Here, we choose $l = k-2$ for simplicity, but this choice may not be the best choice. Further study is needed to find an optimal power $l$. Under this choice, the maximum value of the right-hand side of Eq. (39) can be evaluated as $2^{\frac{k-2}{2}} C_k \alpha _k^{\frac{1}{2}} f^*$ and this maximum is attained at $r_{\omega } = \sqrt{2 \alpha _k^{\frac{1}{k-2}}} r^*$, where $f^* \, ( \approx 6.38173 \times 10^{-1} )$ and $r^* \, ( \approx 8.92135 \times 10^{-1} )$ the maximum and the argument that attains the maximum of the function $f ( r ) = r ( 1 - \exp ( \frac{1}{r^2} ) ) \, ( r \ge 0 )$, respectively.

For the right-hand side of Eq. (39) to be of order 1, i.e., $2^{\frac{k-2}{2}} C_k \alpha _k^{\frac{1}{2}} f^* \sim 1 , \, \alpha _k$ should satisfy $\alpha _k \sim \frac{1}{2^{k-2}} ( \frac{1}{f^* C_k} ) ^2$. This is how we determine $\alpha _k \, ( k = 3, \ldots , m )$. If the $\alpha _k$ is chosen as this,

$$\begin{aligned} \left| \frac{r_{\omega }^{\left( k+1 \right) } - r_{\omega }^{\left( k \right) }}{r_{\omega }^{\left( k \right) }} \right| \lesssim 1 \end{aligned}$$

(49)

holds for $k = 3, \ldots , m-1$. This should hold if $( k+1 )$th order perturbation acts as a correction to the result up to the $k$th order perturbation.

Note that, if $\alpha _k$ becomes smaller, the deviation between $F_k$ and $\check{F}_k$ becomes larger. Therefore, it is favorable if $\alpha _k$ is chosen as large as possible. Since $\alpha _k$ is inversely proportional to $C_k^2$, it may also be important to suppress the growth of $F_k$ without normalizing near-resonant terms in LCPT.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Teramoto, H., Toda, M. & Komatsuzaki, T. A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region. Theor Chem Acc 133, 1571 (2014). https://doi.org/10.1007/s00214-014-1571-9

Download citation

Received: 12 May 2014
Accepted: 21 August 2014
Published: 13 September 2014
DOI: https://doi.org/10.1007/s00214-014-1571-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region

Abstract

Similar content being viewed by others

Perturbation Gadgets: Arbitrary Energy Scales from a Single Strong Interaction

Second-order Møller–Plesset perturbation (MP2) theory at finite temperature: relation with Surján’s density matrix MP2 and its application to linear-scaling divide-and-conquer method

Numerical stochastic perturbation theory applied to the twisted Eguchi-Kawai model

1 Introduction

2 Non-blow-up regions of Lie canonical perturbation theory, LCPT

2.1 An illustration of a validity range of LCPT for a one-dimensional Hamiltonian system

2.2 Non-blow-up regions of LCPT for \(n\)-dimensional Hamiltonian systems

2.3 A demonstration of how the non-blow-up region \(U_m\) depends on the perturbation order \(m\)

2.3.1 Non-blow-up regions in a Hamiltonian [Eq. (1)]

2.3.2 Non-blow-up regions in a HCN molecule

3 A method of how to improve non-blow-up regions

4 Demonstration of our method to improve the validity range

4.1 Demonstration of our method in the Hamiltonian system [Eq. (1)]

4.2 Demonstration of our method in the HCN molecule

5 Conclusions and discussions

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

1.1 A blow-up method to solve Eqs. (23) and (24)

1.2 A derivation of Eq. (39)

1.3 A method to determine \(l\) and \(\alpha _k \, ( k=3, \ldots , m )\) in Sect. 3

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A new method to improve validity range of Lie canonical perturbation theory: with a central focus on a concept of non-blow-up region

Abstract

Similar content being viewed by others

Perturbation Gadgets: Arbitrary Energy Scales from a Single Strong Interaction

Second-order Møller–Plesset perturbation (MP2) theory at finite temperature: relation with Surján’s density matrix MP2 and its application to linear-scaling divide-and-conquer method

Numerical stochastic perturbation theory applied to the twisted Eguchi-Kawai model

1 Introduction

2 Non-blow-up regions of Lie canonical perturbation theory, LCPT

2.1 An illustration of a validity range of LCPT for a one-dimensional Hamiltonian system

2.2 Non-blow-up regions of LCPT for \(n\)-dimensional Hamiltonian systems

2.3 A demonstration of how the non-blow-up region \(U_m\) depends on the perturbation order \(m\)

2.3.1 Non-blow-up regions in a Hamiltonian [Eq. (1)]

2.3.2 Non-blow-up regions in a HCN molecule

3 A method of how to improve non-blow-up regions

4 Demonstration of our method to improve the validity range

4.1 Demonstration of our method in the Hamiltonian system [Eq. (1)]

4.2 Demonstration of our method in the HCN molecule

5 Conclusions and discussions

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

1.1 A blow-up method to solve Eqs. (23) and (24)

1.2 A derivation of Eq. (39)

1.3 A method to determine \(l\) and \(\alpha _k \, ( k=3, \ldots , m )\) in Sect. 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation