Abstract:
Following [18], we develop an approach to the Hamiltonian theory of normal forms based on continuous averaging. We concentrate on the case of normal forms near an elliptic singular point, but unlike [18] we do not assume that frequences of the linearized system are non-resonant. We study analytic properties of the normalization procedure. In particular, we show that in the case of a codimension one resonance an analytic Hamiltonian function may be reduced to a normal form up to an exponentially small reminder with explicit estimates of the reminder and the analyticity domain.
Keywords:
Hamiltonian normal forms, Hamiltonian perturbation theory.
Consider a Hamiltonian system with $n$ degrees of freedom in a neighbourhood of an elliptic singular point. In the linear approximation, the dynamics is determined by the Hamiltonian equations
$$
\begin{equation*}
\begin{gathered} \, \dot x = \partial_y H_2, \quad \dot y = - \partial_x H_2, \qquad x = (x_1,\dots,x_n), \quad y = (y_1,\dots,y_n), \\ H_2 = \sum_{j=1}^n \frac{\omega_j}{2} (y_j^2 + x_j^2). \end{gathered}
\end{equation*}
\notag
$$
The real numbers $\omega_1,\dots,\omega_n$ form the frequency vector $\omega\in\mathbb{R}^n$.
According to Birkhoff [1], it is convenient to use the complex coordinates
Below, all the phase variables including $x$ and $y$ may be complex. Hence the overbar does not mean necessarily complex conjugation. In the other words, the variables $z$ and $\overline z$ are assumed to be independent.
The variables $(z,\overline z)$ are canonical, that is, for any two smooth functions $F = F(z,\overline z), G = G(z,\overline z)$, the Poisson bracket has the form
$$
\begin{equation*}
\{F,G\}=i\sum_{j=1}^n \bigl(\partial_{\overline z_j} F \, \partial_{z_j} G - \partial_{z_j} F\, \partial_{\overline z_j} G \bigr).
\end{equation*}
\notag
$$
The same equation determines the bracket $\{\,,\}$ on the space of formal power series in $z$ and $\overline z$. The function $H_2$ takes the form
Here, $k,\overline k\in\mathbb{Z}_+^n$, $\mathbb{Z}_+ :=\{0,1,\dots\}$ are multiindices and $|k| = |k_1|+\dots+|k_n|$ is the $l^1$-norm. Below, we use the shorter notation
More generally, $\dot F = \{\widehat H,F\}$ for any function (or formal power series) $F = F(\mathbf{z})$.
According to the theory of normal forms, the function $\widehat H$ may be simplified by passage to another coordinate system, [1].
The monomial $\widehat H_\mathbf{k}\mathbf{z}^\mathbf{k}$ in expansion (1.1) is called resonant if $\langle\omega,\overline k - k\rangle = 0$, where $\langle\,,\rangle$ is the standard inner product in $\mathbb{R}^n$. The integer number $|\overline k - k|$ is called the order of the resonance. Any integer vector $\mathbf{k}=(k,\overline k)$ which determines a resonant monomial $\mathbf{z}^\mathbf{k}$ satisfies $\overline k - k \in\mathbb{L}_\omega$, where
If $\langle\omega,q\rangle = 0$, $q\in\mathbb{Z}^n$ implies $q=0$, the frequency vector $\omega$ is said to be non-resonant.
By using an appropriate coordinate change, one gets rid of any finite set of non-resonant monomials. To eliminate all non-resonant monomials, we have to use coordinate changes in the form of, in general, divergent power series. The Hamiltonian function $N$ obtained as a result of this (formal) normalization is called the normal form of the original Hamiltonian $H_2+\widehat H$. The normal form is unique as a formal series.
The problem of convergence/divergence of the normalizing transformation under the assumption of analyticity of $\widehat H$ is central in the theory. H. Eliasson attracted attention of specialists to another (harder) problem: convergence/divergence of the normal form. If the normalization converges and the lattice $\mathcal{L}_\omega$ is at most 1-dimensional, then the system is locally completely integrable. Various versions of the inverse statement are proved in [19], [7], [4], [8].
Another corollary from convergence of the normalization is Lyapunov stability of the equilibrium position in the case of a non-resonant frequency vector.1[x]1Under some simple explicit additional conditions. The papers [5], [9] contain examples of real-analytic Hamiltonians $H_2 + \widehat H$ such that the origin is Lyapunov unstable in system (1.2) although in the linear approximation the system is obviously stable.
Convergence of the normal form does not imply convergence of the normalization. But it has interesting dynamical consequences: the measure of the set covered by KAM-tori turns out to be noticeably bigger than in the case when the normal form diverges [10].
Convergence of the normalizing transformation is an exceptional phenomenon in any reasonable sense. At the moment, this exceptionality is known in terms of Baire category [14] and $\Gamma$-capacity [12], [10]. Explicit examples of real-analytic systems with divergent normal form can be found in [6], [20], [5].
The case of the “trivial” normal form $N = H_2$ is special. According to the Bruno–Rüssmann theorem [2], [13], see also [16], if $N = H_2$ and $\omega$ satisfies some (rather weak) Diophantine conditions,2[x]2Usually called the Bruno conditions. then the normalization converges.
The normalizing change of variables is traditionally constructed as a composition of an infinite sequence of coordinate changes which normalize the Hamiltonian function up to a remainder of a higher and higher degree [1]. In further works (see for example [15]), this change of coordinates is represented as a formal series in $z$ and $\overline z$. We propose another approach to normalization.
Let $\mathcal{F}$ be the space of all power series in the variables $z$ and $\overline z$. Sometimes, we refer to elements of $\mathcal{F}$ as functions although they are only formal power series. Let $\mathcal{F}_\diamond\subset\mathcal{F}$ be the subspace of series which start from terms of degree at least 3. In this paper, assuming that $H_0$ is fixed, we study various normalization flows $\phi_\xi^\delta$ on the space $\mathcal{F}_\diamond$, $\delta\in [0,+\infty)$. Any shift
is a transformation of the Hamiltonian function $H_2 + \widehat H$ according to a certain (depending on $\delta$ and on the initial Hamiltonian $H_2 + \widehat H$) canonical change of variables.
Any flow $\phi_\xi^\delta$ is determined by a certain ODE in $\mathcal{F}_\diamond$,
$$
\begin{equation}
\partial_\delta H = - \{\xi H, H_2 + H\}, \qquad H|_{\delta=0} = \widehat H.
\end{equation}
\tag{1.4}
$$
Here, $\xi$ is a linear operator on $\mathcal{F}_\diamond$. In fact, (1.4) is an initial value problem (IVP) for a differential equation presented in the form of Lax L-A pair. Such systems are usually considered in the theory of integrable systems. However, integrability of system (1.4) whatever it means seems to be of no use to us.
Let $\mathcal{N}_\diamond\subset\mathcal{F}_\diamond$ be the subspace which consists of series which contain only resonant monomials. We will choose operators $\xi$ such that the space $\mathcal{N}_\diamond$ is invariant with respect to $\phi^\delta_\xi$ and any point of $\mathcal{N}_\diamond$ is fixed. We proposed such an approach in [18]. In this paper, we present further results. In particular, here we do not assume that $\omega$ is a non-resonant vector.
In § 3.1, we define the operator $\xi=\xi_*$. We represent system (1.4) in the form of an infinite ODE system (3.9) for the coefficients $H_\mathbf{k}$. A more convenient equivalent form of it is system (3.11). Because of a special “nilpotent” structure of system (3.11), the existence and uniqueness of a solution for the corresponding IVP for any initial condition turns out to be a simple fact (§ 4.1). In the product topology, the solution tends to the normal form as $\delta\to +\infty$.
We are particularly interested in the restriction of $\phi^\delta_{\xi_*}$ to the subspace $\mathcal{A}\subset\mathcal{F}$ of analytic Hamiltonian functions. In § 4.2, we prove that, for any $\widehat H\in\mathcal{A}$, the solution $H=H(\mathbf{z},\delta)$ also lies in $\mathcal{A}$ for any $\delta\geqslant 0$. However, the polydisc of analyticity generically shrinks when $\delta$ grows. A rough lower estimate for its (poly)radius gives a quantity of order $1/(1+\delta)$ (Theorem 4.2).
In § 5, we assume that the normal form $H_2 + N_\diamond$ of the Hamiltonian $H_2\,{+}\,\widehat H$ satisfies the equation $N_\diamond = O_r(\mathbf{z})$, $r\geqslant 3$. For example, generically3[x]3If the frequency vector $\omega$ does not admit resonances of order 3. $r=4$. The case $r>4$ is not generic but may happen for some values of parameters in multiparametric families of Hamiltonians. Assuming that $\widehat H\in\mathcal{A}$, $\omega$ satisfies the Bruno Condition, and $N_\diamond=O_r(\mathbf{z})$, Theorem 5.1 says that $H(\,{\cdot}\,,\delta)$ stays in $\mathcal{A}$ and the polydisc of analyticity has radius of order (at least) $(1+\delta)^{-1/(r-2)}$, $\delta>0$. In particular, generically this radius $\sim (1+\delta)^{-1/2}$. This improves the corresponding estimate $(1+\delta)^{-1}$ in [18].
If $\omega$ is collinear to a vector with rational components (codimention one resonance) then Theorem 5.1 implies Corollary 5.1. This corollary says that there exists a change of variables which reduces in a polydisc of radius $\sim (1+\delta)^{-1/(r-2)}$ the Hamiltonian to the form $H_2 + G^0 + G^*$, where $G^0\in\mathcal{N}_\diamond$ and $|G^*|\sim e^{-c\delta}$, $c>0$.
Theorem 6.1 from § 6 is an auxiliary statement used in the proof of Theorem 5.1. However, it probably has an independent value. It has the same assumptions: $\widehat H\in \mathcal{A}$, $\omega$ satisfies the Bruno Condition, $N_\diamond = O_r(\mathbf{z})$. Then there exists a change of the variables which reduces the Hamiltonian function to the form $H_2\,{+}\, G$, $G = O_r(\mathbf{z})$. Theorem 6.1 presents upper estimates for Taylor coefficients of the function $G$ in terms of analytic properties of $\widehat H$. As a corollary, we obtain the Bruno–Sigel theorem about convergence of the normalization in the case of the trivial normal form (Corollary 6.1).
Some technical results concerning majorants, Bruno sequences, etc., are given in § 7.
§ 2. Basic construction
For any $q = (q_1,\dots,q_n)\in\mathbb{Z}^n$, let $|q|=|q_1|+\dots+|q_n|$ be its $l^1$-norm. For any $k,\overline k\in\mathbb{Z}_+^n$ we put
Series (2.1) are assumed to be formal, that is, there is no restriction on the values of the coefficients $H_{\mathbf{k}}$. So, $\mathcal{F}$ coincides with the ring $\mathbb{C}[[z_1,\dots,z_n,\overline z_1,\dots,\overline z_n]]$. The Poisson bracket defines on $\mathcal{F}$ the structure of a Lie algebra. Below, we use on $\mathcal{F}$ the product topology, that is, a sequence $H^{(1)},H^{(2)},\ldots\in\mathcal{F}$ is said to be convergent if, for any $\mathbf{k}\in\mathbb{Z}_+^{2n}$, the sequence of coefficients $H^{(1)}_{\mathbf{k}},H^{(2)}_{\mathbf{k}},\dots$ converges.
For any $H$ satisfying (2.1), we define $p_{\mathbf{k}}(H) = H_{\mathbf{k}}$. So $p_{\mathbf{k}} \colon \mathcal{F}\to\mathbb{C}$ is a canonical projection corresponding to $\mathbf{k}\in\mathbb{Z}_+^{2n}$. Suppose $F\in\mathcal{F}$ depends on a parameter $\delta\in I$, where $I\subset\mathbb{R}$ is an interval. In other words, we consider a map
We say that $F$ is smooth in $\delta$ if all the maps $p_\mathbf{k}\circ f$ are smooth.
Let $\mathcal{F}_r\subset\mathcal{F}$ be the space of “real” series,
$$
\begin{equation*}
\mathcal{F}_r = \{H\in\mathcal{F} \colon \overline H_{\mathbf{k}} = H_{\mathbf{k}^*} \text{ for any } \mathbf{k}\in\mathbb{Z}_+^n\}.
\end{equation*}
\notag
$$
We define $\mathcal{A}\subset\mathcal{F}$ as the space of analytic series:
$$
\begin{equation*}
\mathcal{A} = \{H\in\mathcal{F} \colon \text{there exist }c, a\text{ such that }|H_{\mathbf{k}}| \leqslant c e^{a|\mathbf{k}|}\text{ for any }\mathbf{k}\in\mathbb{Z}_+^{2n}\}.
\end{equation*}
\notag
$$
The product topology may be restricted from $\mathcal{F}$ to $\mathcal{A}$, but, being a scale of Banach spaces, $\mathcal{A}$ may be endowed with a more natural topology. We have $\mathcal{A} = \bigcup_{\rho>0}\mathcal{A}^\rho$, where $\mathcal{A}^\rho$ is a Banach space with the norm $\|\,{\cdot}\,\|_\rho$,
This implies $|H_\mathbf{k}|\leqslant c\rho_0^{-|\mathbf{k}|}$. Since $\rho_0<\rho$ is arbitrary, we obtain (2.3). The lemma is proved.
Corollary 2.1. The topology on $\mathcal{A}^\rho$ by the norm $\|\,{\cdot}\,\|_\rho$ is stronger than the product topology induced from $\mathcal{F}$, that is, if a sequence $\{H^{(j)}\}$ converges in $(\mathcal{A}^\rho, \|\,{\cdot}\,\|_\rho)$, then it converges in $\mathcal{F}$.
We define the resonant sublattice (1.3). The number of generators in the group $(\mathbb{L}_\omega,+)$ is said to be the rank of $\mathbb{L}_\omega$. In particular, if $\operatorname{rank}\mathbb{L}_\omega=0$ (equivalently, $\mathbb{L}_\omega=\{0\}$), then the frequency vector is called non-resonant. If $\operatorname{rank}\mathbb{L}_\omega=1$, we have a simple resonance, if $\operatorname{rank}\mathbb{L}_\omega>1$, the resonance is multiple, if $\operatorname{rank}\mathbb{L}_\omega=n-1$, then we have a codimension one resonance. In the latter case, all trajectories of the linearized system
that is $H=O_3(\mathbf{z})$ for any $H\in\mathcal{F}_\diamond$. By definition, we also put $\mathcal{N}_\diamond = \mathcal{N}\cap\mathcal{F}_\diamond$.
§ 3. Normalization flow
Consider the change of variables in the form of a shift
along solutions of the Hamiltonian system4[x]4The so-called Lie method. with Hamiltonian $F=F(\mathbf{z},\delta)=O_3(\mathbf{z})$ and independent variable $\delta$,
Differentiating equation (3.3) with respect to $\delta$, we obtain
$$
\begin{equation*}
\partial_\delta H = -\{F,H_2+H\}, \qquad H|_{\delta=0} = \widehat H.
\end{equation*}
\notag
$$
The main idea of the continuous averaging is to take $F$ in the form $\xi H$, where $\xi$ is a certain operator5[x]5This idea is similar to that of the Moser homotopy method [11]. depending on the problem we deal with. We now obtain an initial value problem in $\mathcal{F}_\diamond$,
$$
\begin{equation}
\partial_\delta H = - \{\xi H,H_2 + H\}, \qquad H|_{\delta=0} = \widehat H.
\end{equation}
\tag{3.4}
$$
3.1. Operator $\xi_*$
To explain the idea, we start with the “simplest” operator $\xi=\xi_*$. We put
Informal explanation for the choice (3.6) of the operator $\xi=\xi_*$ is as follows. Removing the non-linear terms $v_1+v_2$ in (3.7), we obtain the equation $\partial_\delta H = v_0(H)$ or, in more detail
So, the solution $H$ of the truncated problem (3.8) tends to $H^0\in\mathcal{N}_\diamond$ as $\delta\to +\infty$.
Let $e_j=(0,\dots,1,\dots,0)$ be the $j$th unit vector in $\mathbb{Z}_+^n$ and let $\mathbf{e}_j=(e_j,e_j)\in\mathbb{Z}_+^{2n}$. Equation (3.7) written for each Taylor coefficient $H_{\mathbf{k}}$ has the form
Remark 3.1. (1) The functions $v_{1,\mathbf{k}}$ and $v_{2,\mathbf{k}}$ are quadratic polynomials in the variables $H_{\mathbf{k}}$. The functions $\mathbf{v}_{2,\mathbf{k}}$ are quadratic polynomials in $\mathcal{H}_{\mathbf{k}}$ with coefficients depending on $\delta$.
(2) The polynomial $v_{1,\mathbf{k}}$ vanishes if $\mathbf{k}'\in\mathbb{L}_\omega$.
(3) The polynomials $v_{1,\mathbf{k}}, v_{2,\mathbf{k}}$ and $\mathbf{v}_{2,\mathbf{k}}$ do not depend on the variables $H_\mathbf{s},\mathcal{H}_{\mathbf{s}}$, for any $\mathbf{s}\in\mathbb{Z}_\diamond^{2n}$ such that $|\mathbf{s}| > |\mathbf{k}|-2$.
(4) The polynomials $v_{2,\mathbf{k}}$ and $\mathbf{v}_{2,\mathbf{k}}$ do not depend on the variables $H_\mathbf{s},\mathcal{H}_{\mathbf{s}}$ for any $\mathbf{s}\in\mathbb{Z}_\diamond^{2n}$ such that $0\leqslant\sigma_{\mathbf{k}'} \langle\omega,\mathbf{s}'\rangle\leqslant\omega_{\mathbf{k}'}$.
(5) For any $\sigma_{\mathbf{l}'} < 0 < \sigma_{\mathbf{m}'}$, the quantity $\omega_{\mathbf{l}',\mathbf{m}'}$ is positive,
Let $I\subset\mathbb{R}$ be an interval containing the point $0$. We say that the path $\gamma\colon I\to\mathcal{F}_\diamond$ is a solution of system (3.4) on the interval $I$ if $\gamma(0)=\widehat H$ and the coefficients $H_{\mathbf{k}}(\delta) = p_{\mathbf{k}}\gamma(\delta)$ satisfy (3.9).
If the solution $\gamma \colon I\to\mathcal{F}_\diamond$ exists and is unique, then, for any $\delta\in I$, we put $\gamma(\delta)=\phi^\delta_{\xi_*}(\widehat H)$.
Definition 4.1. The ordinary differential equation
$$
\begin{equation*}
F' = \Phi(F), \qquad F = \sum_{|\mathbf{k}|\geqslant 3} F_\mathbf{k} \mathbf{z}^\mathbf{k}
\end{equation*}
\notag
$$
on $\mathcal{F}$ has a nilpotent form if $F'_\mathbf{k}=\Phi_\mathbf{k}$ for any $\mathbf{k}\in\mathbb{Z}_\diamond^{2n}$, where $\Phi_\mathbf{k} = p_\mathbf{k}\circ\Phi$ is a function of the variables $F_\mathbf{m}$, $|\mathbf{m}|<|\mathbf{k}|$.
According to Remark 3.1, (3), system (3.11) has a nilpotent form. In particular, $\partial_\delta\mathcal{H}_{\mathbf{k}} = 0$ for $|\mathbf{k}|=3$.
Theorem 4.1. For any $\widehat H\,{\in}\,\mathcal{F}_\diamond$ and any $\delta\,{\in}\,\mathbb{R}$, the element $H(\,{\cdot}\,,\delta)\,{=}\,\phi^\delta_{\xi_*}(\widehat H)\,{\in}\,\mathcal{F}$ is well-defined. For any $\mathbf{k}\in\mathbb{Z}_\diamond^{2n}$, the function $\mathcal{H}_\mathbf{k}(\delta) = H_\mathbf{k}(\delta) e^{\omega_{\mathbf{k}'}\delta}$ equals
where $P_\mathbf{k}$ is a polynomial in $\widehat H_\mathbf{m}$, $|\mathbf{m}|<|\mathbf{k}|$ with coefficients in the form of (finite) linear combinations of terms $\delta^s e^{-\nu\delta}$, $s\in\mathbb{Z}_+$. Here, $\nu\geqslant 0$, and, moreover, if the term $\delta^s e^{-\nu\delta}$ with $\nu=0$ appears in a coefficient of $P_\mathbf{k}$, $\mathbf{k}'\in\mathbb{L}_\omega$, then $s=0$ in this term.
Proof. For $|\mathbf{k}|=3$, we have $\mathcal{H}_\mathbf{k}(\delta)=\widehat H_\mathbf{k}$. Suppose equations (4.1) hold for all vectors $\mathbf{k}\in\mathbb{Z}_\diamond^{2n}$ with $|\mathbf{k}|<K$. We take any $\mathbf{k}\in\mathbb{Z}^{2n}_\diamond$ with $|\mathbf{k}|=K$. By (3.11)
By using (3.10), (3.12), and the induction assumption, we obtain (4.1).
If $\mathbf{k}'\in\mathbb{L}_\omega$, then $v_{1,\mathbf{k}}=0$, while each term in $\mathbf{v}_{2,\mathbf{k}}$ has the form $\lambda^s e^{-\lambda\delta}$ with $\lambda>0$. This implies the last statement of Theorem 4.1. The theorem is proved.
Corollary 4.1. The limit $\lim_{\delta\to +\infty} \phi^\delta(\widehat H)$ exists in the product topology on $\mathcal{F}_\diamond$ and lies in $\mathcal{N}_\diamond$.
Indeed, by Theorem 4.1 for any $\mathbf{k}\in\mathbb{Z}_\diamond^{2n}$, the limit
exists. The convergence is exponential and $H_{\mathbf{k}}(+\infty)$ vanishes if $\mathbf{k}'\not\in\mathbb{L}_\omega$. This is equivalent to the existence of $\lim_{\delta\to +\infty}\phi^\delta_{\xi_*}(\widehat H)=\phi^{+\infty}_{\xi_*}(\widehat H)$ together with the fact that $\phi^{+\infty}_{\xi_*}(\widehat H)\in\mathcal{N}_\diamond$.
Corollary 4.2. Suppose $\widehat H\in\mathcal{F}_r\cap\mathcal{F}_\diamond$. Then $\phi^\delta_{\xi_*}(\widehat H)\in\mathcal{F}_r\cap\mathcal{F}_\diamond$ for any $\delta\geqslant 0$.
Indeed, the required identities $\overline H_{\mathbf{k}}(\delta) = H_{\mathbf{k}^*}(\delta)$, $\delta\geqslant 0$, can be proved by induction on $|\mathbf{k}|$ by using equation (3.11).
4.2. Analytic aspect
Theorem 4.2. Suppose $\widehat H\in\mathcal{A}^\rho\cap\mathcal{F}_\diamond$, $\|\widehat H\|_\rho=h\rho^3$. Then, for any $\delta\geqslant 0$,
To obtain equation (4.4), we have replaced in $v_{1,\mathbf{k}}$ and $\mathbf{v}_{2,\mathbf{k}}$ the minuses by pluses, dropped the exponential multipliers, and added some new positive (for positive $\mathbf{H}_{\mathbf{l}}$ and $\mathbf{H}_{\mathbf{m}}$) terms. The main property of system (4.3), (4.4) is such that if, $|\widehat H_{\mathbf{k}}|\leqslant\widehat{\mathbf{H}}_{\mathbf{k}}$ for any $\mathbf{k}\in\mathbb{Z}_\diamond^{2n}$, then, for any $\delta\geqslant 0$, the inequality $|v_{1,\mathbf{k}} + \mathbf{v}_{2,\mathbf{k}}| \leqslant \mathbf{V}_\mathbf{k}$ holds, that is, conditions (a) and (b) from Definition 7.2 hold.
System (4.3), (4.4) has a nilpotent form. Therefore, by Theorem 7.1 we may use the majorant principle: if $\mathbf{H}$ is a solution of (4.3), (4.4), then (3.11) has a solution $\mathcal{H}$ and $\mathcal{H}\ll\mathbf{H}$ for any $\delta\geqslant 0$.
System (4.3), (4.4) can be written in a shorter form as
Since the initial condition $\mathbf{H}|_{\delta=0}$ depends on $z$ and $\overline z$ only through the variable $\zeta$, we may look for a solution of (4.5) in the form $\mathbf{H}(z,\overline z,\delta)=F(\zeta,\delta)$. The function $F$ satisfies the equation
Corollary 4.3. Suppose $\widehat H\in\mathcal{A}\cap\mathcal{F}_r$. Then $\phi^\delta_{\xi_*}(\widehat H)\in\mathcal{A}\cap\mathcal{F}_r$ for any $\delta\geqslant 0$.
Note that, for a typical $\widehat H\in\mathcal{A}\cap\mathcal{F}_\diamond$, one should expect that $\phi^{+\infty}_{\xi_*}(\widehat H)\in\mathcal{N}_\diamond$ does not belong to $\mathcal{A}$ [10].
§ 5. Degenerate normal form
5.1. Normalization theorem
Suppose the normal form $H_2 + N_\diamond$ is such that $N_\diamond := \lim_{\delta\to +\infty} \phi_{\xi_*}^\delta (\widehat H) \in \mathcal{N}$ vanishes in several first orders in $\mathbf{z}$. In this section, we obtain more precise estimates (in comparison with those from Theorem 4.2) for the analyticity domain of the Hamiltonian function $\mathcal{H}(\,{\cdot}\,,\delta)$ with increasing $\delta$.
Obviously, if $\{a_j\}$ and $\{b_j\}$ are two Bruno sequences, then the product $\{a_j b_j\}$ is also a Bruno sequence.
Definition 5.2. We say that the vector $\omega\in\mathbb{R}^n$ satisfies the Bruno condition if $\{a_j\}$, $a_j=\max\{1,\Omega_{2^j+2}\}$ is a Bruno sequence.
Theorem 5.1. Suppose $\widehat H\in\mathcal{A}^\rho\cap\mathcal{F}_\diamond$, $\omega$ satisfies the Bruno condition, and $N_\diamond = O_r(\mathbf{z})$. Then, for any $\delta\geqslant 0$, the solution $\mathcal{H}(\mathbf{z},\delta) = \sum \mathcal{H}_\mathbf{k}(\delta) \mathbf{z}^\mathbf{k}$ of (3.11) belongs to the space $\mathcal{A}^{g(\delta)}$, where
for some constant $C$ depending on $\rho,\|\widehat H\|_\rho,\{\Omega_j\}$, and $n$. More precisely, for some $C_H = C_H(\rho,\|\widehat H\|_\rho,\{\Omega_j\},n)$ and $C_z = C_z(\rho,\|\widehat H\|_\rho,\{\Omega_j\},n)$,
Suppose that $\operatorname{rank}\mathbb{L}_\omega = n-1$. Then there exists a unique collection of integers $q_1,\dots,q_n,p$, and $\lambda>0$ such that
Hence in (3.12) we have $\omega_{\mathbf{l}',\mathbf{m}'} \geqslant \lambda / p$.
If the normal form of $H_2+\widehat H$ equals $H_2 + O_r(\mathbf{z})$, we may use Theorem 5.1 to obtain the following corollary.
Corollary 5.1. Suppose $\operatorname{rank}\mathbb{L}_\omega = n-1$, $\widehat H\in\mathcal{A}^\rho\cap\mathcal{F}_\diamond$, and the normal form of $H_2+\widehat H$ is of order $O_r(\mathbf{z})$.
Then there exists a canonical change of variables $\mathbf{w}\mapsto\mathbf{z}$ which transforms the Hamiltonian to $H_2(\mathbf{w})+G(\mathbf{w})$, $G=O_r(\mathbf{w})$, where
Estimates (5.5) mean that the non-resonant part $G^*$ may be reduced to an exponentially small quantity of order $e^{-\lambda\delta/p}$. The price is a little size ($\sim \delta^{-1/(r-2)}$) of the domain in which the Hamiltonian function $G$ is defined.
To obtain estimates (5.5) in the domain (5.4), we split the function $\mathcal{H}(\mathbf{z},\delta)$, obtained in Theorem 5.1 into the resonant, $\mathcal{H}^0$, and non-resonant, $\mathcal{H}^*$, parts
Estimates (5.5) follow from (5.3) and the inequality $\omega_{\mathbf{k}'}\geqslant\lambda/p$ for any $\mathbf{k}'\in\mathbb{Z}^n\setminus\mathbb{L}_\omega$.
Definition. The sequence $\{b_j\}_{j\in\mathbb{N}}$ is said to be sublinear if $\lim_{j\to+\infty} b_j / j \,{=}\, 0$. The sequence $\{b_j\}_{j\in\mathbb{N}}$ is said to be convex if $b_{j-1} - 2b_j + b_{j+1} \geqslant 0$ for all $j\geqslant 2$.
Lemma 6.1. Suppose $\widehat H\in\mathcal{A}\cap\mathcal{F}_\diamond$ is such that
(2) the sequence $\{b_j\}$ is non-increasing, sublinear and convex.6[x]6We will use sequences $\{b_j\}$ with $b_j < 0$.
Then an analytic canonical transformation $\mathbf{w}\mapsto\mathbf{z}=\vartheta(\mathbf{w})$ of coordinates reduces the Hamiltonian $H_2 + \widehat H$ to the form $H_2\circ\vartheta+\widehat H\circ\vartheta = H_2 + G^0 + G$, where
Proof. We construct the transformation $\vartheta$ by using continuous averaging with the following operator $\xi_s$. Suppose $H_2(\mathbf{z}_\delta) + H(\mathbf{z}_\delta,\delta)$,
Here, $\omega_{\mathbf{k}'}$ is defined by (3.5), and $\{\,,\}_\mathbf{k}$ denotes the Taylor coefficient with the number $\mathbf{k}$ in the expansion of the bracket. We solve the IVP (6.7)–(6.9) on the interval $\delta\in [0,+\infty)$. The required change of variables is obtained with $\delta=+\infty$.
Equations (6.7) can be easily solved: $H_\mathbf{k} = e^{-\omega_{\mathbf{k}'}\delta} \widehat H_\mathbf{k}$ for $\mathbf{k}\in\mathcal{Z}_s$. As a result, we get (6.1).
Now we associate with system (6.7), (6.8) a majorant system for the functions
For any $|\mathbf{k}|\geqslant 2s-2$, we put $\mathbf{H}_\mathbf{k} = c\, e^{b_{|\mathbf{k}|} + \alpha |\mathbf{k}|} \mathbf{h}_\mathbf{k}$, $\mathbf{h}_\mathbf{k}(\delta) \geqslant \mathbf{h}_\mathbf{k}(0) = 1$.
Lemma 6.2. Suppose the sequence $\{b_j\}_{j\in\mathbb{N}}$ is non-increasing and convex. Then, for any $\mathbf{k}\in\mathbb{Z}_+^{2n}$, $|\mathbf{k}|\geqslant 2s-2$,
Since the multiplier $c e^{-\delta / \Omega_{2s-2}} \Lambda$ does not depend on $\mathbf{k}$, we conclude7[x]7For example, by induction on $|\mathbf{k}|$. that $\chi_{|\mathbf{k}|} = \mathbf{h}_\mathbf{k}$ and we may take
In particular, $\mathbf{h}_\mathbf{k} = \mathbf{h}_\mathbf{k}(\delta)$ is an increasing function of $\delta\in [0,+\infty)$. Taking $\delta = +\infty$, we obtain
We estimate $\rho'$ such that, for any $\mathbf{z}\in D_{\rho'}$ and $\delta\in [0,+\infty)$, the point $\mathbf{z}(\delta) = \vartheta^{-1}_\delta(\mathbf{z})$ does not leave $D_\rho$. We have $\mathbf{z}(\delta)=(z_1(\delta),\dots,\overline z_n) \in D_\rho$, $\rho e^\alpha \leqslant 1$, and so
where $j$ is any number from the set $\{1,\dots,n\}$. Here, we have used that the sequence $b_s$ does not increase, $s\leqslant |\mathbf{k}|\leqslant 2s$, and the sum contains less than $(2s)^{2n-1}$ terms. Estimate (6.12) implies
The differences $\overline z_j(\delta) - \overline z_j(0)$ can be estimated in the same way. This implies (6.4). Lemma 6.1 is proved.
Proof of Lemma 6.2. For any $\mathbf{l},\mathbf{m},\mathbf{k}\in\mathbb{Z}_+^{2n}$, the equation $\mathbf{l}+\mathbf{m}=\mathbf{k}+\mathbf{e}_j$ implies $|\mathbf{l}|+|\mathbf{m}|=|\mathbf{k}|+2$. This implies the estimate
In this section, assuming that the normal form of $H_2 + \widehat H$ is $H_2+N_\diamond$, $N_\diamond=O_r(\mathbf{z})$, we apply Lemma 6.1 inductively to reduce the Hamiltonian to the form $H_2 + G$, $G=O_r$. Assuming $\widehat H\in\mathcal{A}$, we estimate $G$ in terms of $\widehat H$. As a by-product, we obtain another proof of the Bruno–Rüssmann theorem on convergence of the normalization in the case of zero normal form.
Motivated by conditions of Theorem 5.1 we will assume that
(1) $\omega\in\mathbb{R}^n$ satisfies the Bruno condition,
Remark 6.1. The normalization from the proof of Theorem 6.1 deals with a finite number of “small divisors” $\langle\omega,\mathbf{k}'\rangle$. Hence the assumption that $\omega$ satisfies the Bruno condition looks too strong.8[x]8For $|\mathbf{k}|>r$ we do not need any conditions on small divisors. We use this assumption to obtain uniform with respect to $r$ estimates (6.18), (6.19) for $\rho_*$ and $G_\mathbf{k}$.
Corollary 6.1 (the Bruno–Rüssmann theorem). Suppose that
(a) $\omega\in\mathbb{R}^n$ satisfies the Bruno condition,
(b) $\widehat H \in\mathcal{A}\cap\mathcal{F}_\diamond$,
(c) the normal form of the Hamiltonian $H_2 + \widehat H$ equals $H_2$.
Then an analytic canonical transformation of coordinates $\mathbf{w}\mapsto\mathbf{z}=\vartheta(\mathbf{w})$ reduces the Hamiltonian $H_2+\widehat H$ to the normal form $H_2\circ\vartheta + \widehat H\circ\vartheta = H_2$.
To prove Corollary 6.1, it suffices to make $r\to +\infty$ in Theorem 6.1.
We obtain the sequences $\alpha_0,\alpha_1,\dots,\alpha_N$, $\varepsilon_1,\dots,\varepsilon_N$, $\Lambda_1,\dots,\Lambda_N$, which estimate the Hamiltonian on the $m$th step,
Now we prove by induction that we $\varepsilon_m\leqslant 2^{-m-2}$ for $c_0 e^{2\alpha_0}\leqslant 1/16$ (see (6.17)). Indeed, for $m=0$, we have the estimate $\varepsilon_0 = 2c_0^2 e^{2\alpha_0} \leqslant 1/8$ by (6.22). Suppose this inequality is valid for $m\leqslant m_0$. Then
Putting $\beta=\alpha_N$, we obtain $\beta - \alpha_0 \leqslant \sum_{m=1}^\infty \varepsilon_m \leqslant 1/2$. This implies (6.19).
To estimate $\rho_*$, we define the sequence $\vartheta_1,\dots,\vartheta_N$, where $\vartheta_m \colon D_{\rho_{m-1}}\to D_{\rho_m}$ is the map from Lemma 6.1 on the $m$th step. We also define
Now we show that $\rho_m$, $m\geqslant 1$, determined by the equation $\rho_m e^{\alpha_m}=1$, satisfies (6.24). Indeed, $\rho_0 e^{\alpha_0}=1$ by (6.23). Assuming $\rho_m e^{\alpha_m}=1$ for some $m\geqslant0$, we have
Let $\{b_s\}$ be a sublinear convex negative sequence satisfying (6.14) with Bruno sequence (6.15). By Theorem 6.1 there exists an analytic canonical transformation of coordinates, which reduces the Hamiltonian $H_2 + \widehat H$ to the form $H_2 + G$, where the function $G$ satisfies (6.19). Since $b_{|\mathbf{k}|}$ are negative, we have
We now start another continuous averaging procedure with the operator $\xi=\xi_*$. We repeat the proof of Theorem 4.2 until equation (4.7). In our case $f'=O_{r-1}(\mathbf{z})$. By Lemma 7.3,
To estimate the function $G=G(\zeta)$, we have to use a quantitative version of the implicit function theorem (Lemma 6.3 below). First, we introduce the function $\widetilde G(\zeta) = \zeta + \tau G(\zeta)$. It satisfies the equation
Lemma 6.3 (see [3]). Let $x=y+\varphi(y)$ in the complex ball $\{y\in\mathbb{C} \colon |y|\leqslant 6\rho\}$, where the function $\varphi$ is analytic and $|\varphi|\leqslant\rho/2$. Then there is a unique analytic function $\psi$ defined for $|x|\leqslant\rho$ such that
for some constant $C$. The function $\int_0^\zeta G(\widehat\zeta)\, d\widehat\zeta$ is a majorant for $|\mathcal{H}|$. This implies $|\mathcal{H}| \leqslant |\rho G| \leqslant C_H / (1 +\delta^{1+2/(r-2)})$. Theorem 5.1 is proved.
Generically $\omega$ does not admit resonances of order less than $4$ (this includes the case of non-resonant $\omega$). Then $r = 4$ and diameter of the analyticity domain for $\mathcal{H}$ is of order $\delta^{-1/2}$. This estimate is better than $\delta^{-1}$ declared in [18].
§ 7. Technical part
7.1. Majorants
Lemma 7.1. Suppose $F = \sum_{\mathbf{k}\in\mathbb{Z}_+^{2n}} F_\mathbf{k} \mathbf{z}^\mathbf{k} \in\mathcal{A}^\rho$, and the sequence $\{b_j\}_{j\in\mathbb{Z}_+}$ is sublinear. Then, for any $\alpha > - \ln\rho$, there exists $c>0$ such that
For any $\alpha > - \ln\rho$, the function $q\mapsto e^{-b_q - (\alpha + \ln\rho) q}$, $q\in\mathbb{Z}_+$, is bounded because $b_q$ is sublinear. This proves the lemma.
For any $F,\mathbf{F}\in\mathcal{F}$ we say that $F\ll\mathbf{F}$ if and only if $|F_\mathbf{k}| \leqslant \mathbf{F}_\mathbf{k}$, $\mathbf{k}\in\mathbb{Z}_+^{2n}$, for their Taylor coefficients. In this case, we say that $\mathbf{F}$ is a majorant for $F$.
Lemma 7.2. Suppose $F\ll\mathbf{F}$ and $\widehat F\ll\widehat{\mathbf{F}}$. Then:
We use the majorant method to obtain estimates for solutions of initial value problems (IVP) in $\mathcal{F}$.
As an example, consider the IVP
$$
\begin{equation}
\partial_\delta F = \Phi(F,\delta), \qquad F|_{\delta=0} = \widehat F.
\end{equation}
\tag{7.4}
$$
Here, $F\in\mathcal{F}$ depends on the parameter $\delta$, and $\Phi$ is a map from $\mathcal{F}\times\mathbb{R}_+$ to $\mathcal{F}$.
Definition dfn:pr. The IVP (7.4) is said to be power regular if, for any $\widehat F\in\mathcal{F}$, equation (7.4) has a unique solution $F = F(\mathbf{z},\delta)\in\mathcal{F}$ for all $\delta>0$.
We associate with (7.4) the so-called majorant system
We put $\Phi_\mathbf{k} = p_\mathbf{k}\circ\Phi$ and $\Psi_\mathbf{k} = p_\mathbf{k}\circ\Psi$.
Definition 7.2. The IVP (7.5) is said to be a majorant IVP for (7.4) if:
(a) $\widehat F\ll \widehat{\mathbf{F}}$;
(b) for any $F\ll\mathbf{F}$, $\mathbf{k}\in\mathbb{Z}_+^{2n}$, and $\delta \geqslant 0$, we have $\Phi_\mathbf{k}(F,\delta) \ll \Psi_\mathbf{k}(\mathbf{F},\delta)$.
Majorant principle. Suppose the IVP (7.4) is power regular. Suppose also that there exists a solution $\mathbf{F} = \mathbf{F}(\,{\cdot}\,,\delta)\in\mathcal{A}$ of (7.5) on the interval $\delta\in [0,\delta_0]$. Then (7.4) has a unique analytic solution $F$ on $[0,\delta_0]$ and $F(\,{\cdot}\,,\delta) \ll \mathbf{F}(\,{\cdot}\,,\delta)$.
Remark 7.1. Definitions dfn:pr and 7.2 as well as the majorant principle obviously extend to systems of equations, where $F,\widehat F\in\mathcal{F}^m$ and $\Phi\colon \mathcal{F}^m\times\mathbb{R}_+\to\mathcal{F}$.
Remark 7.2. One may replace the first equation (7.5) by the inequality $\partial_\delta \mathbf{F} \gg \Psi(\mathbf{F},\delta)$.
The majorant principle presented here differs from the majorant argument used since Cauchy times. Traditionally, the evolution variable (in our case $\delta$) is regarded complex as well and Taylor expansions in it are used. In our approach, this variable is a real parameter in both exact solution and a majorant. Due to this we are able to obtain majorant estimates for solutions of (7.4) on large (even infinite) intervals of $\delta$.
Theorem 7.1. Suppose both systems (7.4) and (7.5) have nilpotent structure (see Definition 4.1). Then the majorant principle holds true.
We expect that majorant principle is valid in a much wider generality. But in this paper we are only interested in the case of systems having nilpotent form.
Proof of Theorem 7.1. Let $\mathbf{k}^0$ be an index with minimal possible degree $|\mathbf{k}^0|$. For example, in system (3.11), $|\mathbf{k}^0|=3$. Nilpotent form of (7.4) implies that
Hence $F_{\mathbf{k}^0}(\delta)\ll\mathbf{F}_{\mathbf{k}^0}(\delta)$ for $\delta\geqslant 0$.
We proceed by induction on $|\mathbf{k}|$. Suppose $F_{\mathbf{k}}(\delta)\ll\mathbf{F}_{\mathbf{k}}(\delta)$, $\delta\geqslant 0$, provided $|\mathbf{k}|< K$. For any $\mathbf{k}$ such that $|\mathbf{k}|=K$ we have by induction assumption and Definition 7.2, (b),
Proof. (1) First, consider the case $l-k=1$. If $k-m=1$ then (7.6) coincides with definition of convexity. We use induction on $k-m$. Suppose (7.6) holds for $k-m=q$. Then inequality (7.6) is obtained if we add the inequalities
which implies (6.14). To have sublinearity, we need $\lim_{J\to +\infty} 2^{-J} b_{2^J+2} = 0$. Hence, we take $b_3 = - \frac12 \sum_{j=1}^\infty 2^{-j} \ln a_j$. Such a choice of $b_3$ implies that $b_{2^J+2} < 0$, $J=0,1,\dots$ .
By (6.14), the left-hand side of (7.8) equals $2^{J-1}(\ln a_J - \ln a_{J-1})$. For any $J\geqslant 1$, this expression is non-negative because $\{a_j\}$ is non-decreasing.
If $b_s$, $s\ne 2^j+2$, are determined by linear interpolation, the convexity, negativity, and monotonicity remain true for all the sequence $\{b_s\}_{s\in\mathbb{N}}$. Lemma 7.6 is proved.
Bibliography
1.
G. D. Birkhoff, Dynamical systems, Amer. Math. Soc. Colloq. Publ., 9, Amer. Math. Soc., New York, 1927
2.
A. D. Brjuno, “Analytical form of differential equations”, Trans. Moscow Math. Soc., 25 (1973), 131–288; II, 26 (1974), 199–239
3.
J. Dieudonné, Foundations of modern analysis, Treatise of analysis. I, Pure Appl. Math., 10-1, 2nd rev. ed., Academic Press, New York–London, 1969
4.
L. H. Eliasson, “Normal forms for Hamiltonian systems with Poisson commuting integrals — elliptic case”, Comment. Math. Helv., 65:1 (1990), 4–35
5.
B. Fayad, “Lyapunov unstable elliptic equilibria”, J. Amer. Math. Soc., 36:1 (2023), 81–106
6.
Xianghong Gong, “Existence of divergent Birkhoff normal forms of Hamiltonian functions”, Illinois J. Math., 56:1 (2012), 85–94
7.
H. Ito, “Convergence of Birkhoff normal forms for integrable systems”, Comment. Math. Helv., 64:3 (1989), 412–461
8.
T. Kappeler, Y. Kodama, and A. Némethi, “On the Birkhoff normal form of a completely integrable Hamiltonian system near a fixed point with resonance”, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4), 26:4 (1998), 623–661
9.
V. V. Kozlov, “Formal stability, stability for most initial conditions and diffusion in analytic systems of differential equations”, Regul. Chaotic Dyn., 28:3 (2023), 251–264
10.
R. Krikorian, “On the divergence of Birkhoff normal forms”, Publ. Math. Inst. Hautes Études Sci., 135 (2022), 1–181
11.
J. Moser, “On the volume elements on a manifold”, Trans. Amer. Math. Soc., 120 (1965), 286–294
12.
R. Pérez-Marco, “Convergence or generic divergence of the Birkhoff normal form”, Ann. of Math. (2), 157:2 (2003), 557–574
13.
H. Rüssmann, “Über die Normalform analytischer Hamiltonscher Differentialgleichungen in der Nähe einer Gleichgewichtslösung”, Math. Ann., 169 (1967), 55–72
14.
C. L. Siegel, “Über die Existenz einer Normalform analytischer Hamiltonscher Differentialgleichungen in der Nähe einer Gleichgewichtslösung”, Math. Ann., 128 (1954), 144–170
15.
C. L. Siegel and J. K. Moser, Lectures on celestial mechanics, Transl. from the the German, Grundlehren Math. Wiss., 187, Springer-Verlag, New York–Heidelberg, 1971