Abstract:
We consider the problem of simultaneous approximation of real numbers θ1,…,θn by
rationals and the dual problem of approximating zero by
the values of the linear form x0+θ1x1+⋯+θnxn at
integer points. In this setting we analyse two transference inequalities
obtained by Schmidt and Summerer. We present a rather simple geometric
observation
which proves their result. We also derive several
previously unknown corollaries. In particular,
we show that, together with German's
inequalities for uniform exponents, Schmidt and Summerer's inequalities imply
the inequalities by Bugeaud and Laurent and “one half” of the inequalities
by Marnat and Moshchevitin. Moreover,
we show that our main construction
provides a rather simple proof of Nesterenko's linear independence
criterion.
Keywords:
Diophantine approximation, Diophantine exponents, transference inequalities,
linear independence criterion.
This research was supported by the Russian Science Foundation
(grant no. 22-41-05001),
https://rscf.ru/en/project/22-41-05001/.
The first author is a winner of the “Junior Leader” contest conducted by the Foundation for the Advancement of Theoretical Physics and Mathematics Advancement Foundation “BASIS” and would like to thank its sponsors and jury.
Dedicated to Yu. V. Nesterenkoon the occasion of his 75th birthday
§ 1. Introduction
This paper originated as a result of studying the prominent paper [1] by Nesterenko, where he proves his linear independence criterion.
Given an integer n⩾2, we fix an arbitrary n-tuple θ=(θ1,…,θn)∈Rn. It is well known that the problem of approximating θ1,…,θn simultaneously by rational numbers with equal denominators is related to the problem of approximating zero by the values of the linear form x0+θ1x1+⋯+θnxn at non-zero integer points. This relation is performed by the so-called transference principle discovered by Khintchine [2]. He formulated it in terms of Diophantine exponents.
Definition 1. The (regular) Diophantine exponentλ=λ(θ) is defined as the supremum of real numbers γ such that the system of inequalities
max
admits solutions in \mathbf x=(x_0,\dots,x_n)\in\mathbb{Z}^{n+1} for some arbitrarily large values of t.
The respective uniform Diophantine exponent\widehat\lambda=\widehat \lambda(\boldsymbol\theta) is defined as the supremum of real numbers \gamma such that (1.1) admits solutions in \mathbf x=(x_0,\dots,x_n)\in \mathbb{Z}^{n+1} for every t large enough.
Definition 2. The (regular) Diophantine exponent\omega=\omega(\boldsymbol \theta) is defined as the supremum of real numbers \gamma such that the system of inequalities
admits solutions in \mathbf x=(x_0,\dots,x_n)\in\mathbb{Z}^{n+1} for some arbitrarily large values of t.
The respective uniform Diophantine exponent\widehat\omega=\widehat \omega(\boldsymbol\theta) is defined as the supremum of real numbers \gamma such that (1.2) admits solutions in \mathbf x=(x_0,\dots,x_n)\in \mathbb{Z}^{n+1} for every t large enough.
It follows immediately from Dirichlet’s approximation theorem (or from Minkowski’s convex body theorem) that the Diophantine exponents satisfy the trivial relations \lambda\geqslant \widehat\lambda\geqslant 1/n and \omega\geqslant \widehat\omega\geqslant n. It is also known (see [3]) that, unless all the \theta_i are rational, we have a slightly less trivial inequality \widehat\lambda\leqslant 1.
Let us give a brief account on the existing non-trivial relations.
The aforementioned Khintchine’s transference principle was published in 1926. It can be formulated as follows:
provided that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q}. It can be easily verified that (1.4) implies (1.3), as \widehat\omega^{-1}\leqslant 1/n\leqslant \widehat\lambda\leqslant 1.
As for the uniform exponents, Jarník [3] proved in 1938 that the following remarkable identity holds if n= 2 and 1,\theta_1,\theta_2 are linearly independent over \mathbb{Q}:
provided that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q}. An alternative proof of their result can be found in [9]. Clearly, (1.6) and (1.7) imply Bugeaud and Laurent’s inequalities (1.4).
Furthermore, in 1950–1954 Jarník [10], [11] proved for n= 2 that
It is interesting to note that the right-hand sides in (1.8) coincide for n= 2, by (1.5). They are both equal to \widehat\omega\widehat\lambda as well as to (1-\widehat\omega^{-1})/(1-\widehat\lambda). We shall mention this fact in Section 3.1.
The inequalities (1.8) were recently generalized to the case of arbitrary n\geqslant 2 by Marnat and Moshchevitin [12]. They showed that
respectively. An alternative proof of the second inequality in (1.9) can be found in [13].
Summing up, we can say that all non-trivial relations currently known follow from (1.6), (1.7), and (1.9). The main purpose of this paper is to present a geometric observation, a rather simple one, which proves (1.7) almost immediately. It appears that this observation also provides a quite simple proof of Nesterenko’s linear independence criterion.
The rest of the paper is organized as follows. In Section 2 we formulate and prove our main result and apply it to prove (1.7). In Section 3 we derive some corollaries of (1.7) concerning lower bounds for the ratios \omega/\widehat\omega, \lambda/\widehat\lambda and show that (1.7) yields the weakest of the inequalities (1.9). Moreover, we analyse how (1.6) and (1.7) split Bugeaud and Laurent’s inequalities (1.4), and compare some of our corollaries with a recent result by Schleischitz [14]. Finally, Section 4 is devoted to Nesterenko’s linear independence criterion. We present a rather simple proof of his theorem, which is based on our main geometric observation described in Section 2. We also show that in order to prove the linear independence criterion itself, it suffices to use the first step of Nesterenko’s induction.
§ 2. Empty cylinder lemma
Let us introduce some notation. We write \ell for the one-dimensional subspace generated by (1,\theta_1,\dots,\theta_n) and let \ell^\perp be its orthogonal complement. Given any \mathbf x\in\mathbb{R}^{n+1}, we denote the Euclidean distance from \mathbf x to \ell (resp. from \mathbf x to \ell^\perp) by r(\mathbf x) (resp. h(\mathbf x)). We also write \langle\,\cdot\,,\cdot\,\rangle for the inner product in \mathbb{R}^{n+1}.
Our main geometric observation is described by the following statement.
Lemma 1. Let t, \alpha, \beta be positive real numbers such that t^\beta>2t^\alpha. Suppose that \mathbf v\in\mathbb{Z}^{n+1} satisfies
Proof. Consider an arbitrary \mathbf x\in\mathcal{C}. Let us show that 0< \langle \mathbf v,\mathbf x\rangle< 1. To do this, we write \mathbf y for the orthogonal projection of \mathbf x to the two-dimensional subspace \pi spanned by \ell and \mathbf v. Then r(\mathbf y) and h(\mathbf y) can be interpreted as the absolute values of the coordinates of \mathbf y in \pi with respect to the coordinate axes \ell and \ell^\bot\cap \pi (see Fig. 1). Hence
Now we can prove Schmidt and Summerer’s inequalities (1.7).
Proof of (1.7). We may assume that \lambda>1/n and \omega> n. Indeed, by Khintchine’s inequalities (1.3) the equalities \lambda=1/n and \omega=n are equivalent, and if they hold, trivial inequalities yield that \widehat\lambda=1/n, \widehat\omega=n. With these values of the exponents both inequalities (1.7) are obviously true. So, let us suppose that \lambda>1/n and \omega> n.
By the definition of \lambda and \omega we can choose a point \mathbf v \in\mathbb{Z}^{n+1} and a positive real number \gamma to satisfy any of the following two collections of conditions:
1) h(\mathbf v) is arbitrarily large, r(\mathbf v)=h(\mathbf v)^{-\gamma}, \gamma>1/n, \gamma is arbitrarily close to \lambda;
2) h(\mathbf v) is arbitrarily small, r(\mathbf v)=h(\mathbf v)^{-\gamma}, \gamma<1/n, \gamma is arbitrarily close to 1/\omega.
We shall refer to the first choice as Case 1, and to the second one as Case 2.
In either case the main part of the argument is basically the same. So, let us choose \mathbf v and \gamma according to one of the two cases. Set t=t(\mathbf v,\gamma) to be the smallest positive real number such that the cylinder
Then r(\mathbf v)=h(\mathbf v)^{-\gamma}=t^{-\alpha\gamma}=t^{\alpha-1-\beta} and t\cdot h(\mathbf v)^{-1-\gamma}=t^{-\beta}. Hence \mathbf v satisfies (2.1) and
In order to fulfill the hypothesis of Lemma 1, it remains to prove that t^\beta> 2t^\alpha. We recall that in Case 1 h(\mathbf v) is arbitrarily large and \gamma>1/n, and in Case 2 h(\mathbf v) is arbitrarily small and \gamma<1/n. In both cases by Minkowski’s convex body theorem the volume of \mathcal{C}_{\mathbf v} is bounded (by 2^{n+1}), and so the product t^{n+1}h(\mathbf v)^{-1-\gamma}=h(\mathbf v)^{(n+1)/\alpha-(1+\gamma)} is also bounded. Hence in Case 1 we have \alpha\gamma\geqslant (n+1)\*\gamma/(1+ \gamma)> 1 and, therefore, \beta=\alpha+\alpha\gamma-1>\alpha, whereas in Case 2 we have \alpha\gamma\leqslant (n+1)\gamma/(1+\gamma)< 1 and, therefore, \beta=\alpha+\alpha\gamma-1<\alpha. Thus, in both cases we have
Having the hypothesis of Lemma 1 fulfilled, we conclude that there are no integer points in the cylinder \mathcal{C}=\mathcal{C}(t,\alpha,\beta) defined by (2.2). At the same time there are some non-zero integer points in \mathcal{C}_{\mathbf v}, but all of them lie on the boundary of \mathcal{C}_{\mathbf v}. Moreover, every such point satisfies the condition r(\mathbf x)=t since
giving thus new lower bounds in the problem of estimating the ratio between the regular and uniform exponents.
Note that, as in Jarník’s inequalities (1.8) for n= 2, the right-hand sides in (3.1) coincide. However, the lower estimate for \omega/\widehat\omega and \lambda/\widehat\lambda provided by (3.1) is a function of \omega and \lambda, whereas it has been more traditional to think in this context of functions of \widehat\omega and \widehat \lambda. The right-hand side (1+\lambda)/(1+\omega^{-1}) can be weakened to (1-\widehat\omega^{-1})/(1-\widehat\lambda), thus generalizing Jarník’s inequalities (1.8) to arbitrary dimension.
Proposition 1. Suppose that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q}. Then
Proof. Set B=(1+\lambda)/(1+\omega^{-1}). Then (3.1) reads as \omega\geqslant \widehat\omega B and \lambda\geqslant \widehat\lambda B. Inserting these inequalities into the definition of B, we get
The inequalities (3.3) coincide with Jarník’s inequalities (1.8) for n= 2. Another generalization of (1.8) to arbitrary dimension is provided by (1.9). An important difference between (1.9) and (3.3) is that the lower bounds G_{\mathrm{lin}}(\widehat \omega) and G_{\mathrm{sim}}(\widehat\lambda) in (1.9) are functions of only one exponent, whereas (3.3) requires both.
If G_{\mathrm{lin}}(\widehat\omega)=G_{\mathrm{sim}}(\widehat\lambda), then (1.9) is equivalent to (3.3), but if G_{\mathrm{lin}}(\widehat \omega)\neq G_{\mathrm{sim}}(\widehat\lambda), then (3.3) is stronger than the weakest of (1.9) and weaker than the strongest.
Proposition 2. Suppose that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q}. Then either
Since \widehat\omega\geqslant n and \widehat\lambda\geqslant 1/n, both (\widehat \omega\widehat\lambda)^{1/(n-1)}\geqslant 1 and (1-\widehat\omega^{-1})/(1-\widehat \lambda)\geqslant 1. By simple substitution and regrouping it is easily verified that
However, this identity holds if and only if G_{\mathrm{lin}}(\widehat \omega)=G_{\mathrm{sim}}(\widehat\lambda), which happens quite rarely. Nevertheless, if a strict inequality holds, then the sign of this inequality determines uniquely which of the quantities G_{\mathrm{lin}}(\widehat \omega), G_{\mathrm{sim}}(\widehat\lambda) is larger.
3.2. The inequalities by Bugeaud and Laurent
The inequalities (1.4) obviously follow from (1.7) and (1.6):
Bugeaud and Laurent proved (1.4) with the help of the so-called intermediate Diophantine exponents by splitting Khintchine’s inequalities (1.3) into “going up” and “going down” chains of inequalities between consecutive intermediate exponents and improving on the first inequality in each chain with the respective uniform exponent. An analogous splitting of German’s inequalities (1.6) can be found in [6]. It would be interesting to establish a suitable splitting of Schmidt and Summerer’s inequalities (1.7). Such a splitting, combined with the existing splitting of (1.6), would provide a splitting of (1.4), alternative to the one of Bugeaud and Laurent.
As for (3.7) itself, the way it splits (1.4) with the help of the respective fourth exponent gives much additional information in the case when one of the relations (1.4) is an equality. In this case the respective pair of the inequalities (3.7) becomes a pair of equalities and we get a whole range of equalities for the intermediate exponents mentioned above. It is worth mentioning in this context that there is an explicit description (obtained recently by Schleischitz [15]) of triples of exponents for which either the first pair of the inequalities (3.7), or the second one, is a pair of equalities.
3.3. The inequalities by Schleischitz
Analysis of the quantity (\widehat\omega\widehat\lambda)^{1/(n{-}1)} appearing in Proposition 2 provides the following observation.
Proposition 3. Suppose that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q}. Then
where f(x) is as in (1.10). Since f\bigl(\omega/\widehat\omega \bigr)\geqslant 0 by (1.9), we get the second inequality in (3.8). The proposition is proved.
In [14], Schleischitz proves two inequalities that very much resemble (3.8). They are of the form
with some non-negative \psi(\lambda,\widehat\lambda) and \chi(\lambda,\widehat \lambda) turning into zero only if \lambda/\widehat\lambda=G_{\mathrm{sim}}(\widehat \lambda). One might wonder whether \psi(\lambda,\widehat\lambda) or \chi(\lambda,\widehat\lambda) can be removed from (3.9), so that it would resemble (3.8) closer. However, the answer is negative since it follows from the results of Kleinbock, Moshchevitin, and Weiss [16] that there exist many \boldsymbol\theta with small \lambda and infinite \widehat\omega. Their argument can be modified to show that, preserving \lambda small, we can make \widehat\omega finite, but however large. Thus, it is impossible to substitute the extra summands with zero in the general case.
§ 4. Nesterenko’s linear independence criterion
In 1985 Nesterenko published his famous linear independence criterion. His original proof was rather involved. A simpler argument can be found in [17] and [18]; see also [19]. Our Lemma 1 provides an even simpler proof.
As before, let us fix \boldsymbol\theta=(\theta_1,\dots,\theta_n)\in \mathbb{R}^n. Let \ell, \ell^\perp, r(\,\cdot\,), h(\,\cdot\,) be defined as at the beginning of Section 2.
Given any subspace \mathcal{L} of \mathbb{R}^{n+1}, we write \varphi(\mathcal{L}) for the tangent of the angle between \ell and \mathcal{L}. If \mathcal{L} is defined over \mathbb{Q}, we write H(\mathcal{L}) for its height, that is, the covolume of the lattice \mathcal{L}\cap \mathbb{Z}^{n+1}.
Theorem 1 (Nesterenko, 1985). Let \alpha, \beta, c_1, c_2, \varepsilon be positive real numbers, \beta\geqslant \alpha. Let (t_k)_{k\in\mathbb{N}} be an increasing sequence of positive real numbers such that
Then, for every d-dimensional subspace \mathcal{L} of \mathbb{R}^{n+1} defined over \mathbb{Q}, there is a positive number c_3=c_3(d,\varepsilon) such that
Let us derive Theorem 1 from Lemma 1. We split our argument into three steps. Suppose that the hypothesis of Theorem 1 holds.
Step 1: Adjusting the hypothesis. Let \varepsilon' be a positive real number sufficiently small with respect to \varepsilon. Set \alpha'=(1-\varepsilon')\alpha, \beta'=(1+\varepsilon')\beta. Let us show that for every real t large enough there exists \mathbf x\in \mathbb{Z}^{n+1} such that
For every large t choose k so that t_k<t\leqslant t_{k+1}. It follows from (4.1) that t_{k+1}<t_k^{1+\varepsilon'/2} if k is large enough. Hence for \mathbf x\in\mathbb{Z}^{n+1} satisfying (4.2) we have r(\mathbf x)\leqslant t_k< t and
Step 2: Proof for d= 1. Let \mathcal{L} be the one-dimensional subspace generated by a primitive vector \mathbf v\in\mathbb{Z}^{n+1} (primitive means that the coordinates of \mathbf v are coprime). Then \varphi(\mathcal{L})=r(\mathbf v)/h(\mathbf v) and H(\mathcal{L})=\sqrt{r(\mathbf v)^2+h(\mathbf v)^2}. Let us suppose that h(\mathbf v) is large and that
Set t=h(\mathbf v)^{1/\alpha'}. Then h(\mathbf v)=t^{\alpha'} and r(\mathbf v)=t^{\alpha'-1-\beta''} with some \beta''>\beta'. By Lemma 1 there are no integer points \mathbf x such that
Given a d-dimensional subspace \mathcal{L} defined over \mathbb{Q}, let \ell' be the one-dimensional subspace generated by the point in \mathcal{L} closest to the point (1,\theta_1,\dots,\theta_n). Then the angle between \ell and \mathcal{L} equals the angle between \ell and \ell'. Furthermore, given any \mathbf x\in\mathcal{L}, we write r'(\mathbf x) and h'(\mathbf x) for the Euclidean distances from \mathbf x to \ell' and \mathcal{L}\cap\ell^\perp respectively.
It follows from (4.6) that t_0/H(\mathcal{L})\to \infty and r'(\mathbf v)\to 0 as H(\mathcal{L})\to \infty. We may also assume that \varphi(\mathcal{L})\to 0 as H(\mathcal{L})\to \infty, neglecting most of subspaces. Then h'(\mathbf v)\to \infty as H(\mathcal{L})\to \infty and
Having thus chosen \mathbf v and assuming H(\mathcal{L}) to be large enough, we apply (4.5), (4.6), (4.8)–(4.10) and estimate \varphi(\mathcal{L}) as follows:
However, this estimate easily follows from (4.3) with d= 1, that is, we may actually avoid Step 3 if (4.11) is our aim. Indeed, if \ell is contained in a d-dimensional subspace \mathcal{L} defined over \mathbb{Q}, then Dirichlet’s approximation theorem guarantees that there are infinitely many integer points \mathbf x in \mathcal{L} such that
with some positive c(d,H(\mathcal{L})). But if d<(1+\beta')/(1+\beta'-\alpha'), we have \delta'<d/(d-1), where \alpha', \beta', \delta' are as in Section 4.1, and Step 2 tells us that the opposite holds for every integer point \mathbf x with h(\mathbf x) large enough:
Thus, there is no such \mathcal{L}, and we get (4.11).
4.3. A slight refinement in the case of linear independence
Theorem 1 is mainly applied to prove linear independence of the numbers 1,\theta_1,\dots,\theta_n over \mathbb{Q}. But if it is already known that they are linearly independent, the estimate (4.3) in Theorem 1 can be improved for d=n. In this case, the restriction on \alpha and \beta in Theorem 1 implies that \alpha>n-1. For \alpha> n the following statement provides a stronger inequality than (4.3). It also provides a lower bound for \omega=\omega(\boldsymbol\theta).
Proposition 4. Under the hypothesis of Theorem 1, suppose additionally that 1,\theta_1,\dots,\theta_n are linearly independent over \mathbb{Q} and that
Here we have used the fact that (4.12) implies positiveness of the denominators in both fractions above. This proves (4.13).
Furthermore, let \mathbf w be the integer normal of \mathcal{L}, that is, the unique (up to sign) non-zero primitive integer vector orthogonal to \mathcal{L}. Bounds for \varphi(\mathcal{L}) and \omega are obviously related, as H(\mathcal{L})=\sqrt{h(\mathbf w)^2+r(\mathbf w)^2} and the angle between \mathcal{L} and \ell is equal to the angle between \mathbf w and \ell^\perp. Hence
with some positive c depending only on n and \varepsilon. Thus, (4.14) is also proved. The proposition is proved.
Acknowledgements
The authors are grateful to Johannes Schleischitz for a series of useful comments, especially, for drawing the authors’ attention to Schmidt and Summerer’s paper [8] once again.
The first author is a winner of the “Junior Leader” contest conducted by the Foundation for the Advancement of Theoretical Physics and Mathematics “BASIS” and would like to thank its sponsors and jury.
Bibliography
1.
Yu. V. Nesterenko, “On the linear independence of numbers”, Vestnik Moskov. Univ. Ser. 1. Mat. Mekh., 1985, no. 1, 46–49; English transl. Moscow Univ. Math. Bull., 40:1 (1985), 69–74
2.
A. Khintchine, “Über eine Klasse linearer Diophantischer Approximationen”, Rend. Circ. Mat. Palermo, 50 (1926), 170–195
Y. Bugeaud and M. Laurent, “Exponents of Diophantine approximation”, Diophantine geometry, CRM Series, 4, Ed. Norm., Pisa, 2007, 101–121
5.
Y. Bugeaud and M. Laurent, “On transfer inequalities in Diophantine approximation. II”, Math. Z., 265:2 (2010), 249–262
6.
O. N. German, “Intermediate Diophantine exponents and parametric geometry of numbers”, Acta Arith., 154:1 (2012), 79–101
7.
O. N. German, “On Diophantine exponents and Khintchine's transference principle”, Mosc. J. Comb. Number Theory, 2:2 (2012), 22–51
8.
W. M. Schmidt and L. Summerer, “Diophantine approximation and parametric geometry of numbers”, Monatsh. Math., 169:1 (2013), 51–104
9.
O. N. German and N. G. Moshchevitin, “A simple proof of Schmidt–Summerer's inequality”, Monatsh. Math., 170:3-4 (2013), 361–370
10.
V. Jarník, “Une remarque sur les approximations diophantiennes linéaires”, Acta Sci. Math. (Szeged), 12 B (1950), 82–86
11.
V. Jarník, “Contribution à la théorie des approximations diophantiennes
linéaires et homogènes”, Czechoslovak Math. J., 4:79 (1954), 330–353
12.
A. Marnat and N. G. Moshchevitin, “An optimal bound for the ratio between ordinary and uniform exponents
of Diophantine approximation”, Mathematika, 66:3 (2020), 818–854
13.
Ngoc Ai Van Nguyen, A. Poëls, and D. Roy, “A transference principle for simultaneous rational approximation”, J. Théor. Nombres Bordeaux, 32:2 (2020), 387–402
14.
J. Schleischitz, “On geometry of numbers and uniform rational approximation to the Veronese curve”, Period. Math. Hung., 83:2 (2021), 233–249; arXiv: 1911.03215
15.
J. Schleischitz, “Optimality of two inequalities for exponents of Diophantine approximation”, J. Number Theory, 244 (2023), 169–203; arXiv: 2102.03154
16.
D. Kleinbock, N. Moshchevitin, and B. Weiss, “Singular vectors on manifolds and fractals”, Israel J. Math., 245:2 (2021), 589–613; arXiv: 1912.13070
17.
S. Fischler and W. Zudilin, “A refinement of Nesterenko's linear independence criterion with
applications to zeta values”, Math. Ann., 347:4 (2010), 739–763
18.
A. Chantanasiri, “On the criteria for linear independence of Nesterenko, Fischler and Zudilin”, Chamchuri J. Math., 2:1 (2010), 31–46
19.
S. Fischler and T. Rivoal, “Irrationality exponent and rational approximations with prescribed growth”, Proc. Amer. Math. Soc., 138:3 (2010), 799–808
Citation:
O. N. German, N. G. Moshchevitin, “On the transference principle and Nesterenko's linear independence criterion”, Izv. Math., 87:2 (2023), 252–264