Abstract:
The known non-linear kinetic equations (in particular, the wave kinetic equation and the quantum Nordheim–Uehling–Uhlenbeck equations) are considered as a natural generalization of the classical spatially homogeneous Boltzmann equation. To this end we introduce the general Boltzmann-type kinetic equation that depends on a function of four real variables $F(x,y;v,w)$. The function $F$ is assumed to satisfy certain commutation relations. The general properties of this equation are studied. It is shown that the kinetic equations mentioned above correspond to different forms of the function (polynomial) $F$. Then the problem of discretization of the general Boltzmann-type kinetic equation is considered on the basis of ideas which are similar to those used for the construction of discrete models of the Boltzmann equation. The main attention is paid to discrete models of the wave kinetic equation. It is shown that such models have a monotone functional similar to the Boltzmann $H$-function. The existence and uniqueness theorem for global in time solution of the Cauchy problem for these models is proved. Moreover, it is proved that the solution converges to the equilibrium solution when time goes to infinity. The properties of the equilibrium solution and the connection with solutions of the wave kinetic equation are discussed. The problem of the approximation of the Boltzmann-type equation by its discrete models is also discussed. The paper contains a concise introduction to the Boltzmann equation and its main properties. In principle, it allows one to read the paper without any preliminary knowledge in kinetic theory.
Bibliography: 61 titles.
Keywords:Boltzmann-type equations, wave kinetic equation, Lyapunov functions, $H$-theorem, distribution functions, discrete kinetic models, non-linear integral operators, dynamical systems.
The classical Boltzmann equation, which is the main mathematical tool for the description of rarefied gases, occupies a very specific place in mathematical physics. Speaking about equations of mathematical physics, we normally have in mind linear or non-linear equations in partial derivatives. Non-linear kinetic equations of Boltzmann type belong to a slightly different kind of equations, though they contain partial derivatives together with multiple integrals. These equations look too cumbersome at first glance. Perhaps, partly for this reason they are not included in the standard university courses for mathematicians. Nevertheless, the interest of mathematicians all over the world to this part of mathematical physics was growing fast in recent decades. It is very typical for the history of physics and mathematics that some parts of physics, which looked not very clear (even for physicists) after first fundamental discoveries, become rather a part of mathematics with time. This can be a slow process. Of course, this does not mean that their importance for physics has disappeared. The famous examples are Newton’s mechanics and Maxwell’s electrodynamics. Something like that happened and is still happening with the kinetic theory of gases founded by Maxwell [47], Boltzmann [16], and their predecessors approximately 150 years ago.
The Boltzmann equation was first published in 1872 [16]. This equation has a very interesting history; we mention only one important point. Formally, this equation was supposed to describe more accurately — than the equations of gas dynamics known at that time — the behaviour of a rarefied gas of particles interacting in accordance with the laws of classical mechanics. The first conclusion made by Boltzmann in his paper cited above was his $H$-theorem, which formally proves the existence of a functional that is monotone decreasing it time on any solution of his equation. In fact, this was the discovery of the mechanical meaning of entropy, a result which is probably more important than the Boltzmann equation itself. The immediate consequence was that the solution of the Boltzmann equation cannot be invertible in time in contrast to the equations of Newtonian mechanics. This caused certain doubts in this equation, especially among mathematicians. The famous remark by Zermelo [61] based on Poincaré’s recurrence theorem was made at the end of the 19th century. This question and some others were clarified since then, and the validity of the Boltzmann equation has been justified rigorously, at least for short time intervals. We describe briefly below the progress in mathematical results for the Boltzmann equation. Only a few such results were obtained before the 1960s. These were, in particular, works by Hilbert [36], Carleman [19], Morgenstern [48], and Grad [34]. Of course, the great contribution of Bogolyubov [15], who proposed in 1946 his famous method of derivation of the kinetic equations from dynamics, should be mentioned. Some of the works cited above were written at the formal level of mathematical rigour, but this is inevitable at the early stage of the development of any mathematical theory related to physics. The Chapman–Enskog method [22], [27], invented by physicists for the transition to hydrodynamics from the Boltzmann equation, became a standard tool for the study of dynamical systems with small parameter.
The number of rigorous mathematical results in kinetic theory began to grow faster in the 1960s–1970s. In particular, we mention:
(i) the mathematically rigorous theory of the Cauchy problem for the linearized Boltzmann equation constructed in works by Arsen’ev [8] (for short-range intermolecular potentials) and Ellis and Pinsky [26] (for power-like potentials with Grad’s angular cut-off);
(ii) the complete theory of the existence and uniqueness of solutions to the Cauchy problem for the spatially homogeneous Boltzmann equation for a wide class of potentials developed by Arkeryd [2];
(iii) the first global in time existence and uniqueness theorems for the Cauchy problem for the spatially inhomogeneous Boltzmann equation by Maslova and Firsov [46] and Ukai [55] under the assumption that the initial conditions are sufficiently close to an equilibrium;
(iv) the proof by Lanford [42] of the first validation theorem for the Boltzmann equation in the Boltzmann–Grad limit for hard spheres on a time interval of an order of the particle mean free path.
For the sake of brevity we do not discuss the related results obtained for other two famous classical non-linear kinetic equations introduced by Landau [40] and Vlasov [59], respectively. The kinetic theory and, in particular, the theory of the Boltzmann equation is gradually becoming a more-or-less regular part of mathematical physics. Mathematical conferences on this subject became more and more frequent in Europe, USA, and Japan during the last two decades of the 20th century. There was a sort of competition of pure mathematicians from different countries to prove a global existence theorem for the Boltzmann equation with initial data far from equilibrium. Finally, the result was obtained by DiPerna and Lions in 1989 [23]. The Fields Medal obtained by P.-L. Lions apparently attracted a great number of good young mathematicians to kinetic theory, especially in France. One of them, Villani, obtained another Fields Medal in 2010 for his works in this field of mathematical physics. He also wrote an excellent survey [58] of the mathematical results on the Boltzmann equation. This survey, together with its renewed on-line version and the book [21], contain most of the important mathematical results on the Boltzmann equation obtained before 2005. A review of some more recent results can be found, for example, in [10]. The development of kinetic theory in the last decades was connected with applications of its ideas to various unusual objects, which are far from traditional rarefied gases. It is also related to the rapid development of computers and numerical methods. Kinetic equations are now used for modelling traffic flows, the distribution of ‘active particles’ (viruses and so on) in biology, and socio-economic processes. Of course, many of these equations are perhaps ‘too young’ to become a subject of a rigorous mathematical study. On the other hand there are some classes of relatively ‘old’ kinetic equations, which have actively been used by physicists since the 1960s (see, for example, [30] and [60]). These are the so-called ‘wave kinetic equations’ used in the theory of weak turbulence. There are some mathematical results on these equations (see, for example, [28], [25], and the references there), but still many questions, like the long-time asymptotic behaviour of solutions, remain unclear. One goal of the present paper is to study these and similar equations (for example, the quantum Nordheim–Uehling–Uhlenbeck equation [49], [54]) from a unified point of view as particular cases of the general class of Boltzmann-type equations [12]. Our approach to this class of equations is, to a certain extent, close to the approach of Arkeryd in [5]. To this class of equations we apply some methods used for the classical Boltzmann equation. In particular, the properties of discrete models of Boltzmann-type equations are discussed. The main attention is paid to the wave kinetic equation (WKE): it is proved that solutions of any normal discrete model of the WKE tend to the equilibrium distribution as time tends to infinity. The consequences of this fact for solutions of the WKE are discussed in detail. It should be noted that we do not discuss in this paper important questions relating to the validation of various Boltzmann-type equations, that is, their derivation from some more general mathematical models. This is not an easy question. For example, it took more than a hundred years to find a mathematical proof (Lanford’s theorem [42]) of the connection between the classical Boltzmann equation and the system of $N$ hard spheres as $N$ tends to infinity. A recent mathematical result in [25] on the derivation of the WKE from the non-linear Schrödinger equation (in the presence of a random force field) looks very promising in this sense.
The paper is organized as follows. Section 2 presents a concise introduction to key ideas and technique that lead from Hamiltonian mechanics to the classical Boltzmann equation. In principle, it allows one to read the paper without preliminary knowledge in kinetic theory. On the other hand important notation and some technical tools are introduced there. We begin with the $N$-particle system described by the Newtonian equations in the Hamiltonian form. Then we introduce the notion of an $N$-particle distribution function and Liouville’s equation. Following Grad’s version of the BBGKY-hierarchy, we finally present a formal derivation of the Boltzmann equation for hard spheres. A formal generalization of this equation to the case of any reasonable intermolecular potential is also performed. Then we study all general properties of the Boltzmann equation, including conservation laws and the $H$-theorem.
The general Boltzmann-type kinetic equation is introduced at the beginning of § 3 as a natural generalization of the classical Boltzmann equation from § 2. This new equation depends on an arbitrary function $F(x_1,x_2;x_3,x_4)$ of four variables. For the Boltzmann equation we have $F=x_3 x_4-x_1 x_2$. Other kinetic equations mentioned above correspond to different polynomial forms of $F$. We show that all these equations can be studied from a unified point of view. Different forms of the general kinetic equation are constructed for the 3D-case (Proposition 3.1). The generalization to the $d$-dimensional case for $d \geqslant 2$ is also considered. The weak form of the general kinetic equation (the equation for average values) and conservation laws are also discussed. We define a class of functions $F$ that can lead to an analogue of Boltzmann’s $H$-theorem (Proposition 3.2). Then we introduce discrete kinetic models of the general Boltzmann-type kinetic equation by using the formal analogy with discrete velocity models of the Boltzmann equation. We define similarly the notion of a normal discrete model. Then we prove Theorem 3.1 on the main properties of normal discrete models which possess an analogue of the $H$-theorem.
Section 4 is devoted to some properties of solutions to normal discrete kinetic models of WKE. The main result of this section is the proof of the convergence of any positive solution of this model to a unique equilibrium solution. This result is formulated at the beginning of § 4 (Theorem 4.1). The proof is given in the rest of that section. First we construct the solution for any positive initial data and prove its global in time existence and uniqueness (Lemma 4.1). Then we construct a positive stationary solution of the model and prove its uniqueness under fixed invariants (mass and energy) in Lemma 4.2. Then we improve some estimates for strictly positive initial data and complete the proof of convergence to an equilibrium by more or less standard methods of the theory of ODEs.
At the end of § 4 we discuss a general problem of the approximation of the Boltzmann- type kinetic equation by a sequence of discrete models as the order of the model, that is, the number of its discrete points, tends to infinity.
The results of the paper and related open problems are briefly discussed in § 5.
2. From particle dynamics to the Boltzmann equation
2.1. $N$-particle dynamics and modelling of rarefied gases
We consider $N \geqslant 1$ identical particles with mass $m=1$. This system is characterized by a $6N$-dimensional phase vector $Z_N=\{z_1,\dots,z_N\}$ with components $z_i=(x_i,v_i)$, where the $x_i \in \mathbb{R}^3$ and $v_i \in \mathbb{R}^3$ denote, respectively, the position and velocity of the $i\operatorname{th}$ particle, $i=1,\dots,N$. Usually, we assume below that particles interact via a given pair potential $\Phi(r)$, where $r>0$ denotes the distance between two interacting particles. We also assume that $\Phi(r) \to 0$ as $r \to\infty$. The equations of motion of the system have the following Hamiltonian form (see any textbook in classical mechanics, for example, [41]):
Thus the temporal evolution of the system can be understood as the motion of the phase point $Z_N=\{z_1,\dots,z_N\}$ in the phase space $\mathbb{R}^{6N}$. This motion obeys the conservation laws of energy $E_N \in \mathbb{R}$ and momentum $P_N\in \mathbb{R}^3$,
respectively. The laws (2.2) follow directly from equations (2.1).
In principle, the $N$-particle system described above can be used to model real gases or fluids. Then the main problem is that the number of particles $N$ is of order of $10^{23}$ (Avogadro’s number). It is intuitively clear that the case of a rarefied gas is easier for description than the case of a dense gas. Indeed, in the low density limit we obtain a free molecular flow, that is, each particle moves independently with constant velocity. Consequently our main assumption will be as follows: a typical distance between particles is much greater than the effective diameter $d$ of the potential. It is known that the typical ‘size’ $d$ of a molecule of the air is roughly equal to $3.7 \cdot 10^{-8}$ cm, whereas the number density $\bar{n}$ of the air is about $2.5 \cdot 10^{19}$ cm$^{-3}$ under normal conditions. The inequality
is the well-known criterion for an ideal gas. Note that $\delta \approx 10^{-3}$ for air under normal conditions on the surface of the Earth. This parameter $\delta$ is decreasing with height. Therefore, kinetic equations (in particular, the classical Boltzmann equation for rarefied gases) are important for applications in space science and technology.
The simplest model intermolecular potential $\Phi(r)$ corresponds to particles interacting like hard spheres of diameter $d$. Then we formally obtain
including the Coulomb case $n=1$ for any sign of $\alpha$.
In the next subsection we discuss some probabilistic aspects of the kinetic theory of gases.
2.2. Distribution functions and Liouville’s equation
We introduce an important notion of one-particle distribution function $f(x,v,t)$ (the words ‘one- particle’ are usually omitted below for the sake of brevity). The physical meaning of this function is as follows: the average number of particles in any measurable set $\Delta \subseteq \mathbb{R}^3 \times \mathbb{R}^3$ is given by the equality
are fixed. How can we find the distribution function $f(x,v,t)$ for $t>0$? This is, in a sense, the main problem of kinetic theory. It can be shown that for some special physical systems, like rarefied gases, the temporal evolution of distribution function $f(x,v,t)$ is described by a so-called ‘kinetic equation’
where $A(f)$ is a non-linear operator acting on $f$. We usually assume that the initial value problem (2.7), (2.8) has a unique solution $f(x,v,t)$ on some time interval $0 \leqslant t \leqslant T$.
Let us consider the simplest kinetic equation connected with the system (2.1). Omitting the index $i=1$ we obtain the equations of free motion
Note that the motion of one particle can also be described by the distribution function $f(x,v,t)$, which has in this case the meaning of a probability density, provided that
This is the simplest kinetic equation. Note that the kinetic equations (2.12) are exactly equivalent to the dynamical equations (2.9). The probabilistic description is caused only by an uncertainty in the initial conditions.
Let us now extend these arguments to the case of $N$ non-interacting particles. We consider equations (2.1) for $\Phi(r)\equiv 0$ and obtain
Remark 2.1. Here and below we use notation like $F_N(z_1,\dots,z_N)$ (with upper- case $F$) for various multi-particle distribution functions which have the meaning of a probability density. These functions are always normalized by $1$ in the whole phase space. The notation like $f_N(z_1,\dots,z_N)$ will be used for a slightly different class of functions related to equality (2.6). The difference disappears in the trivial case $N=1$.
This property is known as the propagation of chaos [39]. It is self-evident for non-interacting particles, but it can also be proved as an asymptotic property of more complex multi-particle systems.
What changes if we consider the Hamiltonian system (2.1) for a non-zero potential $\Phi(r)\ne 0$? Then we can still use the $N$-particle distribution function $F_N=\{z_1,\dots,z_N;t)$. We shall see below that the equation for $F_N$ reads
which is Liouville’s equation (2.18) in slightly different notation. The sum in (2.21) is usually called the Poisson bracket $\{F,H\}$ (only for particles with unit mass $m= 1$); see [41].
Liouville’s equation is very important because it allows one (at least formally) to build a bridge between $N$-particle dynamics, kinetic theory, and hydrodynamics.
2.3. The BBGKY-hierarchy
In § 2.2 the function $f_N(z_1,\dots,z_N;t)$ was assumed to be integrable over the whole phase space ${\mathbb{R}}^{6N}$. This assumption is sometimes too strong. For example, it does not allow one to consider translationally invariant (in the physical space ${\mathbb{R}}^3$) systems with initial data like $f_2^{(0)}(x_1-x_2;v_1,v_2)$. Therefore, it is more convenient to consider an $N$-particle system confined to a bounded domain $\Omega \subset {\mathbb{R}^3}$, say, a box or a sphere with ‘large’ diameter $L$. The volume of $\Omega$ is denoted by $|\Omega|$. Then the phase state of the $i$th particle is described by the point
We can assume for simplicity that the walls of $\Omega$ are specularly reflecting. Another possibility is to assume that $\Omega$ is a periodic box. What is important is to keep unchanged the total number $N$ of particles inside $\Omega$.
For the sake of brevity we use below the symbolic notation
The function $F_N(1,\dots,N;t)$ is assumed to be invariant under permutations of its arguments $(1,\dots,N)$ because all $N$ particles are identical. This property is preserved by Liouville’s equation, provided that it is valid at $t=0$.
We introduce $k$-particle probability distributions by the equalities
Here and below the argument $t$ is omitted in such cases when this does not cause any confusion. The next step is to obtain a set of evolution equations for $F_k^{\kern1pt(N)}(1,\dots,k;t)$. We take any $1\leqslant k \leqslant N-1$ and integrate equation (2.25) with respect to $d(k+1)\cdots\,dN$. The result reads
If we take another value of $j$ in the sum for $\Gamma_{ik}^{\kern1pt(N)}$, then the result is the same. This follows from the symmetry of $F_N(1,\dots,N)$ with respect to permutations. Moreover, the operator $B_{i,k+1}$ from (2.25) can be replaced by
The set of equations (2.31), (2.32) is called (provided that we ignore its trivial modifications) the BBGKY-hierarchy. The acronym BBGKY stands for Bogolyubov, Born, Green, Kirkwood, and Yvon, the names of the physicists who introduced independently this system of equations (see, for example, [9] and [15]). Equation (2.31) for $k=N$ also makes sense if we set $F_N^{\kern1pt(N)}=F_N$ and $F_N^{(N+1)}=0$. Then it becomes Liouville’s equation for $F_N$. Note that equations (2.31) (the BBGKY- hierarchy for $N$-particle dynamical system (2.1) in the box $\Omega$) can formally be considered as exact. The only relevant assumption is equality (2.29), related to interactions with boundaries of the box $\Omega$. This equality is fulfilled, in particular, for a periodic box or a box with specularly reflecting walls.
The BBGKY-hierarchy is a starting point for all classical kinetic equations. For example, assume that the interaction between particles is weak and replace $\Phi(r)$ by $\varepsilon\bar{\Phi}(r)$ in equations (2.31) in the following form:
and to assume that $F_{k}^{\kern1pt(N)} \to F_k$ in that limit for all $k=1,2,\dots$ . Then we formally obtain the following infinite set of limiting equations for $F_{k}$:
The most important applications of Vlasov’s equations are related to the Coulomb potential $\Phi(r)=\alpha/r$ (gravitational or electrostatic forces).
On the other hand, in the important case of potentials $\Phi(r)$ with compact support (strong interaction at short distances) it is more convenient to use a different approach. This approach is considered in the next subsection.
2.4. Hard spheres and the Boltzmann–Grad limit
We begin with the case of $N$ hard spheres of diameter $d$. Then the Liouville equation (2.18) cannot be used directly because the potential $\Phi_{\rm HS}(r)$ (2.4) is too singular. The function $F_N(x_1,v_1;\dots;x_N,v_N;t)$ is defined in this case by the free flow equation
We add to this equation the boundary conditions on each of $N(N-1)/2$ boundary surfaces $|x_i-x_j|=d$ ($i,j=1,\dots,N$, $i\ne j$) in $\mathbb{R}^{3N}$. Taking, for example, $i=1$ and $j=2$, and assuming the specular reflection law we obtain
$$
\begin{equation}
\begin{gathered} \, x=x_1-x_2= d\,n, \quad n \in S^2, \qquad u=v_1-v_2, \\ v_1'=v_1-n (u \cdot n),\quad v_2'=v_2+n (u \cdot n). \end{gathered}
\end{equation}
\tag{2.36}
$$
The surfaces that correspond to multiple collisions of $k \geqslant 3$ particles are described by at least $k-1 \geqslant 2$ equalities. For example, for $k=3$ we need to satisfy simultaneously two conditions like $|x_1-x_2|=d$ and $|x_2-x_3|=d$. These surfaces have a zero measure as compared with the case $k=2$ of pair collisions. Therefore, we ignore multiple collisions.
if $|x_i-x_j| < d$ for at least one pair of indices $1 \leqslant i<j \leqslant N$. Here and below we use the symbolic notation (2.23) from § 2.3 when it does not cause any confusion. We also assume that $F_N \geqslant 0$ is the probability density in $G_N$ with usual normalization condition (2.24). Our aim is to construct the equation for the one-particle probability density $F_1^{\kern1pt(N)}(1)$ defined in (2.26) for $k=1$. Note that this function can equivalently be defined by the equality
We separated the term with $k=1$ in (2.40) because all other terms in the sum are equal to $I_2$. Indeed, we always assume that all particles are identical and therefore $F_N(1,\dots,N)$ is symmetric with respect to permutations of the arguments $(2,\dots,N)$. Hence $I_k=I_2$ for any $3 \leqslant k \leqslant N$, and therefore
We use the notation $\widetilde{F}_2^{\kern1pt(N)}$ because this function coincides formally with $F_2^{\kern1pt(N)}$ in (2.26) only for $d=0$. We apply Gauss’s theorem to the integral (2.43) and obtain after simple transformations
where $\widetilde{F}_2^{\kern1pt(N)}(x_1,v_1;x_2,v_2)$ was defined in (2.44). We can split the integral over $S^2$ into two parts in the following way:
$$
\begin{equation*}
\begin{gathered} \, \int_{S^2} dn\,(u \cdot n)\Psi(n)=\int_{S_+^2} dn\,|u \cdot n|\Psi(n)- \int_{S_-^2} dn\,|u \cdot n| \Psi(n)\,, \\ S_+^2=\{n \in S^2\colon u \cdot n >0\},\qquad S_-^2=\{n \in S^2\colon u \cdot n <0\}, \end{gathered}
\end{equation*}
\notag
$$
where $u=v_1-v_2$ and $\Psi(n)$ is an arbitrary integrable function. It is clear from (2.35) and (2.36) that the function $\Psi(n)=\widetilde{F}_2^{\kern1pt(N)}(x_1,v_1; x_1-dn,v_2)$ in the integral over $S_+^2$ can be expressed in terms of $\Psi(n)$ in the integral over $S^2_-$. Then we obtain
Note that equations (2.45) and (2.46) are formally exact for hard spheres. To our knowledge, they were published for the first time by Grad no later than 1957 [34]. These equations are very important as a starting point for the mathematically rigorous derivation of the Boltzmann equation.
For our goals it is sufficient to introduce the ‘chaotic’ initial data
Then from (2.45) and (2.46) we formally obtain the Boltzmann equation for hard spheres. In the simplified notation $x=x_1$, $v=v_1$, $w=v_2$ this equation reads
(the domain of integration $S_+^2$ in (2.46) is extended to the whole unit sphere $S^2$ in an obvious way). The limiting initial conditions follows formally from (2.47):
We consider in this section the case of the whole space $G=\mathbb{R}^3 \times \mathbb{R}^3$ in order to simplify formal calculations. In fact, the same equations (2.50)–(2.52) can be derived rigorously from $N$-particle dynamics in the case when all particles are confined to a bounded domain $\Omega$ with reflecting walls (see [21] and the references there for details).
In our formal derivation of the Boltzmann equation in this section we have followed Grad’s scheme from [34]. It should be pointed out that the first mathematically rigorous proof of the validity of the Boltzmann equation for hard spheres was given by Lanford [42] in 1975. The deep and rigorous presentation of the validation problem for the Boltzmann equation can be found in the book [21].
2.5. The Boltzmann equation for hard spheres and other potentials
The classical Boltzmann equation is usually considered not for the probability density $F(x,v,t)$, but for the distribution function
where the irrelevant arguments $(x,t)$ of $f(x,v,t)$ have been omitted in the so-called Boltzmann collision integral $Q(f,f)$. Note that $Q(f,f)$ is an operator quadratic with respect to $f$ and acting only on the variable $v \in \mathbb{R}^3$. The advantage of equation (2.54) is that it does not contain the number of particles $N$. The Boltzmann collision integral can be presented in different forms. In particular, the following form of $Q(f,f)$ is very useful:
Proof. We evaluate the integral $I(F)$ in the spherical coordinates with polar axis directed along $u \in \mathbb{R}^3$. Thus we set $k=rn$, $n \in S^2$, in (2.56) and obtain
$$
\begin{equation*}
\begin{aligned} \, I(F)&=\int_{S^2}\,dn\int_{0}^{\infty}\,dr\, r\delta \biggl[\frac{1}{2}(2 n \cdot u+r)\biggr]F(rn) \\ &=4\int_{S^2_-}\,dn \,|n \cdot u|F[-2(u \cdot n)n],\quad S^2_-=\{n \in S^2\colon u \cdot n < 0\}. \end{aligned}
\end{equation*}
\notag
$$
The integrand is an even function of $n \in S^2$, and thus the first equality (2.57) follows. The second equality is based on the change of variables $k=\widetilde{k}-u$ in the integral (2.56). Then we obtain
in the notation of (2.59). It is easy to check that this formula for $Q(f,f)$ coincides with (2.55). Hence the equivalence of (2.54) and (2.55) is proved. Note also that the same identity (2.57) leads to the third useful representation of the collision integral for hard spheres:
and $\sigma_{\rm diff}=d^2/4$ denotes the so-called differential scattering cross-section for hard spheres with diameter $d$. A bar in (2.64) and (2.65) means actually averaging over the random impact parameters. The physical meaning of equation (2.62) becomes clear if we write it as
On the other hand the average change of $\psi$ in the collision of particles with velocities $v$ and $w$ is equal to the average difference $[\overline{\psi(v')-\psi(v)}]$ given in (2.65). Hence the right-hand side of (2.66) defines correctly (at the intuitive level) the rate of change of $\langle f,\psi \rangle$ due to collisions.
These considerations allow us to generalize formulas (2.64)–(2.67) to the case of general (repulsive) potential $\Phi(r)$ with finite radius of action $R_{\max}=d$. In that case we have the same total cross-section $\sigma_{\rm tot}=\pi d^2$ as for hard spheres with diameter $d$. However, the differential cross-section of scattering $(v,w) \to (v',w')$, such that
which is discussed in detail in textbooks on classical mechanics (see [41]). The connection of $\sigma_{\rm diff}(|u|,\mu)$ with the potential $\Phi(x)$ is briefly discussed below in § 2.8. If we fix the intermolecular potential $\Phi(r)$ and the corresponding differential scattering cross-section $\sigma(|u|,\mu)$, then we obtain the same equation (2.64), where
Finally, we note that the total cross-section $\sigma_{\rm tot}$ disappears after the substitution of (2.65) into (2.64). In addition, the differential cross-section $\sigma(|u|,\mu)$ is always multiplied by $|u|$ in the integrand in (2.64). Therefore, it is more convenient to introduce the new function
Note that $\psi(v)$ is an arbitrary test function. Therefore, this expression is sometimes called ‘the weak form of the Boltzmann collision integral’. The corresponding strong form of $Q(f,f)$ is
in the notation of (2.70) and (2.72). We shall usually consider $g(|u|,\mu)$ as a given function.
2.6. Basic properties of the Boltzmann equation
In applications to rarefied gas dynamics we are mainly interested not in the distribution function $f(x,v,t)$ itself, but in the (macroscopic) characteristics of the gas averaged over the velocity space. In accordance with the physical meaning of $f(x,v,t)$, the density of the gas or, equivalently, the number of particles per unit volume is defined by the equality
$$
\begin{equation}
\rho(x,t)=\langle f,1 \rangle=\int_{\mathbb{R}^3}dv\,f(x,v,t),\qquad x \in\mathbb{R}^3,\quad t \geqslant 0.
\end{equation}
\tag{2.74}
$$
Other important macroscopic characteristics of the gas are the bulk or mean velocity $u(x,t)$ (not to be confused with the notation $u$ for the relative velocity in the collision integral (2.72)) and the absolute temperature $T(x,t)$. These functions are defined by the equalities
in the notation of (2.55). Here $m$ is the molecular mass, while $T$ is expressed in energy units. Usually, we assume in this paper that $m=1$ unless a mixture of different gases is considered.
For given values of $\rho(x,t)$, $u(x,t)$, and $T(x,t)$ the following distribution function is called the local Maxwell distribution (or Maxwellian):
where it is assumed that $m=1$ in (2.75). The same function is called the ‘absolute Maxwellian’ if the parameters $\rho$, $u$, and $T$ are independent of $x$ and $t$.
Returning to the Boltzmann equation (2.73) we can easily understand the importance of the Maxwellian distribution (2.76). Indeed, it follows from (2.72) that
Another important property of the Boltzmann equation is connected with the conservation laws for the mass, momentum, and energy. We consider identity (2.70) for a fixed test function $\psi(v)$ and transform the integral by interchanging the variables $v$ and $w$. Then we easily obtain
for arbitrary constant parameters $a$, $b$, and $c$. This identity leads to conservation laws for the mass, momentum, and energy. Indeed, consider equation (2.62) for $\psi=1$, $\psi=v$, and $\psi=|v|^2$, respectively. Then we obtain
which is valid for any function $\Psi(v_1,v_2;v_3,v_4)$ such that the integral is convergent. For the proof it is sufficient to pass to the variables $U$ and $u$ from (2.63) in the integrand and to repeat the considerations used for the proof of identity (2.81). For brevity we omit these straightforward calculations.
To complete the proof of (2.85) we consider (2.81) and set
This inequality is known as Boltzmann’s $H$-theorem. Its importance can easily be understood in the spatially homogeneous case, considered in the next section.
2.7. The spatially homogeneous problem
The Boltzmann equation (2.73) admits a class of spatially homogeneous solutions $f(v,t)$. Usually, we consider the initial value problem
We also note that the operator $Q(f,f)$ is invariant under the shifts $v \to v+v_0$, $v_0 \in \mathbb{R}^3$, in the $v$-space. Therefore, if $f(v,t)$ is a solution of the equation in (2.90), then $f(v+v_0,t)$ is also a solution for any $v_0 \in \mathbb{R}^3$. Hence we can always reduce problem (2.90) to the case
Moreover, if $f(v,t)$ is a solution of the spatially homogeneous Boltzmann equation, then so is the function $\widetilde{f}(v,t)=\alpha f(v,\alpha t)$ for any $\alpha >0$. This transformation allows one to reduce the general problem (2.90) to the case
for almost all values $(v,w, \omega) \in \mathbb{R}^3\times \mathbb{R}^3\times S^2$, provided the kernel $g(|u|,\mu)$ in (2.72) is positive almost everywhere. This functional equation was studied in various classes of functions by many authors, beginning with Boltzmann (see [21] and the references there). They proved the uniqueness of its well-known solution (2.78). On the other hand the only function (2.78) which satisfies the conservation laws discussed above is the Maxwellian $f_{\rm M}$ (2.94). Hence we can conclude at the formal level that $H(f)$ decreases monotonically in time unless $f=f_{\rm M}$. This conclusion can be confirmed by the general inequality
where $G_1(t)=\log t+t^{-1}-1 \geqslant 0$. We set $z=f(v)$, $y=f_{\rm M}(v)$ and integrate the inequality $G(f,f_{\rm M}) \geqslant 0$ over $v \in \mathbb{R}^3$. This completes the proof of (2.98). Inequality (2.98) shows that the Maxwellian (2.74)–(2.76) is the minimizer of the $H$- functional $H(f)=\langle f,\log f \rangle$ in the class of distribution functions with fixed lower moments $(\rho,u,T)$.
Returning to the initial value problem (2.90) and assuming conditions (2.92) and (2.93), we know that there is a unique positive stationary solution $f_{\rm M}$, given in (2.94) and (2.95), such that all conservation laws are satisfied. This stationary solution $f_{\rm M}$ minimizes the $H$-functional, and therefore we expect that the solution $f(v,t)$ converges (in some precise sense) to $f_{\rm M}$ for large values of time.
This is the qualitative behaviour of solutions of problem (2.90) that we expect on the basis of the above formal considerations. The corresponding physical process is called ‘relaxation to equilibrium’.
The rigorous mathematical theory of problem (2.90) is not simple. First steps in its development were made by Carleman in the 1930s (see [19]) for the model of hard spheres. Then a more general and detailed theory was presented by Arkeryd [2] in the early 1970s (see also [3]). To understand these and more recent results in this area we need to introduce a sort of classification of collisional kernels $g(|u|,\mu)$ in the Boltzmann integral (2.72). This is done in the next subsection.
2.8. Collisional kernels
Recall that the kernel $g(|u|,\mu)$ is equal to $|u|$ times the differential cross-section $\sigma(|u|,\mu)$ expressed as a function of $\mu=\cos \theta$. The scattering angle $\theta \in [0,\pi]$ is given in the form (see [41])
where $b$ is the impact parameter, $\Phi(r)$ is the intermolecular potential, and $m$ is the reduced mass of colliding particles ($m=1/2$ for particles with unit mass). To find $\sigma(|u|,\mu)$ we need to construct the inverse function $b=b(|u|,\theta)$. Then we express this function as $b=\widetilde{ b}(|u|,\cos \theta)$ and finally obtain (see [41])
Generally speaking, this is a rather complicated calculation. Fortunately it leads to a simple explicit formula $\sigma=d^2/4$ in the important case, when particles are hard spheres with diameter $d$. It also can be shown that for power-like potentials $\Phi(r)=\alpha r^{-n}$, $\alpha > 0$, we obtain
where a function of $\theta$ is expressed as the function $\widetilde{A}_n$ of $\mu$. Hence in the case of power-like potentials $\Phi(r)=\alpha r^{-n}$ the collisional kernel in (2.72) reads
We can use the same formula for hard spheres by assuming that $n=\infty$ and $g_\infty=d^2/4$. There is, however, an important difference between hard spheres and power-like potentials. We consider the collision integral (2.72) again and split it formally into two parts:
It was already discussed in § 2.5 that $\sigma_{\rm tot}=\pi R^2_{\max}$, where $R_{\max}$ denotes the radius of action of the potential. In the case of hard spheres of diameter $d$ or any potential with $R_{\max}=d$ we obtain the universal formula for the collision frequency:
However, if we consider the power-like potential for any $n > 0$, then $R_{\max}=\infty$, and therefore the integral $\nu(v)$ diverges. Hence the splitting (2.102) is impossible, though the ‘full’ collision integral (2.72) can be convergent. The matter is that the kernel $g(|u|,\mu)$ has a non-integrable singularity at $\mu=1$, that is, $\theta=0$, for long-range potentials with $R_{\max}=\infty$. At the same time $v'=v$ and $w'=w$ if $\mu=1$. Therefore, the second factor in the integrand is equal to zero at that point. It is easy to see that the integral (2.72) is convergent for a large class of functions $f(v)$, provided that
in the case of power-like potentials. It can be shown that this condition is satisfied for all $n > 1$. The Coulomb case $n=1$ is always considered separately.
There also were many publications in the last two decades that were related to the Boltzmann equation with long-range potentials, but we do not consider related problems below (see, for example, [1] and the references there). Our main goal is to discuss a general class of Boltzmann-type equations, which are used in applications by physicists. This class of equations is introduced in the next section.
3. Boltzmann-type kinetic equations and their discrete models
3.1. A generalization of the Boltzmann equation
In this section we introduce a general class of kinetic equations that includes the spatially homogeneous Boltzmann equation (2.90) as a particular case. To this end we choose a function $F(x_1,x_2;x_3,x_4)$ of four real (or complex) variables and assume that
We also introduce a non-negative function $R(u_1,u_2)$ of two vectors $u_1,u_2 \in \mathbb{R}^3$. It is usually assumed that $R(u_1,u_2)$ is invariant under rotations of $\mathbb{R}^3$. Then it is known that
that is, such a function can be reduced to a function of three scalar variables. This property holds for $d$-dimensional vectors for any $d \geqslant 2$. The function $R(u_1,u_2)$ plays below the role of the kernel of a certain integral operator.
Then we define the general Boltzmann-type kinetic equation for a function $f(v,t)$, where $v \in \mathbb{R}^3$ and $t \in \mathbb{R}_+$, by the equality
Our closest goal is to simplify this integral for arbitrary $F(x_1,x_2;x_3,x_4)$ and, in particular, to show that $K[f]$ coincides with the Boltzmann collision integral if
This formula for $K[f](v)$ is valid for any kernel $R(u_1,u_2)$. In the case of an isotropic kernel (invariant under rotations) we can use (3.2) and set
where the notation is as in (2.72). If, in addition, we assume that $F$ is given by (3.5), then $K[f](v)$ coincides with the Boltzmann collision integral (2.72). Note that the simplest case of a constant kernel $R(u_1,u_2)=\mathrm{const}$ in (3.4) corresponds, under condition (3.5), to the case of hard spheres for the Boltzmann equation. Thus the following statement is proved.
Proposition 3.1. The equation (3.3), (3.4) with isotropic kernel (3.2) can be reduced by formal transformations to the Boltzmann-type equation (3.3), (3.11). The connection between the kernels of the corresponding integral operators is described by equality (3.10). If $F(x_1,x_2;x_3,x_4)$ in the operator $K$ in (3.4) is given by formula (3.5), then equation (3.3), (3.4) coincides with the spatially homogeneous Boltzmann equation (2.90), (2.72).
Some authors (see, for example, [25]) consider a $d$-dimensional version of the integral (3.4), where $\mathbb{R}^3$ is replaced by $\mathbb{R}^d$, $d \geqslant 2$. Then all above transformations can be repeated for the integral $K[f](v)$, $v \in \mathbb{R}^d$, with minimal changes [12]. The $d$-dimensional analogue of equality (3.8) reads
in the notation of (3.9), where $v \in \mathbb{R}^d$, $w \in \mathbb{R}^d$, and $\omega \in S^{d-1}$. If we assume that the kernel $R(u_1,u_2)$ is invariant under rotations of $\mathbb{R}^d$, then we can use the same identity (3.2) and set
The substitution of formula (3.13) into (3.12) leads to the $d$-dimensional ‘collision integral’ (3.11), where the domain of integration $\mathbb{R}^3 \times S^2$ is replaced by $\mathbb{R}^d \times S^{d-1}$. Hence Proposition 3.1 can formally be generalized to an arbitrary dimension $d \geqslant 2$. For the simplicity of presentation we mainly consider below the case $d=3$.
We present without derivation two other useful forms of the integral (3.4) [12]. The first reads
It is clear that the integral over $k \in \mathbb{R}^3$ can be reduced to an integral over the plane orthogonal to $u \in \mathbb{R}^3$. For the Boltzmann equation a transformation of this kind was originally used by Carleman [19]. The second form of $K[f](v)$ reads
Note that the notation for $v'$ and $w'$ in this formula differs from similar notation in (3.9). We do not use below representation (3.15), (3.16) for $K[f](v)$, but it should be mentioned because in the case (3.5) of the Boltzmann equation it is the most conventional form of the collision integral (see, for example, the books [20], [21], and [43]). If the kernel $R(u_1,u_2)$ is isotropic, then we can use equality (3.13) and obtain
in the notation of (3.9) one can easily check that the two scalar functions $h_1=1$ and $h_3= |v|^2$, and also the vector-function $h_2=v \in {\mathbb{R}^3}$ are linearly independent solutions of this equation. The uniqueness of these solutions in different classes of functions has been proved by many authors (see the discussion of equation (3.5) in [21]).
Hence we have the following conservation laws for equation (3.3), (3.4):
provided that conditions (3.1) are fulfilled for $F$ in (3.4). The corresponding integrals in (3.23) in the case (3.5) of the Boltzmann equation have the physical meaning of the total number of particles (gas molecules), total momentum, and total kinetic energy, respectively. These properties of the Boltzmann equation were discussed in § 2.6 above.
Let us assume that there exists a function $p(x)$ such that
for almost all $x_i \geqslant 0$, $i=1,2,3,4$. Then we can formally introduce a generalized $H$-functional (see the end of § 2.6) on the set of non-negative solutions $f(v,t)$ of equation (3.3), (3.4) by the formula
We always assume that $F$ in (3.4) satisfies conditions (3.1). Therefore, we can apply identity (3.20) and conclude from inequality (3.24) that $\widehat{H}[f(\,\cdot\,,t)]$ cannot increase with time. In the case (3.5) of the Boltzmann equation inequality (3.24) holds for $p(x)=\log x$, and we obtain
Note that $\langle f,1\rangle=\mathrm{const}$ because of the conservation laws (2.1). Therefore, the functional $\widehat{H}(f)$ is basically the same as the classical Boltzmann $H$-functional $H(f)=\langle f,\log f\rangle$ considered in § 2.6.
The results of this subsection can be formulated as follows.
Proposition 3.2. Equation (3.3), (3.11), where $F(x_1,x_2;x_3,x_4)$ satisfies conditions (3.1), has the same conservation laws (3.23) as the Boltzmann equation. If the function $F$ also satisfies inequality (3.24) for some function $p(x)$, $x \in \mathbb{R}_+$, then (at least formally) the functional $\widehat{H}[f(\,\cdot\,,t)]$ (3.25) cannot increase with time $t \geqslant 0$ on any solution $f(v,t)$ of equation (3.3), (3.11).
Of course, all our considerations in §§ 3.1 and 3.2 were made at the formal level of mathematical rigour, since we did not specify the function $F(x_1,x_2;x_3,x_4)$ in (3.4). In the next subsection we consider some specific cases, which are different from the Boltzmann case (3.5), but are also interesting for applications.
3.3. The Nordheim–Uehling–Uhlenbeck equation and wave kinetic equation
It is clear that specific operators $K$ in (3.4) can have different functions $F(x_1,x_2;x_3,x_4)$. In all interesting applications the function $F$ can be represented as a difference of two functions:
There are at least three cases of kinetic equations (3.3), (3.4) of interest to physics, for which $F$ has the structure (3.4) with different functions $P$. These are the following cases:
because $\theta^2=1$ and therefore terms of the fourth order in $F_{\rm NUU}$ vanish. A review of mathematical results for the NUU-equation can be found in [4] and [5] (see also [6] and [7]). An interesting formal generalization of this equation to the case of so-called anyons (quasi-particles with any fractional spin between 0 and 1) was also considered in these papers. This model corresponds to (3.27), with
The limiting values $\alpha=0,1$ correspond to the NUU-equation for bosons ($\alpha =0$) and fermions ($\alpha=1$). The existence of global solutions in $L^1\cap L^\infty$ of the Cauchy problem for equation (3.3), (3.4), where $F$ is given in (3.27) and (3.31), was proved in [4] under some restriction on the initial conditions and the kernel $R$ of the operator (3.4).
We note that in all above cases (A)–(D) it is possible to find a function $p(x)$ that satisfies inequality (3.24). Indeed the function $F$ for cases (A), (B), and (D) can be written as
where $\Phi(x)=1$ in case (A), $\Phi(x)=(1+\theta x)^{-1}$ in case (B), and $\Phi(x)$ is given in (3.31) in case (D). Then it is easy to see that the function
satisfies (3.24), provided that $\Phi(x) > 0$. The positivity condition for $\Phi(x)$ is fulfilled for all $x \geqslant 0$ in cases (A) and (B) with $\theta =1$. It is also fulfilled for $0 \leqslant x < 1$, for $\theta= -1$ in case (B) and for $0 \leqslant x < 1/\alpha$ in case (D). The known results on the existence of solutions to kinetic equations (3.3) with the corresponding operators (3.4) show that it is sufficient to satisfy these restrictions only at $t= 0$ [4]. Thus the kinetic equation (3.3) has in cases (A), (B), and (D) the monotone decreasing functionals
Inequality (3.24) in cases (A), (B), and (D) allows one to answer an important question about stationary solutions of equation (3.3). If $K[f^{\rm st}](v)=0$, then we can integrate this equality against any ‘nice’ function $h(v)$ and obtain the identity (with respect to $h(v)$) $\langle K(f^{\rm st}),h\rangle=0$ in the notation of (3.18). Then we take $h(v)=\log \Psi[f^{\rm st}]$ and use the transformation (3.20) and inequality (3.24). Since the resulting integral of a non-negative function over the set $\mathbb{R}^3 \times \mathbb{R}^3 \times S^2$ must be equal to zero, we conclude that this function is equal to zero almost everywhere in this set. This leads to the equation
in the notation of (3.9) and (3.32). This equation holds almost everywhere in $\mathbb{R}^3 \times \mathbb{R}^3 \times S^2$. Then we obtain (see comments to equation (3.22) above)
where $\alpha \in \mathbb{R}$, $\gamma \in \mathbb{R}$, and $\beta \in \mathbb{R}^3$ are arbitrary constant parameters. For brevity we do not discuss these known stationary solutions; see, for example, [5] for details. We stress that the above considerations just repeat the usual arguments in the proof of the uniqueness of the Maxwellian stationary solution to the Boltzmann equation; see, for example, [20]. Because of many similarities with the Boltzmann case (A), one can expect a similar behaviour of solutions to equation (3.3) in cases (B) and (D), in particular, convergence to stationary solutions discussed above for large values of time.
The situation looks more complex in case (C) of the WKE. In this case we can also satisfy inequality (3.24) by choosing $p(x)=-1/x$. Then this inequality for $F$ from (3.27) and (3.30) reads
However, an attempt to construct the $\widehat{H}$-functional (3.25) leads to divergent integral $I(x)$. It looks reasonable to replace this integral in (3.25) by $I(x)=-\log x$; then we formally obtain
This integral is divergent for large $|v|$, because we always assume that $f(v,t) \to 0$ as $|v| \to \infty$. Below we will try to clarify the situation with the WKE by using discrete kinetic models introduced in the next subsection.
3.4. Discrete kinetic models
The idea of using discrete velocity models for qualitative description of solutions to the Boltzmann equation seems to be very natural. Implicitly, it was already used by Boltzmann in the first publication [16] of his famous equation. We also mention the first two toy models with a few velocities introduced by Carleman [19] and Broadwell [17], respectively. An important role in the development of this idea was played by Cabannes [18] and Gatignol [31] in the 1970 –1980s. Moreover, it was proved in the 1990s (see [13], [50], and the references there) that the Boltzmann equation can be approximated by its discrete velocity models as the number of velocities tends to infinity. These results show that discrete models can not only be used for a qualitative, but also for a quantitative description of solutions to the Boltzmann equation.
A similar scheme of the construction of discrete models can be applied to the general kinetic equation (3.3), (3.4). We introduce the velocity space $V \subset \mathbb{R}^d$ that contains $n \geqslant 4$ points and replace the function $f(v,t)$ by a vector $f(t) \in \mathbb{R}^n$, where
It is implicitly assumed here that $f_i(t)$ approximates for large $n$ the function $f(v,t)$ at the point $v=v_i \in \mathbb{R}^d$, $i=1,\dots,n$. Speaking about discrete models it is convenient to use an arbitrary dimension $d \geqslant 2$, as we will see below. The simplest and most transparent case is, of course, the planar case $d=2$. The kinetic equation (3.3), (3.4) in the $d$-dimensional case (3.12) is replaced by the following set of ordinary differential equations:
$$
\begin{equation}
\frac{df_i}{dt}=\sum_{j,k,l=1}^{n}\Gamma_{ij}^{kl} F_w(f_i,f_j ; f_k,f_l),\qquad \Gamma_{ij}^{kl}=\Gamma_{ji}^{kl}=\Gamma^{ij}_{kl},\quad 1 \leqslant i \leqslant n,
\end{equation}
\tag{3.35}
$$
where the constant (for the given set $V$) parameters $\Gamma_{kl}^{ij}$ depend only on $|v_i-v_j|=|v_k-v_l|$ and $(v_i-v_j) \cdot (v_k-v_l)$ for any integer values of the indices $1 \leqslant i,j,k,l \leqslant n$. The strict inequality $\Gamma^{kl}_{ij} > 0$ is possible only for
Note that equations (3.35) have a universal form for any dimension $d \geqslant 2$, though the coefficients $\Gamma^{kl}_{ij}$ can depend on $d$. The equalities (3.36) have a simple geometric meaning: the points $\{v_i,v_j,v_k,v_l\}$ form a rectangle, where the two pairs of points $\{v_i,v_j\}$ and $\{v_k,v_l\}$ belong to two different diagonals.
Obviously, this geometrical meaning does not depend on the dimension. The simplest non-trivial example of the set $V \subset \mathbb{R}^2$ in (3.34) has just four ‘velocities’
where $\Gamma^{34}_{12} > 0$ is a constant. In the Boltzmann case (3.25) this equation can easily be reduced to a linear equation. The case (3.27), (3.30) of the WKE is a bit more complicated; it was discussed in [12] in more detail.
In order to construct a discrete model (3.34), (3.35), which has all the relevant properties of the initial kinetic equation (3.3), (3.4), we need to impose some restrictions on the set $V$ and the coefficients of equations (3.35).
Definition 3.1. The model (3.34), (3.35) is called normal if it satisfies the following conditions on the set $V$:
(a) all of its $n$ elements are pairwise different and do not lie in a linear subspace of dimension $d' \leqslant d-1$ or on a sphere in $\mathbb{R}^d$;
(b) the set $V$ does not have isolated points, that is, for any $1 \leqslant i \leqslant n$ in (3.35) there exist $1 \leqslant j,k,l \leqslant n$ such that $\Gamma_{ij}^{kl} > 0$;
is fulfilled for all indices $(i,j;k,l)$ for which $\Gamma_{ij}^{kl} > 0$, then there exist constants $\alpha,\gamma \in \mathbb{R}$ and $\beta \in \mathbb{R}^d$ such that $h(v)=\alpha+\beta \cdot v+\gamma |v|^2$.
Methods of construction of normal models are well developed (see, in particular, [11], [14], [56], and [57]). It is easy to show that $n \geqslant 6$ for them. Most models that we consider below are assumed to be normal. The condition of normality is very important for the large-time asymptotic behaviour of solutions, as we will see below.
3.5. The properties of discrete models
The discrete kinetic models (3.34), (3.35) of the kinetic equation (3.3), (3.4) is uniquely defined by
(a) the function $F(x_1,x_2;x_3,x_4)$ satisfying conditions (3.1);
(b) the phase $V=\{v_1,\dots,v_n\} \subset \mathbb{R}^d$;
(c) the set of coefficients $\Gamma=\{ \Gamma_{ij}^{kl} \geqslant 0, 1 \leqslant i, j, k, l \leqslant n\}$, where $\Gamma_{ij}^{kl}$ can depend on $|v_i-v_j |=|v_k-v_l |$ and $(v_i-v_j) \cdot (v_k-v_l)$.
Recall that the inequality $\Gamma_{ij}^{kl} > 0$ is possible only under conditions (3.36). Moreover, the symmetry conditions from (3.35) are fulfilled for all elements of the set $\Gamma$.
By using these symmetry conditions for $\Gamma_{ij}^{kl}$ and related conditions (3.1) for $F$ it is easy to derive from (3.35) the following identity:
where $h_1,\dots,h_n$ are constant numbers. Obviously, this is a discrete analogue of identity (3.20) for the kinetic equation (3.3), (3.4). Then conditions (3.36) lead to the following conservation laws:
which are similar to integrals (3.23) for the kinetic equation. Note that both identity (3.40) and the conservation laws (3.41) are valid for any discrete kinetic model, not only for normal models which cannot have other linear conservation laws than the ones listed in (3.41) or linear combinations of these. However, this is not true for all normal models, but at least this holds for normal models with function $F$ satisfying inequality (3.24) for some function $p(x)$, $x > 0$. We can prove the following statement.
Theorem 3.1. Assume that the model (3.34), (3.35) is normal and there exists a function $p(x)$ such that inequality (3.24) is satisfied for all $x_i > 0$, $i=1,2,3,4$. Also assume that
for any solution $\{f_i(t) > 0,\, i=1,\dots,n\}$ of (3.35), then $h_i=\alpha+\beta \cdot v_i+\gamma |v_i|^2$ in the notation of (3.34), where $\alpha \in \mathbb{R}$, $\gamma \in \mathbb{R}$, and $\beta \in \mathbb{R}^d$ are some constant parameters;
which is supposed to be valid for any $f_i > 0$, $i=1,\dots,n$. Assume that (b) is wrong and that in this identity $h=(h_1,\dots,h_n)$ is not a linear combination of the vectors $\varphi_1,\dots,\varphi_{d+2}$, which correspond to the conservation laws (3.41). Without loss of generality we can assume that
where $\lambda$ and $\mu$ are arbitrary real numbers. We can always choose these numbers in such a way that conditions (3.48) for $\widetilde{h}=(\widetilde{h}_1,\dots,\widetilde{h}_n)$ are fulfilled. Tildes are omitted below. The numbers $\alpha$ and $\beta$ in (3.48) are chosen in the following way. It follows from assumption (1) of the theorem that the function $p(x)$ maps the interval $[a,b]$ to another interval, say, $[\alpha,\beta]$. Moreover, there is an inverse function $x(p)$, which maps any point $p \in [\alpha,\beta]$ to $x(p) \in [a,b]$. Then we can easily construct a counterexample to our assumption by substituting $f_i=x(h_i)$, $i=1,\dots,n$, into identity (3.47). We obtain a sum of non-negative terms and conclude that each term vanishes, that is,
for any $1 \leqslant i,j,k,l \leqslant n$ such that $\Gamma_{ij}^{kl}>0$. Then we use assumption (2) of the theorem and Definition 3.1. This proves (3.1-b).
In order to prove (3.1-c) we consider identity (3.47) for a stationary solution $f^{\rm st}$. Then the left-hand side of (3.47) is equal to zero for an arbitrary vector $h=(h_1,\dots,h_n)$. We substitute $f_i=f^{\rm st}_i$ and $h_i=p(f^{\rm st}_i)$, $i=1,\dots,n$, into (3.47) and obtain again a sum of non-negative terms. The same considerations as above lead to the equalities
for all $1 \leqslant i,j,k,l \leqslant n$ such that $\Gamma_{ij}^{kl} > 0$. Then we use Definition 3.1 again and prove (3.1-c). This completes the proof.
3.6. Some transformations of equations and initial data
We return for a moment to the kinetic equation (3.3), (3.4) and note that this equation is invariant under rotations and translations of variable $v \in \mathbb{R}^3$ (or $v \in \mathbb{R}^d$, $d \geqslant 2$, in the general case; see (3.12)). Usually, we consider initial data $f(v,0) \geqslant 0$ for (3.3) such that
The invariance of equation (3.3), (3.4) under translations means that if $f(v,t)$ is a solution of this equation, then so is $f_a(v,t)=f(v+a,t)$ for any $a \in \mathbb{R}^3$. Then
in the notation of (3.18). It is always assumed that $\langle f(v,0),1 \rangle \ne 0$. Therefore, we can always choose $a \in \mathbb{R}^3$ (or $a \in \mathbb{R}^d$ in the general case) in such a way that $\langle f_a(v,0),v \rangle=0$. This almost trivial observation allows us to consider only initial conditions for (3.3), (3.4) such that $\langle f(v,0),v \rangle=0$.
This transition from the general kinetic equation (3.3), (3.4) to its discrete model (3.34), (3.35) preserves the translational symmetry of equations (3.35). Indeed, a shift $v \to v+a$ of the $v$-variables means the following transformation of the set $V$ in (3.34) for the discrete model:
Equations (3.35) of the model are connected with the set $V \subset \mathbb{R}^d$ only through the coefficients $\Gamma_{ij}^{kl}$. However, these coefficients depend (for fixed indices $1\leqslant i,j,k,l \ n$) only on differences $v_i-v_j$ and $v_k-v_l$. Therefore, each coefficient $\Gamma_{ij}^{kl}$ is invariant under translations (3.51) of the whole of $V$. It is also easy to check that Definition 3.1 of a normal discrete model is invariant under translations. In other words, if the pair $(V,\Gamma)$, where $\Gamma=\{\Gamma_{ij}^{kl}, \, 1 \leqslant i,j,k,l\}$, defines a normal model, then so does any pair $(V_a,\Gamma)$ in the notation of (3.51).
4. Convergence to equilibrium for discrete models of wave kinetic equations
4.1. The statement of the problem and the formulation of results
We consider below the discrete models (3.34), (3.35), where
The last restriction does not lead to a loss of generality, as explained in § 3.6. A strict positivity condition is needed in order to avoid some ‘non-typical’ solutions and simplify the proofs. For example, if $\Gamma_{12}^{34} > 0$ and $f_i^{(0)}=0$ for all $i \geqslant 5$, then $f_5(t)=\dots=f_n(t)=0$ for $t > 0$. In this case we obtain from (3.35) a simple set of four equations (3.38), which should be considered separately (see, for example, [12]). Neglecting such special cases does not look very important for the general qualitative behaviour of solutions of the model.
The main result of Section 4 can be formulated in the following way.
Theorem 4.1. Assume that the discrete model (4.2)–(4.4) is normal, that is, the set $V$ in (4.4) and the coefficients $\Gamma_{ij}^{kl}$, $1 \leqslant i,j,k,l \leqslant n$, satisfy Definition 3.1. Also assume that
(2) if $v_i \in V$, then $(-v_i) \in V$ for all $i=1,\dots,n$.
Then the Cauchy problem for equations (4.2) with the initial conditions (4.5) has a unique solution $f(t)=\{f_1(t),\dots,f_n(t)\}$ for all $t > 0$. Moreover,
It is easy to see that the function $T_0(b)$ defined by equality (4.8) decreases monotonically on the interval $-M^{-1} \leqslant b < \infty$ from its maximum value $T_0(-M^{-1})=M$ to zero as $b \to \infty$. Therefore, the root $b(T_0)$ defined in part (b) of the theorem is unique.
The proof of Theorem 4.1 is given in §§ 4.2–4.5. It is based on the simple estimates (4.6), conservation laws, and on the fact that equations (4.2) have a Lyapunov function which decreases monotonically on positive solutions of (4.2).
4.2. The existence and uniqueness of global non-negative solutions
The third equality in (4.10) follows from (4.5); it is not used in this subsection.
Note that the existence and uniqueness of local in time solutions to problem (4.2), (4.5) follow from the general theory of ODEs. However, we need to construct a global solution for the positive initial data (4.5). To do this we use a simple trick, which is more or less standard for the Boltzmann equation (see, for example, [21]). Namely, we modify equation (4.2) in the following way:
in the notation of (4.10) and (4.11). Note that $\lambda$ and $g$ are positive constants. It is easy to see that $A_i(\varphi) \geqslant Q^+_i(\varphi) \geqslant 0$ because
The terms $Q^{+}_{i}(\varphi)$ are also polynomials with non-negative coefficients for each $1 \leqslant i \leqslant n$. Therefore, for any two vectors
The sequence $\{\rho^{(k)}(t),\, k=0,1,\dots\}$ is obviously monotone increasing and bounded. Taking its limit and the limit of equations (4.20) as $k \to \infty$, we can easily show that
The transition to the limit under the integral sign here and below is justified by Lebesgues’s dominated convergence theorem. For brevity we ignore sets of zero measure. On the other hand it follows from (2.15), (2.16) that $0 \leqslant \varphi^{(k)}_{i} \leqslant \rho_{0}$ for all $k \geqslant 0$ and $i=1,\dots,n$. Therefore,
The uniqueness of its solution follows from standard theorems for autonomous ODEs with polynomial right-hand side. Note that we have not used any connection of the coefficients $\Gamma_{ij}^{kl}$ in equation (4.3) with the set in (4.4).
Thus the following lemma is almost proved.
Lemma 4.1. Consider equations (4.2), (4.3). Then for any non-negative data
Proof. To finish the proof it is sufficient to note that the problem (4.22) for $\varphi(t)$ coincides with the problem (4.2), (4.3), (4.23) for $f(t)$. Therefore, we just need to set $f(t)=\varphi(t)$, where $\varphi(t)$ was already constructed above. Inequalities (4.24) follow from (4.17) and (4.11). This completes the proof.
It is clear that Lemma 4.1 proves the first part of Theorem 4.1 (without statement (b)) under much weaker conditions, independent of the set $V$ in (4.4) and the specific properties of normal discrete model of WKE. The role of these stronger conditions will be clear in the next subsection.
4.3. The existence of a unique stationary solution
Our goal in this section is to prove that the stationary equation (4.2), that is, the equation
in the notation of (4.4). This fact follows from Theorem 3.1 (c) in § 3.5: we just need to check that all conditions of that theorem are satisfied. We note that the model (4.2)–(4.4) is normal by the assumptions of Theorem 4.1. Moreover, the function $p(x)=-1/x$ satisfies inequality (3.24) for $F(x_1,x_2;x_3,x_4)$ given in (4.1), as it was shown in (3.33). The function $p(x)=-1/x$ also satisfies assumptions (1) and (2) of Theorem 3.1 in § 3.5. Hence we can use part (c) of that theorem and conclude that any solution of (4.25) reads
where $\alpha \in \mathbb{R}$, $\gamma \in \mathbb{R}$, and $\beta \in \mathbb{R}^d$ are free parameters. We obtain these parameters from conditions (4.26) and the assumptions of Theorem 4.1. Note that $v_1=0$, $v_i \ne v_j$ if $i \ne j$, and the set $V$ in (4.4) is invariant (perhaps with changed numbering) under the transformation $v_i \to (-v_i)$, $1 \leqslant i \leqslant n$, as it follows from these assumptions. Hence $\alpha \ne 0$ in (4.27), and we can change the notation of (4.27) to
$$
\begin{equation}
f_i=a(1+b|v_i|^2+\beta \cdot v_i)^{-1}, \qquad 1 \leqslant i \leqslant n.
\end{equation}
\tag{4.28}
$$
Then we apply the third condition from (4.26) and obtain
We are interested in bounded non-negative functions $f_i$ in (4.28). Therefore,
$$
\begin{equation*}
(1+b|v_i|^2 )^2 > (\beta \cdot v_i)^2, \qquad 1 \leqslant i \leqslant n.
\end{equation*}
\notag
$$
Then we consider the scalar product of equality (4.29) and the constant vector $\beta \in \mathbb{R}^d$ and obtain a sum of non-negative numbers on the right-hand side. Hence $\beta=0$, since the equality $\beta \cdot v_i=0$ for all $1 \leqslant i \leqslant n$ is impossible for any normal model (see Definition 3.1).
Then it follows from the algebraic equations (4.26) that
The existence of such a root follows from simple considerations. If we consider (4.31) as the definition of the function $T(b)$, then we can compute its derivative $T'(b)$:
to zero for $b \to \infty$. Hence the inverse function $b(T)$ satisfying (4.31) is uniquely defined for all $0 < T < M$. The limiting value $b(M)=-M^{-1}$ means that the solution (4.28) becomes singular. This limit is irrelevant for Theorem 4.1 because we always have $T_0 < M$ for the initial data (4.5).
The result of this subsection can be formulated as follows.
Lemma 4.2. Assume that conditions of Theorem 4.1 are satisfied for equations (4.2), (4.3) and the set (4.4). Then the stationary equation (4.25) has a unique non-negative solution
satisfying conditions (4.26) for any $\rho > 0$ and $0 < T < \max\{|v_i|^2,\,i=1,\dots,n\}$. The parameters $a$ and $b$ are defined uniquely by the algebraic equations (4.30), (4.31).
The proof of Lemma 4.2 has already been given above. We use the notation $f^{\rm st}$ in (4.32) in order to distinguish the stationary solution from the time-dependent solution $f(t)$ of equations (4.2), (4.3).
In the next subsection we study some properties of the stationary solution (4.32).
4.4. The properties of the stationary solution
It was shown in the beginning of the previous subsection that all conditions of Theorem 3.1 are satisfied for the function $p(x)=-1/x$ under the assumptions of Theorem 4.1. In particular, inequality (3.24) for $F(x_1,x_2;x_3,x_4)$ from (4.1) and $p(x)=-1/x$ is presented in (3.33). Hence we can apply part (a) of Theorem 3.1 from § 3.5 and equations (3.45), (3.46) from its proof. Obviously, we can choose
where the set $V=\{v_1,\dots,v_n\}$ in (4.4) satisfies conditions of Theorem 4.1. It is straightforward to see that the standard method of Lagrange multipliers shows that the point $f^{\rm st} \in \Omega$ in Lemma 4.1 is a unique point of extremum in $\Omega$. In fact, it is a point of minimum because
in the notation of Theorem 4.1. The point $f =f^{\rm st} \in \Omega$ is the only point in $\Omega$ where $U(f)=0$. The function $U(f)$ is used in the next subsection for the proof of Theorem 4.1.
in the notation of (4.2)–(4.5) and assume that all conditions of Theorem 4.1 are fulfilled. The unique solution of problem (4.37) was constructed in § 4.2. This solution obviously satisfies the conservation laws
in the notation of (4.4). Moreover, $f_i(t) > 0$ for all $1 \leqslant i \leqslant n$ and all $t > 0$ because of the lower estimates in (4.24). Hence for each $t \geqslant 0$,
in the notation of (4.35). Note that $\Omega \subset \mathbb{R}^n$ is a bounded domain for any values of the parameters $\rho >0$ and $0 < T < \max\{|v_i|^{2},\, i=1,\dots,n\}$. In particular, $\Omega \subset B(\rho \sqrt{d}\,)$, where $B(\rho \sqrt{d}\,)$ is the ball of radius $\rho \sqrt{d}$ centred at the origin.
We have constructed the Lyapunov function $U(f)$ (see (4.36)) for equation (4.37) such that $U(f) \geqslant 0$ for any $f \in \Omega$. It was also shown that there is a unique point $f^{\rm st} \in \Omega$ such that $U(f^{\rm st})=0$. Note that the derivative
It is clear that under conditions of Theorem 4.1 the equation $ W(f)=0$ has a unique solution $f=f^{\rm st}$ (4.32) in $\Omega$. This fact was actually used in the proof of Lemma 4.2. Hence, similarly to $U(f)$ the function $W(f)$ has the following properties:
It follows from Lemma 4.3 that $f(t) \in \Omega_1$, $t \geqslant 0$, where $f(t)$ is the solution of problem (4.37). Note that the definition of $\Omega_1$ depends on the initial data not only through the sums (4.38), but also through the product of components of $f^{(0)}$.
We recall some known applications of Lyapunov functions (see, for example, the textbook on ODEs [53]). With some abuse of notation, we consider the vector ODE (like (4.2))
It is assumed for simplicity that the components $Q_i(f)$, $1 \leqslant i \leqslant n$, are polynomials in the components of $f=(f_1,\dots,f_n)$. Of course, these polynomials can differ from those shown in (4.3). Let $D \subset \mathbb{R}^n$ be a closed bounded domain, and assume that there exists a solution $f(t)$ of equation (4.45) such that $f(t) \subset D$ for all $t \geqslant 0$. The following theorem is a simple modification of Theorem 5.3 in [53]. In fact, the first version of this theorem was proved by Lyapunov in 1892, in his thesis [45].
Theorem 4.2. Assume that equation (4.45) has a Lyapunov function $U(f)$ such that
(a) $U(f) \geqslant 0$ for all $f \in D$, where $D \subset \mathbb{R}^n$ is closed bounded domain;
(b) $W(f)=-\operatorname{grad}_f U \cdot Q(f) \leqslant 0$ for all $f \in D$;
(c) both the functions $U(f)$ and $W(f)$ are continuous in $D$;
(d) there exists a unique vector (a point in $\mathbb{R}^n$) $f^{\rm st} \in D$ such that
Proof. The proof is simply a repetition of the proof of Theorem 5.3 in [53]. Therefore, we just outline the scheme of the proof. The first step is to prove that $U[f(t)] \to 0$ as $t \to \infty$. Assuming the opposite we obtain $U[f(t)] \geqslant \alpha > 0$, $t \geqslant 0$. Then we can prove by contradiction that $|f(t)-f^{\rm st}| \geqslant \beta$ and $W[f(t)] \geqslant \gamma$ for all $t \geqslant 0$. Hence we obtain that
This obviously contradicts the assumption that $f(t) \in D$, and therefore $U[f(t)] \geqslant 0$ for all $t \geqslant 0$. Hence $U[f(t)] \to 0$ as $t \to \infty$. The second step is to prove the limiting equality (4.47). Again, we assume the opposite. Then there exist $\varepsilon > 0$ and a sequence $\{t_k,k \geqslant 1\}$ such that $t_n \to \infty$ as $n \to \infty$ but
for all $k \geqslant 1$. The sequence $\{f_k=f(t_k), k=1,2,\dots\} \subset D$ is bounded and therefore contains a convergent subsequence, which converges to a point $\overline{f} \in D$ (here we need the domain $D$ to be closed). The equality $\overline{f}=f^{\rm st}$ is impossible because of inequalities (4.48). If we substitute this convergent subsequence into the continuous Lyapunov function $U(f)$, then the corresponding sequence converges to $U(\overline{f}) \ne 0$. Hence we obtain a contradiction, and this completes the proof.
The end of the proof of Theorem4.1. It remains to apply Theorem 4.2 to the case of equations (4.2), (4.3) with initial data (4.5). We consider the solution $f \in \Omega_1$ constructed in Lemma 2.1. It is clear that all conditions of Theorem 4.1 are fulfilled. Indeed, we choose the domain $D=\Omega_1$ in the notation of (4.44) and the functions $U(f)$ and $W(f)$ in the notation of (4.27) and (4.41), respectively. The stationary solution $f^{\rm st} \in \Omega_1$ was constructed in Lemma 4.2 in the explicit form (4.32) under the assumptions of Theorem 4.1. Equalities (4.46) follow from formulae (4.36) for $U(f)$ and (4.41) for $W(f)$, under the same assumption. The uniqueness in $\Omega$ (and therefore in $\Omega_1 \subset \Omega$) of the root $f=f^{\rm st}$ of the equations $U(f)=W(f)=0$ was proved above: see (4.47) and the comments after (4.36). Hence the assumptions of Theorem 4.1 allow us to use Theorem 4.2 and prove the limiting equality (4.7). This completes the proof of Theorem 4.1.
In the next subsection we discuss in greater detail the connection of Boltzmann-type kinetic equations with their discrete models.
4.6. On the approximation of Boltzmann-type equations by discrete kinetic models
We consider below the Boltzmann-type equation (3.1) for the distribution function $f(v,t)$, $v \in \mathbb{R}^d$, where $K[f]$ is written in the form (3.12). Omitting the irrelevant constant factor in (3.12) we obtain
Suppose that we want to approximate the integral $K[f](v)$ by a quadrature formula on some discrete lattice in the $v$-space. To this end we introduce a regular grid in $\mathbb{R}^d$:
where $\widetilde{K}_i(\widetilde{f})$ denotes a quadrature formula for $K[f](v_i)$. In principle, we can use any such formula, but there are obvious advantages associated with approximations of the form
Tildes are omitted here and below. It is assumed that the constant coefficients $\Gamma_{ij}^{kl}$ are non-negative and, moreover, $\Gamma_{ij}^{kl} > 0$ only if
It was shown there that solutions of system (4.53) of ODEs inherit the properties of solutions of the kinetic equation (4.49), namely, the conservation laws and $H$-theorem (provided that the $H$-theorem is valid for equation (4.49)).
Our goal is to explain some methods for constructing such approximations. Our presentation can be considered as a generalization of the papers [13] and [50], devoted to similar problems for the classical Boltzmann equation and its formal approximation by discrete velocity models proposed in [32].
Let $(k_1 h,\dots,k_d h)$ be our discrete set of points in $\mathbb{R}^d$, where $h$ is any positive number (mesh step) and $(k_1,\dots,k_d)$ are integers. We identify this grid with $\mathbb{Z}^d$ and call its points ‘integer points’. We also use below the term ‘even’ points if all the $n_i$ are even integers.
A natural first step is to use the simplest rectangular formula
in the notation of (4.49)–(4.52). Here $v_i$ is a given point of the lattice and the sum is taken over all points $v_j$ such that $u_{ij}=v_i-v_j$ (this choice of $v_j$ is explained below).
Then the next step is to choose an approximation of the integrals over the unit sphere $S^{d-1}$
If two integer vectors $v_i$ and $v_j$ are chosen in the way explained above, then $u_{ij}=v_i-v_j$ is an even vector and $U_{ij}=(v_i+v_j)/2$ is an integer vector. We introduce the shorthand notation
where $v_i \in h \mathbb{Z}^d$, $v_j \in \mathbb{Z}^d$, and $u_{ij}=v_i-v_j \in 2h\mathbb{Z}^d$ by construction. It is implicitly assumed that the function $\varphi(|u_{ij}|^2/4,\omega)$ is known only at points $\omega_k$ where $v_i'$ and $v_j'$ in (4.55) are integer vectors. Let us assume that
in the notation of (4.58). In our case a natural idea of the approximation of the average value (4.57) of the function $\varphi(h^2 m,\omega)$ over the unit sphere $S^{d-1}$ is to replace the integral (4.57) by the mean value over the ‘integer’ points (in the above sense) on the unit sphere. In other words, we assume that for large $m$,
where $r_d(m)$ is the number of integer solutions of equation (4.59), that is, the cardinality of $V_d(m)$. The first argument $h^2m$ of $\varphi(h^2 m,\omega)$ is not important. Normally, in the general approximate formula (4.54) we consider the limit as $h \to 0$, $m \to \infty$, and $h^2 m=\mathrm{const}$.
Thus, the approximate formula (4.60) is closely connected with the classical number-theoretical problem of finding integer solutions of equation (4.59). There is a large literature on this problem (see, for example, the books [44] and [35]). It is intuitively clear that the approximate equality (4.60) becomes exact for a continuous function of $\omega \in S^{d-1}$ in the limit as $m \to \infty$, provided that
(1) the number $r(m) $ of solutions of (4.59) tends to infinity as $m \to \infty$, and
(2) these solutions tend to be equidistributed on the sphere.
This is true for $d \geqslant 4$; see [44] for details. However, the situation is less simple in the more practically interesting cases of $d=2$ and $d=3$.
We discuss briefly these two cases below. All details can be found in [13] and [50] for $d=3$ and in [29] for $d=2$. We begin with the case $d=3$. The first difficulty is that equation (4.59) for $d=3$ does not have integer solutions if $m=4^{a}(8k+7)$, where $a$ and $k$ are integers. Fortunately this is irrelevant for our problem, since it is known from the equality for $m$ in (4.58) that there exists at least one integer solution of (4.59).
Still, there are ‘bad’ sequences of the form $m_{a}=4^{a}m_0$, where $a\to \infty$, because $r(m_{a})=r(m_0)$ for any $a=0,1,\dots$ . It follows from elementary observations that $x_1$, $x_2$, and $x_3$ in (4.59) are even if and only if $m=0\pmod4$. If $m=4^{a}m_1$, where $m_1\ne 0\pmod4$ and $m_1 \ne 7\pmod8$, then $r_3(m) \to \infty$ as $m_{1} \to \infty$. Of course, this is not enough for a rigorous justification of the approximate formula (4.60). The more difficult problem of the equidistribution of solutions of (4.59) on the sphere of radius $\sqrt{m}$ in $\mathbb{R}^3$ was solved in [13] and [50] on the basis of rather complicated number-theoretical results due to Iwaniec [38] (see also [33], [24], and [52]). Without going into details, we cite here one of the results in [50] (see Corollary 4 on p. 1873 there).
Proposition 4.1. Let $\varphi(\omega)$ be a continuous function on $S^2$. Then
Then we substitute the left-hand side of the approximate formula (4.60) in the notation of (4.56) into the approximate equality (4.54), and for $d \geqslant 3$ we obtain
Here the delta functions of integer vectors and squares of such vectors denote simply the corresponding Kronecker symbols. The kernel $\widetilde{R}(u,u')$ is defined by the equality
This is the result of Theorem 11 in [50]. It can easily be generalized to the case $d \geqslant 3$ for $\|f\|$ from (4.65) and to the whole class of Boltzmann-type operators $K[f](v)$ for a function $F(f_i(x_1),f_j(x_2);f_k(x_3),f_l(x_4))$ satisfying the inequality
for all $0 \leqslant x_i \leqslant R$, $1 \leqslant i \leqslant 4$, and for any $R > 0$. We do not discuss here various estimates of the rate of convergence of the quadrature formula (4.62) as $h \to 0$, obtained for the Boltzmann equation in [50].
Instead we give some comments on the difficult case $d=2$. In fact, the quadrature formula (4.62) was also proved for the Boltzmann equation under some smoothness assumptions on the functions under the integral sign in (4.49). Roughly speaking, it was proved in [29] that the lattice points on circles are equidistributed on the average in the following sense: the exponential sums
where $\theta_{u'} \in [0,2\pi]$ is the angular coordinate of $u' \in \mathbb{Z}^2$, converge to zero as $m$ tends to infinity. This is a key point in the proof in [29] of the quadrature formula (4.62) in the planar case $d=2$.
There is also another way of approximation of the Boltzmann-type integrals (4.59) by infinite sums. The idea is to use Carleman’s form of the integral (see formula (3.14) in Section 3.1) with inner integral over a plane. Discrete models of the Boltzmann equation based on this idea were constructed and discussed in [51]. Independently, the convergence of certain infinite sums to the integral $K[f]$ for the WKE written in a similar form was discussed in [25]. For the sake of brevity we do not discuss here the problem of the approximation of infinite sums by finite sums. This is always possible to do for functions $f$ and $F$ satisfying conditions (4.65) and (4.66), respectively.
5. Conclusions
A large class of non-linear kinetic equations of Boltzmann type was considered in §§ 2–4 from a unified point of view. This class includes, in particular, such well-known equations as (a) the classical Boltzmann equation, (b) the quantum Nordheim–Uehling–Uhlenbeck equation, (c) the wave kinetic equation used in the theory of weak turbulence.
It was shown that all these equations can naturally by considered as different forms of the general Boltzmann-type equation introduced in § 3. The general properties (conservation laws and monotone functionals) of this equation were also studied there. By analogy with discrete velocity models of the Boltzmann equation the class of discrete models of the general kinetic equation was introduced and the properties of models were studied.
The long-time behaviour of solutions to discrete models of the WKE was investigated in detail in § 4. First we proved the existence of a unique global in time solution of the corresponding set of ODEs for any non-negative initial conditions. The Lyapunov function was constructed for any positive solution of the model and then used for the proof of convergence to equilibrium at the end of § 4. This result was established for so-called normal models, which do not have any spurious conservation law. Perhaps, similar results can also be proved for discrete models of the NUU-equation for fermions, but the case of WKE looks more interesting for some reasons.
The matter is that it is natural to expect that the time-evolution of solutions to normal discrete kinetic models imitates, to a certain extent, the behaviour of the corresponding solutions to the kinetic equations. In principle, we can approximate with any prescribed accuracy the kinetic equation by a sequence of discrete models with sufficiently large number of discrete phase points, as it was shown in § 4.6. These arguments work very well in the case of the Boltzmann equation, for which the discrete models predict that a solution $f(v,t)$, $v \in \mathbb{R}^d$, with finite moments up to the second order tends, as $t \to \infty$, to a Maxwellian distribution of the form $M(v)=a\exp(-b|v|^2)$, under some irrelevant extra conditions. The positive parameters $a$ and $b$ are determined by conservation laws. This prediction is absolutely correct for the Boltzmann equation. On the contrary, the attractor for solutions to discrete models of the WKE has the form $f^{\rm st}(v)=a(1+b|v|^2)^{-1}$. This function is obviously not integrable in $\mathbb{R}^d$, $d \geqslant 2$, for any positive $a$ and $b$. Therefore, it cannot be an attractor for integrable solutions of the WKE. Hence a straightforward prediction of similar long-time behaviour is impossible in that case. At the same time the information about the long-time behaviour of solutions to discrete models of WKE can still be useful. We hope to return to this question in subsequent publications.
Acknowledgements
I thank S. B. Kuksin for important discussions. I am also grateful to I. F. Potapenko for her help in the preparation of the manuscript.
Bibliography
1.
R. Alexandre, Y. Morimoto, S. Ukai, Chao-Jiang Xu, and Tong Yang, “Smoothing effect of weak solutions for the spatially homogeneous Boltzmann equation without angular cutoff”, Kyoto J. Math., 52:3 (2012), 433–463
2.
L. Arkeryd, “On the Boltzmann equation. I. Existence”, Arch. Ration. Mech. Anal., 45 (1972), 1–16; “II. The full initial value problem”, 17–34
3.
L. Arkeryd, “$L^\infty$ estimates for the space-homogeneous Boltzmann equation”, J. Stat. Phys., 31:2 (1983), 347–361
4.
L. Arkeryd, “A quantum Boltzmann equation for Haldane statistics and hard forces; the space-homogeneous initial value problem”, Comm. Math. Phys., 298:2 (2010), 573–583
5.
L. Arkeryd, “On low temperature kinetic theory: spin diffusion, Bose–Einstein condensates, anyons”, J. Stat. Phys., 150:6 (2013), 1063–1079
6.
L. Arkeryd and A. Nouri, “Bose condensates in interaction with excitations: a kinetic model”, Comm. Math. Phys., 310:3 (2012), 765–788
7.
L. Arkeryd and A. Nouri, “A Milne problem from a Bose condensate with excitations”, Kinet. Relat. Models, 6:4 (2013), 671–686
8.
A. A. Arsen'ev, “The Cauchy problem for the linearized Boltzmann equation”, U.S.S.R. Comput. Math. Math. Phys., 5:5 (1975), 110–136
9.
R. Balescu, Statistical mechanics of charged particles, Monographs in Statistical Physics and Thermodynamics, 4, Intersci. Publ. John Wiley & Sons, Ltd., London–New York–Sydney, 1963, xii+477 pp.
10.
A. V. Bobylev, Kinetic equations, v. 1, De Gruyter Ser. Appl. Numer. Math., 5/1, Boltzmann equation, Maxwell models, and hydrodynamics beyond Navier–Stokes, De Gruyter, Berlin, 2020, xiii+244 pp.
11.
A. V. Bobylev and C. Cercignani, “Discrete velocity models without nonphysical invariants”, J. Stat. Phys., 97:3-4 (1999), 677–686
12.
A. V. Bobylev and S. B. Kuksin, “Boltzmann equation nd wave kinetic equations”, Keldysh Institute Preprints, 2023, 031, 20 pp. (Russian)
13.
A. V. Bobylev, A. Palczewski, and J. Schneider, “On approximation of the Boltzmann equation by discrete velocity models”, C. R. Acad. Sci. Paris Sér. I Math., 320:5 (1995), 639–644
14.
A. V. Bobylev and M. C. Vinerean, “Construction of discrete kinetic models with given invariants”, J. Stat. Phys., 132:1 (2008), 153–170
15.
N. N. Bogoliubov, “Problems of a dynamical theory in statistical physics”, Studies of statistical mechanics, v. I, Series in Physics, North-Holland Publishing Co., Amsterdam, 1962, 1–118
16.
L. Boltzmann, “Weitere Studien über das Wärmegleichgewicht unter Gasmolekulen”, Wien. Ber., 66 (1872), 275–370
17.
J. E. Broadwell, “Study of rarefied shear flow by the discrete velocity method”, J. Fluid Mech., 19:3 (1964), 401–414
18.
H. Cabannes, The discrete Boltzmann equation (theory and applications), Lecture notes given at the University of California, Univ. of California, Berkeley, 1980, viii+55 pp.
19.
T. Carleman, Problèmes mathématiques dans la théorie cinétique des gaz, Publ. Sci. Inst. Mittag-Leffler, 2, Almqvist & Wiksells, Uppsala, 1957, 112 pp.
20.
C. Cercignani, The Boltzmann equation and its applications, Appl. Math. Sci., 67, Springer-Verlag, New York, 1988, xii+455 pp.
21.
C. Cercignani, R. Illner, and M. Pulvirenti, The mathematical theory of dilute gases, Appl. Math. Sci., 106, Springer-Verlag, New York, 1994, viii+347 pp.
22.
S. Chapman, “On the law of distribution of molecular velocities, and on the theory of viscosity and thermal conduction, in a non-uniform simple monatomic gas”, Philos. Trans. Roy. Soc. London Ser. A, 216:538-548 (1916), 279–348
23.
R. J. DiPerna and P. L. Lions, “On the Cauchy problem for Boltzmann equations: global existence and weak stability”, Ann. of Math. (2), 130:2 (1989), 312–366
24.
W. Duke, “Hyperbolic distribution problems and half-integral weight Maass forms”, Invent. Math., 92:1 (1988), 73–90
25.
A. Dymov and S. Kuksin, “Formal expansions in stochastic model for wave turbulence 1: kinetic limit”, Comm. Math. Phys., 382:2 (2021), 951–1014
26.
R. S. Ellis and M. A. Pinsky, “The first and second fluid approximations to the linearized Boltzmann equation”, J. Math. Pures Appl. (9), 54 (1975), 125–156
27.
D. Enskog, Kinetische Theorie der Vorgänge in mässig verdünnten Gasen, Almqvist & Wiksells, Uppsala, 1917, vi+160 pp.
28.
M. Escobedo and J. J. L. Velázquez, On the theory of weak turbulence for the nonlinear Schrödinger equation, Mem. Amer. Math. Soc., 238, no. 1124, Amer. Math. Soc., Providence, RI, 2015, v+107 pp.
29.
L. Fainsilberg, P. Kurlberg, and B. Wennberg, “Lattice points on circles and discrete velocity models for the Boltzmann equation”, SIAM J. Math. Anal., 37:6 (2006), 1903–1922
30.
A. A. Galeev and V. I. Karpman, “Turbulence theory of a weakly non-equilibrium low-density plasma and structure of shock waves”, Soviet Physics JETP, 17:2 (1963), 403–409
31.
R. Gatignol, Théorie cinétique des gaz à répartition discrète de vitesses, Lecture Notes in Phys., 36, Springer-Verlag, Berlin–New York, 1975, ii+219 pp.
32.
D. Goldstein, B. Sturtevant, and J. E. Broadwell, “Investigations of the motion of discrete-velocity gases”, Rared gas dynamics: theoretical and computational techniques, Progr. Astronaut. Aeronaut., 118, AIAA, Washington, DC, 1989, 100–117
33.
E. P. Golubeva and O. M. Fomenko, “Asymptotic distribution of integral points on the three-dimensional sphere”, J. Soviet Math., 52:3 (1990), 3036–3048
34.
H. Grad, “Principles of the kinetic theory of gases”, Thermodynamik der Gase, Handbuch Phys., 12, Springer-Verlag, Berlin–Göttingen–Heidelberg, 1958, 205–294
35.
E. Grosswald, Representations of integers as sums of squares, Springer-Verlag, New York, 1985, xi+251 pp.
36.
D. Hilbert, “Begründung der kinetischen Gastheorie”, Math. Anal., 72:4 (1912), 562–577
37.
R. Illner and T. Platkowski, “Discrete velocity models of the Boltzmann equation: a survey on the mathematical aspects of the theory”, SIAM Rev., 30:2 (1988), 213–255
38.
H. Iwaniec, “Fourier coefficients of modular forms of half-integral weight”, Invent. Math., 87:2 (1987), 385–401
39.
M. Kac, Probability and related topics in physical sciences (Boulder, CO 1957), Lectures in Appl. Math., Intersci. Publ., London–New York, 1959, xiii+266 pp.
40.
L. D. Landau, “Kinetic equation for the case of Coulomb interaction”, Phys. Z. Sowjetunion, 10 (1936), 154–164
41.
L. D. Landau and E. M. Lifshitz, Course of theoretical physics, v. 1, Pergamon Internat. Library Sci. Tech. Engrg. Social Stud., Mechanics, 3rd ed., Pergamon Press, Oxford–Elmsford, NY, 1981, xxvii+169 pp.
42.
O. E. Lanford III, “Time evolution of large classical systems”, Dynamical systems, theory and applications (Rencontres, Battelle Res. Inst., Seattle, WA 1974), Lecture Notes in Phys., 38, Springer-Verlag, Berlin–New York, 1975, 1–111
43.
E. M. Lifshitz and L. P. Pitaevskii, Course of theoretical physics, v. 10, Pergamon Internat. Library Sci. Tech. Engrg. Social Stud., Physical kinetics, Pergamon Press, Oxford–Elmsford, NY, 1981, xi+452 pp.
44.
Yu. V. Linnik, Ergodic properties of algebraic fields, Ergeb. Math. Grenzgeb., 45, Springer-Verlag New York Inc., New York, 1968, ix+192 pp.
45.
A. M. Lyapunov, The general problem of the stability of motion, Taylor & Francis Group, London, 1992, x+270 pp.
46.
N. B. Maslova and A. N. Firsov, “Solution of the Cauchy problem for the Bolzmann equation. I”, Vestn. Leningrad Univ., 1975, no. 19, 83–88 (Russian)
47.
J. C. Maxwell, “On the dynamical theory of gases”, Philos. Trans. Roy. Soc. London, 157 (1867), 49–88
48.
D. Morgenstern, “General existence and uniqueness proof for spatially homogeneous solutions of the Maxwell–Boltzmann equation in the case of Maxwellian molecules”, Proc. Natl. Acad. Sci. U.S.A., 40:8 (1954), 719–721
49.
L. W. Nordheim, “On the kinetic method in the new statistics and application in the electron theory of conductivity”, Proc. Roy. Soc. London Ser. A, 119:783 (1928), 689–698
50.
A. Palczewski, J. Schneider, and A. V. Bobylev, “A consistency result for a discrete-velocity model of the Boltzmann equation”, SIAM J. Numer. Anal., 34:5 (1997), 1865–1883
51.
V. A. Panferov and
A. G. Heintz, “A new consistent discrete-velocity model for the Boltzmann equation”, Math. Methods Appl. Sci., 25:7 (2002), 571–593
52.
P. Sarnak, Some applications of modular forms, Cambridge Tracts in Math., 99, Cambridge Univ. Press, Cambridge, 1990, x+111 pp.
53.
A. N. Tikhonov, A. B. Vasil'eva, and A. G. Sveshnikov, Differential equations, Springer Ser. Soviet Math., Springer-Verlag, Berlin, 1985, viii+238 pp.
54.
E. A. Uehling and G. E. Uhlenbeck, “Transport phenomena in Einstein–Bose and Fermi–Dirac gases. I”, Phys. Rev. (2), 43:7 (1933), 552–561
55.
S. Ukai, “On the existence of global solutions of mixed problem for non-linear Boltzmann equation”, Proc. Japan Acad., 50:3 (1974), 179–184
56.
V. V. Vedenyapin, “Velocity inductive construction for mixtures”, Transport Theory Statist. Phys., 28:7 (1999), 727–742
57.
V. V. Vedenyapin and Yu. N. Orlov, “Conservation laws for polynomial Hamiltonians and for discrete models of the Boltzmann equation”, Theoret. and Math. Phys., 121:2 (1999), 1516–1523
58.
C. Villani, “A review of mathematical topics in collisional kinetic theory”, Handbook of mathematical fluid dynamics, v. 1, Noth-Holland, Amsterdam, 2002, 71–305
59.
A. A. Vlasov, “Vibration properties of an electron gas”, Zh. Experiment. Teor. Fiz., 8:3 (1938), 291–318 (Russian)
60.
V. E. Zakharov, “A solvable model of weak turbulence”, Prikl. Mech. Tech. Fiz. (USSR), 1965, no. 1, 14–20 (Russian)
61.
E. Zermelo, “Ueber einen Satz der Dynamik und die mechanische Wärmetheorie”, Ann. Phys., 293:3 (1896), 485–494
Citation:
A. V. Bobylev, “Boltzmann-type kinetic equations and discrete models”, Russian Math. Surveys, 79:3 (2024), 459–513