Abstract:
Recently, in a number of papers it was understood that results on sampling discretization and universal sampling discretization can successfully be used in the problem of sampling recovery. Moreover, it turns out that it is sufficient to only have a one-sided discretization inequality for some of these applications. This motivated us to write the present paper as a survey, which includes new results, with the focus on the one-sided discretization inequalities and their applications to sampling recovery. In this sense the paper complements the two existing survey papers on sampling discretization (Russian Math. Surveys, 74:4 (2019), 579–630 and J. Complexity, 71 (2022), 101653, 55 pp.).
Bibliography: 50 titles.
This research was carried out at Lomonosov Moscow State University and supported by the Russian Science Foundation under grant no. 23-71-30001, https://rscf.ru/en/project/23-71-30001/.
There has been a significant progress in the study of the sampling discretization of integral norms for both a fixed finite-dimensional function space and a finite collection of such function spaces (universal discretization). Sampling discretization results turn out to be very useful in various applications, particularly in sampling recovery. For recent results on sampling recovery we refer the reader to [4], [27], [28], [34], [45], [3], [18], [48], [30], [17], [14], [9]–[11], [1], [25], [26], [46], [24], [47], and [40].
The systematic study of sampling discretization of the $L_p$-norms of functions in finite-dimensional subspaces begun in 2017 in [42] and [43]. First results in this direction were obtained in the 1930s and go back to Bernstein, Marcinkiewicz, and Zygmund. At the moment, this area has developed into a vast and actively growing field of research with deep connections with other important areas of research. There are two survey papers [7] and [20] on the topic. The paper [7] covers results on exact weighted discretization, discretization of the uniform norm of multivariate trigonometric polynomials, and some results on universal discretization. The paper [20] covers recent results on sampling discretization and discusses in detail their connections with other areas of research, in particular, with the theory of moments of random vectors, the theory of submatrices with good properties, embedding of finite-dimensional subspaces, sparse approximation, learning theory, and sampling recovery. Recently, in a number of papers it was understood that results on sampling discretization and universal sampling discretization can be used successfully in the problem of sampling recovery. Moreover, it turns out that it is sufficient to only have a one-sided discretization inequality for some of these applications. This motivated us to write the present paper on one-sided discretization inequalities and their applications to sampling recovery. In this sense our paper complements the survey papers [7] and [20].
Let $(\Omega,\mu)$ be a probability space. We consider measurable functions that are defined at each point of $\Omega$, and we do not identify functions which are different on a set of zero measure (see [20], § 2, for a justification). By the $L_p$-norm, $1\leqslant p< \infty$, we understand
and with some abuse of notation we write occasionally $L_\infty(\Omega)$ for the space $\mathcal{B}(\Omega)$ of bounded functions on $\Omega$. In §§ 5 and 6 it is convenient for us to assume that $\Omega$ is a compact subset of $\mathbb{R}^d$ and to consider the space $\mathcal{C}(\Omega)$ of continuous functions instead.
We also consider the discrete space $L_p^m$ of vectors $\mathbf{x}=(x_1,\dots,x_m)\in\mathbb{R}^m$ (or $\mathbb C^m$) with the norm
The previous papers on discretization were focused on two-sided inequalities, which establish that the discrete norm of a sample vector is bounded below and above by the integral $L_p$-norm of the function (multiplied by some constants). Inequalities of this type are also known under the name of Marcinkiewicz–Zygmund inequalities. Let us describe them in more detail.
The Marcinkiewicz-type discretization problem
Let $(\Omega,\mu)$ be a probability space. We say that a linear subspace $X_N$ (here the index $N$ usually denotes the dimension of $X_N$) of $L_p(\Omega,\mu)$, $1\leqslant p < \infty$, admits the Marcinkiewicz-type discretization theorem with parameters $m\in \mathbb{N}$ and $p$ and positive constants $C_1\leqslant C_2$ if there exists a set $\{\xi^j\}_{j=1}^m\subset\Omega$ such that for any $f\in X_N$ we have
Recently it was established that results on sampling discretization and on universal sampling discretization of the square norm are useful in sampling recovery (see [45]) and in sparse sampling recovery (see [9] and [11]) with error measured in the square norm. It was discovered that in some cases it is sufficient to impose a one-sided sampling discretization assumption to prove the corresponding sampling recovery results.
Inequality (1.3) consists of two one-sided inequalities: the left discretization inequality (LDI) and the right discretization inequality (RDI). In this paper we concentrate separately on LDI and on RDI. Clearly, the Bernstein-type discretization problem is an example of a one-sided discretization inequality, namely, an LDI. The RDI in (1.4) is trivial. Note that other names for one-sided inequalities are also used in the literature. For instance, Lubinsky calls RDI forward inequalities and LDI converse inequalities (see [31], for example).
In this paper we focus on more general one-sided discretization inequalities between the integral $L_p$-norm of a function $f$ and the discrete $L_q^m$-norm (or weighted $\ell_q$-norm) of the vector $S(f,\xi)$. We stress that the parameters $p$ and $q$ can be different.
We consider the following settings.
One-sided discretization problems
Given a probability space $(\Omega,\mu)$, a set $X$ of functions defined on $\Omega$, and parameters $1\leqslant p,q\leqslant \infty$, $D>0$, $m\in\mathbb{N}$, we are interested in the following properties.
for some sequence of points $\xi^1,\dots,\xi^m\in\Omega$. We denote this property by $X \in \mathcal{RD}(m,p,q,D)$.
In the case $q=\infty$ the discrete norm in (1.5) or (1.6) is understood as usual; see (1.1).
Note that in the definitions of LDI and RDI we allow some of the points $\xi^1,\dots,\xi^m$ to coincide. By some abuse of notation we will write $\xi = \{\xi^j\}_{j=1}^m$ bearing in mind that some elements of this set can coincide.
The problem we consider here is to find the smallest integer $m\in\mathbb{N}$ such that LDI/RDI hold true for a given $N$-dimensional subspace $X_N$ or for a collection of subspaces $X_N$.
One can formulate LDI/RDI in terms of the sampling operator $S(\,\cdot\,,\xi)$ acting from $L_p$ to $L_q^m$ (see (1.2)). The RDI is just the norm bound $\|S f\|_q \leqslant D\|f\|_p$ for all $f\in X$ and the LDI is the bound $\|f\|_p \leqslant D\|S f\|_q$ for all $f\in X$.
We can write $\mathrm{RDI}(p,q)$ or $\mathrm{LDI}(p,q)$ if we want to emphasize the dependence on parameters. For the sake of brevity in the case $p=q$ we often use just one parameter, for example, $\mathcal{LD}(m,p,D)$ or $\mathrm{RDI}(p)$.
The above inequalities (1.5) and (1.6) are formulated in the case of discretization with equal weights $1/m$ for each sample point $\xi^j$, $j=1,\dots,m$. In the case of general weights (weight $\lambda_j$ for the sample point $\xi^j$, $j=1,\dots,m$) we obtain analogues of LDI and RDI for weighted discretization, which we call WLDI:
The $\mathcal{LD}$ and $\mathcal{RD}$ properties are almost monotone in the number of sampling points in the following sense. If $X\in\mathcal{LD}(m,p,q,D)$, then $X\in\mathcal{LD}(km,p,q,D)$ (each point is repeated $k$ times) for each $k\in\mathbb N$; moreover, $X\in\mathcal{LD}(m',p,q,2^{1/q}D)$ for all $m'>m$. In fact, if $m'\leqslant 2m$, then consider $m$ points ensuring the membership relation $X\in\mathcal{LD}(m,p,q,D)$ and repeat $m'-m$ of them. Then the $l_q^m$-norms of functions do not decrease and the normalizing factor $1/m^{1/q}$ reduces with coefficient $(m'/m)^{1/q}\leqslant 2^{1/q}$; we obtain $X\in\mathcal{LD}(m',p,q,2^{1/q}D)$. The general case reduces to this in view of the first observation.
The same holds for RDI. Hence if we do not care about multiplicative absolute constants, then there is no difference between, say, the property
(in the last expression we even allow the noninteger first parameter $C_1N\log N$ and assume that the number of points is $[C_1N\log N]$).
The structure of the paper
Now we present a brief description of our results. In § 2 we mostly study the right discretization inequality. We focus on lower bounds for $m$ such that $\mathcal{RD}(m,p, q, D)$ is possible. In Proposition 2.1 we discuss the case when $2<p<\infty$ and $1\leqslant q<\infty$ and obtain a lower bound on $m$ for an RDI to hold under certain assumptions on the subspace $X_N$. As a corollary of this lower bound (see Corollary 2.1), we establish that for a subspace $\mathcal{T}(\Lambda_N)$ spanned by a lacunary trigonometric system of size $N$ to have the RDI in the case $2<q<\infty$ it is necessary to use at least $N^{q/2}$ points in the sense of order. We show (see Remark 2.2) that for an LDI to hold for the subspace $\mathcal{T}(\Lambda_N)$ it is sufficient to use points in amount of order $N$. Also, we prove (see Proposition 2.3) that if the sampling operator is injective, then there is no universal bound on $m$ in terms of $N$, which guarantees RDI for all $N$-dimensional subspaces.
In § 3 we discuss LDI. We show that a weighted LDI with non-negative weights implies an LDI (see Proposition 3.1). As a consequence, we obtain (see Proposition 3.2) that for any $N$-dimensional subspace the $\mathrm{LDI}(2)$ holds for the number $m$ of points being of order $N$. We also formulate an open problem.
In § 4 we discuss a connection between sampling discretization problems and known settings on finding submatrices of small size with certain good properties. In particular, we explain that the $\mathrm{RDI}(2)$ with $m=N$ points is a corollary of Lunin’s result on matrices.
In § 5 we discuss the problem of sampling recovery. It is mostly a survey of known results with simple comments on their improvements. Here the case $p\ne q$ is important. For instance, it turns out that some of the sampling recovery results which were proved using the $\mathrm{LDI}(p,p)$ can also be proved under the weaker assumption $\mathrm{LDI}(p,\infty)$, namely,
where $p\in[1,\infty)$ (see Theorem 5.2). Also, we show in § 5 how known results on the discretization of the uniform norm can be combined with a result due to Kiefer and Wolfowitz [22] to obtain results on LDI.
In § 6 we present a brief survey of the very recent results from [9]–[11]. In addition, in the same way as in § 5 we demonstrate that some of those results can be improved by replacing the assumption of an LDI with parameter $p$ by the weaker assumption of an $\mathrm{LDI}(p,\infty)$, $p\geqslant1$; see (1.7).
In § 7 we discuss sampling discretization of the uniform norm. By its nature, sampling discretization of the uniform norm is an LDI problem because the corresponding RDI is trivial in this case. In § 7 we focus on lower bounds on $m$ for the sampling discretization result to hold. Theorem 7.1 is a new result, which extends the previously known result for the trigonometric system to the case of more general systems. Then we discuss an application of Theorem 7.1 to the case when the system in question is a Sidon system.
We often use the concept of Nikol’skii’s inequality. For the reader’s convenience we formulate it here. Other definitions and notations are introduced in the text below.
Nikol’skii-type inequalities
Let $1\leqslant p\leqslant q\leqslant\infty$ and $X_N\subset L_q(\Omega)$. The inequality
is called Nikol’skii’s inequality for the pair $(p,q)$ with constant $M$. We will also use a brief form of this relation, $X_N \in \mathrm{NI}(p,q,M)$. Typically, $M$ depends on $N$, for instance, $M$ can be of order $N^{1/p-1/q}$.
2. Right discretization inequalities (RDI)
In this section we mostly discuss necessary conditions on $m$ for RDI and WRDI to hold. We begin with weighted discretization.
Lemma 2.1. Let $2<p<\infty$ and $1\leqslant q<\infty$, and let $X_N\in\mathrm{NI}(2,p,M)$ for some constant $M$. Assume that $\{\xi^j\}_{j=1}^m\subset \Omega$ and non-negative weights $\{\lambda_j\}_{j=1}^m$ are such that for any $f\in X_N$
Remark 2.1. In Lemma 2.1 we assume that all weights are non-negative. But discretization with negative weights also makes sense. For instance, it was noted in [29] that even for $p=2$ there exist subspaces such that for the minimum number $m$ of points providing for each $f\in X_N$ the equality
(exact discretization) we need some $\lambda_j$, $j=1,\dots, m$, to be negative.
The following proposition is a direct corollary of Lemma 2.1.
Proposition 2.1. Let $2<p<\infty$ and $1\leqslant q<\infty$, and let $X_N\in\mathrm{NI}(2,p,M)$ for some constant $M$. Suppose that $X_N$ has an orthonormal basis $\{u_i\}_{i=1}^N$ with the property
$$
\begin{equation*}
N^{q/2} \leqslant m C(p,q,b) D^q.
\end{equation*}
\notag
$$
Remark 2.2. In contrast to the RDI case, for lacunary polynomials an LDI with $2<p<\infty$ and $2<q<\infty$ can be satisfied by using $m=O(N)$ points. Indeed, by Theorem 1.1 from [42] there are $m \leqslant CN$ points $\xi^1,\dots,\xi^m$ that provide $L_2$-discretization and this immediately implies an LDI:
Proposition 2.2. Let $2<p<\infty$, and let $X_N\in\mathrm{NI}(2,p,M)$ for some constant $M$. Assume that $\{\xi^j\}_{j=1}^m\subset \Omega$, and let $\{\lambda_j\}_{j=1}^m$ be non-negative weights such that for each $f\in X_N$
Now we make a simple observation that in the case of an injective sampling operator $f\mapsto S(f,\xi)$ there is no a priori upper bound for the number of points $m$ that provide an RDI.
Given $a\in(0,1/2]$, we define a function $f_a\in C[0,1]$ by
Proposition 2.3. Let $p,q\in[1,\infty)$, and let $X_N$ be a linear subspace of $C[0,1]$ that contains the functions $f_a$ and $f_{a/2}$ for some $a\in(0,1/2]$. Suppose that a set $\{\xi^j\}_{j=1}^m\subset\Omega$ provides the property $X_N\in\mathcal{RD}(m,p,q,D)$ with injective sampling operator, that is,
Proof. We apply the injectivity of the sampling operator to the function $f_{a/2}$ and obtain that at least one point among the $\xi^j$, $j=1,\dots, m$, lies in the interval $[0,a]$. Therefore, the $\mathrm{RDI}(p,q)$ for $f_a$ implies that
We can take $a$ in (2.5) to be arbitrarily small and obtain arbitrarily large lower bounds for $m$. Hence there is also no upper bound for the number of points $m$ that provide Marcinkiewicz-type discretization (1.3). Note that Proposition 2.3 gives the solution to Open Problem 1 in [20]. In terms of [20] this means that for each $1\leqslant p<\infty$ and any positive constants $C_1$ and $C_2$ we have $sd(N,p,C_1,C_2) =\infty$.
Note that each subspace $X_N$ admits the $\mathrm{RDI}(2,2)$ with $m=N$ points (see Proposition 4.1). This means that the injectivity condition for RDI is essential. The $\mathrm{LDI}(2,2)$ also holds with $m=O(N)$ points for each $X_N$ (see Proposition 3.2). However, we cannot guarantee both an LDI and an RDI with a common set of points of any size bounded by $m(N)$. Otherwise we would obtain an RDI with injective sampling operator, and this would contradict Proposition 2.3.
Proposition 3.2 from [15] implies lower bounds on the number of points for RDI for subspaces with Nikol’skii’s inequality. We refer the reader to [15] for the construction and for the result in the case when $X_N\in \mathrm{NI}(1,\infty, (1+\varepsilon)N)$.
Theorem 2.A ([15]). For any $1\leqslant q<2$ and $N\in\mathbb{N}$ there exists an $N$-dimensional subspace $X_N\subset L_1([0,1])$ with the following properties:
(b) if $m\in\mathbb{N}$ and $t_j, j=1,\dots, m$, are such that $\sum_{j=1}^m|f(t_j)|^q>0$ for all $f\in X_N\setminus \{0\}$, then there exists $f\in X_N$ such that
In particular, it follows from (2.6) that the $\mathrm{RDI}(q)$ with injective sampling operator is not true for $m=o(N^{2-q/2})$.
A comment on WRDI
Let $2\leqslant r<p<\infty$, $X_N\in\mathrm{NI}(2,p,M)$, and let the points $\xi^1,\dots,\xi^m\in\Omega$ and weights $\lambda_1,\dots,\lambda_m\in\mathbb{R}_+$, $\lambda_1+\dots+\lambda_m=1$, be such that
Thus, the $\mathrm{NI}(2,p, M)$ and $\mathrm{WRDI}(p)$ with a constant $D$ imply the $\mathrm{WRDI}(r)$ with the constant $DM$.
3. Left discretization inequalities (LDI)
We will prove that a WLDI with non-negative weights implies an LDI if the sum of weights is bounded above. This extra property of weights is true, for example, when we have the two-sided weighted discretization. We begin with a slightly more general observation than the one in [45].
Remark 3.1. Let $\mathcal F$ be a collection of finite-dimensional subspaces such that if $X_N\in \mathcal F$, then
Suppose that for some $1\leqslant p_1\leqslant p_2<\infty$ each $N$-dimensional subspace $X_N\in\mathcal F$, $N \in \mathbb N$, admits the discretization
with $m\leqslant m(N)$ points and some nonnegative weights $\{\lambda_j\}_{j=1}^m$. Then all $X_N\in \mathcal F$ satisfy (3.1) with $m'\leqslant m(N+1)$ points and non-negative weights $\{\lambda'_j\}_{j=1}^{m'}$ such that $\sum \lambda_j' \leqslant D_2^q$.
Proposition 3.1. Let $X$ be a set of functions in $L_p(\Omega,\mu)$, and let the points $\xi^1,\dots,\xi^m \in\Omega$ and non-negative weights $\lambda_1,\dots,\lambda_m$ be such that $\lambda_1+\dots+\lambda_m\leqslant C$ and
We construct a new set of points. We repeat each point $\xi^j$, $j=1,\dots, m$, $[\lambda_jCm]+1$ times and denote all these points by $\xi_0^j$, $j=1,\dots, m_0$. Then
Let us apply this technique to some known theorems on discretization.
Proposition 3.2. Let $1\leqslant p<\infty$. Then there exist positive constants $C_1(p)$ and $C_2(p)$ such that each $N$-dimensional subspace $X_N \subset L_p(\Omega,\mu)$ satisfies the property $X_N\in\mathcal{LD}(m, p, C_2)$, that is, there is a set of points $\{\xi^j\}_{j=1}^m$ such that
Proof. In the case $p=2$ we use the following result (see [6], Theorem 6.4): there exist three absolute constants $C_1'$, $c_0$, $C_0$, a set of $m\leqslant C_1'N$ points $\xi^1,\dots, \xi^m\in\Omega$, and a set of non-negative weights $\lambda_j$, $j=1,\dots, m$, such that
This WLDI, Remark 3.1, and Proposition 3.1 imply the required LDI.
Other cases are also covered by known discretization results: see [20], D.20. An upper bound, for $p>2$ and [5], Corollaries 1.1 and 1.2, for $1 \leqslant p < 2$. $\Box$
Remark 3.2. Note that in the papers cited discretization results are often formulated for subspaces of continuous functions $X_N\subset C(\Omega)$. However, it is always possible to discretize properly any subspace $X_N\subset L_p$ by using an appropriate amount of points (see, for example, Proposition 4.A) and pass to a subspace of functions defined on a discrete set. These functions are continuous by definition. So, the results we have cited work in the general setting.
We have discovered that Proposition 3.2 for $p=2$, that is, the $\mathrm{LDI}(2,2)$
was also obtained independently in the recent paper [1], Theorem 6.2, by using a different method (see also [26], Lemma 11).
Open Problem 3.1. Under what condition on $m$ does each $N$-dimensional subspace $X_N \subset \mathcal{C}(\Omega)$ have the property $X_N\in\mathcal{LD}(m,p,\infty,D)$, where $2<p<\infty$ and $D=D(p)$:
$$
\begin{equation*}
\|f\|_p \leqslant D \max_{1\leqslant j \leqslant m}|f(\xi^j)|\quad \text{for all }f\in X_N
\end{equation*}
\notag
$$
with an appropriate set of points $\{\xi^j\}_{j=1}^m$?
Comment 3.1. Theorem 1.3 in [19] on the discretization of the uniform norm implies that in Open Problem 3.1 we can always take $m=9^N$, which means an exponential growth in $N$. Theorem 1.2 in [19] shows that for the subspace $\mathcal{T}(\Lambda_N)$ (see Corollary 2.1) we need $(N/e)e^{CN/D^2}$ points for the $\mathrm{LDI}(\infty)$ to hold. Proposition 3.2 for $p=2$ guarantees that we can take $m$ of order $N$ in the case $p\leqslant 2$. From Proposition 3.2 it follows that we can take $m\leqslant C N^{p/2}\log{N}$ for $p>2$ and $m\leqslant CN^{p/2}$ for even $p$.
The following theorem from [26] is the $\mathrm{LDI}(p,2)$.
Theorem 3.1 ([26], Theorem 13). Let $2\leqslant p\leqslant \infty$. Then there exist positive constants $C_1=4$ and $C_2=83$ such that each $N$-dimensional subspace $X_N \subset \mathcal{C}(\Omega)$ satisfies the property $X_N\in\mathcal{LD}(C_1N,p,2,C_2 N^{1/2-1/p})$, that is, there is a set of $m\leqslant C_1N$ points $\{\xi^j\}_{j=1}^m$ such that
Also see some results on LDI in § 5: Theorems 5.C, 5.F, 5.5, and 5.G.
4. Matrix setting
The following proposition from [15] guarantees that for any subspace $X_N$ we can find a sufficiently large number $m_0\in\mathbb{N}$ such that $X_N\in\mathcal{LD}(m_0,p,K_1)$ and $X_N\in\mathcal{RD}(m_0,p,K_2)$.
Proposition 4.A ([15], Proposition 6.1). Let $1\leqslant p<\infty$, and let $X_N\subset L_p(\Omega,\mu)$ be an $N$-dimensional subspace, where $(\Omega, \mu)$ is a probability space. Then for each $\varepsilon>0$ there exists $m_0\in\mathbb{N}$ such that if $M\geqslant m_0$ and $\{\xi^j\}_{j=1}^M\subset\Omega$ are independent random samples, then with probability at least $1-\varepsilon$
By Proposition 4.A the problem of finding the smallest $m$ for which $X_N\in \mathcal{LD}(m,p,D)$ or $X_N\in\mathcal{RD}(m,p,D)$ can be reformulated in the matrix setting. Note that results similar to Proposition 4.A with extra conditions on $X_N$ were obtained in [6], Theorem 2.2.
then an RDI for the space $\widetilde{X}_N$ provides an RDI for the original space.
Therefore, it is enough to study the LDI and RDI problems for discrete systems, which can be formulated in the matrix setting.
It was pointed out in the survey [20], M.1, that in that the right-hand inequality in the Marcinkiewicz-type discretization theorem (which is an RDI) for an arbitrary finite-dimensional subspace of $L_2$ follows from the paper [32] by Lunin. We show below (see Proposition 4.1) how to prove this. Recall that for spaces satisfying Nikol’skii’s inequality (1.8) for $p=2$ and $q=\infty$ both an RDI and an LDI were obtained in [30] by using some deep results from [33].
It is convenient to write the space of sampling vectors in the form
with an appropriate $m_0\times N$ matrix $A$. There is a natural way to find such a matrix. Assume that a system $\{u_i\}_{i=1}^N$ forms a basis of $X_N$. Let $A$ be the matrix with columns $(u_i(\xi^1), \dots, u_i(\xi^{m_0}))^\top$, $i=1,\dots,N$. Then for $f=x_1u_1+\dots+x_Nu_N$, $\mathbf{x}=(x_1,\dots,x_N)\in\mathbb{R}^N$, we have $S(f,\xi) = A\mathbf{x}$.
Let $A_1$ be an $m\times N$ submatrix of $A$ that consists of the rows with some indices $i_1,\dots,i_m$ (here we assume that the $i_j$ are distinct). Then (4.2) takes the following form:
Thus, to obtain an LDI for a discrete system it is sufficient to find a row- submatrix $A_1$ that satisfies inequality (4.4), called a pointwise estimate (see [20], § 3). The only difference between the discretization problem (4.2) and the matrix pointwise estimate (4.4) is that submatrices consist of distinct rows $i_j$ and in discretization we allow the repetition of $i_j$.
Similarly, an RDI for a discrete system is essentially equivalent to the problem of finding a row-submatrix $A_1$ of the smallest possible size for which the following pointwise estimate holds
Proposition 4.1. Let $X_N$ be an $N$-dimensional subspace of $L_2(\Omega,\mu)$, then $X_N \in \mathcal{RD}(N,2,C)$ for an absolute positive constant $C$.
Proof. By Proposition 4.A inequality (4.3) holds for $p=2$, for some $m_0$ and $K_2=2$. Without loss of generality we may assume that $\dim\widetilde{X}_N=N$: see (4.1) (if the dimension is less than $N$, then we obtain an even stronger RDI with fewer points). Let $\widetilde{X}_N = \{A\mathbf{x}\colon \mathbf{x}\in\mathbb{R}^N\}$. One can take a matrix $A$ with columns orthonormal in $L_2^{m_0}$. In order to obtain an RDI we prove (4.5). By Theorem 2 in [32] there is an $N\times N$ submatrix $A_1$ of the matrix $A$ such that
The first results of that type were obtained by Kashin in 1980 for the operator $(2,q)$-norms of matrices in the case $1\leqslant q\leqslant 2$ (see [20], § 3, for details). If we know that (4.6) fails for each $m\times N$ submatrix $A_1$, then (4.5) does not hold. So there is a chance that there will be some negative results on the norms of submatrices that could help one to establish lower bounds on $m$ in discretization.
5. An application to sampling recovery
We begin with a general problem of the recovery of functions in some class and describe some characteristics of optimal recovery.
Recall the notion of the sampling operator (1.2). Given a fixed $m$ and a set of points $\xi^1,\dots,\xi^m\in\Omega$, we associate with a function $f\in\mathcal{C}(\Omega)$ the vector
This recovery procedure is linear; the following modification of it is also of interest. We allow any mapping $\Phi\colon \mathbb{C}^m \to X_m \subset L_p(\Omega,\mu)$, where $X_m$ is a linear subspace of dimension $m$, and set
In both cases above we build an approximant, which comes from a linear subspace of dimension at most $m$. It is natural to compare the quantities $\varrho_m(\mathbf{F},L_p)$ and $\varrho_m^*(\mathbf{F},L_p)$ with Kolmogorov widths of ${\mathbf F}$ in $L_p$:
In the definition of Kolmogorov widths, given $f\in \mathbf{F}$, we take an approximating element of the $m$-dimensional subspace $X_m$ that is an element of best approximation to $f$. This means that, in general (that is, for $p\ne 2$), this method of approximation is nonlinear.
The characteristics $\varrho_m$, $\varrho_m^*$, and their variants are well studied for many particular classes of functions. For an exposition of the known results we refer to the books [49], [35], [13], [44], [36]–[38], and the references there. The characteristics $\varrho_m^*$ and $\varrho_m$ are inspired by the concepts of Kolmogorov width and linear width. Probably, $\varrho_m^*$ was introduced in [12], $\varrho_m$ in [41], and a variant of $\varrho_m^*$ without the condition of mapping to a linear subspace was introduced in [49].
We turn to concrete methods of recovery. Throughout the rest of the section $\Omega$ is a compact subset of $\mathbb{R}^d$; we recover continuous functions $f\in \mathcal{C}(\Omega)$ and let $X_N$ denote an $N$-dimensional subspace of ${\mathcal C}(\Omega)$.
Theorem 5.A below was proved in [45] under the following assumptions.
Condition A1 (discretization). Let $1\leqslant p\leqslant \infty$. Suppose that $\xi:=\{\xi^j\}_{j=1}^m\subset \Omega$ provides the WLDI, that is, for any $u\in X_N$ in the case $p<\infty$ we have
$$
\begin{equation*}
\|u\|_p \leqslant D \|S(u,\xi)\|_{p,{\mathbf w}},
\end{equation*}
\notag
$$
and in the case $p=\infty$ we have
$$
\begin{equation*}
\|u\|_\infty \leqslant D \|S(u,\xi)\|_{\infty}
\end{equation*}
\notag
$$
for some positive constant $D$.
Condition A2 (weights). Suppose that there exists a positive constant $W$ such that $\sum_{j=1}^m w_j \leqslant W$.
Consider the following well-known recovery operator (algorithm):
(see, for instance, [4]). Note that the above algorithm $\ell p\mathbf{w}(\xi)$ only uses the values $f(\xi^j)$, $j=1,\dots,m$, of the function. In the case $p=2$ it is a linear algorithm, that is, an orthogonal projection with respect to the seminorm $\|\,{\cdot}\,\|_{2,\mathbf{w}}$. Therefore, in the case $p=2$ the error of approximation by the algorithm $\ell\,2\mathbf{w}(\xi)$ gives us an upper bound for the recovery characteristic $\varrho_m(\,\cdot\,, L_2)$. In the case $p\ne 2$ the error of approximation by the algorithm $\ell p\mathbf{w}(\xi)$ gives us an upper bound for the recovery characteristic $\varrho_m^*(\,\cdot\,, L_p)$.
Theorem 5.A (see [45], Theorem 2.1). Under Conditions A1 and A2, for any $f\in \mathcal{C}(\Omega)$ and $1\leqslant p<\infty$
We now prove a version of Theorem 5.A for the error $\|f-\ell p\mathbf{w}(\xi)(f)\|_{\infty}$ under the extra condition of the validity of Nikol’skii’s inequality.
Theorem 5.1. Let $1\leqslant p<\infty$. Under Conditions A1 and A2 and the extra assumption $X_N\in \mathrm{NI}(p,\infty,M)$, for any $f\in \mathcal{C}(\Omega)$
Proof. The proof is simple and goes along the lines of the proof of Theorem 5.A. Let $u:= \ell p\mathbf{w}(\xi)(f)$. For an arbitrary $g \in X_N$ we have the following chain of inequalities
Here we prove the following analogue of Theorem 5.A under the weaker assumption (5.1) instead of A1.
Theorem 5.2. Let $p\in [1,\infty)$. Assume that a subspace $X_N\subset \mathcal{C}(\Omega)$ has the property $X_N\in\mathcal{LD}(m,p,\infty,D)$ provided by a set $\xi=\{\xi^j\}_{j=1}^m$: for each $u\in X_N$
Theorems 5.A and 5.2 provide the following inequalities for any compact subset $\mathbf{F} \subset \mathcal{C}(\Omega)$ and any probability measure $\mu$ on $\Omega$:
Here $m$ is such that, in the case of Theorem 5.A, Conditions A1 and A2 and, in the case of Theorem 5.2, condition (5.1) are satisfied for any $N$-dimensional subspace of $\mathcal{C}(\Omega)$. For $p=2$ the following inequality is known (see [45]): there exist two positive absolute constants $b$ and $B$ such that for any compact subset $\Omega$ of $\mathbb{R}^d$, any probability measure $\mu$ on it, and any compact subset $\mathbf{F}$ of $\mathcal{C}(\Omega)$
It is known (see [20], D.20. A lower bound) and it follows from Corollary 2.1 that for the discretization of the $L_p$-norm of functions in an $N$-dimensional subspace, $2<p<\infty$, we need at least $m> CN^{p/2}$ points. Thus, we can expect that using an argument similar to Theorem 5.A will not allow us to obtain $m$ better than of order $N^{p/2}$ in inequality (5.6) for $2<p<\infty$. Condition (5.1) is weaker than the combination of Conditions A1 and A2. Therefore, there is a hope that we can obtain $m$ better than of order $N^{p/2}$ in inequality (5.6) by using Theorem 5.2 if we solve Open Problem 3.1.
Open Problem 5.1. Let $2<p<\infty$. What is the minimal growth of $m(N)$ on $N$ that guarantees the following property: for any compact subset $\mathbf{F} \subset \mathcal{C}(\Omega)$ and any probability measure $\mu$ on $\Omega$
Open Problem 5.2. Let $2<p<\infty$. What is the minimal growth of $m(N)$ on $N$ that guarantees the following property: for any compact subset $\mathbf{F} \subset \mathcal{C}(\Omega)$ and any probability measure $\mu$ on $\Omega$
Note that it is known from [45] (see (5.7)) that in the case $p=2$ (and therefore for all $1\leqslant p\leqslant 2$) one can take $m(N)\leqslant CN$.
Theorem 5.2 and Proposition 3.2 for $p=2$ imply the following unconditional result.
Theorem 5.3. There exist two absolute positive constants $C_1$ and $C_2$ such that for any $N$-dimensional subspace $X_N \subset \mathcal{C}(\Omega)$ there exists a set of points $\{\xi^j\}_{j=1}^m$, $m\leqslant C_1N$, with the following property: for any $f\in \mathcal{C}(\Omega)$
Note that Proposition 3.2 and Theorem 5.A for $p=2$ imply the following result.
Theorem 5.4. There exist absolute positive constants $C_1$ and $C_2$ such that for each $N$-dimensional subspace $X_N \subset \mathcal{C}(\Omega)$ there exists a set of points $\{\xi^j\}_{j=1}^m$, $m\leqslant C_1N$, with the following property: for any $f\in \mathcal{C}(\Omega)$
A comment on sampling recovery in the uniform norm
It is well known that results on sampling discretization in the $L_2$-norm imply some results on sampling discretization in the $L_\infty$-norm. We illustrate this phenomenon using some known examples. Probably, the first example of this type is multivariate trigonometric polynomials. Let $Q$ denote a finite subset of $\mathbb{Z}^d$ and $|Q|$ denote the cardinality of $Q$. Let
Theorem 5.B ([42], Theorem 1.1). There exist three absolute positive constants $C_1$, $C_2$, and $C_3$ with the following properties: for any $d\in \mathbb{N}$ and each $Q\subset \mathbb{Z}^d$ there exists a set of $m \leqslant C_1|Q| $ points $\xi^j\in \mathbb{T}^d$, $j=1,\dots,m$, such that for $f\in \mathcal{T}(Q)$
In [7] it was shown how Theorem 5.B implies a result on sampling discretization of the $L_\infty$-norm. Namely, the following theorem was proved.
Theorem 5.C (see [7], Theorem 2.9). Let two positive absolute constants $C_1$ and $C_2$ be as in Theorem 5.B. Then for any $d\in \mathbb{N}$ and each $Q\subset \mathbb{Z}^d$ there exists a set $\xi$ of $m \leqslant C_1|Q|$ points $\xi^j\in \mathbb{T}^d$, $j=1,\dots,m$, such that for any $f\in \mathcal{T}(Q)$
Proof. For the reader’s convenience we present here the one-line proof from [7]. We use the set of points provided by Theorem 5.B. Then $m\leqslant C_1|Q|$, and for any $f\in\mathcal{T}(Q)$ we have
We point out that in the above proof, in addition to the sampling discretization result — Theorem 5.B — we used Nikol’skii’s inequality $\|f\|_\infty \leqslant |Q|^{1/2}\|f\|_2$.
The following two sampling discretization results show that Nikol’skii’s inequality guarantees good discretization inequalities for general subspaces.
Theorem 5.D (see [30], Theorem 1.1). Let $\Omega\subset \mathbb{R}^d$ be a non-empty set with probability measure $\mu$. Let $X_N$ be a subspace in $\mathrm{NI}(2,\infty, tN^{1/2})$. Then there is an absolute constant $C_1$ such that there exists a set $\{\xi^j\}_{j=1}^m\subset \Omega$ of $m \leqslant C_1 t^2 N$ points with the following property: for any $f\in X_N$
where $C_2$ and $C_3$ are absolute positive constants.
Theorem 5.E (see [30], Theorem 1.2). If $X_N$ is an $N$-dimensional subspace of the complex space $L_2(\Omega,\mu)$, then there exist three absolute positive constants $C_1'$, $c_0'$, and $C_0'$, a set of $m\leqslant C_1'N$ points $\xi^1,\dots, \xi^m\in\Omega$, and a set of non-negative weights $\lambda_j$, $j=1,\dots, m$, such that
Note that Theorem 5.E is a generalization to the complex case of an earlier result from [6] established for the real case. In [19] it was shown how Theorem 5.E implies a result on the sampling discretization of the $L_\infty$-norm. Namely, the following theorem was proved there.
Theorem 5.F ([19], Theorem 6.6). There exist two absolute constants $C_1$ and $C_2$ such that for any subspace $X_N \in \mathrm{NI}(2,\infty, M)$ there exists a set of $m\leqslant C_1N$ points $\xi^1,\dots, \xi^m\in\Omega$ with the following property: for any $f\in X_N$
Note that in the special case when $M=tN^{1/2}$ the proof of Theorem 5.F given in [19] shows that Theorem 5.D implies the following result on simultaneous sampling discretization of the $L_\infty$- and $L_2$-norms.
Theorem 5.5. Let $\Omega\subset \mathbb{R}^d$ be a non-empty set with probability measure $\mu$. Assume that $X_N \in \mathrm{NI}(2,\infty, tN^{1/2})$. Then there exist three absolute positive constants $C_1$, $C_2$, and $C_3$ (which are from Theorem 5.D) such that there exists a set $\{\xi^j\}_{j=1}^m\subset \Omega$ of cardinality $m \leqslant C_1 t^2 N$ with the following property: for any $f\in X_N$
Theorems 5.F and 5.5 are conditional results: they hold under the assumption of the validity of Nikol’skii’s inequality. Recently it was understood how those conditional results can be converted into unconditional ones (see [26]). This progress was based on a result of Kiefer and Wolfowitz [22] which guarantees that for any finite-dimensional subspace $X_N$ of $\mathcal{C}(\Omega)$ there exists a probability measure $\mu$ on $\Omega$ such that for all $f\in X_N$ we have
In other words, for any subspace $X_N$ of $\mathcal{C}(\Omega)$ we have $X_N \in\mathrm{NI}(2,\infty, N^{1/2})$ for some probability measure $\mu$.
It was observed in [26] (see Remark 6 there) that the “Kiefer–Wolfowitz result presents itself as a missing piece” in the known results proved under the assumption of Nikol’skii’s inequality $X_N \in \mathrm{NI}(2,\infty, CN^{1/2})$. The following result was proved in [26].
Theorem 5.G ([26], Theorem 2). There is an absolute constant $C$ such that for any subspace $X_N$ of $\mathcal{C}(\Omega)$ there exists a set $\{\xi^j\}_{j=1}^m\subset \Omega$ of $m \leqslant 2 N$ points with the following property: for any $f\in X_N$
The authors of [26] proved (5.10) using the Kiefer–Wolfowitz result (5.9) (they also proved a novel version of it which is actually the $\mathrm{WLDI}(\infty,2)$) and the $\mathrm{LDI}(2,2)$ (see (3.2)) with $m=2N$ points. They also mention that the $\mathrm{LDI}(\infty,2)$ property (5.10) implies the following $\mathrm{LDI}(\infty,\infty)$:
Note that inequality (5.11) was presented in [23] without proof. It is a direct corollary of Theorem 5.F and the Kiefer–Wolfowitz result.
Remark 5.1. The Kiefer–Wolfowitz result (5.9) makes the first half of Theorem 5.5 (inequalities (5.8)) for $t=1$ unconditional (valid for any $X_N\subset \mathcal{C}(\Omega)$) because (5.8) does not depend on the measure $\mu$. In addition, Theorem 5.5 (for $t=1$) provides the following useful discretization of the $L_2$-norm for $\mu$ satisfying $X_N \in \mathrm{NI}(2,\infty, N^{1/2})$ (in particular, for $\mu$ from (5.9))
Let us now discuss the sampling recovery in the uniform norm.
Inequality (5.11) provides Condition A1 of discretization in the case $p=\infty$ for $D = C_2N^{1/2}$. Therefore, Theorem 5.A implies that for each subspace $X_N$ of $\mathcal{C}(\Omega)$, for any $f\in \mathcal{C}(\Omega)$ we have
Now we apply Theorem 5.1 to $p=2$. We use Theorem 5.E and Remark 3.1 to satisfy Conditions A1 and A2 and use the Kiefer–Wolfowitz result (5.9) to satisfy the assumption of the validity of Nikol’skii’s inequality. As a result, instead of (5.12) we obtain its linear version
Note that inequality (5.13) was formulated in [26]. We point out that inequalities (5.12) and (5.13) are direct corollaries of some known results which are based on the theorems in [30] on the sampling discretization of the $L_2$-norm, and of the Kiefer–Wolfowitz result (5.9).
Remark 5.2. It was pointed out in [23], § 6, that Theorem 5.E and Theorem 5.F hold for $m\leqslant bN$, where $b\in (1,2]$ is arbitrary and $c_0'$, $C_0'$, and $C_2$ are allowed to depend on $b$. This implies that inequalities (5.11), (5.12), and (5.13) hold for each $C_1\in (1,2]$ and $b\in (1,2]$, where $C_2$ is allowed to depend on $C_1$ and $B$ to depend on $b$.
6. Universal discretization and nonlinear recovery
In this section we obtain some results related directly to the very recent results from [9]–[11].
Recall the definition of the concept of $v$-term approximation with respect to a given system $\mathcal{D}=\{g_i\}_{i=1}^\infty$ in a Banach space $X$. Given an integer $v\in\mathbb{N}$, we denote by $\mathcal{X}_v(\mathcal{D})$ the collection of all linear spaces spanned by the $g_j$, $j\in J$, where $J\subset \mathbb{N}$ and $|J|=v$, and we denote by $\Sigma_v(\mathcal{D})$ the set of all $v$-term approximants with respect to $\mathcal{D}$:
$$
\begin{equation*}
\Sigma_v({\mathcal D}):=\bigcup_{V\in\mathcal X_v({\mathcal D})} V.
\end{equation*}
\notag
$$
In this paper we only consider finite systems $\mathcal{D}_N=\{g_i\}_{i=1}^N$. Now we describe results obtained in [10]. The following two algorithms were studied in [10]. Set
Here index $\mathrm{s}$ stands for sample to stress that this algorithm only uses the sample vector $S(f,\xi)$.
Clearly, $\ell p^{\rm s}(\xi,\mathcal{X}_v(\mathcal{D}_N))(f)$ is the best $v$-term approximation of $f$ with respect to $\mathcal{D}_N$ in the space $L_p(\xi)$ with the norm $\|S(f,\xi)\|_p$.
Note that $\ell p^{\rm s}(\xi,\mathcal{X}_v(\mathcal{D}_N))(f)$ may not be unique.
In this paper we discuss Algorithm $\ell p^{\rm s}$ and the following version of Algorithm $\ell p$.
Algorithm $\ell(p,\infty)$. Given a system $\mathcal{D}_N$ and a set of points $\xi:=\{\xi^j\}_{j=1}^m\subset \Omega$, we define the following algorithm:
In [10] the following one-sided universal discretization condition on a family of subspaces was used.
Definition 6.1 (see [10]). Let $1\leqslant p <\infty$. We say that a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$ provides one-sided $L_p$-universal discretization with parameter $D\geqslant 1$ for a collection $\mathcal{X}:= \{X(n)\}_{n=1}^k$ of finite-dimensional linear subspaces $X(n)$ if we have
and the points $\{\xi^j\}_{j=1}^m$ provide the property in question. So we say that the set $\xi$ provides the universal $\mathrm{LDI}(p)$.
In this paper we use the following slightly weaker condition.
Definition 6.2. Let $1\leqslant p <\infty$. We say that a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega $ provides the universal $\mathrm{LDI}(p,\infty)$ with parameter $D\geqslant 1$ for a collection $\mathcal{X}:= \{X(n)\}_{n=1}^k$ of finite-dimensional linear subspaces $X(n)$ if $\xi$ provides the property $\bigcup_{n=1}^k X(n)\in \mathcal{LD}(m,p,\infty,D)$, that is,
The following two Lebesgue-type inequalities were proved in [10] for Algorithms $\ell p$ and $\ell p^{\rm s}$. We only formulate those parts of these theorems from [10] which are related to our results.
Theorem 6.A. Let $m$, $v$, and $N$ be natural numbers such that $v\leqslant N$. Let $\mathcal{D}_N\subset \mathcal{C}(\Omega)$ be a system of $N$ elements. Assume that there exists a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$ which provides the universal $\mathrm{LDI}(p)$-property (6.5) for $1\leqslant p<\infty$ for the collection $\mathcal{X}_v(\mathcal{D}_N)$. Then for any function $f \in \mathcal{C}(\Omega)$
Theorem 6.B. Let $m$, $v$, and $N$ be natural numbers such that $2v\leqslant N$. Let $\mathcal{D}_N\subset \mathcal{C}(\Omega)$ be a system of $N$ elements. Assume that there exists a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega $ which provides the universal $\mathrm{LDI}(p)$-property (6.5) for $1\leqslant p<\infty$ for the collection $\mathcal{X}_{2v}(\mathcal{D}_N)$. Then for any function $ f \in \mathcal{C}(\Omega)$
We prove two conditional theorems here: Theorem 6.1 for Algorithm $\ell(p,\infty)$ and Theorem 6.2 for Algorithm $\ell\infty^s$.
Theorem 6.1. Let $m$, $v$, and $N$ be natural numbers such that $v\leqslant N$. Let $\mathcal{D}_N\subset \mathcal{C}(\Omega)$ be a system of $N$ elements. Assume that there exists a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$, which provides the universal $\mathrm{LDI}(p,\infty)$ property (6.6) with $1\leqslant p<\infty$ for the collection $\mathcal{X}_v(\mathcal{D}_N)$. Then for any function $f \in \mathcal{C}(\Omega)$ we have
Proof. Suppose that a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$ provides the universal $\mathrm{LDI}(p,\infty)$ for the collection $\mathcal{X}_v(\mathcal{D}_N)$. Then condition (5.1) for the same $D$ holds for all the $X(n)$ in the family $\mathcal{X}_v(\mathcal{D}_N)$. Thus, we can apply Theorem 5.2 with the same set of points $\xi$ to each subspace $X(n)$. This yields the inequality
We prove the following analogue of Theorem 6.1 for Algorithm $\ell\infty^s$ that only uses the values of the function at the points $\xi^1,\dots,\xi^m$.
Theorem 6.2. Let $m$, $v$ and $N$ be natural numbers such that $2v\leqslant N$. Let $\mathcal{D}_N\subset \mathcal{C}(\Omega)$ be a system of $N$ elements. Assume that there exists a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$, that provides the universal $\mathrm{LDI}(p,\infty)$ property (6.6) for $1\leqslant p<\infty$ for the collection $\mathcal{X}_{2v}(\mathcal{D}_N)$. Then for any function $f \in \mathcal{C}(\Omega)$
For brevity set $u:=\ell \infty^{\rm s}(\xi,\mathcal X_v({\mathcal D}_N))(f)$, and let $h$ be the best $L_\infty$-approximation to $f$ from $\Sigma_v(\mathcal{D}_N)$. Then (6.3) implies that
We now discuss an application of Theorem 6.2 to optimal sampling recovery. We allow any mapping $\Psi\colon \mathbb{C}^m \to L_p(\Omega,\mu)$, and for a class of functions $\mathbf{F}\subset \mathcal{C}(\Omega)$, set
Here the index ${\rm o}$ stands for optimal. The following theorem is a direct corollary of Theorem 6.2.
Theorem 6.3. Let $m$, $v$, and $N$ be natural numbers such that $2v\leqslant N$. Let $1\leqslant p<\infty$. Assume that a system $\mathcal{D}_N \subset \mathcal{C}(\Omega)$ is such that there exists a set $\xi:= \{\xi^j\}_{j=1}^m \subset \Omega$ that provides the universal $\mathrm{LDI}(p,\infty)$, (6.6), for the collection $\mathcal{X}_{2v}(\mathcal{D}_N)$. Then for any compact subset $\mathbf{F}$ of $\mathcal{C}(\Omega)$
7. Some remarks on the discretization of the uniform norm
Discretization of the uniform norm is an actively developing area of research; the reader can find recent results in the papers [19], [23], [46], and [26].
Let $\Phi:=\{\varphi_i\}_{i=1}^N$ be a real linearly independent system satisfying the condition
Proof. We use the following result of Gluskin [16]. Here $\langle\mathbf{x}^1,\mathbf{x}^2\rangle=\sum x^1_ix^2_i$ is the usual scalar product and $|\mathbf{x}|=|\langle\mathbf{x},\mathbf{x}\rangle|^{1/2}$ is the Euclidean norm.
Theorem 7.A. Let $Y=\{\mathbf{y}^1,\dots,\mathbf{y}^S\} \subset \mathbb{R}^N$, $|\mathbf{y}^j|=1$, $j=1,\dots,S$, $S\geqslant N$, and
Corollary 7.1. Assume that the system $\Phi$ satisfies (7.6) and $X_N\in\mathcal{LD}(m,\infty,D)$. Then there exists an absolute positive constant $c $ such that
for an absolute constant $C_1$. Applying Theorem 7.1 for $t=B$, $K=C_1\beta^{-1}N^{-1/2}$, and $D$ we complete the proof. $\Box$
The Sidon property is connected with other interesting properties of functional systems. This provides more examples of systems with poor $L_\infty$-discretization.
Example 7.1. For any uniformly bounded orthonormal sub-Gaussian system $\{\varphi_i\}_{i=1}^N$ the space $X_N := \operatorname{span}\{\varphi_i\}_{i=1}^N$ requires an exponential number of points for $L_\infty$-discretization.
Indeed, it was proved in [2], Theorem 1.7, that such a system has a Sidon subsystem of size $cN$.
Recall some probabilistic notation: $\mathsf{E}$ means expectation and $\mathcal N(0,1)$ is the standard normal distribution.
Example 7.2. For any uniformly bounded orthonormal system $\{\varphi_i\}_{i=1}^N$ that is randomly Sidon:
the space $X_N := \operatorname{span}\{\varphi_i\}_{i=1}^N$ requires an exponential number of points for $L_\infty$- discretization.
Indeed, it was proved in [39] that the random Sidon property for $\Phi:=\{\varphi_i\}_{i=1}^N$ implies the sidonicity of the system $\Phi^{(4)} := \{\varphi_i(x^1)\cdots\varphi_i(x^4)\}_{i=1}^N$. If some points $\{\xi^j\}_{j=1}^m$ discretize $\operatorname{span}\Phi$, then the points $\{(\xi^{j_1},\xi^{j_2})\}$ discretize $\operatorname{span}\Phi^{(2)}$:
Further, the points $\{(\xi^{j_1},\xi^{j_2},\xi^{j_3},\xi^{j_4})\}$ discretize the Sidon system $\operatorname{span}\Phi^{(4)}$, so the number of points is exponential in $N$.
The third-named author is grateful to D. Krieg for bringing the paper [22] to his attention.
Bibliography
1.
F. Bartel, M. Schäfer, and T. Ullrich, “Constructive subsampling of finite frames with applications in optimal function recovery”, Appl. Comput. Harmon. Anal., 65 (2023), 209–248
2.
J. Bourgain and M. Lewko, “Sidonicity and variants of Kaczmarz's problem”, Ann. Inst. Fourier (Grenoble), 67:3 (2017), 1321–1352
3.
M. Dolbeault and A. Cohen, “Optimal pointwise sampling for $L^2$ approximation”, J. Complexity, 68 (2022), 101602, 12 pp.
4.
A. Cohen and G. Migliorati, “Optimal weighted least-squares methods”, SMAI J. Comput. Math., 3 (2017), 181–203
5.
F. Dai, E. Kosov, and V. Temlyakov, “Some improved bounds in sampling discretization of integral norms”, J. Funct. Anal., 285:4 (2023), 109951, 40 pp. ; (2022 (v2 – 2023)), 46 pp., arXiv: 2208.09762v1
6.
F. Dai, A. Prymak, A. Shadrin, V. Temlyakov, and S. Tikhonov, “Entropy numbers and Marcinkiewicz-type discretization”, J. Funct. Anal., 281:6 (2021), 109090, 25 pp. ; Entropy numbers and Marcinkiewicz-type discretization theorem, 2020, 27 pp., arXiv: 2001.10636v1
7.
F. Dai, A. Prymak, V. N. Temlyakov, and S. Yu. Tikhonov, “Integral norm discretization and related problems”, Russian Math. Surveys, 74:4 (2019), 579–630; (2018 (v3 – 2019)), 51 pp., arXiv: 1807.01353v1
8.
F. Dai and V. N. Temlyakov, “Sampling discretization of integral norms and its application”, Proc. Steklov Inst. Math., 319 (2022), 97–109
9.
F. Dai and V. Temlyakov, Universal discretization and sparse sampling recovery, 2023, 40 pp., arXiv: 2301.05962
10.
F. Dai and V. Temlyakov, Lebesgue-type inequalities in sparse sampling recovery, 2023, 25 pp., arXiv: 2307.04161v1
11.
F. Dai and V. Temlyakov, “Random points are good for universal discretization”, J. Math. Anal. Appl., 529:1 (2024), 127570, 28 pp. ; (2023), 35 pp., arXiv: 2301.12536v2
12.
Dinh Düng, “Reconstruction and one-sided approximation of periodic functions of several variables”, Soviet Math. Dokl., 42:1 (1991), 68–71
13.
Dinh Dũng, V. Temlyakov, and T. Ullrich, Hyperbolic cross approximation, Adv. Courses Math. CRM Barcelona, Birkhäuser/Springer, Cham, 2018, xi+218 pp. ; 2016 (v3 – 2017), 182 pp., arXiv: 1601.03978v2
14.
M. Dolbeault, D. Krieg, and M. Ullrich, “A sharp upper bound for sampling numbers in $L_2$”, Appl. Comput. Harmon. Anal., 63 (2023), 113–134
15.
D. Freeman and D. Ghoreishi, “Discretizing $L_p$ norms and frame theory”, J. Math. Anal. Appl., 519:2 (2023), 126846, 17 pp.
16.
E. D. Gluskin, “Extremal properties of orthogonal parallelepipeds and their applications to the geometry of Banach spaces”, Math. USSR-Sb., 64:1 (1989), 85–96
17.
T. Jahn, T. Ullrich, and F. Voigtlaender, “Sampling numbers of smoothness classes via $\ell^1$-minimization”, J. Complexity, 79 (2023), 101786, 35 pp. ; (2022 (v3 – 2023)), 26 pp., arXiv: 2212.00445v1
18.
L. Kämmerer, T. Ullrich, and T. Volkmer, “Worst-case recovery guarantees for least squares approximation using random samples”, Constr. Approx., 54:2 (2021), 295–352
19.
B. Kashin, S. Konyagin, and V. Temlyakov, “Sampling discretization of the uniform norm”, Constr. Approx., 57:2 (2023), 663–694
20.
B. Kashin, E. Kosov, I. Limonova, and V. Temlyakov, “Sampling discretization and related problems”, J. Complexity, 71 (2022), 101653, 55 pp.
21.
B. S. Kashin and V. N. Temlyakov, “The volume estimates and their applications”, East J. Approx., 9:4 (2003), 469–485
22.
J. Kiefer and J. Wolfowitz, “The equivalence of two extremum problems”, Canadian J. Math., 12 (1960), 363–366
23.
E. Kosov and V. Temlyakov, “Sampling discretization of the uniform norm and applications”, J. Math. Anal. Appl., 538:2 (2024), 128431, 25 pp. ; (2024 (v1 – 2023)), 36 pp., arXiv: 2306.14207
24.
E. D. Kosov and V. N. Temlyakov, Bounds for the sampling discretization error and
their applications to universal sampling discretization, 2024 (v1 – 2023), 36 pp., arXiv: 2312.05670v2
25.
D. Krieg, K. Pozharska, M. Ullrich, and T. Ullrich, Sampling recovery in $L_2$ and other norms, 2023, 32 pp., arXiv: 2305.07539v3
26.
D. Krieg, K. Pozharska, M. Ullrich, and T. Ullrich, Sampling projections in the uniform norm, 2024, 18 pp., arXiv: 2401.02220
27.
D. Krieg and M. Ullrich, “Function values are enough for $L_2$-approximation”, Found. Comput. Math., 21:4 (2021), 1141–1151
28.
D. Krieg and M. Ullrich, “Function values are enough for $L_2$-approximation: Part II”, J. Complexity, 66 (2021), 101569, 14 pp.
29.
I. V. Limonova, “Exact discretization of the $L_2$-norm with negative weight”, Math. Notes, 110:3 (2021), 458–462
30.
I. Limonova and V. Temlyakov, “On sampling discretization in $L_2$”, J. Math. Anal. Appl., 515:2 (2022), 126457, 14 pp.
31.
D. S. Lubinsky, “Marcinkiewicz–Zygmund inequalities: methods and results”, Recent progress in inequalities, Math. Appl., 430, Kluwer Acad. Publ., Dordrecht, 1998, 213–240
32.
A. A. Lunin, “Operator norms of submatrices”, Math. Notes, 45:3 (1989), 248–252
33.
A. W. Marcus, D. A. Spielman, and N. Srivastava, “Interlacing families II: Mixed characteristic polynomials and the Kadison–Singer problem”, Ann. of Math. (2), 182:1 (2015), 327–350
34.
N. Nagel, M. Schäfer, and T. Ullrich, “A new upper bound for sampling numbers”, Found. Comput. Math., 22:2 (2022), 445–468
35.
E. Novak, Deterministic and stochastic error bounds in numerical analysis, Lecture Notes in Math., 1349, Springer-Verlag, Berlin, 1988, vi+113 pp.
36.
E. Novak and H. Woźniakowski, Tractability of multivariate problems, v. I, EMS Tracts Math., 6, Linear information, Eur. Math. Soc. (EMS), Zürich, 2008, xii+384 pp.
37.
E. Novak and H. Woźniakowski, Tractability of multivariate problems, v. II, EMS Tracts Math., 12, Standard information for functionals, Eur. Math. Soc. (EMS), Zürich, 2010, xviii+657 pp.
38.
E. Novak and H. Woźniakowski, Tractability of multivariate problems, v. III, EMS Tracts Math., 18, Standard information for operators, Eur. Math. Soc. (EMS), Zürich, 2012, xviii+586 pp.
K. Pozharska and T. Ullrich, “A note on sampling recovery of multivariate functions in the uniform norm”, SIAM J. Numer. Anal., 60:3 (2022), 1363–1384
41.
V. N. Temlyakov, “On approximate recovery of functions with bounded mixed derivative”, J. Complexity, 9:1 (1993), 41–59
42.
V. N. Temlyakov, “The Marcinkiewicz-type discretization theorems for the hyperbolic cross polynomials”, Jaen J. Approx., 9:1 (2017), 37–63
43.
V. N. Temlyakov, “The Marcinkiewicz-type discretization theorems”, Constr. Approx., 48:2 (2018), 337–369
44.
V. Temlyakov, Multivariate approximation, Cambridge Monogr. Appl. Comput. Math., 32, Cambridge Univ. Press, Cambridge, 2018, xvi+534 pp.
45.
V. Temlyakov, “On optimal recovery in $L_2$”, J. Complexity, 65 (2021), 101545, 11 pp.
46.
V. N. Temlyakov, “On universal sampling recovery in the uniform norm”, Proc. Steklov Inst. Math., 323 (2023), 206–216
47.
V. Temlyakov, Sparse sampling recovery in integral norms on some function classes, 2024, 25 pp., arXiv: 2401.14670
48.
V. Temlyakov and T. Ullrich, “Bounds on Kolmogorov widths and sampling recovery for classes with small mixed smoothness”, J. Complexity, 67 (2021), 101575, 19 pp.
49.
J. F. Traub, G. W. Wasilkowski, and H. Woźniakowski, Information-based complexity, Comput. Sci. Sci. Comput., Academic Press, Inc., Boston, MA, 1988, xiv+523 pp.
50.
A. Zygmund, Trigonometric series, v. I, 2nd ed., Cambridge Univ. Press, New York, 1959, xii+383 pp.
Citation:
I. V. Limonova, Yu. V. Malykhin, V. N. Temlyakov, “One-sided discretization inequalities and sampling recovery”, Russian Math. Surveys, 79:3 (2024), 515–545