Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

A uniformly continuous function f on an n-dimensional space X of finite volume tends to concentrate near a single value as n approaches infinity, in the sense that the \(\varepsilon\)-extension of some level set has nearly full measure. This phenomenon, which is called the concentration of measure in high dimension, is frequently related to a transitive group of symmetries acting on X. The prototypical example is the case of a 1-Lipschitz function on the unit sphere S n, see [3, 4, 8].

One of the most important consequences of the concentration of measure is the emergence of spectrum, as was discovered in the 1970s by the third named author, see [57]. The idea is that not only does the distinguished level set have a large \(\varepsilon\)-extension in a sense of measure, but one may actually find structured subsets on which the function is nearly constant. When we have a group G acting transitively on X, this structured subset belongs to the orbit \(\{gM_{0}\,;g \in G\}\) where \(M_{0} \subseteq X\) is a fixed subspace. The third named author also noted some connections with Ramsey theory, which were developed in two different directions: by Gromov [2] in the direction of metric geometry, and by Pestov [9, 10] in the unexpected direction of dynamical systems.

The phenomenon of spectrum thus follows from concentration, and it comes as no surprise that most of the results in Analysis which establish spectrum, have appeared as a consequence of concentration. In this note we demonstrate an instance where no concentration of measure is available, but nevertheless a geometrically structured level set arises.

To state our result, consider the standard flat torus \(\mathbb{T}^{n} = \mathbb{R}^{n}/\mathbb{Z}^{n} = (\mathbb{R}/\mathbb{Z})^{n}\), which inherits its Riemannian structure from \(\mathbb{R}^{n}\). We say that \(M \subseteq \mathbb{T}^{n}\) is a coordinate subtorus of dimension k if it is the collection of all n-tuples \((\theta _{j})_{j=1}^{n} \in \mathbb{T}^{n}\) with fixed nk coordinates. Given a manifold X and \(f: X \rightarrow \mathbb{R}\) we denote the oscillation of f along X by

$$\displaystyle{\mbox{ Osc}(f;X) =\sup _{X}f -\inf _{X}f.}$$

Theorem 1.

There is a universal constant c > 0, such that for any \(n \geq 1,0 <\varepsilon \leq 1\) and a function \(f: \mathbb{T}^{n} \rightarrow \mathbb{R}\) which is 1-Lipschitz, there exists a k-dimensional coordinate subtorus \(M \subseteq \mathbb{T}^{n}\) with \(k\,=\,\left \lfloor c \frac{\log n} {\log \log (5n)\,+\,\log \vert \varepsilon \vert }\right \rfloor\) , such that \(\mbox{ Osc}(f;M)\,\leq \,\varepsilon\) .

Note that the collection of all coordinate subtori equals the orbit {gM 0 ; g ∈ G} where \(M_{0} \subseteq \mathbb{T}^{n}\) is any fixed k-dimensional coordinate subtorus, and the group \(G = \mathbb{R}^{n} \rtimes S_{n}\) acts on \(\mathbb{T}^{n}\) by translations and permutations of the coordinates. Theorem 1 is a manifestation of spectrum, yet its proof below is inspired by proofs of the Morrey embedding theorem, and the argument does not follow the usual concentration paradigm. We think that the spectrum phenomenon should be much more widespread, perhaps even more than the concentration phenomenon, and we hope that this note will be a small step towards its recognition.

2 Proof of the Theorem

We write | ⋅ | for the standard Euclidean norm in \(\mathbb{R}^{n}\) and we write log for the natural logarithm. The standard vector fields \(\partial /\partial x_{1},\ldots,\partial /\partial x_{n}\) on \(\mathbb{R}^{n}\) are well-defined also on the quotient \(\mathbb{T}^{n} = \mathbb{R}^{n}/\mathbb{Z}^{n}\). These n vector fields are the coordinate directions on the unit torus \(\mathbb{T}^{n}\). Thus, the partial derivatives \(\partial _{1}f,\ldots,\partial _{n}f\) are well-defined for any smooth function \(f: \mathbb{T}^{n} \rightarrow \mathbb{R}\), and we have \(\vert \nabla f\vert ^{2} =\sum _{ i=1}^{n}(\partial _{i}f)^{2}\). A k-dimensional subspace \(E \subseteq T_{x}\mathbb{T}^{n}\) is a coordinate subspace if it is spanned by k coordinate directions. For \(f: \mathbb{T}^{n} \rightarrow \mathbb{R}\) and \(M \subseteq \mathbb{T}^{n}\) a submanifold, we write ∇ M f for the gradient of the restriction \(f\vert _{M}: M \rightarrow \mathbb{R}\).

Throughout the proof, c, C will always denote universal constants, not necessarily having the same value at each appearance. Since the Riemannian volume of \(\mathbb{T}^{n}\) equals one, Theorem 1 follows from the case α = 1 of the following:

Theorem 2.

There is a universal constant c > 0 with the following property: Let \(n \geq 1,0 <\varepsilon \leq 1,0 <\alpha \leq 1\) and \(1 \leq k \leq c \frac{\log n} {\log \log (5n)+\vert \log \varepsilon \vert +\vert \log \alpha \vert }\) . Let \(f: \mathbb{T}^{n} \rightarrow \mathbb{R}\) be a locally-Lipschitz function such that, for \(p = k(1+\alpha )\) ,

$$\displaystyle{ \int _{\mathbb{T}^{n}}\vert \nabla f\vert ^{p} \leq 1. }$$
(1)

Then there exists a k-dimensional coordinate subtorus \(M \subseteq \mathbb{T}^{n}\) with \(\mbox{ Osc}(f;M) \leq \varepsilon\) .

The essence of the proof is as follows. First, for some large k we find a k-dimensional coordinate subtorus M where the derivative is small on average, in the sense that \(\Big(\int _{M}\vert \nabla _{M}f\vert ^{p}\Big)^{1/p}\) is small. The existence of such a subtorus is a consequence of the observation that at every point, most of the partial derivatives in the coordinate directions are small. We then restrict our attention to this subtorus and take any two points \(\tilde{x},\tilde{y} \in M\). Our goal is to show that \(f(\tilde{x}) - f(\tilde{y}) <\varepsilon\).

To this end we construct a polygonal line from \(\tilde{x}\) to \(\tilde{y}\) which consists of intervals of length 1∕2. For every such interval [x, y] we randomly select a point Z in a (k − 1)-dimensional ball which is orthogonal to the interval [x, y] and is centered at its midpoint. We then show that | f(x) − f(Z) | and | f(y) − f(Z) | are typically small, since | ∇ M f | is small on average along the intervals [x, Z] and [y, Z].

We proceed with a formal proof of Theorem 2, beginning with the following computation:

Lemma 3.

For any \(n \geq 1,0 <\varepsilon \leq 1,0 <\alpha \leq 1\) and \(1 \leq k \leq c \frac{\log n} {\log \log (5n)+\vert \log \varepsilon \vert +\vert \log \alpha \vert }\) , we have that k ≤ n∕2 and

$$\displaystyle{ \Bigg(\frac{2k} {\delta ^{2}n}\Bigg)^{1/p} \leq \sqrt{k}\cdot \delta }$$
(2)

where \(p = (1+\alpha )k\) and

$$\displaystyle{ \delta = \frac{\alpha } {16(1+\alpha )} \cdot \frac{\varepsilon } {k^{3/2}}. }$$
(3)

Proof.

Take \(c = 1/200\). The desired conclusion (2) is equivalent to \(4k^{2-p} \leq \delta ^{2p+4}n^{2}\), which in turn is equivalent to

$$\displaystyle{ 2^{8p+18} \cdot \left (\frac{\alpha +1} {\alpha } \right )^{2p+4} \cdot k^{2p+8} \leq \varepsilon ^{2p+4}n^{2}. }$$
(4)

Since c ≤ 1∕12 and α ≤ 1 we have that \(6p \leq 12k \leq \log n/\vert \log \varepsilon \vert\) and hence \(\varepsilon ^{2p+4}n^{2} \geq \varepsilon ^{6p}n^{2} \geq n\). Since α + 1 ≤ 2 then in order to obtain (4) it suffices to prove

$$\displaystyle{ \left (\frac{32} {\alpha } \cdot k\right )^{2p+8} \leq n. }$$
(5)

Since c ≤ 1∕200 and k ≤ clogn∕(loglog(5n)) then 24klogk ≤ logn. Since \(k \leq c \frac{\log n} {\vert \log \alpha \vert +\log (\log 5)}\) then \(24k\log \left (\frac{32} {\alpha } \right ) \leq \log n\). We conclude that \(12k\log \left (\frac{32} {\alpha } \cdot k\right ) \leq \log n\), and hence

$$\displaystyle{ \left (\frac{32} {\alpha } \cdot k\right )^{12k} \leq n. }$$
(6)

However, \(p = (1+\alpha )k\) and hence 2p + 8 ≤ 12k. Therefore the desired bound (5) follows from (6). Since \(k \leq \frac{1} {2}\log n \leq n/2\), the lemma is proven. □ 

Our standing assumptions for the remainder of the proof of Theorem 2 are that \(n \geq 1,0 <\varepsilon \leq 1,0 <\alpha \leq 1\) and that

$$\displaystyle{ 1 \leq k \leq c \frac{\log n} {\log \log (5n) + \vert \log \varepsilon \vert + \vert \log \alpha \vert } }$$
(7)

where c > 0 is the constant from Lemma 3. We also denote

$$\displaystyle{ p = (1+\alpha )k }$$
(8)

and we write \(e_{1},\ldots,e_{n}\) for the standard n unit vectors in \(\mathbb{R}^{n}\).

Lemma 4.

Let \(v \in \mathbb{R}^{n}\) and let \(J \subseteq \{ 1,\ldots,n\}\) be a random subset of size k, selected uniformly from the collection of all \(\left (\!\!\begin{array}{c} n\\ k \end{array} \!\!\right )\) subsets. Consider the k-dimensional subspace \(E \subseteq \mathbb{R}^{n}\) spanned by {e j ;j ∈ J} and let P E be the orthogonal projection operator onto E in \(\mathbb{R}^{n}\) . Then,

$$\displaystyle{\big(\mathbb{E}\vert P_{E}v\vert ^{p}\big)^{1/p} \leq \frac{\alpha } {8(1+\alpha )} \cdot \frac{\varepsilon } {k} \cdot \vert v\vert.}$$

Proof.

We may assume that \(v = (v_{1},\ldots,v_{n}) \in \mathbb{R}^{n}\) satisfies | v | = 1. Let δ > 0 be defined as in (3). Denote \(I =\{ i;\vert v_{i}\vert \geq \delta \}\). Since | v | = 1, we must have | I | ≤ 1∕δ 2. We claim that

$$\displaystyle{ \mathbb{P}(I \cap J = \varnothing ) \geq 1 -\frac{2k} {\delta ^{2}n}. }$$
(9)

Indeed, if \(\frac{2k} {\delta ^{2}n} \geq 1\) then (9) is obvious. Otherwise, \(\vert I\vert \leq \delta ^{-2} \leq n/2 \leq n - k\) and

$$\displaystyle{\mathbb{P}(I \cap J = \varnothing ) =\prod _{ j=0}^{k-1}\frac{n -\vert I\vert - j} {n - j} \geq \left (1 - \frac{\vert I\vert } {n - k + 1}\right )^{k} \geq \left (1 - \frac{2} {\delta ^{2}n}\right )^{k} \geq 1-\frac{2k} {\delta ^{2}n}.}$$

Thus (9) is proven. Consequently,

$$\displaystyle{\mathbb{E}\vert P_{E}v\vert ^{p} = \mathbb{E}\left (\sum _{ j\in J}v_{j}^{2}\right )^{p/2} \leq \frac{2k} {\delta ^{2}n} + \mathbb{E}\left [1_{\{I\cap J=\varnothing \}}\cdot \left (\sum _{j\in J}v_{j}^{2}\right )^{p/2}\right ] \leq \frac{2k} {\delta ^{2}n} +\Bigg (k \cdot \delta ^{2}\Bigg)^{p/2},}$$

where 1 A equals one if the event A holds true and it vanishes otherwise. By using the inequality \((a + b)^{1/p} \leq a^{1/p} + b^{1/p}\) we obtain

$$\displaystyle{\big(\mathbb{E}\vert P_{E}v\vert ^{p}\big)^{1/p} \leq \Bigg (\frac{2k} {\delta ^{2}n}\Bigg)^{1/p} + \sqrt{k}\cdot \delta \leq 2\sqrt{k}\cdot \delta = \frac{\alpha } {8(1+\alpha )} \cdot \frac{\varepsilon } {k},}$$

where we utilized (3) and Lemma 3. □ 

Corollary 5.

Let \(f: \mathbb{T}^{n} \rightarrow \mathbb{R}\) be a locally-Lipschitz function with \(\int _{\mathbb{T}^{n}}\vert \nabla f\vert ^{p} \leq 1\) . Then there exists a k-dimensional coordinate subtorus \(M \subseteq \mathbb{T}^{n}\) such that

$$\displaystyle{ \left (\int _{M}\vert \nabla _{M}f\vert ^{p}\right )^{1/p} \leq \frac{\alpha } {8(1+\alpha )} \cdot \frac{\varepsilon } {k}. }$$
(10)

Proof.

The set of all coordinate k-dimensional subtori admits a unique probability measure, invariant under translations and coordinate permutations. Let M be a random coordinate k-subtorus, chosen with respect to the uniform distribution. All the tangent spaces \(T_{x}\mathbb{T}^{n}\) are canonically identified with \(\mathbb{R}^{n}\), and we let \(E \subseteq \mathbb{R}^{n}\) denote a random, uniformly chosen k-dimensional coordinate subspace. Then we may write

$$\displaystyle{\mathbb{E}_{M}\int _{M}\vert \nabla _{M}f\vert ^{p} =\int _{ \mathbb{T}^{n}}\mathbb{E}_{E}\vert P_{E}\nabla f\vert ^{p} \leq A^{p}\int _{ \mathbb{T}^{n}}\vert \nabla f\vert ^{p} \leq A^{p},}$$

where \(A = \frac{\alpha } {8(1+\alpha )} \cdot \frac{\varepsilon } {k}\) and we used Lemma 4. It follows that there exists a subtorus M which satisfies (10). □ 

The following lemma is essentially Morrey’s inequality (see [1, Sect. 4.5]).

Lemma 6.

Consider the k-dimensional Euclidean ball \(B(0,R) =\{ x \in \mathbb{R}^{k}\,;\,\vert x\vert \leq R\}\) . Let \(f: B(0,R) \rightarrow \mathbb{R}\) be a locally-Lipschitz function, and let x,y ∈ B(0,R) satisfy \(\vert x - y\vert = 2R\) . Recall that \(p = (1+\alpha )k\) . Then,

$$\displaystyle{ \vert f(x) - f(y)\vert \leq 4\frac{1+\alpha } {\alpha } \cdot k^{ \frac{1} {2(1+\alpha )} } \cdot R^{1-\frac{k} {p} }\left (\int _{B(0,R)}\vert \nabla f(x)\vert ^{p}\mathit{dx}\right )^{1/p}. }$$
(11)

Proof.

We may reduce matters to the case R = 1 by replacing f(x) by f(Rx); note that the right-hand side of (11) is invariant under such replacement. Thus x is a unit vector, and \(y = -x\). Let Z be a random point, distributed uniformly in the (k − 1)-dimensional unit ball

$$\displaystyle{B(0,1) \cap x^{\perp } =\{ v \in \mathbb{R}^{k}\,;\,\vert v\vert \leq 1,\,v \cdot x = 0\},}$$

where v ⋅ x is the standard scalar product of \(x,v \in \mathbb{R}^{k}\). Let us write

$$\displaystyle\begin{array}{rcl} \mathbb{E}\vert f(x) - f(Z)\vert & \leq & \mathbb{E}\vert x - Z\vert \int _{0}^{1}\left \vert \nabla f((1 - t)x + tZ)\right \vert \mathit{dt} \\ & \leq & 2\mathbb{E}\vert \nabla f((1 - T)x + \mathit{TZ})\vert = 2\int _{B(0,1)}\vert \nabla f(z)\vert \rho (z)\mathit{dz},{}\end{array}$$
(12)

where T is a random variable uniformly distributed in [0, 1], independent of Z, and where ρ is the probability density of the random variable \((1 - T)x + \mathit{TZ}\). Then,

$$\displaystyle{\rho ((1 - r)x + \mathit{rz}) = \frac{c_{k}} {r^{k-1}}}$$

when \(z \in B(0,1) \cap x^{\perp },0 <r <1\). We may compute c k as follows:

$$\displaystyle{1 = c_{k}\int _{0}^{1} \frac{1} {r^{k-1}}V _{k-1}(r)\mathit{dr} = c_{k}V _{k-1}(1) = c_{k} \frac{\pi ^{k-1}} {\Gamma \left (\frac{k+1} {2} \right )},}$$

where V k−1(r) is the (k − 1)-dimensional volume of a (k − 1)-dimensional Euclidean ball of radius r. Denote \(q = p/(p - 1)\). Then,

$$\displaystyle{\int _{B(0,1)}\rho ^{q}\,=\,\int _{ 0}^{1}\left ( \frac{c_{k}} {r^{k-1}}\right )^{q}V _{ k-1}(r)\mathit{dr}\,=\, \frac{c_{k}^{q}V _{ k-1}(1)} {(k - 1)(1 - q) + 1}\,=\,\frac{p - 1} {p - k}\left (\frac{\Gamma \left (\frac{k+1} {2} \right )} {\pi ^{k-1}} \right )^{q-1},}$$

and hence

$$\displaystyle\begin{array}{rcl} \left (\int _{B(0,1)}\rho ^{q}\right )^{1/q}& =& \left (\frac{p - 1} {p - k}\right )^{1/q}\left (\frac{\Gamma \left (\frac{k+1} {2} \right )} {\pi ^{k-1}} \right )^{1/p} \\ & \leq & \left (\frac{1+\alpha } {\alpha } \right )^{1/q}\left (\frac{k^{k/2}} {\pi ^{k-1}} \right )^{1/p} \leq \frac{1+\alpha } {\alpha } \cdot k^{ \frac{1} {2(1+\alpha )} }.{}\end{array}$$
(13)

Denote \(C_{\alpha,k} = \frac{1+\alpha } {\alpha } \cdot k^{ \frac{1} {2(1+\alpha )} }\). From (12), (13) and the Hölder inequality,

$$\displaystyle{ \mathbb{E}\vert f(x) - f(Z)\vert \leq 2\left (\int _{B(0,1)}\vert \nabla f\vert ^{p}\right )^{\frac{1} {p} }\left (\int _{B(0,1)}\rho ^{q}\right )^{\frac{1} {q} } \leq 2C_{\alpha,k}\left (\int _{B(0,1)}\vert \nabla f\vert ^{p}\right )^{\frac{1} {p} }. }$$
(14)

A bound similar to (14) also holds for \(\mathbb{E}\vert f(y) - f(Z)\vert\), since \(y = -x\). By the triangle inequality,

$$\displaystyle{ \vert f(x) - f(y)\vert \leq \mathbb{E}\vert f(y) - f(Z)\vert + \mathbb{E}\vert f(Z) - f(x)\vert \leq 4C_{\alpha,k}\left (\int _{B(0,1)}\vert \nabla f\vert ^{p}\right )^{1/p}. }$$

 □ 

Proof of Theorem 2.

According to Corollary 5 we may select a coordinate subtorus \(M = \mathbb{T}^{k}\) so that

$$\displaystyle{ \left (\int _{M}\vert \nabla _{M}f\vert ^{p}\right )^{1/p} \leq \frac{\alpha } {8(1+\alpha )} \cdot \frac{\varepsilon } {k}. }$$
(15)

Given any two points x, y ∈ M, let us show that

$$\displaystyle{ \vert f(x) - f(y)\vert \leq \varepsilon. }$$
(16)

The distance between x and y is at most \(\sqrt{k}/2\). Let us construct a curve, in fact a polygonal line, starting at x and ending at \(y\) which consists of at most \(\sqrt{k} + 1\) intervals of length 1∕2. For instance, we may take all but the last two intervals to be intervals of length 1∕2 lying on a minimizing geodesic between x to y. The last two intervals need to connect two points whose distance is at most 1∕2, and this is easy to do by drawing an isosceles triangle whose base is the segment between these two points.

Let \([x_{j},x_{j+1}]\) be any of the intervals appearing in the polygonal line constructed above. Let \(B \subset \mathbb{T}^{k} = M\) be a geodesic ball of radius \(R = 1/4\) centered at the midpoint of \([x_{j},x_{j+1}]\). This geodesic ball on the torus is isometric to a Euclidean ball of radius \(R = 1/4\) in \(\mathbb{R}^{k}\). Lemma 6 applies, and implies that

$$\displaystyle\begin{array}{rcl} \vert f(x_{j}) - f(x_{j+1})\vert & \leq & 4\frac{1+\alpha } {\alpha } \cdot k^{ \frac{1} {2(1+\alpha )} }\left (\frac{1} {4}\right )^{1-\frac{k} {p} }\left (\int _{B}\vert \nabla _{M}f\vert ^{p}\right )^{\frac{1} {p} } {}\\ & \leq & 4\frac{1+\alpha } {\alpha } \cdot \sqrt{k}\left (\int _{M}\vert \nabla _{M}f\vert ^{p}\right )^{\frac{1} {p} }. {}\\ \end{array}$$

Since the number of intervals in the polygonal line is at most \(\sqrt{ k} + 1 \leq 2\sqrt{k}\), then

$$\displaystyle{\vert f(x) - f(y)\vert \leq \sum _{j}\vert f(x_{j}) - f(x_{j+1})\vert \leq 8\frac{1+\alpha } {\alpha } \cdot k\left (\int _{M}\vert \nabla _{M}f\vert ^{p}\right )^{1/p} \leq \varepsilon,}$$

where we used (15) in the last passage. The points x, y ∈ M were arbitrary, and hence \(\mbox{ Osc}(f;M) \leq \varepsilon\). □ 

Remarks.

  1. 1.

    It is evident from the proof of Theorem 2 that the subtorus M is selected randomly and uniformly over the collection of all k-dimensional coordinate subtori. It is easy to obtain that with probability at least 9∕10, we have that \(Osc(M;f) \leq \varepsilon\).

  2. 2.

    The assumption that f is locally-Lipschitz in Theorem 2 is only used to justify the use of the fundamental theorem of calculus in (12). It is possible to significantly weaken this assumption; it suffices to know that f admits weak derivatives \(\partial _{1}f,\ldots,\partial _{n}f\) and that (1) holds true, see [1, Chap. 4] for more information.

    It is quite surprising that the conclusion of the theorem also holds for non-continuous, unbounded functions, with many singular points, as long as (1) is satisfied in the sense of weak derivatives. The singularities are necessarily of a rather mild type, and a variant of our proof yields a subtorus M on which the function f is necessarily continuous with \(\mbox{ Osc}(f;M) \leq \varepsilon\).

  3. 3.

    Another possible approach to the problem would be along the lines of the proof of the classical concentration theorems—namely, finding an \(\varepsilon\)-net of points in a subtorus, where all the coordinate partial derivatives of the function are small. However, this approach requires some additional a-priori data about the function, such as a uniform bound on the Hessian.

  4. 4.

    We do not know whether the dependence on the dimension in Theorem 1 is optimal. Better estimates may be obtained if the subtorus \(M \subseteq \mathbb{T}^{n}\) is permitted to be an arbitrary k-dimensional rational subtorus, which is not necessarily a coordinate subtorus. Recall that a rational torus is a quotient of \(\mathbb{R}^{n}\) by a lattice which is spanned by n vectors with rational coordinates.