Badly approximable points on manifolds and unipotent orbits in homogeneous spaces

Yang, Lei

doi:10.1007/s00039-019-00508-1

Badly approximable points on manifolds and unipotent orbits in homogeneous spaces

Published: 25 June 2019

Volume 29, pages 1194–1234, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Geometric and Functional Analysis Aims and scope Submit manuscript

Badly approximable points on manifolds and unipotent orbits in homogeneous spaces

Download PDF

Lei Yang¹

663 Accesses
4 Citations
Explore all metrics

Abstract

In this paper, we study the weighted n-dimensional badly approximable points on manifolds. Given a $C^n$ differentiable non-degenerate submanifold ${\mathcal {U}} \subset {\mathbb {R}}^n$, we will show that any countable intersection of the sets of the weighted badly approximable points on ${\mathcal {U}}$ has full Hausdorff dimension. This strengthens a result of Beresnevich (Invent Math 202(3):1199–1240, 2015) by removing the condition on weights and weakening the smoothness condition on manifolds. Compared to the work of Beresnevich, our approach relies on homogeneous dynamics. It turns out that in order to solve this problem, it is crucial to study the distribution of long pieces of unipotent orbits in homogeneous spaces. The proof relies on the linearization technique and representations of $\mathrm {SL}(n+1,{\mathbb {R}})$.

Badly approximable points on manifolds

Article Open access 05 March 2015

Invariant measures for solvable groups and Diophantine approximation

Article 28 April 2017

Counting lattice points and weak admissibility of a lattice and its dual

Article 02 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

1.1 Badly approximable vectors.

Given a positive integer n, a vector ${\mathbf{r}} = (r_1 , \dots , r_n)$ is called a n-dimensional weight if $r_i \ge 0$ for $i=1,\dots , n$ and

$$\begin{aligned} r_1 + \cdots + r_n = 1. \end{aligned}$$

The weighted version of Dirichlet’s approximation theorem says the following:

Theorem 1.1

(Dirichlet’s Theorem, 1842). For any n-dimensional weight ${\mathbf{r}} = (r_1,\dots , r_n)$, the following statement holds. For any vector ${\mathbf{x}} = (x_1, \dots , x_n) \in {\mathbb {R}}^n$ and any $N > 1$, there exists an integer vector ${\mathbf{p}} = (p_1, \dots , p_n , q) \in {\mathbb {Z}}^{n+1}$such that $0 < |q| \le N$ and

$$\begin{aligned} |q x_i + p_i| \le N^{-r_i}, \quad \text {for } i =1,\dots , n. \end{aligned}$$

This theorem is the starting point of simutaneous Diophantine approximation. Using this theorem, one can easily show the following:

Corollary 1.2

For any vector $\mathbf {x}= (x_1, \dots , x_n) \in {\mathbb {R}}^n$, there are infinitely many integer vectors ${\mathbf{p}} = (p_1,\dots , p_n , q)\in {\mathbb {Z}}^{n+1}$ with $q \ne 0$ satisfying the following:

$$\begin{aligned} |q|^{r_i} |q x_i + p_i| \le 1\quad \text {for} \, i = 1 ,\dots , n. \end{aligned}$$

(1.1)

For almost every vector $\mathbf {x}\in {\mathbb {R}}^n$, the above corollary remains true if we replace 1 with any smaller constant $c>0$ on the right-hand side of (1.1), see [DS70] and [KW08]. The exceptional vectors are called ${\mathbf{r}}$-weighted badly approximable vectors. We give the formal definition as follows:

Definition 1.3

Given an n-dimensional weight ${\mathbf{r}} = (r_1, \dots , r_n)$, a vector $\mathbf {x}\in {\mathbb {R}}^n$ is called ${\mathbf{r}}$-weighted badly approximable if there exists a constant $c >0$ such that for any ${\mathbf{p}} =(p_1, \dots , p_n, q) \in {\mathbb {Z}}^{n+1}$ with $q \ne 0$,

$$\begin{aligned} \max _{1 \le i \le n} |q|^{r_i} |q x_i + p_i| \ge c . \end{aligned}$$

For an n-dimensional weight ${\mathbf{r}}$, let us denote the set of ${\mathbf{r}}$-weighted badly approximable vectors in ${\mathbb {R}}^n$ by ${\mathbf {Bad}}({\mathbf{r}})$. In particular, ${\mathbf {Bad}}(1)$ denotes the set of badly approximable numbers.

${\mathbf {Bad}}({\mathbf{r}})$ is a fundamental object in metric Diophantine approximation. The study of its properties has a long history and attracts people from both number theory and homogeneous dynamics. In view of [KW08], we know that the Lebesgue measure of ${\mathbf {Bad}}({\mathbf{r}})$ is zero. However, it turns out that every ${\mathbf {Bad}}({\mathbf{r}})$ has full Hausdorff dimension, cf. [Jar29], [Sch66], [PV02], [KW10]. The intersections of ${\mathbf {Bad}}({\mathbf{r}})$ with different weights ${\mathbf{r}}$ have been of major interest for several decades. In particular, Wolfgang M. Schmidt conjectured the following:

Conjecture 1.4

(Schmidt’s Conjecture, see [Sch83]). For $n =2$,

$$\begin{aligned} {\mathbf {Bad}}( 1/3, 2/3) \cap {\mathbf {Bad}}( 2/3, 1/3) \ne \emptyset . \end{aligned}$$

In 2011, Badziahin, Pollington and Velani [BPV11] settled this conjecture by showing the following: for any countable collection of 2-dimensional weights $\{(i_t, j_t): t \in {\mathbb {N}}\}$, if $\liminf _{t \rightarrow \infty } \min \{i_t, j_t\} >0$, then

$$\begin{aligned} \dim _H\left( \bigcap _{t =1}^{\infty } {\mathbf {Bad}}( i_t, j_t )\right) = 2, \end{aligned}$$

where $\dim _H(\cdot )$ denotes the Hausdorff dimension of a set. An (see [An13], [An16]) later strengthens their result by removing the condition on the weights. In fact, in [An16], An proves the following much stronger result: for any 2-dimensional weight $(r_1, r_2)$, ${\mathbf {Bad}}(r_1, r_2)$ is $(24\sqrt{2})^{-1}$-winning. Here a set is called $\alpha $-winning if it is a winning set for Schmidt’s $(\alpha , \beta )$-game for any $\beta \in (0,1)$. This statement implies that any countable intersection of sets of weighted badly approximable vectors is $\alpha $-winning. Nesharim and Simmons [NS14] further show that every ${\mathbf {Bad}}(r_1, r_2)$ is hyperplane absolute winning. The reader is referred to [Sch66] for more details of Schmidt’s game and to [BFK+12] for details about hyperplane winning sets.

Badly approximable vectors lying on planar curves are studied by An, Beresnevich and Velani [ABV18]. They prove that for any non-degenerate planar curve ${\mathcal {C}}$ and any weight $(r_1, r_2)$, ${\mathbf {Bad}}(r_1, r_2)\cap {\mathcal {C}}$ is $\frac{1}{2}$-winning.

For $n \ge 3$, the problem turns out to be essentially more difficult. Beresnevich [Ber15] makes the first breakthrough:

Theorem 1.5

(see [Ber15, Corollary 1]). Let $n \ge 2$ be an integer and ${\mathcal {U}} \subset {\mathbb {R}}^n$ be an analytic and non-degenerate submanifold in ${\mathbb {R}}^n$. Let W be a finite or countable set of n-dimensional weights such that $\inf _{{\mathbf{r}} \in W}\{ \tau ({\mathbf{r}})\} >0$ where $\tau (r_1, \dots , r_n) := \min \{ r_i : r_i >0\}$ for an n-dimensional weight $(r_1, \dots , r_n)$. Then

$$\begin{aligned} \dim _H \left( \bigcap _{{\mathbf{r}} \in W} {\mathbf {Bad}}({\mathbf{r}}) \cap {\mathcal {U}} \right) = \dim {\mathcal {U}}. \end{aligned}$$

Remark 1.6

Here a submanifold is called non-degenerate if the derivatives at each point span the whole space. In the setting of analytic submanifolds, this is equivalent to that the submanifold is not contained in any hyperplane of ${\mathbb {R}}^n$.

1.2 Notation.

In this paper, we will fix the following notation.

For a set ${\mathcal {S}}$, let $\sharp {\mathcal {S}}$ denote the cardinality of ${\mathcal {S}}$. For a measurable subset $E \subset {\mathbb {R}}$, let m(E) denote its Lebesgue measure.

For a matrix M, let $M^{\mathrm {T}}$ denote its transpose. For integer $k >0$, let $\mathrm {I}_k$ denote the k-dimensional identity matrix.

Let $\Vert \cdot \Vert $ denote the supremum norm on ${\mathbb {R}}^n$ and ${\mathbb {R}}^{n+1}$. Let $\Vert \cdot \Vert _2$ denote the Euclidean norm on ${\mathbb {R}}^n$ and ${\mathbb {R}}^{n+1}$. For ${\mathbf{x}} \in {\mathbb {R}}^{n+1}$ (or $\in {\mathbb {R}}^n$) and $r >0$, let $B({\mathbf{x}}, r)$ denote the closed ball in ${\mathbb {R}}^{n+1}$ (or ${\mathbb {R}}^n$) centered at ${\mathbf{x}}$ of radius r, with respect to $\Vert \cdot \Vert $. For every $i = 1, \dots , n+1$, there is a natural supremum norm on $\bigwedge ^i {\mathbb {R}}^{n+1}$. Let us denote it by $\Vert \cdot \Vert $.

Throughout this paper, when we say that C is a constant, we always mean that c is a constant only depending on the dimension n. For quantities A and B, let us use $A \ll B$ to mean that there is a constant $C>0$ such that $A \le C B$. Let $A \asymp B$ mean that $A \ll B$ and $B \ll A$. For a quantity A, let O(A) denote a quantity which is $\ll A$ or a vector whose norm is $\ll A$.

1.3 Main results.

In this paper, we will strengthen Theorem 1.5 by removing the condition on weights and weakening the analytic condition to differentiable condition on submanifolds.

To simplify the exposition, in this paper, we will focus on the case of curves:

Theorem 1.7

Let $\varvec{{\varphi }}: I = [a,b] \rightarrow {\mathbb {R}}^n$ be a $C^n$ differentiable and non-degenerate curve in ${\mathbb {R}}^n$. Let W be a finite or countable set of n-dimensional weights. Then

$$\begin{aligned} \dim _H \left( \bigcap _{{\mathbf{r}} \in W} {\mathbf {Bad}}({\mathbf{r}}) \cap \varvec{{\varphi }}(I) \right) = 1. \end{aligned}$$

The proof for curves directly applies to any $C^n$ non-degenerate manifolds, see Sect. 5.5 for detailed explanation. Therefore, Theorem 1.7 holds for any $C^n$ non-degenerate manifolds. In Theorem 1.5, the analyticity condition comes from a fiber lemma (cf. [Ber15, “Appendix C”]) which reduces the general case to the case of curves.

In fact, we can prove the following stronger statement:

Theorem 1.8

Let W be a finite or countable set of n-dimensional weights and ${\mathcal {F}}_n(B)$ be a finite family of $C^n$ differentiable non-degenerate maps $\varvec{{\varphi }}: [0,1] \rightarrow {\mathbb {R}}^n$. Then

$$\begin{aligned} \dim _H \left( \bigcap _{\varvec{{\varphi }}\in {\mathcal {F}}_n(B)} \bigcap _{{\mathbf{r}} \in W} \varvec{{\varphi }}^{-1}({\mathbf {Bad}}({\mathbf{r}}))\right) = 1. \end{aligned}$$

For the same reason as above, this statement holds when [0, 1] is replaced by a m-dimensional ball $B \subset {\mathbb {R}}^m$ for any $m \le n$.

Compared with [Ber15], in this paper, we study this problem through homogeneous dynamics and prove Theorems 1.7 and 1.8 using the linearization technique.

1.4 Bounded orbits in homogeneous spaces.

Let us briefly recall the correspondence between Diophantine approximation and homogeneous dynamics. The reader may see [Dan84], [KM98], [KW08] for more details.

Let $G = \mathrm {SL}(n+1, {\mathbb {R}})$, and $\Gamma = \mathrm {SL}(n+1, {\mathbb {Z}})$. The homogeneous space $X = G/\Gamma $ can be identified with the space of unimodular lattices in ${\mathbb {R}}^{n+1}$. For any $g \in \mathrm {SL}(n+1, {\mathbb {R}})$, the point $g\Gamma $ is identified with the lattice $g{\mathbb {Z}}^{n+1}$. For $\epsilon >0$, let us define

$$\begin{aligned} K_{\epsilon }:=\left\{ \Lambda \in X: \Lambda \cap B({\mathbf{0}}, \epsilon ) = \{{\mathbf{0}}\}\right\} . \end{aligned}$$

(1.2)

By Mahler’s compactness criterion [Mah46], every $K_{\epsilon }$ is a compact subset of X and every compact subset of X is contained in some $K_{\epsilon }$.

For a weight ${\mathbf{r}}=(r_1, \dots , r_n)$, let us define the diagonal subgroup $A_{{\mathbf{r}}} \subset G$ as follows:

$$\begin{aligned} A_{{\mathbf{r}}} := \left\{ a_{{\mathbf{r}}}(t) := \begin{bmatrix} e^{r_1 t}&~&~&~ \\ ~&\ddots&~&~ \\ ~&~&e^{r_n t}&~ \\ ~&~&~&e^{-t}\end{bmatrix}: t \in {\mathbb {R}}\right\} . \end{aligned}$$

For $\mathbf {x}\in {\mathbb {R}}^n$, let us denote

$$\begin{aligned} V(\mathbf {x}) := \begin{bmatrix} \mathrm {I}_n&\mathbf {x}\\ ~&1 \end{bmatrix}. \end{aligned}$$

Proposition 1.9

([Kle98, Theorem 1.5]). $\mathbf {x}{\in } {\mathbf {Bad}}({\mathbf{r}})$ if and only if $\{a_{{\mathbf{r}}}(t)V(\mathbf {x}){\mathbb {Z}}^{n+1}: t >0\}$ is bounded.

Therefore our main theorem is equivalent to saying that for any $C^n$ non-degenerate submanifold ${\mathcal {U}} \subset {\mathbb {R}}^n$ and any countable collection of one-parameter diagonal subgroups $\{A_{{\mathbf{r}}_s}: s \in {\mathbb {N}}\}$, the set of $\mathbf {x}\in {\mathcal {U}}$ such that

$$\begin{aligned} \{a_{{\mathbf{r}}_s}(t)V(\mathbf {x}){\mathbb {Z}}^{n+1}: t>0\} \end{aligned}$$

is bounded for all $s \in {\mathbb {N}}$ has full Hausdorff dimension.

The study of bounded trajectories under the action of diagonal subgroups in homogeneous spaces is a fundamental topic in homogeneous dynamics and has been active for decades. The basic set up of this type of problems is the following. Let G be a Lie group and $\Gamma \subset G$ be a nonuniform lattice in G. Then $X = G/\Gamma $ is a noncompact homogeneous space. Let $A = \{ a(t) : t \in {\mathbb {R}}\}$ be a one-dimensional diagonalizable subgroup and let ${\mathbf {Bd}}(A)$ be the set of $x \in X$ such that $A^{+}x$ is bounded in X, where $A^+ := \{a(t): t >0\}$. Then one can ask whether ${\mathbf {Bd}}(A)$ has full Hausdorff dimension. For a submanifold ${\mathcal {U}} \subset X $, one can also ask whether ${\mathbf {Bd}}(A)\cap {\mathcal {U}}$ has Hausdorff dimension $\dim {\mathcal {U}}$.

In 1986, Dani [Dan86] studies the case where G is a semisimple Lie group with ${\mathbb {R}}$-rank one. In this case, he proves that for any non-quasi-unipotent one parameter subgroup $A \subset G$, ${\mathbf {Bd}}(A)$ has full Hausdorff dimension. His proof relies on Schmidt’s game. In 1996, Kleinbock and Margulis [KM96] study the case where G is a semisimple Lie group and $\Gamma $ is a irreducible lattice in G. In this case, they prove that ${\mathbf {Bd}}(A)$ has full Hausdorff dimension for any non-quasi-unipotent subgroup A. Their proof is based on the mixing property of the action of A on X. Recently, An, Guan and Kleinbock study the case where $G = \mathrm {SL}(3,{\mathbb {R}})$ and $\Gamma = \mathrm {SL}(3,{\mathbb {Z}})$. They prove that for any countable collection of diagonalizable one-parameter subgroups $\{F_s : s \in {\mathbb {N}}\}$, the intersection $\bigcap _{s =1 }^{\infty } {\mathbf {Bd}}(F_s)$ has full Hausdorff dimension. Their proof closely follows the argument in the work of An [An16] and uses a variantion of Schmidt’s game.

1.5 The linearization technique.

In [Ber15], the proof relies on the theory of geometry of numbers. In this paper, we study this problem through homogeneous dynamics and tackle the technical difficulties using the linearization technique. It turns out that in order to get full Hausdorff dimension, it is crucial to study distributions of long pieces of unipotent orbits in the homogeneous space $G/\Gamma $. To be specific, for a particular long piece C of a unipotent orbit, we need to estimate the length of the part in C staying outside a large compact subset K of $G/\Gamma $. In homogeneous dynamics, the standard tool to study this type of problem is the linearization technique. The linearization technique is a standard and powerful technique in homogeneous dynamics. Using the linearization technique, we can transform a problem in dynamical systems to a problem on linear representations. Then we can study this problem using tools and results in representation theory.

Let us briefly describe the technical difficulty when we apply the linearization technique. Let ${\mathcal {V}}$ be a finite dimensional linear representation of $\mathrm {SL}(n+1,{\mathbb {R}})$ with a norm $\Vert \cdot \Vert $ and $\Gamma ({\mathcal {V}}) \subset {\mathcal {V}}$ be a fixed discrete subset of ${\mathcal {V}}$. Let $U = \{u(r): r \in {\mathbb {R}}\}$ be a one parameter unipotent subgroup of G. Given a large number $ T > 1$, we want to estimate the measure of $r \in [ - T, T]$ such that there exists $v \in \Gamma ({\mathcal {V}})$ such that $\Vert u(r)v\Vert \le \epsilon $ where $\epsilon >0$ is a small number. By Dani-Margulis non-divergence theorem (see [DM92]), the measure is very small compared with T given that for any such $v \in \Gamma ({\mathcal {V}})$

$$\begin{aligned} \max \{ \Vert u(r)v\Vert : r \in [-T, T] \} \ge \rho \end{aligned}$$

where $\rho >0$ is some fixed number. The difficulty is to handle the case where there exists some $v \in \Gamma ({\mathcal {V}})$, such that

$$\begin{aligned} \max \{ \Vert u(r)v\Vert : r \in [-T, T] \} < \rho . \end{aligned}$$

Let us call such intervals T-bad intervals. In this paper, we will use representation theory to study properties of such v’s. We then use these properties to show that in a longer interval, say $[-T^2, T^2]$, the number of T-bad intervals is $\ll T^{1-\mu }$ for some constant $\mu >0$. This result is sufficient to prove Theorem 1.7.

In this paper, ${\mathcal {V}}$ is the canonical representation of $\mathrm {SL}(n+1, {\mathbb {R}})$ on $\bigwedge ^i {\mathbb {R}}^{n+1}$ and $\Gamma ({\mathcal {V}}) = \bigwedge ^i {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$ where $i = 1, \dots , n$.

The main technical results in this paper are proved in Sects. 4, 5.3 and 5.4.

We refer the reader to [Rat91], [MT94], [MS95], [Sha09b], [Sha09a], [LM14] for more applications of the linearization technique.

1.6 The organization of the paper.

The paper is organized as follows:

In Sect. 2, we will recall some basic facts on Diophantine approximation, linear representations and lattices in ${\mathbb {R}}^{n+1}$.
In Sect. 3, we will recall a theorem on computing the Hausdorff dimension of Cantor-like sets. We will also construct a Cantor-like covering of the set of weighted badly approximable points.
In Sect. 4, we will prove two technical results on counting lattice points. Proposition 4.1 is one of the main technical contributions of this paper. Its proof relies on the linearization technique and $\mathrm {SL}(n+1, {\mathbb {R}})$ representations.
In Sect. 5, we will give the proof of Proposition 3.7, which implies Theorems 3.5, 1.7 and 1.8. We split the proof into three parts: the generic case, the dangerous case and the extremely dangerous case. Section 5.2 handles the generic case. The proof relies on the Dani-Margulis non-divergence theorem (Theorem 5.1). §5.3 handles the dangerous case. The proof relies on Proposition 4.1 proved in Sect. 4 and the linearization technique. Section 5.4 handles the extremely dangerous case. The proof relies on Proposition 4.2 proved in 4 and the linearization technique. Finally, we will explain how to adapt the proof to handle general $C^n$ non-degenerate manifolds.

2 Preliminaries

2.1 Dual form of approximation.

We first recall the following equivalent definition of ${\mathbf {Bad}}({\mathbf{r}})$:

Lemma 2.1

(see [Ber15, Lemma 1]). Let ${\mathbf{r}} = (r_1, \dots , r_n) \in {\mathbb {R}}^n$ be a weight and $\mathbf {x}\in {\mathbb {R}}^n$. The following statements are equivalent:

(1)
$\mathbf {x}\in {\mathbf {Bad}}({\mathbf{r}})$.
(2)
There exists $c >0$ such that for any integer vector $(p_1, \dots , p_n ,q)$ such that $q \ne 0$, we have that
$$\begin{aligned} \max _{1 \le i \le n} |q|^{r_i}|q x_i + p_i| \ge c. \end{aligned}$$
(3)
There exists $c >0$ such that for any $N \ge 1$, the only integer solution $(a_0, a_1, \dots , a_n)$ to the system
$$\begin{aligned} \begin{array}{rr} |a_0 + a_1 x_1 + \cdots + a_n x_n|< c N^{-1},&|a_i| < N^{r_i} \text { for all } 1 \le i \le n \end{array} \end{aligned}$$
is $a_0 = a_1 = \cdots = a_n =0$.

Proof

The reader is referred to [Mah39], [BPV11, “Appendix”] and [Ber15, “Appendix A”] for the proof. $\square $

Later in this paper we will use the third statement as the definition of ${\mathbf {Bad}}({\mathbf{r}})$.

Given a weight ${\mathbf{r}}=(r_1, \dots , r_n)$, let us define

$$\begin{aligned} D_{{\mathbf{r}}} := \left\{ d_{{\mathbf{r}}}(t) := \begin{bmatrix} e^{t}&~&~&~ \\ ~&e^{-r_1 t}&~&~ \\ ~&~&\ddots&~ \\ ~&~&~&e^{-r_n t} \end{bmatrix}: t \in {\mathbb {R}}\right\} . \end{aligned}$$

For $\mathbf {x}\in {\mathbb {R}}^n$, let us define

$$\begin{aligned} U(\mathbf {x}) := \begin{bmatrix} 1&\mathbf {x}^{\mathrm {T}} \\ ~&\mathrm {I}_n \end{bmatrix}. \end{aligned}$$

If we use the third statement in Lemma 2.1 as the definition of ${\mathbf {Bad}}({\mathbf{r}})$, then in view of [Kle98, Theorem 1.5] we have that $\mathbf {x}\in {\mathbf {Bad}}({\mathbf{r}})$ if and only if $U(\mathbf {x}){\mathbb {Z}}^{n+1} \in {\mathbf {Bd}}(D_{{\mathbf{r}}})$.

2.2 The canonical representation.

Let $V = {\mathbb {R}}^{n+1}$. Let us consider the canonical representation of $G = \mathrm {SL}(n+1, {\mathbb {R}})$ on V: $g \in G$ acts on $v \in V$ by left matrix multiplication. It induces a canonical representation of G on $\bigwedge ^i V$ for every $i=1,2,\dots , n$. For $g \in G$ and

$$\begin{aligned} {\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge \nolimits ^i V, \end{aligned}$$

$g {\mathbf{v}} = (g {\mathbf{v}}_1) \wedge \cdots \wedge (g {\mathbf{v}}_i).$

For $i =1 , \dots , n$, let ${\mathbf{e}}_i \in {\mathbb {R}}^n$ denote the vector with 1 in the ith component and 0 in other components.

Let us fix a basis for V as follows. Let ${\mathbf{w}}_{+} := ( 1, 0, \dots , 0)$. For $i = 1, \dots , n$, let ${\mathbf{w}}_i := (0, \dots , 1, \dots , 0)$ with 1 in the $i+1$st component and 0 in other components. Then $\{{\mathbf{w}}_+, {\mathbf{w}}_1, \dots , {\mathbf{w}}_n\}$ is a basis for V. Let W denote the subspace of V spanned by $\{{\mathbf{w}}_1, \dots , {\mathbf{w}}_n\}$. For $j= 2, \dots , n$, let $W_j$ the subspace of W spanned by $\{{\mathbf{w}}_j, \dots , {\mathbf{w}}_n\}$.

Let us define

$$\begin{aligned} Z : = \left\{ z(\mathfrak {k}) := \begin{bmatrix} 1&~ \\ ~&\mathfrak {k} \end{bmatrix}: \mathfrak {k} \in \mathrm {SO}(n) \right\} . \end{aligned}$$

(2.1)

Let us consider the canonical action of $\mathrm {SO}(n)$ on ${\mathbb {R}}^n$. For $\mathfrak {k} \in \mathrm {SO}(n)$ and ${\mathbf{x}} \in {\mathbb {R}}^n$, let us denote by $\mathfrak {k} \cdot {\mathbf{x}}$ the canonical action of $\mathfrak {k}$ on ${\mathbf{x}}$. It is straightforward to check that for $ \mathfrak {k} \in \mathrm {SO}(n)$ and $\mathbf {x}\in {\mathbb {R}}^n$,

$$\begin{aligned} z(\mathfrak {k}) U(\mathbf {x}) z^{-1}(\mathfrak {k}) = U(\mathfrak {k}\cdot \mathbf {x}). \end{aligned}$$

For any $\mathbf {x}\in {\mathbb {R}}^n$, let us define a subgroup $\mathrm {SL}(2, \mathbf {x})$ of G containing $U(\mathbf {x})$ as follows. For $\mathbf {x}= {\mathbf{e}}_1$, let us define

$$\begin{aligned} \mathrm {SL}(2, {\mathbf{e}}_1) := \left\{ \begin{bmatrix} h&~ \\ ~&\mathrm {I}_{n-1} \end{bmatrix}: h \in \mathrm {SL}(2, {\mathbb {R}}) \right\} . \end{aligned}$$

For general $\mathbf {x}\in {\mathbb {R}}^n$, let us choose $\mathfrak {k}\in \mathrm {SO}(n)$ such that $\Vert \mathbf {x}\Vert _2\mathfrak {k}\cdot {\mathbf{e}}_1 = \mathbf {x}$ and define

$$\begin{aligned} \mathrm {SL}(2, \mathbf {x}) := z(\mathfrak {k}) \mathrm {SL}(2, {\mathbf{e}}_1) z^{-1}(\mathfrak {k}). \end{aligned}$$

It is easy to see that $\mathrm {SL}(2, \mathbf {x})$ is isomorphic to $\mathrm {SL}(2, {\mathbb {R}})$ and $U(\mathbf {x}) \in \mathrm {SL}(2, \mathbf {x})$ corresponds to

$$\begin{aligned} \begin{bmatrix} 1&\Vert \mathbf {x}\Vert _2 \\ ~&1 \end{bmatrix} \in \mathrm {SL}(2,{\mathbb {R}}). \end{aligned}$$

For $r >0$, let $\xi _{{\mathbf{e}}_1}(r) \in \mathrm {SL}(2, {\mathbf{e}}_1)$ denote the element

$$\begin{aligned} \begin{bmatrix} r&0&~ \\ 0&r^{-1}&~\\ ~&~&\mathrm {I}_{n-1} \end{bmatrix} \end{aligned}$$

and $\xi _{\mathbf {x}} (r) \in \mathrm {SL}(2 , \mathbf {x})$ denote $ z(\mathfrak {k}) \xi _{{\mathbf{e}}_1}(r) z^{-1}(\mathfrak {k})$. Then $\xi _{\mathbf {x}}(r)$ corresponds to $\begin{bmatrix} r&~\\ ~&r^{-1} \end{bmatrix}$ in $\mathrm {SL}(2, {\mathbb {R}})$.

Let us study the action of $\mathrm {SL}(2, \mathbf {x})$ on V.

Let us first consider the case $\mathbf {x}= {\mathbf{e}}_1$. For $r \in {\mathbb {R}}$, let us denote

$$\begin{aligned} u_1 (r) := U(r {\mathbf{e}}_1) , \end{aligned}$$

and

$$\begin{aligned} U_1 := \{ u_1 (r ): r \in {\mathbb {R}}\}. \end{aligned}$$

Let us denote

$$\begin{aligned} \Xi _1 : = \{ \xi _1 (r) : = \mathrm {diag}\{r , r^{-1},1, \dots , 1 \}: r > 0 \}. \end{aligned}$$

It is easy to see that $\xi _1 (r) {\mathbf{w}}_+ = r {\mathbf{w}}_+$, $u_1 (r) {\mathbf{w}}_+ = {\mathbf{w}}_+$, $\xi _1 (r) {\mathbf{w}}_1 = r^{-1}{\mathbf{w}}_1$, $u_1(r){\mathbf{w}}_1 = {\mathbf{w}}_1 + r {\mathbf{w}}_+$, and for any ${\mathbf{w}} \in W_2$, ${\mathbf{w}}$ is fixed by $\mathrm {SL}(2, {\mathbf{e}}_1)$.

For $\mathbf {x}\in {\mathbb {R}}^n$, we have $\mathbf {x}= \Vert \mathbf {x}\Vert _2 \mathfrak {k} \cdot {\mathbf{e}}_1$ for some $\mathfrak {k} \in \mathrm {SO}(n)$ and

$$\begin{aligned} \mathrm {SL}(2,\mathbf {x}) = z(\mathfrak {k}) \mathrm {SL}(2, {\mathbf{e}}_1) z^{-1}(\mathfrak {k}). \end{aligned}$$

In particular, we have that

$$\begin{aligned} U(\mathbf {x}) = z(\mathfrak {k}) u_1(\Vert \mathbf {x}\Vert _2)z^{-1}(\mathfrak {k}) \end{aligned}$$

and $\xi _{\mathbf {x}}(r) = z(\mathfrak {k}) \xi _1(r) z^{-1}(\mathfrak {k})$. Since $z(\mathfrak {k}) {\mathbf{w}}_+ = {\mathbf{w}}_+$ and $z(\mathfrak {k}) W = W$, we have that $\xi _{\mathbf {x}}(r) {\mathbf{w}}_+ = r {\mathbf{w}}_+$, $U(\mathbf {x}) {\mathbf{w}}_+ = {\mathbf{w}}_+$, $\xi _{\mathbf {x}}(r) z(\mathfrak {k}) {\mathbf{w}}_1 = r^{-1}\mathfrak {k} \cdot {\mathbf{w}}_1$, $U(\mathbf {x}) z(\mathfrak {k}) {\mathbf{w}}_1 = z(\mathfrak {k}) {\mathbf{w}}_1 + \Vert \mathbf {x}\Vert _2 {\mathbf{w}}_+$ and for any ${\mathbf{w}} \in z(\mathfrak {k})W_2$, ${\mathbf{w}}$ is fixed by $\mathrm {SL}(2, \mathbf {x})$.

Let us consider the action of $\mathrm {SL}(2,\mathbf {x})$ on $\bigwedge ^i V$ for $i = 2,\dots , n$. Let us denote $\mathbf {x}= \Vert \mathbf {x}\Vert _2 \mathfrak {k} \cdot {\mathbf{e}}_1$ as above. For any ${\mathbf{w}} \in \bigwedge ^{i-1} z(\mathfrak {k})W_2$, we have that

$$\begin{aligned} \xi _{\mathbf {x}}(r) ((z(\mathfrak {k}){\mathbf{w}}_1)\wedge {\mathbf{w}})= & {} r^{-1}((z(\mathfrak {k}){\mathbf{w}}_1)\wedge {\mathbf{w}}),\\ U(\mathbf {x})((z(\mathfrak {k}){\mathbf{w}}_1)\wedge {\mathbf{w}})= & {} (z(\mathfrak {k}){\mathbf{w}}_1)\wedge {\mathbf{w}} + \Vert \mathbf {x}\Vert _2 ({\mathbf{w}}_+ \wedge {\mathbf{w}}),\\ \xi _{\mathbf {x}}(r) ({\mathbf{w}}_+ \wedge {\mathbf{w}})= & {} r ({\mathbf{w}}_+ \wedge {\mathbf{w}}) \end{aligned}$$

and

$$\begin{aligned} U(\mathbf {x})({\mathbf{w}}_+ \wedge {\mathbf{w}}) = {\mathbf{w}}_+ \wedge {\mathbf{w}}. \end{aligned}$$

For any ${\mathbf{w}} \in \bigwedge ^{i} z(\mathfrak {k})W_2$ and any ${\mathbf{w}}' \in \bigwedge ^{i-2} z(\mathfrak {k})W_2$, we have that ${\mathbf{w}}$ and ${\mathbf{w}}_+ \wedge (z(\mathfrak {k}){\mathbf{w}}_1) \wedge {\mathbf{w}}'$ are fixed by $\mathrm {SL}(2,\mathbf {x})$.

2.3 Lattices in ${\mathbb {R}}^{n+1}$.

In this subsection let us recall some basic facts on lattices and sublattices in ${\mathbb {R}}^{n+1}$.

For a discrete subgroup $\Delta $ of ${\mathbb {R}}^{n+1}$, let $\mathrm {Span}_{{\mathbb {R}}}(\Delta )$ denote the ${\mathbb {R}}$-span of $\Delta $.

Let $\Lambda \in X = G/\Gamma $ be a unimodular lattice in ${\mathbb {R}}^{n+1}$. For $i=1, \dots , n+1$, let ${\mathcal {L}}_i( \Lambda )$ denote the collection of i-dimensional sublattices of $\Lambda $. Given $\Lambda ' \in {\mathcal {L}}_i( \Lambda )$, let us choose a basis $\{{\mathbf{v}}_1, \dots , {\mathbf{v}}_i\}$ of $\Lambda '$ and define

$$\begin{aligned} {\mathcal {W}}(\Lambda ') := {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i V. \end{aligned}$$

(2.2)

${\mathcal {W}}(\Lambda ')$ is well defined modulo $\pm 1$. Thus ${\mathcal {W}}$ defines a map from ${\mathcal {L}}_i( \Lambda )$ to $\bigwedge ^i V/\pm $ for each $i =1, \dots , n+1$. Let us denote $d(\Lambda ') := \Vert {\mathcal {W}}(\Lambda ')\Vert $. We say that $\Lambda '$ is primitive relative to $\Lambda $ if ${\mathcal {W}}(\Lambda ')$ can not be written as $m {\mathcal {W}}(\tilde{\Lambda })$ where $|m| >1$ is an integer and $\tilde{\Lambda } \in {\mathcal {L}}_i(\Lambda )$ (see [Cas57]).

For $ j = 1, \dots , i$, let

$$\begin{aligned} \lambda _j(\Lambda ') := \inf \{r\ge 0: B({\mathbf{0}}, r) \text { contains at least } j \text { linearly independent vectors of } \Lambda '\}. \end{aligned}$$

By the Minkowski Theorem (see [Cas57]), we have the following:

$$\begin{aligned} \lambda _1(\Lambda ')\cdots \lambda _i(\Lambda ') \asymp d(\Lambda '). \end{aligned}$$

(2.3)

Moreover, there exists a basis (called Minkowski reduced basis) of $\Lambda '$, $\{{\mathbf{v}}_j: j = 1, \dots , i\}$, such that $\Vert {\mathbf{v}}_j\Vert \asymp \lambda _j(\Lambda ')$ for every $j = 1, \dots , i$.

For $\rho >0$ and $i =1, \dots , n+1$, let ${\mathcal {C}}_i(\Lambda , \rho )$ denote the collection of i-dimensional primitive sublattices $\Lambda '$ of $\Lambda $ with $d(\Lambda ') < \rho $. We will need the following result on counting sublattices:

Proposition 2.2

There exists a constant $N >1$ such that the following statement holds. For any $0<\epsilon < 1 $ and any $i= 1, \dots , n$, let $\Lambda \in K_{\epsilon }$ where $K_{\epsilon }$ is defined in (1.2). Then we have that

$$\begin{aligned} \sharp {\mathcal {C}}_i(\Lambda , 1) \le \epsilon ^{-N} . \end{aligned}$$

Proof

First note that there exists a constant $N_1 >1$ such that for any $i=1,\dots , n$ and $\rho >0$,

$$\begin{aligned} \sharp {\mathcal {C}}_i({\mathbb {Z}}^{n+1}, \rho ) \le \rho ^{N_1}. \end{aligned}$$

We also note that there exists a constant $N_2 >1$ such that for any $\Lambda \in K_{\epsilon }$, there exists $g \in \mathrm {SL}(n+1, {\mathbb {R}})$ with $\Vert g^{-1}\Vert < \epsilon ^{-N_2}$ such that $\Lambda = g {\mathbb {Z}}^{n+1}$. In fact, the fact is easily seen if g is chosen in a Siegel set (see [EW17, Proposition 10.56]). Let us fix $\rho > \epsilon $ and $i = 1, \dots , n$. Then for any $\Lambda ' \in {\mathcal {C}}_i(\Lambda , 1)$, then we have that $g^{-1}\Lambda ' \subset {\mathbb {Z}}^{n+1}$ and

$$\begin{aligned} d(g^{-1} \Lambda ') \le \Vert g^{-1}\Vert ^i d(\Lambda ') \le \epsilon ^{-(n+1) N_2} . \end{aligned}$$

Therefore, we have that

$$\begin{aligned} \sharp {\mathcal {C}}_i(\Lambda , 1) \le \sharp {\mathcal {C}}_i({\mathbb {Z}}^{n+1}, \epsilon ^{-(n+1) N_2}) \le \epsilon ^{-N} \end{aligned}$$

where $N = N_1 N_2 (n+1)$.

This completes the proof. $\square $

3 A Cantor-like Construction

In this section, we will introduce a Cantor-like construction which will help us to compute Hausdorff dimension.

Since we focus on the case of curves, we may assume that ${\mathcal {U}}$ is given by

$$\begin{aligned} \varvec{{\varphi }}= (\varphi _1 , \dots , \varphi _n): [0,1] \rightarrow {\mathbb {R}}^n \end{aligned}$$

where every $\varphi _i(s)$ is a $C^n$ differentiable function.

Definition 3.1

(See [Ber15, Sect. 5]). For an integer $R >0$ and a closed interval $J \subset [0,1]$, let us denote by ${\mathbf {Par}}_{R}(J)$ the collection of closed intervals obtained by dividing J into R closed intervals of the same size. For a collection ${\mathcal {I}}$ of closed intervals, let us denote

$$\begin{aligned} {\mathbf {Par}}_{R}({\mathcal {I}}) := \bigcup _{I \in {\mathcal {I}}} {\mathbf {Par}}_R(I). \end{aligned}$$

A sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ of collections of closed intevals is called a R-sequence if for every $q \ge 1$, ${\mathcal {I}}_q \subset {\mathbf {Par}}_R({\mathcal {I}}_{q-1})$. For a R-sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ and $q \ge 1$, let us define $\hat{{\mathcal {I}}}_q := {\mathbf {Par}}_R({\mathcal {I}}_{q-1})\setminus {\mathcal {I}}_q$ and

$$\begin{aligned} {\mathcal {K}}(\{{\mathcal {I}}_q: q \in {\mathbb {N}}\}) := \bigcap _{q \in {\mathbb {N}}} \bigcup _{I_q \in {\mathcal {I}}_q} I_q . \end{aligned}$$

Then every R-sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ gives a Cantor-like subset ${\mathcal {K}}(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}})$ of [0, 1].

For $q \ge 1$ and a partition $\{\hat{{\mathcal {I}}}_{q, p}\}_{0 \le p \le q-1}$ of $\hat{{\mathcal {I}}}_q$, let us define

$$\begin{aligned} d_q(\{\hat{{\mathcal {I}}}_{q, p}\}_{0\le p \le q-1}):= \sum _{p=0}^{q-1} \left( \frac{4}{R}\right) ^{q-p} \max _{I_p \in {\mathcal {I}}_p} F( \hat{{\mathcal {I}}}_{q,p}, I_p), \end{aligned}$$

where $F( \hat{{\mathcal {I}}}_{q,p}, I_p) := \sharp \{I_q \in \hat{{\mathcal {I}}}_{q,p}, I_q \in I_p\}$. Let us define

$$\begin{aligned} d_q({\mathcal {I}}_q) := \min _{\{\hat{{\mathcal {I}}}_{q, p}\}_{0\le p \le q-1}} d_q(\{\hat{{\mathcal {I}}}_{q, p}\}_{0\le p \le q-1}), \end{aligned}$$

where $\{\hat{{\mathcal {I}}}_{q, p}\}_{0\le p \le q-1}$ runs over all possible partitions of $\hat{{\mathcal {I}}}_q$. Let us define

$$\begin{aligned} d(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) := \max _{q \in {\mathbb {N}}} d_q({\mathcal {I}}_q). \end{aligned}$$

Definition 3.2

(See [Ber15, Sect. 5]). For $R >1$ and a compact subset $X \subset [0,1]$, we say that X is R-Cantor rich if for any $\epsilon >0$, there exists a R-sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ such that

$$\begin{aligned} {\mathcal {K}}(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) \subset X \end{aligned}$$

and $d(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) \le \epsilon $.

Our proof relies on the following two theorems:

Theorem 3.3

(See [Ber15, Theorem 6]). Any R-Cantor rich set X has full Hausdorff dimension.

Theorem 3.4

(See [Ber15, Theorem 7]). Any countable intersection of R-Cantor rich sets in [0, 1] is R-Cantor rich.

To show Theorems 1.7 and 1.8, it suffices to find a constant $R >1$ and show that for any weight ${\mathbf{r}}$, $\varvec{{\varphi }}^{-1}({\mathbf {Bad}}({\mathbf{r}})\cap \varvec{{\varphi }}([0,1]))$ is R-Cantor rich. We will determine $R>1$ later.

Theorem 3.5

There exists a constant $R > 1$ such that for any weight ${\mathbf{r}}$, $\varvec{{\varphi }}^{-1}({\mathbf {Bad}}({\mathbf{r}})\cap \varvec{{\varphi }}([0,1]))$ is R-Cantor rich.

Our main task is to prove Theorem 3.5.

Let us fix R. We will show that for any $\epsilon >0$, we can construct a R-sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ such that ${\mathcal {K}}(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) \subset \varvec{{\varphi }}^{-1}({\mathbf {Bad}}({\mathbf{r}}))$ and $d(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) < \epsilon $.

Standing Assumption 3.6

Let us make some assumptions to simplify the proof.

A.1 :

Without loss of generality, we may assume that $r_1 \ge r_2 \ge \cdots \ge r_n$. We may also assume that $r_n >0$. By [Ber15], if $r_n = 0$, we can reduce the problem to the $(n-1)$-dimensional case.

A.2 :

Since

$$\begin{aligned} \varvec{{\varphi }}=(\varphi _1, \dots , \varphi _n): [0,1] \rightarrow {\mathbb {R}}^{n} \end{aligned}$$

is $C^n$ differentiable and non-degenerate, we may assume that for any $s \in [0,1]$ and any $i = 1 ,\dots , n$, $\varphi '_i(s) \ne 0$. If this is not the case, we can replace [0, 1] with a smaller closed interval $I \subset [0,1]$, cf. [Ber15, Property F]. Then since [0, 1] is closed, there exist constants $C_1> c_1 >0$ such that for any $s \in [0,1]$ and any $i=1,\dots , n$, $c_1 \le |\varphi '_i(s)| \le C_1$.

Let us fix some notation. Let $ \kappa >0$ be a small parameter which we will determine later. Let $b >0$ be such that $b^{1+r_1}=R$. For $t >0$, let us denote

$$\begin{aligned} g_{{\mathbf{r}}} (t) := \begin{bmatrix} b^{t} ~&~&~&~ \\ ~&b^{-r_1 t}&~&~ \\ ~&~&\ddots&~ \\ ~&~&~&b^{-r_n t} \end{bmatrix}. \end{aligned}$$

For $i =1 , \dots , n$, let $\lambda _i = \frac{1+r_i}{1+r_1}$. Then we have that $1 = \lambda _1 \ge \lambda _2 \ge \cdots \ge \lambda _n$. Let $m(\cdot )$ denote the Lebesgue measure on [0, 1].

Let us give the R-sequence as follows. Let ${\mathcal {I}}_0 = \{ [0,1] \}$. Suppose that we have defined ${\mathcal {I}}_{q-1}$ for $q \ge 1$ and every $I_{q-1} \in {\mathcal {I}}_{q-1}$ is a closed interval of size $ R^{-q+1}$. Let us define ${\mathcal {I}}_{q}\subset {\mathbf {Par}}_R({\mathcal {I}}_{q-1})$ as follows. For any $I_{q} \in {\mathbf {Par}}_R({\mathcal {I}}_q)$, $I_{q} \in \hat{{\mathcal {I}}}_{q}$ if and only if there exists $s \in I_{q}$ such that $g_{{\mathbf{r}}} (q) U(\varvec{{\varphi }}(s)) {\mathbb {Z}}^{n+1} \notin K_{\kappa }$. That is to say, there exists ${\mathbf{a}} \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ such that $\Vert g_{{\mathbf{r}}} (q) U(\varvec{{\varphi }}(s)){\mathbf{a}}\Vert \le \kappa $. Let us define ${\mathcal {I}}_q = {\mathbf {Par}}_R({\mathcal {I}}_{q-1})\setminus \hat{{\mathcal {I}}}_q$. This finishes the construction of $\{{\mathcal {I}}_q\}_{q\in {\mathbb {N}}}$. It is easy to see that

$$\begin{aligned} {\mathcal {K}}(\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}) \subset \varvec{{\varphi }}^{-1}({\mathbf {Bad}}({\mathbf{r}})). \end{aligned}$$

We need to prove the following:

Proposition 3.7

For any $\epsilon >0$, there exists $\kappa >0$ such that the R-sequence $\{{\mathcal {I}}_q\}_{q \in {\mathbb {N}}}$ constructed as above with $\kappa $ satisfies that

$$\begin{aligned} d(\{{\mathcal {I}}_q\}_{q\in {\mathbb {N}}}) \le \epsilon . \end{aligned}$$

(3.1)

Let $N >1$ be the constant from Proposition 2.2 and $k>0$ be such that $\kappa = R^{-k}$. We can choose $\kappa $ so that k is an integer. Let us give a partition $\{\hat{{\mathcal {I}}}_{q,p}\}_{0\le p \le q-1}$ of $\hat{{\mathcal {I}}}_q$ for each $q \in {\mathbb {N}}$ which shows that Proposition 3.7 holds.

Definition 3.8

Let us fix a small constant $0< \rho < 1 $. We will modify the choice of $\rho $ later in this paper according to the constants arising from our technical results. For $q \le 10^6 n^4 N k$, let us define $\hat{{\mathcal {I}}}_{q, 0} := \hat{{\mathcal {I}}}_q$ and $\hat{{\mathcal {I}}}_{q,p} = \emptyset $ for other p’s.

For $q > 10^6 n^4 N k$ and $l = 2000 n^2 N k$, let $ p = q - 2l$. Let us define $\hat{{\mathcal {I}}}_{q,p'} := \emptyset $ for $p<p' \le q-1$. Let us define $\hat{{\mathcal {I}}}_{q,p}$ to be the collection of $I_q \in \hat{{\mathcal {I}}}_q $ with the following property: there exists $s \in I_q$ such that for any $j=1,\dots , n$ and any ${\mathbf{w}} = {\mathbf{w}}_1 \wedge \cdots \wedge {\mathbf{w}}_j \in \bigwedge ^j {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$,

$$\begin{aligned} \max \{\Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{w}}\Vert : s' \in [s-R^{-q+l}, s+R^{-q+l}] \} \ge \rho ^j. \end{aligned}$$

Let $\eta = \frac{1}{100n^2}$ and $\eta ' = \frac{\eta }{1+r_1}$. For $q > 10^6 n^4 N k$ and $ 2000 n^2 N k < l \le 2\eta ' q$, let $p = q - 2l$. Let us define $\hat{{\mathcal {I}}}_{q,p+1} := \emptyset $. For $j = 1, \dots , n$, let us define $\hat{{\mathcal {I}}}_{q,p}(j)$ to be the collection of $I_q \in \hat{{\mathcal {I}}}_q\setminus \left( \bigcup _{p' < p} \hat{{\mathcal {I}}}_{q,p'} \right) $ such that there exists $s \in I_q$ and ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_j \in \bigwedge ^j {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$ such that

$$\begin{aligned} \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')){\mathbf{v}}\Vert < \rho ^j, \end{aligned}$$

for any $s' \in [s - R^{-q + l}, s + R^{-q +l}]$ and for any $j' = 1, \dots , n$ and any ${\mathbf{w}} = {\mathbf{w}}_1 \wedge \cdots \wedge {\mathbf{w}}_{j'} \in \bigwedge ^{j'} {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$,

$$\begin{aligned} \max \left\{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{w}}\Vert :s' \in [s - R^{-q+l+1}, s + R^{-q +l+1} ]\right\} \ge \rho ^{j'}. \end{aligned}$$

Let us define $\hat{{\mathcal {I}}}_{q,p} = \bigcup _{j=1}^{n} \hat{{\mathcal {I}}}_{q,p}(j)$.

For $j = 1, \dots , n$, let us define $\hat{{\mathcal {I}}}_{q,0}(j)$ to be the collection of $I_q {\in } \hat{{\mathcal {I}}}_q\setminus \left( \bigcup _{p' \le q- 4\eta ' q} \hat{{\mathcal {I}}}_{q,p'} \right) $ such that there exists $s \in I_q$ and ${\mathbf{v}}= {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_j \in \bigwedge ^j {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ such that

$$\begin{aligned} \max \left\{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert :s' \in [s - R^{-q (1-2\eta ')}, s+ R^{-q(1-2\eta ')} ]\right\} < \rho ^j. \end{aligned}$$

Let us define $\hat{{\mathcal {I}}}_{q,0} = \bigcup _{j=1}^{n} \hat{{\mathcal {I}}}_{q,0}(j)$.

Let us define $\hat{{\mathcal {I}}}_{q,p} := \emptyset $ for other p’s. It is easy to see that $\{\hat{{\mathcal {I}}}_{q,p}\}_{0 \le p \le q-1}$ is a partition of $\hat{{\mathcal {I}}}_q$.

Besides the definition of $\{\hat{{\mathcal {I}}}_{q,p}\}_{ 0 \le p \le q-1 }$, let us also introduce the notion of dangerous interval and extremely dangerous interval:

Definition 3.9

For $q > 10^6 n^4 N k$, $ 1000 n^2 N k \le l \le \eta ' q $, and ${\mathbf{a}} \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$, the (q, l)-dangerous interval associated with ${\mathbf{a}}$, which is denoted by $\Delta _{q,l} ({\mathbf{a}})$, is a closed interval of the form $\Delta _{q,l}({\mathbf{a}}) = [ s - R^{-q + l }, s + R^{-q +l}] \subset [0,1]$ such that $I_q \subset \Delta _{q,l}({\mathbf{a}})$ for some $I_q \in \hat{{\mathcal {I}}}_q$,

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{a}}\Vert : s' \in \Delta _{q,l}({\mathbf{a}}) \} < \rho \end{aligned}$$

and

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{a}}\Vert : s' \in [s - R^{-q+l+1}, s + R^{-q+l+1}] \} \ge \rho . \end{aligned}$$

The center s of $\Delta _{q,l}({\mathbf{a}})$ is chosen such that the first coordinate of $U(\varvec{{\varphi }}(s)) {\mathbf{a}}$ is zero.

For $q \ge 10^6 n^4 N k $ and ${\mathbf{a}} \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$, the q-extremely dangerous interval associated with ${\mathbf{a}}$, which is denoted by $\Delta _q ({\mathbf{a}})$, is a closed interval of the form $\Delta _q({\mathbf{a}}) = [s - R^{-q + l'}, s + R^{-q + l'}]$ with $l' > \eta ' q$ such that $I_q \subset \Delta _q({\mathbf{a}})$ for some $I_q \in \hat{{\mathcal {I}}}_q$,

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{a}}\Vert : s' \in \Delta _{q}({\mathbf{a}}) = [s - R^{-q+l'}, s + R^{-q+l'}]\} < \rho \end{aligned}$$

and

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{a}}\Vert : s' \in [s - R^{-q+l'+1}, s + R^{-q+l'+1}] \} \ge \rho . \end{aligned}$$

Remark 3.10

Note that for any $q \ge 10^6 n^4 N k$, there are only finitely many ${\mathbf{a}}$’s such that $\Delta _{q, l} ({\mathbf{a}})$ or $\Delta _q ({\mathbf{a}})$ exist.

4 Counting Dangerous Intervals

In this section we will count dangerous intervals and extremely dangerous intervals.

Proposition 4.1

Let $q \ge 10^6 n^4 N k$, $ 1000 n^2 N k \le l \le \eta ' q$ and $p = q - 2l $. For $I_p \in {\mathcal {I}}_p$, let ${\mathcal {D}}_{q, l} (I_p)$ denote the collection of (q, l)-dangerous intervals which intersect $I_p$. Then for any $I_p \in {\mathcal {I}}_p$,

$$\begin{aligned} \sharp {\mathcal {D}}_{q,l}(I_p) \ll R^{\left( 1- \frac{1}{10n}\right) l}. \end{aligned}$$

Proposition 4.2

Let $q \ge 10^6 n^4 N k$. Let $D_q \subset [0,1]$ denote the union of q-extremely dangerous intervals contained in [0, 1]. Then $D_q$ can be covered by a collection of $N_q$ closed intervals of length $\delta _q$ and

$$\begin{aligned} N_q \le \frac{K_0 (\rho ^{n+1} b^{-\eta q})^{\alpha }}{\delta _q} \end{aligned}$$

where $\delta _q = R^{-q(1-\eta ')}$, $K_0>0$ is a constant, and $\alpha = \frac{1}{(n+1)(2n-1)}$.

In fact, Proposition 4.2 is a rephrase of the following theorem due to Bernik, Kleinbock and Margulis:

Theorem 4.3

(See [Ber15, Proposition 2] and [BKM01, Theorem 1.4]). Let $q > 10^6 n^4 N k$. Let us define $E_q \subset [0,1]$ to be the set of $s \in [0,1]$ such that there exists ${\mathbf{a}} =(a_0, a_1, \dots , a_n) \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ such that $|a_i| < \rho b^{r_i q}$ for $i=1,\dots ,n$, $|f(s)| < \rho b^{-q}$ and $| f'(s)| < b^{(r_1 - \eta )q}$ where

$$\begin{aligned} f(s) = a_0 + a_1 \varphi _1(s) + \cdots + a_n \varphi _n(s). \end{aligned}$$

(4.1)

Then $E_q$ can be covered by a collection ${\mathcal {E}}_{q}$ of intervals such that

$$\begin{aligned} m(\Delta ) \le \delta _q \text { for all } \Delta \in {\mathcal {E}}_{q}, \end{aligned}$$

and

$$\begin{aligned} |{\mathcal {E}}_{q}| \le \frac{K_0 (\rho ^{n+1} b^{-\eta q})^{\alpha }}{\delta _q}, \end{aligned}$$

where $\delta _q = R^{-q(1 - \eta ')}$, $K_0 >0$ is a constant, and $\alpha = \frac{1}{(n+1)(2n-1)}$.

The theorem above is a simplified version of [BKM01, Theorem 1.4]. The original version is more general.

Proof of Proposition 4.2

For every q-extremely dangerous interval $\Delta _q({\mathbf{a}}) = [s - R^{-q+l'}, s + R^{-q +l'}]$ where $l' \ge \eta ' q$ and ${\mathbf{a}} = (a_0, a_1, \dots , a_n)$, we have that

$$\begin{aligned} \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')){\mathbf{a}}\Vert < \rho \end{aligned}$$

(4.2)

for every $s' \in \Delta _q ({\mathbf{a}})$. By direct computation, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')){\mathbf{a}} = (v_0(s'), v_1(s'), \dots , v_n(s')) \end{aligned}$$

where

$$\begin{aligned} v_0(s') = b^q (a_0 + a_1 \varphi _1 (s') + \cdots + a_n \varphi _n (s')), \end{aligned}$$

and $v_i (s') = b^{-r_i q} a_i$ for $i =1, \dots , n$. Then (4.2) implies that $|a_i| < \rho b^{r_i q}$ for $i=1, \dots , n$, and $|f(s)| < \rho b^{-q}$, where f is as in (4.1). Since $l \ge \eta ' q$, we have that

$$\begin{aligned} |f(s')| < \rho b^{-q} \end{aligned}$$

for any $s' \in [s - R^{-q(1-\eta ')} , s + R^{-q(1-\eta ')}]$. Let us write $s' = s + r R^{-q(1-\eta ')}$ for some $r \in [ -1, 1]$. Then

$$\begin{aligned} f(s') = f(s) + f'(s)r R^{-q(1-\eta ')} + O(R^{-2q(1-\eta ')}). \end{aligned}$$

Therefore, we have that for any $r \in [ -1,1]$,

$$\begin{aligned} | f'(s) r R^{-q(1-\eta ')}|&= |f(s') - f(s) - O(R^{-2q(1-\eta ')})| \\&\le |f(s')| + |f(s)| + O(R^{-2q(1-\eta ')}) \\&< \rho b^{-q} + \rho b^{-q} + \rho b^{-q} < b^{-q}. \end{aligned}$$

This implies that

$$\begin{aligned} | f'(s)| < R^{q(1-\eta ')} b^{-q} = b^{q(r_1-\eta )}. \end{aligned}$$

The last equality above holds because $b^{1+r_1} = R$ and $\eta ' = \frac{\eta }{1+r_1}$. This shows that $x \in E_q$ for any $x \in \Delta _q({\mathbf{a}})$, i.e., $\Delta _q({\mathbf{a}}) \subset E_q$. Therefore, we have that $D_q \subset E_q$. Then the conclusion follows from Theorem 4.3. $\square $

The rest of the section is devoted to the proof of Proposition 4.1. This is one of the main technical results of this paper.

Proof of Proposition 4.1

Let us fix $I_p \in {\mathcal {I}}_p$. Let us write $I_p = [s - R^{-q + 2l}, s+ R^{-q + 2l}]$. We claim that we can approximate $\varvec{{\varphi }}(I_p)$ by its linear part. In fact, for any $s' \in I_p$, let us write $s' = s + r R^{-q + 2l}$ for some $r \in [-1, 1]$. By Taylor’s expansion, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s'))&= g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s) + R^{-q + 2l} r\varvec{{\varphi }}'(s) + O(R^{-2q + 4l})) \\&= g_{{\mathbf{r}}}(q) U(O(R^{-2q + 4l})) g_{{\mathbf{r}}}(-q) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s) + R^{-q + 2l} r\varvec{{\varphi }}'(s) ) \\&= U(O(R^{-q + 4l})) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s) + R^{-q + 2l}r \varvec{{\varphi }}'(s)). \end{aligned}$$

Since $l \le \eta ' q$, we have that $O(R^{-q + 4l})$ is exponentially small and thus can be ignored. Therefore, we can approximate $\varvec{{\varphi }}(s')$ by $\varvec{{\varphi }}(s) + \varvec{{\varphi }}'(s)(s'-s)$ for any $s' \in I_p$.

Let us take a (q, l)-dangerous interval $\Delta _{q,l}({\mathbf{a}})$ that intersects $I_p$. Without loss of generality, we may assume that $\Delta _{q,l}({\mathbf{a}}) \subset I_p$. If this is not the case, we can replace $I_p$ with a slightly larger interval $I'_p$ such that $\Delta _{q,l}({\mathbf{a}}) \subset I'_p$ and $m(I'_p) < 2 m(I_p)$ and proceed the same argument. Let us write $\Delta _{q,l}({\mathbf{a}}) = [s'-R^{-q+l}, s'+ R^{-q +l}]$ where ${\mathbf{a}} = (a_0, a_1, \dots , a_n) \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$. For every $s_0 \in \Delta _{q,l}({\mathbf{a}})$, let us denote

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_0)){\mathbf{a}} = {\mathbf{v}}(s_0) = (v_0(s_0), v_1(s_0), \dots , v_n(s_0) ). \end{aligned}$$

Then we have that

$$\begin{aligned} \max \{ \Vert {\mathbf{v}}(s_0)\Vert : s_0 \in \Delta _{q,l}({\mathbf{a}}) \} < \rho \end{aligned}$$

(4.3)

and

$$\begin{aligned} \max \{ \Vert {\mathbf{v}}(s_0)\Vert : s_0 \in [s'- R^{-q+l+1}, s'+R^{-q+l+1}] \} \ge \rho . \end{aligned}$$

(4.4)

Recall that for $j =1, \dots , n$, $\lambda _j = \frac{1+ r_j}{1+ r_1}$. Let $1 \le n' \le n $ be the largest index j such that $(1 -\lambda _j)q \le l$.

For $s_0 \in [s'-R^{-q+l} , s'+ R^{-q + l}]$, let us write $s_0 = s' + r R^{-q + l}$ for $r \in [-1, 1]$. As we explained before, we can approximate $\varvec{{\varphi }}(s_0)$ by $\varvec{{\varphi }}(s') + R^{-q + l} r\varvec{{\varphi }}'(s')$. By our standing assumption on $\varvec{{\varphi }}$ (Standing Assumption A.2), we have that $c_1 \le |\varphi '_j(s_0)| \le C_1$ for $j = 1, \dots , n$. By direct calculation, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s_0)){\mathbf{a}}&= g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_0) - \varvec{{\varphi }}(s')) g_{{\mathbf{r}}}(-q) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')){\mathbf{a}} \\&= g_{{\mathbf{r}}}(q)U(r R^{-q + l} \varvec{{\varphi }}'(s')) g_{{\mathbf{r}}}(-q) {\mathbf{v}}(s'). \end{aligned}$$

Recall that ${\mathbf{e}}_i \in {\mathbb {R}}^n$ denote the vector with ith coordinate equal to 1 and other coordinates equal to zero. By direct calculation, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(r R^{-q +l} \varvec{{\varphi }}'(s')) g_{{\mathbf{r}}}(-q) = U\left( r R^l \sum _{i=1}^n R^{-(1 - \lambda _i)q} \varphi '_i(s') {\mathbf{e}}_i \right) . \end{aligned}$$

Therefore, we have

$$\begin{aligned} {\mathbf{v}}(s_0) = U\left( r R^l \sum _{i=1}^n R^{-(1 - \lambda _i)q} \varphi '_i(s') {\mathbf{e}}_i \right) {\mathbf{v}}(s'). \end{aligned}$$

(4.5)

For the case $n' < n$, let us estimate

$$\begin{aligned} U\left( - r R^l \sum _{i=n'+1}^n R^{-(1 - \lambda _i)q} \varphi '_i(s') {\mathbf{e}}_i \right) {\mathbf{v}}(s_0). \end{aligned}$$

By our assumption, for $i \ge n' +1$, we have that $|r R^l R^{-(1-\lambda _i)q}| \le 1$. Therefore, if we write

$$\begin{aligned} U\left( - r R^l \sum _{i=n'+1}^n R^{-(1 - \lambda _i)q} \varphi '_i(s') {\mathbf{e}}_i \right) {\mathbf{v}}(s_0) = \tilde{{\mathbf{v}}}(s_0) = (\tilde{v}_0(s_0), \tilde{v}_1(s_0), \dots , \tilde{v}_n(s_0)), \end{aligned}$$

(4.6)

where $\tilde{v}_0(s_0) = v_0(s_0) -r \sum _{i= n' +1}^n R^l R^{-(1-\lambda _i)q} \varphi '_i(s') v_i(s_0)$ and $\tilde{v}_i(s_0) = v_i(s_0)$ for $i=1,\dots , n$, then $|\tilde{v}_0(s_0)| < C = (n+1)C_1 \rho $, and $|\tilde{v}_i(s_0)| < \rho $ for $i = 1, \dots , n$. Let

$$\begin{aligned} {\mathbf{h}} = \sum _{i=1}^{n'} R^{-(1 - \lambda _i)q } \varphi '_i(s') {\mathbf{e}}_i \end{aligned}$$

and

$$\begin{aligned} {\mathbf{h}}_W = \sum _{i=1}^{n'} R^{-(1 - \lambda _i)q } \varphi '_i(s') {\mathbf{w}}_i \in W. \end{aligned}$$

Then $\Vert {\mathbf{h}}\Vert _2 = \Vert {\mathbf{h}}_W\Vert _2 \asymp 1$. Combining (4.5) and (4.6), we have

$$\begin{aligned} U(r R^l {\mathbf{h}} ) {\mathbf{v}}(s') = ( \tilde{v}_0(s_0), \tilde{v}_1(s_0), \dots , \tilde{v}_n(s_0)), \end{aligned}$$

(4.7)

where $|\tilde{v}_0(s_0)| < C $, and $|\tilde{v}_i(s_0)| < \rho $ for $i=1,\dots , n$. Let $E_{n'}$ be the subspace of ${\mathbb {R}}^n$ spanned by $\{{\mathbf{e}}_1, \dots , {\mathbf{e}}_{n'}\}$ and $W'_{n'}$ be the subspace of W spanned by $\{{\mathbf{w}}_1, \dots , {\mathbf{w}}_{n'}\}$. Then ${\mathbf{h}} \in E_{n'}$. Let $\mathfrak {k} \in \mathrm {SO}(n)$ be an element such that $\mathfrak {k}\cdot {\mathbf{e}}_1 = {\mathbf{h}}$, $\mathfrak {k}\cdot E_{n'} = E_{n'}$, and $\mathfrak {k} \cdot {\mathbf{e}}_i = {\mathbf{e}}_i$ for $i = n' +1 , \dots , n$. Let $z(\mathfrak {k}) = \begin{bmatrix} 1&~ \\ ~&\mathfrak {k} \end{bmatrix} \in Z$. It is easy to see that $z(\mathfrak {k}) {\mathbf{w}}_+ = {\mathbf{w}}_+$, $z(\mathfrak {k}) {\mathbf{w}}_1 = {\mathbf{h}}_W$, $z(\mathfrak {k}) W'_{n'} = W'_{n'}$, and $z(\mathfrak {k}) {\mathbf{w}}_i = {\mathbf{w}}_i$ for $i = n' +1, \dots , n$. By the definition of $z(\mathfrak {k})$ and our discussion in Sect. 2.2, we have that $U({\mathbf{h}}) = z(\mathfrak {k}) U(\Vert {\mathbf{h}}\Vert _2{\mathbf{e}}_1)z^{-1}(\mathfrak {k})$. Therefore, we have that $U({\mathbf{h}}) {\mathbf{h}}_W = {\mathbf{h}}_W + \Vert {\mathbf{h}}\Vert _2 {\mathbf{w}}_{+}$. Moreover, we have that $U({\mathbf{h}}) {\mathbf{w}}_+ = {\mathbf{w}}_+$; for $i = 2, \dots , n'$, $U({\mathbf{h}}) z(\mathfrak {k}) {\mathbf{w}}_i = z(\mathfrak {k}){\mathbf{w}}_i$; and for $i = n'+1, \dots , n$, $U({\mathbf{h}}) {\mathbf{w}}_i = {\mathbf{w}}_i$. Let us write

$$\begin{aligned} {\mathbf{v}}(s') = a_+(s') {\mathbf{w}}_+ + \sum _{i=1}^{n'} a_i(s') z(\mathfrak {k}){\mathbf{w}}_i + \sum _{i= n' +1}^n a_i(s') {\mathbf{w}}_i. \end{aligned}$$

Then the above discussion shows that

$$\begin{aligned} U(r R^l {\mathbf{h}}){\mathbf{v}}(s') = (a_+(s') + r R^l a_1(s')) {\mathbf{w}}_+ + \sum _{i=1}^{n'} a_i(s') z(\mathfrak {k}){\mathbf{w}}_i + \sum _{i= n' +1}^n a_i(s') {\mathbf{w}}_i. \end{aligned}$$

By (4.3), (4.6) and (4.7), we have that there exists a constant $C >0$ such that $|a_i(s')| < C$ for $i = 1, \dots , n$ and $|a_+(s') + r R^l a_1(s')| < C $ for any $r \in [ -1, 1]$. This implies that $|a_+(s')| < C $, and $|a_1(s')| < C R^{-l} $. Therefore, we have that ${\mathbf{v}}(s') \in z(\mathfrak {k}) ([-C, C] \times [-C R^{-l}, C R^{-l}] \times [-C, C]^{n-1})$.

Now let us estimate $|{\mathcal {D}}_{q,l}(I_p)|$.

Suppose that ${\mathcal {D}}_{q,l}(I_p) = \{\Delta _{q,l}({\mathbf{a}}_u) : 1 \le u \le L\}$. For each $u = 1, \dots , L$, let us take $s_u \in \Delta _{q,l}({\mathbf{a}}_u)\cap I_p$ such that $s_u \in I_{q-1, u}$ for some $I_{q-1,u} \in {\mathcal {I}}_{q-1}$. Let us denote

$$\begin{aligned} {\mathbf{v}}_u = g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_u)){\mathbf{a}}_u. \end{aligned}$$

Then by our previous argument, we have that

$$\begin{aligned} {\mathbf{v}}_u = a_{u, +} {\mathbf{w}}_+ + \sum _{i=1}^{n'} a_{u, i} z(\mathfrak {k}){\mathbf{w}}_i + \sum _{i=n'+1}^n a_{u,i} {\mathbf{w}}_i, \end{aligned}$$

(4.8)

where $|a_{u, +}| < C $, $|a_{u,1}| < C R^{-l}$, and $ |a_{u, i} | < C $ for $ i = 2, \dots , n$.

Now let us consider $g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u$. Let us write $s_u = s_1 - r R^{-q + 2l}$ for some $r \in [-1,1]$. As we explained at the beginning of the proof, we can approximate $\varvec{{\varphi }}(I_p)$ by its linear part. Then we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u&= g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s_1) - \varvec{{\varphi }}(s_u)) g_{{\mathbf{r}}}(-q) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s_u)){\mathbf{a}}_u \\&= g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s_1) - \varvec{{\varphi }}(s_u)) g_{{\mathbf{r}}}(-q) {\mathbf{v}}_u \\&= g_{{\mathbf{r}}}(q) U(r R^{-q + 2l} \varvec{{\varphi }}'(s)) g_{{\mathbf{r}}}(-q) {\mathbf{v}}_u \\&= U\left( r R^{2l} \sum _{i= 1}^n R^{-(1-\lambda _i)q} \varphi '_i (s) {\mathbf{e}}_i \right) {\mathbf{v}}_u. \end{aligned}$$

Let us denote ${\mathbf{h}} = \sum _{i=1}^{n'} R^{-(1-\lambda _i)q} \varphi '_i (s') {\mathbf{e}}_i$ as before. Then by (4.8), we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u&= U(r R^{2l} {\mathbf{h}} + r R^{2l}\sum _{i = n' +1}^n R^{-(1-\lambda _i)q} \varvec{{\varphi }}'_i(s) {\mathbf{e}}_i){\mathbf{v}}_u \\&= \left( a_{u, +} + r R^{2l} a_{u, 1} + r R^{2l}\sum _{i= n' +1}^n R^{-(1-\lambda _i)q} \varphi '_i(s) a_{u, i} \right) {\mathbf{w}}_{+} \\&\quad + \sum _{i=1}^{n'} a_{u, i} z(\mathfrak {k}){\mathbf{w}}_i + \sum _{i=n'+1}^n a_{u,i} {\mathbf{w}}_i. \end{aligned}$$

Since $|a_{u,1}| \le C R^{-l}$, and since for $i = n'+1 , \dots , n$, $(1-\lambda _i) q > l$, $|a_{u, i}| <C$, and $|\varphi '_i(x)|\le C_1$, we have that

$$\begin{aligned}&\left| a_{u, +} + r R^{2l} a_{u, 1} + r R^{2l}\sum _{i= n' +1}^n R^{-(1-\lambda _i)q} \varvec{{\varphi }}'(s) a_{u, i} \right| \\&\quad \le |a_{u, +}| + |r| R^{2l } |a_{u,l}| + |r| R^{2l } \sum _{i=n'+1}^n R^{-(1-\lambda _i)q} | \varvec{{\varphi }}'(s)| |a_{u,i}| \\&\quad \le C + R^{2l } C R^{-l} + R^{2l } \sum _{i = n' + 1}^n R^{-l} C_1 C \\&\quad \le C + R^{2l } C R^{-l } + R^{2l } n R^{-l} C_1 C \\&\quad \le C_2 R^{l} \end{aligned}$$

where $ C_2 = 2C + n C_1 C >0$. This implies that for any $u = 1, \dots , L$, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u \in z(\mathfrak {k})([-C_2 R^l, C_2 R^l] \times [-C R^{-l}, C R^{-l}] \times [-C, C]^{n-1}). \end{aligned}$$

Let us consider the range of $g_{{\mathbf{r}}}(q -l) U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u= g_{{\mathbf{r}}}(-l) g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u$. Let us write $g_{{\mathbf{r}}}(-l) = d_2 (l) d_1(l)$ where

$$\begin{aligned} d_1(l) = \begin{bmatrix} b^{-l}&~&~&~&~ \\ ~&b^{r_1 l} \mathrm {I}_{ n'}&~&~&~ \\ ~&~&b^{r_{n' + 1} l}&~&~ \\ ~&~&~&\ddots&~ \\ ~&~&~&~&b^{r_n l}\end{bmatrix}, \end{aligned}$$

and

$$\begin{aligned} d_2(l) = \begin{bmatrix} 1&~&~&~&~&~ \\ ~&1&~&~&~&~ \\ ~&~&b^{-( r_1 - r_2)l}&~&~&~ \\ ~&~&~&\ddots&~&~ \\ ~&~&~&~&b^{-(r_1 - r_{n'})l}&~ \\ ~&~&~&~&~&\mathrm {I}_{n - n'} \end{bmatrix}. \end{aligned}$$

Then we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q -l) U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u \in d_2(l) d_1(l) z(\mathfrak {k})([-C_2 R^l, C_2 R^l] \times [-C R^{-l}, C R^{-l}] \times [-C, C]^{n-1}). \end{aligned}$$

By the definition of $z(\mathfrak {k})$, we have that $d_1(l) z(\mathfrak {k}) = z(\mathfrak {k}) d_1(l)$. Therefore, we have that

$$\begin{aligned}&d_1(l) z(\mathfrak {k})([-C_2 R^l, C_2 R^l] \times [-C R^{-l}, C R^{-l}] \times [-C, C]^{n-1}) \\&\quad = z(\mathfrak {k}) d_1 (l) ([-C_2 R^l, C_2 R^l] \times [-C R^{-l}, C R^{-l}] \times [-C, C]^{n-1}) \\&\quad = z(\mathfrak {k}) \left( [-C_2 b^{r_1 l} , C_2 b^{r_1 l}] \times [-C b^{-l} , C b^{-l} ]\times [-C b^{r_1 l} , C b^{r_1 l}]^{n_1 -1} \times \prod _{i=n'+1}^n [-C b^{r_i l} , C b^{r_i l}]\right) \\&\quad \subset z(\mathfrak {k})\left( [-C_2 b^{r_1 l} , C_2 b^{r_1 l}] \times [-1 , 1 ]\times [-C b^{r_1 l} , C b^{r_1 l}]^{n' -1}\prod _{i=n'+1}^n [-C b^{r_i l} , C b^{r_i l}]\right) . \end{aligned}$$

It is easy to see that

$$\begin{aligned} z(\mathfrak {k})\left( [-C_2 b^{r_1 l} , C_2 b^{r_1 l}] \times [-1 , 1 ]\times [-C b^{r_1 l} , C b^{r_1 l}]^{n' -1} \times \prod _{i=n'+1}^n [-C b^{r_i l} , C b^{r_i l}]\right) \end{aligned}$$

can be covered by a collection ${\mathcal {B}}$ of $ O(b^{\lambda l})$ balls of radius 1 where $\lambda = n' r_1 + \sum _{i = n' +1}^n r_i$. Then we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u&\in d_2(l)\bigcup _{B \in {\mathcal {B}}} B \\&= \bigcup _{B \in {\mathcal {B}}} d_2(l)B. \end{aligned}$$

Since $d_2(l)$ is a contracting map, for every $B \in {\mathcal {B}}$, there exists a ball $B'$ of radius C such that $d_2(l) B \subset B'$. Let ${\mathcal {B}}'$ denote the collection of all such $B'$’s. Then we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u \in \bigcup _{B' \in {\mathcal {B}}'} B'. \end{aligned}$$

Since $g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u \in g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbb {Z}}^{n+1}$, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u \in \bigcup _{B' \in {\mathcal {B}}'} B' \cap \Lambda , \end{aligned}$$

where $\Lambda = g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbb {Z}}^{n+1}$. By our assumption, $s_1 \in I_{q-1, 1}$ for some $I_{q-1, 1} \in {\mathcal {I}}_{q-1}$. This implies that $s_1 \in I_{q-l}$ for some $I_{q-l} \in {\mathcal {I}}_{q-l}$. Therefore, $\Lambda = g_{{\mathbf{r}}}(q-l) U(\varvec{{\varphi }}(s_1)){\mathbb {Z}}^{n+1} \in K_{\kappa }$, i.e., $\Lambda $ does not contain any nonzero vectors with norm $\le \kappa $. Therefore, there exists a constant $C_4$ such that every ball of radius 1 contains at most $C_4 \kappa ^{-n-1} = C_4 R^{(n+1)k}$ points in $\Lambda $. Thus, we have that

$$\begin{aligned} \sharp {\mathcal {D}}_{q,l}(I_p)&= \sharp \{g_{{\mathbf{r}}}(q-l)U(\varvec{{\varphi }}(s_1)){\mathbf{a}}_u : 1 \le u \le L\} \le \sum _{B' \in {\mathcal {B}}'} \sharp (B'\cap \Lambda ) \\&\le \sum _{B' \in {\mathcal {B}}'} C_4 R^{(n+1)k} \\&\le C_5 b^{\lambda l + 4n k} \le C_5 b^{(\lambda + \frac{1}{200n})l}, \end{aligned}$$

where $C_5 = C_3 C_4$ and $\lambda = n' r_1 + \sum _{i= n' +1}^n r_i$. Now let us estimate $\lambda $. In fact,

$$\begin{aligned} \lambda&= \sum _{i=1}^n r_i + \sum _{i=1}^{n'} (r_1 - r_i)\\&= 1 + \sum _{i=1}^{n'} (r_1 - r_i) . \end{aligned}$$

By our assumption, for $i=1, \dots , n'$, we have that $r_1 - r_i \le \frac{l}{q} \le \frac{1}{100 n^2}$. Therefore, we have that

$$\begin{aligned} \lambda \le 1 + n \frac{1}{100n^2} = 1+ \frac{1}{100 n}. \end{aligned}$$

Thus, we have that

$$\begin{aligned} \sharp {\mathcal {D}}_{q,l}(I_p) \le C_5 b^{\left( 1+ \frac{1}{100n} + \frac{1}{200n}\right) l} \le C_5 R^{\left( 1- \frac{1}{10n}\right) l}. \end{aligned}$$

The last inequality above holds because $b = R^{\frac{1}{1+r_1}} \le R^{\frac{n}{n+1}}$.

This completes the proof. $\square $

5 Proof of the Main Result

In this section we will finish the proof of Proposition 3.7. By our discussion in Sects. 1 and 3, Proposition 3.7 implies Theorem 3.5, and thus Theorems 1.7 and 1.8.

The structure of the section is as follows. In the first subsection, we will prove Proposition 3.7 for the case $q \le 10^6 n^4 N k$. The second, third and fourth subsections are devoted to the proof for the case $q > 10^6 n^4 N k$. The key point is to estimate $F(\hat{{\mathcal {I}}}_{q, p}, I_p)$ for $I_p \in {\mathcal {I}}_p$. The second subsection deals with the case $p = q - 4000 n^2 N k$. The third subsection deals with the case $p = q - 2l$ where $ 2000 n^2 N k< l < 2 \eta ' q$. The fourth subsection deals with the case $p = 0$.

The third and fourth subsections contain some technical results on the canonical representation of $\mathrm {SL}(n+1, {\mathbb {R}})$ on $\bigwedge ^i V$ for $i = 2, \dots , n$. They are also main technical contributions of this paper.

Our basic tool is the following non-divergence theorem due to Kleinbock:

Theorem 5.1

(see [Kle08, Theorem 2.2]). There exist constants $C, \alpha >0$ such that the following holds: For any $g \in \mathrm {SL}(n+1,{\mathbb {R}})$, any one parameter unipotent subgroup $U = \{u(r) : r \in {\mathbb {R}}\} \subset \mathrm {SL}(n+1, {\mathbb {R}})$ and any $R >0$, if for any $i= 1,2,\dots , n$ and any ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$,

$$\begin{aligned} \max \{ \Vert u(r)g{\mathbf{v}}\Vert : r \in [-R, R] \} \ge \rho ^i, \end{aligned}$$

then for any $0< \epsilon <\rho $,

$$\begin{aligned} m \left( \{ r \in [-R,R] : u(r) g {\mathbb {Z}}^{n+1} \notin K_{\epsilon } \} \right) \le C \left( \frac{\epsilon }{\rho } \right) ^{\alpha } R. \end{aligned}$$

We will also need the following important result due to Kleinbock and Margulis [KM98].

Theorem 5.2

(see [KM98, Proposition 2.3]). Let $\varvec{{\varphi }}: [0,1] \rightarrow {\mathbb {R}}^n$ be a $C^n$ non-degenerate curve. Then there exists a constant $\alpha >0$ such that for any $s \in [0,1]$ there exists an interval J centered at s and positive constants D and $\rho $ such that for any $t \ge 0$ and $0< \epsilon < \rho $ one has

$$\begin{aligned} m\left( \{ s' \in J: g_{{\mathbf{r}}}(t) u(\varvec{{\varphi }}(s')){\mathbb {Z}}^{n+1} \not \in K_{\epsilon } \} \right) \le D \left( \frac{\epsilon }{\rho }\right) ^{\alpha } m(J). \end{aligned}$$

Remark 5.3

The exact statement in [KM98, Proposition 2.3] is more general than the above theorem. For example, the statement holds for any $C^n$ differentiable non-degenerate submanifolds.

From Theorem 5.2, one can easily deduce the following corollary:

Corollary 5.4

Let $\varvec{{\varphi }}: [0,1] \rightarrow {\mathbb {R}}^n$ be a $C^n$ non-degenerate curve. Then there exist constants $C>0$, $\alpha >0$ and $0<\rho _1<1$ such that for any $t \ge 0$ and $0< \epsilon < \rho _1$ one has

$$\begin{aligned} m\left( \{ s \in [0,1]: g_{{\mathbf{r}}}(t) u(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \not \in K_{\epsilon } \}\right) \le C \left( \frac{\epsilon }{\rho _1}\right) ^{\alpha }. \end{aligned}$$

Proof

For any $s \in [0,1]$, one can find the corresponding interval $J = J(s)$, constants $D(s) >0$ and $\rho (s) >0$ arising from Theorem 5.2. Then $\{J(s): s \in [0,1]\}$ is an open covering of [0, 1]. Since [0, 1] is compact, there is a finite covering $\{J(s_i): i = 1, 2, \dots , M\}$. Without loss of generality, we may assume that $m(J(s_i)) \le 2$. Let us choose $\rho _1 := \min \{ \rho (s_i): i = 1,2, \dots , M\}$ and $C := 2 M \max \{D(s_i): i=1,2,\dots , M\} $. Then for any $t \ge 0$ and $0<\epsilon < \rho _1$, we have that

$$\begin{aligned} E_{t, \epsilon } \subset \bigcup _{i=1}^M E_{t, \epsilon }\cap J(s_i) \end{aligned}$$

where $E_{t, \epsilon } := \{ s \in [0,1]: [g_{{\mathbf{r}}}(t) u(\varvec{{\varphi }}(s))] \not \in K_{\epsilon } \}$. By Theorem 5.2, for any $i=1,2,\dots , M$, we have that

$$\begin{aligned} m(E_{t, \epsilon }\cap J(s_i)) \le D(s_i)\left( \frac{\epsilon }{\rho (s_i)}\right) ^{\alpha } m(J(s_i)) \le D(s_i) \left( \frac{\epsilon }{\rho _1}\right) ^{\alpha } \cdot 2 . \end{aligned}$$

Therefore, we have that

$$\begin{aligned} m(E_{t, \epsilon }) \le \sum _{i=1}^M 2 D(s_i) \left( \frac{\epsilon }{\rho _1}\right) ^{\alpha } \le C \left( \frac{\epsilon }{\rho _1}\right) ^{\alpha }. \end{aligned}$$

This completes the proof. $\square $

Later in this paper, we will choose $0< \rho < 1$ such that $C\left( \frac{2 \rho }{\rho _1}\right) ^{\alpha } < \frac{1}{1000}$.

5.1 The case where q is small.

In this subsection, let us assume that $q \le 10^6 n^4 N k$. Then $\hat{{\mathcal {I}}}_{q,0} = \hat{I}_q$ and $\hat{{\mathcal {I}}}_{q,p} = \emptyset $ for other p.

Proposition 5.5

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q, 0}, I) \ll R^{q - \alpha k}. \end{aligned}$$

Proof

By Corollary 5.4, we have that for any $\kappa = R^{-k} >0$ such that $2\kappa < \rho $, the following holds:

$$\begin{aligned} m(\{s \in [0,1]: g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \notin K_{2\kappa } \}) \le C \left( \frac{2\kappa }{\rho }\right) ^{\alpha }. \end{aligned}$$

On the other hand, by the definition of $\hat{{\mathcal {I}}}_q$, for any $I_q \in \hat{{\mathcal {I}}}_q$, there exists $s \in I_q$ such that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)) {\mathbb {Z}}^{n+1} \in X \setminus K_{ \kappa }. \end{aligned}$$

Since $g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(I_q)) {\mathbb {Z}}^{n+1}$ is contained in 1-neighborhood of $g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)) {\mathbb {Z}}^{n+1}$, we have

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(I_q)) {\mathbb {Z}}^{n+1} \subset X \setminus K_{2 \kappa }. \end{aligned}$$

Therefore, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,0}, I) R^{-q}&= m\left( \bigcup _{I_q \in \hat{{\mathcal {I}}}_q} I_q \right) \\&= m(\{s \in I: g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \notin K_{2\kappa } \}) \le C_6 \kappa ^{\alpha } = C_6 R^{-\alpha k} \end{aligned}$$

where $C_6 = C \left( \frac{2}{\rho } \right) ^{\alpha }$. This finishes the proof. $\square $

Let us choose $R >1$ such that $R^{\alpha } > 1000^{10^6 n^4 N}$.

Proof of Proposition 3.7for$q \le 10^6 n^4 N k$. It suffices to show that

$$\begin{aligned} \left( \frac{4}{R} \right) ^q F(\hat{{\mathcal {I}}}_{q,0}, I) \end{aligned}$$

can be arbitrarily small. In fact, by Proposition 5.5, we have that

$$\begin{aligned} \left( \frac{4}{R} \right) ^q F(\hat{{\mathcal {I}}}_{q,0}, I)&= \left( \frac{4}{R} \right) ^q O(R^{q - \alpha k}) \\&= O\left( \frac{4^q}{ R^{\alpha k}}\right) = O\left( \frac{4^{10^6 n^4 N k}}{R^{\alpha k}}\right) = O\left( \left( \frac{4}{1000}\right) ^{10^6 n^4 N k}\right) . \end{aligned}$$

Then it is easy to see that $\left( \frac{4}{R} \right) ^q F(\hat{{\mathcal {I}}}_{q,0}, I) \rightarrow 0$ as $k \rightarrow \infty $.

This completes the proof for $q \le 10^6 n^4 N k$. $\square $

5.2 The generic case.

The rest of the section is devoted to the proof of Proposition 3.7 for $q > 10^6 n^4 N k$. In the following subsections, we will estimate $F(\hat{{\mathcal {I}}}_{q,p}, I_p)$ for different p’s. In this subsection we will estimate $F(\hat{{\mathcal {I}}}_{q,p}, I_p)$ for $p = q - 4000 n^2 N k$. We call it the generic case.

Proposition 5.6

Let $q > 10^6 n^4 N k$ and $p = q- 4000 n^2 N k$. Then for any $I_p \in {\mathcal {I}}_p$, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \ll R^{q-p - \alpha k} . \end{aligned}$$

Proof

Let us fix $I_p \in {\mathcal {I}}_p$. If $F(\hat{{\mathcal {I}}}_{q,p}, I_p) = 0$, then the statement trivially holds.

Suppose $F(\hat{{\mathcal {I}}}_{q,p}, I_p) >0$, let us take $I_q \in \hat{{\mathcal {I}}}_{q,p}$ and $s \in I_q\cap I_p$. Without loss of generality, we may assume that $[s- R^{-q+ 2000 n^2 N k}, s + R^{-q + 2000 n^2 N k}] \subset I_p$. If this is not the case, we can replace $I_p$ with a slightly larger interval $I'_p \supset I_p$ such that $[s- R^{-q+ 2000 n^2 N k}, s + R^{-q + 2000 n^2 N k}] \subset I'_p$ and $m(I'_p) < 2 m(I_p)$ and proceed the same argument. Then for any $i = 1, \dots , n$ and ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$, we have that

$$\begin{aligned} \max \{\Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert : s' \in [s- R^{-q+ 2000 n^2 N k}, s + R^{-q + 2000 n^2 N k}]\} \ge \rho ^i. \end{aligned}$$

Therefore, we have that

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert : s' \in I_p \} \ge \rho ^i. \end{aligned}$$

On the other hand, as we explained in the proof of Proposition 4.1, we can approximate $\varvec{{\varphi }}(I_p)$ by its linear part, that is to say, for any $s' \in I_p$, we approximate $\varvec{{\varphi }}(s')$ by $ \varvec{{\varphi }}(s) + (s'-s)\varvec{{\varphi }}'(s)$. For $s' \in I_p$, let us write $s' = s + r R^{-q + 4000 n^2 N k}$ where $r \in [-1,1]$ and denote $g = g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s))$. Then

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s'))&= g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s') - \varvec{{\varphi }}(s)) g_{{\mathbf{r}}}(-q) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s)) \\&= g_{{\mathbf{r}}}(q) U(r R^{-q + 4000 n^2 N k} \varvec{{\varphi }}'(s)) g_{{\mathbf{r}}}(-q) g \\&= U(r R^{4000 n^2 N k} {\mathbf{h}}) g, \end{aligned}$$

where ${\mathbf{h}} = \varphi '_1(s) {\mathbf{e}}_1 + \sum _{i=2}^n R^{-(1-\lambda _i)q} \varphi '_i(s) {\mathbf{e}}_i$. Recall that $\lambda _i = \frac{1+r_i}{1+ r_1}$. Since $\{U(rR^{4000 n^2 N k} {\mathbf{h}} ): r \in {\mathbb {R}}\}$ is a one parameter unipotent subgroup, by Theorem 5.1, we have that

$$\begin{aligned} m(\{ r \in [-1,1]: U(r R^{4000 n^2 N k} {\mathbf{h}}) g {\mathbb {Z}}^{n+1} \notin K_{2\kappa }\}) \le 2C \left( \frac{2\kappa }{\rho }\right) ^{\alpha } . \end{aligned}$$

This implies that

$$\begin{aligned} m(\{ s \in I_p: g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \notin K_{2\kappa }\}) \le 2C \left( \frac{2\kappa }{\rho }\right) ^{\alpha } m(I_p). \end{aligned}$$

On the other hand, it is easy to see that $g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(I_q)){\mathbb {Z}}^{n+1} \subset X \setminus K_{2\kappa }$ for any $I_q \in \hat{{\mathcal {I}}}_q$. Therefore we have that

$$\begin{aligned} \begin{array}{cl} &{} F(\hat{{\mathcal {I}}}_{q, p}, I_p) R^{-q} \\ &{} \quad \le m(\{ s \in I_p: g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \notin K_{2\kappa }\}) \\ &{} \quad \le 2C \left( \frac{2\kappa }{\rho } \right) ^{\alpha } m(I_p) \\ &{} \quad = 2 C \left( \frac{2}{\rho }\right) ^{\alpha } \kappa ^{\alpha } R^{-p} = C_7 R^{-p - \alpha k} \end{array} \end{aligned}$$

where $C_7 = 2 C \left( \frac{2}{\rho }\right) ^{\alpha }$. This proves the statement. $\square $

By Proposition 5.6, we have that for $p = q - 4000 n^2 N k$ and any $I_p \in {\mathcal {I}}_p$, the following holds:

$$\begin{aligned} \left( \frac{4}{R}\right) ^{q-p} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \ll \left( \frac{4}{R}\right) ^{q-p} R^{q-p - \alpha k} = \frac{4^{4000 n^2 N k}}{R^{\alpha k}} = \left( \frac{4}{1000}\right) ^{4000 n^2 N k}. \end{aligned}$$

(5.1)

Then it is easy to see that $\left( \frac{4}{R}\right) ^{q-p} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \rightarrow 0$ as $ k \rightarrow \infty $.

5.3 Dangerous case.

In this subsection, we will consider the case where $2000 n^2 N k< l < 2\eta ' q$ and $p = q - 2l$. We call this case the (q, l)-dangerous case.

Proposition 5.7

For any $I_p \in {\mathcal {I}}_p$, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \ll R^{ q - p -\frac{l}{20n} }. \end{aligned}$$

Let us recall that for $1000 n^2 N k< l' < \eta ' q$, a $(q,l')$-dangerous interval $\Delta _{q,l'}({\mathbf{a}})$ associated with a nonzero integer vector ${\mathbf{a}} \in {\mathbb {Z}}^{n+1}$ is a closed interval of the form

$$\begin{aligned} \Delta _{q,l'}({\mathbf{a}}) = [ s - R^{-q + l'}, s + R^{-q + l'}] \end{aligned}$$

such that $I_q \subset \Delta _{q,l'}({\mathbf{a}})$ for some $I_q \in \hat{{\mathcal {I}}}_q$,

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')){\mathbf{a}}\Vert : s' \in \Delta _{q,l'}({\mathbf{a}}) \} < \rho \end{aligned}$$

and

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')){\mathbf{a}}\Vert : s' \in [ s - R^{-q + l'+1}, s + R^{-q + l'+1}] \} \ge \rho . \end{aligned}$$

The following lemma is crucial to prove Proposition 5.7 and is one of the main technical contributions of this paper:

Lemma 5.8

For any $i = 1, \dots , n$ and $I_q \in \hat{{\mathcal {I}}}_{q,p}(i)$ intersecting $I_p$, one of the following two cases holds:

Case 1. :

there exists a $(q,l')$-dangerous interval $\Delta _{q, l'} ({\mathbf{a}})$ containing $I_q$ for some $ l/2 \le l' \le l$;

Case 2. :

there exists $s \in I_q$ and

$$\begin{aligned} {\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge \nolimits ^i {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\} \end{aligned}$$

such that if we write

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbf{v}} = {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)} \end{aligned}$$

where ${\mathbf{w}}^{(i-1)} \in \bigwedge ^{i-1} W$ and ${\mathbf{w}}^{(i)} \in \bigwedge ^i W$, then we have that $\Vert {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}\Vert = \Vert {\mathbf{w}}^{(i-1)} \Vert < \rho ^i$ and $\Vert {\mathbf{w}}^{(i)}\Vert \le \rho ^i R^{-l/2}$.

Proof

If $i = 1$, then the first case holds. We may assume that $i \ge 2$.

By the definition of $\hat{{\mathcal {I}}}_{q,p}(i)$, there exists ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ such that for any $s \in I_q$,

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert : s' \in [s- R^{-q+l}, s+ R^{-q +l} ] \} < \rho ^i \end{aligned}$$

and

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert : s' \in [s- R^{-q+l+1}, s+ R^{-q +l+1} ] \} \ge \rho ^i. \end{aligned}$$

Without loss of generality, we may assume that the sublattice $L_i$ generated by $\{{\mathbf{v}}_1, \dots , {\mathbf{v}}_i \}$ is a primitive i-dimensional sublattice of ${\mathbb {Z}}^{n+1}$. Then $\Lambda _i {=} g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s)) L_i$ is a primitive i-dimensional sublattice of $\Lambda = g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1}$. For simplicity, let us denote $g = g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s))$. Let us choose the Minkowski reduced basis $\{g{\mathbf{v}}'_1, \dots , g{\mathbf{v}}'_i\}$ of $\Lambda _i$. Since

$$\begin{aligned} d(\Lambda _i) = \Vert g{\mathbf{v}}\Vert < \rho ^i, \end{aligned}$$

we have that $\Vert g{\mathbf{v}}'_1\Vert < \rho $ by the Minkowski Theorem.

Let us repeat the argument in the proof of Proposition 4.1. Recall that for $j = 1,\dots , n $, $\lambda _j = \frac{1+ r_j}{1+ r_1}$. Let $1 \le n' \le n$ be the largest index j such that $(1- \lambda _j) q \le l$. By Standing Assumption A.2, we have that $c_1 \le |\varvec{{\varphi }}'_i (s)| \le C_1$ for any $i = 1, \dots , n$ and $s \in [0,1]$. Fix any $s \in I_q$ and let ${\mathbf{h}} = \sum _{i=1}^{n'} R^{-(1-\lambda _i)q} \varvec{{\varphi }}'(s){\mathbf{e}}_i$. For any $s' \in [ s - R^{-q + l}, s + R^{-q + l}]$, let us write $s' = s + r R^{-q + l}$ where $ r \in [-1, 1]$. By the same argument as in the proof of Proposition 4.1, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')) = U(O(1)) U(r R^l {\mathbf{h}}) g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)) = U(O(1)) U(r R^l {\mathbf{h}}) g. \end{aligned}$$

Therefore, we have that

$$\begin{aligned} \Vert U(r R^l {\mathbf{h}}) g{\mathbf{v}} \Vert < \rho ^i \end{aligned}$$

for any $r \in [-1,1]$.

Following the notation in the proof of Proposition 4.1, let us denote ${\mathbf{h}} = \mathfrak {k} \cdot {\mathbf{e}}_1$ for $\mathfrak {k} \in \mathrm {SO}(n)$ and

$$\begin{aligned} z(\mathfrak {k}) = \begin{bmatrix} 1&~ \\ ~&\mathfrak {k} \end{bmatrix} \in Z. \end{aligned}$$

For $ j = 1, \dots , i$, let us write

$$\begin{aligned} g {\mathbf{v}}'_j = a_+ (j) {\mathbf{w}}_+ + a_1(j) z(\mathfrak {k}){\mathbf{w}}_1 + {\mathbf{w}}'(j) \end{aligned}$$

where ${\mathbf{w}}' (j) \in z(\mathfrak {k}) W_2$. Then

$$\begin{aligned} g {\mathbf{v}}&= (g{\mathbf{v}}'_1)\wedge \cdots \wedge (g{\mathbf{v}}'_i) \\&= \bigwedge _{j=1}^i (a_+ (j) {\mathbf{w}}_+ + a_1(j) z(\mathfrak {k}){\mathbf{w}}_1 + {\mathbf{w}}'(j) ) \\&= {\mathbf{w}}_+ \wedge (z(\mathfrak {k}){\mathbf{w}}_1) \wedge \left( \sum _{j <j'} \epsilon _{+,1}(j,j') a_+(j) a_1(j') \bigwedge _{k \ne j,j'} {\mathbf{w}}'(k) \right) \\&\quad +\, {\mathbf{w}}_+ \wedge \left( \sum _{j=1}^i \epsilon _+(j) a_+(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) + (z(\mathfrak {k}){\mathbf{w}}_1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \\&\quad +\, \bigwedge _{j=1}^i {\mathbf{w}}'(j) \end{aligned}$$

where $\epsilon _{+,1}(j,j'), \epsilon _+(j), \epsilon _1(j) \in \{\pm 1\}$ for every $j,j' \in \{1, \dots , i\}$. By our discussion in Sect. 2.2 on the representation of $\mathrm {SL}(2, {\mathbf{h}})$ on $\bigwedge ^i V$, we have that

$$\begin{aligned} U( r R^l {\mathbf{h}})g {\mathbf{v}}&= {\mathbf{w}}_+ \wedge (z(\mathfrak {k}){\mathbf{w}}_1) \wedge \left( \sum _{j <j'} \epsilon _{+,1}(j,j') a_+(j) a_1(j') \bigwedge _{k \ne j,j'} {\mathbf{w}}'(k) \right) \\&\quad +\, {\mathbf{w}}_+ \wedge \left( \sum _{j=1}^i \epsilon _+(j) a_+(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \\&\quad +\, r R^l {\mathbf{w}}_+ \wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \\&\quad +\, (z(\mathfrak {k}){\mathbf{w}}_1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) + \bigwedge _{j=1}^i {\mathbf{w}}'(j) . \end{aligned}$$

Since $\Vert U(r R^l {\mathbf{h}}) g {\mathbf{v}}\Vert < \rho ^i$ for any $r \in [-1,1]$, we have that

$$\begin{aligned} \left\| \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k)\right\| \le \rho ^i R^{-l}. \end{aligned}$$

Let us consider the following two cases:

(1)
$|a_1(1)| \le R^{-l/2}$.
(2)
$|a_1(1)| > R^{-l/2}$.

Let us first suppose $|a_1(1)| \le R^{-l/2}$. Note that $\Vert g{\mathbf{v}}'_1\Vert < \rho $. Then by repeating the calculation in the proof of Proposition 4.1, we conclude that

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')) {\mathbf{v}}'_1\Vert : s' \in [ s- R^{-q + l/2}, s + R^{-q + l/2} ] \} < \rho . \end{aligned}$$

On the other hand, by our definition on $\hat{{\mathcal {I}}}_{q, p}(i)$, we have that

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s')){\mathbf{v}}'_1\Vert : s' \in [ s - R^{-q+l+1}, s + R^{-q+l+1}] \} \ge \rho . \end{aligned}$$

This implies that $I_q \subset \Delta _{q,l'}({\mathbf{v}}'_1)$ for some $l/2 \le l' \le l$. This proves the first part of the statement.

Now let us suppose $|a_1(1)| > R^{-l/2}$. Then we have that

$$\begin{aligned} \epsilon _1(1) a_1(1) \bigwedge _{j=1}^i {\mathbf{w}}'(j)&= {\mathbf{w}}'(1)\wedge \left( \epsilon _1(1) a_1(1) \bigwedge _{k\ne 1} {\mathbf{w}}'(k) \right) \\&= {\mathbf{w}}'(1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) . \end{aligned}$$

Therefore, we have that

$$\begin{aligned} |a_1 (1)| \left\| \bigwedge _{j=1}^i {\mathbf{w}}'(j)\right\|&= \left\| {\mathbf{w}}'(1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \right\| \\&\le \Vert {\mathbf{w}}'(1)\Vert \left\| \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right\| \\&\le \rho \cdot \rho ^i R^{-l} = \rho ^{i+1} R^{-l}. \end{aligned}$$

Since $|a_1(1)| > R^{-l/2}$ and $\rho < 1$, we have that

$$\begin{aligned} \left\| \bigwedge _{j=1}^i {\mathbf{w}}'(j)\right\| \le \rho ^{i} R^{-l/2}. \end{aligned}$$

If we write

$$\begin{aligned} g {\mathbf{v}} = {\mathbf{w}} \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)} \end{aligned}$$

where ${\mathbf{w}}^{(i-1)} \in \bigwedge ^{i-1} W$ and ${\mathbf{w}}^{(i)} \in \bigwedge ^i W$, then

$$\begin{aligned} {\mathbf{w}}^{(i)} = (z(\mathfrak {k}){\mathbf{w}}_1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) + \bigwedge _{j=1}^i {\mathbf{w}}'(j). \end{aligned}$$

By our previous argument, we have that

$$\begin{aligned} \Vert {\mathbf{w}}^{(i)}\Vert \le \rho ^i R^{-l/2}. \end{aligned}$$

This proves the second part of the statement. $\square $

The following lemma takes care of the second case of Lemma 5.8.

Lemma 5.9

Let $i \in \{2,\dots , n \}$. Let ${\mathcal {D}}_{q,p}(I_p, i)$ denote the collection of $I_q \in \hat{{\mathcal {I}}}_{q,p}$ intersecting $I_p$ and not contained in any $(q,l')$-dangerous interval for any $l/2 \le l' \le l$. Let

$$\begin{aligned} D_{q,p}(I_p, i) := \bigcup _{I_q \in {\mathcal {D}}_{q,p}(I_p, i)} I_q. \end{aligned}$$

Then for any closed subinterval $J \subset I_p$ of length $R^{-q + (1+ \frac{1}{2n})l}$, we have that

$$\begin{aligned} m(D_{q,p}(I_p, i) \cap J) \ll R^{-\frac{l}{20n}} m(J). \end{aligned}$$

Proof

Let us fix a closed subinterval $J \subset I_p$ of length $R^{-q + (1+ \frac{1}{2n})l}$.

For any $s \in I_q \in {\mathcal {D}}_{q,p}(I_p, i)$, there exists ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ such that

$$\begin{aligned} \max \{ \Vert g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s')){\mathbf{v}}\Vert : s' \in [ s - R^{-q+l}, s + R^{-q + l}] \} < \rho ^i. \end{aligned}$$

Let us denote the interval $[s - R^{-q+l}, s + R^{-q+l}]$ by $\Delta _{q,l}({\mathbf{v}}, i)$. Then every $I_q \in {\mathcal {D}}_{q,l}(I_p,i)$ is contained in some $\Delta _{q,l}({\mathbf{v}},i)$ and every $\Delta _{q,l}({\mathbf{v}}, i)$ contains at most $O(R^l)$ different $I_q \in {\mathcal {D}}_{q,l}(I_p, i)$.

We will follow the notation used in the proof of Lemma 5.8. Let $g = g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s))$, ${\mathbf{h}} = \mathfrak {k} \cdot {\mathbf{e}}_1$ and

$$\begin{aligned} z(\mathfrak {k}) = \begin{bmatrix} 1&~ \\ ~&\mathfrak {k} \end{bmatrix} \in Z \end{aligned}$$

be as in the proof of Lemma 5.8. For $j = 1 , \dots , i$, let us write

$$\begin{aligned} g{\mathbf{v}}_j&= a_+(j){\mathbf{w}}_+ + a_1(j) z(\mathfrak {k}){\mathbf{w}}_1 + {\mathbf{w}}'(j) \\&= a_+(j){\mathbf{w}}_+ + {\mathbf{w}}(j) \end{aligned}$$

where ${\mathbf{w}}'(j) \in z(\mathfrak {k})W_2$ and ${\mathbf{w}}(j) = a_1(j) z(\mathfrak {k}){\mathbf{w}}_1 + {\mathbf{w}}'(j) \in W$. Then

$$\begin{aligned} g {\mathbf{v}}&= {\mathbf{w}}_+ \wedge (z(\mathfrak {k}){\mathbf{w}}_1) \wedge \left( \sum _{j <j'} \epsilon _{+,1}(j,j') a_+(j) a_1(j') \bigwedge _{k \ne j,j'} {\mathbf{w}}'(k) \right) \\&\quad +\, {\mathbf{w}}_+ \wedge \left( \sum _{j=1}^i \epsilon _+(j) a_+(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \\&\quad +\, (z(\mathfrak {k}){\mathbf{w}}_1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) + \bigwedge _{j=1}^i {\mathbf{w}}'(j) . \end{aligned}$$

By Lemma 5.8, we have that

$$\begin{aligned} \left\| (z(\mathfrak {k}){\mathbf{w}}_1)\wedge \left( \sum _{j=1}^i \epsilon _1(j) a_1(j) \bigwedge _{k \ne j} {\mathbf{w}}'(k) \right) \right\| \le \rho ^i R^{-l} \end{aligned}$$

and

$$\begin{aligned} \left\| \bigwedge _{j=1}^i {\mathbf{w}}'(j) \right\| \le \rho ^i R^{-l/2}. \end{aligned}$$

Let us take the collection of all possible $\Delta _{q,l}({\mathbf{v}}, i)$’s intersecting J, say

$$\begin{aligned} \{ \Delta _{q,l}({\mathbf{v}}(M), i) = [ s(M) - R^{-q +l}, s(M) + R^{-q + l}] : M = 1, \dots , L \}. \end{aligned}$$

For simplicity, let us denote $g(M) = g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s(M)))$ for $M = 1,\dots , L$. Since $\varvec{{\varphi }}(J)$ can be approximated by its linear part, we have that the corresponding ${\mathbf{h}}$ and $\mathfrak {k}$ for s(M) is the same for $M = 1, \dots , L$. Then

$$\begin{aligned} g(M) {\mathbf{v}}(M) = {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}(M) + (z(\mathfrak {k}){\mathbf{w}}_1) \wedge ({\mathbf{w}}')^{ (i-1)}(M) + {\mathbf{w}}^{(i)}(M) \end{aligned}$$

where ${\mathbf{w}}^{(i-1)}(M) \in \bigwedge ^{i-1}W$, $({\mathbf{w}}')^{ (i-1)}(M) \in \bigwedge ^{i-1} z(\mathfrak {k})W_2$ and ${\mathbf{w}}^{(i)}(M) \in \bigwedge ^{i} z(\mathfrak {k})W_2$. By our previous discussion, we have that

$$\begin{aligned}&\left\| {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}(M) \right\| < \rho ^i, \\&\Vert ({\mathbf{w}}')^{ (i-1)}(M) \Vert = \left\| (z(\mathfrak {k}){\mathbf{w}}_1) \wedge ({\mathbf{w}}')^{ (i-1)}(M) \right\| \le \rho ^i R^{-l}, \end{aligned}$$

and

$$\begin{aligned} \left\| {\mathbf{w}}^{(i)}(M) \right\| \le \rho ^i R^{-l/2}. \end{aligned}$$

Now let us consider $g(1) {\mathbf{v}}(M)$. Let us write $s(1) - s(M) = r R^{-q + (1 + \frac{1}{2n})l}$ where $r \in [ -1, 1]$. By our previous discussion, we have that

$$\begin{aligned} g(1)&= g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s(1))) = U(O(1)) U\left( r R^{\left( 1+ \frac{1}{2n}\right) l} {\mathbf{h}}\right) g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s(M))) \\&= U(O(1)) U\left( r R^{\left( 1+ \frac{1}{2n}\right) l} {\mathbf{h}}\right) g(M). \end{aligned}$$

Therefore, we have that

$$\begin{aligned} g(1){\mathbf{v}}(M) = U(O(1)) U\left( r R^{\left( 1+ \frac{1}{2n}\right) l} {\mathbf{h}}\right) g(M) {\mathbf{v}}(M). \end{aligned}$$

It is easy to see that we can ignore the contribution of U(O(1)) and identify $g(1){\mathbf{v}}(M)$ with $U(r R^{(1+ \frac{1}{2n})l} {\mathbf{h}}) g(M) {\mathbf{v}}(M)$. Then we have that

$$\begin{aligned} g(1){\mathbf{v}}(M)&= U\left( r R^{\left( 1+ \frac{1}{2n}\right) l} {\mathbf{h}}\right) g(M) {\mathbf{v}}(M) \\&= {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}(M) + r R^{\left( 1+\frac{1}{2n}\right) l} {\mathbf{w}}_+ \wedge ({\mathbf{w}}')^{ (i-1)}(M) \\&\quad +\, (z(\mathfrak {k}){\mathbf{w}}_1) \wedge ({\mathbf{w}}')^{ (i-1)}(M) + {\mathbf{w}}^{(i)}(M) . \end{aligned}$$

Now let us look at the range of

$$\begin{aligned} g_{{\mathbf{r}}}(-l/2) g(1) {\mathbf{v}}(M) = g_{{\mathbf{r}}}(q-l/2) U(\varvec{{\varphi }}(s(1))) {\mathbf{v}}(M). \end{aligned}$$

It is easy to see that $g_{{\mathbf{r}}}(-l/2){\mathbf{w}}_+ = b^{-l/2} {\mathbf{w}}_+$, $\Vert g_{{\mathbf{r}}}(-l/2)z(\mathfrak {k}){\mathbf{w}}_1\Vert \le b^{r_1 l/2} \Vert z(\mathfrak {k}){\mathbf{w}}_1\Vert $,

$$\begin{aligned} \Vert g_{{\mathbf{r}}}(-l/2){\mathbf{w}}^{(i-1)}(M)\Vert\le & {} b^{l/2} \Vert {\mathbf{w}}^{(i-1)}(M)\Vert ,\\ \Vert g_{{\mathbf{r}}}(-l/2)({\mathbf{w}}')^{ (i-1)}(M) \Vert\le & {} b^{(1-r_1)l/2} \Vert ({\mathbf{w}}')^{ (i-1)}(M) \Vert , \end{aligned}$$

and

$$\begin{aligned} \Vert g_{{\mathbf{r}}}(-l/2) {\mathbf{w}}^{(i)}(M)\Vert \le b^{l/2} \Vert {\mathbf{w}}^{(i)}(M)\Vert . \end{aligned}$$

Since

$$\begin{aligned} g_{{\mathbf{r}}}(-l/2) g(1) {\mathbf{v}}(M)&= b^{-l/2} {\mathbf{w}}_+ \wedge (g_{{\mathbf{r}}}(-l/2){\mathbf{w}}^{(i-1)}(M)) \\&\quad + r R^{\left( 1+\frac{1}{2n}\right) l} b^{-l/2} {\mathbf{w}}_+ \wedge (g_{{\mathbf{r}}}(-l/2) ({\mathbf{w}}')^{ (i-1)}(M) ) \\&\quad + (g_{{\mathbf{r}}}(-l/2)z(\mathfrak {k}){\mathbf{w}}_1) \wedge (g_{{\mathbf{r}}}(-l/2)({\mathbf{w}}')^{ (i-1)}(M)) \\&\quad + g_{{\mathbf{r}}}(-l/2) {\mathbf{w}}^{(i)}(M), \end{aligned}$$

we have that

$$\begin{aligned} \Vert g_{{\mathbf{r}}}(-l/2) g(1) {\mathbf{v}}(M)\Vert&\le b^{-l/2} \Vert {\mathbf{w}}_+ \wedge (g_{{\mathbf{r}}}(-l/2){\mathbf{w}}^{(i-1)}(M))\Vert \\&\quad +\, R^{\left( 1+\frac{1}{2n}\right) l} b^{-l/2} \Vert {\mathbf{w}}_+ \wedge (g_{{\mathbf{r}}}(-l/2) ({\mathbf{w}}')^{ (i-1)}(M) )\Vert \\&\quad +\, \Vert g_{{\mathbf{r}}}(-l/2)z(\mathfrak {k}){\mathbf{w}}_1\Vert \cdot \Vert g_{{\mathbf{r}}}(-l/2)({\mathbf{w}}')^{ (i-1)}(M)\Vert \\&\quad +\, \Vert g_{{\mathbf{r}}}(-l/2) {\mathbf{w}}^{(i)}(M)\Vert \\&{\le } b^{-l/2} b^{l/2} \Vert {\mathbf{w}}^{(i-1)}(M))\Vert {+} R^{\left( 1+\frac{1}{2n}\right) l} b^{-l/2}b^{(1-r_1)l/2} \Vert ({\mathbf{w}}')^{ (i-1)}(M) \Vert \\&\quad +\, b^{r_1 l/2} \Vert z(\mathfrak {k}){\mathbf{w}}_1\Vert \cdot b^{(1-r_1)l/2} \Vert ({\mathbf{w}}')^{ (i-1)}(M) \Vert + b^{l/2}\Vert {\mathbf{w}}^{(i)}(M)\Vert \\&\le b^{-l/2} b^{l/2} \rho ^i + R^{\left( 1+\frac{1}{2n}\right) l} b^{-l/2}b^{(1-r_1)l/2} \rho ^i R^{-l} \\&\quad +\, b^{r_1 l/2} b^{(1-r_1)l/2} \rho ^i R^{-l} + b^{l/2} \rho ^i R^{-l/2} \\&\le \rho ^i + \rho ^i + \rho ^i R^{-l/2} + \rho ^i \le 1. \end{aligned}$$

For $M = 1, \dots , L$, let $\Lambda _i({\mathbf{v}}(M))$ denote the i-dimensional primitive sublattice of ${\mathbb {Z}}^{n+1}$ corresponding to ${\mathbf{v}}(M)$. We will apply Proposition 2.2 to estimate L. Thus, let us keep the notation used there. By the inequality above, we have that $g_{{\mathbf{r}}}(-l/2)g(1)\Lambda _i({\mathbf{v}}(M)) \in {\mathcal {C}}_i(g_{{\mathbf{r}}}(-l/2)g(1){\mathbb {Z}}^{n+1}, 1)$ for every $M = 1, \dots , L$. On the other hand, since $x(1) \in I_q \in \hat{{\mathcal {I}}}_q$, we have that

$$\begin{aligned} g_{{\mathbf{r}}}(-l/2)g(1){\mathbb {Z}}^{n+1} = g_{{\mathbf{r}}}(q - l/2) U(\varvec{{\varphi }}(s(1))){\mathbb {Z}}^{n+1} \in K_{\kappa }. \end{aligned}$$

By Proposition 2.2, we have that

$$\begin{aligned} L \le \sharp {\mathcal {C}}_i(g_{{\mathbf{r}}}(-l/2)g(1){\mathbb {Z}}^{n+1}, 1) \le \kappa ^{-N} = R^{Nk}. \end{aligned}$$

Therefore, we have that

$$\begin{aligned} m(D_{q,p}(I_p, i) \cap J)&\le L R^{-q+ l} \le R^{-q + l + N k} \\&\le R^{-q + l + \frac{l}{100 n}} \le R^{- \frac{l}{20n}} R^{-q + \left( 1 + \frac{1}{2n}\right) l} = R^{- \frac{l}{20n}} m(J). \end{aligned}$$

This completes the proof. $\square $

Lemma 5.9 easily implies the following:

Corollary 5.10

Let us keep the notation as above. Then

$$\begin{aligned} m(D_{q, p} (I_p, i)) \ll R^{-\frac{l}{20n}} m(I_p). \end{aligned}$$

Proof

The statement follows from Lemma 5.9 by dividing $I_p$ into subintervals of length $R^{-q + (1 + \frac{1}{2n})l}$. $\square $

Now we are ready to prove Proposition 5.7.

Proof of Proposition 5.7

Let us fix $I_p \in {\mathcal {I}}_p$. For every $l/2 \le l' \le l$, let us denote by $D_{q,l'}(I_p)$ denote the union of $(q, l')$-dangerous intervals intersecting $I_p$. By Proposition 4.1, we have that $m(D_{q,l'}(I_p)) = O\left( R^{-\frac{l'}{10n}}\right) m(I_p)$. Therefore, we have that

$$\begin{aligned} m\left( \bigcup _{l/2 \le l' \le l} D_{q,l'}(I_p) \right)&\le \sum _{l/2 \le l' \le l} m( D_{q,l'}(I_p)) \\&\ll \sum _{l/2 \le l' \le l} R^{-\frac{l'}{10n}} m(I_p) \\&\ll R^{-\frac{l}{20n}} m(I_p). \end{aligned}$$

By Corollary 5.10, we have that

$$\begin{aligned} m \left( \bigcup _{i = 2}^{n} D_{q, p} (I_p, i)\right)&\le \sum _{i = 2}^{n} m(D_{q, p} (I_p, i)) \\&\ll \sum _{i=2}^{n} R^{-\frac{l}{20n}} m(I_p) \ll R^{-\frac{l}{20n}} m(I_p) \end{aligned}$$

By Lemma 5.8, we have that

$$\begin{aligned} I_q \subset \bigcup _{l/2 \le l' \le l} D_{q,l'}(I_p)\cup \bigcup _{i = 2}^{n} D_{q, p} (I_p, i) \end{aligned}$$

for any $I_q \in \hat{{\mathcal {I}}}_{q,p}$. Therefore, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,p}, I_p) R^{-q}&\le m\left( \bigcup _{l/2 \le l' \le l} D_{q,l'}(I_p)\bigcup _{i = 2}^{n} D_{q, p} (I_p, i) \right) \\&\le m\left( \bigcup _{l/2 \le l' \le l} D_{q,l'}(I_p) \right) + m \left( \bigcup _{i = 2}^{n} D_{q, p} (I_p, i)\right) \\&\ll R^{-\frac{l}{20n}} m(I_p) = R^{-p - \frac{l}{20n}}. \end{aligned}$$

This proves that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \ll R^{q-p - \frac{l}{20n}}. \end{aligned}$$

$\square $

By Proposition 5.7, we have that

$$\begin{aligned}&\sum _{l = 2000 n^2 N k}^{2\eta ' q} \left( \frac{4}{R}\right) ^{2l} \max _{I_{q- 2l} \in {\mathcal {I}}_{q- 2l}} F(\hat{{\mathcal {I}}}_{q, q - 2l}, I_{q - 2l}) \nonumber \\&\quad \ll \sum _{l = 2000 n^2 N k}^{2\eta ' q} \left( \frac{4}{R}\right) ^{2l} R^{2l - \frac{l}{20n}} \end{aligned}$$

(5.2)

$$\begin{aligned}&\quad \le \sum _{l = 2000 n^2 N k}^{2\eta ' q} \left( \frac{16}{1000}\right) ^l \ll \left( \frac{16}{1000}\right) ^{2000 n^2 N k}. \end{aligned}$$

(5.3)

From this it is easy to see that

$$\begin{aligned} \sum _{l = 2000 n^2 N k}^{2\eta ' q} \left( \frac{4}{R}\right) ^{2l} \max _{I_{q- 2l} \in {\mathcal {I}}_{q- 2l}} F(\hat{{\mathcal {I}}}_{q, q - 2l}, I_{q - 2l}) \rightarrow 0 \end{aligned}$$

(5.4)

as $k \rightarrow \infty $.

5.4 Extremely dangerous case.

In this subsection we will estimate $F(\hat{{\mathcal {I}}}_{q,0}, I)$. We call this case the extremely dangerous case.

Proposition 5.11

There exists a constant $\nu >0$ such that for any $q > 10^6 n^4 N k$, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,0}, I) \ll R^{(1-\nu ) q}. \end{aligned}$$

Similarly to Lemma 5.8, we have the following:

Lemma 5.12

For any $i = 1, \dots , n$ and $I_q \in \hat{{\mathcal {I}}}_{q,0}(i)$, one of the following two cases holds:

Case 1. :

there exists a q-extremely dangerous interval $\Delta _q({\mathbf{a}})$ such that $I_q \in \Delta _q({\mathbf{a}})$;

Case 2. :

there exists ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$ such that the following holds: for any $s \in I_q$, if we write

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbf{v}} = {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)} \end{aligned}$$

where ${\mathbf{w}}^{(i-1)} \in \bigwedge ^{i-1} W$ and ${\mathbf{w}}^{(i)} \in \bigwedge ^i W$, then $\Vert {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}\Vert \le \rho ^i$ and $\Vert {\mathbf{w}}^{(i)}\Vert \le \rho ^i R^{-\eta ' q}$.

Proof

The proof is the same as the proof of Lemma 5.8. In fact, the argument in the proof of Lemma 5.8 works for $l= 2\eta ' q$ and thus concludes the statement. $\square $

Definition 5.13

For $ i = 2, \dots , n$, let ${\mathcal {D}}_q(i) $ denote the collection of $I_q \in \hat{{\mathcal {I}}}_{q,0}(i)$ such that the second case in Lemma 5.12 holds and let

$$\begin{aligned} D_q(i) := \bigcup _{I_q \in {\mathcal {D}}_q(i)} I_q . \end{aligned}$$

Moreover, for $I_q \in {\mathcal {D}}_q(i)$, let ${\mathbf{v}} ={\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$ be the vector given in the second case of Lemma 5.12. Then for $s \in I_q$, we can write

$$\begin{aligned} g_{{\mathbf{r}}}(q)U(\varvec{{\varphi }}(s)){\mathbf{v}} = {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)} \end{aligned}$$

as in the second case of Lemma 5.12. For $l \ge \eta ' q$, let ${\mathcal {D}}'_{q,l}(i)$ denote the collection of $I_q \in {\mathcal {D}}_q(i)$ such that

$$\begin{aligned} \rho ^i R^{-l+1} \le \Vert {\mathbf{w}}^{(i)}\Vert \le \rho ^i R^{-l}, \end{aligned}$$

and let

$$\begin{aligned} D'_{q,l}(i) : = \bigcup _{I_q \in {\mathcal {D}}'_{q,l}(i)} I_q . \end{aligned}$$

Lemma 5.14

There exists a constant $\nu >0$ such that for any $q > 10^6 n^4 N k$ and any $i= 2, \dots , n$, we have that

$$\begin{aligned} m(D_q(i)) \ll R^{- \nu q}. \end{aligned}$$

Proof

For any $ \eta ' q \le l \le 2\eta ' q $, using the same argument as in the proof of Lemma 5.9, we can prove that

$$\begin{aligned} m(D'_{q,l}(i)) \ll R^{-\frac{l}{20n}}. \end{aligned}$$

Therefore, we have that

$$\begin{aligned} m\left( \bigcup _{l = \eta ' q}^{2\eta ' q} D'_{q,l}(i) \right)&\le \sum _{l = \eta ' q}^{2\eta ' q} m(D'_{q,l}(i)) \\&\ll \sum _{l = \eta ' q}^{2\eta ' q} R^{-\frac{l}{20n}} \ll R^{-\frac{\eta ' q}{ 20 n}}. \end{aligned}$$

Let us denote

$$\begin{aligned} {\mathcal {D}}'_q(i) := \bigcup _{l > 2\eta ' q} {\mathcal {D}}'_{q,l} \end{aligned}$$

and

$$\begin{aligned} D'_q(i) : = \bigcup _{I_q \in {\mathcal {D}}'_q(i)} I_q. \end{aligned}$$

Then it is enough to show that

$$\begin{aligned} m(D'_q(i)) \ll R^{-\nu q}. \end{aligned}$$

For any $I_q \in {\mathcal {D}}'_q(i)$ and $s \in I_q$, there exists ${\mathbf{v}} = {\mathbf{v}}_1 \wedge \cdots \wedge {\mathbf{v}}_i \in \bigwedge ^i {\mathbb {Z}}^{n+1} \setminus \{{\mathbf{0}}\}$ such that if we write

$$\begin{aligned} g_{{\mathbf{r}}}(q) U(\varvec{{\varphi }}(s)){\mathbf{v}} = {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)} \end{aligned}$$

where ${\mathbf{w}}^{(i-1)} \in \bigwedge ^{i-1} W$ and ${\mathbf{w}}^{(i)} \in \bigwedge ^i W$, then we have that $\Vert {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}\Vert \le \rho ^i$ and $\Vert {\mathbf{w}}^{(i)}\Vert \le \rho ^i R^{-2\eta ' q}$.

Recall that $\eta = (1+r_1)\eta '$. Let us deal with the following two cases separately:

(1)
$r_n \ge \frac{\eta }{n}$.
(2)
There exists $ 1 < n_1 \le n $ such that for $r_i \ge \frac{\eta }{n} $ for $1 \le i < n_1$ and $r_i < \frac{\eta }{n}$ for $ n_1 \le i \le n$.

Let us first deal with the first case. For this case, let us define

$$\begin{aligned} g^{\eta }(t) := \begin{bmatrix} b^{-\eta t}&~ \\ ~&b^{\eta t/n}\mathrm {I}_n \end{bmatrix} \in \mathrm {SL}(n+1, {\mathbb {R}}) \end{aligned}$$

and $g_{{\mathbf{r}}, \eta }(t) := g^{\eta }(t) g_{{\mathbf{r}}}(t)$. It is easy to see that

$$\begin{aligned} g^{\eta }(t) {\mathbf{w}}_+ = b^{-\eta t} {\mathbf{w}}_+ = R^{-\eta ' t} {\mathbf{w}}_+, \end{aligned}$$

and

$$\begin{aligned} g^{\eta }(t) {\mathbf{w}} = b^{\eta t/n} {\mathbf{w}} = R^{\eta ' t/n} {\mathbf{w}} \end{aligned}$$

for any ${\mathbf{w}} \in W$.

Then we have that

$$\begin{aligned} \Vert g_{{\mathbf{r}}, \eta }(q)U(\varvec{{\varphi }}(s)){\mathbf{v}}\Vert&= \Vert g^{\eta }(q)({\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)})\Vert \\&\le \Vert g^{\eta }(q)({\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)})\Vert + \Vert g^{\eta }(q) {\mathbf{w}}^{(i)}\Vert \\&= b^{-\eta q \left( 1- \frac{i-1}{n}\right) } \Vert {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)}\Vert + b^{\frac{\eta q i}{n}} \Vert {\mathbf{w}}^{(i)}\Vert \\&\le b^{-\frac{\eta q}{n}} \rho ^i + b^{\eta q} R^{-2\eta ' q} \rho ^i \le R^{-\frac{\eta ' q}{n}} \rho ^i. \end{aligned}$$

By the Minkowski Theorem, the above inequality implies that the lattice $g_{{\mathbf{r}}, \eta }(q)U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1}$ contains a nonzero vector with norm $\le R^{-\frac{\eta ' q}{n^2}} \rho $. Therefore, for any $I_q \in {\mathcal {D}}'_q(i)$ we have that

$$\begin{aligned} g_{{\mathbf{r}}, \eta }(q)U(\varvec{{\varphi }}(I_q)){\mathbb {Z}}^{n+1} \not \in K_{\sigma } \end{aligned}$$

where $\sigma = R^{-\frac{\eta ' q}{n^2}} \rho $. Then by Corollary 5.4, we have that

$$\begin{aligned} m\left( \{ s \in I: g_{{\mathbf{r}},\eta }(q) U(\varvec{{\varphi }}(s)){\mathbb {Z}}^{n+1} \not \in K_{\sigma } \} \right) \ll \sigma ^{\alpha } = R^{-\frac{\alpha \eta ' q}{n^2}}. \end{aligned}$$

This proves that

$$\begin{aligned} m(D'_q (i)) \ll R^{-\frac{\alpha \eta ' q}{n^2}} . \end{aligned}$$

This finishes the proof for the first case.

Now let us take care of the second case. Let us denote

$$\begin{aligned} \xi (t) := \begin{bmatrix} b^{-\beta t}&~&~&~&~&~&~ \\ ~&1&~&~&~&~&~ \\ ~&~&\ddots&~&~&~&~ \\ ~&~&~&1&~&~&~ \\ ~&~&~&~&b^{r_{n_1} t}&~&~ \\ ~&~&~&~&~&\ddots&~ \\ ~&~&~&~&~&~&b^{r_n t} \end{bmatrix} \in \mathrm {SL}(n+1, {\mathbb {R}}) \end{aligned}$$

where $\beta = \sum _{j= n_1}^n r_j < \eta $ and

$$\begin{aligned} g' (t) := \xi (t) g_{{\mathbf{r}}}(t) = \begin{bmatrix} b^{\chi t}&~&~&~&~&~&~ \\ ~&b^{-r_1 t}&~&~&~&~&~ \\ ~&~&\ddots&~&~&~&~ \\ ~&~&~&b^{- r_{n_1 - 1} t}&~&~&~ \\ ~&~&~&~&1&~&~ \\ ~&~&~&~&~&\ddots&~ \\ ~&~&~&~&~&~&1 \end{bmatrix} \end{aligned}$$

where $\chi = \sum _{j=1}^{n_1 - 1} r_j$. Then it is easy to see that

$$\begin{aligned} \xi (t) {\mathbf{w}}_+&= b^{-\beta t} {\mathbf{w}}_+ , \\ \xi (t) {\mathbf{w}}_j&= {\mathbf{w}}_j \end{aligned}$$

for $j = 1, \dots , n_1 - 1$, and

$$\begin{aligned} \xi (t) {\mathbf{w}}_j = b^{r_j t} {\mathbf{w}}_j \end{aligned}$$

for $j = n_1, \dots , n$. Then we have that

$$\begin{aligned} \Vert g'(q) U(\varvec{{\varphi }}(s)) {\mathbf{v}}\Vert&= \Vert \xi (q) ({\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} + {\mathbf{w}}^{(i)}) \Vert \\&\le \Vert \xi (q)({\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)})\Vert + \Vert \xi (q) {\mathbf{w}}^{(i)}\Vert \\&\le \Vert {\mathbf{w}}_+ \wedge {\mathbf{w}}^{(i-1)} \Vert + b^{\beta q} \Vert {\mathbf{w}}^{(i)}\Vert \\&\le \rho ^i + b^{\beta q} R^{-2\eta ' q} \rho ^i \\&\le \rho ^i + b^{\eta q} R^{-2\eta ' q} \rho ^i \le \rho ^i + R^{-\eta ' q} \rho ^i < (2\rho )^i. \end{aligned}$$

Moreover, for any $s' \in \Delta (s):= [ s - R^{-q(1-2\eta ')} , s + R^{-q(1-2\eta ')}]$, we also have that

$$\begin{aligned} \Vert g'(q) U(\varvec{{\varphi }}(s')) {\mathbf{v}}\Vert < (2\rho )^i. \end{aligned}$$

Let $C > 0$ and $\alpha >0$ be the constants given in Theorem 5.1. Then by the Minkowski Theorem, the inequality above implies that for any $s' \in \Delta (s)$, the lattice $g'(q) U(\varvec{{\varphi }}(s')){\mathbb {Z}}^{n+1}$ contains a nonzero vector of length $< 2\rho $. Let ${\mathbf{v}}_{s'} \in {\mathbb {Z}}^{n+1}\setminus \{{\mathbf{0}}\}$ be the vector such that $\Vert g'(q)U(\varvec{{\varphi }}(s')){\mathbf{v}}_{s'}\Vert < 2 \rho $. Let us write

$$\begin{aligned} {\mathbf{v}}_{s'} = (v_{s'}(0), v_{s'}(1),\dots , v_{s'}(n)). \end{aligned}$$

Then for $j = n_1 , \dots , n$, we have that $|v_{s'}(j)| < 2\rho $. Therefore, $v_{s'}(j) = 0$ for any $j = n_1 , \dots , n$. In other words, ${\mathbf{v}}_{s'}$ is contained in the subspace spanned $\{{\mathbf{w}}_+ , {\mathbf{w}}_1, \dots , {\mathbf{w}}_{n_1 - 1}\}$. For notational simplicity, let us denote this subspace by ${\mathbb {R}}^{n_1}$ and denote the set of integer points contained in the subspace by ${\mathbb {Z}}^{n_1}$. Accordingly, let us denote by $\mathrm {SL}(n_1, {\mathbb {R}})$ the subgroup

$$\begin{aligned} \left\{ \begin{bmatrix} X&~ \\ ~&\mathrm {I}_{n+1 - n_1} \end{bmatrix}: X \in \mathrm {SL}(n_1, {\mathbb {R}}) \right\} \subset \mathrm {SL}(n+1, {\mathbb {R}}) \end{aligned}$$

and denote by $\mathrm {SL}(n_1, {\mathbb {Z}})$ the subgroup of integer points in $\mathrm {SL}(n_1,{\mathbb {R}})$. Note that $g'(q) \in \mathrm {SL}(n_1, {\mathbb {R}})$. $U(\varvec{{\varphi }}(s'))$ can also be considered as an element in $\mathrm {SL}(n_1,{\mathbb {R}})$ since it preserves ${\mathbb {R}}^{n_1}$. Then $\Vert g'(q)U(\varvec{{\varphi }}(s')){\mathbf{v}}_{s'}\Vert < 2\rho $ implies that for any $s' \in \Delta (s)$, the lattice $g'(q)U(\varvec{{\varphi }}(s')){\mathbb {Z}}^{n_1}$ contains a nonzero vector of length $< 2\rho $. Let $K_{2\rho }(n_1) \subset X(n_1) = \mathrm {SL}(n_1, {\mathbb {R}})/\mathrm {SL}(n_1, {\mathbb {Z}})$ denote the set of unimodular lattices in ${\mathbb {R}}^{n_1}$ which do not contain any nonzero vector of length $< 2 \rho $. Then the claim above implies that

$$\begin{aligned} m(\{s' \in \Delta (s): g'(q) U(\varvec{{\varphi }}(s')) {\mathbb {Z}}^{n_1} \not \in K_{2\rho }(n_1) \} ) = m(\Delta (s)). \end{aligned}$$

By Theorem 5.1, there exist $j \in {1, \dots , n_1 -1}$ and ${\mathbf{v}}' = {\mathbf{v}}'_1 \wedge \cdots \wedge {\mathbf{v}}'_j \in \bigwedge ^j {\mathbb {Z}}^{n_1} \setminus \{{\mathbf{0}}\}$ such that

$$\begin{aligned} \max \{ \Vert g'(q)U(\varvec{{\varphi }}(s')){\mathbf{v}}'\Vert : s' \in [s- R^{-q(1-2\eta ')}, s+ R^{-q(1-2\eta ')}] \} < \rho _1^j \end{aligned}$$

(5.5)

since otherwise we will have that

$$\begin{aligned} m(\{s' \in \Delta (x): g'(q) U(\varvec{{\varphi }}(s')) {\mathbb {Z}}^{n_1} \not \in K_{2\rho }(n_1) \} ) \le C \left( \frac{2 \rho }{\rho _1}\right) ^{\alpha } m(\Delta (s)) < \frac{1}{1000} m(\Delta (s)). \end{aligned}$$

Now we have (5.5) in dimension $n_1$ and every weight of $g'(q)$ is at least $\eta /n$. Then we can repeat the argument for the first case with $n+1$ replaced by $n_1$ to complete the proof. $\square $

Now we are ready to prove Proposition 5.11.

Proof of Proposition 5.11

Recall that in Proposition 4.2, we denote by $E_q$ the union of all q-extremely dangerous intervals. By Lemma 5.12, we have that

$$\begin{aligned} I_q \subset E_q \cup \bigcup _{i=2}^n D_q(i). \end{aligned}$$

By Proposition 4.2 we have that

$$\begin{aligned} m(E_q) \ll R^{-\nu q} \end{aligned}$$

for some constant $\nu >0$. On the other hand, by Lemma 5.14, we have that

$$\begin{aligned} m(D_q(i)) \ll R^{-\nu q} \end{aligned}$$

for any $i = 2, \dots , n$. Therefore, we have that

$$\begin{aligned} F(\hat{{\mathcal {I}}}_{q,0}, I) R^{-q}&= m\left( \bigcup _{I_q \in \hat{{\mathcal {I}}}_{q,0}} I_q\right) \\&\le m\left( E_q \bigcup _{i=2}^n D_q(i) \right) \le m(E_q) + \sum _{i=2}^n m(D_q(i)) \ll R^{-\nu q}. \end{aligned}$$

This completes the proof. $\square $

Now we are ready to prove Proposition 3.7 for $q > 10^6 n^4 N k$.

Proof of Proposition 3.7for$q > 10^6 n^4 N k$. We can choose R such that $R^{\nu } > 1000$. By Proposition 5.11, we have that

$$\begin{aligned} \left( \frac{4}{R} \right) ^q F(\hat{{\mathcal {I}}}_{q,0}, I) \ll \left( \frac{4}{R} \right) ^q R^{(1-\nu ) q} = \left( \frac{4}{R^{\nu }} \right) ^q <\left( \frac{4}{1000} \right) ^q. \end{aligned}$$

(5.6)

Combining (5.1), (5.2) and (5.6), we have that

$$\begin{aligned} \sum _{p = 0}^{q-1} \left( \frac{4}{R}\right) ^{q-p} \max _{I_p \in {\mathcal {I}}_p} F(\hat{{\mathcal {I}}}_{q,p}, I_p) \rightarrow 0 \end{aligned}$$

as $m \rightarrow \infty $. This proves the statement. $\square $

Remark 5.15

In [BHNS18], Cantor winning property is introduced. It is equivalent to Cantor rich over ${\mathbb {R}}$ and is defined for higher dimensions.

Proof of Theorem 3.5

By Definition 3.2, Theorem 3.5 follows from Proposition 3.7.

$\square $

By Theorems 3.3, 3.4 and 3.5 implies Theorems 1.7 and 1.8.

5.5 General case.

Finally, let us explain how to adapt the proof for curves to handle general $C^n$ non-degenerate submanifolds.

Let $\varvec{{\varphi }}= \varvec{{\varphi }}(x_1, \dots , x_m): [0,1]^m \rightarrow {\mathbb {R}}^n$ be the $C^n$ differentiable map defining ${\mathcal {U}}$, where $m = \dim {\mathcal {U}}$. Then Definitions 3.1 and 3.2 will change according to the dimension. Intervals will be replaced by m-dimensional regular boxes. It is easy to see that higher dimensional versions of Theorems 3.3 and 3.4 still hold. Therefore, to prove Theorem 1.7 for higher dimensional manifolds, it suffices to prove higher dimensional versions of Proposition 3.7.

Following the argument for curves, we split the proof into four parts: the case where q is small, the generic case, the dangerous case and the extremely dangerous case. When q is small, we can repeat the same argument since Theorem 5.2 holds for any dimension. In the generic case, we can repeat the same argument since Thereom 5.1 holds for any dimension. In the dangerous case, we can consider $\frac{\partial \varvec{{\varphi }}}{\partial x_j}$ for $j = 1, \dots , m$ instead of $\varvec{{\varphi }}'(x)$ to prove higher dimensional versions of Proposition 4.1 and Lemma 5.8. Then the argument works through. In the extremely dangerous case, we can consider partial derivatives as in the dangerous case to prove higher dimension version of Lemma 5.12. Then we can repeat the same argument since higher dimensional versions of Proposition 4.2 and Theorem 5.2 still hold.

Combining the three cases above, we can deduce Theorem 1.7 for higher dimensional $C^n$ non-degenerate submanifolds.

References

J. An, V. Beresnevich, and S. Velani. Badly approximable points on planar curves and winning. Advances in Mathematics, 324 (2018), 148–202
Article MathSciNet MATH Google Scholar
J. An. Badziahin-Pollington-Velani’s theorem and Schmidt’s game. Bulletin of the London Mathematical Society, (4)45 (2013), 721–733
Article MathSciNet MATH Google Scholar
J. An. $2$-dimensional badly approximable vectors and Schmidt’s game. Duke Math. J., (2)165 (2016), 267–284, 02
Article MathSciNet MATH Google Scholar
V. Beresnevich. Badly approximable points on manifolds. Inventiones Mathematicae, (3)202 (2015), 1199–1240
Article MathSciNet MATH Google Scholar
R. Broderick, L. Fishman, D. Kleinbock, A. Reich, and B. Weiss. The set of badly approximable vectors is strongly C1 incompressible. Mathematical Proceedings of the Cambridge Philosophical Society, (2)153 (2012), 319-339
Article MathSciNet MATH Google Scholar
D. Badziahin, S. Harrap, E. Nesharim, and D. Simmons. Schmidt games and Cantor winning sets, arXiv preprint arXiv:1804.06499, pp. 1–36 (2018).
V. Bernik, D. Kleinbock, and G. Margulis. Khintchine-type theorems on manifolds: the convergence case for standard and multiplicative versions. International Mathematics Research Notices, (9)2001 (2001), 453–486
Article MathSciNet MATH Google Scholar
D. Badziahin, A. Pollington, and S. Velani. On a problem in simultaneous diophantine approximation: Schmidt’s conjecture. Annals of Mathematics, (3)174 (2011), 1837–1883
Article MathSciNet MATH Google Scholar
J. S. Cassels. An introduction to Diophantine approximation. Cambridge University Press, (1957).
MATH Google Scholar
S.G. Dani. On orbits of unipotent flows on homogeneous spaces. Ergodic Theory and Dynamical Systems, (01)4 (1984), 25–34
Article MathSciNet MATH Google Scholar
S.G. Dani. Bounded orbits of flows on homogeneous spaces. Commentarii Mathematici Helvetici, (1)61 (1986), 636–660
Article MathSciNet MATH Google Scholar
S.G. Dani and G. Margulis. Limit distributions of orbits of unipotent flows and values of quadratic forms. IM Gelfand Seminar. Adv. Soviet Math, 16 (1992), 91–137
MATH Google Scholar
H. Davenport and W. Schmidt. Dirichlet’s theorem on Diophantine approximation. ii. Acta Arithmetica, (4)16 (1970), 413–424
Article MathSciNet MATH Google Scholar
M. L. Einsiedler and T. Ward. Functional Analysis, Spectral Theory, and Applications, volume 276. Springer (2017).
Book MATH Google Scholar
V. Jarník. Zur metrischen Theorie der Diophantischen approximationen. Prace Matematyczno-Fizyczne, (1)36 (1928–1929), 91–106.
D. Kleinbock. Flows on homogeneous spaces and diophantine properties of matrices. Duke Math. J., 95 (1998), 107–124
Article MathSciNet MATH Google Scholar
D. Kleinbock. An extension of quantitative nondivergence and applications to diophantine exponents. Transactions of the American Mathematical Society, (12)360 (2008), 6497–6523
Article MathSciNet MATH Google Scholar
D. Kleinbock and G. Margulis. Bounded orbits of nonquasiunipotent flows on homogeneous spaces. American Mathematical Society Translations, pp. 141–172, (1996).
D. Kleinbock and G. Margulis. Flows on homogeneous spaces and Diophantine approximation on manifolds. Annals of Mathematics, pp. 339–360, (1998).
D. Kleinbock and B. Weiss. Dirichlet’s theorem on Diophantine approximation and homogeneous flows. Journal of Modern Dynamics, (1)2 (2008), 43–62
MathSciNet MATH Google Scholar
D. Kleinbock and B. Weiss. Modified Schmidt games and Diophantine approximation with weights. Advances in Mathematics, (4)223 (2010), 1276–1298
Article MathSciNet MATH Google Scholar
E. Lindenstrauss and G. Margulis. Effective estimates on indefinite ternary forms. Israel Journal of Mathematics, (1)203 (2014), 445–499
Article MathSciNet MATH Google Scholar
K. Mahler. Ein Übertragungsprinzip für lineare Ungleichungen. Časopis pro pěstování matematiky a fysiky, (3)68 (1939), 85–92
MATH Google Scholar
K. Mahler. On lattice points in n-dimensional star bodies. i. existence theorems. In Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, Vol. 187. The Royal Society, (1946), pp. 151–187.
S. Mozes and N. Shah. On the space of ergodic invariant measures of unipotent flows. Ergodic Theory and Dynamical Systems, (01)15 (1995), 149–159
Article MathSciNet MATH Google Scholar
G. Margulis and G. Tomanov. Invariant measures for actions of unipotent groups over local fields on homogeneous spaces. Inventiones mathematicae, (1)116 (1994), 347–392
Article MathSciNet MATH Google Scholar
E. Nesharim and D. Simmons. Bad(s,t) is hyperplane absolute winning. Acta Arithmetica, (2)164 (2014), 145–152
Article MathSciNet MATH Google Scholar
A. Pollington and S. Velani. On simultaneously badly approximable numbers. Journal of the London Mathematical Society, (1)66 (2002), 29–40
Article MathSciNet MATH Google Scholar
M. Ratner. On Raghunathan’s measure conjecture. Annals of Mathematics, pp. 545–607, 1991.
W. M. Schmidt. On badly approximable numbers and certain games. Transactions of the American Mathematical Society, (1)123 (1966), 178–199
Article MathSciNet MATH Google Scholar
W. M. Schmidt. Open problems in Diophantine approximation. Diophantine approximations and transcendental numbers (Luminy, 1982), 31 (1983), 271–287
MathSciNet Google Scholar
N. Shah. Equidistribution of expanding translates of curves and Dirichlet’s theorem on diophantine approximation. Inventiones Mathematicae, (3)177 (2009), 509–532
Article MathSciNet MATH Google Scholar
N. Shah. Limiting distributions of curves under geodesic flow on hyperbolic manifolds. Duke Mathematical Journal, (2)148 (2009), 251–279

Download references

Acknowledgements

The author would like to thank Elon Lindenstrauss and Barak Weiss for sharing many insightful ideas on this problem. He also thanks Shahar Mozes for helpful conversations on this problem. He appreciates their encouragements during the process of this work. He thanks Victor Beresnevich for inspiring discussion on this topic, especially for pointing out that the proof works for $C^n$ differentiable submanifolds. He also thanks Jinpeng An, Anish Ghosh, Erez Nesharim and Sanju Velani for their interests and helpful comments on an earlier version of this paper. Thanks are due to the anonymous referees for carefully reading the paper and giving many valuable suggestions that led to this revised version.

Author information

Authors and Affiliations

College of Mathematics, Sichuan University, Chengdu, 610065, Sichuan, China
Lei Yang

Authors

Lei Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The author is supported in part by ISF Grant 2095/15, ERC Grant AdG 267259, NSFC Grant 11743006 and the Fundamental Research Funds for the Central Universities YJ201769.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, L. Badly approximable points on manifolds and unipotent orbits in homogeneous spaces. Geom. Funct. Anal. 29, 1194–1234 (2019). https://doi.org/10.1007/s00039-019-00508-1

Download citation

Received: 07 August 2018
Revised: 14 March 2019
Accepted: 29 April 2019
Published: 25 June 2019
Issue Date: 01 August 2019
DOI: https://doi.org/10.1007/s00039-019-00508-1

Keywords and phrases

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Badly approximable points on manifolds and unipotent orbits in homogeneous spaces

Abstract

Similar content being viewed by others

Badly approximable points on manifolds

Invariant measures for solvable groups and Diophantine approximation

Counting lattice points and weak admissibility of a lattice and its dual

1 Introduction

1.1 Badly approximable vectors.

Theorem 1.1

Corollary 1.2

Definition 1.3

Conjecture 1.4

Theorem 1.5

Remark 1.6

1.2 Notation.

1.3 Main results.

Theorem 1.7

Theorem 1.8

1.4 Bounded orbits in homogeneous spaces.

Proposition 1.9

1.5 The linearization technique.

1.6 The organization of the paper.

2 Preliminaries

2.1 Dual form of approximation.

Lemma 2.1

Proof

2.2 The canonical representation.

2.3 Lattices in \({\mathbb {R}}^{n+1}\).

Proposition 2.2

Proof

3 A Cantor-like Construction

Definition 3.1

Definition 3.2

Theorem 3.3

Theorem 3.4

Theorem 3.5

Standing Assumption 3.6

Proposition 3.7

Definition 3.8

Definition 3.9

Remark 3.10

4 Counting Dangerous Intervals

Proposition 4.1

Proposition 4.2

Theorem 4.3

Proof of Proposition 4.2

Proof of Proposition 4.1

5 Proof of the Main Result

Theorem 5.1

Theorem 5.2

Remark 5.3

Corollary 5.4

Proof

5.1 The case where q is small.

Proposition 5.5

Proof

5.2 The generic case.

Proposition 5.6

Proof

5.3 Dangerous case.

Proposition 5.7

Lemma 5.8

Proof

Lemma 5.9

Proof

Corollary 5.10

Proof

Proof of Proposition 5.7

5.4 Extremely dangerous case.

Proposition 5.11

Lemma 5.12

Proof

Definition 5.13

Lemma 5.14

Proof

Proof of Proposition 5.11

Remark 5.15

Proof of Theorem 3.5

5.5 General case.

References