1 Introduction

Discrete Radon averaging operators are our focus. Let P be a polynomial of one variable mapping the integers to the integers and of degree d. Set an average and a fractional integral operator to be

$$\begin{aligned} A _{N} f(x)&:=\frac{1}{N}\sum _{k=1}^N f(x+P(k)) , \nonumber \\ I _{\lambda } f (x)&:= \sum _{k=1} ^{\infty } \frac{f (x+ P (k))}{k ^{\lambda }}, \qquad 0< \lambda < 1. \end{aligned}$$
(1.1)

Throughout, functions f can be assumed to be finitely supported. We write \(A\lesssim B\) if there exists an absolute constant C such that \(A\le CB\). If the constant depends on parameters \(\lambda ,\mu ,\ldots \) we denote that with a subscript, such as \(A\lesssim _{\lambda ,\mu ,\ldots }B\). The inequalities of interest are

$$\begin{aligned} \Vert A_N f\Vert _{\ell ^q(\mathbb {Z})}&\lesssim _{P,p,q} N^{-d(\frac{1}{p}-\frac{1}{q})} \Vert f\Vert _{\ell ^p(\mathbb {Z})}, \qquad N \in \mathbb N , \ 1<p< q < \infty . \end{aligned}$$
(1.2)
$$\begin{aligned} \Vert I _{\lambda } f\Vert _{\ell ^q(\mathbb {Z})}&\lesssim _{P,\lambda ,p,q} \Vert f\Vert _{\ell ^p(\mathbb {Z})}, \qquad 1< p< q < \infty . \end{aligned}$$
(1.3)

The inequality in (1.2) should hold uniformly in\(N \in {\mathbb {N}}\). Using the notation \(\lesssim _{P,p,q}\) we emphasize that the estimate has to be uniform in N and f, but the implicit constant is allowed to depend on P, p, and q. One should similarly interpret the notation \(\lesssim _{P,\lambda ,p,q}\) in (1.3). The exponent in (1.2), \(-d(1/p-1/q)\), is the best possible one, as is trivially seen by taking \(f={\mathbb {1}}_{\{1,2,3,\ldots ,2P(N)\}}\) and letting \(N\rightarrow \infty \).

Set \( {\mathcal {A}} (P)\) to be the set of all (pq) for which (1.2) holds. Set \( {\mathcal {I}} (P)\) to be the set of all \( (p,q, \lambda )\) for which (1.3) holds. These two classes are related through

Proposition 1.4

These two relations between \( {\mathcal {A}} (P)\) and \( {\mathcal {I}} (P)\) hold, where d is the degree of P.

  1. (1)

    If \( (p,q) \in {\mathcal {A}} (P)\), then \( (p,q, \lambda ) \in {\mathcal {I}} (P)\) for \( 0< 1-\lambda < \min \{1, d (1/p - 1/q) \} \).

  2. (2)

    If \( (p,q, 1 - d (1/p - 1/q) ) \in {\mathcal {I}} (P)\), then \( (p,q) \in {\mathcal {A}} (P)\).

We will define two closely related concepts in Sect. 7, and phrase some conjectures about them.

Concerning the collection \( {\mathcal {I}} (P)\), the relevant conjecture [18, pg. 597] is

Conjecture 1.5

The inequality (1.3) holds if and only if \( 1- \lambda \le d (1/p - 1/q)\), \( 1/q < \lambda \), and \( 1- \lambda < 1/p\).

Discrete fractional integrals were studied by Stein and Wainger [20, 21], with further contributions by Oberlin [17], and Ionescu and Wainger [9]. In particular, the case of quadratic \( P (x) = x ^2 \) is completely resolved. Our first main theorem provides a sharp, up to the endpoint, bound for most quadratic polynomials. Note that, when studying (1.2), interpolation with the trivial estimates for \(q=p\) allows us to additionally assume \(q=p'\) and \(p<2\).

Theorem 1.6

For a quadratic polynomial \(P(x)=ax^2+bx+c\) with non-negative integer coefficients and \( N\in \mathbb N \), the inequality (1.2) holds in the range

$$\begin{aligned} \left\{ (p,q) \,:\, {\tfrac{1}{q}\le \tfrac{1}{p}},\ {\tfrac{2}{q}>\tfrac{1}{p}},\ {\tfrac{1}{q}>\tfrac{2}{p}-1} \right\} . \end{aligned}$$

More precisely, for every \(3/2< p\le 2\) and \(N\in \mathbb N\) one has

$$\begin{aligned} \Vert A_{N}f\Vert _{\ell ^{p'}(\mathbb {Z})} \lesssim _p \left( 2a+\frac{b}{N}\right) (2aN+b)^{-2(\frac{1}{p} -\frac{1}{p'})}\Vert f\Vert _{\ell ^p(\mathbb {Z})}. \end{aligned}$$
(1.7)

It seems reasonable to conjecture that in (1.2) the bounds, provided they hold, depend upon the polynomial only through its degree d. Right now, we do not know that this is true even in the quadratic case.

Pierce [18, 19] also studied the fractional integrals. In particular [18] points to the relationship to the (at that point unresolved) Vinogradov Mean Value Conjecture. It reveals itself through the need for bounds on the exponential sums

$$\begin{aligned} S_N(t_1,t_2,\ldots ,t_d) := \frac{1}{N}\sum _{k=1}^{N} e^{2\pi i (k t_1 + k^2 t_2 + \cdots + k^d t_d)}; \quad t_1,t_2,\ldots ,t_d\in \mathbb {T}=\mathbb {R}/\mathbb {Z}. \end{aligned}$$
(1.8)

This theme was further elaborated on by Kim [15]. Using the work of Bourgain, Guth, and Demeter [1], we establish

Theorem 1.9

Let P be an arbitrary polynomial of degree \(d\ge 3\) mapping the set of integers back into itself. Averages (1.1) satisfy estimate (1.2) for exponents pq in the triangular range

$$\begin{aligned} \big \{ (p,q) \,:\, {\textstyle \frac{1}{q}\le \frac{1}{p}},\ {\textstyle \frac{d^2+d+1}{q}>\frac{d^2+d-1}{p}},\ {\textstyle \frac{d^2+d-1}{q}>\frac{d^2+d+1}{p}-2} \big \}. \end{aligned}$$
(1.10)

Specializing \(q=p'\) the range (1.10) reduces to

$$\begin{aligned} 2-{\textstyle \frac{2}{d^2+d+1}}<p\le 2. \end{aligned}$$
(1.11)

We will regard averages (1.1) as “projections” of the following higher-dimensional polynomial averages. Writing \( (x_1,x_2,\ldots ,x_d)\in \mathbb {Z}^d\), consider

$$\begin{aligned} {{\widetilde{A}}_{N}} f(x_1,x_2,\ldots ,x_d) := \frac{1}{N}\sum _{k=1}^{N} f(x_1+k,x_2+k^2,\ldots ,x_d+k^d) . \end{aligned}$$
(1.12)

This time we want to prove \(\ell ^p\)-improving estimates of the form

$$\begin{aligned} \Vert {{\widetilde{A}}_{N}} f\Vert _{\ell ^q(\mathbb {Z}^d)} \lesssim _{d,p,q} N^{-\frac{d(d+1)}{2}(\frac{1}{p}-\frac{1}{q})} \Vert f\Vert _{\ell ^p(\mathbb {Z}^d)}. \end{aligned}$$
(1.13)

The exponent \(-\frac{d(d+1)}{2}(\frac{1}{p}-\frac{1}{q})\) is the most improvement in N that one can expect, as in easily seen by taking f to be the indicator function of \(\{1,2,\ldots ,2N\}\times \cdots \times \{1,2,\ldots ,2N^d\}\).

Theorem 1.14

If \(d\ge 3\), then averages (1.12) satisfy estimate (1.13) for exponents pq in the same range (1.10), as in Theorem 1.9.

As we explain in Sect. 6, the bounds above are very close to optimal. It would be interesting to find the optimal open ranges of exponents p and q in Theorems 1.9 and 1.14.

Let us also remark that continuous-parameter results similar to Theorem 1.14 have already been present in the literature for a while. For example, Christ [2] has essentially settled all \(L ^p\)-improving properties of the convolution operator associated with the continuous moment curve \([-1,1]\rightarrow \mathbb {R}^d\), \(t\mapsto (t,t^2,\ldots ,t^d)\). Proofs of such results do not rely on number theory and sometimes not even on the Fourier analysis (as was the case with [2]), but rather on geometrical considerations and enumerative combinatorics.

Our proof of Theorem 1.6 is done by using the essentially sharp results about discrete fractional integrals due to Stein and Wainger. The argument can be reversed. We use our higher degree results on \(\ell ^{p}\)-improving to deduce results about discrete fractional integrals. To indicate the range of results that can be proved, we define

$$\begin{aligned} {\widetilde{I}}_{d, \lambda } f (x_1 ,\dotsc , x_d) := \sum _{k=1 } ^{\infty } k ^{- \lambda } f (x_1+k ,\dotsc , x_d+k^d), \qquad 0< \lambda <1 \end{aligned}$$

for finitely supported functions f. The following corollary extends works by Kim [15, Corollaries 2.2, 2.3], and is very nearly sharp.

Corollary 1.15

Suppose that the indices (pq) belong to the range (1.10).

  1. (1)

    Let P be an arbitrary polynomial of degree \(d\ge 3\) mapping the set of integers back into itself. We have the inequality (1.3) provided that \( 0< 1- \lambda < d (1/p - 1/q) \).

  2. (2)

    Take \(d\ge 3\). We have the inequality \( \Vert \widetilde{I} _{d, \lambda } f\Vert _{\ell ^{q}(\mathbb {Z}^d)} \lesssim _{d, \lambda , p, q } \Vert f\Vert _{\ell ^{p}(\mathbb {Z}^d)} \) provided that \( 0< 1- \lambda < (1/2)d(d+1) (1/p - 1/q) \).

The study of improving estimates for averages has been studied for decades in the Euclidean setting. It was recently recognized that some of these inequalities can be further extended to so-called sparse bounds. The latter imply the strongest known weighted estimates; see [3, 7, 16]. The study of the discrete variants has a much shorter history. Qualitative results were established in [4, 13] for discrete singular Radon transforms. On the other hand, discrete spherical averages admit a robust variant of their continuous analogs [8, 10,11,12, 14].

Improving inequalities and sparse bounds are closely related, but the connection is far more delicate in the discrete case. In particular, the sparse bounds proved in [5, 6] for averages along the prime and square integers, respectively, rely upon the Hardy–Littlewood Circle method. We do not know another way to prove those bounds; see the conjectures in Sect. 7.

2 Relating Averages to Fractional Integrals and Vice Versa

Proof of Proposition 1.4

Assume that \( (p,q) \in {\mathcal {A}} (P)\), that is (1.2) holds. For \( 0< 1-\lambda < \min \{1, d (1/p - 1/q) \} \) we have

$$\begin{aligned} \Vert I _{\lambda } f \Vert _{\ell ^q(\mathbb {Z})}&\lesssim _{\lambda } \sum _{j =1} ^{\infty } 2 ^{(1- \lambda ) j }\Vert A _{2 ^{j}} f \Vert _{\ell ^q(\mathbb {Z})} \\&\lesssim _{P,p,q} \Vert f\Vert _{\ell ^p(\mathbb {Z})} \sum _{j =1} ^{\infty } 2 ^{ ( 1- \lambda - d (1/p - 1/q) )j } \lesssim _{\lambda } \Vert f\Vert _{\ell ^p(\mathbb {Z})}. \end{aligned}$$

In the reverse direction, observe that \(A_N f \le N ^{-1+ \lambda } I _{\lambda } f\). So if \( (p,q, 1 - d (1/p - 1/q) ) \in {\mathcal {I}} (P)\), the result follows. \(\square \)

Proof of Corollary 1.15 assuming Theorems 1.9 and 1.14

The first part is a direct consequence of Theorem 1.9 and Proposition 1.4. For the second part we again write

$$\begin{aligned} \Vert \widetilde{I} _{d,\lambda } f \Vert _{\ell ^q(\mathbb {Z}^d)} \lesssim _{\lambda } \sum _{j =1}^{\infty } 2 ^{(1- \lambda ) j }\Vert \widetilde{A} _{2 ^{j}} f \Vert _{\ell ^q(\mathbb {Z}^d)} \end{aligned}$$

and then apply Theorem 1.14 in exactly the same way.

3 Quadratic Polynomials

Case\(d=2\), \(P(x)=x^2\). For this particular choice of the polynomial, Conjecture 1.5 has been verified by Stein and Wainger [20, 21]; also see [9, Cor 1.3]. Note that for \( 3/2< p < 2\), \( q=p'\), and \( 1- \lambda = 2 (1/p-1/q)\), we have \( (p, q, \lambda ) \in {\mathcal {I}} (x^2)\), so the result follows.

3.1 General Quadratic Polynomials

Let us turn to the proof of (1.7). In order to avoid confusion we will write the relevant polynomial P in the superscript of \(A_N^P\).

We define \(g:\mathbb Z\rightarrow \mathbb {R}\) by

$$\begin{aligned} g(4am)=f(m) \ \text {for all} \ m\in \mathbb Z \ \text {and} \ g(n)=0 \ \text {if}\ 4a\not \mid n. \end{aligned}$$

Since f is supported in \([-(aN^2+bN+\frac{b^2}{4a}),aN^2+bN+\frac{b^2}{4a}]\), we have that g is supported in \([-(2aN+b)^2,(2aN+b)^2]\). Therefore

$$\begin{aligned} A_{N}^{P}f(x)= & {} \frac{1}{N}\sum _{n\le N}f(x+an^2+bn+c)\\= & {} \frac{1}{N}\sum _{n\le N}g(4ax+4a^2n^2+4abn+4ac)\\= & {} \frac{1}{N}\sum _{n\le N}g(4a(x+c)-b^2+(2an+b)^2)\\\le & {} \frac{(2aN+b)}{N}\frac{1}{2aN+b}\sum _{k\le 2aN+b}g(4a(x+c)-b^2+k^2)\\= & {} \left( 2a+\frac{b}{N}\right) A_{2aN+b}^{x^2}g(4a(x+c)-b^2). \end{aligned}$$

Using this calculation and the previously established case of Theorem 1.6 we obtain

$$\begin{aligned} \Vert A_{N}^{P}f\Vert _{\ell ^{p'}(\mathbb {Z})}\le & {} \left( 2a+\frac{b}{N}\right) \Vert A_{2aN+b}^{x^2}g\Vert _{\ell ^{p'}(\mathbb {Z})}\\\le & {} \left( 2a+\frac{b}{N}\right) (2aN+b)^{2/p'-2/p}C_p\Vert g\Vert _{\ell ^{p} (\mathbb {Z})}\\\le & {} \left( 2a+\frac{b}{N}\right) (2aN+b)^{2/p'-2/p}C_p\Vert f\Vert _{\ell ^{p} (\mathbb {Z})}, \end{aligned}$$

for every \(3/2<p\le 2\).

4 Reduction to Vinogradov’s Mean Value Theorem

Proof of Theorem 1.14

Let \(\ell ^2(\mathbb {Z}^d)\rightarrow L ^2(\mathbb {T}^d)\), \(f\mapsto \widehat{f}\) denote the Fourier transform on the group \(\mathbb {Z}^d\). For finitely supported f this is simply the formation of the multiple Fourier series with coefficients f(m); \(m\in \mathbb {Z}^d\).

We apply the Hausdorff–Young inequality twice to reduce the problem to bounds for the exponential sums. To do so, we write

$$\begin{aligned}&\widetilde{A}_N f(x_1,x_2,\ldots ,x_d) \\&\quad = \frac{1}{N}\sum _{k=1}^{N} \int _{\mathbb {T}^d} \widehat{f}(t_1,t_2,\ldots ,t_d) e^{2\pi i ( (x_1+k)t_1 + (x_2+k^2)t_2 + \cdots + (x_d+k^d) t_d) } dt_1 dt_2 \cdots dt_d \\&\quad = \int _{\mathbb {T}^d} \widehat{f}(t_1,t_2,\ldots ,t_d) S_N(t_1,t_2,\ldots ,t_d) e^{2\pi i ( x_1 t_1 + x_2 t_2 + \cdots + x_d t_d)} dt_1 dt_2 \cdots dt_d, \end{aligned}$$

where \(S_N\) are the normalized exponential sums given by (1.8). We recognize \(\widetilde{A}_N f\) as the Fourier transform of the function \(\widehat{f} \cdot S_N\) on the group \(\mathbb {T}^d\). Applying the Hausdorff–Young inequality on \(\mathbb {T}^d\), then Hölder’s inequality on \(\mathbb {T}^d\), and finally the Hausdorff–Young inequality on \(\mathbb {Z}^d\), we get

$$\begin{aligned} \Vert \widetilde{A}_N f\Vert _{\ell ^{p'}(\mathbb {Z}^d)} \le \Vert \widehat{f} \cdot S_N\Vert _{L ^{p}(\mathbb {T}^d)} \le \Vert \widehat{f}\Vert _{L ^{p'}(\mathbb {T}^d)} \Vert S_N\Vert _{L ^{s}(\mathbb {T}^d)} \le \Vert f\Vert _{\ell ^{p}(\mathbb {Z}^d)} \Vert S_N\Vert _{L ^{s}(\mathbb {T}^d)}, \end{aligned}$$

where \(1/s=1/p-1/p'=2/p-1\). Thus, the \(\ell ^p\)-improving inequality depends on the \(L ^{s}\)-norm of the sums \(S_N\). Vinogradov’s mean value theorem, as established by Bourgain, Demeter, and Guth [1], claims precisely the bound

$$\begin{aligned} \Vert S_N\Vert _{L ^s(\mathbb {T}^d)} \lesssim _{d,s,\varepsilon } N^{-\frac{d(d+1)}{2s}+\varepsilon } \end{aligned}$$

for any \(s>d(d+1)\) and for any fixed \(\varepsilon >0\). Note that in the typical formulation of Vinogradov’s mean value theorem number s needs to be an even integer, but the analytic proof from [1] does not require that. Moreover, for \(d\ge 3\) one can even remove the \(\varepsilon \) by performing the Hardy–Littlewood circle method, as in [22, Sect. 7] or [1, Sect. 5]. Therefore, we actually have

$$\begin{aligned} \Vert S_N\Vert _{L ^s(\mathbb {T}^d)} \lesssim _{d,s} N^{-\frac{d(d+1)}{2s}}. \end{aligned}$$
(4.1)

Combining (4.1) with the previous computation we get exactly the \(\ell ^p\)-improving estimate (1.13) for \(2-2/(d^2+d+1)<p\le 2\) and \(q=p'\). Interpolation with the trivial estimates for \(q=p\) settles the whole claimed range of (pq). \(\square \)

5 Projection of Higher-Dimensional Averages to One-Dimensional Ones

Proof of Theorem 1.9

Take an arbitrary polynomial function \(P:\mathbb {R}\rightarrow \mathbb {R}\) of degree \(d\ge 3\) mapping \(\mathbb {Z}\) back to \(\mathbb {Z}\). Let us write it as

$$\begin{aligned} P(x) = a_0 + a_1 x + a_2 x^2 + \cdots + a_{d-1} x^{d-1} + a_d x^d, \end{aligned}$$

where \(a_0,a_1,\ldots ,a_d\in \mathbb {R}\) and, without loss of generality, \(a_d>0\). By solving the Vandermonde linear system in the coefficients of P the conditions \(P(0),P(1),\ldots ,P(d)\in \mathbb {N}\) imply that \(a_0,a_1,\ldots ,a_d\) are rational numbers. Moreover, let \(v\in \mathbb {N}\) be the least common denominator of \(a_1,\ldots ,a_d\), so that we can write \(a_j=b_j u/v\) for \(j=1,\ldots ,d\), where \(u\in \mathbb {N}\) and \(b_1,\ldots ,b_d\in \mathbb {Z}\) do not have a common multiple greater than 1. By the formula for the Vandermonde determinant we also know that v divides \(\prod _{0\le i<j\le d}(j-i)\), so it has an upper bound depending only on d. Finally, since the free coefficient of P simply translates the averages (1.1), it is safe to assume that \(a_0=0\).

For any given function \(g:\mathbb {Z}\rightarrow \mathbb {C}\) and a fixed number \(r\in \{0,1,\ldots ,u-1\}\) define \(f:\mathbb {Z}^d\rightarrow \mathbb {C}\) by

$$\begin{aligned} f(x_1,x_2,\ldots ,x_{d-1},x_d)&= {\mathbb {1}}_{\{1,2,\ldots ,2N\}}(x_1) {\mathbb {1}}_{\{1,2,\ldots ,2N^2\}}(x_2) \cdots {\mathbb {1}}_{\{1,2,\ldots ,2N^{d-1}\}}(x_{d-1}) \\&\quad g(a_1 x_1 + a_2 x_2 + \cdots + a_{d-1} x_{d-1} + a_d x_d + r) \\&\quad {\mathbb {1}}_{\mathbb {Z}}(a_1 x_1 + a_2 x_2 + \cdots + a_{d-1} x_{d-1} + a_d x_d). \end{aligned}$$

This way

$$\begin{aligned} \Vert f\Vert _{\ell ^p(\mathbb {Z}^d)} \lesssim _{d,p} N^{\frac{(d-1)d}{2p}} \Vert g\Vert _{\ell ^p(\mathbb {Z})}. \end{aligned}$$
(5.1)

For any d-tuple

$$\begin{aligned}&(x_1,x_2,\ldots ,x_{d-1},x_d) \in \{1,2,\ldots ,N\}\times \{1,2,\ldots ,N^2\}\times \cdots \\&\quad \times \{1,2,\ldots ,N^{d-1}\}\times \mathbb {Z} \end{aligned}$$

satisfying \(a_1 x_1 + a_2 x_2 + \cdots + a_d x_d\in \mathbb {Z}\) we have

$$\begin{aligned} \widetilde{A}_N f(x_1,x_2,\ldots ,x_d) = A_N g(a_1 x_1 + a_2 x_2 + \cdots + a_d x_d + r). \end{aligned}$$

From the theory of linear Diophantine equations we know that \(b_1\mathbb {Z} + b_2\mathbb {Z} + \cdots + b_d\mathbb {Z} = \mathbb {Z}\). Hence, for sufficiently large \(N\in \mathbb {N}\),

$$\begin{aligned} \Vert \widetilde{A}_N f\Vert _{\ell ^q (\mathbb {Z}^d)}^q&\ge \sum _{\begin{array}{c} x_1\in v\mathbb {Z}\\ 1\le x_1\le N \end{array}} \cdots \sum _{\begin{array}{c} x_{d-1}\in v\mathbb {Z}\\ 1\le x_{d-1}\le N^{d-1} \end{array}} \sum _{x_d\in v\mathbb {Z}} \big |(A_N g)(a_1x_1+\cdots +a_d x_d+r)\big |^q \nonumber \\&\gtrsim \;_ P N^{\frac{(d-1)d}{2}} \sum _{n\in r+u\mathbb {Z}} |(A_N g)(n)|^q. \end{aligned}$$
(5.2)

Applying (1.13) and combining it with (5.1) and (5.2) we obtain

$$\begin{aligned} N^{\frac{(d-1)d}{2q}} \Big (\sum _{n\in r+u\mathbb {Z}}|(A_N g)(n)|^q\Big )^{1/q} \lesssim _{P,p,q} N^{-\frac{d(d+1)}{2}(\frac{1}{p}-\frac{1}{q})} N^{\frac{(d-1)d}{2p}} \Vert g\Vert _{\ell ^p(\mathbb {Z})}. \end{aligned}$$

Finally, summing in \(r=0,1,\ldots ,u-1\) gives estimate (1.2) for the function g. \(\square \)

6 Examples

We formulate the examples that show certain sharpness in the \(\ell ^{p}\)-improving inequalities. These are essentially known, and we include them for completeness.

Counterexamples similar to the ones in [5] show that (1.2) cannot hold outside the range

$$\begin{aligned} \left\{ (p,q) \,:\, {\tfrac{1}{q}\le \tfrac{1}{p}},\ {\tfrac{d}{q}\ge \tfrac{d-1}{p}},\ {\tfrac{d-1}{q}\ge \tfrac{d}{p}-1} \right\} . \end{aligned}$$
(6.1)

Indeed, by taking \(f={\mathbb {1}}_{\{P(1),P(2),\ldots ,P(N)\}}\) we conclude

$$\begin{aligned} \frac{\Vert A_N f\Vert _{\ell ^q(\mathbb {Z})}}{\Vert f\Vert _{\ell ^p(\mathbb {Z})}} \ge \frac{|(A_N f)(0)|}{\Vert f\Vert _{\ell ^p(\mathbb {Z})}} = \frac{1}{N^{1/p}}, \end{aligned}$$

so that \(d/q\ge (d-1)/p\). Similarly, by taking \(f={\mathbb {1}}_{\{0\}}\) we get

$$\begin{aligned} \frac{\Vert A_N f\Vert _{\ell ^q(\mathbb {Z})}}{\Vert f\Vert _{\ell ^p(\mathbb {Z})}} \ge \frac{\big (\sum _{m=1}^{N}|(A_N f)(-P(m))|^q\big )^{1/q}}{\Vert f\Vert _{\ell ^p(\mathbb {Z})}} = \frac{N^{1/q-1}}{1}, \end{aligned}$$

which forces \((d-1)/q\ge d/p-1\). If one only cares about the case \(q=p'\), then (6.1) is simply the range

$$\begin{aligned} 2-{\textstyle \frac{1}{d}}\le p\le 2. \end{aligned}$$
(6.2)

Theorem 1.9 leaves a gap between the ranges (1.11) and (6.2) for large d.

Next, it is easy to see that (1.13) cannot hold outside the range

$$\begin{aligned} \big \{ (p,q) \,:\, {\textstyle \frac{1}{q}\le \frac{1}{p}},\ {\textstyle \frac{d^2+d}{q}\ge \frac{d^2+d-2}{p}},\ {\textstyle \frac{d^2+d-2}{q}\ge \frac{d^2+d}{p}-2} \big \}. \end{aligned}$$

Indeed, by taking

$$\begin{aligned} f = {\mathbb {1}}_{\{(1^1,1^2,\ldots ,1^d), (2^1,2^2,\ldots ,2^d), \ldots , (N^1,N^2,\ldots ,N^d)\}} \end{aligned}$$

we conclude

$$\begin{aligned} N^{-\frac{d(d+1)}{2}(\frac{1}{p}-\frac{1}{q})} \gtrsim \,_ {d,p,q} \frac{\Vert \widetilde{A}_N f\Vert _{\ell ^q(\mathbb {Z}^d)}}{\Vert f\Vert _{\ell ^p(\mathbb {Z}^d)}} \ge \frac{|(\widetilde{A}_N f)(0,0,\ldots ,0)|}{\Vert f\Vert _{\ell ^p(\mathbb {Z}^d)}} = \frac{1}{N^{1/p}}, \end{aligned}$$

so multiplying by \(N^{1/p}\) and letting \(N\rightarrow \infty \) give \((d^2+d)/q\ge (d^2+d-2)/p\). Similarly, by taking \(f={\mathbb {1}}_{\{(0,0,\ldots ,0)\}}\) we get

$$\begin{aligned} N^{-\frac{d(d+1)}{2}(\frac{1}{p}-\frac{1}{q})}&\gtrsim \,_ {d,p,q} \frac{\Vert \widetilde{A}_N f\Vert _{\ell ^q(\mathbb {Z}^d)}}{\Vert f\Vert _{\ell ^p(\mathbb {Z}^d)}} \\&\ge \frac{\big ( \sum _{m=1}^{N} |(\widetilde{A}_N f)(-m^1,-m^2,\ldots ,-m^d)|^q \big )^{1/q}}{\Vert f\Vert _{\ell ^p(\mathbb {Z})}} = \frac{N^{1/q-1}}{1}. \end{aligned}$$

This forces \((d^2+d-2)/q\ge (d^2+d)/p-2\). If one only cares about the cases \(q=p'\), then we are talking about the range

$$\begin{aligned} 2-{\textstyle \frac{2}{d^2+d}}\le p\le 2. \end{aligned}$$

Comparing it to (1.11) we see that Theorem 1.14 is “asymptotically optimal” as \(d\rightarrow \infty \).

7 Conjectured Sparse Bounds

Recall that a collection of intervals \( {\mathcal {S}}\) is said to be sparse if for each interval I there is a subset \( E_I\) so that \( |E_I|> |I|/4 \) and \( \{E_I : I\in {\mathcal {S}}\}\) are pairwise disjoint. For \( 0\le \lambda < 1\), and a sparse collection \( {\mathcal {S}}\), set

$$\begin{aligned} \Lambda _{p,q,\lambda } (f,g)&:= \sum _{I\in {\mathcal {S}}} \langle f \rangle _{p, I} \langle g \rangle _{I,q} |I|^{1- \lambda } , \\ \text {where} \quad \langle \phi \rangle _{I,r}&:= \Bigl [ |I|^{-1} \sum _{n \in I\cap \mathbb Z } |\phi (n)|^{r} \Bigr ] ^{1/r}. \end{aligned}$$

Concerning the maximal function \( A _{*} f = \sup _N A_N f \), where \( A_N\) is defined as in (1.1), the main conjecture would be that if \( (p,q) \in {\mathcal {A}} (P)\), that is when (1.2) holds, one has

$$\begin{aligned} \langle A _{*} f,g \rangle \lesssim \sup _{{\mathcal {S}}} \Lambda _{p,q',0} (f,g). \end{aligned}$$

Notice that we are using the conjugate index \( q' = q/(q-1)\) above. In fact, the main result of [5] is that for the quadratic polynomial \( P (x) = x ^2 \), this is true, except possibly at the boundary of \( {\mathcal {A}} (P)\). Nothing close to this is known for any other polynomial, as far as we know.

Turning to the fractional integral operator, the main conjecture would be that if \( (p,q, \lambda ) \in {\mathcal {I}} (P)\), that is when (1.3) holds, one has

$$\begin{aligned} \langle I _{\lambda } f,g \rangle \lesssim \sup _{{\mathcal {S}}} \Lambda _{p,q', \lambda } (f,g). \end{aligned}$$

No such bound is known, even in the quadratic case.

The interest in sparse bounds comes in part as they immediately imply a range of weighted inequalities and vector valued inequalities. The main results of [5, 6] concern sparse bounds for averages over the square integers and the primes. These results seem to be much more difficult than the improving or fractional integral inequalities.