1 Introduction

The Ramsey number r(F; q) of a k-uniform hypergraph F is the smallest natural number n such that every q-coloring of the edges of K n (k), the complete k-uniform hypergraph on n vertices, contains a monochromatic copy of F. In the particular case when q = 2, we simply write r(F). The existence of r(F; q) was established by Ramsey in his foundational paper [17] and there is now a large body of work studying the Ramsey numbers of graphs and hypergraphs. For a recent survey, we refer the interested reader to [5].

In this paper, we will be concerned with a well-known refinement of Ramsey’s theorem, the induced Ramsey theorem. We say that a k-uniform hypergraph F is an induced subgraph of another k-uniform hypergraph G if V (F) ⊂ V (G) and any k vertices in F form an edge if and only if they also form an edge in G. The induced Ramsey number r ind(F; q) of a k-uniform hypergraph F is then the smallest natural number n for which there exists a k-uniform hypergraph G on n vertices such that every q-coloring of the edges of G contains an induced monochromatic copy of F. Again, in the particular case when q = 2, we simply write r ind(F).

For graphs, the existence of induced Ramsey numbers was established independently by Deuber [6], Erdős, Hajnal, and Pósa [9], and Rödl [18], while for k-uniform hypergraphs with k ≥ 3 their existence was shown independently by Nešetřil and Rödl [16] and Abramson and Harrington [1]. The bounds that these original proofs gave on r ind(F; q) were enormous. However, at that time it was noted by Rödl (unpublished) that for bipartite graphs F the induced Ramsey numbers are exponential in the number of vertices. Moreover, it was conjectured by Erdős [7] that there exists a constant c such that every graph F with t vertices satisfies r ind(F) ≤ 2ct. If true, the complete graph would show that this is best possible up to the constant c. A result of Conlon, Fox, and Sudakov [3], building on earlier work by Kohayakawa, Prömel, and Rödl [13], comes close to establishing this conjecture, showing that

$$\displaystyle{r_{\mathrm{ind}}(F) \leq 2^{ct\log t}.}$$

However, the method used to prove this estimate only works in the 2-color case. For q ≥ 3, the best known bound, due to Fox and Sudakov [11], is \(r_{\mathrm{ind}}(F;q) \leq 2^{ct^{3} }\), where c depends only on q.

In this note, we study the analogous question for hypergraphs, showing that the induced Ramsey number is never significantly larger than the usual Ramsey number. Our main result is the following.

Theorem 1

Let F be a k-uniform hypergraph with t vertices and ℓ edges. Then there are positive constants c 1, c 2, and c 3 such that

$$\displaystyle{r_{\mathrm{ind}}(F;q) \leq 2^{c_{1}k\ell^{3}\log (qt\ell) }R^{c_{2}k\ell^{2}+c_{ 3}t\ell},}$$

where R = r(F; q) is the classical q-color Ramsey number of F.

Define the tower function t k (x) by t 1(x) = x and, for i ≥ 1, \(t_{i+1}(x) = 2^{t_{i}(x)}\). A seminal result of Erdős and Rado [8] says that

$$\displaystyle{r(K_{t}^{(k)};q) \leq t_{ k}(ct),}$$

where c depends only on k and q. This yields the following immediate corollary of Theorem 1.

Corollary 1

For any natural numbers k ≥ 3 and q ≥ 2, there exists a constant c such that if F is a k-uniform hypergraph with t vertices, then

$$\displaystyle{r_{\mathrm{ind}}(F;q) \leq t_{k}(ct).}$$

A result of Erdős and Hajnal (see, for example, Chapter 4.7 in [12] and [4]) says that

$$\displaystyle{r(K_{t}^{(k)};4) \geq t_{ k}(c^{{\prime}}t),}$$

where c depends only on k. Therefore, the Erdős–Rado bound is sharp up to the constant c for q ≥ 4. By taking F = K t (k), this also implies that Corollary 1 is tight up to the constant c for q ≥ 4. Whether it is also sharp for q = 2 and 3 depends on whether r(K t (k)) ≥ t k (c t), though determining if this is the case is a famous, and seemingly difficult, open problem.

The proof of Theorem 1 relies on an application of the hypergraph container method of Saxton and Thomason [20] and Balogh, Morris, and Samotij [2]. In Ramsey theory, the use of this method was pioneered by Nenadov and Steger [14] and developed further by Rödl, Ruciński, and Schacht [19] in order to give an exponential-type upper bound for Folkman numbers. Our modest results are simply another manifestation of the power of this beautiful method.

2 Proof of Theorem 1

In order to state the container theorem we first need some definitions. Recall that the degree d(σ) of a set of vertices σ in a hypergraph H is the number of edges of H containing σ, while the average degree is the average of d(v): = d({v}) over all vertices v.

Definition 2

Let H be an -uniform hypergraph of order N with average degree d. Let τ > 0. Given vV (H) and 2 ≤ j, let

$$\displaystyle{ d^{(\,j)}(v) =\max \big\{ d(\sigma )\::\: v \in \sigma \subset V (H),\vert \sigma \vert = j\big\}. }$$

If d > 0, define δ j by the equation

$$\displaystyle{ \delta _{j}\tau ^{j-1}Nd =\sum _{ v}d^{(\,j)}(v). }$$

The codegree function δ(H, τ) is then defined by

$$\displaystyle{ \delta (H,\tau ) = 2^{\binom{\ell}{2}-1}\sum _{ j=2}^{\ell}2^{-\binom{j-1}{2}}\delta _{ j}. }$$

If d = 0, define δ(H, τ) = 0.

The precise lemma we will need is a slight variant of Corollary 3.6 from Saxton and Thomason’s paper [20]. A similar version was already used in the work of Rödl, Ruciński, and Schacht [19] and we refer the interested reader to that paper for a thorough discussion.

Lemma 3

Let H be an ℓ-uniform hypergraph on N vertices with average degree d. Let 0 < ɛ < 1∕2. Suppose that τ satisfies δ(H, τ) ≤ ɛ∕12! and τ ≤ 1∕144! 2 ℓ. Then there exists a collection \(\mathcal{C}\) of subsets of V (H) such that

  1. (i)

    for every set IV (H) such that e(H[I]) ≤ ɛτ e(H), there is \(C \in \mathcal{C}\) with IC,

  2. (ii)

    e(H[C]) ≤ ɛe(H) for all \(C \in \mathcal{C}\) ,

  3. (iii)

    \(\log \vert \mathcal{C}\vert \leq 1000\ell!^{3}\ell\log (1/\varepsilon )N\tau \log (1/\tau )\) .

Before we give the proof of Theorem 1, we first describe the -uniform hypergraph H to which we will apply Lemma 3.

Construction 4

Given a k-uniform hypergraph F with edges, we construct an auxiliary hypergraph H by taking

$$\displaystyle{V (H) = \binom{[n]}{k}\qquad \text{and}\qquad E(H) =\Bigg\{ E \in \binom{V (H)}{\ell}: E\cong F\Bigg\}.}$$

In other words, the vertices of H are the k-tuples of [n] and the edges of H are copies of F in \(\binom{[n]}{k}\).

Proof of Theorem 1

Recall that R = r(F; q), the q-color Ramsey number of F, and suppose that F has t vertices and edges. Let us fix the following numbers:

$$\displaystyle{ \begin{array}{ll} \tau = n^{-\frac{1} {2\ell} }, &p = 1000R^{k}q\alpha,\qquad \alpha = n^{-\frac{1} {2\ell} + \frac{1} {4\ell(\ell+1)} }, \\ \varepsilon = 1/(2qR^{t}),&n =\ell ^{40\ell^{2}(\ell+1) }(1000q)^{8\ell(\ell+1)}R^{4k\ell(\ell+1)+4t\ell}\binom{t}{k}^{4\ell}. \end{array} }$$
(1)

Remark 5

Note that n is bounded above by an expression of the form

$$\displaystyle{2^{c_{1}k\ell^{3}\log (qt\ell) }R^{c_{2}k\ell^{2}+c_{ 3}t\ell},}$$

as required.

Obviously, Rt and one can check that p and n satisfy the following conditions, which we will make use of during the course of the proof:

$$\displaystyle\begin{array}{rcl} p \leq 1,& &{}\end{array}$$
(2)
$$\displaystyle\begin{array}{rcl} n \geq (24 \cdot 2^{\binom{\ell}{2}}t^{t}q\ell!R^{t})^{2},& &{}\end{array}$$
(3)
$$\displaystyle\begin{array}{rcl} n> (144\ell!^{2}\ell)^{2\ell},& &{}\end{array}$$
(4)
$$\displaystyle\begin{array}{rcl} n>\ell ^{40\ell^{2}(\ell+1) },& &{}\end{array}$$
(5)
$$\displaystyle\begin{array}{rcl} n> (1000q)^{8\ell(\ell+1)}R^{4k\ell(\ell+1)+4t\ell}\binom{t}{k}^{4\ell}.& &{}\end{array}$$
(6)

We will show that, with positive probability, a random hypergraph \(G \in \mathbb{G}^{(k)}(n,p)\) has the property that every q-coloring of its edges contains an induced monochromatic copy of F. The proof proceeds in two stages. First, we use Lemma 3 to show that, with probability 1 − o(1), G has the property that any q-coloring of its edges yields many monochromatic copies of F. Then we show that some of these monochromatic copies must be induced.

More formally, let X be the event that there is a q-coloring of the edges of G which contains at most

$$\displaystyle{ M:= \frac{\varepsilon \tau ^{\ell}(n)_{t}} {\mathop{\mathrm{aut}}\nolimits (F)} }$$

monochromatic copies of F in each color, and let Y be the event that G contains at least M noninduced copies of F. Note that if \(\overline{X} \cap \overline{Y }\) happens, then, in any q-coloring, there are more monochromatic copies of F in one of the q colors than there are noninduced copies of F in G. Hence, that color class must contain an induced copy of F.

We now proceed to show that the probability P(X) tends to zero as n tends to infinity. In order to apply Lemma 3, we need to check that τ and ɛ satisfy the requisite assumptions with respect to the -uniform hypergraph H defined in Construction 4. Let σV (H) be arbitrary and define

$$\displaystyle{V _{\sigma } =\bigcup _{v\in \sigma }v \subset [n].}$$

For an arbitrary set W ⊂ [n]∖V σ with | W | = t − | V σ |, let emb  F (σ, W) denote the number of copies \(\widetilde{F}\) of F with \(V (\widetilde{F}) = W \cup V _{\sigma }\) and \(\sigma \subset E(\widetilde{F})\). Observe that this number does not actually depend on the choice of W, so we will simply use emb  F (σ) from now on.

Since there are clearly \(\binom{n -\vert V _{\sigma }\vert }{t -\vert V _{\sigma }\vert }\) choices for the set W, we arrive at the following claim.

Claim 1

For any σV (H),

$$\displaystyle{d(\sigma ) = \binom{n -\vert V _{\sigma }\vert }{t -\vert V _{\sigma }\vert }\mathop{\mathrm{emb}}\nolimits _{F}(\sigma ).}$$

Let us denote by t j the minimum number of vertices of F which span j edges. From Claim 1, it follows that for any σV (H) with | σ | = j, we have

$$\displaystyle{d(\sigma ) = \binom{n -\vert V _{\sigma }\vert }{t -\vert V _{\sigma }\vert }\mathop{\mathrm{emb}}\nolimits _{F}(\sigma ) \leq \binom{n - t_{j}}{t - t_{j}}\mathop{ \mathrm{emb}}\nolimits _{F}(\sigma ).}$$

On the other hand, for a singleton σ 1V (H), we have \(\vert V _{\sigma _{1}}\vert = k\) and therefore d = d(σ 1) is such that

$$\displaystyle{\frac{d(\sigma )} {d} \leq \frac{\binom{n - t_{j}}{t - t_{j}}} {\binom{n - k}{t - k}} \frac{\mathop{\mathrm{emb}}\nolimits _{F}(\sigma )} {\mathop{\mathrm{emb}}\nolimits _{F}(\sigma _{1})} \leq \frac{\binom{n - t_{j}}{t - t_{j}}} {\binom{n - k}{t - k}} <\left (\frac{n} {t} \right )^{k-t_{j} }.}$$

It then follows from Definition 2 and (1) that

$$\displaystyle{ \delta _{j} <\frac{(n/t)^{k-t_{j}}} {\tau ^{j-1}} <t^{t}n^{k-t_{j}+(\,j-1)/(2\ell)}. }$$
(7)

Since t j is increasing with respect to j, t 2k + 1, and j, we have \(k - t_{j} + \frac{j-1} {2\ell} \leq -1/2\). Thus, in view of (7), we have

$$\displaystyle{ \delta _{j} <t^{t}n^{k-t_{j}+(\,j-1)/(2\ell)} \leq t^{t}n^{-1/2} }$$
(8)

for all 2 ≤ j.

Using Definition 2 and inequality (8), we can now bound the codegree function δ(H, τ) by

$$\displaystyle{ \delta (H,\tau ) = 2^{\binom{\ell}{2}-1}\sum _{ j=2}^{\ell}2^{-\binom{j-1}{2}}\delta _{ j} \leq 2^{\binom{\ell}{2}-1}t^{t}n^{-1/2}\sum _{ j=2}^{\ell}2^{-\binom{j-1}{2}} \leq 2^{\binom{\ell}{2}}t^{t}n^{-1/2}. }$$
(9)

Since n satisfies (3), inequality (9) implies that

$$\displaystyle{\delta (H,\tau ) \leq 2^{\binom{\ell}{2}}t^{t}n^{-1/2} \leq \frac{\varepsilon } {12\ell!}.}$$

That is, δ(H, τ) satisfies the condition in Lemma 3.

Finally, (4) implies that τ satisfies the condition

$$\displaystyle{\tau = n^{-1/(2\ell)} <\frac{1} {144\ell!^{2}\ell}.}$$

Therefore, the assumptions of Lemma 3 are met and we can let \(\mathcal{C}\) be the collection of subsets from V (H) obtained from applying Lemma 3. Denote the elements of \(\mathcal{C}\) by \(C_{1},C_{2},\ldots,C_{\vert \mathcal{C}\vert }\).

For every choice of \(1 \leq a_{1},\ldots,a_{q} \leq \vert \mathcal{C}\vert\) (not necessarily distinct), let \(E_{a_{1},\ldots,a_{q}}\) be the event that \(G \subseteq C_{a_{1}} \cup \ldots \cup C_{a_{q}}\). Next we will show the following claim.

Claim 2

$$\displaystyle{ \mathbf{P}(X) \leq \mathbf{P}\bigg(\bigvee _{a_{1},\ldots,a_{q}}E_{a_{1},\ldots,a_{q}}\bigg) \leq \sum _{a_{1},\ldots,a_{q}}\mathbf{P}(E_{a_{1},\ldots,a_{q}}). }$$
(10)

Proof

Suppose thatGX. By definition, there exists a q-coloring of the edges of G, say with colors 1, 2, , q, which contains at most M copies of F in each color. For any color class j, let I j denote the set of vertices of H which correspond to edges of color j in G. Since each edge in H[I j ] corresponds to a copy of F in color j, we have e(H[I j ]) ≤ M. Note that

$$\displaystyle{M =\varepsilon \tau ^{\ell}e(H),}$$

which means that each I j satisfies the condition (i) of Lemma 3. Therefore, for each color class j, there must be a set \(C_{a_{j}} \in \mathcal{C}\) such that \(C_{a_{j}} \supset I_{j}\). Since \(G =\bigcup _{j}I_{j}\), this implies that \(G \in E_{a_{1},\ldots,a_{q}}\). Since GX was arbitrary, the bound (10) follows and the claim is proved. □

Owing to Claim 2, we now bound \(\mathbf{P}(E_{a_{1},\ldots,a_{q}})\). Recalling the definition of the event \(E_{a_{1},\ldots,a_{q}}\), we note that

$$\displaystyle{ \mathbf{P}(E_{a_{1},\ldots,a_{q}}) = (1 - p)^{\vert V (H)\setminus (C_{a_{1}}\cup \ldots \cup C_{a_{q}})\vert }. }$$
(11)

Hence, we shall estimate \(\vert V (H)\setminus (C_{a_{1}} \cup \ldots \cup C_{a_{q}})\vert\) to derive a bound for P(X) by (10).

Claim 3

For all choices \(1 \leq a_{1},\ldots,a_{q} \leq \vert \mathcal{C}\vert\) we have

$$\displaystyle{ \vert V (H)\setminus (C_{a_{1}} \cup \ldots \cup C_{a_{q}})\vert \geq \frac{1} {2}\Big( \frac{n} {R}\Big)^{k}. }$$

Proof

Leta 1, , a q be fixed and set

$$\displaystyle{ \mathcal{A} =\bigg\{ A \in \binom{[n]}{R}\::\: \binom{A}{k} \subset C_{a_{1}} \cup \ldots \cup C_{a_{q}}\bigg\}. }$$
(12)

By the definition of R = r(F; q), for each set \(A \in \mathcal{A}\) there is an index j = j(A) ∈ [q] such that \(C_{a_{j}}\) contains a copy of F with vertices from A. The element \(e \in E(C_{a_{j}})\) that corresponds to this copy of F satisfies \(e \subset \binom{A}{k}\) and, thus, \(\bigcup _{x\in e}x \subset A\). We now give an upper bound for \(\vert \mathcal{A}\vert\) by counting the number of pairs in

$$\displaystyle{\mathcal{P} =\bigg\{ (e,A) \in \bigcup _{i=1}^{q}E\big(C_{ a_{i}}\big) \times \mathcal{A}\text{ with }\bigcup _{x\in e}x \subset A\bigg\}.}$$

On the one hand, we have already established that \(\vert \mathcal{P}\vert \geq \vert \mathcal{A}\vert\). On the other hand, for any fixed eE(H), we have \(\vert \bigcup _{x\in e}x\vert = \vert V (F)\vert = t\) and, therefore, there are at most \(\binom{n - t}{R - t}\) sets \(A \supset \bigcup _{x\in e}x\). It follows that

$$\displaystyle{ \begin{array}{ll} \vert \mathcal{A}\vert \leq \vert \mathcal{P}\vert &\leq \bigg\vert \bigcup _{i=1}^{q}E\big(C_{a_{i}}\big)\bigg\vert \binom{n - t}{R - t}\mathop{ \leq }\limits^{ (\textit{ii})}q\varepsilon e(H)\binom{n - t}{R - t} \\ &\mathop{ =}\limits^{(1)}\frac{e(H)} {2R^{t}} \binom{n - t}{R - t} \leq \frac{(n)_{t}} {2R^{t}} \binom{n - t}{R - t} \leq \frac{1} {2}\binom{n}{R}. \end{array} }$$
(13)

By definition, each \(A \in \binom{[n]}{R}\setminus \mathcal{A}\) satisfies \(\binom{A}{k}\not\!\!\subset C_{a_{1}} \cup \ldots \cup C_{a_{q}}\). Hence, \(V (H)\setminus (C_{a_{1}} \cup \ldots \cup C_{a_{q}})\) intersects \(\binom{A}{k}\). Since an element of V (H) can appear in at most \(\binom{n - k}{R - k}\) sets A, it follows from (13) that there are at least

$$\displaystyle{\frac{1} {2}\binom{n}{R}\bigg/\binom{n - k}{R - k} \geq \frac{1} {2}\Big( \frac{n} {R}\Big)^{k}}$$

elements in \(V (H)\setminus (C_{a_{1}} \cup \ldots \cup C_{a_{q}})\), as required. □

In view of Claim 3, our choice of p = 1000R k , where α = n −1∕2+1∕4(+1), and (11), we have, for any \(C_{a_{1}},\ldots,C_{a_{q}} \in \mathcal{C}\),

$$\displaystyle{ \begin{array}{lll} \mathbf{P}(E_{a_{1},\ldots,a_{q}})& \leq &(1 - p)^{(n/R)^{k}/2 } \\ & \leq &\exp \big(-pn^{k}/2R^{k}\big) =\exp \big (-(1000R^{k}q\alpha )n^{k}/2R^{k}\big) \\ & =&e^{-500q\alpha n^{k} } \leq e^{-1000q\alpha N}, \end{array} }$$
(14)

where, in the last step, we used \(N = \binom{n}{k} \leq \frac{n^{k}} {2}\). Therefore, (10) and (14) together with the bound on \(\vert \mathcal{C}\vert\) given by Lemma 3(iii) imply that

$$\displaystyle{\begin{array}{lll} \mathbf{P}(X)& \leq &\sum _{C_{a_{ 1}},\ldots,C_{a_{q}}\in \mathcal{C}}\mathbf{P}(E_{a_{1},\ldots,a_{q}})\ \leq \vert \mathcal{C}\vert ^{q}e^{-1000q\alpha N} \\ & \leq &\exp \big(1000q\ell!^{3}\ell\log (1/\varepsilon )N\tau \log (1/\tau ) - 1000q\alpha N\big) \\ & =&\exp \big(1000qN\tau (\ell!^{3}\ell\log (1/\varepsilon )\log (1/\tau ) -\alpha /\tau )\big) \\ & \leq &\exp \big(1000qN\tau (\ell!^{3}\log ^{2}n - n^{1/(4\ell(\ell+1))})\big) \leq 1/4,\\ \end{array} }$$

where we used that n satisfies (5).

Now, by Markov’s inequality, with probability at least 1∕2, the number of noninduced copies of F in G will be at most twice the expected number of copies, which is fewer than

$$\displaystyle{\begin{array}{lll} 2p^{\ell+1} \frac{(n)_{t}} {\mathop{\mathrm{aut}}\nolimits (F)}\binom{t}{k}& =&2(1000q)^{\ell+1}R^{k(\ell+1)}n^{-1/2-1/(4\ell)} \frac{(n)_{t}} {\mathop{\mathrm{aut}}\nolimits (F)}\binom{t}{k} \\ & <& \frac{1} {2qR^{t}}(n^{-1/(2\ell)})^{\ell} \frac{(n)_{t}} {\mathop{\mathrm{aut}}\nolimits (F)} =\varepsilon \tau ^{\ell} \frac{(n)_{t}} {\mathop{\mathrm{aut}}\nolimits (F)} = M,\end{array} }$$

where the inequality above follows from (6). In other words, \(\mathbf{P}(\overline{Y }) \geq 1/2\) and, therefore, \(\mathbf{P}(\overline{X} \cap \overline{Y }) \geq 1/4\), so there exists a graph G such that \(\overline{X} \cap \overline{Y }\) holds. By our earlier observations, this completes the proof.

3 Concluding Remarks

Beginning with Fox and Sudakov [10], much of the recent work on induced Ramsey numbers for graphs has used pseudorandom rather than random graphs for the target graph G. The results of this paper rely very firmly on using random hypergraphs. It would be interesting to know whether comparable bounds could be proved using pseudorandom hypergraphs.

It would also be interesting to prove comparable bounds for the following variant of the induced Ramsey theorem, first proved by Nešetřil and Rödl [15]: for every graph F, there exists a graph G such that every q-coloring of the triangles of G contains an induced copy of F all of whose triangles receive the same color. By taking F = K t and q = 4, we see that | G | may need to be double exponential in | F |. We believe that a matching double-exponential upper bound should also hold.