Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Gödel’s paper on formally undecidable propositions in first order Peano arithmetic (Gödel 1931) showed that any recursive axiomatic system containing Peano arithmetic still admits propositions which are not decidable. Gödel’s original example of such a proposition was not that illuminating. It was merely a kind of formalization of the well known antinomy of the liar. This raised the problem to look for intuitively meaningful propositions which are independent of Peano arithmetic. Paris and Harrington (1977) showed that a straightforward variant of the finite Ramsey theorem is independent of Peano arithmetic, thus witnessing Gödel’s first incompleteness theorem .

The original short and elegant proof of Paris and Harrington uses model theoretic tools. A different, purely combinatorial explanation of the unprovability by means of fast growing functions was given by Ketonen and Solovay (1981). In this section we present a simplification of the Ketonen-Solovay argument due to Loebl and Nešetřil (1991). We start with some background on fast growing hierarchies.

1 The Hardy Hierarchy

Let \(\gamma _{1} =\omega\) and \(\gamma _{n+1} =\gamma _{ n}^{\omega }\) for every n < ω, i.e.,

$$\displaystyle{\gamma _{n} {=\omega }^{{\omega }^{{\cdot }^{{\cdot }^{{\cdot }^{\omega }} }} }\Big\}n - \text{times}.}$$

Moreover set

$$\displaystyle{\epsilon _{0} {=\omega }^{{\omega }^{{\cdot }^{{\cdot }^{\cdot } }} } =\lim _{n\rightarrow \infty }\gamma _{n}.}$$

Then ε 0 is the least ordinal solution to the equation \({\omega }^{\lambda } =\lambda\). Throughout this section we are only concerned with ordinals below ε 0.

First note that every ordinal below ε 0 admits a unique representation known as the Cantor normal form of α:

Let \(\alpha <\epsilon _{0}\) be a positive ordinal and k be a positive integer. Then α can be represented uniquely as

$$\displaystyle{\alpha {=\omega }^{\alpha _{1}} \cdot n_{1} {+\omega }^{\alpha _{2}} \cdot n_{2} +\ldots {+\omega }^{\alpha _{k}} \cdot n_{k},}$$

where \(\alpha >\alpha _{1} >\alpha _{2} >\ldots >\alpha _{k} \geq 0\) are ordinals and \(n_{1},\ldots,n_{k}\) are positive integers.

Such a coding of ordinals α < ε 0 by positive integers can be defined straightforwardly, compare for example Schütte (1977).

Next we define fundamental sequences which we will subsequently use in order to define the Hardy hierarchy. We need these fundamental sequences in order to handle limit ordinals properly. To every limit ordinal α < ε 0 we associate a strictly monotone sequence α[n], n < ω, which approaches α from below. If \(\alpha <\epsilon _{0}\) is given in Cantor normal form \(\alpha =\alpha ^{\prime} {+\omega }^{\alpha _{k}} \cdot n_{ k},\) where α k is the minimal exponent, let

$$\displaystyle{\alpha [n] = \left \{\begin{array}{l@{\qquad }l} \alpha ^{\prime}\ {+\ \omega }^{\alpha _{k}} \cdot (n_{ k} - 1)\ {+\ \omega }^{\alpha _{k}[n]}, \qquad &\mbox{ if $\alpha _{k}$ is a limit ordinal,} \\ \alpha ^{\prime}\ {+\ \omega }^{\alpha _{k}} \cdot (n_{ k} - 1)\ {+\ \omega }^{\alpha _{k}-1} \cdot (n + 1),\qquad & \mbox{ if $\alpha _{k}$ is a successor ordinal.} \end{array} \right.}$$

For example, \(\omega [n] = n + 1\), \({\omega }^{\omega }[n] {=\omega }^{n+1}\), \({\omega }^{k+1}[n] {=\omega }^{k} \cdot (n + 1)\), and \({\omega }^{k} \cdot (k + 1)[n] {=\omega }^{k} \cdot k {+\omega }^{k-1} \cdot (n + 1)\).

With the help of these fundamental sequences we define functions \(H_{\alpha }(\cdot )\) for all α < ε 0:

$$\displaystyle{\begin{array}{llcl} &H_{0}(n) & =&n, \\ &H_{\alpha +1}(n)& =&H_{\alpha }(n + 1), \\ &H_{\alpha }(n) & =&H_{\alpha [n]}(n)\qquad \mbox{ for limit ordinals.} \\ \mbox{ Finally, define}\ H_{\epsilon _{0}}\ \mbox{ by}& & & \\ &H_{\epsilon _{0}}(n) & =&H_{\gamma _{n}}(n).\\ \end{array} }$$

This is the Hardy hierarchy , introduced by Wainer (1972). This hierarchy is based on a sequence of functions first defined by Hardy (1904) to construct sets of real numbers of cardinality 1. It is not difficult to see that each H α is strictly increasing and \(H_{\alpha }(n) < H_{\alpha +1}(n)\) for every nonnegative integer n.

The significance of the Hardy hierarchy in connection with unprovability results stems from the following theorem, cf. Wainer (1970, 1972) and Buchholz and Wainer (1987).

Theorem 8.1.

Let f: ω →ω be a provably total and recursive function (provably total with respect to Peano arithmetic). Then f is eventually dominated by some H α for an α < ε 0 . Moreover, \(H_{\epsilon _{0}}\) eventually dominates every provably total recursive function but it itself is not provably total.

2 Paris-Harrington’s Unprovability Result

A set Lω is called large, if \(L\neq \varnothing \) and \(\min L \leq \vert L\vert \). So {4, 5, 6, 7} is a large set but not {4, 10, 15}. Let k, n and r be positive integers. With this terminology at hand we can state the following variation of the classical Ramsey theorem that follows from the infinite Ramsey theorem using a compactness argument.

Theorem 8.2.

Let k and r be positive integers. Then there exists a least positive integer n = PH(k,r) such that for every r-coloring Δ: [n] k → r there exists a large subset L ⊆ n with |L| > k such that \(\varDelta \rceil {[L]}^{k}\) is a constant coloring.□

While for the classical Ramsey theorem it is difficult to obtain tight bounds it will turn out that for this seemingly small variation of the classical Ramsey theorem it is already difficult to obtain any kind of bound.

Theorem 8.3 (Paris and Harrington).

The statement

  1. (PH)

    for every pair k,r of positive integers there exists a least positive integer n = PH(k,r) such that for every r-coloring Δ: [n] k → r there exists a large subset L ⊆ n with |L| > k such that \(\varDelta \rceil {[L]}^{k}\) is a constant coloring

is not provable in Peano arithmetic.

For the reader who is not used to work in Peano arithmetic we mention that for statements about natural numbers Peano arithmetic is equivalent to the result of replacing the axiom of infinity by its negation in the usual axioms of Zermelo-Fraenkel set theory (see, e.g., Jech (1978) for these axioms). Obviously, the principle (PH) can be formulated in this theory. In this way Theorem 8.3 should be understood as: the formula of Peano arithmetic corresponding to the principle (PH) is not provable in Peano arithmetic.

Intuitively, a reason for the unprovability of (PH) in Peano arithmetic is that the function PH(k, r) grows too rapidly. Recall that a recursive function f: ωω is provably recursive if one can show in Peano arithmetic that f is total, i.e., defined for all natural numbers. Now it turns out that the function PH(k, k) grows faster than any provably recursive function f, i.e., f(k) < PH(k, k) for all but finitely many k. However, by Theorem 8.2 the function PH(k, k) is total, hence, (PH) is not provable in Peano arithmetic.

The aim of this section is to prove the Paris-Harrington result by purely combinatorial means following an approach of Ketonen and Solovay (1981). Here we follow a simplified approach by Loebl and Nešetřil (1991).

Let α < ε 0 be an ordinal and let \(\alpha {=\omega }^{\alpha _{1}} \cdot n_{1} {+\omega }^{\alpha _{2}} \cdot n_{2} +\ldots {+\omega }^{\alpha _{k}} \cdot n_{k}\) be the Cantor normal form of α. Let \(S_{i}(\alpha ) {=\omega }^{\alpha _{i}} \cdot n_{i}\) be the ith summand in the Cantor normal form of α, let \(C_{i}(\alpha ) = n_{i}\) be the coefficient of the ith summand and let \(E_{i}(\alpha ) =\alpha _{i}\) be the corresponding exponent. If \(\gamma _{h-1} \leq \alpha <\gamma _{h}\) then α is said to be of height h which is abbreviated by h(α) = h. The weight w(α) of α is defined recursively as follows:

$$\displaystyle{w(\alpha ) = \left \{\begin{array}{l@{\qquad }l} \alpha, \qquad &\mbox{ if $\alpha $ is an integer,}\\ \max \{n_{ 1},\ldots,n_{k},w(\alpha _{1}),\ldots,w(\alpha _{k}),k\},\qquad &\mbox{ otherwise.} \end{array} \right.}$$

Let n be an integer. Then (α, n) is called a good pair if \(n > w(\alpha ) + h(\alpha )\). Let \((\alpha,n)\) be a good pair. We define a predecessor function R(α; n) as follows.

$$\displaystyle{R(\alpha;n) = \left \{\begin{array}{l@{\qquad }l} (\alpha -1;n + 1), \qquad &\mbox{ if $\alpha $ is a successor ordinal,}\\ (\alpha [n - h(\alpha )]; n + 1),\qquad &\mbox{ if $\alpha $ is a limit ordinal.} \end{array} \right.}$$

Since \(w(\alpha [n - h(\alpha )]) \leq \max \{ w(\alpha ) + 1,\:n - h(\alpha )\}\), it follows that R(α; n) again is a good pair. As an example, consider γ h , the stack of h many ω’s. Observe that \(h(\gamma _{h}) = h + 1\). Hence, (γ h ; h + 3) is a good pair and so is \(R(\gamma _{h};\:h + 3) = (\gamma _{h}[2];\:h + 4)\).

Let R 0(α; n) = (α; n) and \({R}^{k+1}(\alpha;n) = R({R}^{k}(\alpha;n)).\) By \(\mathcal{R}(\alpha;n)\) we denote the family of all pairs which can be generated by successively applying this predecessor operation, i.e., \(\mathcal{R}(\alpha;n) =\{ {R}^{i}(\alpha;n)\,\vert \,i <\omega \}.\) Finally, let \(r(\alpha,n) = \vert \mathcal{R}(\alpha;n)\vert \). This function can be related to the Hardy hierarchy.

Lemma 8.4.

Let α < ε 0 be an ordinal and let n be a non-negative integer. Then

$$\displaystyle{r(\alpha,\:n + h(\alpha ))\ \geq \ H_{\alpha }(n) - n.}$$

Proof.

We apply transfinite induction on α. Obviously, for every natural number k, r(k, n) is the length of the sequence \((k,n),(k - 1,n + 1),\ldots,(0,n + k)\). Thus \(r(k,n + 1) = k + 1 > H_{k}(n) - n = k\).

In the induction step we have either α + 1 being a successor ordinal. i.e.,

$$\displaystyle\begin{array}{rcl} r(\alpha +1,n + h(\alpha ))& =& 1 + r(\alpha,n + h(\alpha ) + 1) {}\\ & \geq & 1 + H_{\alpha }(n + 1) - n - 1 {}\\ & =& H_{\alpha +1}(n) - n, {}\\ \end{array}$$

or α being a limit ordinal and therefore

$$\displaystyle\begin{array}{rcl} r(\alpha,\:n + h(\alpha ))& =& r(\alpha [n],\:n + h(\alpha ) + 1)\; \geq \; H_{\alpha [n]}(n) - n\; =\; H_{\alpha }(n) - n, {}\\ \end{array}$$

as claimed. □

A family of good pairs is called a good family. If there is a member of such a family of height h and, moreover, the height of each member is at most h then this family is said to be a good family of height h. A good family \((\beta _{0};n_{0}),\ldots,(\beta _{t-1};n_{t-1})\) is monotone if \(\beta _{i} >\beta _{j}\) and \(n_{i} < n_{j}\) for every pair 0 ≤ i < j < t. For instance, \(\mathcal{R}(\gamma _{h};h + 3)\) is a monotone family of height h + 1 for every h < ω.

The following coloring lemma plays the key rôle in the proof of the Paris-Harrington result.

Lemma 8.5.

Let h ≥ 2 be an integer and let \(S =\{ (\beta _{0};n_{0}),\ldots,(\beta _{t-1};n_{t-1})\}\) be a good family of height h such that n i > h + 1 for every i < t. Then there exists a coloring of the (h + 1)-subsets of S with less than 3 h colors such that no monotone subfamily \(S^{\prime} =\{ (\alpha _{0};m_{0}),\ldots,(\alpha _{s-1};m_{s-1})\}\) of S of size |S′| > m 0 is monochromatic.

Proof.

Let \(\epsilon _{0} >\alpha _{0} >\alpha _{1} >\alpha _{2}\) be ordinals in Cantor normal form. Then let \(\varDelta (\alpha _{0},\alpha _{1}) =\min \{ i\mid S_{i}(\alpha _{0})\not =S_{i}(\alpha _{1})\}\) be the index of the largest summand where α 0 and α 1 differ. Recall that \(C_{i}(\alpha _{1}),C_{i}(\alpha _{2})\) denotes the coefficient of the ith summand of \(\alpha _{1},\alpha _{2},\) respectively. We define \(\delta (\alpha _{0},\alpha _{1},\alpha _{2}) < 3\) as follows.

$$\displaystyle{\delta (\alpha _{0},\alpha _{1},\alpha _{2}) = \left \{\begin{array}{l@{\qquad }l} 0,\qquad &\mbox{ if $\varDelta (\alpha _{0},\alpha _{1}) >\varDelta (\alpha _{1},\alpha _{2})$,} \\ 1,\qquad &\mbox{ if $x =\varDelta (\alpha _{0},\alpha _{1}) \leq \varDelta (\alpha _{1},\alpha _{2}) = y$ and $C_{y}(\alpha _{1}) < C_{x}(\alpha _{0})$,} \\ 2,\qquad &\mbox{ otherwise.} \end{array} \right.}$$

Iterating this scheme we associate to every strictly monotone decreasing sequence \(\alpha = (\alpha _{0},\ldots,\alpha _{t-1})\) of ordinals a vector \(\delta (\alpha ) = (\delta _{0},\ldots,\delta _{t-3}) \in {3}^{t-2}\) where \(\delta _{i} =\delta (\alpha _{i},\alpha _{i+1},\alpha _{i+2}).\)

Let \(S =\{ (\beta _{0};n_{0}),\ldots,(\beta _{t-1};n_{t-1})\}\) be a good family of height h such that \(\beta _{0} >\ldots >\beta _{t-1}\). We define a coloring of the (h + 1)-subsets of S by induction on h.

First assume that S is of height 2. Then color every monotone 3-element subset \(\{(\alpha _{0};m_{0}),(\alpha _{1};m_{1}),(\alpha _{2};m_{2})\}\) of S with color \(\delta (\alpha _{0},\alpha _{1},\alpha _{2})\). This is clearly a 3-coloring. Assume that \(S^{\prime} =\{ (\alpha _{0};m_{0}),\ldots,(\alpha _{s-1};m_{s-1})\}\) is a monotone subfamily of S which is monochromatic. Recalling the definition of w(α 0) and the fact that \(m_{0} > w(\alpha _{0}) + h(\alpha _{0})\) we show in the following that | S′ | ≤ m 0. The assumption that S′ is monochromatic with color 0 implies that | S′ | is bounded by the number of summands in the Cantor normal form of α 0 plus one. The assumption that S′ is monochromatic with color 1 implies that | S′ | is at most one more than the size of the coefficient of the largest summand in the Cantor normal form of α 0. Finally, the assumption that S′ is monochromatic with color 2 implies that | S′ | is bounded by the size of the exponent of the first summand in the Cantor normal form of α 0 plus one. This is because h(α 0) = 2, i.e., the exponent is an integer.

Next assume the validity of the lemma for all good families of height h for some h ≥ 2 and assume that S is of height h + 1 and therefore n i > h + 2 for every i < t.

We associate a family H(S) of height h to S as follows. To any 2-subset of S, say \(\{(\alpha _{0};m_{0}),(\alpha _{1};m_{1})\}\), we associate a pair \((\eta _{0};p_{0})\) choosing \(p_{0} = m_{0} - 1\) and \(\eta _{0} = E_{x}(\alpha _{0})\) where \(x =\varDelta (\alpha _{0},\alpha _{1})\). Observe that each such pair is a good pair of height at most h. Let H(S) be the set of all pairs which can be obtained this way. Then H(S) is a good family and p > h + 1 for every pair (η, p) ∈ H(S) is valid. Without loss of generality we can assume that H(S) is of height h. Hence by inductive assumption there exists a coloring of the (h + 1)-subsets of H(S) with less than 3h colors such that no monotone subfamily \({H}^{^{\prime}} =\{ (\eta _{0};p_{0}),\ldots,(\eta _{r-1};p_{r-1})\}\) of H(S) of size | H | > p 0 is monochromatic.

Now color the monotone (h + 2)-subfamilies of S as follows. Let \(T =\{ (\alpha _{0};m_{0}),\ldots,(\alpha _{h+1};m_{h+1})\}\) be such a family. Color T with \(\delta (\alpha _{0},\ldots,\alpha _{h+1}) \in {3}^{h}\) if \(\delta (\alpha _{0},\ldots,\alpha _{h+1})\not =(2,\ldots,2)\). Otherwise consider the (h + 1)-subfamily \(H(T) =\{ (\eta _{0};p_{0}),\ldots,(\eta _{h};p_{h})\}\) of H(S) and color T with the color assigned to H(T) by the inductive assumption.

Obviously, this defines a coloring of all (h + 2)-subfamilies of S with less than \(2 \cdot {3}^{h} < {3}^{h+1}\) many colors. Assume that \(S^{\prime} =\{ (\alpha _{0};m_{0}),\ldots,(\alpha _{s-1};m_{s-1})\}\) is a monotone subfamily of S which is monochromatic. Assume that S′ is monochromatic in some color δ ∈ 3h which is not a constant vector. Then | S′ | ≤ h + 2, but m 0 > h + 2. If S′ is monochromatic with color \((0,\ldots,0) \in {3}^{h}\) or with color \((1,\ldots,1) \in {3}^{h}\) similar arguments as in the case h = 2 show that \(\vert S^{\prime}\vert \leq w(\alpha _{0}) + 1\) but \(m_{0} > w(\alpha _{0}) + h(\alpha _{0})\). It remains to consider the case that S′ is monochromatic with color \((2,\ldots,2) \in {3}^{h}\). But then the family \(H(S^{\prime}) =\{ (\eta _{0};p_{0}),\ldots,(\eta _{s-2};p_{s-2})\},\) where \(p_{i} = m_{i} - 1\) and \(\eta _{i} = E_{x}(\alpha _{i})\) with \(x =\varDelta (\alpha _{0},\alpha _{1})\) for every is − 2, is a monotone subfamily of H(S) and, by definition of the coloring of S, monochromatic. Hence, by inductive assumption, \(\vert H(S^{\prime})\vert \leq p_{0} = m_{0} - 1\) and so | S′ | ≤ m 0. □

Lemma 8.6.

Let h ≥ 2. Then

$$\displaystyle{\mathit{PH}(h + 2,\;{3}^{h+1} + 2h)\ >\ H_{\gamma _{ h}}(h) + h.}$$

Proof.

Consider the monotone family \(\mathcal{R}(\gamma _{h};2h + 1) =\{ (\alpha _{0};m_{0}),\ldots,(\alpha _{t-1};m_{t-1})\}.\) Obviously, \(\alpha _{0} =\gamma _{h}\), \(m_{0} = 2h + 1\) and \(m_{i+1} = m_{i} + 1\) for every i < t − 1. By Lemma 8.4 we have that \(t \geq H_{\gamma _{h}}(h) - h.\) Since \(\mathcal{R}(\gamma _{h};2h + 1)\) is a monotone family of height h + 1 and m i > h + 2 for every i < t, by Lemma 8.5 there exists a coloring of the (h + 2)-subsets of \(\mathcal{R}(\gamma _{h};2h + 1)\) with less than 3h+1 colors such that no monotone subfamily S′ ⊆ S of size | S′ | > 2h + 1 is monochromatic. This induces obviously a coloring of the (h + 2)-subsets of \(M =\{ 0,\ldots,2h,2h + 1,\ldots,m_{t-1}\}\) with \({3}^{h+1} + 2h\) many colors having the property that there is no large subset of L which is monochromatic. Hence

$$\displaystyle\begin{array}{rcl} \mathit{PH}(h + 2,\;{3}^{h+1} + 2h)& \geq & m_{ t-1} + 1\ \geq \ 2h + t + 1 {}\\ & \geq & H_{\gamma _{h}}(h) + h + 1. {}\\ & & {}\\ \end{array}$$

Proof of Theorem 8.3.

$$\displaystyle{PH(h + 2,\;{3}^{h+1} + 2h)\ >\ H_{\gamma _{ h}}(h) + h + 1\ \geq \ H_{\epsilon _{0}}(h),}$$

and so, by Theorem 8.1, PH(h, h) is not a provably total and recursive function. □

It should be mentioned that other variants of Ramsey-type theorems give rise to functions which grow even much faster than the Paris-Harrington function. For example, in Prömel et al. (1991) fast growing functions based on Ramsey’s theorem are investigated which grow faster than any recursive function which can proved to be total in the formal system ATR0.