1 Introduction

It is well known that first-order modal logic faces fundamental limitations in expressive power. Some standard examples used to illustrate this include:

  1. (E)

    There could have been things other than there actually are.Footnote 1

  2. (R)

    Everyone who is actually rich could have been poor.Footnote 2

Informally, the first is true iff there is a possible world where something exists that does not exist in the actual world. The second has multiple readings, but on one reading, it is true iff there is a possible world where everyone who is rich in the actual world is poor in that world. It can been shown that no formula in basic first-order modal logic with actualist quantifiers (i.e., quantifiers ranging over existents) is equivalent to (E) or to (R).Footnote 3 We can regiment (E) using a possibilist existential quantifier \(\Sigma {}\) (i.e., a quantifier ranging over all possible objects) and an existence predicate \({\textsf {E}} \) as follows:

$$\begin{aligned} \Sigma {x}\left( \lnot {\textsf {E}} (x) \wedge \Diamond {\textsf {E}} (x)\right) . \end{aligned}$$
(1)

But one can prove that even with these additions, there is still no formula that is equivalent to (R).Footnote 4

In response to these expressive limitations, many authors have considered extending first-order modal logic with an “actually” operator \(@\).Footnote 5 They then point out that in the presence of \(@\) and the possibilist universal quantifier \(\Pi {}\), the following is equivalent to (R):

$$\begin{aligned} \Diamond \Pi {x}\left( @{\textsf {Rich}} (x) \rightarrow {\textsf {Poor}} (x)\right) . \end{aligned}$$
(2)

However, even with possibilist quantifiers and the actuality operator, sentences like the following seem to remain inexpressible:Footnote 6

  1. (NR)

    Necessarily, everyone who was rich could have been poor.

One could try to fix this problem by adding more and more operators to the language, some of which we will discuss below. But many such languages face further expressivity limitations themselves.Footnote 7 Corresponding expressive limitations also arise for first-order temporal logic, though we will mostly focus on the modal versions until the end of this paper.

Very often, these inexpressibility claims are justified in the literature only by example: all of the most straightforward attempts at formalizing these English sentences into first-order modal logic fail. While this style of argument may be convincing, it does not constitute a proof. One can sometimes find rigorous proofs for a variety of inexpressibility claims.Footnote 8 But only (Hodes 1984a, b, c) provides proofs of the inexpressibility of (R), (E), and sentences like them in extensions of first-order modal logic with two-dimensional operators such as \(@\).Footnote 9 And while these proofs are very interesting and involve a number of underappreciated techniques, they are quite complicated and difficult to generalize to other formal languages of interest.

In this paper, I will use a modular notion of bisimulation to characterize the expressive power of extensions of first-order modal logic with two-dimensional operators. After reviewing basic first-order modal logic (Sect. 2), I will provide a single proof method for characterizing the expressive power of a wide variety of first-order modal languages using bisimulations (Sect. 3). I will then present a variety of inexpressibility proofs using this technique (Sect. 4). I will conclude by generalizing these results to temporal logics and higher-dimensional logics (Sect. 5). The more intricate details are left to appendices (Appendices 1–4).

2 First-order modal logic

In this section, we review the standard possible worlds semantics for first-order modal logic. The technical details below are fairly standard except our points of evaluation need to be two-dimensional to account for operators like the actuality operator \(@\). While we have picked a particularly simple formulation of first-order modal logic, the inexpressibility results we explore in this paper apply to a wide range of formulations of first-order modal logic.Footnote 10

The signature for our plain vanilla first-order modal language contains:

  • \({\textsf {VAR}} \) = (the set of (object) variables);

  • \({\textsf {PRED}} ^n\) = for each \(n \ge 1\) (the set of n-place predicates);

The set of formulas in or -formulas is defined recursively:

where \(P^n \in {\textsf {PRED}} ^n\) for any \(n \ge 1\), and . The usual abbreviations for \(\bot \), \(\vee \), \(\rightarrow \), \(\leftrightarrow \), , and \(\Diamond \) apply. We may drop parentheses for readability. If the free variables of \(\varphi \) are among , we may write “” to indicate this.

To talk more easily about extensions of , we will introduce a convention. Let be some new symbols with pre-defined syntax. The language obtained from by adding is . Some symbols that might be added include:

where \(\approx \) is the identity relation, \({\textsf {E}} \) is an existence predicate, \(@\) is an “actually” operator, \({\mathop {\downarrow }}\) is a diagonalization operator (the inverse of \(@\)),Footnote 11\(\mathcal {F}\) is a “fixedly” operator,Footnote 12 is a quantifier over all actual objects,Footnote 13 and \(\Pi {}\) is the possibilist universal quantifier (its existential counterpart is \(\Sigma {}\)). The usual abbreviations apply. In what follows, we will let “\(\mathcal {L}\)” stand for any arbitrary where are among the symbols above.

Definition 1

(Models) A model is a tuple \(\mathcal {M} \) = \(\left\langle W,R,F,D,\delta ,I\right\rangle \) where:

  • W is a nonempty set (the state space);

  • \(R \subseteq W \times W\) (the \(\Box \)-accessibility relation);

  • \(F \subseteq W \times W\) (the \(\mathcal {F}\)-accessibility relation);

  • D is a nonempty set disjoint from W (the (global) domain);

  • \(\delta :W \rightarrow \wp \left( D\right) \) is a function (the local domain assignment), where for each \(w \in W\), \(\delta (w)\) is the local domain ofw;

  • \(I :{\textsf {PRED}} ^n \times W \rightarrow \wp (D^n)\) (for all \(n \ge 1\)) is a function (the interpretation function).

We will let \(\mathcal {M} \)’s state space be \(W^\mathcal {M} \), \(\mathcal {M} \)’s \(\Box \)-accessibility relation be \(R^\mathcal {M} \), etc. We will define (and likewise for F). If \(R = W \times W\), we will say R is universal. If \(D =\bigcup _{w \in W} \delta (w)\) (i.e., D does not contain impossible objects), we will say D satisfies the domain constraint. We will let \(\mathbf U \) be the class of models where R and F are universal, \(\mathbf D \) be the class of models where D satisfies the domain constraint, and UD be their intersection.

Let \(\mathcal {M} \) be a model. A variable assignment for\(\mathcal {M} \) is a function g mapping variables to elements in D. Let the set of variable assignments for \(\mathcal {M} \) be \({\textsf {VA}} (\mathcal {M} )\). If \(g \in {\textsf {VA}} (\mathcal {M} )\), then \(g^x_a\) is the result of modifying g by mapping x to a.

For readability, if is a sequence (of terms, objects, etc.), we may write “\(\overline{\alpha } \)” in place of “”. \(\overline{\alpha } \) is assumed to be of the appropriate length, whatever that is in a given context. Let \(\left| \overline{\alpha } \right| \) be the length of \(\overline{\alpha } \). When f is some unary function, we may write “\(f(\overline{\alpha })\)” in place of “”. For instance, if g is a variable assignment, “\(g(\overline{x})\)” on this notation stands for “”. Likewise, “\(g^{\overline{x}}_{\overline{a}}\)” stands for “”.

Since we want to consider operators like \(@\), our possible worlds semantics will be two-dimensional [as suggested in, e.g., Davies and Humberstone (1980, pp. 4–5)]. That is, indices will have to contain two worlds. The first world is to be interpreted as the world “considered as actual” and the second as the world of evaluation.

Definition 2

(Satisfaction for ) The -satisfaction relation\(\Vdash \) is defined recursively, for all models \(\mathcal {M} = \left\langle W,R,F,D,\delta ,I\right\rangle \), all \(w,v \in W\) and all \(g \in {\textsf {VA}} (\mathcal {M} )\):

If \(\left| \overline{x} \right| \le \left| \overline{a} \right| \), then \(\mathcal {M} ,w,v \Vdash \varphi [\overline{a} ]\) if for all \(g \in {\textsf {VA}} (\mathcal {M} )\), \(\mathcal {M} ,w,v,g^{\overline{x}}_{\overline{a}} \Vdash \varphi (\overline{x})\).

Definition 3

(Validity) Let \(\mathbf C \) be a class of models. We will say \(\varphi \) is (generally)C-valid—written as \(\Vdash _\mathbf C \varphi \)—if \(\mathcal {M} ,w,v,g \Vdash \varphi \) for all \(\mathcal {M} \in \mathbf C \), all \(w,v \in W^\mathcal {M} \), and all \(g \in {\textsf {VA}} (\mathcal {M} )\). We will say \(\varphi \) is diagonallyC-valid—written as \(\Vdash _\mathbf C ^\text {d} \varphi \)—if \(\mathcal {M} ,w,w,g \Vdash \varphi \) for all \(\mathcal {M} \in \mathbf C \), all \(w \in W^\mathcal {M} \), and all \(g \in {\textsf {VA}} (\mathcal {M} )\). If \(\varphi \leftrightarrow \psi \) is (diagonally) \(\mathbf C \)-valid, we will say \(\varphi \) and \(\psi \) are (diagonally)C-equivalent. If \(\mathbf C \) is the class of all models, we may drop mention of \(\mathbf C \) and just say “valid” or “equivalent”.

We could have defined some of the additional symbols above in terms of others, assuming the others are present. For instance, , , and are all valid (we will invoke these throughout without explicit mention of them). Thus, by the following lemma, we could have taken the lefthand side of these biconditionals to be abbreviations for their righthand side:

Lemma 4

(Replacement of equivalents) Suppose \(\varphi \) and \(\psi \) have the same free variables. Let \(\theta '\) be a formula that results from replacing any number of instances of \(\varphi \) in \(\theta \) with \(\psi \). Then \(\Vdash _\mathbf C \varphi \leftrightarrow \psi \) implies \(\Vdash _\mathbf C \theta \leftrightarrow \theta '\).

This follows by a straightforward induction. If we replace \(\Vdash _\mathbf C \) in Lemma 4 with \(\Vdash _\mathbf C ^\text {d}\), then the result no longer holds (for instance, \(\Vdash ^\text {d} @P(x) \leftrightarrow P(x)\), but  \(\nVdash ^\text {d} \Box @P(x) \leftrightarrow \Box P(x)\)). However, if \(\Vdash _\mathbf C ^\text {d} \varphi \leftrightarrow \psi \), then \(\Vdash _\mathbf C \star \varphi \leftrightarrow \star \psi \), where , in which case we can replace \(\star \varphi \) with \(\star \psi \).

We can think of Definition 2 as specifying a translation from the modal language into the language of possible worlds. We can make this more precise by formally defining the language of possible worlds, which is often called the correspondence language.Footnote 14 The correspondence language is a two-sorted first-order language: one sort for objects, and one sort for worlds. The signature for our two-sorted first-order language contains \({\textsf {VAR}} \), \({\textsf {PRED}} \), and:

  • \({\textsf {SVAR}} \) = (the set of state variables).

The set of formulas in or -formulas is defined recursively:

where \(P^n \in {\textsf {PRED}} ^n\), \(x,y,\overline{y} \in {\textsf {VAR}} \), and \(s,t \in {\textsf {SVAR}} \). I will typically use \(\alpha ,\beta ,\gamma ,\dots \) for -formulas to distinguish them from -formulas.

To illustrate, here are the intended formalizations of (E), (R), and (NR), where \(s^*\) is meant to be interpreted as the actual world (which we will assume is the same as our starting world of evaluation, just for simplicity):Footnote 15

  1. (E)

    There could have been things other than there actually are.

    (3)
  2. (R)

    Everyone who’s actually rich could have been poor.

    (4)
  3. (NR)

    Necessarily, everyone who’s rich could have been poor.

    (5)

To define ’s semantics, we need to modify the definition of a variable assignment for\(\mathcal {M} \) so that not only do variable assignments map variables to elements of D, but they also map state variables to elements of W. Then satisfaction in is just the standard notion of satisfaction for two-sorted first-order logic:

Definition 5

(Satisfaction for ) The -satisfaction relation\(\vDash \) is defined recursively for all models \(\mathcal {M} \) and all \(g \in {\textsf {VA}} (\mathcal {M} )\):

We say \(\alpha \) is C-valid—written as \(\vDash _\mathbf C \alpha \)—if \(\mathcal {M} ,g \vDash \alpha \) for all \(\mathcal {M} \in \mathbf C \) and all \(g \in {\textsf {VA}} (\mathcal {M} )\). Equivalence is defined likewise.

We can now make more precise the thought that Definition 2 is specifying a translation:

Definition 6

(Standard translation) Let \(\varphi \) be a \(\mathcal {L}\)-formula, and let \(s,t \in {\textsf {SVAR}} \). The standard translation of\(\varphi \)with respect to\(\left\langle s,t\right\rangle \), , is defined recursively:

where \(s'\) and \(t'\) are state variables not occurring anywhere in . If \(\varPhi \) is a set of \(\mathcal {L}\)-formulas, then we will let .

The following lemma, which can be proved using a simple induction on formulas, states that \({\textsf {ST}} \) translates every \(\mathcal {L}\)-formula into an equivalent -formula:

Lemma 7

(Translation) Let \(\mathcal {M} \) be a model, \(w,v \in W^\mathcal {M} \), \(g \in {\textsf {VA}} (\mathcal {M} )\), \(s,t \in {\textsf {SVAR}} \), and \(\varphi \) an \(\mathcal {L}\)-formula. Then \(\mathcal {M} ,w,v,g \Vdash \varphi \) iff .

In other words, Lemma 7 tells us that we can think of “\(\mathcal {M} ,w,v,g \Vdash \varphi \)” as a notational variant of “”. In what follows, we will implicitly identify an extension of with its equivalent fragment of the two-sorted language. The question now is to what extent we can find a translation that goes the other way. To help answer this question, we can define a formal notion of expressivity relative to as follows:

Definition 8

(Expressivity) Let \(\mathbf C \) be a class of models. We will say a set of -formulas \(\varGamma \)C-expresses a set of -formulas \(\varDelta \) if \(\varGamma \) is \(\mathbf C \)-equivalent to \(\varDelta \)—that is, for all \(\mathcal {M} \in \mathbf C \) and all \(g \in {\textsf {VA}} (\mathcal {M} )\), we have that \(\mathcal {M} ,g \Vdash \varGamma \) iff \(\mathcal {M} ,g \Vdash \varDelta \). If either \(\varGamma \) or \(\varDelta \) are singletons, we can drop the set brackets for readability. Where \(\mathcal {L}\) is a fragment of , we will say \(\varGamma \) is C-expressible in \(\mathcal {L}\) if there is a set of \(\mathcal {L}\)-formulas that \(\mathbf C \)-expresses \(\varGamma \). Where \(\mathcal {L}_1\) and \(\mathcal {L}_2\) are fragments of , we will say \(\mathcal {L}_2\)C-expresses\(\mathcal {L}_1\) or \(\mathcal {L}_1\) is C-included in \(\mathcal {L}_2\)—written as \(\mathcal {L}_1 \le _\mathbf C \mathcal {L}_2\)—if for any set of \(\mathcal {L}_1\)-formulas \(\varGamma \), there is a set of \(\mathcal {L}_2\)-formulas \(\varDelta \) that \(\mathbf C \)-expresses \(\varGamma \). We will write \(\mathcal {L}_1 <_\mathbf C \mathcal {L}_2\) if \(\mathcal {L}_1 \le _\mathbf C \mathcal {L}_2\) and \(\mathcal {L}_2 \nleq _\mathbf C \mathcal {L}_1\), and \(\mathcal {L}_1 \equiv _\mathbf C \mathcal {L}_2\) if \(\mathcal {L}_1 \le _\mathbf C \mathcal {L}_2\) and \(\mathcal {L}_2 \le _\mathbf C \mathcal {L}_1\).

These definitions apply to extensions of , viewed as fragments of . Thus, where \(\varGamma \) is a set of -formulas, and where \(\mathcal {L}\) is an extension of , we will say \(\varGamma \) is C-expressible in \(\mathcal {L}\) if there is a set of \(\mathcal {L}\)-formulas \(\varPhi \) such that \(\varGamma \) is \(\mathbf C \)-equivalent to . Likewise, if \(\mathcal {L}_1\) and \(\mathcal {L}_2\) are extensions of first-order modal logic, we will write \(\mathcal {L}_1 \le _\mathbf C \mathcal {L}_2\) if for any set of \(\mathcal {L}_1\)-formulas \(\varPhi \), there is a set of \(\mathcal {L}_2\)-formulas \(\varPsi \) such that is \(\mathbf C \)-equivalent to . Similarly for \(<_\mathbf C \) and \(\equiv _\mathbf C \).

3 Bisimulation

To show that no formula (or set of formulas) of a modal language \(\mathcal {L}\) can express a certain formula \(\alpha \) of , one must generally construct two models such that (a) they agree in \(\mathcal {L}\) on all \(\mathcal {L}\)-formulas (i.e., they are \(\mathcal {L}\)-equivalent), and (b) they disagree in on \(\alpha \). To make showing that such models are \(\mathcal {L}\)-equivalent easier, we can appeal to the notion of a bisimulation.Footnote 16 The notion of a bisimulation for first-order modal logic has not been discussed much until recently.Footnote 17 Below, we extend the notion of bisimulation in order to ensure modal equivalence for formulas involving two-dimensional operators.

A bisimulation is basically a back-and-forth game. In the standard back-and-forth game for (non-modal) first-order logic, there are two players, Abelard and Eloïse. Abelard aims to refute Eloïse’s attempt to show that the two models satisfy the same closed formulas. Abelard starts by picking an object from one of the models. Eloïse must then pick a matching object from the other model that satisfies the same atomic formulas. They continue in this manner, making sure at all times that the objects picked out so far from one model satisfy exactly the same atomic formulas that the objects picked out from the other model satisfy. If at any point the objects picked out from one model do not satisfy the same atomic formulas as the objects picked out from the other model, then Abelard wins. But if Eloïse manages to extend the game out for an infinite number of rounds, she wins. Two first-order models are elementarily equivalent (i.e., satisfy the same closed first-order formulas) if (but not only if) Eloïse has a winning strategy in this game for those models.

Likewise, two modal models satisfy the same -formulas if Eloïse has a winning strategy for a back-and-forth game like the one above, with some modifications. In the modified game, the game is “located” at some world(s) in the two models. When Abelard picks an object from the model, he must pick an object that exists at the world where the game is located; likewise with Eloïse. Now the catch: Abelard can choose, at any time, to change the location of the game in either model to any accessible world from the current location. In order to keep playing, Eloïse must likewise pick a matching accessible world in the other model to relocate the game to. The game then relocates to those accessible worlds, and the game continues. As before, if the objects that have been picked out from one model do not satisfy the same atomic formulas at the game’s current location that are satisfied by the objects picked out from the other model, then Abelard wins. But if Eloïse manages to extend the game out for an infinite number of rounds, she wins. Two worlds in two models will satisfy the same -formulas if Eloïse has a winning strategy in this game, where the game starts at those two worlds. More variations arise when different extensions of are considered. More precisely:

Definition 9

(Bisimulation) Let \(\mathcal {M} \) and \(\mathcal {N} \) be models. An -bisimulation between\(\mathcal {M} \)and\(\mathcal {N} \) is a nonempty variably polyadic relation Z such that for all \(w,v \in W^\mathcal {M} \), all \(w',v' \in W^\mathcal {N} \), all finite \(\overline{a} \in D^\mathcal {M} \), and all finite \(\overline{b} \in D^\mathcal {N} \) where \(\left| \overline{a} \right| = \left| \overline{b} \right| \), we have that \(Z(w,v,\overline{a};w',v',\overline{b})\) only if:

  • (Atomic) :

  • (Zig)

  • (Zag)

  • (Forth)

  • (Back) .

We may write “\(\mathcal {M} ,w,v,\overline{a} \leftrightarrows \mathcal {N} ,w',v',\overline{b} \)” (where possibly \(\left| \overline{a} \right| = \left| \overline{b} \right| = 0\)) to indicate that \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \) are -bisimilar, i.e., that there is a bisimulation Z between \(\mathcal {M} \) and \(\mathcal {N} \) such that \(Z(w,v,\overline{a};w',v',\overline{b})\).

The notion of an -bisimulation between\(\mathcal {M} \)and\(\mathcal {N} \) is defined similarly, except one must add the corresponding condition(s) below:

We may write “\(\mathcal {M} ,w,v,\overline{a} \leftrightarrows _{\mathcal {L}} \mathcal {N} ,w',v',\overline{b} \)” to indicate that \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \) are \(\mathcal {L}\)-bisimilar. We may also sometimes write “”, where , for readability.

Here are the various conditions phrased in terms of games. (Atomic) says that Eloïse loses unless \(\overline{a} \) satisfy the same atomic formulas in \(\mathcal {M} ,w,v\) that \(\overline{b} \) satisfy in \(\mathcal {N} ,w',v'\). (Zig) says that if Abelard decides to move the game to \(\left\langle w,u\right\rangle \) in \(\mathcal {M} \) where \(u \in R^\mathcal {M} [v]\), Eloïse must choose a \(u' \in R^\mathcal {N} [v']\) and relocate the game in \(\mathcal {N} \) to \(\left\langle w',u'\right\rangle \) to continue. Likewise for (Zag). (Forth) says that if Abelard picks an object \(a'\) from v, Eloïse must pick an object \(b'\) from \(v'\) to match it with. Likewise for (Back). (Eq) says that if Abelard picks an object that was already chosen, Eloïse must pick the matching object. (Ex) says that the objects picked have to agree in terms of existence, even when the game relocates. (Act) says that Abelard can force the game to relocate to \(\left\langle w,w\right\rangle \) and \(\left\langle w',w'\right\rangle \). Likewise for (Diag). The other clauses are as before, except with respect to different domains and relations.

Definition 10

(Modal equivalence) \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \) are -equivalent or modally equivalent if for all \(\mathcal {L}\)-formulas \(\varphi (\overline{x})\) (where \(\left| \overline{x} \right| \le \left| \overline{a} \right| \)), \(\mathcal {M} ,w,v \Vdash \varphi [\overline{a} ]\) iff \(\mathcal {N} ,w',v' \Vdash \varphi [\overline{b} ]\). We may write “\(\mathcal {M} ,w,v,\overline{a} \equiv _\mathcal {L}\mathcal {N} ,w',v',\overline{b} \)” to indicate that \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \) are \(\mathcal {L}\)-equivalent.

Theorem 11

(Bisimulation implies modal equivalence) Where , if \(\mathcal {M} ,w,v,\overline{a} \leftrightarrows _{\mathcal {L}} \mathcal {N} ,w',v',\overline{b} \), then \(\mathcal {M} ,w,v,\overline{a} \equiv _{\mathcal {L}} \mathcal {N} ,w',v',\overline{b} \).

In general, modal equivalence does not imply bisimulation.Footnote 18 However, it does when infinitary conjunction is present in the language. Consider the symbol with the following formation rule: if \(\varPhi \) is a set of well-formed formulas (of any size), then is a well-formed formula. Then it can be shown that iff .Footnote 19 Thus, bisimulation is equivalent to infinitary modal equivalence. No bisimulation clauses need to be added for .

Adding infinitary conjunction to the language clearly increases the expressive power of the language. For example, one can regiment the sentence “There are infinitely many rich people” as , where is short for the formula , and is short for the formula . However, infinitary conjunction does not increase the expressive power enough to overcome the particular expressive limitations discussed here, so we set it aside in what follows.

Now, recall the definition of expressibility (Definition 8).

Corollary 12

(Translation implies invariance) Let \(\alpha (\overline{x};s,t)\) be an -formula. If , and if \(\alpha \) is equivalent to the translation of some set of -formulas, then \(\mathcal {M} \vDash \alpha [\overline{a};w,v]\) iff \(\mathcal {N} \vDash \alpha [\overline{b};w',v']\). In other words, if \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \) are -bisimilar, but they disagree on \(\alpha \), then \(\alpha \) is not expressible as a set of -formulas.

Corollary 12 says that if a -formula is equivalent to the translation of an \(\mathcal {L}\)-formula (or a set of \(\mathcal {L}\)-formulas), then it is preserved under \(\mathcal {L}\)-bisimulations. As in propositional modal logic, the converse also holds (see Appendix 1 for the proof).Footnote 20

Theorem 13

(van Benthem Characterization Theorem) Let \(\alpha (\overline{x};s,t)\) be an -formula such that \(\mathcal {M} \vDash \alpha [\overline{a};w,v]\) iff \(\mathcal {N} \vDash \alpha [\overline{b};w',v']\) given that \(\mathcal {M} ,w,v,\overline{a} \leftrightarrows _{\mathcal {L}} \mathcal {N} ,w',v',\overline{b} \). Then \(\alpha \) is equivalent to the translation of some \(\mathcal {L}\)-formula.

This together with Theorem 11 implies that \(\mathcal {L}\) is just the \(\mathcal {L}\)-bisimulation invariant fragment of . For our purposes, however, Corollary 12 will be the key result in generating the inexpressibility results below.

4 Inexpressibility

While Corollary 12 and Theorem 13 exactly characterize the expressive power of and its various extensions, the characterization is a bit abstract, and it does not automatically tell us what the expressive power of these extensions are relative to one another. We now turn to illustrating the expressive limitations of and its extensions with concrete examples. Note that all of our models in this section fall in the class UD, so these inexpressibility results therefore apply to any class that includes UD.

To warm up, we start by showing that (E) is not expressible in . Recall (E) says that there could have been things other than there are, which is formalized in as (3):

(3)

The proof strategy will always be the same: construct two modal models that are bisimilar, but that disagree on (3). Because we do not have \(\approx \) in by default, this is actually very easy. Let \(\mathcal {E} = \left\langle W,R,F,D,\delta ,I\right\rangle \), where , R and F are universal, , and \(I(P,u) = \emptyset \). Let \(\mathcal {E} ' = \left\langle W,R,F,D',\delta ',I\right\rangle \), where everything is as in \(\mathcal {E} \), except . See Fig. 1 for a picture.

Fig. 1
figure 1

-bisimilar models that disagree on (E)

It is easy to see that w in \(\mathcal {E} \) does not satisfy (3): every possible object exists at w. However, w in \(\mathcal {E} '\) does satisfy (3): b is a possible object that does not exist at w. So \(\mathcal {E},w\) and \(\mathcal {E} ',w\) disagree on (3). So we just need to show that \(\mathcal {E},w,w \leftrightarrows \mathcal {E} ',w,w\). In fact, in this case, we can just take Z to map every world to every world, and every object to every object any number of times. To show this is a bisimulation, we just check each of the clauses from Definition 9 holds, which is easy to do. One might initially think that we will run into problems in trying to show (Back) holds; for if we consider Z(wvawva), and we decide to pick b from \(\delta '(v)\), then we cannot pick b from \(\delta (v)\) to match it with. But since a and b do not disagree on any predicates, and since \(\approx \) is not present in the language, cannot tell that a and b are distinct objects anyway. We do not have to match a to a and b to b every time. We can just as well match b in \(\delta '(v)\) with a in \(\delta (v)\).

What made this proof easy was the absence of \(\approx \). Now we will show that even cannot express (E).Footnote 21 Consider first \(\mathcal {E} _1 = \left\langle W_1,W_1^2,W_1^2,\mathbb {N} ,\delta _1,I_1\right\rangle \), where the global domain is \(\mathbb {N} \) and the accessibility relations are both universal. For each nonempty finite or cofinite \(S \subseteq \mathbb {N} \), there is a world \(w_S \in W_1\) such that \(\delta _1(w_S) = S\). No other worlds are in \(W_1\). Again, the extension of all non-logical predicates will be empty at all worlds. The second model is similar to the first, but now the global domain contains an additional object \(\infty \). For each nonempty set S that is either finite or cofinite in , there is a world \(w_S \in W_2\) such that \(\delta _2(w_S) = S\). See Fig. 2 for a picture.

Fig. 2
figure 2

-bisimilar models disagreeing on (E)

Since \(\delta _1(w_\mathbb {N} ) = \mathbb {N} = D\), \(w_\mathbb {N} \) in \(\mathcal {E} _1\) does not satisfy (3). But \(w_\mathbb {N} \) in \(\mathcal {E} _2\) does satisfy (3), since \(\infty \notin \delta _2(w_\mathbb {N} )\). So \(\mathcal {E} _1,w_\mathbb {N} \) and \(\mathcal {E} _2,w_\mathbb {N} \) disagree on (3). So we just need to show that \(\mathcal {E} _1,w_\mathbb {N} ,w_\mathbb {N} \leftrightarrows _{\approx } \mathcal {E} _2,w_\mathbb {N} ,w_\mathbb {N} \). Constructing the bisimulation is fairly straightforward (albeit tedious) once we work out what Eloïse’s winning strategy is. The construction of the bisimulation is given in Appendix 2, but the idea in terms of games is sketched below. To help describe the proof, let us introduce the following useful definition:

Definition 14

(Partial isomorphism) A partial isomorphism between\(\mathcal {M} ,w,v,\overline{a} \)and\(\mathcal {N} ,w',v',\overline{b} \) is a finite partial map \(\rho :D \rightarrow D'\) such that \(\rho (a_i) = b_i\) for \(i \le \left| \overline{a} \right| \) and:

  1. (Predicate)

    :

We write \(\mathcal {M} ,w,v,\overline{a} \simeq \mathcal {N} ,w',v',\overline{b} \) to indicate that there is a partial isomorphism between \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \).

To say that \(\mathcal {M} ,w,v,\overline{a} \simeq \mathcal {N} ,w',v',\overline{b} \) is essentially to say that Eloïse can continue the game (i.e., Abelard has not won yet) at this stage of the game (even with \(\approx \) present).

Proposition 15

(Inexpressibility of (E)) \(\mathcal {E} _1,w_\mathbb {N} ,w_\mathbb {N} \leftrightarrows _{\approx } \mathcal {E} _2,w_\mathbb {N} ,w_\mathbb {N} \). But \(\mathcal {E} _2 \vDash \) (3)\([w_\mathbb {N} ]\) while \(\mathcal {E} _1 \nvDash \) (3)\([w_\mathbb {N} ]\). Hence, (3) is not expressible in .

Proof

(Sketch) Our game starts at \(\mathcal {E} _1,w_\mathbb {N} ,w_\mathbb {N} \) and \(\mathcal {E} _2,w_\mathbb {N} ,w_\mathbb {N} \). We will describe a strategy for Eloïse such that, at every stage of the game, which we will represent as \(\left\langle w_\mathbb {N} ,v_1,\overline{a};w_\mathbb {N} ,v_2,\overline{b} \right\rangle \), we have that \(\mathcal {E} _1,w_\mathbb {N} ,v_1,\overline{a} \simeq \mathcal {E} _2,w_\mathbb {N} ,v_2,\overline{b} \) (in other words: Eloïse can continue the game at every stage of the game). We construct the strategy by induction on Abelard’s move.

Vacuously, \(\mathcal {E} _1,w_\mathbb {N} ,w_\mathbb {N} \simeq \mathcal {E} _2,w_\mathbb {N} ,w_\mathbb {N} \). So suppose \(\left\langle w_\mathbb {N} ,v_1,\overline{a} '; w_\mathbb {N} ,v_2,\overline{b} \right\rangle \) is the current stage of the game, where \(\mathcal {E} _1,w_\mathbb {N} ,v_1,\overline{a} \simeq \mathcal {E} _2,w_\mathbb {N} ,v_2,\overline{b} \) and where \(\left| \delta _1(v_1) \right| = \left| \delta _2(v_2) \right| \), i.e., the size of \(\delta _1(v_1)\) and \(\delta _2(v_2)\) is the same. Abelard can decide either to pick an object from \(\delta _1(v_1)\) or \(\delta _2(v_2)\), or to relocate the game. By the fact that \(\mathcal {E} _1,w_\mathbb {N} ,v_1,\overline{a} \simeq \mathcal {E} _2,w_\mathbb {N} ,v_2,\overline{b} \) and that \(\left| \delta _1(v_1) \right| = \left| \delta _2(v_2) \right| \), it follows that . So if Abelard decides to pick a new object from one of \(v_1\) and \(v_2\), Eloïse can always pick a matching object from the local domain of the other world to continue the game.

Suppose instead that Abelard decides to relocate the game. Eloïse should then choose a world in the other model so that the following holds of the new locations \(u_1\) and \(u_2\): (i) \(\left| \delta _1(u_1) \right| = \left| \delta _2(u_2) \right| \), and (ii) \(a_i \in \delta _1(u_1)\) iff \(b_i \in \delta _2(u_2)\). Since there are only finitely many \(\overline{a} \) at any given stage of the game, this will always be possible. And as long as (ii) holds, we will have that \(\mathcal {M} ,w_\mathbb {N} ,u_1,\overline{a} \simeq \mathcal {M} ,w_\mathbb {N} ,u_2,\overline{b} \). \(\square \)

Notice that this proof will fail under a variety of extensions of . This is easy to see if the extension can express (3) directly, as does and :

(6)
(7)

But it is also instructive to see where the proof for Proposition 15 fails for these extensions. If we were to add \(\Pi {}\), then Abelard would be allowed to pick any object from the global domain of either model. In that case, partial isomorphism is no longer enough to guarantee that Eloïse can continue in this game. For Eloïse to continue the game at \(\left\langle w_\mathbb {N} ,v_1,\overline{a};w_\mathbb {N} ,v_2,\overline{b} \right\rangle \), we also need to ensure that \(\left| \mathbb {N} - \delta _1(v_1) \right| = \left| \mathbb {N} ^\infty - \delta _2(v_2) \right| \). Otherwise, if say \(\left| \mathbb {N} - \delta _1(v_1) \right| < \left| \mathbb {N} ^\infty - \delta _2(v_2) \right| \), Abelard could keep picking objects from \(\mathbb {N} - \delta _1(v_1)\) until he ran out (since \(\left| \mathbb {N} - \delta _1(v_1) \right| < \left| \mathbb {N} ^\infty - \delta _2(v_2) \right| \) would imply that \(\mathbb {N} - \delta _1(v_1)\) is finite). Then he could pick whatever unmatched objects remain in \(\mathbb {N} ^\infty - \delta _2(v_2)\). In response, Eloïse would be forced to match Abelard’s object either with an object not in \(\mathbb {N} - \delta _1(v_1)\), thus violating (Ex) from Definition 9, or with an object in \(\mathbb {N} - \delta _1(v_1)\) that was already chosen, thus matching a previously unmatched object to a previously matched object and violating (Eq). So we need to be able to ensure that \(\left| \mathbb {N} - \delta _1(v_1) \right| = \left| \mathbb {N} ^\infty - \delta _2(v_2) \right| \) at every stage of the game, which we can do except at one very crucial point, viz., the beginning: \(\left| \mathbb {N} - \delta _1(w_\mathbb {N} ) \right| \ne \left| \mathbb {N} ^\infty - \delta _2(w_\mathbb {N} ) \right| \). In other words, Abelard can force a win just by picking \(\infty \) from \(D_2\), leaving Eloïse unable to pick a matching object while meeting (Ex). Without \(\Pi {}\), this winning strategy for Abelard is blocked.

If we add \(@\), then Abelard can force the location of the game in both models to move back to \(w_\mathbb {N} \). In the proof of Proposition 15 above, it is crucial that Eloïse can choose to relocate to a world similar enough to the actual world. For instance, suppose on round 1, Abelard chooses to move to in \(\mathcal {E} _2\). Then Eloïse must choose to move to some in \(\mathcal {E} _1\)—let us say . Abelard can then choose \(\infty \) from , forcing Eloïse to choose 5. At this point, if Abelard decided to relocate the game back to \(w_\mathbb {N} \) in both models, then he could choose 5 from \(\mathcal {E} _1\), and Eloïse would lose by violating (Eq). But without \(@\), while Abelard can choose to relocate the game to \(w_\mathbb {N} \) in \(\mathcal {E} _2\), say, Eloïse does not have to relocate the game to \(w_\mathbb {N} \) in \(\mathcal {E} _1\); she can, for instance, pick to relocate to in \(\mathcal {E} _1\).

Let us now turn to showing a more difficult inexpressibility result, viz., that (R) is not expressible in .Footnote 22 Recall (R) says that everyone who is actually rich could have been poor, which is formalized in as (4):

(4)

Let \(\mathbb {N} ^- \,{:}{=}\, \mathbb {Z} - \mathbb {N} \). We let \(\mathcal {R} _1 = \left\langle W_1,R_1,F_1,D_1,\delta _1,I_1\right\rangle \) and \(\mathcal {R} _2 = \left\langle W_2,R_2,F_2,D_2,\delta _2,I_2\right\rangle \), where \(D_1 = D_2 = \mathbb {Z} \) and the accessibility relations are universal for both models. There is a world \(w \in W_1\) that will act as our actual world, where every positive integer is rich (top half of circle), and every negative integer is poor (bottom half of circle). For each nonempty finite subset \(S \subseteq \mathbb {N} \), there is a world \(v_S \in W_1\) where the members of S do not exist, and otherwise the rich and the poor are flipped with respect to w; so at \(v_S\), the negative integers are rich, and the positive integers not in S are poor, and the positive integers in S do not exist. The extension of all other predicates is empty. The only difference between \(\mathcal {R} _1\) and \(\mathcal {R} _2\) is that \(\mathcal {R} _2\) includes an additional world \(v_\emptyset \in W_2\), where no integer fails to exist, and where the rich and poor are flipped with respect to w. See Fig. 3 for a picture.

Fig. 3
figure 3

-bisimilar models disagreeing on (R). The top half of each circle satisfies \({\textsf {Rich}} \), while the bottom half satisfies \({\textsf {Poor}} \); at each \(v_S\), the members of S do not exist

\(\mathcal {R} _2 \vDash \) (4)[w], since \(v_\emptyset \) is the world where everyone rich in w is poor. But \(\mathcal {R} _1 \nvDash \) (4)[w], since in every world where something that is rich in w is poor, something that is rich in w does not exist (and hence is not poor). And once again, \(\mathcal {R} _1,w,w \leftrightarrows _{\approx ,@} \mathcal {R} _2,w,w\). The details are left to Appendix 2, but a proof is sketched in terms of games below.

Proposition 16

(Inexpressibility of (R)) \(\mathcal {R} _1,w,w \leftrightarrows _{\approx ,@} \mathcal {R} _2,w,w\). But \(\mathcal {R} _2 \vDash \) (4)[w] even though \(\mathcal {R} _1 \nvDash \) (4)[w]. Hence, (4) is not expressible in .

Proof

(Sketch) Our game starts at \(\mathcal {R} _1,w,w\) and \(\mathcal {R} _2,w,w\). As before, we will describe a winning strategy for Eloïse such that, at every stage \(\left\langle w,v,\overline{a};w,v',\overline{b} \right\rangle \) of the -bisimulation game, \(\mathcal {R} _1,w,v,\overline{a} \simeq \mathcal {R} _2,w,v',\overline{b} \).

Again, vacuously, \(\mathcal {R} _1,w,w \simeq \mathcal {R} _2,w,w\). So suppose \(\left\langle w,u_1,\overline{a};w,u_2,\overline{b} \right\rangle \) is the current stage of the game, where \(\mathcal {R} _1,w,u_1,\overline{a} \simeq \mathcal {R} _2,w,u_2,\overline{b} \) and where \(a_i\) is positive (i.e., in \(\mathbb {N} \)) iff \(b_i\) is positive. We will show that this continues to be true regardless of Abelard’s move. If Abelard decides to pick a new object from \(\delta _1(u_1)\) or \(\delta _2(u_2)\), it will either be positive or negative. Since there are infinitely many of both, Eloïse will have no trouble picking a new one; and since there was a partial isomorphism between \(u_1\) and \(u_2\), Eloïse only needs the new objects to agree on their sign.

Suppose instead that Abelard decides to relocate the game. If he decides to move the game in both models back to w, since \(a_i\) is positive iff \(b_i\) is positive, we will have \(\mathcal {R} _1,w,w,\overline{a} \simeq \mathcal {R} _2,w,w,\overline{b} \). (Likewise, if Abelard chooses to relocate to w in one model but lets Eloïse choose the other new location, she should still choose w for the reason above.) If he decides to relocate to some \(v_S\) where \(S \ne \emptyset \) in, say, \(\mathcal {R} _1\), let T be any set with the same cardinality as S such that \(a_i \in S\) iff \(b_i \in T\). Since there are only finitely many \(a_i\)s, there will always be such a T. Eloïse can choose to relocate to \(v_T\), and again, since \(a_i\) is positive iff \(b_i\) is positive, \(\mathcal {R} _1,w,v_S,\overline{a} \simeq \mathcal {R} _2,w,v_T,\overline{b} \). Likewise if Abelard chooses to relocate to some \(v_S\) in \(\mathcal {R} _2\).

The tricky part is determining what to do when Abelard decides to relocate to \(v_\emptyset \) in \(\mathcal {R} _2\). But since there are only finitely many \(a_i\)s, Eloïse can just choose a \(v_S\) where . Then it will still be the case that \(\mathcal {R} _1,w,v_S,\overline{a} \simeq \mathcal {R} _2,w,v_\emptyset ,\overline{b} \). So no matter where Abelard decides to relocate, Eloïse can continue the game. \(\square \)

Notice that the proof fails if we try to add either \(\Pi {}\), \({\mathop {\downarrow }}\), or \(\mathcal {F}\). It is easy to see this for \(\Pi {}\), since we can express (4) as (2):

$$\begin{aligned} \Diamond \Pi {x}(@{\textsf {Rich}} (x) \rightarrow {\textsf {Poor}} (x)). \end{aligned}$$
(2)

To see where the proof above fails for , consider what happens when Abelard decides to move from \(u_2\) to \(v_\emptyset \) in \(\mathcal {R} _2\). Eloïse will try to match that move in \(\mathcal {R} _1\) by moving from \(u_1\) to some \(v_S\) where . But now, because of \(\Pi {}\), Abelard is free to pick any object in S (and hence not in \(\delta _1(v_S)\)), forcing Eloïse to match it with an object in \(\delta _2(v_\emptyset )\) (since \(\delta _2(v_\emptyset ) = D_2 = \mathbb {Z} \)), and hence violating (Atomic).

As for \({\mathop {\downarrow }}\) and \(\mathcal {F}\), the models above disagree on both of these formulas:

(8)
(9)

In particular, \(\mathcal {R} _2,w,w \vDash \) (8) and \(\mathcal {R} _2,w,w \vDash \) (9) (take \(v_\emptyset \) to be the world we shift to by \(\Diamond {\mathop {\downarrow }}\) or \(\left\langle \mathcal {F}\right\rangle @\)), but \(\mathcal {R} _1,w,w \nvDash \) (8) and \(\mathcal {R} _1,w,w \nvDash \) (9). While this does not show that (4) can be expressed as an -formula, it does show \(\mathcal {R} _1\) and \(\mathcal {R} _2\) cannot be used to settle the matter. Using modified models, however, it is possible to settle the matter in the negative: (4) is not even UD-expressible in (see Appendix 3).Footnote 23

As a final example, we will show that even cannot express (NR), which is formalized as (5):

(5)

Consider two models \(\mathcal {N} _1\! =\! \left\langle W_1,R_1,F_1,D_1,\delta _1,I_1\right\rangle \) and \(\mathcal {N} _2 \!=\! \left\langle W_2,R_2,F_2,D_2,\delta _2,I_2\right\rangle \). Again, \(D_1 = D_2 = \mathbb {Z} \) and the accessibility relations are universal. However, now all of \(\mathbb {Z} \) exists at every world in either model. Our actual world is z, an egalitarian world where no integer is either rich or poor. For every finite set \(S \subseteq \mathbb {N} \) and finite set \(T \subseteq \mathbb {N} ^-\), there is a world \(w_S^T\) where all the integers in \((\mathbb {N} - S) \cup T\) are rich, while all the integers in \((\mathbb {N} ^- - T) \cup S\) are poor (so our old w is now just \(w_\emptyset ^\emptyset \)). And for every nonempty finite set \(S \subseteq \mathbb {N} \), and every finite set \(T \subseteq \mathbb {N} ^-\), there is a world \(v_S^T\) like before, where the rich and poor are flipped with respect to \(w_S^T\). The only difference between \(\mathcal {N} _1\) and \(\mathcal {N} _2\) is the presence of worlds of the form \(v_\emptyset ^T\) in \(\mathcal {N} _2\). See Fig. 4 for a picture.Footnote 24

Fig. 4
figure 4

-bisimilar models disagreeing on (NR)

\(\mathcal {N} _1,z,z\) and \(\mathcal {N} _2,z,z\) both agree that (4) is true, since nothing in z is rich. But they disagree on whether (5) is true; without the presence of \(v_\emptyset ^T\), there is no world where everyone rich in \(w_\emptyset ^\emptyset \) (i.e., \(\mathbb {N} \)) is poor. Thus, \(\mathcal {N} _1 \nvDash \) (5)[z] but \(\mathcal {N} _2 \vDash \) (5)[z]. Furthermore:

Proposition 17

(Inexpressibility of (NR)) \(\mathcal {N} _1,z,z \leftrightarrows _{\approx ,@,\Pi {}} \mathcal {N} _2,z,z\). But \(\mathcal {N} _2 \vDash \) (5)[z] while \(\mathcal {N} _1 \nvDash \) (5)[z]. Hence, (5) is not expressible in .

Proof

(Sketch) Our game starts at \(\mathcal {N} _1,z,z\) and \(\mathcal {N} _2,z,z\). We will describe a winning strategy for Eloïse such that at every stage of the game \(\left\langle z,u_1,\overline{a};z,u_2,\overline{b} \right\rangle \), not only do we have \(\mathcal {N} _1,z,u_1,\overline{a} \simeq \mathcal {N} _2,u_2,\overline{b} \), but also we have \(u_1 = z\) iff \(u_2 = z\), and we have \(u_1 = w^{T_1}_{S_1}\) for some \(S_1\) and \(T_1\) iff \(u_2 = w^{T_2}_{S_2}\) for some \(S_2\) and \(T_2\). Clearly this holds for the initial stage \(\left\langle z,z;z,z\right\rangle \). So suppose \(\left\langle z,u_1,\overline{a};z,u_2,\overline{b} \right\rangle \) is such a stage.

Because for each world u and each , \(I_k({\textsf {Rich}},u)\) and \(I_k({\textsf {Poor}},u)\) are empty or infinite, the back-and-forth step is easy. So suppose Abelard decides to relocate the game in \(\mathcal {N} _1\). If he relocates to z, then Eloïse should also just relocate to z in \(\mathcal {N} _1\). Suppose now Abelard decides to relocate to some \(w^{T_1}_{S_1}\) in \(\mathcal {N} _1\). Define and . Then one can check that \(\mathcal {N} _1,z,w^{T_1}_{S_1}, \overline{a} \simeq \mathcal {N} _2,z,w^{T_2}_{S_2}, \overline{b} \). For instance, suppose \(a_i \in I_1({\textsf {Rich}},w^{T_1}_{S_1})\). Either \(b_i \in \mathbb {N} \) or \(b_i \in \mathbb {N} ^-\). If \(b_i \in \mathbb {N} \), then \(b_i \notin S_2\), so \(b_i \in I_2({\textsf {Rich}},w^{T_2}_{S_2})\). If \(b_i \in \mathbb {N} ^-\), then \(b_i \in T_2\), so \(b_i \in I_2({\textsf {Rich}},w^{T_2}_{S_2})\). Likewise, if \(a_i \in I_1({\textsf {Poor}},w^{T_1}_{S_1})\), then \(b_i \in I_2({\textsf {Poor}},w^{T_2}_{S_2})\). The same strategy works if Abelard chooses to relocate to some \(v^{T_1}_{S_1}\) in \(\mathcal {N} _1\). It also works if Abelard decides to relocate in \(\mathcal {N} _2\), except when he chooses \(v^{T_2}_{S_2}\) where the corresponding \(S_1\) as defined above would be empty. In that case, for no \(a_i \in \mathbb {N} \) is \(b_i \in I_2({\textsf {Rich}},v^{T_2}_{S_2})\). So Eloïse can choose \(S_1\) to be any nonempty subset of . \(\square \)

Once again, however, this inexpressibility proof does not extend to languages with \({\mathop {\downarrow }}\), since we can express (5) as:

$$\begin{aligned} \Box {\mathop {\downarrow }}\Diamond \Pi {x}\left( @{\textsf {Rich}} (x) \rightarrow {\textsf {Poor}} (x)\right) . \end{aligned}$$
(10)

Likewise, if we restrict to the class of models where \(R = F\), we can express (5) with:

$$\begin{aligned} \mathcal {F}@\Diamond \Pi {x}\left( @{\textsf {Rich}} (x) \rightarrow {\textsf {Poor}} (x)\right) . \end{aligned}$$
(11)

5 Generalizations

In the previous section, we saw a number of examples demonstrating how bisimulations can be used to prove inexpressibility results for a variety of two-dimensional logics. In this section, we turn to some more general results regarding the expressive power of two-dimensional modal languages and beyond.

Fig. 5
figure 5

The (\(\mathbf D \)-)expressive hierarchy for two-dimensional languages between and . Arrows represent strict increase in expressive power. If there is an upward path from \(\mathcal {L}_1\) to \(\mathcal {L}_2\) in the diagram on the left, then the inclusions between \(\mathcal {L}_1\), \(\mathcal {L}_2\), and their extensions with \({\textsf {E}} \), \(\approx \), or \(\Pi {}\) are represented in the right diagram

First, given that sentences like (E), (R), and (NR) are expressible in some languages but not others, it is natural to ask what exactly the relative expressive power of all these various languages are. For instance, combining the results in Sect. 4, we know that . But how do languages like and compare? Is one stronger than the other? What if we add a possibilist quantifier to one or the other?

Using bisimulation techniques like the ones in Sect. 4, we can mostly settle these questions for the two-dimensional languages discussed in this paper (though in what follows, I have excluded languages that include ). The inclusions relative to the class of all models can be diagrammed as in Fig. 5 (for a proof of the accuracy of the diagram, see Appendix 4). The diagram is still accurate relative to \(\mathbf D \). Relative to \(\mathbf U \), adding \({\mathop {\downarrow }}\) or \(\mathcal {F}\) without \(@\) present is redundant. Moreover, .Footnote 25 Relative to UD, we will have more inclusions. For instance, while and , we do have , using the translation . Similarly, . However, there are still limitations: and . For more details, see Appendix 4.

Second, all of these inexpressibility results carry over to temporal logic. In temporal logic, one also includes a backward-looking operator \({\Box }^{-1} \), in addition to \(\Box \), with the following semantic clause (where ):

Usually, \(\Box \) and \({\Box }^{-1} \) are written respectively as \({\mathop {\mathsf {G}}}\) and \({\mathop {\mathsf {H}}}\) (for “it is always going to be” and “it has always been”), \(@\) is written as \({\mathop {\mathsf {N}}}\) (for “now”), and \({\mathop {\downarrow }}\) is written as \({\textsf {T}} \) (for “then”). The notion of a bisimulation can easily be generalized to temporal logic by including Zig-Zag clauses for both R and \({R}^{-1} \) (which are often written respectively as < and >).

All of the sentences considered in this paper have natural temporal analogues. Here are a few variations on some of the sentences we have been considering (where \({\textsf {R}} \) is replaced by <):

  1. (FE)

    There will be things other than there are now.

    (12)
  2. (PR)

    It was the case that everyone now rich was poor.

    (13)
  3. (FPR)

    Henceforth, everyone who is rich will have once been poor.

    (14)

All of our models in Sect. 4 have universal accessibility relations. However, allowing < to be universal would be too permissive in the context of temporal logic (there would be no difference between future and past!). Often, < is required to be at least a strict partial order (i.e., irreflexive, asymmetric, and transitive), thereby excluding models where it is universal. Thus, none of the results in Sect. 4 immediately carry over to temporal logic.

Fortunately, we can still piggyback on these inexpressibility results with the following trick. Suppose where the accessibility relations of \(\mathcal {M} \) and \(\mathcal {N} \) are universal. Assume for simplicity that \(W^\mathcal {M} \) and \(W^\mathcal {N} \) are countable. Let \(f^\mathcal {M} :\mathbb {N} \rightarrow W^\mathcal {M} \) be a surjection such that \(f^\mathcal {M} (0) = w\) (and likewise for \(f^\mathcal {N} \)). Define a new model \(\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }\) where \(W^{\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }} {:}{=}\mathbb {Z} \times W^\mathcal {M} \), , \(D^{\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }} {:}{=}D^\mathcal {M} \), \(\delta ^{\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }}(\left\langle i,f(n)\right\rangle ) {:}{=}\delta ^\mathcal {M} (f(n))\), and \(I^{\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }}(P,\left\langle i,f(n)\right\rangle ) {:}{=}I^\mathcal {M} (P,f(n))\). That is, each \(i \in \mathbb {Z} \) contains a copy of \(\mathcal {M} \), and the integer-world pairs are ordered lexicographically. Define \(\mathcal {N} _{\mathbb {Z} \times \mathbb {N} }\) similarly from \(\mathcal {N} \) using \(f^\mathcal {N} \).

From this, it follows that . For instance, suppose we are playing the back-and-forth game with \(\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }\) and \(\mathcal {N} _{\mathbb {Z} \times \mathbb {N} }\), and we are currently located at some stage \(\left\langle \left\langle i_1,u_1\right\rangle ,\left\langle i_2,u_2\right\rangle ,\overline{a};\left\langle i_1',u_1'\right\rangle ,\left\langle i_2',u_2'\right\rangle ,\overline{b} \right\rangle \). Then whenever Abelard makes a move, Eloïse need only consult the back-and-forth with \(\mathcal {M} \) and \(\mathcal {N} \), and see how she would respond at stage \(\left\langle u_1,u_2,\overline{a};u_1',u_2',\overline{b} \right\rangle \). In particular, if Abelard chooses to relocate the game in \(\mathcal {M} _{\mathbb {Z} \times \mathbb {N} }\) to \(\left\langle i_3,u_3\right\rangle \) where \(\left\langle i_3,u_3\right\rangle > \left\langle i_2,u_2\right\rangle \), then Eloïse can pick whatever \(u_3'\) she would have chosen had Abelard chose \(u_3\) in the back-and-forth game with \(\mathcal {M} \) and \(\mathcal {N} \), and then she can relocate the game in \(\mathcal {N} _{\mathbb {Z} \times \mathbb {N} }\) to \(\left\langle i_3',u_3'\right\rangle \) where \(\left\langle i_3',u_3'\right\rangle > \left\langle i_2',u_2'\right\rangle \) (she will always be able to find one, since there are infinitely many copies of \(\mathcal {N} \) after \(i_2'\)). The same strategy applies if Abelard picks \(\left\langle i_3,u_3\right\rangle \) with \(\left\langle i_3,u_3\right\rangle < \left\langle i_2,u_2\right\rangle \). Thus, our results in Sect. 4 can be extended to temporal logic.Footnote 26

Finally, it seems as though two-dimensions is not enough to overcome the kind of expressive limitations discussed in this paper in full generality. Recall that while (NR) is not expressible as an -formula, it is expressible as an -formula. However, more complicated sentences can be constructed that reveal the expressive limitations of even . The most natural examples use temporal modalities or mix modalities. For instance, here is a temporal example from Cresswell (1990, p. 29):

  1. (H)

    Once, everyone now alive who was not then miserable would eventually be happy.

To formalize this, it seems that we need to store two reference times, not just one. In , we would formalize (H) as follows:

(15)

We can also get examples with metaphysical modality, although the more powerful the language is, the more contrived the examples have to be:

  1. (RC)

    There could have been a brave man such that everyone who was poor but kind in reality necessarily received money from that man.

The problem is that we need to be able to go back to both the actual world and the first world we shifted to while we are at the second world we shifted to; but we can only keep track of one reference world at a time. It has been noted in the literature that this point seems to generalize to higher-dimensional languages.Footnote 27 One gets the feeling that for any n, we can concoct a -formula that requires keeping track of \((n + 1)\)-many worlds in our points of evaluation. But no proof of this claim has been offered in the literature.Footnote 28 Using the power of bisimulations, we can actually verify this claim. I will conclude by explaining how to generate further inexpressibility results for higher-dimensional languages. We will only explicitly prove that a certain three-dimensional language is not expressible in any two-dimensional language. Hopefully, it will be clear how the method can be schematized to show that some \((n+1)\)-dimensional languages are not expressible in any n-dimensional language.

First, we define an n-dimensional model to be any tuple of the form where each \(R_i \subseteq W \times W\), and otherwise everything is as before. For instance, the models we have been working with in this paper have all been 2-dimensional models (where \(F = R_1\)). Second, for each \(k \ge 1\), we will introduce operators \(\Box _k\), \(@_k\), and \({\mathop {\downarrow }}_k\), which we might add to . In what follows, we will define and . Third, where is a sequence of worlds, and where \(1 \le k \le n\), let \(\sigma ^k_v\) be the result of replacing \(w_k\) in \(\sigma \) with v. Finally, satisfaction for will be relativized to \((n+1)\)-dimensional models, as well as a sequence of n-many worlds , a world v, and a variable assignment \(g \in {\textsf {VA}} (\mathcal {M} )\), with the semantic clauses for the new operators stated below for \(1 \le k \le n\):

Thus, \(\mathcal {F}\), \(@\), and \({\mathop {\downarrow }}\) are \(\Box _1\), \(@_1\), and \({\mathop {\downarrow }}_1\) respectively. Since and is essentially , it makes sense to call with this semantics an n-dimensional language. Generalizing the definition of a bisimulation to is straightforward.

Fig. 6
figure 6

Summary of \(\mathcal {N} _3\) and \(\mathcal {N} _4\). For each , we have that \(S_k \subseteq \mathbb {N} _k\) and . For \(\mathcal {N} _3\), there are no worlds of the form \(\gamma _{\emptyset ,S_2,S_3,S_4}\)

Using models similar to \(\mathcal {N} _1\) and \(\mathcal {N} _2\) from Fig. 4, we can show that is not included in . We have already shown with Proposition 17 that is not included in (or even in ), since distinguishes \(\mathcal {N} _1,z,z\) and \(\mathcal {N} _2,z,z\), even though \(\mathcal {N} _1,z,z \leftrightarrows _{\approx ,@_1,\Pi {}} \mathcal {N} _2,z,z\). We will show that the following -formula is not expressible in :Footnote 29

(16)

The proof that is not expressible in is a straightforward generalization of the proof below.

First, we describe our models \(\mathcal {N} _3\) and \(\mathcal {N} _4\). These models have been summarized in Fig. 6. All of the accessibility relations will be universal, and \(\mathcal {N} _3\) and \(\mathcal {N} _4\) will both be constant domain models (so the local domain of every world is the global domain). Let \(\mathbb {N} _1\), \(\mathbb {N} _2\), \(\mathbb {N} _3\), and \(\mathbb {N} _4\) be disjoint copies of \(\mathbb {N} \). In both models, there will be a unique world z where \(I_k(P_1,z) = I_k(P_2,z) = I_k(P_3,z) = \emptyset \) for . There will be three types of worlds: \(\alpha \)-worlds, \(\beta \)-worlds, and \(\gamma \)-worlds. Each type of world will be uniquely specified by four sets \(S_1\), \(S_2\), \(S_3\), and \(S_4\), where \(S_k \subseteq \mathbb {N} _k\) and for . Where , we will denote the worlds as \(\eta _{S_1,S_2,S_3,S_4}\). Each of \(S_1\), \(S_2\), \(S_3\), and \(S_4\) is generally allowed to be empty, but in \(\mathcal {N} _3\), only \(\gamma \)-worlds where \(S_1 \ne \emptyset \) are allowed. For each and each , \(I_k(P_i,\eta _{S_1,S_2,S_3,S_4}) = \emptyset \) with the following exceptions:

  • \(I_k(P_1,\alpha _{S_1,S_2,S_3,S_4}) = (\mathbb {N} _1 - S_1) \cup (\mathbb {N} _2 - S_2) \cup S_3 \cup S_4\)

  • \(I_k(P_2,\beta _{S_1,S_2,S_3,S_4}) = (\mathbb {N} _1 - S_1) \cup (\mathbb {N} _3 - S_3) \cup S_2 \cup S_4\)

  • \(I_k(P_3,\gamma _{S_1,S_2,S_3,S_4}) = (\mathbb {N} _1- S_1) \cup (\mathbb {N} _4 - S_4) \cup S_2 \cup S_3\).

We will start by explaining why \(\mathcal {N} _3,\left\langle z,z\right\rangle ,z \nVdash \) (16) while \(\mathcal {N} _4,\left\langle z,z\right\rangle ,z \Vdash \) (16). First, to explain why \(\mathcal {N} _3,\left\langle z,z\right\rangle ,z \nVdash \) (16), it suffices to note the following (where \(\alpha \) and \(\beta \) below have set \(S_1 = S_2 = S_3 = S_4 = \emptyset \), and w is any world):

This is simply because \(I(P_1,\alpha ) \cap I(P_2,\beta ) = \mathbb {N} \), but no \(\gamma \)-world has all of \(\mathbb {N} \) in its interpretation of \(P_3\). Second, to explain why \(\mathcal {N} _4,\left\langle z,z\right\rangle ,z \Vdash \) (16), consider the following formula:

(17)

Notice that \(\mathcal {N} _4,\left\langle w,v\right\rangle ,u \Vdash \) (17) holds vacuously unless w is an \(\alpha \)-world and v is a \(\beta \)-world. So suppose \(w = \alpha _{S_1,S_2,S_3,S_4}\) and \(v = \beta _{T_1,T_2,T_3,T_4}\). Then:

$$\begin{aligned} I_4(P_1,w) \cap I_4(P_2,v)&= (\mathbb {N} _1 - (S_1 \cup T_1)) \cup (T_2 - S_2) \cup (S_3 - T_3) \cup (S_4 \cap T_4). \end{aligned}$$

Now pick \(u = \gamma _{(S_1 \cup T_1),(T_2 - S_2),(S_3 - T_3),S_4'}\), where \(S_4'\) is disjoint from \(S_4 \cap T_4\). (We can always find such a \(\gamma \) since \(S_1 \cup T_1\) is allowed to be empty.) Then it is easy to see that . Thus, \(\mathcal {N} _3,\left\langle z,z\right\rangle ,z\) and \(\mathcal {N} _4,\left\langle z,z\right\rangle ,z\) disagree on (16).

Now, in what follows, let us say that a 2D-partial isomorphism between \(\mathcal {M} ,\left\langle w,z\right\rangle ,v,\overline{a} \) and \(\mathcal {N} ,\left\langle w',z'\right\rangle ,v',\overline{b} \) is a map \(\rho \) such that \(\rho \) is a partial isomorphism between \(\mathcal {M} ,w,v,\overline{a} \) and \(\mathcal {N} ,w',v',\overline{b} \), and also a partial isomorphism between \(\mathcal {M} ,w,w,\overline{a} \) and \(\mathcal {N} ,w',w',\overline{b} \). In other words, 2D-partial isomorphisms must also satisfy (Predicate) and (Existence) at w and \(w'\). We will write \(\simeq _\text {2D}\) in place of \(\simeq \) for 2D-partial isomorphisms. It is easy to verify that 2D-partial isomorphism allows for Eloïse to continue the -bisimulation game.

Let us write \(\mathcal {M} ,w,z,v,\overline{a} \leftrightarrows ^2 \mathcal {N} ,w',z',v',\overline{b} \) to mean that \(\mathcal {M} ,w,z,u,\overline{a} \) and \(\mathcal {N} ,w',z',u',\overline{b} \) are -bisimilar (I am dropping the angle brackets). Note \(\leftrightarrows ^2\) is just -bisimilarity, with an extra argument place for worlds in the middle; no clause in this notion of bisimilarity can do anything to or with the z and \(z'\) worlds. The conventions from before regarding adding additional operators, quantifiers, etc. all apply.

Theorem 18

(Higher-dimensional inexpressibility) \(\mathcal {N} _3,z,z,z \leftrightarrows _{\approx ,@_2,\Pi {}}^2 \mathcal {N} _4,z,z,z\).

Proof

Clearly, \(z,z,z \simeq _\text {2D} z,z,z\). Now, suppose \(w,z,v,\overline{a} \simeq _\text {2D} w',z,v',\overline{b} \), where:

  • \(w = z\) iff \(w' = z\) (likewise for v and \(v'\))

  • w is an \(\alpha \)/\(\beta \)/\(\gamma \)-world iff \(w'\) is an \(\alpha \)/\(\beta \)/\(\gamma \)-world (likewise for v and \(v'\))

  • \(w = v\) iff \(w' = v'\).

Using these assumptions, we will show that no matter what move Abelard makes, Eloïse can match his move so as to preserve these assumptions.

First, observe that no matter what worlds w and v are, for any , all of the following sets are either empty or infinite (where ):

Thus, if Abelard picks a new \(a \in \delta _3(u)\), then no matter what predicates it satisfies in w or v, Eloïse will have infinitely many new \(b \in \delta _4(u')\) to choose from which satisfy the same predicates in \(w'\) and \(v'\). Likewise if Abelard picks a new \(b \in \delta _4(u')\).

Next, suppose Abelard decides to relocate the game in \(\mathcal {N} _3\). Since this is the -bisimulation game, he can either choose to replace w with another world, or v with another world (there are no clauses for reseting z). Clearly if he replaces one of these with z, Eloïse should match his move by replacing the corresponding world with z. If he replaces w with v, Eloïse should replace \(w'\) with \(v'\). Likewise if he replaces v with w. Suppose WLOG that he decides to replace v with a new \(\alpha \)-world \(u = \alpha _{S_1,S_2,S_3,S_4}\). Define:

Define \(u' = \alpha _{T_1,T_2,T_3,T_4}\). It is easy to check that \(a_i \in I_3(P_1,u)\) iff \(b_i \in I_4(P_1,u')\). (Suppose \(a_i \in I_3(P_1,u)\), and reason by cases depending on which \(\mathbb {N} _k\) contains \(b_i\). Likewise with \(a_i \notin I_3(P_1,u)\).) The other cases are symmetric (for both Zig and Zag), except if Abelard decides to replace \(v'\) with a \(\gamma \)-world where the corresponding \(S_1\) we define would be empty. In that case, define the other sets as before, and pick a new and set . \(\square \)

6 Conclusion

As we have seen, there are a number of English modal claims that seem to resist regimentation in first-order modal logic, even if we add two-dimensional operators. Proofs of these claims in the literature are often quite complicated. But as we have shown, they can be simplified by first regimenting these English sentences as formulas in an extensional two-sorted language and then constructing bisimilar models that disagree on these extensional formulas. We illustrated this technique by showing that (E) is not expressible in , that (R) is not expressible in , and that (NR) is not expressible in . We then classified the relative expressive power of the extensions of with operators like \(@\), \({\mathop {\downarrow }}\), and \(\mathcal {F}\), and finally showed how these inexpressibility results generalize to temporal languages and to higher-dimensional languages.

There are still a number of questions about the expressivity of extensions of first-order modal logic that have yet to be resolved. For one thing, we have yet to complete the classification of the expressive power of these languages relative to \(\mathbf U \) and UD, and one might also wonder whether these results hold in other kinds of models, such as the class of models with finite global domains.Footnote 30 More broadly, there is still a question about whether we can formally characterize sentences like (E), (R), and (NR).Footnote 31 Finally, there is still much to be learned about even more powerful extensions of first-order modal logic. For instance, the results in Sect. 5 suggest that the key to overcoming all of these expressive limitations is to move to a hybrid language, or some weaker Vlachian languages.Footnote 32 Bisimulation techniques can also be used to characterize the expressive power of first-order Vlachian logics and hybrid logics.Footnote 33 But there is still much to uncover about the full expressive landscape for these languages and their extensions.