1 Introduction

As a rough approximation, regularity theories of causation hold that, given the circumstances, causes are nomically sufficient for their effects. As a matter of law, if the cause is present in these circumstances, then the effect will be present too. As a rough approximation, counterfactual theories of causation hold that, given the circumstances, causes are nomically necessary for their effects. As a matter of law, were the cause to have been absent in these circumstances, so too would the effect have been absent. I, like many, have been persuaded that counterfactual theories are roughly correct about the relation of singular, token, or actual causation. Here, I will argue that nomic sufficiency accounts are roughly correct about what I will call structural determination relations. On the theory advanced here, these structural determination relations provide truth conditions for causal counterfactual conditionals. For counterfactual theorists, this means that regularity accounts still have an important role to play in the metaphysics of causation.

Singular causal relations—the relations expressed by sentences of the form “c’s F-ing caused e to G”, where c’s F-ing and e’s G-ing are particular events or facts—are familiar philosophical fare. Structural determination, less so. As I will explain in further depth below, structural determination relations link qualities or quantities of particular parts of the world. We can represent these qualities or quantities with variables. When we do so, structural determination relations are representable as structural equations, which establish functional relationships between those variables.

Structural equations modeling has been used to provide novel semantics for causal counterfactual conditionals (Hiddleston 2005; Shulz 2011, and Briggs 2012), to investigate traditional metaphysical questions about singular causation (Hitchcock 2001; Woodward 2003, ch. 2, Menzies 2004; Halpern and Pearl 2001, 2005a; Menzies 2007; Handfield et al. 2008; Halpern 2008; Hitchcock and Knobe 2009; Glymour et al. 2010; Halpern and Hitchcock 2010, forthcoming, Paul and Hall 2013; Livengood 2013; Baumgartner 2013, and Weslake forthcoming), to explicate the nature of causal enquiry and scientific explanation (Woodward 1999; Halpern and Pearl 2005b; Woodward and Hitchcock 2003a, b, and Woodward 2003, ch. 7), and to undergird novel statistical techniques for drawing inferences about the causal structure of the world on the basis of sample data (Pearl 2000, 2009 and Spirtes et al. 2000). However, relatively little has been done to get clear about what exactly someone commits themselves to when they endorse one of these models—what exactly, that is, a structural equations model says about the world.Footnote 1

To the extent that it has been addressed, the consensus view appears to be that structural equations represent patterns of causal counterfactual dependence amongst variable values (in particular, see Hitchcock 2001; Woodward and Hitchcock 2003a; Hall 2007; Hitchcock 2007; Menzies 2008; Halpern and Hitchcock 2010, and Glynn 2013).Footnote 2 In Sect. 3 below, I will explain why causal counterfactuals are not well-suited to provide a semantics for structural determination relations. My contention there will be that the counterfactual account is not capable of securing the independent manipulability of the structural determination relations represented in a structural equations model—a property of structural equations models known as modularity. In contrast, I will suggest that one variable is structurally determined by others just in case, within a certain region of modal space, the values of the latter variables are sufficient for the value of the former (Sect. 4). This account will allow us to understand the technical notion of an intervention; it will guarantee the modularity of a correct structural equations model; and it will allow us to provide a semantics for causal counterfactual conditionals which neatly solves a problem case for the standard ‘closest possible world’ semantics of Stalnaker (1968) and Lewis (1973).

2 Structural equations models

A structural equations model \({\mathcal {M}}\) is a triple \(( {\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\) of a vector of exogenous variables \({\mathcal {U}}= (U_1 \dots U_M )\), a vector of endogenous variables \({\mathcal {V}}= ( V_1 \dots V_N )\) and a vector of structural equations \({\mathcal {E}}= ( \phi _{V_1} \dots \phi _{V_N} )\), one for each endogenous variable.Footnote 3 Formally, a variable is a partial function from a set of possibilities (or ‘worlds’) \({\mathcal {W}}\) to the real line \({\mathbb {R}}\). What makes the function partial is just that it needn’t map each and every possibility \(w \in {\mathcal {W}}\) to some real number. So, for instance, I might be interested in the variable S = number appearing on the digital scale at \(t\). This variable assigns a value, \(s\), to every world at which the scale displays a number at \(t\). However, it will not assign any value to a world at which the scale does not display a number at \(t\), or does not exist at \(t\). Here’s another (equivalent) way to understand a variable: it is an assignment of values to a set of pairwise inconsistent propositions \(\{ \mathrm {P}_i \} \subset \wp ({\mathcal {W}})\). Which value the variable takes on depends upon which of these propositions is true. For instance, the variable \(F\) could assign the value \(f\) to the proposition you exert a force of f Newtons on the surface of the scale at t, for every \(f\) in some specified range. In general, variables stand to their values as determinables stand to their determinants; just as being red is one way for an object to be colored, having the property represented by \(V=v\) is one way for a part of the world to have the property represented by \(V\). (A word on notation: I’ll write ‘\(V_w = v\)’ to mean that the value of \(w\), under the function \(V\), is \(v\) and I will often use ‘\(V = v\)’ to denote the set of worlds \(w\) such that \(V_w = v\)).

The structural equations in \({\mathcal {E}}\) establish functional relationships amongst the variables in \({\mathcal {U}}\cup {\mathcal {V}}\). For instance, suppose that the digital scale is accurate at \(t\), zeroed out in the appropriate way (so that it reads ‘\(0\)’ when subjected to the earth’s gravitational force and the ambient air pressure), and nothing else (besides you) is exerting any force upon the surface of the scale. Then, the value of \(S\) will be determined by the value of \(F\). Since an object weighing 0.2248 pounds exerts 1 Newton on the surface of the Earth, if \(F = f\), then \(S= \lceil 0.2248 \cdot f \rceil\).Footnote 4 We can thus write down the structural equation

$$\begin{aligned} S \,{:}{=}\,\lceil 0.2248 \cdot F \rceil \end{aligned}$$

What makes this equation structural is that it is asymmetric; it matters which variable is to the left of the ‘\(\,{:}{=}\,\)’. In addition to claiming that the value of \(S\) is a function of the value of \(F\), the structural equation makes the further claim that the value of \(S\) is determined by the value of \(F\) in a way that the value of \(F\) isn’t determined by the value of \(S\). Here’s a way of getting at this asymmetry: if there were a certain kind of intervention on the value of \(F\)—if, for instance, you were to put one foot on the floor—then the value of \(S\) would have been different—the scale would have displayed a different number. If, however, there were an intervention on the value of \(S\)—if, for instance, the scale was re-zeroed to read ‘\(0\)’—then the value of \(F\) would not be different—you would not suddenly exert \(0\) Newtons on the surface of the scale. (To emphasize this asymmetry, I use ‘\(\,{:}{=}\,\)’ to distinguish that asymmetrical relation from the symmetrical ‘\(=\)’).Footnote 5

These variables and this structural equation together constitute a structural equations model, or a causal model (I’ll use these terms interchangeably throughout). A causal graph provides an intuitive representation of much of the information contained in a causal model. If \(U\) appears on the right-hand-side of \(V\)’s structural equation \(\phi _V\), then there will be an arrow with its tail at \(U\) and its head at \(V\) in the causal graph. For instance, the model of the scale’s display and the force you exert on the scale’s surface generates the causal graph shown in Fig. 1. This causal graph tells us that the value of \(F\) determines the value of \(S\), without telling us exactly how. It tells us that the numbers on the scale’s display are determined by the force you exert upon it, but it doesn’t, for instance, tell us what units the scale’s display is set to. For that information, we must look to \(S\)’s structural equation in \({\mathcal {E}}\).

Fig. 1
figure 1

A causal graph

A causal model can involve many more variables and structural equations than this. Also, a single structural equation can relate more than two variables. Adapting an example from Pearl (2000, ch. 7), suppose that there are two riflemen, one standing on the left, the other standing on the right, who have their rifles aimed at a deserter. If the captain gives the order, then both riflemen will fire, and the deserter will die. We can model the causal structure of this case with \({\mathcal {M}}_{2} = ( ( C ), ( L, R, D ), {\mathcal {E}}_{2} )\), where \(C\) is a binary variable which takes the value \(1\) if the captain gives the order to fire and takes the value \(0\) otherwise, \(L\) is a binary variable which takes the value \(1\) iff the left rifleman fires, \(R\) is a binary variable which takes the value \(1\) iff the right rifleman fires, and \(D\) is a binary variable which takes the value \(1\) iff the deserter dies. The structural equations in \({\mathcal {E}}_{2}\) are shown in Fig. 2. (There, \(x \vee y\) is the truth function \(\max \{ x, y \}\).) These equations tell us that the left rifleman will fire iff the captain gives the order, and likewise for the right rifleman. And the deserter will die iff at least one of the riflemen fire. \({\mathcal {M}}_{2}\) tells us that the value of \(C\) determines the values of \(L\) and \(R\) and that the values of \(L\) and \(R\) jointly determine the value of \(D\).

We may use the language of genealogy to talk about the structural relationships between variables. Thus, all of the variables in a causal model which appear on the right-hand-side of \(V\)’s structural equation, \(\phi _V\), are called \(V\)’s structural parents. I’ll use ‘\({\mathbf{PA}}(V)\)’ to refer to a vector of \(V\)’s structural parents.Footnote 6 (If \(U\) is exogenous, then \({\mathbf {PA}}(U)\) is the empty vector.) In the model shown in Fig. 2, e.g., \({\mathbf {PA}}(D) = ( L, R )\). In a similar fashion, we can define \(V\)’s structural descendants—with the slight wrinkle that we stipulate that every variable \(V\) is one of its own descendants. I’ll use ‘\({\mathbf {DE}}(V)\)’ to refer to a vector of \(V\)’s structural descendants. In the model shown in Fig. 2, \({\mathbf {DE}}(L) = ( L, D )\).

Fig. 2
figure 2

The system of structural equations \({\mathcal {E}}_{2}\)

A few paragraphs back, I invoked the notion of an intervention. Formally, an intervention is a way of setting the values of some of the variables in \({\mathcal {U}}\cup {\mathcal {V}}\) without directly affecting any of the other variables in \({\mathcal {U}}\cup {\mathcal {V}}\), or their determination structure. To illustrate, suppose that in the model shown in Fig. 2, the value of \(L\) is set to \(1\) via an intervention. Suppose, that is, that we perform an intervention to make the left rifleman fire. The way this is modeled is by replacing \(L\)’s structural equation, \(L \,{:}{=}\,C\), with \(L = 1\) (indicating that \(L\) has been set to \(1\) via an intervention) and leaving all other structural equations unchanged. We thus get the mutilated model \({\mathcal {M}}_{2, \,L= 1} = (( C ), ( L, R, D ), {\mathcal {E}}_{2, \,L= 1} )\), shown in Fig. 3. In \({\mathcal {M}}_{2, \,L= 1}\), whether the left rifleman fires is no longer determined by whether the captain gives the order. However, whether the deserter dies is still determined by whether the left rifleman fires. In general, the graphical result of an intervention on a variable \(V\) is to remove all of the arrows leading into \(V\) (if such there be), to destroy all of the structural determination relations between \(V\) and \({\mathbf {PA}}(V)\), while leaving all other structural determination relations intact.

Fig. 3
figure 3

The system of equations \({\mathcal {E}}_{2, \,L=1}\) models an intervention on the variable \(L\)

This property of a structural equations model—that there are in-principle hypothetical interventions upon the variables which leave all the other structural determination relations intact—is known as modularity. Without modularity, structural equations models do not tell us anything about the results of hypothetical interventions, since without the assumption that the structural equations other than \(\phi _V\) remain in place post-intervention, we cannot calculate the down-stream effects of setting the value of \(V\).

Notice that not every way of setting the value of \(V\) will have this result. Some ways of setting the value of \(V\) will affect other variables in the graph as well. For instance, one way of setting \(S\) to \(0\), one way of making the scale read ‘\(0\)’, is to simply lift you off of the scale. But this wouldn’t count as an intervention on the value of \(S\), since it wouldn’t alter the manner in which the value of \(F\) determines the value of \(S\). It wouldn’t be correct to model this way of setting \(S\) to \(0\) by replacing \(S \,{:}{=}\,\phi _S(F)\) with \(S = 0\), since the determination relation represented by \(S \,{:}{=}\,\phi _S(F)\) would still be in force. It would be this very determination relation that we would exploit in order to affect the value of \(S\). Additionally, we could set the value of \(F\) in such a way that we affect the manner in which the value of \(F\) determines the value of \(S\)i.e., our meddling could have the result of changing the structural equation \(\phi _S\). For instance, we might decide to keep you from stepping onto the scale by placing a dead five-pound rat on the scale. In that case, our method for setting the value of \(F\) would alter \(S\)’s structural equation, replacing it with

$$\begin{aligned} S \,{:}{=}\,\lceil 0.2248 \cdot F + 5 \rceil \end{aligned}$$

So only certain methods of setting the value of a variable in a causal model will count as interventions on the value of that variable, in our technical sense.Footnote 7

Once we have this method for modeling interventions, a method for evaluating causal counterfactual conditionals comes along for free. On this account, the counterfactual \(\text {A} \;\Box \!\!\rightarrow \text {C}\) is true at a world \(w\) according to the model \({\mathcal {M}}\) just in case \({\mathcal {M}}_{\text {A}}, {\mathcal {U}}_w \,\models \, \text {C}\). That is: the counterfactual \(\text {A} \;\square \!\!\rightarrow \text {C}\) is true at a world \(w\), according to the model \({\mathcal {M}}\), given the exogenous variable assignment \({\mathcal {U}}= {\mathcal {U}}_w\), iff \(\text {C}\) is true in the model that we get by minimally mutilating \({\mathcal {M}}\) so as to make \(\text {A}\) true. To illustrate: suppose that, in the causal model shown in Fig. 2, the actual value of \(C\) is \(0\).Footnote 8 Suppose, that is, the captain doesn’t actually give the order to fire. Then, neither the left nor the right rifleman fires, and the deserter does not die. And suppose that we want to evaluate the causal counterfactual ‘If the left rifleman were to have fired, then the deserter would have died’—or ‘\(L = 1 \;\square \!\!\rightarrow D = 1\)’. To evaluate this causal counterfactual, we simply perform an intervention on the value of \(L\) so as to make the antecedent true; we mutilate the model, so that the value of \(L\) is no longer determined by the value of \(C\), we set \(L\) to \(1\), and then we calculate the values of \(R\) and \(D\) in the mutilated model in accordance with their structural equations and the values for the exogenous variables. If the consequent comes out true in the mutilated model \({\mathcal {M}}_{2, \,L=1}\), then the counterfactual was true in the original model \({\mathcal {M}}_{2}\). According to \({\mathcal {M}}_{2}\), then, ‘\(L = 1 \;\square \!\!\rightarrow D = 1\)’ is true. If the left rifleman were to fire, then the deserter would have died. Note that, without modularity, we would not be able to evaluate these counterfactual conditionals, since, without modularity, there is no guarantee that the downstream structural determination relations would remain intact post-intervention.

A theory of structural determination should explain why structural equations models have the properties they do. In particular, it should explain why they allow us to correctly evaluate causal counterfactual conditionals in this way, and it should explain why a correct system of structural equations is modular. It would be a benefit of an account if it could explain why only certain ways of setting the values of variables leave the downstream structural determination relations unaffected, as well as providing a principled way of distinguishing the ways of setting the values of variables which do from those which do not constitute interventions, in our technical sense. In Sect. 4, I will provide a theory which meets each of these explanatory demands. I will not attempt to account for probabilistic structural determination relations (of the sort that I believe are implicated in probabilistic causation). Nor will I be concerned with backwards structural determination relations, in which the future state of the world structurally determines the past state of the world. That’s not because I think that there aren’t, or couldn’t be, probabilistic or backwards structural determination relations. Considering these issues here would simply muddy already murky waters. Another task for another day.

3 The causal counterfactual understanding

Let’s say that a structural equation \(V \,{:}{=}\,\phi _V({\mathbf {PA}}(V))\) is descriptively adequate at world \(w\) just in case \(V_w = \phi _V({\mathbf {PA}}(V)_w)\). Just as mere descriptive adequacy is not sufficient for a universal generalization to be a law of nature, mere descriptive adequacy is not enough for a structural equation to be correct. There must additionally be some kind of genuine determination of \(V\) by \({\mathbf {PA}}(V)\). So an account of structural equations models must say something about what it takes, beyond mere descriptive adequacy, for a structural equation to be correct.

One of the more popular ways of understanding structural equations models appeals to causal counterfactual conditionals. Hitchcock articulates this view in his (2001, pp. 283–284):

A system of structural equations is an elegant means for representing a whole family of counterfactuals...The correctness of a set of structural equations, and of the corresponding graph, depends upon the truth of these counterfactuals.

On this account, what it is for an isolated structural equation \(V \,{:}{=}\,\phi _V({\mathbf {PA}}(V) )\) to be correct is just for it to be the case that, for any subvector \({\mathbf {X}} \subseteq {\mathbf {PA}}(V)\), were \(\mathbf {X}\) to take on the values \({\mathbf {x}}, V\) would take on the value \(\phi _V({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}} )\),

\((V)\) :

\(\begin{aligned} \forall {\mathbf {X}} \subseteq {\mathbf {PA}}(V) \quad \forall {\mathbf {x}} \quad ( {\mathbf {X}} = {\mathbf {x}} \; \square \!\!\rightarrow \; V = \phi _V({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf{x}})) \end{aligned}\)

where \({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}}\) is the assignment given to \(V\)’s structural parents by \(\mathbf {X}=\mathbf {x}\) (if \(\mathbf {X}=\mathbf {x}\) doesn’t assign any value to one of \(V\)’s structural parents, then \({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}}\) gives that parent its actual value). \((V)\) tells us that \(V\)’s value counterfactually depends upon those of its parents, in the way specified by \(\phi _V\).

More generally, we can say that what it is for a structural equation \(\phi _V\), in a causal model \({\mathcal {M}}= ( {\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\), to be correct at a world \(w\), is just for it to be the case that, for every subvector \({\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V)\), and any assignment of values \(\mathbf {x}\) to \(\mathbf {X}\), were \(\mathbf {X}\) to take on those values, \(V\) would take on the value \(\phi _V({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}})\)

\(({\mathcal {V}}1)\) :

\(\begin{aligned} \forall V \in {\mathcal {V}}\quad \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}} \quad ( {\mathbf {X}} = {\mathbf {x}} \; \square \!\!\rightarrow \; V = \phi _{V}({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}} )) \end{aligned}\)

where \({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}}\) assigns \(V\)’s parents the values determined by \(({\mathcal {U}}- \mathbf {X})_w, \mathbf {x}\), and \({\mathcal {E}}- \bigcup _i (\phi _{X_i}),\) for each endogenous \(X_i \in \mathbf {X}\). That is, \({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}}\) assigns \({\mathbf {PA}}(V)\) the values determined by the mutilated model \({\mathcal {M}}_{\mathbf {X} = \mathbf {x}}\), with the actual assignment of values to the exogenous variables. \(({\mathcal {V}}1)\) tells us that \(V\)’s value counterfactually depends upon the values of its parents, and only the values of its parents, in the way specified by \({\phi_{V}}\).

The causal counterfactuals in \((V)\) and \(({\mathcal {V}}1)\) are to be evaluated in the standard way (see Stalnaker 1968; Lewis 1973). To evaluate a counterfactual \(\text {A} \;\square \!\!\rightarrow \text {C}\), we consider a set of possibilities determined by \(\text {A}\) and the world of evaluation \(w\), and check to see whether \(\text {C}\) is true in those possibilities. Exactly which possibilities we ought to check is a complicated and controversial matter. However, for the most part, we can sidestep these issues here. We need only endorse the following general framework: there is a selection function, \(f\), which is a function from pairs of propositions, \(\text {A}\), and worlds, \(w\), to sets of worlds, \(f({\text {A}}, w)\). Whenever a counterfactual conditional \(\text {A} \;\square \!\!\rightarrow \text {C}\) is true at a world \(w\), what makes it the case that \(\text {A} \;\square \!\!\rightarrow \text {C}\) is true is that \(f({\text {A}}, w) \,\models \, \text {C}\).Footnote 9 Different accounts of the selection function will yield different truth conditions for counterfactual conditionals. However, for my purposes, it won’t matter what we say about \(f\), so long as we agree that it satisfies the following three properties.Footnote 10

(\(f1\)):

\(f({\text {A}}, w) \,\models \, \text {A}\)

(\(f2\)):

if \(\text {A} \,\models \, \text {B}\) then \(f({\text {B}}, w) \cap {\text {A}} \subseteq f({\text {A}}, \,w\))

(\(f3\)):

at \(w\), there is a hypothetical intervention to set any \(\mathbf {V} \subseteq {\mathcal {U}}\cup {\mathcal {V}}\) to any values \(\mathbf {v}\) which yields a world \(w_{\mathbf {V} = \mathbf {v}} \in f({\mathbf {V}} = {\mathbf {v}}, w)\)

Returning to \(({\mathcal {V}}1)\): that condition imposes two constraints on a structural equation, \(\phi _{V}\), in a causal model \({\mathcal {M}}\). In the first place, it says that the value of \(\phi _{V}\)’s left-hand-side variable, \(V\), is sensitive to the values of \(\phi _{V}\)’s right-hand-side variables, \({\mathbf {PA}}(V)\), and they are sensitive in precisely the way specified by \(\phi _{V}\). Changes in the values of those variables would lead to changes in the value of \(V\), and they would lead to precisely the changes specified by \(\phi _{V}\). Additionally, it says that the value of \(V\) is only directly sensitive to the values of \({\mathbf {PA}}(V)\). Holding fixed those values, interventions to change the values of the other variables in the model would not lead to changes in the value of \(V\).

Note that requiring a causal model to satisfy \(({\mathcal {V}}1)\) is stronger than merely requiring it to satisfy \((V)\), for each endogenous variable \(V \in {\mathcal {V}}\).

Fig. 4
figure 4

The system of structural equations \({\mathcal {E}}_{4}\)

To illustrate: suppose that the right rifleman takes the day off, so that the causal model shown in Fig. 4 correctly describes the structural determination relations between the captain’s giving the order (\(C\)), the left rifleman’s firing (\(L\)), and the deserter’s dying (\(D\)). Suppose that, at the actual world, the captain doesn’t give the order. Given the method for evaluating causal counterfactuals introduced in Sect. 2, this model entails (1).

$$\begin{aligned} C=1 \;\square \!\!\rightarrow D = 1 \end{aligned}$$
(1)

However, (1) does not follow from the truth of \((V)\), for each of the structural equations in \({\mathcal {E}}_{4}\),

$$\begin{aligned}C=0 &\;\square \!\!\rightarrow L =0 \\C=1 &\;\square \!\!\rightarrow L =1 \\L=0 &\;\square \!\!\rightarrow D=0 \\L=1 &\;\square \!\!\rightarrow D=1 \end{aligned}$$

since the counterfactual conditional is not transitive. \(({\mathcal {V}}1)\), by contrast, will require, inter alia, that both (2) and (3) be true in order for the system of structural equations \({\mathcal {E}}_{4}\) to be correct.

$$\begin{aligned}C=1 \;\square \!\!\rightarrow L=1\end{aligned}$$
(2)
$$\begin{aligned} (C = 1 \wedge L=1) \;\square \!\!\rightarrow D=1 \end{aligned}$$
(3)

And (2) and (3) do entail (1), given \((f1)\) and \((f2)\).

However, the truth of \(({\mathcal {V}}1)\) is not sufficient for the correctness of a structural equations model. Take the familiar example of Suzy and Billy throwing their rocks at a window. Both Suzy and Billy have excellent aim, so if either of them throws their rock, then the window will shatter; and the window is sturdy enough that if neither of them throw their rock, then the window will not shatter. Suppose that Suzy actually throws and Billy doesn’t, and that (411) are all true.

$$\begin{aligned} (B = 0 \wedge S = 0) \; \square \!\!\rightarrow \, W = 0 \end{aligned}$$
(4)
$$\begin{aligned} (B=0 \wedge S=1) \;\square\!\!\rightarrow W=1 \end{aligned}$$
(5)
$$\begin{aligned} (B = 1 \wedge S=0) \;\square\!\!\rightarrow W=1 \end{aligned}$$
(6)
$$\begin{aligned} (B=1 \wedge S=1) \;\square\!\!\rightarrow W=1 \end{aligned}$$
(7)
$$\begin{aligned} S=1 \;\square\!\!\rightarrow W=1 \end{aligned}$$
(8)
$$\begin{aligned} S=0 \;\square\!\!\rightarrow W=0 \end{aligned}$$
(9)
$$\begin{aligned} B=1 \;\square\!\!\rightarrow W=1 \end{aligned}$$
(10)
$$\begin{aligned} B=0 \;\square\!\!\rightarrow W=1 \end{aligned}$$
(11)

(where \(B, S\), and \(W\) are binary variables with the natural interpretation). If the truth of \(({\mathcal {V}}1)\) were sufficient for the correctness of a structural equations model, then the system of structural equations \({\mathcal {E}}_{5}\), shown in Fig. 5, would have to be correct.

Fig. 5
figure 5

The system of structural equations \({\mathcal {E}}_{5}\)

However, \({\mathcal {E}}_{5}\) says more than the counterfactuals (411) do. When \(B = 0\) and \(S = 1, {\mathcal {E}}_{5}\) entails that were Billy to have thrown, Suzy (still) would have, \(B = 1 \;\square \!\!\rightarrow S=1\). But it is consistent with the truth of (411) that Suzy wouldn’t have thrown her rock if Billy had thrown his. That is, it is consistent with (411) that the value of \(S\) is determined by the value of \(B\). And if \(S\) is determined by \(B\), then \({\mathcal {E}}_{5}\) would be incorrect in virtue of missing a necessary determination relation.

The problem with \(({\mathcal {V}}1)\) is that, by itself, it only places a constraint on the endogenous variables, yet a structural equations model gives us information not only about the endogenous variables, but about the exogenous variables as well—in particular, the model tells us that the exogenous variables are counterfactually independent of each other. On the counterfactual approach, there is a straightforward fix to this problem: we simply require that each exogenous variables is counterfactually independent of all the other variables in the model—i.e., at a world \(w\), for any exogenous variable \(U \in {\mathcal {U}}\), and any assignment of values \(\mathbf {x}\) to any subvector \({\mathbf {X}} \subseteq ({\mathcal {U}}- U) \cup {\mathcal {V}}\), were \(\mathbf {X}\) to take on those values, \(U\) would (still) take on its actual value, \(U_w\).

\(({\mathcal {U}}1)\) :

\(\begin{aligned} \forall U \in {\mathcal {U}}\quad \forall {\mathbf {X}} \subseteq ({\mathcal {U}}- U) \cup {\mathcal {V}}\quad \forall {\mathbf {x}} \quad ( {\mathbf {X}} = {\mathbf {x}} \;\square \!\!\rightarrow U = U_w )\end{aligned}\)

This gives us the following account of the correctness of a causal model \({\mathcal {M}}\):

\(({\mathcal {M}}1)\) :

\(\begin{aligned}&{\mathcal {M}} {\text{ is correct at }} w \text{ iff: } \\&1. \quad {\mathcal {V}} \text{ satisfies }\, {({\mathcal {V}}1)} \text{ at } w \\&2. \quad {\mathcal {U}} \text{ satisfies }\, ({\mathcal {U}}1) \text{ at } w \end{aligned}\)

That is: \({\mathcal {M}}\) is correct iff (1) for every endogenous variable \(V \in {\mathcal {V}}\), (a) the value of \(V\) counterfactually depends upon the values of the variables in \({\mathbf {PA}}(V)\) in precisely the manner specified by \(\phi _V\), and (b) holding fixed the value of \({\mathbf {PA}}(V), V\)  doesn’t counterfactually depend upon the values of the variables in \(({\mathcal {U}}\cup {\mathcal {V}}) - {\mathbf {PA}}(V)\); and (2) for every \(U \in {\mathcal {U}}\), the value of \(U\) is counterfactually independent of the values of the other variables in the model.

3.1 Problems with modularity

In order for a structural equations model to be correct, the equations in \({\mathcal {E}}\) must be modular—that is, that there be in-principle interventions to set the values of any subset of \({\mathcal {U}}\cup {\mathcal {V}}\) which leaves the structural equations of the non-intervened-upon endogenous variables intact. The problem is that modularity does not follow from \(({\mathcal {M}}1)\) alone; nor can we formulate the requirement of modularity in terms of any finite number of counterfactuals (even when the potential values of the variables in \({\mathcal {U}}\cup {\mathcal {V}}\) are finite).

Distinguish two kinds of modularity: weak and strong. According to weak modularity, when there is an intervention or interventions to set the values of variables in a correct causal model, the structural equations of the non-intervened-upon endogenous variables in the model will still be descriptively adequate. That is, when we perform hypothetical interventions on the values of the variables in \({\mathcal {U}}\cup {\mathcal {V}}\), taking us to the world \(w_i\), then, for every non-intervened-upon endogenous variable \(V\),

$$\begin{aligned} w_i \,\models \, V = \phi _V({\mathbf {PA}}(V)) \end{aligned}$$

According to strong modularity, whenever there is an intervention or interventions to set the values of variables in a correct causal model, the structural equations of the non-intervened-upon endogenous variables in the model will still be correct. That is, when we perform hypothetical interventions on the values of the variables in \({\mathcal {U}}\cup {\mathcal {V}}\), taking us to the world \(w_i\), for every non-intervened-upon endogenous variable \(V\),

$$\begin{aligned} w_i \,\models \, V \,{:}{=}\,\phi _V({\mathbf {PA}}(V)) \end{aligned}$$

On the counterfactual account, this means that it must at least be true that \(w_i \,\models \, ({{\mathcal {V}}1})\). That is, for every non-intervened-upon endogenous variable \(V\),

$$\begin{aligned} w_i \,\models \, \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}} \quad ( {\mathbf {X}} = {\mathbf {x}} \; \square \!\!\rightarrow \; V = \phi _{V}({\mathbf {PA}}(V)_{\mathbf {X} = \mathbf {x}} )) \end{aligned}$$
(12)

\(({\mathcal {M}}1)\) does not guarantee strong modularity because it does not guarantee the truth of (12) for any endogenous variable \(V\).

To see why, consider again the structural equations model shown in Fig. 4. For that model, \(({\mathcal {M}}1)\) requires that \(C=1 \;\square \!\!\rightarrow D=1\): at all the closest worlds at which the captain gives the order, the deserter will die, \(f(C=1, @) \,\models \, D =1\).Footnote 11 However, \(({\mathcal {M}}1)\) does not require that \(f(C=1, @) \,\models \, L=0 \;\square \!\!\rightarrow D=0\). It does not require that, at all the closest worlds at which the captain gives the order, whether the deserter dies counterfactually depends upon whether the left rifleman fires. And that means that \(({\mathcal {M}}1)\) does not require that, at the worlds at which a hypothetical intervention setting \(C\) to \(1\) occurs, \(D\)’s value is still structurally determined by \(L\)’s value. So \(({\mathcal {M}}1)\) will not guarantee that the equations in \({\mathcal {E}}\) are strongly modular.

Weak modularity is not modularity enough. Structural equations don’t merely represent accidentally true patterns amongst variable values. They represent determination relations between variable values. To say that the structural equation \(V \,{:}{=}\,\phi ({\mathbf {PA}}(V))\) is unaffected by an intervention on another variable should be to say that the determination of \(V\) by \({\mathbf {PA}}(V)\) is unaffected.

Here’s a thought about how to achieve strong modularity: we don’t merely require that \(({\mathcal {V}}1)\) be satisfied. We additionally require that \({\mathcal {V}}\) meets the following condition.

\(({\mathcal {V}}2)\) :

\(\begin{aligned} \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup {\mathcal {V}} \quad &\forall {\mathbf {x}} \quad ( {\mathbf {X}} = {\mathbf {x}} \;\square \!\!\rightarrow ( \forall W \in {\mathcal {V}}- {\mathbf {X}} \\\forall {\mathbf {Y}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- W) \quad &\forall {\mathbf {y}} \quad ( {\mathbf {Y}} = {\mathbf {y}} \;\square \!\!\rightarrow W = \phi _W({\mathbf {PA}}(W)_{\mathbf {X} = \mathbf {x}, \mathbf{Y}=\mathbf{y}} ) ) )) \quad \end{aligned}\)

\(({\mathcal {V}}2)\) says that, if there were an intervention to set the values of any set of variables \(\mathbf {X}\), then \(({\mathcal {V}}1)\) would still hold for all the non-intervened-upon variables. This solution is not satisfying. With this new account, we are told that what it is for a causal model to be correct is, inter alia, for both \(({\mathcal {V}}1)\) and \(({\mathcal {V}}2)\) to be satisfied. But while \(({\mathcal {V}}2)\) guarantees that, at the world where the hypothetical intervention occurs, \(({\mathcal {V}}1)\) will hold, we have as yet no guarantee that, at that world, \(({\mathcal {V}}2)\) will be satisfied. But if what it is for a structural equation to be correct is for both \(({\mathcal {V}}1)\) and \(({\mathcal {V}}2)\) to hold, then this account fails to guarantee that the structural equation will still be correct post-intervention; that is, it fails to secure strong modularity.

It’s actually a bit worse than that. \(({\mathcal {M}}1)\) cannot even guarantee weak modularity. For it could turn out that, for three variables \(V_1, V_2\), and \(V_3\),

$$\begin{aligned} f( V_1 = v_1 \wedge V_2 = v_2, @) \,\models \, V_3 = \phi _{V_3} ( {\mathbf {PA}}(V_3)) \end{aligned}$$
(13)

even thoughFootnote 12

$$\begin{aligned} f( V_2 = v_2, f(V_1 = v_1, @)) \not \models \, V_3 = \phi _{V_3} ( {\mathbf {PA}}(V_3)) \end{aligned}$$
(14)

While \(({\mathcal {M}}1)\) guarantees (13), it is consistent with (14). But this means that \(({\mathcal {M}}1)\) fails to guarantee that the equations in \({\mathcal {E}}\) will even be descriptively adequate after multiple sequential interventions. And the number of potential interventions is unbounded (we can always just set the value of \(X\) to \(x\), then set it to \(x' \ne x\), then set it back to \(x\), then back to \(x'\), and so on and so forth, indefinitely). So there is no finite number of counterfactuals that is sufficient to guarantee that the equations in \({\mathcal {E}}\) are even weakly modular (even when the number of potential values of the variables in \({\mathcal {U}}\cup {\mathcal {V}}\) is finite).

4 The nomic sufficiency understanding

In this section, I will suggest that we can retain all of the virtues of the causal counterfactual understanding of structural equation models, without running into the problems with modularity raised in Sect. 3.1 above, by moving to an understanding of structural equations according to which what makes them correct is that they are descriptively adequate throughout an area of modal space meeting certain constraints. For instance, an isolated structural equation \(V \,{:}{=}\,\phi _V({\mathbf {PA}}(V))\) is correct just in case, for every world \(w\) in some set of worlds \({\mathfrak {F}}_V, V_w = \phi _V({\mathbf {PA}}(V)_w)\).

A useful orienting picture here is Mackie (1965)’s notion of a causal field. Mackie argues that causal claims must be evaluated relative to a set of alternate states of affairs within which the causes are parts of an occurrent minimally sufficient condition for the effect.Footnote 13 He calls this set of alternative states of affairs the causal field. This is roughly how I am thinking of the set of worlds \({\mathfrak {F}}_V\). Just as, on Mackie’s account, the causes are the parts of an occurrent minimally sufficient condition for the effect within the causal field, on the nomic sufficiency account, the values of a variable \(V\)’s structural parents, \({\mathbf {PA}}(V)\), are minimally sufficient for the value of \(V\) within \({\mathfrak {F}}_V\).Footnote 14 In virtue of this resemblance, I will call the set of possibilities \({\mathfrak {F}}_V\,V\)’s causal field.

Of course, this is far too rough. For any structural equation \(\phi _V\), it will be easy to find some set of worlds within which \(\phi _V\) is descriptively adequate. A structural equation according to which my height structurally determines the size of the earth will be descriptively adequate throughout \({\mathfrak {F}}_V\) if I only include worlds in \({\mathfrak {F}}_V\) in which the earth’s diameter is a constant multiple of my height. But my height does not determine the size of the earth. The question of which possibilities to consider when evaluating the determination of one variable by another is a complicated one, but it is one that is faced by the nomic sufficiency account and the counterfactual account both. The counterfactual account solves it by appeal to some suitable selection function \(f\). And I see no reason why the nomic sufficiency account cannot similarly avail itself of this very selection function—whichever one we fancied for the counterfactual understanding of structural equations models—to characterize the worlds which must be included in \({\mathfrak {F}}_V\).Footnote 15

As a first step, if we’re considering an isolated structural equation \(V \,{:}{=}\,\phi _V({\mathbf {PA}}(V))\) at a world \(w\), then we can require that, for every assignment \(\mathbf {x}\) to any \({\mathbf {X}} \subseteq {\mathbf {PA}}(V)\), every world in \(f({\mathbf {X}} = {\mathbf {x}}, w)\) must be included in \({\mathfrak {F}}_V\). Similarly, if we’re considering a structural equation \(V \,{:}{=}\,\phi _V({\mathbf {PA}}(V))\) in a causal model \({\mathcal {M}}\) at a world \(w\), then we can require that, for every assignment of values \(\mathbf {x}\) to any \({\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V)\), all the worlds in \(f({\mathbf {X}} = {\mathbf {x}}, w)\) must be included in \({\mathfrak {F}}_V\).

\(({\mathfrak {F}}1)\) :

\(\begin{aligned} \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}} \quad f({\mathbf {X}} = {\mathbf {x}}, w) \subseteq {\mathfrak {F}}_V\end{aligned}\)

Putting this together with the requirement that \(\phi _V\) be descriptively adequate throughout \({\mathfrak {F}}_V\), we can say that a causal model \({\mathcal {M}}= ({\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\) is correct at a world \(w\) only if \(({\mathcal {V}}3)\).

\(({\mathcal {V}}3)\) :

\(\begin{aligned} \forall V \in {\mathcal {V}}\quad \exists {\mathfrak {F}}_V \text{ such that } {\mathfrak {F}}_V \text{ satisfies } {(\mathfrak {F}}1) \text{ and } {\mathfrak {F}}_V \,\models \, V = \phi _V({\mathbf {PA}}(V)) \end{aligned}\)

If we stop here, then, in the presence of \(({\mathcal {U}}1)\), we get an account which is equivalent to the counterfactual account’s \(({\mathcal {V}}1)\). That is, given \(({\mathcal {U}}1), {\mathcal {M}}\) satisfies \(({\mathcal {V}}1)\) iff \({\mathcal {M}}\) satisfies \(({\mathcal {V}}3)\). (Theorem 1, proved in the appendix, establishes the equivalence.)

Since this condition on the endogenous variables is equivalent to the counterfactual account’s, if we stop here, we will run into the problems with modularity that we encountered in Sect. 3.1. However, we needn’t stop here. We can additionally require that the condition imposed by \(({\mathfrak {F}}1)\) holds, not only for the world of evaluation, but for every other world in \({\mathfrak {F}}_V\) as well.

\(({\mathfrak {F}}2)\) :

\(\begin{aligned} \forall w \in {\mathfrak {F}}_V \quad \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}} \quad f({\mathbf {X}} = {\mathbf {x}}, w) \subseteq {\mathfrak {F}}_V \end{aligned}\)

This amounts to the requirement that the set \({\mathfrak {F}}_V\) is closed under counterfactual suppositions about the values of any of the variables in \({\mathcal {U}}\cup ({\mathcal {V}}- V)\). At any world \(w \in {\mathfrak {F}}_V\), making counterfactual suppositions about the values of any of the variables in \({\mathcal {U}}\cup ({\mathcal {V}}- V)\) will deliver a set of worlds inside of \({\mathfrak {F}}_V\).

Putting this together with the requirement that \(\phi _V\) be descriptively adequate throughout \({\mathfrak {F}}_V\), we get an account according to which a causal model \({\mathcal {M}}\) is correct at a world \(w\) only if \(({\mathcal {V}}4)\).

\(({\mathcal {V}}4)\) :

\(\begin{aligned} \forall V \in {\mathcal {V}}\quad \exists {\mathfrak {F}}_V \ni w \text{ such that } {\mathfrak {F}}_V \text{ satisfies } {(\mathfrak {F}}2) \text{ and } {\mathfrak {F}}_V \,\models \, V = \phi _V({\mathbf {PA}}(V)) \end{aligned}\)

A structural equation belonging to a causal model satisfying \(({\mathcal {V}}4)\) will continue to belong to a causal model satisfying \(({\mathcal {V}}4)\) after any number of interventions to set the values of any of the other variables in the model. In the appendix, I prove the following theorem.

Theorem 2

Given (f3), if \(\phi _V\) belongs to a causal model satisfying \(({\mathcal {V}}4)\) at a world \(w_0\), then \(\phi _V\) will continue to belong to a causal model satisfying \(({\mathcal {V}}4)\) after any number of consecutive hypothetical interventions to set the values of any \({\mathbf {X}} \subseteq {\mathcal {U}}\cup ( {\mathcal {V}}- V)\).

This means that the nomic sufficiency account is not subject to the objection I raised for the counterfactual account in the previous section—viz., that it could not guarantee that a structural equation \(\phi _V\) would continue to be correct after multiple sequential interventions to set the values of the variables other than \(V\).

In Sect. 3, we saw that a structural equations model says more than just that the endogenous variables \(V \in {\mathcal {V}}\) are structurally determined by their parents, and are not structurally determined by any of the other variables in \({\mathcal {U}}\cup {\mathcal {V}}\). It additionally says that the exogenous variables aren’t determined by any of the other variables in \({\mathcal {U}}\cup {\mathcal {V}}\). We can accomplish this within the nomic sufficiency account in the following way. Say that one model \({\mathcal {M}}' = ( {\mathcal {U}}', {\mathcal {V}}', {\mathcal {E}}' )\) eclipses another model \({\mathcal {M}}= ({\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}}), {\mathcal {M}}\sqsubset {\mathcal {M}}'\), iff \({\mathcal {M}}'\) and \({\mathcal {M}}\) share all the same variables and \({\mathcal {M}}'\) contains strictly more structural determination relations between those variables. That is:

$$\begin{aligned}&{\mathcal {M}}\sqsubset {\mathcal {M}}' \text{ iff: } \\&1. \quad {\mathcal {U}}\cup {\mathcal {V}}= {\mathcal {U}}' \cup {\mathcal {V}}' \\&2. \quad \forall V \in {\mathcal {U}}\cup {\mathcal {V}}, \quad {\mathbf {PA}}(V) \subseteq {\mathbf {PA}}'(V) \\&3. \quad \exists V \in {\mathcal {U}}\cup V, \quad {\mathbf {PA}}(V) \,\subsetneq\, {\mathbf {PA}}'(V) \\ \end{aligned}$$

(where ‘\({\mathbf {PA}}'(V)\)’ is a vector of \(V\)’s structural parents in the model \({\mathcal {M}}'\).) Now, we can enrich our account of the correctness of causal models by requiring that a model not be eclipsed by any other model which satisfies \(({\mathcal {V}}4)\).

\(({\mathcal {M}}2)\) :

\(\begin{aligned}&{\mathcal {M}} \text{ is correct at } { w} \text{ iff: } \\&1. \quad {\mathcal {M}} \text{ satisfies } {(\mathcal {V}}4) \text{ at } { w} \\&2. \quad \lnot \exists {\mathcal {M}}' \text{ such that } {\mathcal {M}}' \text{ satisfies } {(\mathcal {V}}4) \text{ at } w \text{ and } {\mathcal {M}}\sqsubset {\mathcal {M}}' \end{aligned}\)

Returning to the example of Billy, Suzy, and the window (shown in Fig. 5): if Billy is eager to see the window shatter, and will throw his rock if (but only if) Suzy doesn’t throw hers, then, given some assumptions about the selection function \(f\), the system of structural equations shown in Fig. 6 will satisfy \(({\mathcal {V}}4)\). (‘\(\overline{x}\)’ is the truth function \(1 - x\)). And this causal model eclipses the model consisting of the sole structural equation \(W \,{:}{=}\,S \vee B\), shown in Fig. 5. So, according to \(({\mathcal {M}}2)\), the causal model in Fig. 5 will not be correct, if this one is. So, if Billy’s decision about whether or not to throw is determined by whether Suzy throws, then the model consisting of just the equation \(W \,{:}{=}\,S \vee B\) is not correct. That model tells us that whether Billy throws isn’t determined by whether Suzy throws, which is false.

Fig. 6
figure 6

\({\mathcal {E}}_{6}\) eclipses \({\mathcal {E}}_{5}\)

4.1 Interventions and modularity

The nomic sufficiency account of causal models affords an understanding of hypothetical interventions. On this understanding, a hypothetical intervention on an endogenous variable \(V\) is a counterfactual supposition which takes one outside of the causal field \({\mathfrak {F}}_V\), while remaining inside the causal fields of all the other endogenous variables in the model. Consider, for instance, the causal model of the captain, the riflemen, and the deserter shown in Fig. 2. Suppose that this causal model is correct at \(w_0\), and that \(C_{w_0} = 0\)—the captain doesn’t give the order at \(w_0\). Suppose that we wish to entertain a hypothetical intervention to set the value of \(L\) to \(1\). We know that this is to be modeled in the following way: we take the original system of structural equations \({\mathcal {E}}_{2}\) and replace it with \({\mathcal {E}}_{2, \,L = 1}\), shown in Fig. 3. In the mutilated system of equations \({\mathcal {E}}_{2 s, \,L = 1}, L\) does not merely take on the value of \(1\). Additionally, the value of \(L\) is not structurally determined by the value of \(C\). Whether the left rifleman fires is not determined by whether the captain gives the order. This follows immediately from the correctness of the system of equations \({\mathcal {E}}_{2, \,L = 1}\), given \(({\mathcal {M}}2)\), since if \(L\) were still structurally determined by \(C\), then \({\mathcal {M}}_{2, \,L = 1}\) would be eclipsed by \({\mathcal {M}}_{2}\).

Since we’ve said that a structural equation \(\phi _V\) is in force at a world \(w\) iff \(w\) lies inside of a causal field \({\mathfrak {F}}_V\) satisfying \(({\mathfrak {F}}2)\), this means that a hypothetical intervention to set the value of \(L\) must take us to a world \(w_1\) which lies outside of \({\mathfrak {F}}_L\) (since \(L\) is not structurally determined by \(C\)), but still inside of \({\mathfrak {F}}_R\) and \({\mathfrak {F}}_D\) (since \(D\) is still structurally determined by \(L\) and \(R\), and \(R\) is still structurally determined by \(C\)), as shown in Fig. 7a. This provides a semantic interpretation of what’s going on when we model an intervention on \(L\) by removing \(L\)s’s structural equation and leaving the other structural equations in place.

Fig. 7
figure 7

Interventions on the nomic sufficiency understanding

It also provides an explanation of why only certain methods of setting the value of \(L\) to \(1\) count as interventions, and it provides a criterion for distinguishing those ways of setting the values of the variables which do from those which do not constitute interventions. For instance, if we were to get the left rifleman to fire by bribing the captain to give him the order, then this would not constitute an intervention on \(L\), since it would leave us inside of the causal field \({\mathfrak {F}}_L\), as shown in Fig. 7b. Similarly, suppose that the captain does not want to kill the deserter, but would welcome an opportunity to let the riflemen blow off some steam. Then, we might be able to get the left rifleman to fire by putting up a bullet-proof partition between the riflemen and the deserter. Then, the captain would give the order, and the left rifleman would fire. Even though this is an intervention which makes the left rifleman fire, it is not an intervention on on the value of \(L\). Rather, since it leaves \(L\) and \(R\)’s determination by \(C\) intact, but severs the determination of \(D\) by \(L\) and \(R\), as shown in Fig. 7c, it constitutes an intervention on \(D\).

This generalizes. A causal model \({\mathcal {M}}\) will be correct throughout \({\mathfrak {F}}_{\mathcal {M}}\mathop {=}\limits ^\mathrm{{{def}}}\bigcap _{V \in {\mathcal {V}}} {\mathfrak {F}}_V\). This is the area of modal space in which every endogenous variable’s structural equation \(V \in {\mathcal {V}}\) is in force—it is the area in which all of the causal fields of the endogenous variables overlap. Theorem 3, proved in the appendix, establishes that, given \(({\mathcal {M}}2)\), this area of modal space will contain every assignment of values to \({\mathcal {U}}\cup {\mathcal {V}}\) which is consistent with the structural equations in \({\mathcal {E}}\), and no assignments of values to \({\mathcal {U}}\cup {\mathcal {V}}\) which is inconsistent with the structural equations in \({\mathcal {E}}\).

Theorem 3

Given (f1), if a causal model \({\mathcal {M}}= ( {\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\) is correct according to \(({\mathcal {M}}2)\), then \({\mathfrak {F}}_{\mathcal {M}}\) \(\mathop {=}\limits ^\mathrm{{{def}}}\) \(\bigcap _{V \in {\mathcal {V}}} {\mathfrak {F}}_{V}\) contains all and only the allowed assignment of values to the variables in \({\mathcal {U}}\cup {\mathcal {V}}\), where an assignment is allowed just in case it is a solution to the equations in \({\mathcal {E}}\).

In particular, this means that, for any assignment of values to the exogenous variables, there will be some area of modal space inside \({\mathfrak {F}}_{\mathcal {M}}\) where that assignment of values is realized. So there are in-principle hypothetical interventions to set the values of any of the exogenous variables without disrupting any of the structural determination relations in \({\mathcal {E}}\).

Moreover, it follows from \(({\mathcal {M}}2)\), (f3), and the definition of intervention provided above that, for any correct causal model \({\mathcal {M}}= ({\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\), any \(\mathbf {V}\, \subsetneq\, {\mathcal {V}}\), and any assignment of values to \(\mathbf {V}\), there will always be an area of modal space which is outside the causal fields of all the members of \(\mathbf {V}\) but still inside the causal fields of all of the members of \({\mathcal {V}}- \mathbf {V}\), and which contains every possible assignment of values to \(\mathbf {V} \cup {\mathcal {U}}\).

Theorem 4

Given \(({\mathcal {M}}2)\) and (f3), for any \(\mathbf {V} \,\subsetneq \,{\mathcal {V}}\),

$$\begin{aligned} \bigcap _{W \notin \mathbf {V}} {\mathfrak {F}}_{W} - \bigcup _{V \in \mathbf {V}} {\mathfrak {F}}_{V} \end{aligned}$$

is non-empty and contains every assignment of values to the variables in \(\mathbf {V} \cup {\mathcal {U}}\).

This means that, if a structural equations model is correct, according to \(({\mathcal {M}}2)\), then there is an in-principle intervention to set any subset of the variables in \({\mathcal {V}}\) to any assignment of values which will leave the structural equations of the non-intervened-upon variables intact. This, together with theorems 2 and 3, guarantees that the structural equations in \({\mathcal {E}}\) are strongly modular.

4.2 Causal counterfactual dependence

\(({\mathcal {M}}2)\) allows us to provide an account of causal counterfactual conditionals in terms of relations of structural determination. On this account, a causal counterfactual \(\text {A} \;\square \!\!\rightarrow \text {C}\) is true at a world \(w\) iff there is a causal model \({\mathcal {M}}\), correct at \(w\), such that, given the exogenous variable assignment \({\mathcal {U}}_w\), if \({\mathcal {M}}\) is minimally mutilated so as to make \(\text {A}\) true, then \(\text {C}\) is true in the resulting model.

$$(\square \!\!\rightarrow _{{\mathcal {M}}}) \quad\quad {\text {A}} \;\square \!\!\rightarrow {\text {C}} \quad \iff \quad {\mathcal {M}}_{\text {A}}, {\mathcal {U}}_{w} \,\models \, \text {C}$$

With this kind of account, we could take structural determination to be more primitive than causal counterfactual dependence, and use the former to provide an account of the latter. That is to say: with this account, we need not define causal counterfactual dependence directly in terms of the selection function; rather, \(f\) can be used to provide truth conditions for \({\mathcal {M}}\), which can be used to provide truth conditions for \(\;\square \!\!\rightarrow\).

The counterfactual understanding, in contrast, retained an account of causal counterfactual conditionals according to which \(\text {A} \;\square \!\!\rightarrow \text {C}\) is true at the world of evaluation, \(w\), iff all the worlds in \(f({\text {A}}, w)\) are worlds at which \(\text {C}\) is true.

$$(\square \!\!\rightarrow _f) \quad\quad {\text {A}} \;\square \!\!\rightarrow {\text {C}} \quad \iff \quad f({\text {A}}, {w}) \,\models \, \text {C}$$

Depending upon our semantics for \(f\), there may be cases in which \((\square \!\!\rightarrow _{{\mathcal {M}}})\) and \((\square \!\!\rightarrow _f)\) diverge. Just to fix ideas: consider an account roughly like that of Lewis (1979) or Maudlin (2007). On Maudlin’s account, \(f({\text {A}}, w)\) is the set of worlds that you get by performing a surgical alteration on \(w\) so as to make \(\text {A}\) true at the relevant time, and then time-evolving the resulting state of the world forward in time according to the fundamental laws of nature. While Lewis (1979)’s account is slightly more complicated, it will achieve the same results as Maudlin’s in the case I’ll be considering.Footnote 16

Fig. 8
figure 8

The system of structural equations \({\mathcal {E}}_{8}\)

Imagine that I’ve got a tychistically chancy coin—whether it lands heads is not determined by the previous microphysical state of the universe and the laws of nature; rather, given that it’s flipped, the previous state of the universe and the laws of nature assign a probability of one half to the coin landing heads and a probability of one half to the coin landing tails. I’m going to flip the coin, and I offer you a bet on whether or not the coin lands heads. I’m an honest player, so if you take the bet and the coin lands heads, then you’ll win some money. If you take the bet and the coin lands tails, then you’ll lose some money. If you don’t take the bet, then you’ll neither win nor lose any money, independent of whether or not the coin lands heads. Let’s stipulate that the chance that the coin lands heads is unaffected by whether you take the bet. In this scenario, it appears that the structural equations model shown in Fig. 8 is correct, where \(B\) is a binary variable that takes the value \(1\) if you accept the bet and \(0\) if you don’t accept the bet, \(H\) is a binary variable which takes the value \(1\) if the coin lands heads and \(0\) if the coin lands tails, and \(W\) is a ternary variable which takes the value \(0\) if you neither win nor lose money, \(1\) if you lose money, and \(2\) if you win money. Let’s say that, at the actual world, you refuse the bet and the coin lands heads.

Suppose that we adopt the Maudlin account of the selection function. Then, \(({\mathcal {M}}1)\) and \((\square \!\!\rightarrow _f)\) will tell us that this structural equations model is not correct, since condition \(({\mathcal {U}}1)\) will not be satisfied. \(({\mathcal {U}}1)\), recall, required that, were some of the exogenous variables to have taken on different values, the other exogenous variables would have retained their actual values. However, \(f(B=1, @) \not \models \, H = 1\), since when we surgically alter the state of the world so as to make \(B=1\) true and time-evolve the resulting state of the world into the future according to the fundamental laws of nature, there are two possibilities: one in which the coin lands heads and one in which the coin lands tails. In contrast, the causal model in Fig. 8 will satisfy the second clause of \(({\mathcal {M}}2)\)—(\({\mathcal {M}}2\), \(2\))—so long as there is no other structural equations model which satisfies (\({\mathcal {M}}2\), \(1\)) according to which \(W\) is determined by \(B\) and \(H\), and either \(B\) is determined by \(H\) or \(H\) is determined by \(B\). Since \(f(B=1, @)\) contains both \(H=0\) worlds and \(H=1\) worlds, there will be no set of worlds containing \(f(B=1, @)\) which entails that \(H\) is any function of \(B\). So \(H\) is not determined by \(B\), according to (\({\mathcal {M}}2\), \(1\)). Assuming that, under the counterfactual supposition that the coin lands tails, you still refused the bet, the structural equations model shown in Fig. 8 will satisfy (\({\mathcal {M}}2\), \(2\)).

Of course, there’s no reason that the counterfactual account can’t replace \(({\mathcal {U}}1)\) with the requirement that a causal model be uneclipsed. On an account like this, a model \({\mathcal {M}}= ({\mathcal {U}}, {\mathcal {V}}, {\mathcal {E}})\) will be correct iff \({\mathcal {V}}\) satisfies \(({\mathcal {V}}1)\) and there is no other structural equations model \({\mathcal {M}}'\) which satisfies \(({\mathcal {V}}1)\) and eclipses \({\mathcal {M}}\). Even this emended counterfactual account will fail to say that the system of equations in Fig. 8 is correct. For \(f(B = 1, @) \not \models \, W=2\), since there are some worlds in \(f(B = 1, @)\) where the coin lands tails and you therefore lose the bet. So, \(({\mathcal {V}}1)\), wedded with a Maudlin-esque account of the selection function, entails that the system of structural equations in Fig. 8 is incorrect.

Independent of its ability to vindicate the system of structural equations \({\mathcal {E}}_{8}\), the fact that this account of the selection function, together with \((\square \!\!\rightarrow _f)\), entails the falsity of \(B=1 \;\square \!\!\rightarrow W=2\) strikes me, as it has struck many,Footnote 17 as the wrong result. Whether the coin lands heads is entirely unaffected by whether you took the bet. Since the coin actually landed heads, if you had taken the bet, you would have won. Now, there are moves to be pulled here—we can alter our account of the selection function so that, if the coin actually lands heads, then only the worlds where the coin lands heads are included in \(f(B=1, @)\). Note, however, that the account consisting of \(({\mathcal {M}}2)\) and \((\square \!\!\rightarrow _{{\mathcal {M}}})\) need not avail itself of those maneuvers. Even with the bare Maudlin account of the selection function, that account entails that, were you to take the bet, you would have won. According to \(({\mathcal {M}}2)\), the correctness of the structural equations model in Fig. 8 does not depend upon whether the worlds in \(f(B = 1, @)\) are worlds in which the coin lands heads or tails, or whether they are worlds in which you win or lose. \(({\mathcal {M}}2)\) only requires that, at all the worlds in \(f(B = 1, @)\) at which the coin lands tails, you lose; and that, at all the worlds in \(f(B=1, @)\) at which the coin lands heads, you win. Assuming that similar remarks apply to all the other worlds in \({\mathfrak {F}}_W\), the structural determination relations shown in Fig. 8 will be in force. Then, those structural determination relations will entail, via \((\square \!\!\rightarrow _{{\mathcal {M}}})\), that \(B=1 \;\square \!\!\rightarrow W=2\). So, according to \(({\mathcal {M}}2)\) and \((\square \!\!\rightarrow _{{\mathcal {M}}})\), it is possible for a counterfactual \(\mathrm {A} \;\square \!\!\rightarrow \mathrm {C}\) to be true at a world \(w\), even though \(f(\mathrm {A}, w) \not \models \, \mathrm {C}\). (See Fig. 9).

Fig. 9
figure 9

The relationship between the causal field \({\mathfrak {F}}_W\), the selection function \(f\), and the counterfactual \(B=1 \;\square \!\!\rightarrow W=2\)

This is not easily mimicked by the counterfactual account, for that account is committed to both \((\square \!\!\rightarrow _{{\mathcal {M}}})\) and \((\square \!\!\rightarrow _f)\). For instance, the counterfactual theorist might want to attempt to adopt the nomic sufficiency account’s treatment of the coin toss case by emending \(({\mathcal {V}}1)\) to read:

$$({\mathcal {V}}5) \quad \forall V \in {\mathcal {V}}\quad \forall {\mathbf {X}}\subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}}\quad ( {\mathbf {X}} = {\mathbf {x}} \quad \;\square \!\!\rightarrow \quad V = \phi _{V}({\mathbf {PA}}(V) ) )$$

\(({\mathcal {V}}5)\), unlike \(({\mathcal {V}}1)\), does not require that, were \(\mathbf {X}\) to take on the values \(\mathbf {x}, V\) would take on the value it is given in the mutilated model \({\mathcal {M}}_{\mathbf {X}=\mathbf {x}}\), with the actual assignment of values to the exogenous variables. It simply requires that, whatever values \(V\)’s parent variables end up taking on when the values of \(\mathbf {X}\) change, the value of \(V\) remains a function \(\phi _V\) of those values. This would allow the counterfactual theorist to agree with the nomic sufficiency theorist that were you to have taken the bet, you would have won, \(B = 1 \;\square \!\!\rightarrow W=2\). However, since the counterfactual theorist is still committed to \((\square \!\!\rightarrow _f)\), so long as they retain the simple Maudlin account of the selections function, they must also deny that were you to have taken the bet, you would have won, since \(f(B=1, @) \not \models \, W=2\). And this is a straightforward contradiction.

The counterfactual theorist might want to respond to these kinds of considerations by denying \((\square \!\!\rightarrow _f)\), and reformulating their account of the correctness conditions of causal models directly in terms of the selection function \(f\), saying nothing of counterfactuals. That is, they could replace \(({\mathcal {V}}5)\) with \(({\mathcal {V}}6)\).

$$({\mathcal {V}}6) \quad \forall V \in {\mathcal {V}}\quad \forall {\mathbf {X}} \subseteq {\mathcal {U}}\cup ({\mathcal {V}}- V) \quad \forall {\mathbf {x}} \quad f({\mathbf {X}} = {\mathbf {x}}, @) \,\models \, V = \phi _{V}({\mathbf {PA}}(V) )$$

It follows from Lemmas 1 and 2 (Sect. 6) that, in the presence of \(({\mathcal {U}}1), ({\mathcal {V}}6)\) is equivalent to \(({\mathcal {V}}1)\). Of course, we just encountered reason for such a theorist to abandon \(({\mathcal {U}}1)\)—namely that, together with the Maudlin-esque account of \(f\), it is inconsistent with the correctness of \({\mathcal {E}}_{8}\). And once \(({\mathcal {U}}1)\) has been replaced with a condition along the lines of (\({\mathcal {M}}2\), \(2\)), \(({\mathcal {V}}6)\) will no longer be equivalent to \(({\mathcal {V}}1)\).Footnote 18 Given that it denies any direct connection between \(f\) and counterfactual conditionals, we might well wonder whether the resulting account deserves the name ‘counterfactual’ any longer, but put that question to the side. Whatever we call the resulting account, it is only able to avoid complicating its account of \(f\) by inching ever closer to the nomic sufficiency account. The only thing separating the two accounts at this point is the nomic sufficiency account’s closure condition, \(({\mathfrak {F}}2)\). This is the aspect of the account which solves the problems with modularity raised in Sect. 3.1 above. It appears that any counterfactual account built around \(({\mathcal {V}}6)\) which was able to solve those problems with modularity would end up being equivalent to (or would entail) the nomic sufficiency account. For it appears that the only way to solve those problems is to impose a constraint on which worlds are reachable by repeated counterfactual supposition; for worlds that are so reachable, put them in the set \({\mathfrak {F}}_V\), and modularity will then guarantee that \({\mathfrak {F}}_V \,\models \, V = \phi _V({\mathbf {PA}}(V))\). If that’s right, then such an account would impose all the same constraints as the nomic sufficiency account; and counterfactual theorists would have mimicked nomic sufficiency theorists only by becoming nomic sufficiency theorists themselves.

4.3 A remaining worry

Above, I defined modularity as the thesis that any number of interventions on a set of variables \(\mathbf {V}\) leaves the structural equations associated with every endogenous variable \(V \notin \mathbf {V}\) unaffected. Theorems 24 guarantee that a correct system of structural equations will be modular in this sense. Note, however, that modularity does not guarantee that there will always be an intervention on a set of variables \(\mathbf {V}\) such that, post-intervention, the variables in \(\mathbf {V}\) are no longer determined by any of the variables in \({\mathbf {PA}}(\mathbf {V})\). For a schematic example, consider the causal fields shown in Fig. 10. In that diagram, the causal model \({\mathcal {M}}\) will be correct at the world \(w_0 \in {\mathfrak {F}}_{\mathcal {M}}\mathop {=}\limits ^\mathrm{{{def}}}\bigcap _{V \in {\mathcal {V}}} {\mathfrak {F}}_V\). Now, theorem 4 guarantees us that there is a set of worlds \(f({\mathbf {V}} = {\mathbf {v}}, w_0)\) which lie outside of \(\bigcup _{V \in \mathbf {V}} {\mathfrak {F}}_V\), yet inside of \(\bigcap _{U \notin \mathbf {V}} {\mathfrak {F}}_U\), and at which \(\mathbf {V}=\mathbf {v}\), for any assignment \(\mathbf {v}\). However, we have no guarantee that this set of worlds doesn’t lie within some other causal field \({\mathfrak {F}}'_V\) such that \({\mathfrak {F}}'_V \,\models \, V = \phi _V'({\mathbf {PA}}(V), \dots )\), for some \(V \in \mathbf {V}\). And that means that, even though we have a guarantee that an intervention on a set of variables will sever the actual structural determination relations between \(\mathbf {V}\) and \({\mathbf {PA}}(\mathbf {V})\), we don’t have any guarantee that the intervention won’t make it the case that some other structural determination relations link \(\mathbf{PA}(V)\) to \(V\).

Fig. 10
figure 10

On the nomic sufficiency account, an intervened-upon variable may still be determined by its structural parents

For a concrete example which might give rise to a case like this, consider the steam vent illustrated in Fig. 11. There, a switch, which may be placed to the left or to the right, will either divert the steam to the left or the right. (If the switch is left, the steam will go right, as shown in Fig. 11a; if the switch is right, the steam will go left, as shown in Fig. 11b). There is a lid on the right steam vent. If the steam is directed up to the right vent, then the lid will heat up. Consider the variables \(S\) and \(L\). \(S\) is \(1\) if the switch is to the left, and is \(0\) if the switch is to the right. \(L\) is \(1\) if the lid is hot and is \(0\) if the lid is not hot. When the system is as depicted in Fig. 11a, the structural equation \(L \,{:}{=}\,S\) will be in force. Whether the lid is hot is determined by whether the switch is to the left or right. In Fig. 11a, both \(S\) and \(L\) will be \(1\). Now, suppose that the lid is attached to a hinge, so that it can be pivoted to sit atop either the left or the right steam vent. There is then an intervention we may perform to set \(L\) to \(0\). That is, there is a method for making the lid not hot which will take us outside of the causal field \({\mathfrak {F}}_L\). We may simply pivot the lid on its hinge to put it atop the left steam vent, as in Fig. 11c. Then, it will no longer be the case that \(L=1\), nor will it be the case that \(S\) determines \(L\) according to the equation \(L \,{:}{=}\,S\). However, even after this intervention has taken place, the value of \(S\) will determine the value of \(L\). It will now do so according to the equation \(L \,{:}{=}\,\overline{S}\). If the switch is set to the left, then the lid will not be hot, and if the switch is set to the right, then the lid will be hot.

Fig. 11
figure 11

An example of how an intervened-upon variable may still be determined by its structural parents

Given the account of interventions provided in Sect. 4.1 above, this will count as an intervention on the value of \(L\). However, it would be inappropriate to model the result of this intervention by mutilating the model, removing \(L\)’s structural equation, and replacing it with nothing. For, in order for the mutilated model to be correct, it must be uneclipsed by any correct structural equations model. And, in this case, the mutilated model in which \(S\) does not structurally determine \(L\) would be eclipsed by the model containing the equation \(L \,{:}{=}\,\overline{S}\).

Cases such as these might also make trouble for \((\square \!\!\rightarrow _{{\mathcal {M}}})\), as \((\square \!\!\rightarrow _{{\mathcal {M}}})\) would predict that, if we are in the situation depicted in Fig. 11a, then, were the lid to not be hot, then, if the switch were moved to the right, the lid would not be hot—\(L = 0 \;\square \!\!\rightarrow (S = 0 \;\square \!\!\rightarrow L = 0)\). However, if we think that there are worlds in \(f(L=0, @)\) at which the lid has been pivoted on its hinge, then we might think that this counterfactual should be false.

It is unclear whether this ought to be regarded as a problem for the nomic sufficiency account. To the extent that one is inclined to think that \(f(L=0, @)\) includes worlds at which the lid has been pivoted on its hinge, it seems to me entirely correct to say that we ought not model an intervention on \(L\) which pivots the lid on its hinge by removing the structural determination relation between \(L\) and \(S\), and it seems a mark in the nomic sufficiency account’s favor that it says so. To the extent that one is inclined to think that \(f(L=0, @)\) contains worlds at which the lid is removed from its hinge, or perhaps worlds at which some kind of Lewisian miracle keeps the lid from getting hot even though the steam is being directed up towards it, it seems entirely correct to model this kind of intervention by removing the structural determination relation between \(S\) and \(L\). Once there is a miracle to set \(L\) to \(0\), changes in the value of \(S\) will not affect the value of \(L\), so long as God’s hand is steady. If one is unhappy with the possibility of \(L\) being determined by \(S\) post-intervention, then one may simply require that \(f(L=0, @)\) contain only worlds at which Lewisian miracles determine the value of \(L\). Of course, nothing in the account guarantees that there will always be some possible Lewisian miracle which will constitute an intervention. However, if there are cases in which Lewisian miracle interventions are impossible, then I’m inclined to say just what I said above about the case in which \(f(L=0, @)\) contained worlds at which the lid was pivoted on its hinge: in such cases, it is incorrect to model the result of the intervention by mutilating the model, and it would be a mark against an account of structural determination if it said otherwise. So my settled judgment is that this is a feature, rather than a bug, of the nomic sufficiency account.

Some will disagree. They will have the following reaction to this case: the structural equations model containing just the equation \(L \,{:}{=}\,S\) ought not be deemed correct by an account of structural determination. For there is another, better, model which contains an additional variable describing the position of the lid. For instance, if we use the variable \(P\), which is \(1\) if the lid is pivoted to the left and \(0\) if the lid is pivoted to the right, then the model \(((S, P), (L), (L \,{:}{=}\,S\, \veebar\, P))\) will be correct.Footnote 19 \(^,\) Footnote 20 I agree that this new model is correct; however, it appears to me that, so long as the lid is pivoted to the right, the original model is correct, too. And I see no reason why these two models cannot both be correct together. There are a great many structural equations models to which we could, if we chose, add additional exogenous variables, but they are none the worse for that. A model in which whether the match lights is determined by whether it is struck is correct, even though we could add to it an additional variable for the presence of oxygen. We often wish to ignore certain background properties of the world which determine an outcome; and an account of structural equations models ought to permit this. However, if one is not persuaded by these considerations, and one wishes to rule out structural equations like \(L \,{:}{=}\,S\) when there are correct structural equations like \(L \,{:}{=}\,S \,\veebar\, P\) to replace them, then the nomic sufficiency account could be emended to achieve this in a variety of ways.Footnote 21 From my perspective, such emendations are unnecessary and ill-advised; but I would not be surprised to learn that others disagree.

5 In summation

After developing the counterfactual understanding of structural determination, I argued that it faces difficulties in securing the modularity of structural determination relations. I advanced an alternate understanding of structural determination and I demonstrated that it guarantees that structural determination relations are modular. I showed that it provides a clear and straightforward way of thinking about hypothetical interventions, as well as a criterion for distinguishing hypothetical changes in the values of variables which constitute interventions from those that do not. By treating structural determination relations as more fundamental than causal counterfactuals, the resulting theory was able to yield a clean solution to a problem case for ‘closest possible world’ semantics for counterfactuals.