1 Preliminaries

1.1 Proportionality

Some events are more general than others. Suzy’s throwing the rock is more general than Suzy’s throwing the rock gleefully, and less general than Suzy’s throwing something. This can be understood in terms of the idea that some events determine others (Yablo, 1992a, b): an event A is more general than an event B just in case B determines A (in the sense that B’s occurring is a way for A to occur.)Footnote 1

When it comes to causation, generality seems to matter. Take Yablo’s familiar example:

(Sophie) Sophie the pigeon is trained to peck at red stimuli. A scarlet tile is presented. She pecks.

Intuitively, the tile’s being red is a cause of Sophie’s pecking. The tile’s being scarlet is not a cause, because it is too specific; Sophie would still have pecked if she saw some other shade of red (crimson, say). Conversely, the tile’s merely being colored is also not a cause, because it is too general; not any colored tile would have led to pecking.

Such examples suggest that causes obey some form of ‘proportionality’ principle: at a first pass, they involve all and only those details which are relevant to their effect. Various authors have defended different versions of this idea (e.g. Yablo, 1992a, 2003; Strevens, 2008; List & Menzies, 2009). Fleshing it out requires characterizing both what proportionality is—that is, what it is for a cause to be proportional to its effect—and the role it plays with respect to causation.

Proportionality may be understood in various ways in terms of counterfactuals, probabilities, and/or laws. For concreteness, I focus on the following simple schematic characterization:Footnote 2

c is proportional to e iff: (i) c is causally sufficient for e, and (ii) no event which is more general than c is causally sufficient for e.

For example, the tile’s being colored is too general for pecking since it fails to satisfy the first clause, and the tile’s being scarlet is too specific since it fails to satisfy the second. As a first pass, causal sufficiency may be understood in terms of c’s entailing e in the relevant background circumstances via some causal law (à la Strevens, 2008). Of course, developing an account along these lines (for example, extending it to cover non-deterministic cases of causation) is no trivial task. However, my aim in this paper is not to defend any particular characterization of proportionality, but to address a problem which arises for any characterization. Moreover, the problem I focus on pertains to the idea that causes are general, irrespective of how their sufficiency is spelled out.

As for proportionality’s role, the most straightforward idea is that proportionality is a necessary condition: if c causes e, then c is proportional to e.Footnote 3 Again, I will focus on this proposal for concreteness. Putting it together with the schematic characterization of proportionality above yields the following principle:

Proportionality: Causes are maximally general sufficers for their effects.

Now, many have thought that this ‘winner-takes-all’ approach is overly demanding: in many cases, two events seem to cause the same effect even though one is more general than the other (Yablo, 1992b: 420, Bontly, 2005: 340, List & Menzies, 2009: §5, McDonnell 2017: §5.1, Vaassen, 2022: §3.2). For example, the window’s shattering may be caused both by Suzy throwing the rock and by Suzy throwing something heavy.

Various less demanding proportionality principles have been suggested. Some approaches characterize proportionality in such a way that an event and its lower-level realizer (like Suzy’s throwing something heavy and Suzy’s throwing the rock) may each be proportional to the same effect (e.g. List & Menzies, 2009, Woodward, 2018). Another idea is that proportionality must be balanced against other factors, such as naturalness (Yablo, 2003): causes are then those events which constitute local maxima, or whose overall eligibility exceeds some threshold (perhaps with respect to some degreed characterization of proportionality). Suzy’s throwing the rock and Suzy’s throwing something heavy may each constitute local maxima, or may each exceed the relevant threshold. Another approach relativizes proportionality—and correspondingly, causation itself—to some contextually salient domain of contrast events (Yablo, 1992b; Touborg, 2022). On this view, Suzy’s throwing the rock may be proportional relative to some default context, whilst invoking the more general event of Suzy’s throwing something heavy brings it into relevant domain, making it proportional in the new context. (This would still be ‘winner takes all’ within any given context, though different events ‘win’ in different contexts.) Finally, proportionality might not constrain causation at all, but rather causal explanation: either explanations must cite proportional causes, or those which do are (ceteris paribus) better (Weslake, 2010; Woodward, 2018). On this view, Suzy’s throwing the rock and Suzy’s throwing something heavy are each causes of the window’s shattering, though invoking one of them may be more explanatory (depending on how proportionality is characterized).

Although the simplicity of winner-takes-all is appealing, some combination of these alternative ideas may well prove to be on the right track. For the purposes of this paper, however, it doesn’t matter exactly how proportionality constrains causation(/causal explanation). As discussed shortly, some version of the problem I address arises whichever role it plays, and my proposed solution carries over.Footnote 4

Before considering this problem, it is worth emphasizing the attractiveness of the idea that proportionality (somehow) constrains causation.

Firstly, the kind of intuition elicited by Sophie is not merely a feature of curious philosophical cases but extremely general: it is naturally elicited by almost any ordinary example of causation. Our overall theory of causation should capture intuitions like these if it can.

Secondly, as Yablo (1992a: 274) notes, proportionality naturally falls out from an attractive general conception of causes as difference-makers for their effects (Woodward & Hitchcock, 2003; Strevens, 2004). Difference-making relations are highly sensitive to levels of generality: an event can be too specific or too general to be the true difference-maker.

Thirdly, and most importantly, proportionality promises a significant advance with two related and long-sought holy grails in the philosophy of mind and the philosophy of science: understanding mental causation, and understanding (more broadly) the goodness of high-level scientific explanations. Indeed, Yablo (1992a) originally formulated his proportionality principle as a response to Kim’s (1989) exclusion problem. The rough idea is that determinable mental events are eligible to cause behavior, despite the causal sufficiency of their more specific physical determinates, in virtue of being proportional to behavior. More generally, proportionality illuminates the distinctive goodness of the causal explanations provided by high-level sciences, despite the apparent ‘causal completeness’ of fundamental physics: these sciences explain high-level phenomena by citing causes that are proportional to them.

1.2 The disjunction problem

Proportionality appears to recommend implausibly disjunctive causes (McGrath, 1998: 171-3, Shapiro & Sober, 2012: 90 -1, Franklin-Hall, 2016: 566-7). Consider:

(Disjunctive Sophie) Sophie the pigeon is trained to peck at red stimuli and at green stimuli. A scarlet tile is presented. She pecks.

It seems that Proportionality rules out the tile’s being red (call this event ‘R’) as a cause of Sophie’s pecking in Disjunctive Sophie. For the tile’s being red-or-green (R-or-G) is a sufficer for pecking which is more general than R. Hence, R is not a maximally general sufficer. Yet, intuitively, R should be a cause of Sophie’s pecking (whatever we think about R-or-G itself).

And it gets worse: even in the case of Sophie, R seems too specific to be a cause, since there are surely alternative possible sufficers for Sophie’s pecking. For example, Franklin-Hall (2016: 566) imagines that Sophie pecks whenever her chin is tickled. Now consider the event of the tile’s being red or Sophie’s chin being tickled (R-or-T). Assuming that causal sufficiency is closed under disjunction, this disjunctive event suffices for pecking—thus, again, R is not a maximally general sufficer.Footnote 5 Yet, whatever we think about the causal credentials of R-or-T, it seems absurd to claim that its sufficiency excludes R from being a cause of pecking.

More broadly, Proportionality seems to recommend as eligible cause the event constructed by disjoining all possible sufficers for the effect, since this event will always be the maximally general sufficer.Footnote 6 This casts severe doubt on the potential benefits of Proportionality. It renders Proportionality unable to capture the motivating intuitions, recommending instead highly counter-intuitive causes. And it no longer seems to support the high-level causes cited by psychology and other special sciences, since these causes are not similarly repugnant disjunctions.

On less demanding versions of proportionality—those which do not embody a ‘winner-takes-all’ approach—one aspect of the disjunction problem may be avoided: the proportionality of repugnant disjunctions does not automatically exclude the proportionality of intuitive causes. Nonetheless, an important aspect of the problem remains: why aren’t disjunctive candidates like R-or-G and R-or-T proportional—or, if they are, what distinguishes them from bona fide high-level causes?Footnote 7 If proportionality is balanced against other factors, then the difficulty is to provide a factor which outweighs the greater proportionality of disjunctive candidates. (In §4.1 I explain why I don’t think naturalness is the required factor.) If proportionality is relativized to a domain, then the difficulty is to explain why disjunctive candidates are excluded from the relevant domain (at least in typical contexts). If proportionality constrains causal explanation rather than causation, the difficulty is to explain why invoking disjunctive candidates is not more explanatory. We can confidently expect that, on any approach, disjunctions will need to be dealt with somehow.

1.3 Cohesion

Some ways of generalizing events (e.g. from the tile’s being scarlet, S, to R) seem intuitively acceptable whereas others (e.g. from R to R-or-G, and from R to R-or-T) do not. This motivates a formulation of Proportionality which favors R over both S and R-or-G in Disjunctive Sophie, and over both S and R-or-T in Sophie.

A natural approach is to distinguish cohesive from disjunctive events: the acceptable ways of generalizing are those which preserve cohesion (intuitively, those which don’t yield repugnantly disjunctive events).Footnote 8 We can then reformulate Proportionality by restricting it to cohesive events:Footnote 9

Cohesive Proportionality: Causes are maximal cohesive sufficers for their effects.

That is: (i) causes are cohesive sufficers for their effects, and (ii) causes cannot be ‘cohesively extended’ into more general cohesive sufficers for their effects.

Thus, on the assumption that R is itself cohesive, and that R-or-G, R-or-T, and any other sufficers for pecking that are more general than R fail to be cohesive, R is a maximal cohesive sufficer in both Sophie and Disjunctive Sophie. Hence, Cohesive Proportionality does not exclude R as a cause of pecking.

Approaches which do not embody a ‘winner takes all’ assumption may also employ the notion of cohesion to address the disjunction problem. For example, one view is that causes (or those which are invoked by causal explanations) are sufficers which best balance cohesion and generality (Strevens, 2004: 172), or whose balance of cohesion and generality exceeds a certain threshold. Another view is that non-cohesive events are automatically excluded from the relevant domain of contrasts.

Vindicating the required assumptions about which events are cohesive requires some account of cohesion, to which I now turn. I will sketch the idea that cohesion arises from the similarity structure of property-spaces. The point of this paper is not to defend this account in detail, but to argue that the right way to solve the disjunction problem is to employ the notion of cohesion. For this purpose, it suffices to make it plausible firstly that cohesion can be accounted for (in a way that secures the needed verdicts), and secondly that solutions which do without cohesion are inadequate. The rest of the paper takes up these tasks in turn.

2 Accounting for cohesion

2.1 The general approach

Providing an account of the relevant notion of cohesion is no easy task. In the first instance, ‘disjunctiveness’ seems to apply to representation rather than reality. But accounting for cohesion in terms of representation is a non-starter. Any event has some non-disjunctive description: letting ‘gred’ denote the property of being either red or green, R-or-G may be described as the tile’s being gred. Moreover, appealing to ‘ordinary’ language won’t help: the tile’s being either red or orange seems to be a cohesive generalization of R despite its disjunctive ordinary description.

It is tempting to invoke the idea that events (and/or the properties they involve) have some metaphysically privileged description, which reveals their disjunctive/cohesive nature. For example, perhaps they have a disjunctive definition in ‘more natural’ terms (Langton & Lewis, 1998), or a disjunctive ‘essence’ (Skiles, 2016).Footnote 10 However, ‘metaphysical’ forms of disjunctiveness may not coincide with the ‘scientific’ (or ‘nomic’) kind at issue here. Many apparently cohesive events, like R, are multiply realizable and/or involve determinable properties. We might expect the privileged descriptions of multiply realizable events to be vast disjunctions of possible fundamental realizers. Similarly, on some accounts, determinable properties like redness are disjunctions of their determinates, making the events involving them metaphysically disjunctive (Bigelow & Pargetter, 1990; Clapp, 2001). Hence, it is unclear that this metaphysical approach draws the required distinction.

My account is consistent with the metaphysical approach, but avoids any contentious assumptions about the metaphysical analysis of high-level events. As I understand it, the cohesion of events arises from the similarity structure of property-spaces. Possible events vary along well-defined dimensions corresponding to determinable properties.Footnote 11 For example, the event R in Sophie can be compared to possible events which vary with respect to color, such as the tile’s being green, the tile’s being scarlet (S), and the tile’s being red-or-green (R-or-G). Likewise, the event of Jones’s greeting Smith loudly can be compared to possible events which vary with respect to volume, such as Jones’s greeting Smith quietly, Jones’s greeting Smith at 80 decibels, and Jones’s greeting Smith loudly-or-quietly. These dimensions of variation are causally relevant: the color of the presented stimulus makes a difference to whether Sophie pecks, and the volume of Jones’s greeting makes a difference to whether Smith replies.

When comparing possible events along dimensions like these, they are naturally represented as regions within a space whose structure is inherited from a corresponding property-space. For example, we can embed the possible events R, S, and R-or-G within a space structured like color space. The regions of color space correspond to color properties, such as being red, being scarlet, and being red-or-green. Color space’s structure encodes the relations between color properties. These properties stand in generality relations—for example: scarlet is contained within red, and red within red-or-green. And they stand in similarity relations—for example: orange is between red and yellow; scarlet is closer to crimson than to cyan. A corresponding space of possible events inherits this structure. For example, S is contained within R, and S is closer to the tile’s being crimson than to the tile’s being cyan. (Likewise, we can embed possible events involving Jones greeting Smith which vary with respect to volume within a space whose structure is inherited from the structure of the space of volume properties, such as being loud and being 80 decibels.)Footnote 12

Call a space of possible events whose structure is inherited from some corresponding property-space a state-space. The idea is then that state-space structure suffices to distinguish cohesive from disjunctive events within it: the cohesive events are those that correspond to ‘cohesive regions’ of the state-space.

Not all events seem to belong to a state-space, however. Consider the event of the tile’s being red-or-round. Since redness and roundness belong to different property-spaces (color space and shape space), being red-or-round does not belong to any property-space. Or consider the event of the tile’s being red or Sophie’s being green. This event could not be taken to correspond to the red-or-green region of color space: what would distinguish it from the event of the tile’s being green or Sophie’s being red? In order, then, for an event to belong to a state-space, it must satisfy the following (somewhat rough) condition: it must correspond to the instantiation of some property(/relation) belonging to an appropriate property-space.

Let the events corresponding to cohesive regions in state-spaces be ‘basic cohesive’ events. Then we can say that an event is cohesive iff it is either basic cohesive or the conjunction of some basic cohesive events. Hence, there are two ways an event can be disjunctive. Events which correspond to a region in a state-space can be disjunctive by corresponding to a non-cohesive region (like R-or-G). And events which do not correspond to any region in a state-space (like the tile’s being red-or-round) are disjunctive unless they are the conjunction of some basic cohesive events. R-or-T is disjunctive in the latter way: it does not belong to any state-space, nor is it a conjunction of basic cohesive events.Footnote 13

An adequate development of this approach must answer three questions:

  1. i)

    How does the similarity structure of a property-space determine cohesion?

  2. ii)

    How is the similarity structure of property-spaces determined?

  3. iii)

    How are properties divided into property-spaces?

The rest of this section briefly takes up these questions. Providing comprehensive answers is well beyond my scope here; instead, my discussion will aim to indicate some available resources and the direction in which I expect the answers to lie.Footnote 14

Before proceeding, it is worth noting that this approach focuses exclusively on the properties involved in causes. Nonetheless, it also has consequences for the involvement of objects. The basic cohesive events involve objects instantiating cohesive properties (presumably at times/time-intervals), with cohesive events formed from these by conjunction. Firstly, this disallows generalizing to object-free ‘existential events’, such as from the tile is red to something is red: such events are not conjunctions of basic cohesive events. Secondly, it allows for abstraction with respect to the object involved, such as from the tile is red to the left half of the tile is red.Footnote 15

2.2 Cohesion from similarity structure

How we define the notion of a cohesive region within a property-space precisely will depend on the particular structure of the space in question. But two principles serve as an initial guide. Firstly, the points of a space (corresponding to the most specific properties it represents) are cohesive. Secondly, the cohesion of a region is preserved by extending it ‘continuously’ (where the particular structure of the space determines the relevant notion of continuity.) Thus, the cohesive regions of a space are generated by continuously extending its points.Footnote 16

In continuous topological spaces, for example, cohesion is naturally captured by connectedness: intuitively, a region is connected when it does not consist of entirely separate regions.Footnote 17 Take the space of real numbers. The connected regions in this space are all and only the intervals: for example, (0,1) and (2,3) are each connected regions, but their union is not. Color space is naturally conceived as a continuous topological space (sometimes depicted as a cylinder, with dimensions corresponding to hue, saturation, and brightness). Given this structure, intuitively cohesive color properties such as being scarlet and being red will count as cohesive (since they correspond to connected regions) and intuitively disjunctive color properties such as being red-or-green will count as disjunctive (since they correspond to disconnected regions). Hence, in the corresponding state-space of possible colored tile presentations in Sophie, S and R are both cohesive, and R-or-G is disjunctive.

The connectedness criterion is only a starting point. Something else is needed for discrete spaces, where only individual points are connected. And even in continuous spaces, we might want a degreed notion on which highly irregular and spread out regions (consisting, for example, of two islands connected by a narrow bridge) are to be counted as less cohesive. If the space has metric structure—a measure of the degree to which any two points within it are similar—then many measures of cohesion are definable. But nothing as rich as metric structure is needed: viable notions of cohesion are definable given only betweenness structure (which points of the space lie between which others) or graph-theoretic structure (which points of the space are directly connected i.e. nearest neighbors).Footnote 18

By way of illustration, consider a discrete property-space representing a simplified spectrum of political persuasions. It has five points, representing far-left, center-left, center, center-right, and far-right. It seems implausible that this space has determinate metric structure: for example, is center-left more similar to far-left or to center? Nonetheless, it has betweenness structure—center-left is between far-left and center—and graph-theoretic structure—far-left is directly connected to center-left but not to center. This allows us to define a region as cohesive iff: (i) it contains any point which lies between two of its points (corresponding to ‘convexity’); or (ii) every point within it is (directly or indirectly) connected to every other (corresponding to ‘path-connectedness’). On either of these notions, for example, being far-left-or-center-left is cohesive, whereas being far-left-or-center is not.

Ultimately, I don’t expect there to be a single correct way of measuring cohesion, nor for there to always be determinate verdicts about whether, or to what extent, an event is cohesive. What I do expect—and what I think solving the disjunction problem requires—is that whenever an event is intuitively too disjunctive to be a cause, there is a suitable notion of cohesion which (when plugged into the approach I have outlined) delivers this verdict. To the extent that cohesion turns out to be indeterminate and/or relative to a context or background model, we should expect causal claims to inherit these features.

2.3 Similarity structure and property-spaces

This approach to cohesion relies on similarity structure pertaining to high-level determinable properties such as color, volume, etc. This structure is sparse in that it only pertains to an elite class of high-level properties: arbitrary gerrymandered properties do not belong to the relevant kind of property-space. It is also autonomous in that it does not flow directly from the structure of fundamental physics: given the multiple realizability of the relevant high-level properties, it is unlikely that fundamental physics alone distinguishes cohesive from disjunctive properties.Footnote 19

By relying on sparse, autonomous high-level similarity structure, my approach acquires the burden of accounting for this structure. Needless to say, I cannot give an account of it here. However, as I argue in §3, it is very plausible that there is such structure; hence, this burden should be viewed as a research project rather than a problem. Moreover, it is important to emphasize that cohesion facts do not require particularly rich similarity structure. We certainly don’t need ‘global’ facts about the relative similarity of utterly unrelated properties (is redness more similar to squareness than to loudness?!) All we need are facts about which properties belong to the same property-space (redness and blueness do, redness and squareness do not), and ‘local’ facts about the structure within a given space. As discussed above, facts about ‘continuity’ within a given property-space, as captured for example by topological and/or graph-theoretic features, suffice to exclude many properties as disjunctive.

If we wish to avoid taking property-spaces and their similarity structure as primitive, there are promising avenues for reduction to explore. To my mind, the most plausible approach is a functionalist one, according to which property-spaces are both populated and structured according to the nomic roles of their properties. On this approach, properties belong to the same property-space in virtue of sharing a schematic nomic role. For example, colors are unified by their role in robust regularities characterizing their interaction with light and visual systems, and political persuasions are unified by their role in robust regularities connecting them to voting behavior and to socio-economic factors. The fruitfulness of sciences dedicated to uncovering these nomic roles is testament to this underlying unification.

Nomic roles don’t only unify properties into property-spaces: they also unify the various instances of the properties themselves. Red objects interact with light and visual systems in similar ways, whilst red-leaning people tend to share voting behaviors and socio-economic backgrounds. This suggests that similarity structure within property-spaces should also be determined by nomic role. Roughly speaking, the similarity between red and orange is determined by similarity in the ways their instances interact with light and visual systems, and the similarity between staunch conservativeness and moderate conservativeness is determined by similarity in the voting behaviors and socio-economic backgrounds of their instances.Footnote 20

3 In defense of cohesion

In this section, I defend my proposed solution to the disjunction problem by arguing that cohesion is not objectionably mysterious: there are good reasons to think that there are cohesion facts, and that we have epistemic access to them when evaluating causal relations.

3.1 Cohesion isn’t naturalness

It is sometimes suggested that the disjunction problem may be solved by appeal to naturalness: the basic idea is that disjunctive events like R-or-G and R-or-T are excluded because they are (in a sense which may be fleshed out in various ways) unnatural. Indeed, I suspect that reactions to the disjunction problem more or less divide between two camps: naturalness-users, who think that it is straightforwardly solved by appeal to naturalness, and naturalness-sceptics, who think that appeal to any ‘metaphysical’ posit like naturalness is methodologically suspect. I address naturalness-users below, arguing that cohesion is a better tool for the job (§4.1). But first, I should address naturalness-sceptics—they are likely to find my appeal to cohesion instead of naturalness little more than a rebranding exercise.

The first thing to note is that cohesion and naturalness are distinct. Those who recognize naturalness facts can think of cohesion as one aspect of, or a contributing factor to, naturalness. It is standard to suppose that the disjunctiveness of a property detracts from its naturalness. For example, being-green-or-blue is less natural than being-green. There is more to naturalness than cohesion, however: conjunctions and negations are also thought to detract from naturalness. For example, although they both seem cohesive, having-charge-and-mass and not-having-charge are each less natural than having-charge. Whereas conjoining natural properties detracts from naturalness, conjoining cohesive properties preserves cohesion.

The question is whether the reasons to be skeptical of naturalness extend to cohesion. Franklin-Hall (2016) and Blanchard (2020) note that resorting to naturalness will strike some defenders of proportionality as an overly ‘metaphysical’ solution, where this appears to mean: offends against empiricist tendencies in the philosophy of science. To my mind, such concerns are misplaced: pace ostrich empiricism, avoiding any appeal to naturalness (or similar resources) in our theorizing about science is neither sustainable nor desirable.Footnote 21 But I won’t press the point here—there is a genuine challenge for the naturalness-user in the vicinity: how are we to account for naturalness (especially, high-level naturalness) in a way which vindicates its alleged role in science? In particular, how does naturalness get its grip on theorizing about causation? In the absence of a satisfying answer, appealing to naturalness seems objectionably mysterious and ad hoc.Footnote 22

The question here is whether the analogous challenge can be met in the case of cohesion. I will now argue that it can be: the similarity structure of property-spaces provides a firm foundation for cohesion in scientifically respectable resources. Hence, even naturalness-sceptics should be happy to appeal to it.Footnote 23

3.2 Similarity structure isn’t mysterious

In §2, I argued that the cohesion facts needed to implement Cohesive Proportionality can be accounted for in terms of similarity structure on property-spaces. I will now argue that it should be uncontroversial that there is such structure, and that we have epistemic access to it. Hence, cohesion is in good metaphysical and epistemic standing.

Structured property-spaces play a crucial role in special sciences: they are posited by the sciences themselves, and exploited in discovering and describing simple generalizations. For example, the rationality of ubiquitous extrapolation practices—in which simple curves are fitted to finite data samples—crucially relies on high-level similarity structure: without it, there is nothing with respect to which these curves are genuinely simple.

For a particularly vivid illustration, consider neuronal tuning curves, widely used in psychophysics for characterizing how neurons’ firing rates vary with environmental features—such as the orientation, shape, or color of a visual stimulus.Footnote 24 The extrapolation of tuning curves from data relies on a parametrization representing the relevant features of the stimulus. For example, we parametrize stimulus orientation by using numbers to represent angles, allowing us to mathematically describe its relationship to neural activity. This parametrization is chosen to reflect the relevant high-level similarity structure.

Without a structured color space, for example, psychophysicists would be unable to describe how the various neurons in Sophie’s visual cortex respond to the color of the presented stimulus (e.g. these neurons respond most strongly to this shade of red). Take some set of points encoding how stimulus color and firing rates covary across some number of trials. This data is evidence for a particular relationship between color and neural firing, as described by a neuronal tuning curve. But an infinite number of curves fit the data, so it can only be evidence for a particular relationship given some way of comparing the simplicity of various alternatives. This comparison relies on some background structure embedding the properties in question (as encoded by the axes used): we can make curves more or less complex by choosing suitably gerrymandered axes, which do not reflect the real similarity structure.

This particular illustration is especially powerful since it covers the rich variety of environmental features to which neurons are sensitive. But the general point extends to all high-level sciences which investigate the nomic relationships between properties—think of temperature vs. reaction rates in chemistry, predator vs. prey populations in biology, and supply vs. demand in economics. Describing such relationships using graphs and equations relies on the high-level similarity structure of property-spaces. Moreover, the point extends beyond formal science to mundane inductive inferences: traffic vs. arrival time, music volume vs. anger of neighbor, time in oven vs. state of dinner. The crude rough-and-ready estimations which enable us to navigate daily life can only be reliable if we track the similarity structure of the relevant properties, with respect to which certain hypotheses stand out from the many consistent with our evidence.

Thus, the vision behind Cohesive Proportionality is that our intuitive inferences about difference-making causation recruit the same evaluations of high-level similarity structure which are implicit in inductive inference more broadly. Our ability to routinely and reliably perform such inference provides excellent reason to think that we must be tracking the kind of structure which suffices to support cohesion facts. Even naturalness-sceptics should view this structure as unmysterious. If it turns out that cohesion cannot be accounted for without appeal to naturalness, the lesson is not that this structure is objectionable after all, but rather that naturalness-sceptics must abandon their scruples.

3.3 Cohesion is useful

So far, I have argued that the cohesion of events can be accounted for in terms of scientifically respectable resources, and thus that cohesion is not mysterious (in the way that naturalness is alleged to be). Let me briefly add a further consideration supporting my appeal to cohesion: it is not an ad hoc notion wheeled in to rescue proportionality, but can (and in my view, should) be put to useful work elsewhere.

First, avoiding disjunctive explanations is an issue for any account of high-level explanation, whether tied to proportionality or not. It seems that something about the generality or abstractness of these explanations makes them good, where this must be distinguished from the spurious kind of generality associated with disjunctive explanations. Relatedly, avoiding disjunctive causes seems desirable for any account of causation, irrespective of one’s attitude towards proportionality. Given an account in terms of counterfactuals, covering laws, or energy transfer, it is easy to construct disjunctive events from those counted as genuine causes, and hard to discount them from being causes themselves without cohesion.Footnote 25

Further afield, making sense of genuine similarity between objects seems to require distinguishing cohesive and disjunctive properties: sharing of disjunctive properties, like grueness and scarlet-or-cyanness, does not make for genuine similarity in the way that sharing of cohesive properties like redness does. This naturally extends to events: only cohesive events make for genuine similarity between the situations or worlds in which they occur. These similarity facts seem required in turn to make sense of counterfactuals: if we want to account for claims about what would have happened in counterfactual situations—which surely, for many reasons, we do—then we likely need to evaluate similarity between situations.

The upshot is that even an ardent naturalness-sceptic, with the least interest in defending proportionality, has good reason to recognize the notion of cohesion that suffices to insulate proportionality from the disjunction problem. Independently of this particular application, accounting for cohesion is a worthwhile (and tractable) research project.

4 Doing without cohesion?

This final section considers attempts to avoid the disjunction problem without cohesion, based on naturalness, interventionism, and contrastivism. I argue that none are adequate as they stand: to do the work of Cohesive Proportionality (or some similar principle), they must be supplemented by appeal to cohesion or the kind of similarity structure which, by my lights, underlies it.

4.1 Naturalness

As mentioned above, it is sometimes suggested that the disjunction problem may be solved by appeal to naturalness. Indeed, this was Yablo’s own recommendation: “proportionality is not pursued at all costs but traded off against naturalness” (2003: 326). The rough idea is that disjunctive events such as R-or-G and (presumably even more so) R-or-T are less natural, and this damages their eligibility as causes. So whilst proportionality pushes us towards general, and hence disjunctive, causes, naturalness pushes us back towards suitably non-disjunctive causes.

One obvious challenge for this proposal is that it requires an account of the naturalness of events (presumably via an account of the naturalness of corresponding properties) which vindicates the idea that intuitively disjunctive events are less natural. It is far from obvious how such an account would go. For example, Lewis (1986b: 61) suggested that naturalness is given by length of definition in fundamental terms: the shorter the definition, the more natural the property. But this approach faces a serious challenge: multiple realizability makes it unlikely that naturalness neatly correlates with simplicity of definition. I don’t want to lean on this particular criticism, however. Various alternative approaches are available to the naturalness-user.Footnote 26 Besides, I have not provided a complete account of cohesion, and such an account may ultimately rely on high-level naturalness facts itself.

So let’s grant the naturalness-user some (suitably non-mysterious) account of high-level naturalness. Even so, this is the wrong tool for solving the disjunction problem. The reason is that naturalness pulls apart from cohesion, in both directions: relatively unnatural events may be suitably cohesive, and objectionably disjunctive events may be relatively natural.

The first kind of case arises when a sufficer may be generalized in a way which reduces naturalness but preserves cohesion. The comparative lack of naturalness does not compromise the more general event’s eligibility to cause (as a comparative lack of cohesion would). For example, suppose a weighing scale flashes whenever something between 1 and 2 kg is placed on it. A 1.3 kg mass is placed on the scale, and it flashes. The mass’s being 1.3 kg and the mass’s being between 1 and 2 kg are each sufficers for the flash. Taking complexity of fundamental definition as a heuristic guide, the former event is plausibly more natural than the latter: it is defined by reference to a single plausibly perfectly natural property (being 1.3 kg), whereas the latter is defined by reference to two such properties and the betweenness relation on masses. But this comparative lack of naturalness does not seem to tell against the latter’s eligibility to be a cause. This is plausibly because it involves no loss of cohesion.Footnote 27

The second kind of case arises when an intuitively cohesive sufficer may be generalized in a way which enhances naturalness but destroys cohesion. The availability of a more general and more natural sufficer does not exclude the more specific event (as it would if the more general sufficer were cohesive). For example, suppose we discover that the property of being red-or-green is far more natural than expected: there are two fundamentally different kinds of photon, and all and only red-or-green things absorb the first kind of photon whilst reflecting the second kind.Footnote 28 Presumably, the newly apparent naturalness of this property would extend to the disjunctive event R-or-G. But this wouldn’t seem to make it any more palatable as a cause of Sophie’s pecking: it seems that R-or-G remains disjunctive in the relevant sense, and hence R remains an eligible cause.Footnote 29

Thus, relatively unnatural events may be eligible causes, if they are cohesive, and relatively natural events may be ineligible, if they fail to be cohesive. Whilst disjunctive events tend to be unnatural, it is their disjunctiveness, not their unnaturalness, which prevents them from being causes.

4.2 Interventionism

According to interventionism, causal explanation aims at ‘identifying interventions that would have changed the explanandum’ (Blanchard, 2020: 634). Woodward (2018) and Blanchard (2020) have recently argued that interventionism yields a ‘metaphysically lightweight’ understanding of proportionality which avoids the disjunction problem. The basic idea is that explaining Sophie’s pecking in terms of R (rather than more specific or more general events) best identifies those interventions which would have prevented pecking: namely, interventions on the tile’s redness.

Interventionists use ‘structural equation models’ to represent causation (discussed in more detail in the appendix). These models represent events as variables taking values, and causal relations via equations connecting the variables. For example, a simple model of Sophie has three binary variables: RED, with values 1/0 representing R/ the tile’s being some color other than red; TICKLE, with values 1/0 representing Sophie’s chin being tickled/not being tickled; and PECK, with values 1/0 representing Sophie’s pecking/not pecking. The equation ‘PECK = max(RED, TICKLE)’ represents the fact that Sophie pecks just in case the tile is red or her chin is tickled.

Intuitively, citing disjunctive events does a poor job of identifying explanandum-changing interventions. For example, suppose we combine the binary variables RED and TICKLE into a single binary variable LUMP, taking value max(RED, TICKLE). Explaining Sophie’s pecking in terms of R-or-T, represented by LUMP = 1, fails to pin down explanandum-changing interventions: if the tile is red, then the way to prevent pecking is to change the tile’s color, but if Sophie’s chin is tickled, then it is to stop the tickling.

This suggests that LUMP = 1 is too general to explain Sophie’s pecking because it is equivalent to a disjunction, RED = 1 or TICKLE = 1, only one of whose disjuncts (RED = 1) is relevant to the actual pecking.Footnote 30 We can cash out relevance as follows: a variable X is relevant to an event Y = y just in case there is some non-actual value of X such that intervening to set X to that value (and doing nothing else) would prevent Y = y.Footnote 31 For example, in Sophie, RED is relevant to PECK = 1 since setting RED to 0 and doing nothing else would prevent pecking, whereas TICKLE is not relevant since intervening to set it to its only non-actual value, 1 (i.e. tickling Sophie’s chin) would overdetermine pecking rather than preventing it.

We can then capture the interventionist idea as follows:

Specificity: X = x is too general to explain Y = y just in case X = x is equivalent to some disjunction X1 = x1 or X2 = x2 … or Xvn = xn, where X1 is relevant to Y = y but X2, …, Xn are not.Footnote 32

However, Specificity overshoots: it only excludes disjunctive causes like R-or-T at the cost of excluding intuitively cohesive causes like R. The problem is that even cohesive events are equivalent to many disjunctions. For example, RED = 1 is equivalent to the disjunction X1 = 1 or X2 = 1, where

X1 = 1 iff RED = 1 & it’s raining in Cambridge; 0 otherwise;

X2 = 1 iff RED = 1 & it’s not raining hard in Cambridge; 0 otherwise.Footnote 33

Now imagine that, while the experiment in Sophie takes place, it’s raining hard in Cambridge. X1 is relevant to pecking: intervening to change its value from 1 to 0 (and doing nothing else) involves removing the red tile, which would prevent pecking. But X2 is not relevant to pecking: its actual value is 0, and setting it to 1 would not prevent pecking. Hence, by Specificity’s lights, just as citing R-or-T fails to identify whether RED or TICKLE is relevant to pecking, citing R fails to identify whether X1 or X2 is relevant. A more appropriately specific explanans for Sophie’s pecking must mention the Cambridge weather!

It might be objected that X1 is not relevant since some (rather convoluted) ways to intervene on it prevent the rain in Cambridge, and this kind of intervention would not prevent pecking. But they involve ‘doing something else’, in addition to intervening on X1: they would also change the value of X2, from 0 to 1. Indeed, the situation of RED vis-à-vis X1 and X2 perfectly parallels that of LUMP vis-à-vis RED and TICKLE. Intervening on RED to change its value from 1 to 0 (and doing nothing else) will prevent pecking. But there are (convoluted) interventions on RED which would not prevent pecking: changing the stimulus color whilst tickling Sophie’s chin. These do not count against RED’s relevance, since they involve doing something else: changing the value of TICKLE, from 0 to 1.

Of course, there is an intuitive difference here: changing TICKLE from 0 to 1 in the course of intervening on RED seems like genuinely ‘doing something else’, whereas changing X2 from 0 to 1 in the course of intervening on X1 does not. But the challenge lies in spelling out the requisite notion of something else (without relying on implicit appeal to cohesion or the like). If we were willing to rest on intuitive differences, we could have excluded LUMP = 1 from the start for being intuitively disjunctive!

The problem as I see it lies in allowing variables like X1 and X2 with values representing disjunctive events. For example, X2 = 0 represents the event of either the tile not being red or its raining hard in Cambridge. Unsurprisingly, interventionism goes awry when applied to such variables, since the corresponding ‘interventions’ are not appropriately targeted: there are genuinely distinct ways of bringing about a disjunctive event. Although my discussion has focused on one proposal, the point clearly generalizes: it is difficult to see how interventionism itself could address the disjunction problem without relying (perhaps implicitly) on some way of restricting variables.

4.3 Contrastivism

Contrastivism is the view that causation is a four-place relation involving merely possible events as causal and effectual contrasts (e.g. Schaffer, 2005). For example, in the case of Sophie, the following contrastive claims hold:

  1. (i)

    The tile’s being red rather than not red caused pecking rather than not pecking.

  2. (ii)

    The tile’s being scarlet rather than crimson did not cause pecking rather than not pecking.

  3. (iii)

    The tile’s being scarlet rather than cyan caused pecking rather than not pecking.

  4. (iv)

    The tile’s being red rather than Sophie’s chin being tickled did not cause pecking rather than not pecking.

Shapiro and Sober (2012) contend that the disjunction problem shows that proportionality is misguided, and that the intuitions which supported proportionality in the first place may be explained away using contrastivism. In Sophie, for example, we judge R to cause pecking in contexts where (i) is the salient contrastive claim, and we judge S not to cause pecking in contexts where (ii) is the salient contrastive claim. However, relative to other possible contrasts, such as those in (iii) and (iv), S is a cause and R is not. Contra proportionality, then, difference-making is not simply a matter of how cause relates to effect, but rather a matter of the difference between the cause and some salient contrast(s) making the difference between the effect and some salient contrast(s).

Unlike the other two proposals I have considered, the idea here is not to cure disjunctivitis but to prevent it: instead of providing some countervailing pressure against grotesquely disjunctive causal claims (via naturalness or interventionist specificity), this proposal rejects the idea which created the pressure to prefer such claims in the first place. Now, this is not directly a challenge to my main thesis that proponents of proportionality should solve the disjunction problem by appeal to cohesion, since I am presupposing that some version of the proportionality idea is worth holding onto. However, if contrastivism shows that this presupposition is unmotivated, that would certainly make my thesis less interesting. Thus, I want to close by suggesting that, even if we adopt a contrastivist framework, the motivations for proportionality—and hence, the need to solve the disjunction problem—remain.

First, a dialectical point: avoiding disjunctive causes is everyone’s problem. Granted, the issue is especially serious for proponents of proportionality (at least on a winner-takes-all approach): disjunctive causes threaten to exclude ordinary causes. But this more serious version of the problem may be solved the same way: whatever excludes disjunctive causes also reinstates ordinary causes. Moreover, for contrastivists who wish to avoid disjunctive causes, cohesion remains a valuable resource. It would be odd, then, to take the disjunction problem to motivate abandoning proportionality in favor of a view on which a closely related problem arises and is naturally solved in the same way.

Set this aside, however: let’s assume that there are good reasons to adopt contrastivism, and consider whether this undermines proportionality in itself. It is important to note that, since proportionality is a constraint on a two-place relation, it is not obvious how it should even be formulated in a contrastivist framework. On a ‘linguistic’ approach, we might tie proportionality directly to causal claims, as the thesis that ‘c causes e’ is only true when c is proportional to e. However, on a natural implementation of contrastivism, ‘c causes e’ is true in a context C iff for all causal and effectual contrasts c* and e* which are salient in C, c rather than c* causes e rather than e*. I grant that, as Shapiro and Sober maintain, there are contexts in which non-proportional causal claims are true (given this contrastivist semantics). For example, there are contexts in which ‘S caused pecking’ is true, since the only salient contrastive claim is (iii) (e.g. because the question at issue is whether pecking is caused by scarlet or cyan tiles).Footnote 34

However, this leaves open another (more interesting) formulation of the proportionality thesis, which ties it to causal theorizing rather than ordinary language. The idea is that there is a natural, theoretically important two-place relation, in the vicinity of ordinary causal claims, which respects proportionality. In particular, there is the relation defined in terms of the objective similarity structure of corresponding state-spaces as follows: c proportionally causes e iff, for all the nearest contrasts c*, c rather than c* causes e rather than ~ e.Footnote 35 This relation deserves its name since, if c ‘proportionally causes’ e, then c is a sufficer for e which cannot be cohesively extended into a more general sufficer (since all ways of cohesively extending it would include some nearest alternative).Footnote 36 Thus, assuming a cohesion requirement on contrastive causation, proportional causes are maximal cohesive sufficers.

Shapiro and Sober may be ‘radical contrastivists’, who hold that the only theoretically important relation is four-place, and there is no such two-place relation corresponding to any objective restriction on contrasts. However, the motivations for a proportionality constraint in the orthodox setting also motivate a more moderate contrastivist view which recognizes proportional causation.

First, the intuitions about cases like Sophie show that, at least in typical contexts, nearby alternatives are salient: it is the salience of the tile’s being crimson which drives the intuition that S is not a cause. This suggests that proportional causation lies in the vicinity of our ordinary causal thinking. Second, the popular idea that causes are difference-makers is vindicated by the failure of nearby alternatives to yield the effect, in a way that it is not by the failure of faraway alternatives. The difference between c and some distant alternative c* is not a good way to measure c’s impact on e, since distant alternatives which preserve or prevent e can (almost) always be found. Third, and related to the last point, recognizing proportional causation allows contrastivists to capture the objective goodness of high-level explanations. Being true with respect to any old contrasts would not suffice to make the causal claims provided by special sciences explanatory; being true with respect to nearby contrasts in corresponding state-spaces does. Objective similarity-structure supports the explanatory credentials of high-level causes (for example, by privileging (i) over (iv)) whilst undermining those of their low-level realizers (for example, by privileging (ii) over (iii)).

There is, then, good reason for contrastivists to recognize proportional causation: doing so allows them to explain our intuitions, vindicate the difference-making idea, and capture the objective goodness of high-level explanations. Moreover, since similarity-structure may be outweighed or overridden by more ‘subjective’, context-sensitive factors, moderate contrastivists needn’t hold that ‘c causes e’ is only true if c proportionally causes e. They can have their cake and eat it too: they can hold that, as a linguistic matter, there are contexts in which non-proportional causal claims are true, whilst, as a theoretical matter, there is an important two-place relation of difference-making, explanation-backing causation which respects proportionality.

5 Conclusion

The idea that proportionality somehow constrains causation (or causal explanation) is an attractive one, but it faces the disjunction problem. I have argued that this problem should be avoided by supplementing proportionality with a cohesion constraint, understood in terms of the similarity structure of property-spaces. This notion is not objectionably mysterious, and we should not hope to avoid the disjunction problem without it.