1 Introduction

In multi-candidate elections, voters are sometimes better off voting for a candidate who is not their most preferred, since doing so may prevent a less preferred candidate from winning. In these situations, strategic voters, those who maximize their expected payoff from voting, will have an incentive to misalign their votes.

Strategic voting has received significant attention from both the theoretical and empirical literature. Existing theories by and large build on the rational voting paradigm.Footnote 1 In these models, voter utility depends only on the election outcome, and strategic behavior is dictated by the likelihood that a single vote alters the outcome. Little consideration is given to how other motives for voting, for example ethical and expressive ones, may impact voting behavior,Footnote 2 even though empirical evidence suggests such concerns play an important role (Blais et al. 2000; Coate and Conlin 2004; Funk 2010; DellaVigna et al. 2016; Pons and Tricaud 2018; Spenkuch 2018). Some scholars have studied turnout decisions in the presence of ethical motives (Feddersen and Sandroni 2006a, b) and their findings are more consistent with voting data than previous theories (Coate and Conlin 2004). Thus, going beyond instrumental motives seems fruitful in the study of strategic voting in multi-candidate elections.

In this paper, we explore strategic voting in a model where voters have ethical, expressive, and instrumental concerns. In particular, we consider an election with three candidates and a large electorate. There are two “centrist” candidates and an “extremist” candidate. The electorate is comprised of a minority of extremist supporters and a majority of centrist supporters, who prefer either of the centrist candidates to the extremist. In sum, the extremist is the Condorcet loser, though he has sufficient support so that misaligned voting among centrist voters is necessary for his defeat.

Centrist supporters care both about the election outcome and who they actually vote for. These are the instrumental and expressive concerns, respectively. Centrist voters derive heterogeneous expressive benefits from voting for the two centrist candidates. Moreover, an unknown fraction of centrist supporters are ethical or “rule-utilitarian” in the spirit of Harsanyi (1977) and Feddersen and Sandroni (2006a, b). These voters follow a voting rule that maximizes the aggregate welfare of centrist voters.Footnote 3 Such a rule may prescribe some ethical voters to misalign their votes. In determining the degree of vote misalignment, ethical centrists trade off lower expressive benefits for a higher probability of centrist victory.

Our model is tractable and delivers comparative statics of misaligned voting with respect to electoral parameters such as the relative importance of instrumental and expressive concerns, and the popularity of the extremist candidate. We show that, when the popularity of the extremist is low, ethical voters are willing to sacrifice their expressive benefits and coalesce behind one of the two centrist candidates. On the other hand, if the extremist’s popularity is sufficiently high, ethical voters refrain from misaligned voting. In this case, the extremist’s victory is certain.

The main insight of our model is that the degree of misaligned voting is non-monotonic in the popularity of the extremist. Misaligned voting is increasing for moderate popularity levels. However, ethical centrists have no incentive to engage in misaligned voting when the extremist’s popularity is too high (but still remains a minority). Intuitively, when the support for the extremist increases, the probability of a centrist victory decreases. Ethical voters partially offset this by increasing misaligned voting insofar the loss in expressive utility is lower than the gain in instrumental utility. This is the case when the extremist’s popularity is below some threshold. When the extremist’s popularity is sufficiently high, misaligned voting does not improve the probability of victory sufficiently to compensate for the loss in expressive benefits. Therefore, ethical voters revert to sincere voting.

Empirical evidence so far has failed to yield a consistent verdict on the relationship between misaligned voting and the popularity of the Condorcet loser.Footnote 4 This may be due to the difficulty in identifying misaligned voting,Footnote 5 the lack of exogenous variation in electoral parameters, and differences in electoral contexts. While there is a lack of causal evidence on how misaligned voting responds to the popularity of the Condorcet loser, the sometime conflicting correlational evidence found across contexts could be the result of a non-monotonic relationship between vote misalignment and extremist support.

In addition to the non-monotonicity result, we show that misaligned voting is increasing in the importance of the election for centrists, as measured by the weight on the instrumental component of their utility. This result follows intuitively from the trade-off between instrumental and expressive motives faced by ethical voters. This conclusion fits one of the findings in Spenkuch (2018), who shows that more voters abandoned non-contender candidates in the critical 2005 German federal election, which followed a non-confidence vote in the Parliament, compared to the less important 2009 election.

This paper complements previous theoretical studies of strategic voting. In these models, voter behavior is predominantly driven by pivotal vote considerations. Seminal works by Palfrey (1989), Cox (1994, 1997) and Myerson and Weber (1993) identify conditions for misaligned voting. In their studies, voters either abandon all but two candidates (i.e., Duvergerian equilibrium) or split their votes in equilibrium.Footnote 6 More recent works by Piketty (2000) and Myatt (2007) provide a theoretical foundation for partial misaligned voting by taking into account incomplete information. The latter argues that misaligned voting is increasing in the popularity of the Condorcet loser.Footnote 7 Our paper contributes to this literature by presenting an alternative approach that incorporates non-instrumental motives for voting.

Our model also contributes to an emerging theoretical literature on how ethical, or “rule utilitarian”, behavior affects political outcomes. In two seminal papers, Harsanyi (1977, 1992) developed the concept of “rule utilitarian” and applied it to study voter turnout. Feddersen and Sandroni (2006a, b) build on Harsanyi’s idea and present a seminal theory of turnout in large elections where voters are ethical and have heterogeneous preferences. Using data from Texas liquor referenda, Coate and Conlin (2004) find support for the predictions of Feddersen and Sandroni (2006a, b). More recently, studies have used ethical motivations to account for other behaviors. For example, Piolatto and Schuett (2015) use ethical voters to explain the demand for political news. Our study contributes to this literature by extending the ethical voter framework to the analysis of strategic voting.

In a contemporaneous paper, Bouton and Ogden (2018) explore a model of multi-candidate elections with ethical voters. Our approaches and results differ in key aspects. In our model, ethical voters face expressive considerations, which generates the key trade-off in our framework. Also, Bouton and Ogden (2018) predict that misaligned voting is increasing in the extremist’s popularity while our novel observation is that this relationship is non-monotonic.

The rest of this paper is organized as follows. Section 2 describes the model. Section 3 presents our main results. Section 4 discusses several extensions to the model. Section 5 concludes.

2 The model

We consider a model of large elections with three candidates. To approximate a large election, we assume a unit continuum of voters. Voting is costless,Footnote 8 and the winner of the election is decided by plurality. For convenience, we refer to one of the candidates as the “extremist” and to the other two as the “center-left” and the “center-right.”

There is a known measure \(k_{e}\) of voters who will vote for the extremist.Footnote 9 This simplifying assumption allows us to focus on the strategic incentives of centrist voters and it is found in other models of strategic voting (see for example Myerson and Weber 1993; Myatt 2007). For the problem to be non-trivial, we impose that \(\frac{1}{3}<k_{e}<\frac{1}{2}\). The upper bound ensures that the extremist supporters are a minority (i.e., the extremist candidate is the Condorcet loser). The lower bound implies that some misaligned voting by the centrist supporters is necessary to defeat the extremist. That is, the extremist will win with certainty if centrist supporters evenly split their votes among the two centrist candidates.

The centrist supporters, of measure \(1-k_{e}\), prefer either centrist candidate to the extremist. Specifically, centrist supporters obtain utility \(w>0\) if either centrist candidate wins, and receive zero utility if the extremist wins. Thus, w is the instrumental value of a centrist victory. In future discussions, we also refer to w as the importance of the election. The assumption of an homogeneous w is made to simplify exposition. In Sect. 4.1, we show that our insights hold when centrist voters have different values of w for the two centrist candidates.

In addition to instrumental utility, centrist supporters derive expressive utility from voting for the two centrist candidates. Formallly, centrist voters have “bliss points” that are uniformly distributed on the interval [0, 1], and the center-left and center-right candidates are located at 0 and 1, respectively. A voter’s expressive utility for voting for a candidate is determined by the distance between the voter’s bliss point and the location of the candidate. A voter with bliss point \(x\in [0,1]\) (voter x for short) receives an expressive benefit of \(\theta (1-x)\) for voting for the center-left and \(\theta x\) for voting for the center-right.Footnote 10 The scalar \(\theta \) measures the intensity of expressive motive for voting.

In sum, voter x’s expected (personal) utility of voting for the center-left and the center-right are, respectively, \(wp+\theta (1-x)\) and \(wp+\theta x\), where p is the probability that a centrist candidate wins the election. Note that if voter x acts based on her personal utility, she votes for the center-left if \(x<\frac{1}{2}\) and for the center-right otherwise.Footnote 11 We refer to this as sincere voting.

So far, we have incorporated both instrumental and expressive motives for voting. We also assume that each centrist voter is ethical or “rule-utilitarian” with probability \(q_{c}\). The precise value of \(q_{c}\) is unknown, but it is common knowledge that \(q_{c}\) is drawn from the uniform distribution on [0, 1]. As in Harsanyi (1977) and Feddersen and Sandroni (2006a, b), ethical voters follow a voting rule that maximizes the expected aggregate welfare of centrist supporters.Footnote 12 In Sect. 4.5, we consider the alternative setting in which centrists are divided into two ethical voter groups, the center-right and center-left, based on voters’ bliss points, and each group follows their own ethical voting rule.Footnote 13

Without loss of generality, we focus on voting rules that are defined by a threshold on the interval [0, 1].Footnote 14

Definition 1

A voting rule is a cut-off \(\sigma _{c}\in [0,1]\) that instructs ethical voters with bliss points \(x \ge \sigma _{c}\) to vote for the center-right and those with bliss points \(x<\sigma _{c}\) to vote for center-left.

The ethical voting rule, \(\sigma _{c}^{*}\), is the voting rule that maximizes the expected aggregate welfare of centrist voters. That is,

$$\begin{aligned} \sigma _{c}^{*}\in \text {argmax}_{\sigma _{c}}wP(\sigma _{c})+\int _{0}^{1}q_{c}\cdot B_{E}(\sigma _{c})+(1-q_{c})\cdot B_{S}dq_{c} \end{aligned}$$
(1)

where

  • \(P(\sigma _{c})\equiv \text {Pr}\left[ q_{c}\max \left\{ \sigma _{c},1-\sigma _{c}\right\} +\frac{1}{2}(1-q_{c}) \ge \frac{k_{e}}{1-k_e}\right] \) is the probability of a centrist victory.Footnote 15

  • \(B_{E}(\sigma _{c})\equiv \int _{0}^{\sigma _{c}}\theta (1-x)dx+\int _{\sigma _{c}}^{1}\theta xdx\) is the aggregate expressive benefit of the ethical centrists.

  • \(B_{S}\equiv \int _{0}^{1}\max \{\theta x,\theta (1-x)\}dx=\frac{3}{4}\theta \) is the aggregate expressive benefit of those centrists who vote sincerely.

The ethical voting rule \(\sigma _{c}^{*}\) exists since the objective function is continuous on a compact domain. Note that the objective function is symmetric around \(\frac{1}{2}\): the expected aggregate welfare under \(\sigma _{c}\) is the same as that under \(1-\sigma _{c}\). Thus, ethical voting rules always come in pairs. For simplicity, we restrict attention to the ethical voting rule \(\sigma _{c}^{*}\) in the interval \([0,\frac{1}{2}]\). In a more general setting where centrist voters have heterogeneous instrumental utilities, the ethical voting rule is generically unique (see Sect. 4.1). Alternatively, one can introduce uncertainty about which centrist candidate ethical voters will rally for. For example, suppose that an opinion poll takes places before the election in which interviewees express their sincere opinion on who their most preferred candidate is. If the poll sample is unbiased, both centrist candidates will be the most popular centrist with probability equal to one half. It can then be assumed that, if ethical voters misalign their vote, they will do so in favor of the most popular centrist candidate according to the poll.Footnote 16

There is misaligned voting when \(\sigma _{c}<\frac{1}{2}\). Here, ethical voters with bliss point \(x\in [\sigma _{c}^{*},\frac{1}{2}]\) vote for the center-right instead of their most preferred candidate, the center-left. We refer to \(m\equiv \frac{1}{2}-\sigma _{c}^{*}\) as the degree of misaligned voting.

3 Results

To simplify exposition, we define \({\tilde{k}}_{e}\equiv \frac{k_{e}}{1-k_{e}}\). Since \({\tilde{k}}_{e}\) is a monotonic transformation of \(k_{e}\), it retains the interpretation of the popularity of the extremist. Note that the restriction on \(k_{e}\) translates to \(\frac{1}{2}<{\tilde{k}}_{e}<1\).

Proposition 1 below states that ethical voters misalign their votes when the extremist has moderate levels of popularity but they vote sincerely if the extremist is sufficiently popular (but still a Condorcet loser).

Proposition 1

There exists a threshold on\({\tilde{k}}_{e}\), denoted\({\bar{k}}_{e}\), that is less than 1, such that

  • if\({\tilde{k}}_{e}<{\bar{k}}_{e}\), ethical voters misalign their vote (i.e. \(\sigma _{c}^{*}<\frac{1}{2}\) ). In particular,\(\sigma _{c}^{*}<1-{\tilde{k}}_{e}\).

  • if\({\tilde{k}}_{e}>{\bar{k}}_{e}\), ethical voters vote sincerely (i.e. \(\sigma _{c}^{*}=\frac{1}{2}\) ),

  • if\({\tilde{k}}_{e}={\bar{k}}_{e}\), then ethical voters are indifferent between voting sincerely and misaligning their votes.

In deciding whether to vote against their preferences, the ethical voters balance the instrumental value of a centrist victory and the expressive benefit of voting. Misaligned voting improves the probability of a centrist victory but there is an opportunity cost in terms of foregone expressive benefits for ethical voters with bliss points \(x\in [\sigma _{c}^*,\frac{1}{2}]\). These voters derive greater expressive utility from voting for the center-left but are instructed by the rule to vote for the center-right. Generally, a minimum level of misaligned voting is needed for it to change the election outcome. For example, when \({\tilde{k}}_{e}=\frac{3}{4}\), at least a quarter of the ethical centrists need to misalign their votes in order for centrists to have a positive probability of victory. Thus, there is a fixed cost associated with (effective) misaligned voting. When the extremist’s popularity is not too high (i.e., \({\tilde{k}}_{e}<{\bar{k}}_{e}\)), ethical voters are willing to bear this fixed cost and misalign their votes to increase the chance of a centrist victory. When the extremist’s popularity is very high (i.e., \({\tilde{k}}_{e}>{\bar{k}}_{e}\)), the fixed cost for misaligned voting becomes prohibitively high.Footnote 17 Ethical voters are unwilling to bear this cost and they will vote sincerely.

The following corollary describes how the threshold \({\bar{k}}_{e}\) depends on w (i.e., the importance of the election) and \(\theta \) (i.e., the intensity of expressive benefits).

Corollary 1

The threshold\({\bar{k}}_{e}\)is increasing inwand decreasing in\(\theta .\)Specifically, it is decreasing in the ratio\(\frac{\theta }{w}\).

Thus, the threshold at which ethical voters switch from misaligned voting to sincere voting is increasing in the instrumental value of voting and decreasing in the expressive value of voting. This is intuitive given our previous discussion on the trade-off faced by ethical voters. When ethical voters care more about defeating the extremist (i.e., higher values of w), they are more willing to bear the cost of misaligned voting. On the other hand, when the intensity of expressive motive increases (i.e., higher values of \(\theta \)), ethical voters face a greater cost of voting against their true preferences. Hence, they are less willing to engage in misaligned voting.

Two empirical implications emerge from the previous results. First, misaligned voting is more likely to occur in elections in which the outcome has a large impact on public welfare. Second, misaligned voting is more likely to arise when centrist voters are relatively homogeneous in ideology or when there is little differentiation between centrist candidates in platforms or valence. In these cases, the foregone expressive benefit due to voting against preference should be low.

The next set of results provides a more detailed characterization of misaligned voting.

Proposition 2

Full misaligned voting (i.e. \(\sigma _{c}^{*}=0\)) occurs if and only if\(\frac{1}{2}+\frac{\theta }{8w}\le {\tilde{k}}_{e}\le 1-\frac{\theta }{16w}\).

The proposition states that, conditional on misaligned voting being optimal (i.e., \({\bar{k}}_{e}=1-\frac{\theta }{16w}<{\tilde{k}}_{e}\)),Footnote 18 ethical voters fully misalign their votes when the extremist is sufficiently popular (i.e., \(\frac{1}{2}+\frac{\theta }{8w}\le {\tilde{k}}_{e}\)). Intuitively, if the extremist’s popularity is low, the marginal benefit of misaligned voting is low since centrists are already in a strong position. Therefore, ethical voters would only engage in partial misaligned voting (i.e., \(\sigma _{c}^*>0\)).Footnote 19 Note that the condition \(\frac{1}{2}+\frac{\theta }{8w}\le {\tilde{k}}_{e}\le 1-\frac{\theta }{16w}\) holds only if \(\frac{\theta }{w}\le \frac{8}{3}\). In other words, a necessary condition for full misaligned voting is that the importance of the election is sufficiently high or the intensity of expressive benefits is sufficiently low.

Figure 1 below maps out regions in the space of electoral parameters (i.e., \({\tilde{k}}_{e}\) and \(\frac{\theta }{w}\)) where misaligned or sincere voting occurs.

Fig. 1
figure 1

Ethical voting rule

Proposition 3 shows that the degree of misaligned voting is increasing in the importance of the election and the extremist’s popularity, and is decreasing in the intensity of expressive benefits.

Proposition 3

When misaligned voting is optimal (i.e.\({\tilde{k}}_{e}<{\bar{k}}_{e}\)), the degree of misaligned voting,\(m(w,\theta ,{\tilde{k}}_{e})\equiv \frac{1}{2}-\sigma _{c}^{*}\), is increasing inwand\({\tilde{k}}_{e}\), and is decreasing in\(\theta \).

Proposition 3 together with Proposition 1 implies a non-monotonic relationship between misaligned voting and the popularity of the extremist. The degree of misaligned voting is increasing in the extremist’s popularity, \({\tilde{k}}_{e}\), conditional on \({\tilde{k}}_{e}\le {\bar{k}}_{e}\). However, once the extremist’s support exceeds \({\bar{k}}_{e}\), ethical voters revert to sincere voting.

The comparative statics of misaligned voting with respect to w and \(\theta \) are intuitive and in line with Corollary 1. When w increases, centrist supporters are more concerned with defeating the extremist. Therefore, ethical voters have a stronger incentive to misalign their votes. The intensity of the expressive benefit, \(\theta \), determines the opportunity cost of misaligned voting. When \(\theta \) increases, so does the cost. This leads to less misaligned voting.

Proposition 4 below presents comparative statics of the (ex-ante) probability of a centrist victory with respect to the electoral parameters.Footnote 20 We show that the probability of a centrist victory is increasing in the importance of the election, and decreasing in both the intensity of the expressive benefit and the extremist’s popularity.

Proposition 4

The probability of a centrist victory given the ethical voting rule,\(P(\sigma _{c}^{*})\), is increasing inwand decreasing in\(\theta \)and\({\tilde{k}}_{e}\).

The intuition behind the comparative statics with respect to \(\theta \) and w is straightforward. We have shown that the degree of misaligned voting is increasing in w and decreasing in \(\theta \). Since the probability of a centrist victory is increasing in misaligned voting, it must be increasing in w and decreasing in \(\theta \). On the other hand, the effect of an increase in the extremist’s popularity is less obvious. It decreases directly the likelihood of a centrist victory, but the ethical voters may have a greater incentive to misalign their votes (see Proposition 3). It happens that the ethical voters’ response is of a second order effect. Hence, the probability of a centrist victory is decreasing in the extremist’s popularity.

3.1 Empirical evidence

In the real world, voter decisions are driven by multiple considerations (e.g., expressive, ethical, and instrumental), which can be difficult to disentangle empirically. Survey data provide some insights into voters’ motives, but, due to its self-reported nature, it is not necessarily conclusive. As voter ideological positions are not directly observable and must be inferred, distinguishing between different motivations is not trivial. For example, Degan and Merlo (2009) show that the hypothesis that voters vote sincerely is not falsifiable when using single election data. Moreover, when using data for the same electorate across elections, the number of elections needs to be greater than the number of dimensions of the policy space for the hypothesis to be testable.

In spite of these obstacles, there is evidence that ethical and expressive concerns drive voter behavior in large elections. Coate and Conlin (2004) show that an structurally estimated group rule-utilitarian model provides a good fit of Texas liquor referenda data. Using the same data, Coate et al. (2008) find that a simple model based on expressive motives provides a better fit than a pivotal-voter model. Moreover, Pons and Tricaud (2018) find support for expressive motives by showing how the presence of a third candidate decreases the vote share of the two leading candidates in French parliamentary elections. DellaVigna et al. (2016) uses a field experiment to infer that, in the 2010 Congressional election, voters in the Chicago suburbs had an expressive value of voting (voting “to tell others”) of close to US $15.

While the expressive and ethical motives appear to be important, instrumental concerns remain a factor. Voting models that incorporate multiple considerations may provide a better fit of electoral data. Spenkuch (2018) takes advantage of Germany’s electoral system to estimate that close to two thirds of voters do not behave according to the pivotal voter model while close to a third do not behave according to expressive considerations.Footnote 21

While existing evidence suggests that various motives play a role in voter behavior, it does not offer a direct validation of our model. An ideal experiment to falsify our main result would be one in which the support for the extremist (or Condorcet loser), \(\frac{1}{3}<k_e<\frac{1}{2}\), varies exogenously across multiple electoral constituencies in the same election.Footnote 22 Our model predicts a non-monotone relationship between \(k_e\) and the degree of misaligned voting. Ceteris paribus, this should translate into a non-monotone relationship between \(k_e\) and the ratio of votes of the most-voted centrist party to the votes of the least-voted centrist party. Unfortunately, finding a plausibly exogenous variation in \(k_e\) is difficult. Extremist support depends on unobservable socio-demographic characteristics, which may be correlated with other key parameters such as the fraction of ethical voters and the intensity of the expressive motive.Footnote 23

Existing studies do not offer a conclusive answer as to whether our model fits electoral behavior due to the lack of causal evidence on how misaligned voting responds to the popularity of the Condorcet loser. Most of the empirical evidence on the matter is suggestive at best. For example, using survey data in English constituencies, Fisher (2000) finds a positive correlation between misaligned or “tactical” voting and the margin of victory. Since the margin of victory should be increasing in extremist support, one can interpret this finding as a positive correlation between misaligned voting and the popularity of the extremist. On the contrary, Blais and Nadeau (1996) use Canadian data to find that close races (i.e., races with small margins of victory) are positively correlated with more misaligned voting. Fujiwara (2011) also finds support for a negative relationship between misaligned voting and extremist support. Using data from Brazilian mayoral elections, he provides causal evidence on the effect of a single-ballot versus a dual-ballot plurality system and finds that the top two candidates experience a bigger drop in their vote shares due to a change to a dual-ballot system in areas with close elections. This means that more misaligned voting occurs in marginal elections. Using both voting and survey data from multiple UK elections, Kiewiet (2013) finds that, in some elections, strategic voting increases with the closeness of the election while in others it decreases. The lack of a consistent message may reflect the difficulty in casually identifying misaligned voting, the lack of exogenous variation in electoral parameters, and differences in electoral contexts. But it may also suggest that there is a non-monotonic relationship between vote misalignment and extremist support.

Finally, regarding the effect of the relative importance of instrumental motives, it is worth noting that the findings in Spenkuch (2018) support our result in Proposition 3. Using German data, the author finds that voters were more likely to abandon non-contender candidates in the critical 2005 federal election than in the less important 2009 election. This supports our conclusion that misaligned voting is increasing in the importance of the election.

4 Extensions

4.1 Heterogeneity in instrumental utility

In the benchmark model, centrist supporters receive the same instrumental utility, w, regardless of which centrist candidate wins. In this section, we explore an extension of our model where centrists derive heterogeneous instrumental benefits from victory, and the benchmark results are shown to hold qualitatively.

Keeping other elements of the benchmark model the same, let the centrists’ instrumental utilities for the centrist candidates be described by two measurable functions \(w_{l}:[0,1]\rightarrow {\mathbb {R}}_{++}\) and \(w_{r}:[0,1]\rightarrow {\mathbb {R}}_{++}\). Specifically, \(w_{l}(x)\) and \(w_{r}(x)\) are the instrumental utilities that voter x receives when the center-left and the center-right candidate wins, respectively. Note that this formulation allows for very general correlation between the instrumental and expressive motives, e.g., voters with strong expressive motive also have strong instrumental motive. The ethical voting rule takes into account the distribution of instrumental utilities among the centrists. The aggregate welfare for centrists is now:

$$\begin{aligned} \int _{0}^{1}q_{c}\cdot B_{E}(\sigma _{c})+(1-q_{c})\cdot B_{S}dq_{c}+P(\sigma _{c})\cdot {\left\{ \begin{array}{ll} \int _{0}^{1}w_{l}(x)dx &{} \text { if }\sigma _{c}>\frac{1}{2}\\ \int _{0}^{1}w_{r}(x)dx &{} \text { if }\sigma _{c}\le \frac{1}{2} \end{array}\right. } \end{aligned}$$
(2)

where \(B_{E}(\sigma _{c})\) and \(B_{S}\) are defined in formula (1). Unlike in the benchmark setting, centrists’ aggregate welfare depends on whether the center-left or the center-right wins. Specifically, if \(\sigma _{c}>\frac{1}{2}\), then the center-left always obtains more votes than the center-right. In this case, the aggregate instrumental utility is given by \(\int _{0}^{1}w_{l}(x)dx\). Similarly, if \(\sigma _{c}<\frac{1}{2}\), the center-right always obtains more votes than the center-left, and the aggregate instrumental utility is \(\int _{0}^{1}w_{r}(x)dx\).

In the benchmark model, voting rule \(\sigma _{c}\) induces the same aggregate welfare as \(1-\sigma _{c}\). That is, ethical voters are indifferent between which centrist candidates to misalign their votes for. In the current setting, ethical voters prefer to misalign their vote for the candidate whose victory gives the highest average instrumental utility. Formally,

Proposition 5

The ethical voting rule \(\sigma _{c}^{*}\) is in the interval \([0,\frac{1}{2}]\) if \(\int _{0}^{1}w_{r}(x)dx>\int _{0}^{1}w_{l}(x)dx\) and is in the interval \([\frac{1}{2},1]\) if \(\int _{0}^{1}w_{r}(x)dx<\int _{0}^{1}w_{l}(x)dx.\)

This result is intuitive. As in the benchmark case, the expressive component of the welfare function is symmetric in \(\sigma _{c}\), and the direction of misaligned voting under the ethical voting rule can only depend on the distribution of instrumental utilities. Clearly, it is optimal for ethical voters to misalign their votes for the candidate whose victory induces the highest aggregate instrumental utility.

Except for breaking the indifference between voting rules, ethical voters face the same trade-off as in the benchmark setting. Indeed, treating \(\max \{E(w_{l}),E(w_{r})\}\) as w, one may proceed with the analysis as before. The characterization of the ethical voting rule and the comparative statics with regard to \(\theta \) and \({\tilde{k}}_{e}\) would be the same as in the benchmark case. The comparative statics with respect to the importance of the election also go through given that one uses \(\max \{{\mathbb {E}}(w_{r}),{\mathbb {E}}(w_{l})\}\) in place of w.

4.2 Turnout decision

In this section, we consider an extension of the model where centrist voters face stochastic turnout costs.Footnote 24 Let centrist voters be indexed by a tuple (xt) (the voter’s type), where x is the voter’s bliss point, and \(\theta t>0\) is the voter’s turnout cost. We let the intensity of expressive motives \(\theta \) be a scaling factor only to simplify notation; it is not material for the results. Because voting is now costly, centrist voters may choose to abstain. Formally, the (personal) utility of a type (xt) voter is now given by:

$$\begin{aligned} \begin{array}{cc} p\cdot w &{} \text { if abstain}\\ p\cdot w+\theta (1-x-t) &{} \text {if vote for center-left}\\ p\cdot w+\theta (x-t) &{} \text {if vote for center-right}. \end{array} \end{aligned}$$

We assume that voter types are drawn uniformly from \([0,1]\times [0,1]\). In the benchmark setting, we imposed that \(\frac{1}{2}\le {\tilde{k}}_{e}\le 1\) so the problem is non-trivial. Here, abstentions due to turnout costs means that the extremist can win even if his vote share is less than \(\frac{1}{3}\). Thus, we need to revise the restriction on \({\tilde{k}}_{e}\) to \(\frac{3}{8}<{\tilde{k}}_{e}<1\). The new lower bound for \({\tilde{k}}_{e}\) reflects the fact that a quarter of non-ethical centrists will abstain.Footnote 25

The aggregate welfare of the centrists depends on the expressive benefits and turnout costs. A priori, it is not clear what the form of voting rules is e.g., whether they are thresholds. Thus, we broaden the definition of a voting rule, which is now a pair of sets \(L,R\subset [0,1]\times [0,1]\), where voters of type \((x,t)\in L\) vote for the center-left, voters of type \((x,t)\in R\) vote for the center-right, and voters of type \((x,t)\in (L\cup R)^{c}\) abstain. Given this, the aggregate welfare of centrists can be written as:

$$\begin{aligned} wP(L,R)+\int _{0}^{1}q_{c}\cdot C_{E}(L,R)+(1-q_{c})\cdot C_{S}dq_{c} \end{aligned}$$

where

  • P(LR) is the probability that a centrist candidate wins the election given the voting rule.

  • \(C_{E}(L,R)=\int _{(x,t)\in L}\theta (1-x-t)dxdt+\int _{(x,t)\in R}\theta (x-t)dxdt\) is the per-capita expressive utility net of turnout cost faced by ethical voters under the voting rule.

  • \(C_{S}=\int _{\max \{1-x,x\}>t}\theta (\max \{1-x,x\}-t)dxdt\) is the per-capita expressive utility net of turnout cost faced by non-ethical (i.e., sincere) voters.Footnote 26

Interestingly, even though voter type is two-dimensional, the ethical voting rule can be characterized by a one-dimensional threshold. As in the benchmark model, we assume, without loss of generality, that ethical voters misalign their votes for the center-right.Footnote 27

Proposition 6

The ethical voting rule is characterized by a cut-off\(0\le \sigma _{c}^{*}\le \frac{1}{2}\)such that voter (xt)

  • votes for the center-right if\(\sigma _{c}^{*}\le x\)and\(t\le 1-2\sigma _{c}^{*}+x\).

  • votes for the center-left if\(x<\sigma _{c}^{*}\)and\(t<1-x\).

  • abstains otherwise.

Figure 2 below provides an illustration of the ethical voting rule. In general, voters who misalign their votes under the ethical voting rule fall into one of two categories: (1) voters who would turnout and vote for the center-left based on their personal utility, and (2) voters who would abstain based on their personal utility. The opportunity cost of misaligned voting for voters in the first category is the difference between the expressive benefit of voting for the center-left and that of voting for the center-right. This depends only on the voter’s bliss point and is equal to \(1-2x\). On the other hand, the opportunity cost of misaligned voting for voters in the second category is \(t-x\), which is the voter’s turnout cost minus the expressive utility from voting for the center-right. To maximize aggregate welfare, the ethical voting rule equates both opportunity costs. This leads to the set of restrictions in Proposition 6.

Fig. 2
figure 2

Ethical Voting Rule

We will not provide a detailed characterization of the ethical voting rule with turnout costs. The structure of the problem (i.e., the objective function for the ethical voters) takes the same form as before and therefore our insights would continue to hold. In particular, the objective function is of the same form as (1) with \(P(\sigma _{c})\), \(C_{E}(\sigma _{c})\) and \(C_{S}\) now defined as follows:Footnote 28

$$\begin{aligned} P(\sigma _{c}) & = {} \max \left\{ 0,1-\frac{{\tilde{k}}_{e} -\frac{3}{8}}{\frac{5}{8}-\sigma _{c}-\sigma _{c}^{2}}\right\} \\ C_{E}(\sigma _{c}) & = {} \frac{\theta }{2}\left( \sigma _{c}-\sigma _{c}^{2} +\frac{\sigma _{c}^{3}}{3}\right) +\frac{\theta }{2} \left( \frac{1}{3}-(1-2\sigma _{c})^{2}- \frac{\sigma _{c}^{3}}{3}-(1-2\sigma _{c})^{2}\sigma _{c}\right) \\ C_{S} & = {} \frac{7\theta }{24}. \end{aligned}$$

Given the objective function, one can verify that our main results (i.e., Propositions 1 and 3 ) hold. In particular, it is still the case that misaligned voting is optimal for moderate values of \({\tilde{k}}_{e}\), and sincere voting is optimal when \({\tilde{k}}_{e}\) is sufficiently high. Intuitively, the trade-off faced by ethical voters is the same as before. A positive level of misaligned voting is required to obtain a centrist victory and this level is increasing in the extremist’s popularity. When the extremist’s support is sufficiently high, the instrumental utility under misaligned voting does not compensate for the loss in expressive benefits. Hence, ethical voters revert to sincere voting.

4.3 Random extremist turnout

In the baseline model, we assumed full turnout by the extremist voters. We relax this assumption in this section. For tractability, we assume that the turnout of extremist voters is a uniform random variable \(\tau \) with support \([0,k_{e}]\). Note that the center-right now wins whenever

$$\begin{aligned} q_{c}\ge \frac{\frac{\tau }{1-k_{e}}-\frac{1}{2}}{\frac{1}{2}-\sigma _{c}}. \end{aligned}$$

We show below that a main observation of the benchmark model—the non-monotonicity of misaligned voting with respect to extremist popularity—continues to hold qualitatively. Specifically, the degree of misaligned voting is non-monotonic in the upper-bound of the extremist turnout if the intensity of expressive motives is sufficiently high relative to the instrumental motive.Footnote 29 If the intensity of expressive motives is low, then the degree of misaligned voting is always increasing in the extremist’s popularity.

Proposition 7

If\(\frac{\theta }{w}\le \frac{1}{2}\), then the degree of misaligned votingmis increasing in the extremist’s popularity (i.e.,\(\frac{1}{2}-\sigma _{c}^{*}\)is increasing in\({\tilde{k}}_{e}\)). If\(\frac{\theta }{w}>\frac{1}{2},\)then there exists a threshold\({\bar{k}}\)such that the degree of misaligned voting is continuously increasing in\({\tilde{k}}_{e}\) if \({\tilde{k}}_{e}\le {\bar{k}}\), and is continuously decreasing in\({\tilde{k}}_{e}\)if\({\tilde{k}}_{e}>{\bar{k}}\).

4.4 General bliss point distribution

In this section, we consider more general distributions of voter ideological positions. Suppose the bliss points of the centrists are distributed according to CDF G over the interval [0, 1]. The two key components of the ethical voter’s objective function, probability of a centrist of victory, \(P(\sigma _{c})\), and the aggregate expressive benefit of the ethical centrist, \(B_{E}(\sigma _{c})\), are now:

  • \(P(\sigma _{c})=\text {Pr}\left[ \max \left\{ q_{c}G(\sigma _{c})+(1-q_{c})G(\frac{1}{2})\ ,\ q_{c}\left( 1-G(\sigma _{c})\right) +(1-q_{c})\left( 1-G(\frac{1}{2})\right) \right\} >{\tilde{k}}_e\right] \)

  • \(B_{E}(\sigma _{c})=\int _{0}^{\sigma _{c}}\theta (1-x)dG(x) +\int _{\sigma _{c}}^{1}\theta xdG(x)\)

Unlike in the benchmark setting, there is no ambiguity about the direction of misaligned voting so long as G is not symmetric around \(\frac{1}{2}\). However, contrary to what may seem obvious, the direction of misaligned voting is not simply pinned down by whether the median centrist voter is left or right leaning (i.e., whether \(G(\frac{1}{2})\) is greater or less than \(\frac{1}{2}\)).Footnote 30 Suppose \(G(\frac{1}{2})\) is less than but close to \(\frac{1}{2}\) (i.e., the center-right has slightly more support than the center-left), but a substantial number of center-left voters have extreme ideologies (i.e., with bliss points close to 0) while a substantial number of center-right voters are moderate (i.e., with bliss points close to \(\frac{1}{2}\)). It could be optimal for the ethical voters to align for the center-left candidate since it is less costly for the center-right voters to misalign their votes because of their moderate stance. However, we can derive a sufficient condition on the “skewness” of the distribution that ensures the ethical voters strictly prefer to misalign their votes for the candidates with more support. We state the result for the case where the distribution of voters is skewed to the right.

Proposition 8

If\(G(x)\ge 1-G(1-x)\ \forall x\ge \frac{1}{2}\)and the inequality is strict for\(x=\frac{1}{2}\). Then, the ethical voters have a strict incentive to misalign their votes for the center-right candidate (i.e., the ethical voting rule\(\sigma _{c}^{*}\)is strictly less than\(\frac{1}{2}\))conditional on misaligned voting being optimal.

Unfortunately, the comparative statics with respect to various parameters are no longer straightforward given a general distribution of bliss points. Indeed, even the concavity of the ethical voter’s objective function is no longer guaranteed.

One way to allow for more generality while preserving tractability is to allow a general distribution of bliss points for the ideological voters but maintaining that the ethical voters’ bliss points are uniformly distributed.Footnote 31 In this case, the direction of misaligned voting would be pinned down by the location of the median ideological voter. Specifically, letting \(G_{id}\) be the CDF of the bliss points of ideological voters, then ethical voters prefer to misalign their votes for the center-right (left) candidates iff \(G_{id}(\frac{1}{2})<(>)\frac{1}{2}\). Intuitively, ethical voters prefer to misalign their votes for the candidates with the higher support from the ideological voters since there is no cost differential for them in terms of misaligning votes in favor of the center-left vs. the center-right. In addition to the uniqueness of ethical voting rule, the comparative statics from the benchmark setting go through. Since the ideological voters’ behavior is fixed, the marginal benefit and marginal cost of misaligned voting by the ethical voters would be independent of the distribution of ideological voters.Footnote 32 Consequently, the ethical voters’ incentives at the margin are unaffected by the distribution of ideological voters, and the comparative statics of misaligned voting with respect to the various electoral parameters are the same as in the benchmark setting.

4.5 Two ethical centrist groups

In this subsection, we depart from the assumption that the ethical voters treat centrist voters as one “group” when optimizing the voting rule. Specifically, we suppose now that there are two ethical voter groups: the center-left, made up of voters with bliss points \(x\in [0,\frac{1}{2}]\), and the center-right, made up of voters with bliss points \(x\in [\frac{1}{2},1]\).Footnote 33 Each center-left and center-right voter is ethical, or “rule-utilitarian”, with probability \(q_l\) and \(q_r\), respectively. As before, the centrist voters, of measure \(1-k_{e}\), prefer either centrist candidate to the extremist. To simplify notation, we re-normalize the value of a centrist victory for the two groups to be 2w.

Let \(\sigma _l\in [0,\frac{1}{2}]\) and \(\sigma _r\in [\frac{1}{2},1]\) be cut-off type voting rules followed by center-left and center-right ethical voters, respectively.Footnote 34 Then, the expected aggregate welfare of center-left and center-right voters are respectively:

$$\begin{aligned} G_l(\sigma _l, \sigma _r)& = {} wP(\sigma _l, \sigma _r) + \int _{0}^{1} q_{l} \left( \int _{0}^{\sigma _l} \theta (1-x)dx + \int _{\sigma _l}^{\frac{1}{2}} \theta x dx\right) +(1-q_{l})\cdot B_{S} dq_{l} \end{aligned}$$
(3)
$$\begin{aligned} G_r(\sigma _l, \sigma _r)& = {} wP(\sigma _l, \sigma _r) + \int _{0}^{1} q_{r} \left( \int _{\frac{1}{2}}^{\sigma _r} \theta (1-x)dx + \int _{\sigma _r}^{1} \theta x dx\right) +(1-q_{r})\cdot B_{S} dq_{r}, \end{aligned}$$
(4)

where

  • \(P(\sigma _l, \sigma _r)\equiv \text {Pr}\left[ \max \left\{ q_l \sigma _l + q_r(\sigma _r - \frac{1}{2}) + \frac{1}{2} (1-q_l) , q_l (\frac{1}{2} - \sigma _l) + q_r(1-\sigma _r) + \frac{1}{2} (1-q_r)\right\} \ge {\tilde{k}}_e\right] \) is the probability of a centrist victory.

  • \(B_{S}\equiv \int _{0}^{\frac{1}{2}}\max \{\theta x,\theta (1-x)\}dx=\frac{3}{8}\theta \) is the aggregate expressive benefit for each centrist group.

Ethical voters are assumed to follow a voting rule which maximize the expected welfare of their group, subject to the behavior of other voters. Hence, as in (Feddersen and Sandroni 2006b), the analysis will focus on ethical rules which satisfy the following consistency requirement:

Definition 2

A pair \((\sigma _l^*, \sigma _r^*)\) is a consistent rule profile if \(G_l(\sigma _l^*, \sigma _r^*) \ge G_l(\sigma _l, \sigma _r^*)\) for all \(\sigma _l \in [0,\frac{1}{2}]\), and \(G_r(\sigma _l^*, \sigma _r^*) \ge G_r(\sigma _l^*, \sigma _r)\) for all \(\sigma _r \in [\frac{1}{2},1].\)

Next, we consider the case that the share of ethical voters is the same in the two groups, i.e., \(q_l=q_r=q_c\), where \(q_{c}\) is drawn from the uniform distribution on [0, 1]. We then consider the case where \(q_l\) and \(q_r\) are independent draws from the uniform distribution on [0, 1].

Common Draw of\(q_l\)and\(q_r\) Suppose that \(q_l=q_r=q_c\), with \(q_c\) drawn from the uniform distribution, we argue the pair (\(\sigma _l^*=\sigma _c^*,\sigma _r^*=\frac{1}{2}\)), which is equivalent to the optimal voting rule in our baseline framework, is a consistent profile. This would imply that our comparative statics would continue to hold.

Proposition 9

\(\sigma _l^*=\sigma _c^*\) and \(\sigma _r^*=\frac{1}{2}\) is a consistent rule profile.

Intuitively, when the fractions of ethical voters are the same for both groups, it is not optimal for both groups to misalign their votes at the same time i.e., whenever one group misaligns their votes, it is optimal for the other group to vote sincerely. It follows that the group that is misaligning their votes faces essentially the same trade-off as the ethical voters in the original framework. Therefore, the patterns of misaligned voting remain the same.

Independent Draws of\(q_l\)and\(q_r\) The case where the fractions of ethical voters in the two groups are uncorrelated is more complex. Specifically, it is no longer straightforward to argue in general that the two groups would not find it optimal to misalign their votes simultaneously. However, we can demonstrate that for a range of parameters, the pair (\(\sigma _l^*=\sigma _c^*,\sigma _r^*=\frac{1}{2}\)) is a consistent profile. Moreover, misaligned voting exhibits non-monotonicity.

Proposition 10

For parameters where \(\sigma _c^*=\frac{1}{2}\) and \(\sigma _c^*<\epsilon \) for some \(\epsilon >0\) small, \(\sigma _l^*=\sigma _c^*\) and \(\sigma _r^*=\frac{1}{2}\) are a consistent rule profile.

Recall that misaligned voting is increasing in \({\tilde{k}}_e\) (i.e.,\(\sigma _c^*\) is decreasing) when \(\sigma _c^*\) is close to 0. At the same time, for sufficiently large \({\tilde{k}}_e\), there is no incentive to misalign votes (i.e., \(\sigma _c^*=\frac{1}{2}\)). Thus, Proposition 10 implies that the non-monotonicity of misaligned voting with respect to extremist support is still present in the current context even though we cannot characterize the consistent rule profiles for all parameters.

5 Conclusion

In this paper, we examine strategic voting in multi-candidate elections when voters have ethical and expressive concerns in addition to instrumental ones. The model is parsimonious and provides a clear mechanism of when and to what degree misaligned voting occurs. We also show how misaligned voting can vary significantly with changes in the importance of the election, the intensity of the expressive motive, and the popularity of the Condorcet loser. The novel insight is that misaligned voting is non-monotonic in the popularity of the Condorcet loser.

Our paper illustrates how ethical and expressive motives can help provide additional insights regarding voting patterns. The model can be the basis for studying other important issues in electoral politics. For example, by adapting the model to a dynamic environment, one can examine the incentives of centrist parties to form coalitions. In general, ethical agent models can prove useful in explaining how individuals act when faced with collective action problems such as political protests. This study also highlights the need for more empirical evidence on misaligned voting. While our results can reconcile some of the existing evidence, more research is needed to test the different implications of our model.