In modern US presidential elections, economic voting strongly impacts election outcomes (Erikson 1989; Fair 1996; Healy and Malhotra 2013; Hibbs 1989; Kramer 1971; Lewis-Beck and Stegmaier 2000; Markus 1992; Tufte 1980; Zaller 2004).Footnote 1 Economic voting refers to voters’ tendency to reward or punish incumbent candidates at the polls based on economic growth preceding the election. The economy also influences election results in other countries (Dassonneville and Lewis-Beck 2017; Lewis-Beck 1988).

Comparative studies of economic voting show that the economy matters across a variety of contexts (Dassonneville and Lewis-Beck 2017, 2019; Singer and Carlin 2013; Lewis-Beck and Ratto 2013). These studies, however, almost exclusively examine modern economies, which have characteristics that may facilitate economic voting. In modern economies, incumbents have the Keynesian toolset to manage the economy. Governments play a large role in economies. They also collect and disseminate measures of the economy. Voters are educated, somewhat economically literate, and exposed to news media coverage of the economy. Most voters are also integrated into a national economy. Are these modern qualities necessary for economic voting to occur? Or, would economic voting occur under non-modern conditions where these factors are absent? Put differently, how minimal are the requirements for economic voting?

In this article, we contribute by examining the extent of retrospective voting when these conditions are generally not met: presidential elections back to 1792 and especially the 1792–1944 period. Using historical economic data, we investigate whether voters have held presidents accountable for the economy throughout US history. Despite these modern conditions not being met, we find evidence for retrospective economic voting all the way back to George Washington. Economic contractions predict incumbents choosing to retire and losing when they did run. We also find that the economy appears to have influenced the extent of these victories, as measured with vote share, but may have done so less when the economy influenced incumbents’ decisions to run again or retire. Our findings contribute to the literature on economic voting by showing that the conditions necessary for retrospective economic voting appear quite minimal.

Context and Economic Voting

Scholars began studying the conditions necessary for economic voting when they started examining it in different national contexts (notably, Lewis-Beck 1988). Subsequent studies have notably tested theories about the conditioning impact of institutions on economic voting (Anderson 2007; Dassonneville and Lewis-Beck 2017; Duch and Stevenson 2008; Powell and Whitten 1993) and about the impact of the integration of the national economy in the world economy (Duch and Stevenson 2008; Hellwig and Samuels 2007).

Nevertheless, we lack a full understanding of the conditions necessary for economic voting. Existing studies either focus on comparing different national contexts or on assessing changes over the post-World War II period (e.g. Dassonneville and Lewis-Beck 2019). By restricting analyses to the past 70 years, these analyses do not tell us whether economic voting requires the conditions that are particular to this period.

Most importantly, we do not know whether economic voting depends on a strongly interventionist government following Keynesian principles. Keynesianism did not become widely accepted among policymakers until after World War II. Most American and comparative studies of economic voting thus focus on the period after World War II, during which governments play a major role in the economy and frequently use Keynesian interventions to limit the severity of economic downturns. It would seem more reasonable for voters to hold incumbents accountable for the economy after the advent of Keynesianism when the government can better intervene to limit downturns.

We also do not know if voters need government-provided information about the economy to engage in economic voting. Governments began systematically collecting and disseminating economic data in the 1930s and 1940s (Coyle 2015). Before then, voters had to rely on anecdotes, media reports of business activity, stock market values, bankruptcies, etc. to gauge the state of the economy. Economists have noted that major economic busts were plainly evident without government data because of news coverage of large firms going bankrupt, especially banks and investment houses (Davis 2006). But slower declines and slower rises in economic activity may have been hard to detect. Even economic news that was available to voters was also less ubiquitous and less national than it has been since World War II.

We also do not know if economic voting requires some level of education. According to a prominent argument, education and information increase voters’ ability to consider short-term factors like the economy when voting (Dalton 1984). Consistent with this argument, more informed voters appear to rely more on the economy (Duch 2001) and have more accurate economic assessments (Krause 1997). However, there is also evidence that less informed voters shift their votes more in response to the economy (Zaller 2004) and that economic voting has not increased in recent decades as education has increased (Dassonneville and Lewis-Beck 2019).

Finally, we do not know the degree to which voters need to be connected to the national economy for economic voting to occur in the aggregate. The economy was much more regional and many voters were probably less tied to national conditions through much of the 1792–1944 period. Transportation costs were higher and markets were consequently more regional. Indeed, per capita income across states converged dramatically starting in the 1940s (Barro and Sala-i-Martin 1992).

By examining economic voting back to the 1790s, we can therefore shed light on the degree to which these conditions are necessary for economic voting. Do voters punish incumbents for economic busts and reward them for booms even under minimal conditions?

Although most research has examined the post-WWII period, several US studies have examined earlier data. Kramer (1971) examines presidential elections back to 1896, finding mixed results. Lynch (1999) and Norpoth (2004) consider economic voting for the president from 1872 to 1996 and find evidence that the economy mattered throughout this period, though Lynch found that it mattered more after World War II, when the government’s intervention in the economy increased considerably. Achen (2016) briefly analyzes the effect of recessions on two early presidential elections. A handful of studies also examine congressional elections back to the 1890s (Kramer 1971; Kiewiet and Udell 1998). However, none of these studies go all the way back to the earliest days of US democracy and so do not fully assess the impact of contextual variables that change over time.

Besides shedding light on the conditions necessary for economic voting, examining pre-WWII economic voting has implications for other important questions in political science. In particular, it may inform discussions about whether economic voting simply reflects “blind retrospection,” where voters reward or punish incumbent politicians for outcomes over which governments have no control (Achen and Bartels 2016). Presidents’ ability to increase growth may always be somewhat limited. However, their ability to do so was presumably more limited before the acceptance of Keynesianism following the Great Depression. If economic voting was as strong before this period as after, it would suggest that voters’ response to the economy does not depend on the magnitude of potential government intervention. Most importantly though, if economic voting was as strong in the earlier period, it would support the conclusions of recent studies showing that economic voting is a pervasive phenomenon (most notably, Dassonneville and Lewis-Beck 2017, 2019).

Were Economic Rationales Available in the Early Republic?

In modern US campaigns, voters are regularly exposed to economic rationales for voting. Incumbent presidents campaign on strong economies—if they are fortunate enough to preside over one—while challengers attack incumbents for any signs of economic weakness (Vavreck 2009). Were economic rationales available to voters in the early years of the United States? To answer this question, we read the scattered and highly informal coverage of elections from newspapers in the first several decades of US elections. We did not conduct a formal content analysis because of the inconsistent availability, inconsistent formats, etc., of newspaper coverage. We did, however, readily find examples of newspaper coverage blaming incumbents for economic contractions all the way back to George Washington.

For example, George Washington faced an economic contraction in his 8th year in office. It began with a land speculation bubble centered around Washington, DC, where speculators hoped to cash in on the construction of the national capital. The real estate bubble collapsed in early 1796 and appears to have started a mild recession. Benjamin Bache’s Aurora General Advertiser directly blamed Washington: under his leadership “our ears are dinned with the tales of bankruptcy, the ruin of our commerce, and the distress of our citizens” (Mann 2002, p. 168). Newspaper coverage of Thomas Jefferson also frequently blamed him for an ongoing economic contraction in 1808—an easy connection to draw because his Embargo Act likely caused the hard times. Newspapers in 1816 similarly connected James Madison to the hard times under his presidency, which stemmed from a financial panic in Britain.

We have compiled quotes illustrating these examples from election-year coverage during the first few decades of US history in the Supporting Information (SI). We think this coverage makes it clear that economic rationales for voting decisions were available to at least some voters during the earliest years of the Republic.

Data

To examine whether economic voting occurred throughout US history, we rely on annual real GDP estimates from Johnston and Williamson (2018), which attempt to measure GDP back to US independence. Other scholars have estimated pre-1930s GDP (McCusker 2000), but Johnston and Williamson (2018) provide the most comprehensive estimates. They rely on decennial census data and then attempt to bridge those once-a-decade estimates with economic series available on a yearly basis. Since they have greater confidence in the estimates of GDP levels than in the year-to-year changes, Johnston and Williamson (2018) caution against using these data for time-series analysis. To address such concerns, we examine whether the findings hold up with alternative economic measures and when we simply categorize years into expansions and contractions. In particular, we check robustness with GDP estimates from the Maddison Project (Bolt et al. 2018), updated versions of NBER recession dating (Davis 2006; National Bureau of Economic Research 2018), data on historical industrial production (Board of Governors of the Federal Reserve System 2018; Davis 2004), and annual qualitative analyses of the economy through 1926 (Thorp and Thorp 1926). Although the magnitude of booms and busts may be imprecise, these indicators agree on whether the economy expanded or contracted in any given year almost 90% of the time (202 of 229 years). With multiple measures of the economy, we could use a measurement error approach, but the measures depend on similar sources to estimate year-to-year changes in between census years (where we have the best estimates of GDP levels), so the measures are not independent.

The electoral data are from the American Presidency Project (Woolley and Peters 2020). For analyses of presidents' decisions to run again, we include the 52 contests in which incumbents did not face 22nd amendment term limits, thereby excluding five elections (1960, 1988, 2000, 2008, and 2016).

Appendix Table 4 provides descriptive statistics for the variables, and Supporting Information (SI) section 2 describes the sources for the data.

The Economy and Incumbent Presidents' Decisions to Run Again

Political scientists usually focus on incumbent (or incumbent party) vote share when studying economic voting. However, if incumbents notice that their popularity depends on the state of the economy, their decision about whether to run again may depend on how the economy is faring. Thus, focusing only on vote share likely understates the effect of the economy.Footnote 2 This is less of a problem for recent decades as most incumbents sought and secured renomination, but more of an issue pre-WWII. However, when considering earlier elections, we consider incumbents’ decisions to run again in addition to vote share.

Based on the historical data described above, GDP growth toward the end of incumbents' terms is a strong predictor of whether presidents managed to retain office throughout US history, that is, whether they chose to run and then, if they did, whether they won. Figure 1 depicts the relationship between GDP growth and presidential electoral outcomes from 1792 to 2016. We calculate GDP growth with the log change, which approximates percent change and addresses the symmetry and additivity problems with percent change (Wetherell 1986). The top shows all incumbents who were eligible to run again. Seventy-one percent of them faced re-election in years of economic growth and 29% in years with recessions. The figure shows that, when GDP contracted, presidents rarely ran and/or won. In the 15 elections occurring during election-year contractions, seven incumbents sought re-election (47%) and only two won (13%). In contrast, in the 37 elections during economic expansions, 31 incumbents sought re-election (84%) and 20 won (54%). Using a chi-squared test, the difference on running is significant at p = 0.006, while the difference on running and winning is significant at p = 0.011.

Fig. 1
figure 1

Election-year GDP change and electoral outcomes, 1790–2016. Note: This figure shows the relationship between election-year GDP change and incumbent presidents' choice to run and, if they did run, whether they were reelected. The top panel shows all incumbent presidents who were eligible to run again. The middle panel shows those who chose to run. The bottom panel shows those who won. The figure shows that, when GDP contracted, presidents rarely ran and/or won. In the 15 elections occurring during election-year contractions, seven incumbents ran (47%) and only two won (13%). In contrast, in the 37 elections during economic growth, 31 incumbents ran (84%) and 20 won (54%). Using a chi-squared test, the difference on running is significant at p = 0.006, while the difference on running and winning is significant at p = 0.011. Only two incumbents with negative growth ran and won: Abraham Lincoln in 1864 and Theodore Roosevelt in 1904—both exceptions proving the rule (see text). For a similar graph with year-3 results, see SI section 2

Presidents therefore rarely sought and won reelection with declining election-year GDP. The two exceptions to this pattern prove the rule: Abraham Lincoln in 1864 likely benefited from Civil War victories (Shafer 2016), while Theodore Roosevelt saw economic recovery set in by the fall of 1904, before voters cast their ballots (Flood 2009; Davis 2006; Thorp and Thorp 1926). Of course, the economy is not the only factor influencing incumbents’ decisions. Almost half of incumbents still left office during economic expansions. Most notably, Millard Fillmore, Rutherford B. Hayes, Lyndon B. Johnson, and Gerald Ford each retired or lost likely for non-economic reasons, ranging from slavery to war to Watergate.Footnote 3

We show the results for all of US history in Fig. 1, but the same patterns hold up when we examine the 1792–1944 period. In fact, all but one of the election-year economic contractions with an eligible incumbent occurred during these earlier years. Using a chi-squared test, the lower rate of running again during economic contractions is significant at p = 0.048, while the lower rate on the combined running and winning variable is also significant at p = 0.048.

To examine the strength of this finding and its consistency over time, we present regression estimates in Tables 1 and 2. Our two dependent variables capture whether an incumbent sought another term and whether they attained one. We code Run Again 1 if the incumbent ran for office and 0 if they retired, while we code Run-Win 1 for a victory and 0 for a retirement or loss. Run-Win is the most comprehensive measure of incumbents returning to power. It accounts for both their decision to run again and whether voters reelected them. Some of those coded as having sought another term may surprise. John Tyler, Millard Fillmore, Franklin Pierce, Woodrow Wilson, Harry Truman, and Lyndon B. Johnson each tried and failed to win renomination (SI section 2, pages 6–9, for citations).Footnote 4

Table 1 Effect of Election-Year GDP Change on Presidents’ Decisions to Run Again and on Whether They Ran and Won, 1792–2016
Table 2 Run again, run-win, and election year GDP growth robustness checks

Table 1 presents regression estimates using linear probability models. The top half of the table shows the 1792–2016 results, and the bottom half 1792–1944. We first discuss the overall results in the top half. Columns 1 and 5 show the bivariate estimates, revealing that GDP growth moderately affects the chances of running again and winning. The run-again coefficient of 0.031 implies that, per 1% increase in GDP growth, the incumbent’s probability of running again rises by about 3%. The bivariate estimated effect on winning of 0.039 in Column 4 suggests that a 1% increase in GDP growth increases the incumbent’s chances of running-winning by about 4%. These coefficients imply that the effects could be substantial: a 5% GDP swing would imply a 15-point change in the probability of running again and a 20-point change in the probability of running-winning. Historically 5% GDP changes were common, occurring in 22% of years between 1790 and 2016. Of course, the GDP growth measure is noisy and the coefficients are imprecisely estimated, though they are generally statistically significant at conventional levels (tests fail to find evidence of autocorrelation).Footnote 5 We employ “bootstrap” standard errors throughout the analysis, but they generally match plain-vanilla standard errors, as shown in Table 2.

We next control for variables that could correlate with growth and/or with the dependent variables. To capture the impact of poor health on a president’s decision to (not) run again, we code a Dies Within One Term variable to 1 when a president died naturally within one term of leaving office. To capture the effect of wars, we subjectively code the variable War 1 for a politically popular war near the election, − 1 for an unpopular conflict, and 0 otherwise based on our reading of election coverage and historical work from these elections.Footnote 6 Given the subjective nature of this variable, we also tried several alternative war codings but the results remained unchanged in part due to the negligible correlation between war and GDP growth.Footnote 7 We also include Party Years in Power, which captures how long a party had continuously controlled the presidency. We also try to account for lengthy periods in US history when one party failed to effectively contest national elections by including the variable Dominant Party, subjectively coding 1 for years in which incumbents faced comparatively weak opposition, − 1 when facing comparatively strong opposition, and 0 otherwise (see SI section 3 for details). Because even strong candidates may lose elections if their party fails to coalesce or significant third-party bids siphon votes, we code Multicandidate 1 if three or more candidates each secured 10% or more of the popular vote. This covariate is likely posttreatment and only relevant for winning. Across a variety of specifications, Table 1 shows that these variables leave the GDP estimate unchanged.Footnote 8

Of course, researchers need little convincing that the economy mattered to presidential elections post-World War II—the evidence is strong. These data suggest, however, that this relationship existed beforehand, before presidents possessed the Keynesian toolset to manage growth. The bottom half of Table 1 addresses this directly by re-estimating these models only for elections from 1792 to 1944. The estimated effects of election-year GDP growth decrease somewhat for both dependent variables but remain substantively large. The estimates for running again do not quite reach statistical significance at conventional levels, but the estimates for run-win do.

In Table 2, we present numerous other robustness checks. The table displays the coefficient, standard error (bootstrapped), r-squared, and n across a variety of specifications. The first set of columns (1–4) shows these estimates for the dependent variable Run Again, and the second set (5–9) shows Run-Win. Rows 1 and 2 reiterate the baseline results from Table 1.

Rows 3–9 show how the results vary across US history. Row 3 shows the estimates for 1948–2016. It does not show run-again estimates because we cannot estimate the run-again models after 1928 because all incumbents ran again, by our coding, unless termed out. Row 4 displays the years before the government formally began measuring GDP, 1792–1928, and row 5 shows 1932–2016. Row 6 displays 1792–1840—capturing the critical early years of the Republic—and row 7 presents 1844–2016. Finally, row 8 presents 1792–1856 estimates and row 9 shows 1860–2016. The estimates for both dependent variables remain notably similar across these timeframes. The GDP change coefficients for Run-Again remain in the range 0.025–0.072. The estimated effect of GDP change on Run-Win ranges between 0.025 and 0.116.

Table 2 next shows that the results are not a function of long-run trends in politics or economics by controlling for time. It uses the baseline model from row 1. The estimates remain nearly identical to the baseline model when we control for the year, year2, and year3 (rows 10–12). Row 13 demonstrates their robustness to outliers: excluding the four election years that had the greatest expansions (1916, 1936) and the deepest contractions (1908, 1932) strengthens the results. Row 14 shows that the estimates are insensitive to excluding any one of the elections—the smallest Run Again coefficient occurs when we exclude the 1908 election, while the smallest Run-Win coefficient occurs when we exclude the 1940 election. We next present two alternative standard errors (row 15, robust standard errors and row 16, regular standard errors), both suggesting statistical significance below conventional levels. Row 17 shows results using probit rather than linear regression.

Next, we test the results using indicators of expansion rather than GDP change. These indicators help address heteroskedasticity and concerns about noise in the GDP estimates. In row 18, we replace the continuous GDP variable with a dummy that captures positive growth (coded as 1) or negative growth (0). Row 19 presents the Davis indicator of expansions (1) and contractions (0), based on estimates of industrial production through 1914 and NBER recession history afterward (Davis 2006; National Bureau of Economic Research 2018). Unsurprisingly, we find that recessions decreased the chances that an incumbent ran again and reduced their chances of reelection if they did run.

To further check robustness, we employ other economic measures, although these share similar sources (SI section 2 describes their methodologies). In row 20 we use the Maddison Project GDP dataset—drawn from numerous estimates but focused on cross-country comparison—and find similar results, albeit suggesting a milder association between GDP and electoral outcomes (Bolt et al. 2018). Next, row 21 shows results based on industrial production indexes (Board of Governors of the Federal Reserve System 2018; Davis 2004). We also use a qualitative measure based on Thorp and Thorp (1926), who provided annual descriptions of the US economy from 1790 to 1926. We had three coders independently rate each year on a 3-point scale (− 1 to + 1) based on Thorp’s descriptions, and use the median score they assigned (Cronbach’s alpha = .84). Although economists have noted flaws in his work (such as a tendency to miss signs of recovery and over-reliance on commodity prices, thereby exaggerating volatility), row 22 shows that his descriptions also predict running and Run-Win, albeit imprecisely. These variables cover somewhat different timespans: the Maddison project starts before the 1804 election, while Thorp’s qualitative descriptions end in 1926, explaining some of the variation in estimates for these measures.

Consistent with studies finding that what matters most to election outcomes is the state of the economy toward the end of an incumbent’s term, row 23 shows that total growth over the previous 4 years matters substantially less than election-year growth. In section 3 of the SI (page 16), we also report estimates for each year individually and the first and second halves of presidential terms. In those estimates, we find that only the election year predicts both run again and run-win. Growth in year 3 is less predictive of incumbents' choice to run again, suggesting less bias from selection for year 3 in analyses of incumbent re-election and vote share. Consistent with less bias from selection, year-3 growth predicts running and winning quite well (consistent with Wlezien 2015). In fact, in the pre-World War II period, year-3 growth predicts run-win as well as year 4 does, including in models that include both. Year-3 and year-4 growth are also considerably more correlated pre-World War II. We therefore report a version of Table 2 for year-3 growth in SI section 3. We expand on this interesting year-3 finding in the next section.

Finally, row 24 assesses whether GDP growth’s relationship with the choice to run again holds up with a more conservative coding of who ran again. In particular, we restrict the run again cases to only those presidents who continued to run after the nomination conventions (using the nomination itself does not work because of incumbents who ran on third party tickets). We thus switch three cases to retiring—Wilson, Truman, and Johnson—rather than seeking further terms. As row 24 shows, the estimate remains identical in size.

Do economic contractions also influence party turnovers? We examine whether the economy affects parties by using incumbent party reelection as the DV, finding largely similar if weaker relationships. We present these results in SI section 3.

Finally, in analyzing this data, we were struck by a pattern that may help explain the unusual peaceful turnover of power in the early Republic. Early presidents faced surprisingly bad luck with the economy. In non-election years, presidents faced recessions in 41% of years from 1792 through 1840. However, in election years during this span presidents faced recessions 54% of the time (1796, 1808, 1812, 1816, 1828, 1836, and 1840). In fact, Washington, Jefferson, Madison, and Jackson each retired during contractions. The only immediate male heir of an early president, John Quincy Adams, attempted to win a second term amid the 1828 contraction and lost, abbreviating the first American dynasty. Martin van Buren also lost reelection in 1840 amidst an apparent recession. These recessions appear to have been generally mild—strong enough to potentially make presidents less popular, but maybe not strong enough to produce widespread dissatisfaction with the fledgling democratic institutions.

The Economy and Vote share

When studying economic voting, researchers usually analyze incumbent (or incumbent party) vote percentage as the dependent variable, not running and winning. Doing so, however, may underestimate the economy's effect on election results if it ignores the economy's impact on incumbents’ decisions to run again. In the previous section, we saw that election-year growth was a strong determinant of incumbents’ decisions about whether to run again up to 1928: only 38% of eligible incumbents sought reelection with negative election-year GDP growth, while 73% did so with positive growth. After 1928, incumbents ran again regardless of the state of the economy, perhaps feeling confident that the government’s Keynesian toolkit would ensure growth in time for the election. Because the election-year economy influenced incumbents’ decisions to run again prior to the war, we expect estimates of the election-year economy's effect on incumbent (and incumbent party) vote share to be smaller in that earlier period.

However, that does not mean the economy did not matter to vote choice in the earlier period. While research on economic voting has found that the election-year economy is the strongest determinant of vote share, the state of the economy in the year before the election also has some impact on vote share (Achen and Bartels 2016; Healy and Lenz 2014, though see Wlezien 2015, which finds an equal effect for the last 2 years of an incumbent’s term). Since we found that the election-year economy influences the choice to run, we would expect it to depress the impact of the election-year economy on vote share prior to World War II, but not depress the impact of the previous-year economy (year 3 of presidents' terms). Furthermore, because incumbents' choice to run again appears to have been less influenced by the economy since World War II, both the state of the economy in the 3rd and 4th years of an incumbent president’s term should influence their vote share in that period, although the latter should matter more (Achen and Bartels 2016; Healy and Lenz 2014).

Starting in 1824, states generally recorded the popular vote. In 1824, however, all candidates ran in the same party and President Monroe chose not to run again, so the election lacked an incumbent candidate. We therefore start our vote-share analysis in the following election, 1828, and consequently miss the critical early years of the Republic.Footnote 9

We begin by analyzing incumbent party vote share and then turn to incumbent candidate vote. Figure 2 presents scatterplots of the incumbent party vote percent by the GDP growth rate, showing this relationship for pre-World War II on the left and post-World War II on the right. The figure shows incumbent party vote for year 3 of incumbents' terms at the top and year 4 at the bottom. We calculate incumbent party vote percent over the two highest vote-getters. Each scatterplot also includes an OLS regression line. The plots reveal a positive relationship in both periods, though, as expected, a noticeably weaker relationship prewar for year 4. Consistent with voter myopia, the impact of year-4 growth exceeds that of year-3 growth during the post-war period.

Fig. 2
figure 2

Economic growth and incumbent party vote share

Table 3 shows Ordinary Least Squares (OLS) regression estimates, with each row presenting a separate regression, as in Table 2. The first set of rows show regressions of incumbent party vote percent on election-year GDP change. These rows show that year-4 GDP change somewhat influenced incumbent vote share throughout US history, but did so more clearly post-World War II when the economy had less influence on incumbents’ decisions about running again. Between 1828 and 1944, a 1% increase in GDP corresponded with a 0.3 increase in incumbent party vote percent. By contrast, after World War II, a 1% increase corresponded with a 1.2-percentage point increase in vote share. The estimates are imprecise, and the pre-World War II estimates are not statistically significant at conventional levels, though close.

Table 3 GDP growth and presidential vote share

Table 3 next examines incumbent candidate vote percent, as opposed to incumbent party vote percent. The estimates of GDP's effect are larger, consistent with evidence that voters hold incumbent presidents more accountable for the economy than incumbent parties (e.g. Norpoth 2004), but the estimates remain imprecise. As noted above, we are only observing the candidates who chose to run in that earlier period, and we know that the ones facing the stiffest headwinds chose not to, likely suppressing the relationship between growth and incumbent vote share. Moreover, the 47% who do choose to run during recessions may do so precisely because they believe they have sufficient electoral advantages to offset the electoral disadvantages from economic contractions, such as when President Lincoln ran in 1864 during a recession but in the wake of capturing Atlanta and numerous other Civil War victories. Because of this downward bias, we believe that the Run-Win measure we use above is a better indicator of the economy's role in elections.Footnote 10

Other factors may also have weakened the relationship in year 4 simply by chance. In 1852, for example, internal debates about slavery split asunder the incumbent Whig Party, harming their nominee's chances despite a strong economy. The Whigs then vanished as an organized force in American politics. As we noted above, the 1904 outlier likely represents an exception that proves the rule, as the monthly industrial production data shows that growth had returned by the fall of 1904, just in time to reelect Theodore Roosevelt. 1924 is another such case: industrial production declined in most months between the summer of 1923 and the summer of 1924, but then grew consistently starting in August 1924, just in time for the election.

As shown in Fig. 2, GDP change in the year before the election was a stronger predictor of incumbent vote percent pre-World War II. So, Table 3 next shows regressions with this variable, rather than election-year GDP change, as the independent variable. The estimates are statistically significant at conventional levels pre-World War II.

It is intriguing that, while election-year growth mattered most for incumbents’ decisions to run again prior to the development of Keynesian economic stimulus tools, year-4 growth may have mattered less for vote share. It may be the case that, prior to Keynesianism, incumbent presidents felt compelled to step down when facing a recession, while, with the advent of Keynesian economics, they now feel less bound by the economy because they have the tools to stimulate the economy. Incumbents may also have felt more constrained by the state of the economy in early decades of US history because the two-term norm was only being established and the 22nd amendment, enforcing that norm, only became part of the Constitution in 1951. If incumbents used to decide whether to run again based on year-4 growth, that would account for the weaker impact of growth in that year on voters in the earlier period. With the modern primary system, incumbents likely must make decisions earlier, and can always hope that stimulative policies will kick in by Election Day.

Conclusion

While countless studies document the impact of the state of the economy on election results in the United States and elsewhere, nearly all those studies begin with elections in the mid-twentieth century. Doing so prevents us from assessing whether economic voting depends on governments having a Keynesian toolset to steer the economy, governments playing a large role in their economies, governments collecting and disseminating economic metrics, voters being educated and exposed to media coverage of the economy, and the economy being strongly integrated at the national level.

Ours is the first study to assess the impact of the economy on elections throughout American history. We found that incumbent presidents are more likely to run and win in a favorable economic context. When GDP declines in the election, presidents less often attempted to hold on to power, with only 47% of eligible incumbents running for reelection, and even fewer winning, just 13%. With positive GDP growth, by contrast, 84% sought another term and 54% succeeded. We found less clear results on the relationship between election-year economic growth and incumbent vote percent for the pre-World War II period, possibly because incumbents decided whether to run again on the basis of year-4 growth, thus potentially limiting the degree to which voters consider election-year growth when voting. We did, however, find a strong relationship between year-3 growth and election results, potentially because selection was weaker with year 3.

These findings suggest that the societal and economic preconditions necessary for economic voting may be more minimal than previously thought. Just as Dassonneville and Lewis-Beck (2017) recently showed that the state of the economy influences election results in a wide variety of institutional configurations, the economy seems to have mattered to election outcomes in the early United States even though American economic, societal, and electoral systems functioned very differently than today. Finally, they also suggest that voters held presidents accountable for the economy before presidents had the Keynesian toolset and before the federal government played as large a role in the economy. Our findings thus support the notion of blind retrospection discussed by Achen and Bartels (2016). Voters hold incumbents responsible for economic conditions regardless of the influence incumbents have on the economy.