Abstract
Punctuated equilibrium theory (PET) describes policy change as occurring mostly through incremental movements with infrequent periods of dramatic change. An impressive body of empirical literature relating to budgeting supports this view, but virtually all empirical tests have focused on examining distributions of annual changes, thus nullifying chronology. In this article, we focus on the time element. Using the same databases as previously used in canonical PET studies, we explore multi-year trends, not only annual observations. For our analyses, we identify directional series of changes (while allowing for one-year changes in direction if these are immediately offset in the following year) on a U.S. budget distribution dataset covering the period of 1947 through 2014, with 60 categories of spending consistently defined over time and adjusted for inflation. We then assess the robustness of the PET findings when incorporating a longer time units of trending series of annual changes into the analysis. We find that almost 65% of changes occur in series of 4 years or more. Nonetheless, the signature PET literature pattern of high kurtosis is equally present in these series as well as in shorter series. Moreover, within growing and trending series, we find that 21% of these series generate 80% of positive budget change. Within these series, we identify a small group of “super-trends” that account for a large share of the overall change. We conclude that expanding methodologies for the study of budgetary change to incorporate longer-term dynamics helps to better understand policy change, but such findings remain consistent with the PET perspective.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Dan Carpenter’s magisterial treatment of the development of the institutional power of the US Food and Drug Administration (FDA) describes two major punctuations in the development of the Agency. First was its initial creation in 1938 in response to deaths from unregulated “patent medicines” (particularly the elixir sulfanilamide in 1937; see Carpenter, 2010, chapter 2). Second was the 1962 expansion of the Agency’s powers following the thalidomide tragedy (chapter 4). In between these periods, however, was hardly a time of stasis. Rather, it was a long period of building up of the organizational capacity and, more importantly, the scientific rationale for a more effective regulatory model. Carpenter describes a slow build-up of institutional power based on changes in conceptual understanding of what the agency should be doing and what role it should play. These understandings were hotly contested and changed slowly over time. However, over a period of more than a decade they were at a very different place than where they had been before, helping to set the stage for the next major expansion of the Agency’s power. Kingdon (1995) would describe such a thing as “softening-up” period. Baumgartner and Jones (1993) would describe it as the development of a new “policy image” that in turn would undergird the power of a new “institutional venue” of power.
Carpenter’s description of the establishment and expansion of the powers of the FDA is consistent with punctuated equilibrium theory (PET). PET describes policy change as occurring mostly through incremental movements with infrequent periods of rapid and dramatic changes. Whereas early work in the PET tradition focused on the chronological development of policy changes in particular policy domains (e.g., Baumgartner & Jones, 2009), a new type of budgetary analysis introduced in subsequent writings used a method that ignored both chronology and the characteristics of individual policy domains (see, e.g., Jones et al., 1997, 1998, 2003; Jones & Baumgartner, 2005).
Jones and Baumgartner (2005) noted that one could look at changes in a single budget category over time; one could look at the distribution of changes across a series of categories in a particular year; or one could put all of these into a single distribution and assess the patterns of change across many budget categories and many years (see their Fig. 4.14, p. 111). This “budget change frequency distribution” has become a canonical demonstration of PET dynamics, with a signature high central peak, “weak shoulders” and “fat tails” reflecting an over-abundance of very small changes, an under-production of moderate changes, and an over-abundance of dramatic adjustments. The empirical evidence is impressive and has been reproduced in many countries (see for example the studies reviewed in Baumgartner, Jones and Mortensen 2023 or Kuhlmann & van der Heijden, 2018). Further, it can easily be assessed by a single statistical attribute: the kurtosis value of a distribution. If the value is near that of a normal distribution, then there is no evidence of PET.
We have no question about the value of this approach, particularly since it avoids overgeneralizing from a short historical time period or a single year or from a single policy domain. However, like any approach, it has some merits and some drawbacks. Compared to Carpenter’s approach or to the early PET literature, the newer literature is striking for its ahistorical nature, the elimination of the time element. In this article we therefore focus on the time element. Using the same databases as previously used in canonical PET studies, we explore multi-year trends, not only annual observations. We can therefore provide a useful robustness test and determine whether the literature’s ahistorical approach is causing it to miss important insights.
Our interest in looking at more than just annual changes is driven by the fact that long-term trends matter. Some examples can illustrate this very easily. In the U.S. federal budget, the housing assistance budget function underwent a sharp and drastic change in 1974 and 1975, rising from 7.5 billion to 147 billion dollars; clearly a dramatic punctuation. In contrast, Medicare climbed from 19 billion to almost half a trillion dollars from 1967 to 2014 in a relatively gradual and stable pattern of growth. A glance at the two series over time would show one with great volatility, with some huge, but momentary, adjustments, and the other with a slow but steady rise accumulating to massive change. If some of the large adjustments in the short volatile series were immediately counter-balanced by adjustments in the opposite direction, then the series could show no net change over time, whereas the slowly building one could accumulate to a transformation of that policy domain. Moreover, the slowly growing or slowly declining series over time obviously differ from ones where the increases and decreases alternate: one trends, whereas the other is erratic around a mean of zero. Putting all of these in a single change distribution may hide important information. It seems plausible, then, that the standard tests of PET theory, impressive and revealing as they are, have been omitting an important question: how changes accumulate over long periods of time.
In light of these observations and findings, our central underlying question is simple: are the PET findings confirmed or contradicted when we use a different methodology, one which explicitly incorporates the possibility of multi-year trends? What else can we discover when we incorporate longer time periods into the analysis? To examine this, we develop a measure of a “trend” in budgeting: periods during which annual changes move consistently in the same direction. Utilizing this concept, we pose three more specific questions: (A) what share of budget changes occur in multi-year trends? (B) Are the temporal dynamics that characterize such trends consistent with PET or do they support a gradualist view of budgeting? and (C) When we incorporate trends, does this change our understanding of what series have the greatest degree of change?
Previous studies suggesting the need to look at trends
Several authors, a few within the PET perspective and some critical of it, have published findings that motivate our approach. These studies suggest that important insights may come from delving deeper into the dynamics of change than is possible with the analysis of a single aggregated distribution of all annual changes across every available budget category. For example, Breunig and Koski (2012) examined budgets at the state level and found that higher levels of punctuation in a state’s budget function correspond to smaller long-term growth. This suggests that gradual changes may accumulate to more significant change in the long term than highly volatile or punctuated series. Robinson et al. (2014) found that budgetary punctuations occur in clusters. In other words, the probability of a budgetary punctuation is positively related to having had a recent punctuation. Flink and Robinson (2020) analyzed over 1000 Texas school districts over a nearly 20-year period and found that a significant number of punctuations were immediately counter-balanced by adjustments in the opposite direction. This points to the possibility that volatile budget series could actually be of little or no net change over time, whereas the slowly building ones may accumulate to much more significant change.
In addition, two studies examine temporal dynamics of two macro budget categories in the US, domestic and defense spending. They do so over very long time periods, from the eighteenth century till nowadays, and they show the significance of gradual accumulation over time. Jones and Breunig (2007) found high levels of autocorrelation in both budget categories, a straightforward indication for trending periods. Jones et al. (2014) show that federal budgeting in the US is a “self-reinforcing, recursive incremental system whose solution is exponential growth, termed exponential incrementalism” (575). This pattern is interrupted by disjoint shifts, caused by critical events such as major wars or severe economic downturns. In addition, temporally localized policy dynamics are also in play, causing bends in the exponential path as well as punctuations. Hence, this analysis over longer periods of time exhibits a prominent role for gradual accumulation. Studies within the historical institutionalist approach portray similar patterns in gradual shifts of institutions, sometimes undetected, occurring over decades (Hacker, 2004; Mahoney & Thelen, 2010; Pierson, 2004; Thelen, 2004). Finally, Epp and Baumgartner (2017) find that budgetary instabilities are affected by institutional capacity, whether the policy area is strongly affected by slowly moving demographic trends, and external crises. All these studies motivate our attention to multi-year trends in budgets.
A number of scholars have argued for the significance of gradual changes slowly accumulating over time; these come from outside the PET perspective and are potentially more critical of it. A handful of case studies demonstrate cases in which gradual change accumulated to significant change and even paradigmatic change, external to the explanatory frameworks of PET. Such cases have been found to occur in various policy areas, such as: forestry (Cashore & Howlett, 2006, 2007), agriculture (Coleman et al., 1996; Daugbjerg, 1997, 2003; Skogstad, 1998), administrative reform (Capano, 2003) policy towards indigenous peoples (Howlett, 1994) and air pollution (Segal, 2018). Benjamin Cashore and Michael Howlett (2007) raise explicit concerns with regard to gradually accumulating incremental changes, and they argue that PET findings overlook this important mechanism of policy change. They note gradual movements in the same direction, none of which would constitute a punctuation when considered on an annual basis, but which might accumulate to substantial change this corresponds with Mahoney and Thelen’s (2004) influential theory of gradual institutional change.
Charles Lindblom understood incrementalism as “political change by small steps” (1979, 517), he also explicitly recognized that “a fast-moving sequence of small changes can more speedily accomplish a drastic alteration of the status quo than can an only infrequent major policy-change” (Lindblom 1979, 520). Hence, the potential for significant gradual accumulation was recognized in the early years of the literature on budgetary change. Budgetary scholars including Wildavsky (1975), Padgett (1981), Larkey et al. (1981), Kamlet and Mowery (1987), and Su et al. (1993) have wrestled with issues of trends and annual changes, and we build on this work here. Summing up, previous research points to the possibility that gradual accumulation (or decline) may be a significant driver of change and that therefore any conclusions regarding patterns of policy outputs should also be sensitive to temporal dynamics. We provide that perspective here.
Gradually accumulating change and PET
In order to understand the theoretical disparity between gradual accumulating change and PET, we need to examine PET’s behavioral foundation, which is Disproportionate Information Processing (DIP) theory (Jones & Baumgartner, 2005). At its core, DIP presents the cognitive overload model of decision-making. Simply put, policy makers suffer from an overload of signals and therefore process information disproportionately. For most issues, most of the time, they under-respond, because their attention is focused on a small number of other issues with the greatest urgent need for response. When an issue does capture their attention, they may over-respond because of the accumulated lag in policy response during the period when they were focused on other issues.
Within a DIP framework, gradual accumulation may occur if signals are pointing in a certain direction over extended periods of time (Segal, 2018). These may be demographic trends or program costs that can be predicted steadily to increase over time (Epp & Baumgartner, 2017). In Segal’s (2018) study, it was steadily increasing environmental threats. For Epp and Baumgartner, it was slowly moving demographic trends. But just because policymakers respond steadily in the face of social indicators moving in a common direction year after year does not mean that they respond in a proportionate manner. Trending series, in other words, may still exhibit the distinctive pattern of under- and over-response, reflecting DIP theory.
Several of the studies reviewed above support a counter-argument, one that suggests that accumulated change over time may stem from a different behavioral model, one of relatively proportionate adjustment. Cashore and Howlett (2006, 2007) argue that a central mode of gradual changes is overlooked by PET. They suggest a “progressive incremental” dynamic of change consisting of incremental changes as an alternative model of policymaking. They demonstrate this through a case study of gradual policy response over time in forest policy in the Pacific Northwest, eventually generating paradigmatic changes in logging practices in response to growing demand to protect endangered species. The dynamic they describe suggests relatively efficient policymaking: pressure from the outside environment pushes policymakers progressively to adjust policy in response, eventually leading to a major policy change.
Other scholars have also identified cases of gradual rather than punctuated policy response. Skogstad (1998) described a gradual change in the European state-assisted agricultural paradigm in the 1990s increasing the paradigm’s endurance, while the more stagnant paradigm in the United States was eventually overthrown. This is consistent with the emphasis on the “build-up” of pressure, or “error-accumulation” in DIP theory. During times when decision makers under-respond to signals, those signals accumulate. Either they dissipate, by random luck, or they accumulate over time if not incorporated into the decisions. This error-accumulation is then the source of punctuations, in the DIP model, as decision makers eventually have to respond to the accumulated crisis, since they did not respond sufficiently to it as it was building. If decision makers were to respond proportionately to signals as they accumulate, then there would be no build-up, and no need for punctuation; this is essentially the Skogstad (1998) example. Efficient processes can go on forever; inefficient ones eventually generate a break-down leading to a punctuation. But if policymakers can “get it right,” as Skogstad suggests, then they can avoid policy punctuations altogether, something that is clearly inconsistent with PET and DIP.
Theory and hypotheses
We sum up the gradual change argument in the form of a rival hypothesis to DIP and PET: gradual accumulation is a central mechanism of change occurring in a more efficient and proportional response to incoming policy-relevant signals. We assess these changes by returning an important time-element into the study of budget changes, trends. If slowly accumulating changes are proportionate to shifting policy needs, then there will be no need for punctuations. In sum, we assess whether the overwhelming consensus in the budgetary literature in support of the “general punctuation hypothesis” (Jones et al., 2009) remains if we allow for trends. After all, that literature has been based on a common, but untested, methodological assumption, that ignoring the time element is inconsequential. Here we test this directly.
We test the following hypotheses. First are three hypotheses related to budgetary gradualism:
H1.
Annual budget changes occurring in trends will show significantly less kurtosis than changes that are not part of trends.
H2.
The longer the trend in a budget series, the lower the kurtosis.
H3.
Analysis of the percent change in budget value from the start to the end of a trend will show lower kurtosis than analysis of the annual budget changes.
Analysis of trends rather than only annual values will generate new insights into the budgetary process. If H1 through H3 are disconfirmed, and series trending in the same direction for long periods are just as likely to be punctuated as series that do not trend (or do so for a shorter period), then a small share of series that are both punctuated and trending may generate extraordinary levels of change. This leads to these expectations:
H4.
A small share of trending series will generate a large share of the overall budget change.
H5.
These “super-trends” will incorporate both clear budget punctuations as well as sustained movement in the same direction.
Finally, as others have noted (Jones et al., 2009), growing and declining trends are mathematically distinct: Growing series are unbounded but declining series cannot decline by more than 100%. Therefore, the patterns we observe are sensitive to whether we are describing growing or declining series:
H6.
The concentration of all change in just a few trending budgetary series will be greater for growing series than for declining ones.
Methodology
We begin by taking the U.S. budget distribution previously used in many published articles from the Comparative Agendas Project web siteFootnote 1; it covers the period of 1947 through 2014, with 60 categoriesFootnote 2 of spending consistently defined over time and adjusted for inflation. We use this dataset without updating it so that our results can most directly be compared with the previous literature. We use inflation-adjusted budget amounts as is common in most budget studies. In addition, we replicate our analysis using a complementary 'change-in-share' analysis that controls for budget drift and reach similar results; see Appendix B. For this, we convert all budget numbers into shares of the annual budget, summing to 100% for each year. We then analyze changes in this share. This means that we are analyzing changes in the share of the budget, understanding that the budget itself may be growing over time. Hence, the change-in-share method is also more reflective of prioritization on behalf of decision makers.
The concept of a “trend” is fundamental to our analysis. For each observation, we simply note whether the change continues in the same direction (e.g., growing or declining) as the previous change, or if it reverses. If there is a reversal, we note if the reversal was itself immediately offset in the following year. In such a case, we call the reversal a single-year irregularity. We call a series a trend if it moves in the same direction for four years or longer, not counting single-year irregularities.Footnote 3 This methodology identified 1043 series from the total 3699 observations in the dataset. Note that our use of four years as a cutoff for defining a trend is arbitrary, but four years of continued growth or decline in a budget is substantively important. Further, whether we define the trend with a cutoff of, 3, 4, 5, or 6 years makes little difference with regard to the hypotheses tested; see Appendix Table 6. We assess levels of kurtosis in Table 3 below by the length of the series and show no particular break-points in the series. Therefore, we are confident that our results are robust with respect to the question of the use of a four-year cutoff to define a trend.
As aptly described by Johanna Kuhlmann and Jeroen van der Heijden (2018, 328–329), Baumgartner and Jones have repeatedly stressed that punctuations are identified by looking at entire distributions of changes, not by tracking a single series over time. The literature provides no objective definition for the cutoffs between small, medium and large changes within single budget series. In an entire distribution, on the other hand, punctuations are identified the presence of “fat tails” or a set of outlier values that would not be expected given the overall level of variability in the series. This means looking at the level of kurtosis in the series, which also implies looking at large numbers (several hundred) of observations rather than only a few. If the series of changes, taken together, is statistically similar to a bell curve (Normal distribution), then it is not punctuated even if there are some large changes. If, on the other hand, there is a high degree of kurtosis (e.g., a high central peak, weak shoulders, and fat tails, also known as leptokurtosis), then the distribution is punctuated (see also Eissler et al., 2016; Boydstun, 2013; Cavalieri, 2023). We follow these strategies here.
Results
Trending series
Breaking down the 3699 annual budget changes into series consistently moving in one direction or another generated 1043 series. Table 1 shows the length of each of the series we identified, separately for series that were growing and declining. It shows both the number of distinct budget series as well as the corresponding number of annual budget observations within each series. While there are relatively few very long trends, a much larger share of all budget observations are part of such trends, since by definition these longer trends have many annual observations. More precisely, just 31% of the series show a trend of four years or more, but 65% of all observations are part of such trends.
Table 1 makes clear that trending series are quite common, and that some data series trend, for many years in a row; 9 declining trends and 53 growing trends lasted for 10 years or more. These were fewer than 6% of all the series, but more than 25% of all budget observations. Table 2 shows summary statistics for growing and declining budget series according to whether they are part of a trend lasting at least four years.
Series trending for four or more years are more common than non-trending series. These findings provide a clear and definitive answer to our first question concerning the extent of accumulating changes: Trends have an important role in the logic of budgets. We can only gain by understanding whether annual budget shifts may be part of multi-year trends.
Looking into the size of budget changes in both types of series, trending and non-trending, we find that annual changes in non-trending series are larger than those in trending series. The medians of annual changes in non-trending series are 12.1% and − 9.6%, as compared to 6.8% and − 6.1% in the trending series. Of course, trending series see accumulation of these effects whereas non-trending series, by definition, experience less accumulation as well as a quicker offset by the following reversing series. This explains why the median of accumulated growth in growing trending series was 111.9%, as opposed to 30.6% in growing non-trending series. The median for accumulated decline in declining trending series was − 39.8%, as opposed to − 21.4% in declining non-trending series. At a minimum, we can say that attention to accumulated changes over many years provides insight into the nature of policy change. Some important policy changes take several years to accumulate.
Disproportionality in trending series
The previous short assessment should clearly answer the question of whether trending changes have a significant role in the budgetary process; they do. We now move on to examine trend dynamics. The fact that a series accumulates in the same direction for several years does not necessarily mean that it is gradual rather than punctuated. If, however, the distinctive characteristic of PET budgetary findings, high kurtosis, is limited only to non-trending series, then the methodological convention within the literature of ignoring time dynamics longer than one year would be problematic. Therefore, we want to know if evidence of proportionality in budgeting differs significantly in those series which are parts of significant trends compared to those which are not part of a trend.
Fortunately, we can devise a simple but robust test for this. Disproportionality is traditionally measured in the PET budgeting literature by using the kurtosis statistic, K, or its more robust analog, L-Kurtosis (LK). LK provides a single value, ranging from 0 to 1, indicating the degree of kurtosis in a distribution. Distributions with high kurtosis, compared to Normal, have a higher central peak, more slender shoulders, and longer tails. LK scores, compared to the more conventional K scores, are less sensitive to single cases in the tail, and therefore provide a more robust and reliable general test, more suitable for our analysis. LK values range from zero to one, with 0.123 the LK of a normal distribution. Table 2 breaks out each annual budget change observation by the length of a series to which it belongs and shows the LK scores for each set of observations.
As we saw in Table 2, 2397 changes are part of trends lasting four years or more, and 1,302 are part of series lasting less than four years. Table 3 shows that the distributions have almost identical LK values: 0.584 for trending series and 0.574 for non-trending series. Thus, we clearly reject H1, which posed that annual budget changes occurring in trends will show significantly less kurtosis than changes that are not part of trends.
Looking at the individual series (clustered by trend length so that the N’s are roughly comparable and over 400 in each case), values are high. These range from 0.511 to 0.634, very high in comparison to the LK of a normal distribution (0.123). We therefore reject H2, which posed that the longer the trend in a budget series, the lower the kurtosis, since there is no tendency for kurtosis to decline as the series grows longer. (And in addition, none of the groups comes close to the value expected in a normal distribution.) Overall, Table 3 shows that consistency in the direction of change is not correlated with a less erratic pattern of budgeting. Decision-making processes within trending series of changes also evolve in a disproportionate dynamic consistent with PET and DIP expectations. The results reject the hypothesis that gradual accumulation is evidence of a more efficient and proportionate pattern of policy adjustment than what has been shown in the PET literature. They also present a strong indication that governmental decision-making processes beyond budgets occur in PET patterns even if change is advancing in a certain direction over an extended period.
Figure 1 shows the distribution of changes at the core of the main result in Table 3: The left panel shows those changes not part of a trending series, and the right panel shows those observations which are part of a trend of four years or longer. The figure shows the number of observations showing various percentages of change from the previous year. A large peak of observations is around zero percent, and there are a number of outliers. A normal curve with similar variance is overlaid on the figure, showing that the observed values: (a) over-produce cases in the central peak; (b) under-produce cases in the moderate “shoulders” of the distribution; and (c) over-produce outliers. This is consistent with dozens of articles in the previous literature (see Baumgartner et al., 2023).
Figure 1 confirms the results from Table 3. No matter if we look at shorter or longer trends, we see the signature of a punctuated equilibrium process: a high-kurtosis distribution.
Finally, we performed another series of tests on the accumulated change across the length of each trend, from its beginning to its end. If trends have a central role in budgeting then perhaps we should widen our perspective from snapshots of annual changes to assess the cumulative change over an entire trend, however long it may be. For example, budgeting may be tied through a “mandatory spending” formula for many years, automatically changing in response to demographic trends, numbers in poverty, or other social indicators. Or, general budgeting policy may be set in the first year of a new administration, continuing for four or eight years until the next one. In such cases, a cumulative change would be a better representation of decision-making dynamics than analysis only of annual changes without respect to how these accumulate. For example, if a series moved up by 5% each year for five years before reversing, we would call this a trend accumulating to 26.5% total change. In contrast, in Fig. 1 and in Table 2 it would be recorded as five values of five. The left panel in Fig. 2 shows the distribution of cumulative changes of all 1,043 trends that we identified, and the right panel shows the same values for all trends lasting four years or longer (N = 323). (Note that the N’s are lower here than in Table 1 because the unit of analysis here is the series, not the number of annual observations within each series.)
When we look at the cumulative change from the beginning to the end of a budgeting trend, of course the degree of change is greater than if we look at just one year at a time. We cluster the values at + 1000% change because many of them go beyond that level. But in looking at all the trends our algorithm identified, or in looking only at the trends lasting at least four years, the LK value is nearly identical: 0.72 or 0.71. Still the general shape of change that we showed in Fig. 1, and which is common in the literature, remains. Trending series show the same kind of PET distribution of changes as non-trending series and shifting our analysis from annual changes to the endpoints of trends of various lengths does not change this finding. Thus, we reject H3, which posed that analysis of the percent change in budget value from the start to the end of a trend will show lower kurtosis than analysis of the annual budget changes, as we rejected H1 and H2.
Isolating super-trends
Our final analysis concerns the largest trends, i.e. those that accumulate over long periods of time to generate the greatest degree of percent change in budget value. The motivation for this is double. First, it is of obvious substantive interest to examine the dynamics of the most significant policy changes. Second, our analysis of whole distributions in the previous part provided a strong indication of PET temporal dynamics though still it did not examine them directly.
Our focus here is on trends that grow for at least four years. Table 2 showed that 48% of the federal budget consists of such cases. This group consists of 202 trending series with 1,571 annual observations. (Appendix B shows the parallel analyses for all series as well as other robustness tests.) Fig. 3 focuses on those 202 series and compares them with respect to the total accumulated change that each generates. Similar to a plot that might show what share of wealth is controlled by what share of the population, the presentation allows a simple graphical demonstration of how important a small number of series is in the overall accumulation of all budget change. Note that if all the budget series contributed equally to the budget changes, the result would be a straight line along the 45-degree line from bottom-left to top-right of the Figure.Footnote 4
Figure 3 is annotated with horizontal and vertical lines indicating the percent of the series generating 20, 50, and 80% of the change in budget value; the corresponding shares of series are 0.5, 2.5, and 21% In fact, just six series out of 202 (3.8%) generate half of all the change described here; these are visible as individual dots in the upper-right of the Figure. We confirm H4, which posed that a small share of trending series will generate a large share of the overall budget change.
Table 4 describes these six “super-trends” in greater detail, and Appendix Table 5 provides the annual values for each of the series, allowing a more detailed assessment of the combination of gradual and punctuated changes that make up each.
Table 4 shows the most remarkable positive changes in the US budget over the entire period. Housing Assistance, for example, increased from 1954 to 1975 from approximately $326 million to almost $150 billion, more than 400 times the earlier value. Expressed in terms of the percent of the federal budget, it grew from less than 0.07% to almost 11%, clearly a remarkable change. While this is the largest change in the entire federal budget, the six other series documented here also show very large changes.
The six “super-trends” we document here combine punctuations with continuing trends (see Table 5). That is, taken as a group, the trending series we study here are not stories of simple proportionate growth in response to a trending social problem such as demographic growth; each is subject to an important single punctuation, or several of them. However, neither are these series characterized by a flat series, a single interruption, and then a flat line at a new value. They are just as punctuated as the other series in the budget (as we showed in Table 2), but they combine this common characteristic with another one: they trend in the same direction for many years. Appendix Table 5 shows no case without a major annual budgetary shift, and most of the series have several extremely large changes. Combined with the previous analysis, these results clearly indicate that the super-trends combine punctuations with continued change; many show multiple large changes in a short time period. We confirm H5, which posed that “super-trends” will incorporate both clear budget punctuations as well as sustained movement in the same direction.Footnote 5
Figure 4 shows the equivalent to Fig. 3 for declining series. The level of disproportionality is significantly lower, though it remains.
Declining series do appear to show some significant differences from growing series, as Fig. 4 differs markedly from Fig. 3. Most obviously, the largest changes are not concentrated to the same extent in just a few series but are spread more evenly across all the categories. Thus, we confirm H6, which posed that the concentration of all change in just a few trending budgetary series will be greater for growing series than for declining ones. This is for several reasons: First, of course a series cannot decline by more than 100%. And once it does, it must, by definition, reverse in the following period. Growing trends, with no upper limit, can continue to grow even after they have seen rapid growth in the past. Second, many of the largest declines come in single years, often in such series as farm stabilization programs, emergency preparedness, and other series explicitly designed to respond to temporary crises but not to remain at high levels year after year. There are a few long-term declines that accumulate to very large percentages, almost eliminating the spending program altogether; these have come in various veterans’ education programs as wars have faded into history (for example Veterans Education, Training, and Rehabilitation declined by 96% from 1957 to 1966 and again by 98% from 1977 to 1990). Other examples include Space Flight (− 66% from 1966 to 1975); Veterans Benefits (1951 to 1960; − 50%; Community Development (− 53% between 1984 and 1989 and − 79% between 2009- and 2014).
Even though Fig. 4 shows significant differences from Fig. 3, it remains the case that reductions are not evenly spread across budget series. As the lines in the Figure illustrate, 55% of the series account for 80% of the total accumulated budget declines; 27.5% account for half of the decline, and 9.2% of the series account for 20% of the total decline. This concentration is much lower than in the positive series, but it remains substantively important.
Trends and proportionality
Because a series is gradually trending upwards or downwards over time does not imply that policymakers are “getting it right” or adjusting proportionately to a steadily growing (or disappearing) problem. Rather, whether a series is trending up or down, policymakers may still be under-responding to the change in social need. Therefore, we can see errors continue to accumulate even as budgets are continually adjusted in the same direction over many years (see Jones & Baumgartner, 2005 in general for a discussion of error-accumulation, friction, and the occasional need to make radical adjustments). Punctuations and trends can go together. When they do, we can see even greater accumulated changes than would be suggested by a focus only on annual budget changes. Of course, as Skogstad (1998) described, occasionally, policymakers may very well get it right. Our results suggest that this is the exception rather than the rule, even when series trend in a common direction over many years.
Conclusion
We set forth in this study with two central questions in mind: do PET findings hold when we use a methodology which explicitly incorporates the possibility of multi-year trends?; and what else can we discover when we incorporate longer time periods into the analysis with focus on the trends characteristics and impact on change? For this, we re-examined patterns of change in budgetary data, assessing whether the consistent findings in the PET literature could be due to a methodological quirk, that of focusing only on cumulative distributions of annual changes. We also gained insights by directly examining temporal dynamics and looking at trends rather than only annual changes. Our results were quite straightforward. A large proportion of changes in budgets occur within trending series, almost 65% when using a four-year cutoff but still significant proportions when five- and six-year cutoffs are used. However, the signature pattern of change previously repeated across the literature in analyses of annual budget changes is repeated when we look at accumulated changes from the beginning to the end of multi-year trends. This suggests that the DIP theory of cognition that is at the core of PET remains valid even while budgets may be shifting steadily in one direction over many years. The fact that a budget may trend in a common direction does not mean policymakers are responding proportionately to the severity of the underlying problem. We find the same tendency for punctuations and high kurtosis within trending and non-trending series.
We also found that a small group of punctuated and long trends, that we label “super-trends,” are responsible for exceptional shares of the total accumulated change in the entire system. In fact, within growing and trending series, 21% of the budget series generate about 80% of the total budgetary change. Overall, we validate the central findings of the PET approach, but we also point to reasons to expand our empirical focus beyond only annual assessments of “the shape of change.” A more complete understanding of budgets requires attention to dynamics over time, and particularly to a small set of “super-trends” that generate a large share of the universe of budgetary reallocations. Understanding these dynamics is both of theoretical and substantive interest.
Future research could possibly focus on identifying patterns of accumulation and decrease over time within trends, as well as the factors that influence such accumulation and decrease patterns. Analyzing change patterns within super-trends could be an interesting place to start.
Notes
We included functions that represent policymaking and excluded financial functions, such as annual offsets (True 2009); we excluded annual changes in share for which the base year was smaller than 100 million 2009 adjusted dollars. These practices are standard in the literature.
Consider this series of percent changes: + 3, + 2, + 5, -2, + 4, + 2, − 1, −1. It is a trend of six years. This series: + 3, + 2, + 5, -2, + 1, + 2, − 1, − 1 lasts only three years because the -2 in the fourth year is not completely offset by the + 1 in the following period; the offset must be larger than the irregularity, otherwise it is not fully reversed.
Note that punctuations should be identified with respect to being outliers in a large distribution of values. When we refer to punctuations in Table A-1 in this section, we are referring to observations that are extreme outliers with respect to the others. This, however, is not a formal test but rather an illustration using limited data.
References
Baumgartner, F. R., & Jones, B. D. (2009). Agendas and Instability in American Politics, 2nd edn. The University of Chicago Press.
Baumgartner, F. R., Jones, B. D., & Mortensen P. B. (2023). Punctuated equilibrium theory: Explaining stability and change in public policy. In C. M. Weible, & C. M. Sabatier (Eds.), Theories of the policy process (5th ed., pp. 65–99). Westview Press.
Boydstun, A. E. (2013). Making the News: Politics, the Media, and Agenda Setting. University of Chicago Press.
Breunig, C., & Koski, C. (2006). Punctuated equilibria and budgets in the American States. Policy Studies Journal, 34(3), 363–379.
Breunig, C., & Koski, C. (2012). The tortoise or the hare? Incrementalism, punctuations, and their consequences. Policy Studies Journal, 40(1), 45–68.
Capano, G. (2003). Administrative traditions and policy change: When policy paradigms matter, the Case of Italian Administrative Reform during the 1990s. Public Administration, 81(4), 781–801.
Carpenter, D. (2010). Reputation and power: Organizational image and pharmaceutical regulation at the FDA. Princeton University Press.
Cashore, B., & Howlett M. (2006). Behavioral thresholds and institutional rigidities as explanations of punctuated equilibrium processes in Pacific Northwest Forest Policy Dynamics. In R. Repetto (Ed.), Punctuated equilibrium and the dynamics of U.S. Environmental Policy (pp. 137–61). Yale University Press.
Cashore, B., & Howlett, M. (2007). Punctuating which equilibrium? Understanding thermostatic policy dynamics in Pacific Northwest Forestry. American Journal of Political Science, 51(3), 532–551.
Cavalieri, A. (2023). Italian budgeting policy. Between punctuations and incrementalism. Palgrave Macmillan.
Coleman, W. D., Skogstad, G. D., & Atkinson, M. M. (1996). Paradigm shifts and policy networks: Cumulative change in agriculture. Journal of Public Policy, 16(3), 273–301.
Daugbjerg, C. (1997). Policy networks and agricultural policy reforms: Explaining deregulation in Sweden and Re-Regulation in the European Community. Governance, 10(2), 123–142.
Daugbjerg, C. (2003). Policy feedback and paradigm shift in EU agricultural policy: The effects of the Macsharry reform on future reform. Journal of European Public Policy, 10(3), 421–437.
Eissler, R., Russell, A., & Jones, B. D. (2016). The transformation of ideas: The origin and evolution of punctuated equilibrium theory. In B. Guy Peters & P. Zittoun (Eds.) Contemporary approaches to public policy (pp. 95–112). Palgrave Macmillan. https://doi.org/10.1057/978-1-137-50494-4_6
Epp, D. A., & Baumgartner, F. R. (2017). Complexity, capacity, and budget punctuations. Policy Studies Journal, 45(2), 247–264.
Flink, C. M., & Robinson, S. E. (2020). Corrective policy reactions: Positive and negative budgetary punctuations. Journal of Public Policy, 40(1), 96–115.
Hacker, J. (2004). Privatizing risk without privatizing the welfare state: The hidden politics of social policy retrenchment in the United States. The American Political Science Review, 98(2), 243–260.
Holyoke, T. T., Henig, J. R., Brown, H., & Lacireno-Paquet, N. (2009). Policy dynamics and the evolution of state charter school laws. Policy Sciences, 42(1), 33–55.
Howlett, M. (1994). Policy paradigms and policy change: Lessons from the old and new Canadian policies towards aboriginal peoples. Policy Studies Journal, 22(4), 631–651.
John, P., & Bevan, S. (2012). What are policy punctuations? Large changes in the legislative agenda of the UK Government, 1911–2008. Policy Studies Journal, 40(1), 89–108.
Jones, B. D., & Baumgartner, F. R. (2005). The Politics of attention: How government prioritizes problems. University of Chicago Press.
Jones, B. D., Baumgartner, F. R., Breunig, C., Foucault, M., Francois, A., Green-Pedersen, C., John, P., Koske, C., Mortensen, P. B., Robinson, S., Soroka, S., Varone, F., Walgrave, S., & Wlezien, C. (2009). A general empirical law of public budgets: A comparative analysis. American Journal of Political Science, 53(4), 855–873.
Jones, B. D., Baumgartner, F. R., & True, J. L. (1998). Policy punctuations: US Budget Authority, 1947–95. Journal of Politics, 60(1), 1–33.
Jones, B. D., Sulkin, T., & Larsen, H. (2003). Policy punctuations in American Political Institutions. American Political Science Review, 97(1), 151–170.
Jones, B. D., True, J. L., & Baumgartner, F. R. (1997). Does incrementalism stem from political consensus or institutional gridlock? American Journal of Political Science, 41(4), 1319–1339.
Jones, B. D., Zalányi, L., & Érdi, P. (2014). An integrated theory of budgetary politics and some empirical tests: The U.S. National Budget, 1791–2010. American Journal of Political Science, 58(3), 561–578.
Kamlet, M. S., & Mowery, D. C. (1987). Influences on executive and congressional budgetary priorities, 1955–1981. American Political Science Review, 81(1), 155–178.
Kingdon, J. W. (1995). Agendas, alternatives, and public policies (2nd ed.). Longman.
Kuhlmann, J., & van der Heijden, J. (2018). What is known about punctuated equilibrium theory? And what does that tell us about the construction, validation, and replication of knowledge in the policy sciences? Review of Policy Research, 35, 326–347. https://doi.org/10.1111/ropr.12283
Larkey, P. D., Stolp, C., & Winer, M. (1981). Theorizing about the growth of government: A research assessment. Journal of Public Policy, 1(2), 157–220.
Mahoney, J., & Thelen, K. (Eds.). (2010). Explaining institutional change: Ambiguity, agency, and power. Cambridge University Press.
Padgett, J. F. (1981). Hierarchy and ecological control in federal budgetary decision making. American Journal of Sociology, 87(1), 75–128.
Pierson, P. (2004). Politics in time: History, institutions, and social analysis. Princeton University Press.
Robinson, S. E. (2004). Punctuated equilibrium, bureaucratization, and budgetary changes in schools. Policy Studies Journal, 32(1), 25–39.
Robinson, S. E., Flink, C. M., & King, C. M. (2014). Organizational history and budgetary punctuation. Journal of Public Administration Research and Theory, 24(2), 459–471.
Sabatier, P. A., & Jenkins-Smith H. C. (Eds.). (1993). Policy change and learning: An advocacy coalition approach. Westview Press.
Segal, E. (2018). Patterns of Gradual Change in Public Policy PhD diss. Hebrew University of Jerusalem.
Skogstad, G. (1998). Ideas, paradigms and institutions: Agricultural exceptionalism in the European Union and the United States. Governance, 11(4), 463–490.
Su, T. T., Kamlet, M. S., & Mowery, D. (1993). Modeling U.S. Budgetary and Fiscal Outcomes: A disaggregated, systemwide perspective. American Journal of Political Science, 37(1), 213–245.
Thelen, K. (2004). How institutions evolve: The political economy of skills in Germany, Britain, the United States and Japan. Cambridge University Press.
Wildavsky, A. (1975). Budgeting: A comparative theory of the budgeting process. Little, Brown.
Acknowledgments
I would like to express my sincere appreciation to my primary advisor Professor Eran Feitelson for his guidance, insights, and constant encouragement throughout the research period. I would also like to thank Professor Bryan Jones for his kind support and guidance. His expertise and wisdom were an invaluable asset to this project.
Funding
Open access funding provided by Hebrew University of Jerusalem.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare none.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Further details on the six super-trends
Table 5 presents the annual data associated with the summary laid out in Table 4 in the main text.
The details presented in Table 5 allow a detailed understanding of the concept of a trend. Looking at the Percent Change of Budget Value column, these generally increase, but sometimes have negative values. Recall from the text that our definition of a trend is one that is not reversed for two years in a row. Also, note that the percent of budget values, and the changes calculated from these, sometimes show different patterns.
Substantively, a review of the last column of the table allows an assessment of whether these series are subject to stability followed by a single punctuation followed by a new stable value, or if there is a more complicated pattern. Clearly, the pattern is more complicated than a single stable—change—stable pattern would suggest. Some of the very large adjustments are preceded by very substantial build-ups, and some very large changes are then quickly followed by even more change in the same direction. In sum, Table 5 makes clear that the six Super-Trends generally involve a number of dramatic adjustments, not just a single one. It also shows that these dramatic long-term shifts are not examples of gradualism, either. They show punctuations as well as trends.
Appendix B: Robustness tests
Alternative cutoffs to define a “Trending Series”
We define a “trend” as a series that moves in the same direction for four years or more. Our findings robust to different cutoff points, however. Table 6 replicates the findings in Table 2 in the main text using different cutoff points, three, five, and six years in Parts A, B, and C respectively.
Defining a “trending” series in different ways of course alters the number of them that are observed in the same data. Table 2 in the main text showed 65% of all budget observations being part of a trend of at least four years; this number is 78% if we define a trend with a cutoff of three years (Part A, above); 52% if defined as five years or longer (Part B); and 46% with a cutoff of six years (Part C). Also, by construction, the median lengths of the trending and non-trending series change.
The annual and cumulative change values, however, do not change enough to merit any revisions in our substantive conclusions. This, combined with the more important test in Table 3 of the main text (LK scores for series of different lengths) shows that the findings are robust to the question of what is the cutoff for a “trending series.” We would reach the same substantive and theoretical conclusions no matter which particular cutoff we were to use.
Changes in annual values vs. changes in annual budget shares
We use percent changes in value rather than changes in "share" in the main text of the paper. This can be an important distinction because if the entire budget increases by a large percent in any particular year, then an increase by a smaller amount can lead to a reduction in the share of the annual budget. Similarly in a year of decline, a smaller decline than average would be an increase in share. Our complementary analysis of changes in share is for theoretical reasons (we believe it has merit in reflecting budgetary choices and priorities made by policymakers by normalizing for policy drift or changes in the overall size of government), but our results are robust to either methodology.
Table 7 replicates Table 2 in the main text using percent changes in values rather than percent changes in shares.
Similar to the results showed in Table 5, changing the definition of a trend makes some small differences in the results. We see 61% of all series appearing as part of a trend, rather than 65% as in the main text. However, the differences are small.
Figure
5 is perhaps a more important test, replicating Fig. 1 in the text using changes in values rather than changes in shares.
Figure 5 shows very similar results to Fig. 1 in the main text, indicating no substantive differences in interpretation from using shares or values of annual budgets in calculating percentage change values. Note that the N’s are slightly different in this presentation, as some series continue as “trends” when calculated as percent change in values, but not when calculated as percent change in series as in the main text. However, these differences are small.
Figure
6 replicates Fig. 2 in the same manner.
As in the previous figure, differences here suggest no change in the substantive interpretation of the results compared to the data presented in the main text. In this presentation, there are more series, because many series continue in the same direction when considered by values, but not when considered as shares. Because of this difference, a larger number of trends last much less. The maximum length when considered by shares is 21 years, with just 10 series lasting 15 years or longer. When looking at values, the longest trend lasts 61 years, and 24 trends last 15 years or longer. The overall cumulative change from beginning to end of these series is therefore greater. However, the signature finding of high kurtosis remains no matter if we look at values or shares of the budget, and the overall shape of the distributions are very similar.
Extensions of Figs. 3 and 4
Figure 3 in the main text included only growing trends (lasting 4 years or more), and Fig. 4 included declining trends, with the same format. Figures
7 and
8 replicate these Figures but show all series, including series not lasting long enough to be called part of a trend.
Figure 3 was limited to 202 growing trends of four years or more. Expanding the analysis to all growing series, Fig. 7 shows little difference. Whereas Fig. 3 showed a high concentration of change coming from just a few series (21% of the series generating 80% of the change; 2.5% generating 50% of the change; and 0.5% generating 20% of the change), the corresponding values for Fig. 7 are 9.1, 0.7, and 0.2% of the series generating 80, 50, and 20% of the change. This suggests that addition of more than 200 additional observations only adds to the remarkable concentration in overall changes coming from a small share of the series.
Figure 4 was limited to 109 declining series lasting four years or more. Figure 8 shows all declining series, no matter the length, and shows little difference from Fig. 4; as with Fig. 7, the revised data in Fig. 8 show a slightly higher degree of concentration than what was shown in the main text.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Segal, E., Baumgartner, F.R. How budgets change: punctuations, trends, and super-trends. Policy Sci (2024). https://doi.org/10.1007/s11077-024-09542-9
Accepted:
Published:
DOI: https://doi.org/10.1007/s11077-024-09542-9