Introduction

The collapse of large social systems, also called “civilizations” or “empires,” is a well-known and highly studied subject. In many cases, the historical record does not provide quantitative data on these events, but in some cases it is possible to quantify the collapse phenomenon in terms, for instance, of the extent of the areas controlled by the central government as reported by Tageepera (1979) or of the output of the economic system as reported by Sverdrup et al. (2013) and McConnell et al. (2018). In these studies, we can observe how collapses are often rapid in comparison to the build-up of the social and economic structures of a civilization. This behavior is consistent with Diamond’s definition of collapse as, “a drastic decrease in human population size and or political/economic/social complexity, over a considerable area, for an extended time.” (Diamond 2005).

Nevertheless, despite the number of studies in this area, there is little agreement on the causes of societal collapses and, in particular, on the possibility of a common mechanism causing them. Edward Gibbon was probably the first to attempt an interpretation of the fall of a large empire, the Roman one, attributing it mainly to the decline of the traditional values (Gibbon 1783). Later authors explained the fall of Rome in widely different ways and Demandt (1984) lists about 210 different theories on this subject, probably an incomplete list. The same variety of interpretations affects the studies of the collapse of other societies in history, as described, for instance, by Tainter (2008).

No consensus appears to exist in this field but, overall, we can divide the interpretations of collapses into two main subsets: theories based on several independent causes (concauses) and theories based on a single cause that generates a cascade of different effects. An example of the first approach—several independent concauses—is the study by Cline on the collapse of the Late Bronze Age Mediterranean Civilization (Cline 2014). According to Cline, multiple negative effects occurred at the same time to generate the collapse, including climate change, earthquakes, foreign invasions, and more. An extreme example of the multiplication of causes is the study Bury published in 1923 who argued that the collapse of the Roman Empire resulted from several contingent events all occurring at about the same time. Tainter comments (2008) stating that Bury considers that “The collapse was just bad luck.”

There are several examples of the second approach, single cause followed by a cascade of related events. One is Douglas Reynolds’ interpretation of the fall of the Soviet Union in 1991 (Reynolds 2016). Reynolds attributes it to mineral depletion and, specifically, to the cascade of negative effects generated by the growing costs of oil production which affected the whole Soviet economic system. Another single-factor model of civilization collapse factor has been proposed by Joseph Tainter in his study “The Collapse of Complex Societies” (Tainter 1988) and in later papers (Tainter 2006, 2008). Tainter identifies “diminishing returns,” a well-known concept in economics, as the general factor in the decline and fall of civilizations. The idea is that, as societies become larger, more complex control structures are needed to maintain the cohesion of society and solve the problems that appear along their path. These structures can be described in terms of governments, the nobility, armies, bureaucracy, and the like. According to Tainter, as these structures become larger, they become less efficient, to the point that the economic returns they provide is smaller than their cost. At this point, society becomes unable to cope with the challenges it faces and must decline, or even collapse.

The contrast between single/multiple causes in the interpretation of the fall of societies highlights a general methodological problem. Not only data are often scarce on these historical phenomena, but their interpretation is often based on the author’s personal judgment. It goes without saying that the collapse of civilizations is not amenable to experimental studies but, even taking this point into account, one may ask how proposing a specific interpretation of the fall of—say—the Roman Empire can be justified. Here, we have several problems, including the fact that the very concept of “causation” is hard to approach in a quantitative manner (Pearl and Mackenzie 2018). Nevertheless, we can choose to rely on the basic scientific concept that the preferable interpretation of an event is not only one that’s compatible with the available data, but also which is of general validity—that is, can explain more than a single event of the same class. In this sense, Tainter’s interpretation of “diminishing returns of complexity” provides a general framework to interpret a large number of cases and it is therefore an interesting idea in view of understanding the general phenomenon of societal collapse.

In the present study, we looked at Tainter’s ideas using the modern concept of “system science” (Mobus and Kalton 2015). By using the modeling method known as “system dynamics” (Richardson 2013) we developed a simple biophysical model describing the evolution of a society. The model includes the effects of overshoot (Catton 1982) and of diminishing returns in the exploitation of natural resources. It is not supposed to describe specific social systems but to provide a “mind-sized” (Bardi 2013) model the main factors that cause collapse. We find that the complexity of the system, assumed to be proportional, to the size of a stock such as “bureaucracy” follows a trajectory that makes the model compatible to the one proposed by Tainter. That is, the system shows a hysteresis that makes its trajectory non-reversible: reducing the costs of bureaucracy does not return society to the previous conditions of prosperity. Unlike Tainter’s hypothesis, our model does not support the idea that the diminished returns of complexity are the cause of the collapse, rather it suggests that the diminishing returns of resource exploitation are the cause of the decline of the complexity of the system.

Tainter’s Model

Tainter describes his model in his 1988 book “The Collapse of Complex Societies” (Tainter 1988). Here is an excerpt from the book.

More complex societies are more costly to maintain than simpler ones, requiring greater support levels per capita. <..> It is the thesis of this chapter that return on investment in complexity varies, and that this variation follows a characteristic curve. More specifically, it is proposed that, in many crucial spheres, continued investment in sociopolitical complexity reaches a point where the benefits for such investment begin to decline, at first gradually, then with accelerated force. Thus, not only must a population allocate greater and greater amounts of resources to maintaining an evolving society, but after a certain point, higher amounts of this investment will yield smaller increments of return. Diminishing returns, it will be shown, are a recurrent aspect of sociopolitical evolution, and of investment in complexity.

The graphic representation of Tainter’s mechanism is shown in Fig. 1 (redrawn from Tainter’s book).

Fig. 1
figure 1

(redrawn from Tainter’s book)

The diminishing returns to complexity, graphically described by Tainter (1988).

Tainter’s thesis is not directly based on quantitative data or models, but historical data are used to support it. For example, the decline of the content of silver in the Roman denarius for a period that goes from the 1st to the third century AD is interpreted by Tainter as an indication that the Roman government was experiencing increasing financial difficulties. Tainter attributes this phenomenon to the increasing cost of the Roman bureaucracy, including the imperial court, resulting from the expansion of the Roman civilization during the first century BC and the first century AD.

Surprisingly, Tainter never mentions in his 1988 book the depletion of the Roman silver and gold mines in Spain, another possible cause of diminishing returns. The subject of mineral depletion in the ancient world is difficult to study for the lack of specific data on mineral production, but it is known that depletion was a problem for the Roman mines and that mining required progressively more efforts, for instance in terms of deeper mines (Edmondson 1989). Recent data show that the Roman mining of various metals and the Roman industrial activity collapsed together with the third century crisis (McConnell et al. 2018) These data do not necessarily imply that depletion was the problem, but do indicate that the lack of precious metals was due to a real decline in availability, not just something generated by the expansion of the economic system or by the growth of the population. In a more recent discussion, Tainter (2006) himself identifies the diminishing returns of mineral exploitation as related to the concept of “EROI” (energy returned for energy invested) developed by Gupta and Hall (2011). The concept of EROI cannot be directly applied to the extraction of metals and other non-energy producing minerals, but the basic mechanism that leads to diminishing EROI for fossil fuels is valid for all mineral resources. The less costly resources are exploited first, and this leads to a gradual and irreversible increase in the energetic and monetary costs of extraction. This is one of the causes of diminishing returns in a complex society (Tainter 1996).

Collapse: The Systems Science Approach

By developing the concept of “diminishing returns,” Tainter links historical collapses to the modern concept of “systems science” (Mobus and Kalton 2015) a field dedicated to the study of complex systems, all different, but all tending to show a similar behavior in similar circumstances. Complex systems are often described in the framework of system dynamics (Richardson 2013) and described in terms of “stocks” and “flows,” while the dynamical evolution of the system is determined by the concepts defined as “forcing” and “feedback.” A forcing is an external perturbation to the system which generates a series of enhancing and/or damping feedbacks, in turn defined as a property of the system for which flows are affected by the size of the stocks they connect. Enhancing feedbacks tend to amplify the forcing, damping feedbacks have the opposite effect. The result of feedbacks may well be to perturb the system to such a point that it crosses a “tipping point” (Gladwell 2002) and moves to a different state. This transition can be described as a collapse if the new state corresponds to a condition of lower complexity. In particular, the concept of feedback may explain the wide variety of attribution of societal collapses to different causes: the initial perturbation which unbalanced the system goes unnoticed when it is masked by the large feedbacks it generates. Feedbacks often act as “control” for the system, leading to a steady state condition (homeostasis) but in the models described here such a condition cannot be attained since the system runs on non-renewable resources.

In the present study, we use systems dynamics to analyze a well-known biophysical concept that we apply to the concept of civilization: the trophic chain. It assumes that a socio-economic system can be represented as an ecosystem where different trophic levels participate in the degradation, or the dissipation, of the thermodynamic potential associated with the highest potential stock (or lowest trophic level). In the case of civilizations, the lowest trophic level is the one defined as “natural resources,” e.g., fossil fuels in the case of the modern civilization. Fossil fuels have a high thermodynamic potential which can be dissipated in the form of the heat produced when the carbon and the hydrogen contained in the fuels combine with atmospheric oxygen. Then, the resulting cascade of trophic levels corresponds to different elements of the economic system: the extractive industry, the manufacturing industry, the bureaucracy, and—lastly—pollution (or waste): the end result of the economic process. This approach is part of the concept of “world modeling” developed first by Forrester (1971) and then applied to studies such as the well-known 1972 study titled “The limits to growth” (Meadows et al. 1972). A modern example of a world model is the “MEDEAS” model (http://www.medeas.eu), which aims at describing the whole world system in detail with several hundred parameters. Other models based on the same approach describe the behavior of a complex system by a small number of parameters and may be termed “mind sized” (Bardi and Lavacchi 2009). One of these relatively simple models is the “HANDY” model created by Motesharrei et al. (2014).

The model developed here is a simple trophic chain of stocks, in its simplest possible form is a two-stock model, already described in a previous paper (Bardi 2013). The model is shown in Fig. 2.

Fig. 2
figure 2

It is a convention described in Bardi (2013)

A simple 2-stock system dynamics model describing the flow and the dissipation of natural resources in a complex society. All the flows in the model are determined by constant multipliers, not shown in the diagram. Note that we represent the flows in the model as going “down” from higher thermodynamic potentials toward lower thermodynamic potentials.

The model is represented in a form that emphasizes the unidirectional flows from higher potential stocks to lower potential stocks, going “down” in the arrangement. We can say that the stock termed “The Economy” dissipates the energy potentials contained in the “Resources” stock, turning them into waste heat and non-recyclable materials. For instance, if the resource is crude oil, then the economy transforms it into all the products that can be made using crude oil, fuels, plastics, chemicals, etc. The model takes into account the entropic loss involved in the passage from one stock to the other. Note also that the model does not consider the effects of pollution, while other factors, such as human population, social structure, bureaucracy, etc., are all aggregated in the “Economy” stock.

In the model, the flow between stocks is regulated by feedbacks. We assume that each flow is linearly proportional to the size of the stocks connected by it, multiplied by a proportionality constant. It would be also possible to assume that there exist exponents to the stock sizes analogous to the ones called “elasticities” in some macroeconomic models. Here, however, we will use a simple assumption of linear proportionality.

The size of the stocks of the model can be measured in terms of energy: natural resources such as crude oil can be measured in terms of the chemical energy they contain, while other mineral resources can be measured in terms of the “embedded energy” (Odum 1973) necessary for their extraction, refining, and production. The stocks aggregated in the “economy” stock can also be described in terms of embedded energy—e.g., a piece of machinery can be measured in terms of the energy necessary to create it, including the cost of producing the materials needed. The stock sizes can also be measured in monetary units, assumed to be “proxies” of energy units. Here, no quantitative assessment of real-world stocks is attempted, but the fact that they can be all measured in the same units ensures that the transformations described in the model are physically possible.

If the natural resources are supposed to be non-renewable (i.e., the flow into the resource stock is set to zero or to a much smaller value than the outflow) this simple two-stock model generates a symmetric, bell-shaped curve for the flow of energy that goes from the upper stock to the lower stock, a flow that we may call “production.” Varying the values of the parameters does not change the qualitative behavior of the model. The production curve corresponds to the “Hubbert curve” which approximately describes the cycle of extraction of mineral resources (Bardi and Lavacchi 2009). If, instead, the natural resource stock is assumed to reform at a rate proportional to the stock size, as it happens for a biological stock, the model is equivalent to the well-known Lotka–Volterra one (Lotka 1925; Volterra 1926) and it generates continuous oscillations in the size of both stocks. Here, we assume that the Resources stock is non-renewable or slowly renewable, so we consider only one cycle of growth and decline (Fig. 3).

Fig. 3
figure 3

Typical behavior of the stocks in the “2-stock” model for non-renewable natural resources. The initial values for the stocks are assumed to be 1 for the Resources stock and 0.01 for the Economy stock. The constants determining the flows are 0.1 for the “production” flow and 0.5 for the depreciation flow

The simple two-stock model is the basis of more complex models where we consider further trophic levels. The next step involves adding a third stock, labeled as “pollution”—this model has been described in a previous paper (Bardi 2013) where it was termed the “Seneca Model” because it produces asymmetric bell-shaped curves where decline is faster than growth, corresponding to an observation put forward long ago by the Roman philosopher Lucius Annaeus Seneca (Bardi 2017).

A further step consists in adding one more level, which we term here as “Bureaucracy” supposed to be a stock that aggregates all the non-productive societal structures: the army, the nobility, the court, the priests, etc. The trophic chain, at this point, is the following.

  1. 1.

    Resources

  2. 2.

    The Economy

  3. 3.

    Bureaucracy

  4. 4.

    Pollution

The model is shown in Fig. 4.

Fig. 4
figure 4

The four-stock SD model used in this paper. All the flows are regulated by constants not shown in the diagram

The flow is unidirectional, and it goes in this way: Natural Resources → The Economy → Bureaucracy → Pollution. As in the simpler two-stock model, resources are transformed into economic capital—here referred to as “The Economy.” This capital is partly turned into “Bureaucracy” whose damping effect may be taken into account as an indirect effect caused by the drawdown of resources from the economy stock, which are not available to exploit natural resources. Note that this stock is connected to the “production” flow. We may assume that Bureaucracy may have an enhancing effect on production. In the real world, this effect could play out, as an example, by providing the extractive industry with a legal framework that allows them to exploit the resource they control without the need of defending them from competitors. This effect can be neglected in a simplified form of the model.

All stocks generate pollution, the necessary result of all the ongoing operation of society. Pollution may take the form of gases emitted by the combustion of fossil fuels, heavy metals dispersed in the environment, as well as the destruction of the fertile soil and the general disturbance of the ecosystem. The pollution stock may be assumed to abate slowly as the polluting substances are re-absorbed by the environment. All stocks in the model are affected by feedbacks which make flows proportional to the size of the stocks connected by it. Note that Bureaucracy affects production in terms of a multiplying factor assumed to be (1 + Bureaucracy). This assumption accounts for the fact that Bureaucracy is a less important factor than economic capital in facilitating production.

If the resources are assumed to be non-renewable, all the stocks of the system go to zero one after the other. However, the 4-stock model produces a more abrupt collapse than the 2-stock model. This behavior often takes the form of the “Seneca Collapse” (Bardi 2017). It is the result of a stock being subjected to a feedback-dominated drawdown by another stock, while at the same time being unable to maintain a replenishing flow from a depleted stock (this can be termed the “candle burning at both ends” effect).

A robust feature of the model is the how the decline in the stock of natural resources is related to a cycle of growth and decline of the other stocks. In most cases, the decline turns out to be faster than growth, a characteristic defined here as “collapse.” Note also that the behavior of the model can vary depending on the initial assumptions. In some cases, the bureaucracy stock may go to zero before the capital stock does. In this case, we may see the model as describing a civilization that loses some of its centralized control structure (e.g., an imperial court) and moves toward simpler delocalized structures. It could describe how the Middle Ages feudal structure developed after the collapse of the Western Roman Empire. A different behavior may also occur, that is the bureaucracy stock surviving the collapse of the capital stock. In this case, society maintains for a certain time an overgrown central control structure while the productive structures have disappeared. This might be the case of the Eastern Roman Empire which saw the capital city of Constantinople surviving even though most of the territory of the empire had been lost. In this respect, the model behaves in a way similar to that of the “HANDY” model (Motesharrei et al. 2014).

The question is now how these models can be related to the qualitative descriptions given by Tainter. A dynamic stock and flow model such as the one developed here does not have a parameter that can directly describe complexity, but we can find a proxy for this entity. We note first that Tainter (1988) describes complexity as:

Complexity is generally understood to refer to such things as the size of a society, the number and distinctiveness of its parts, the variety of specialized social roles that it incorporates, the number of distinct social personalities present, and the variety of mechanisms for organizing these into a coherent, functioning whole.

From this definition, it seems clear that complexity is related to the size of the entities that Tainter labels as “parts of the society.” In the case of the four-stock model, the size of the “bureaucracy stock” can be reasonably assumed to be a proxy for the overall complexity of society. This stock is surely composed of people whom Tainter describes “specialists not directly involved in resource production.” For the simpler 2-stock model, bureaucracy is embedded in the “economy” stock and we can take the whole stock as a proxy for complexity.

Then, what should we see as “benefits of complexity” in the model? The stocks of the system can be replenished only as long as the capital stock is large enough to extract energy and materials from the resource stock. We may also note that the “production” flow correspond to the aggregate concept that in the real world is referred to as “gross domestic product” (GDP) and it is generally believed in the ongoing debate that the larger this flow is, the more people are happy, (of course, there are other indexes that can be used to measure people’s happiness, but that is outside the scope of the present discussion). Therefore, we can quantify the benefits of complexity using the “production” parameter as a proxy.

On the basis of these consideration we can use the model to describe the diminishing returns to complexity by plotting production as a function of the size of the bureaucracy/the economy stock. We do this first for the simplified 2-stock model (Figs. 6, 7) and then for the more complex 4-stock model (Fig. 8). In both cases, the parameters are those described in the captions for the Figs. 4 and 5. The results are qualitatively similar. The obtained plot can be directly compared to the plot proposed by Tainter (1988).

Fig. 5
figure 5

Typical behavior of the 4-stock model. In this figure, all the four stocks are shown. The initial values are 1 units for “Resources,” 0.1 units for “The Economy,” 0.01 units for “Bureaucracy,” and 0.001 units for “Pollution.” The constants are 0.38 for the “Production” flow, 0.15 for the “Depreciation” flow, 0.3 for the “Bureaucracy Creation Flow” and 0.5 for the “Bureaucracy Decay” flow. All the other flows constants are assumed to be zero—in other words the resources are assumed to be non-renewable and pollution to be persistent

Fig. 6
figure 6

A comparison of the results of the two-stock model reported here and Tainter’s model as reported in his 1988 book. The two curves are not identical, but the similarity is evident

Fig. 7
figure 7

Production vs “The Economy” for the two-stock model. The values of the constants are the same as those described in Fig. 3

Fig. 8
figure 8

The curve of Production vs. Bureaucracy for the 4-stock model. The values of the constants in the model are the same as those in Fig. 5. This diagram has a different shape in comparison with the 2-stock model, but the qualitative dependency of the two stocks, bureaucracy and production, remains the same: growth and decline, followed by an inversion in the trend

The curve for the simpler 2-stock model is qualitatively similar to the one proposed by Tainter, (Fig. 6) although the two-stock model shows the whole loop (Fig. 7) whereas Tainter’s illustration shows only the growing phase of complexity. Note the hysteresis of the curve: the bureaucracy stock declines after having reached a maximum value, but its relation to the productivity of the system does not go back to the earlier values. In other words, halving the costs of bureaucracy during the decline phase does not lead back to the same conditions of when the system was growing. The behavior is qualitatively similar for the 4-stock model although, in this case, Bureaucracy initially favors growth but later on becomes a burden. This phenomenon appears to correspond to the current conditions of the modern society. Governments everywhere are cutting their bureaucratic expenses, but the system is not returning to the efficiency of the earlier times.

We may also use a more detailed model to check the behavior that we observe for our “mind sized” model. A well-known such model is the one called “World3,” which was used for the 1972 study “The Limits to Growth.” (Meadows et al. 1972). We used the most recent available version of World3 to calculate a Tainter plot for the “business as usual” or “base case” scenario. The World3 model has an the “industrial output” that we may consider a proxy for Tainter’s “benefits of complexity.” It does not have a “bureaucracy” or “complexity” stock but, after examining the model, we located a parameter that can be considered as a proxy for complexity: “Relative Services Output.” Indeed, the production of services is a task for the bureaucracy sector and it can be considered proportional to its size. The results are shown in Fig. 8 and are very similar to those obtained using the simpler models described here: we can see the typical hysteresis of these phenomena (Fig. 9).

Fig. 9
figure 9

Tainter plot for World 3 where the relative industrial output is reported as a function of the relative services output

A real-world example of this behavior can be found with the Roman empire where the data have been taken from Sverdrup et al. (2013) who studied the connection between resources depletion and the stability of Roman civilization. In particular, the depletion of silver content in coins that started at the beginning of the first century AD continued well into the third century AD. As a consequence, the social structure of the empire, such as the army, became so expensive that it was impossible to maintain them at the previous level of organization and size. This phenomenon reflects exactly the concept of diminishing return of complexity: the state becomes larger and larger non-producing social structures are needed to maintain it. This is true until the expenses to maintain the bureaucracy and the army do not become so high to be unsustainable. In this case, we took the “cash flow” as a measure of the productivity of the Roman Empire and the army size as proportional to the size of the non-productive sector of the economy. The results are shown in (Fig. 10), qualitatively showing the same hysteresis behavior produced by the models.

Fig. 10
figure 10

Data from Sverdrup et al. (2012)

Tainter plot for the Roman empire where the relative cash flow in coinage (state income) is reported as a function of the relative Roman army size. The cash flow is taken as a proxy for Tainter’s “benefits of complexity” whereas the army size is taken as a proxy for the complexity of the state system.

A similar behavior could be observed in the dynamic of exploitation, or better said, in the dynamic of overexploitation of fish (Pauly and Zeller 2016; Watson et al. 2013; Cai and Leung 2017; Carpenter et al. 2016; Zeller et al. 2018). Using the data reported by Perissi et al. (2017), we found that the behavior of the “production” flow as a function of the size of the fishing industry (complexity) provides also a similar curve to the others shown in this paper.

Conclusion

The models studied here are not supposed to describe specific cases of the collapse of human societies, rather they are thought of as a simplified playground to examine the effect of some parameters on the trajectory of complex systems intended as dissipative structures based on finite or slowly renewable resources. The models are based on a simple concept: that of the trophic chain. If we assume that the natural resources available are non-renewable, as they are in the case of mineral resources (e.g., gold and silver for the Roman Empire, fossil fuels for the modern global empire), the disappearance of the trophic structures exploiting the resources is unavoidable—unless new resources can be found. The same is true for those resources which are slow to renew in comparison to the rate of exploitation. The models tell us how the dissipation of the natural resources goes by the progressive filling and emptying of the stocks at lower thermodynamic potential—every step implying a loss of exploitable potential energy which disappears in the form of pollution, e.g., low temperature heat. This phenomenon generates “bell-shaped” curves for the filling/emptying of the stocks. These curves can also take the “Seneca shape” (Bardi 2017) when the decline is faster than the growth. As this phenomenon goes on, the stocks interact with each other.

The time delay in the filling/emptying of the stocks generates a trajectory where stocks move in the phase space along a hysteresis curve. The system continuously evolves in an irreversible manner and it can never return to an earlier condition, unless the resources are assumed to be renewable and, in this case, the system circles around an attractor in phase-space. In other words, simply reducing the size of the “Bureaucracy” stock will not return the system to a condition in which it was during the growing cycle, an observation which seems to correspond to the current situation. This behavior qualitatively corresponds to Tainter’s proposal about the relationship between complexity and productivity of a social system and the curves produced by the model are well comparable to the qualitative ones that Tainter reports in his work (Tainter 1988). However, there is a fundamental difference: the models tell us that the decline in complexity of the system is the result of the diminishing returns of the exploitation of natural resources rather than, as proposed by Tainter, the result of an intrinsic property of diminishing returns of complexity itself. It is nevertheless remarkable how Tainter’s insight can be reproduced by dynamical modeling.