"Theory without data is sterile, while data without theory is uninterpretable."

Levin (1989), page 244.

"The progress of ecosystem science is limited simultaneously by both theory and methods."

Sala and others (2000), page 1.

In the proceedings of the 7th Cary Institute of Ecosystem Studies Conference (Pace and Groffman 1998), Steve Carpenter argued that multiple lines of inference are necessary to understand how ecosystems function, drawing on the metaphor of a four-legged table composed of theory, observations in space, observations in time, and experiments (Carpenter 1998, Figure 1A). We agree with this overall premise, but would like to elevate the importance of theory using an alternative metaphor (Figure 1B), in which theory is the basis for ecosystem science (as shown by the “floor”), interacting with a three-legged stool in which theory is tested using data generated by experiments, observations, and quantitative modeling (see Box 1 for definitions), and then modified as needed in response to data in an iterative process. Given this metaphor, we propose that (1) couching research in a theoretical context accelerates the efficiency and advancement of ecosystem science; (2) experiments, observations, and quantitative models provide complementary tests of theory that can inform applied solutions; (3) graduate students should be exposed to theory and to each of these approaches during their training even if they anticipate focusing on a particular approach; and (4) opportunities abound to better integrate theory and multiple approaches for collecting data into graduate training, at individual to institutional levels. Below, we explain our thinking on these issues.

Figure 1
figure 1

Metaphors for how ecosystem science should operate. A Carpenter’s (1998) “table”. B Our proposed alternative, which makes theory the foundation upon which to place data collection that relies on the interplay of experiments, quantitative models, and observations. Failure to include a strong theoretical foundation, or to balance different approaches to data collection, may lead to falling off the stool. In addition, the floor itself (that is, theory) is shaped by the combined activity and resulting findings from experiments, models, and observations.

Box 1 Working definitions used in this paper for theory, models, observations, and experiments

The Current Role of Theory in Ecosystem Science

Theory provides a critical framework for organizing fundamental scientific assumptions, uniting seemingly disconnected concepts with underlying principles, comparing empirical data to mathematical expectations, and generalizing findings from disparate study systems (Marquet and others 2014) and across spatial and temporal scales (Carpenter 1998). However, one of our Dartmouth colleagues likes to spark discussion by charging that ecosystem ecologists fail to engage theory. A cursory analysis using Web of Science suggests that there may be some validity to this claim. We calculated the annual percentage of papers containing the topic ‘theor*’ in all papers published in the journals Ecosystems, Ecology, and The American Naturalist from 1998 to 2015 (Figure S1). On average, Ecosystems included a lower percentage of theory-based papers (7%) than Ecology (17%) or The American Naturalist (27%), and since its first publication year, the percentage of papers with the topic theor* in Ecosystems has declined, whereas it has increased slightly over the same time frame in both Ecology and Am Nat (Figure S1). This analysis surely overlooks papers that engage theory without explicitly stating the term and fails to characterize the extent of engagement with theory (Scheiner 2013). At face value, however, it suggests that papers in Ecosystems engage with theory both less often and increasingly less frequently than two other leading ecological journals.

Towards a Greater Integration of Theory and Data in Ecosystem Science

Buoyed by the contributions of theory to other areas of ecology (Kendall 2015), we encourage ecosystem scientists to put theory to greater use in informing project development and data collection. The most pragmatic way to approach a particular problem in ecosystem science (or indeed, any science) is to start with an understanding of how relevant theories relate to the specific problem at hand, in order to develop a framework for understanding that can be brought to bear on a particular problem. For example, if a new graduate student is interested in investigating the biogeochemical response of northeastern forests to climate warming, there is no need to “start from scratch.” A wealth of theories already provides a valuable perspective to guide the start of this inquiry (Suppl. Material, Appendix 2), including the metabolic theory of ecology (Brown and others 2004), biological stoichiometry (Sterner and Elser 2002), biodiversity-ecosystem function (Loreau and others 2001), and plant-soil feedbacks (van der Putten and others 2013).

Integration of theory and data provides many benefits. First, it enables the research to become an immediate part of an existing structure of knowledge and minimizes the risk of missing relevant work that can inform best approaches to addressing a particular question. These benefits can accelerate scientific progress, which is increasingly important giving funding constraints. Moreover, beyond providing qualitative background information, theories with some overlapping assumptions or postulates may provide alternative pathways to explaining similar phenomena (Chamberlin 1965). Good theory also explicitly identifies the independent and dependent variables necessary to verify assumptions and test predictions and leads the researcher toward appropriate methodological approaches, which may include experiments, observations (across space and through time), and quantitative models. Thus, framing ecosystem science questions in the context of relevant theories streamlines research planning and maximizes limited resources. Moreover, theory provides a common language through which scientists with different methodological skill sets (for example, modelers vs. empiricists) can communicate.

Once data are collected, the challenging process commences of determining how resulting information supports or challenges the motivating theory. Theories are approximations of some true underlying process and therefore are in continual need of refinement (Marquet and others 2014). Integrating data and theory may be as simple as qualitatively confirming that, under the original set of assumptions, predictions from theory hold in a new study system. Alternatively, data may lead to increasing generality of a theory by requiring mathematical tweaks to theoretical postulates or revision, addition, or rejection of the underlying assumptions. Exciting advancements in science often occur when data don’t fit theoretical predictions despite adhering to underlying assumptions (Duarte and others 2003). In such cases, data may drive the development of a new competing theory.

Research projects in ecosystem science will vary in their extent of engagement with theory [for example, drawing on vs. developing vs. testing theory; sensu (Scheiner 2013)], depending on the question and empirical findings at hand, the scientists’ area of expertise and skill sets, and the degree to which the study system meets the general assumptions of some theory. We don’t expect every project to require the development of new theory; instead we suggest that drawing on theory to inform hypothesis development and data collection serves as the least common denominator in ecosystem research. In short, because incorporating theory provides benefits at the level of the individual researcher and the scientific field, it should underpin all research projects, regardless of the methodological approach employed.

Improving Exposure to Theory, Observations, Experiments, and Quantitative Models

We believe that exposure to the major theories of ecosystem science (for example, Box S1) and to each of the major approaches for data collection (quantitative models, observations, and experiments) is an essential part of a well-rounded training program. Even if a particular student plans to rely primarily on either modeling or empiricism, understanding the strengths and limitations of the other approach(es) will improve one’s own project as well as one’s ability to communicate more effectively within collaborative research teams. Because quantitative models are abstractions of reality, only as good as the input data and assumptions used to construct them (Duarte and others 2003), much can be learned from the mismatch between model predictions and empirical data. It is important for modelers to know how the empirical data used to develop and test models are obtained, to be able to assess the potential limitations of the data (and model) and to meaningfully interpret a predicted difference across different modeling scenarios. On the other hand, it is also important for empiricists to know how their data are (or might be) used by modelers, as well as what the limitations of models might be. Understanding how quantitative models are developed may allow empiricists to tweak and fine-tune data-collection protocols, thereby maximizing the likelihood of data being used in future models. For example, designing experiments to collect data on specific quantitative, rather than nominal, treatments can facilitate model development (Cottingham and others 2005).

Additional advantages of broad exposure to theory, quantitative models, observations, and experiments include developing a strong understanding of the flow of information between data (whether from observations, experiments, or quantitative models) and theory. Trying different approaches also offers early career researchers opportunities to determine what they like and what they are good at, which may inform both their dissertations and long-term career trajectories. Further, we hope that broad exposure promotes open-mindedness about the complementarity of different approaches, which may facilitate cross-talk among researchers with different specialties—an important skill-set for conducting the kinds of interdisciplinary, team-based projects typical of ecosystem science.

Importantly, we are not advocating top-down mandates whereby all graduate students need whole dissertation chapters devoted to each type of research approach, that is, dissertations with a chapter each on modeling, field observations, and mesocosm experiments. Rather, there are less-prescriptive ways to introduce trainees to different approaches and to build a strong skill set in at least one of these areas (Table 1). Moreover, exposure does not have to happen solely in graduate school; it can begin as an undergraduate, continue between degrees, and extend into postdoctoral research. In fact, intentional exposure may not even be needed, if program directors develop structures that deliberately nudge students toward experiencing multiple approaches and toward being opportunistic about unexpected opportunities (Table 1).

Table 1 Strategies to Help Graduate Students Become Exposed to, and Comfortable with, Both Theory and a Breadth of Research Approaches

Synthesis

In conclusion, we suggest that aspiring ecosystem scientists learn the core theories of ecosystem science and use them in framing their research questions; get exposed to and become comfortable with experiments, observations, and quantitative models; build a strong skill set in at least one of those approaches; and be open to opportunities to learn new things when needed. Although many opportunities exist for advisors and programs to nudge graduate students toward integrating theory and multiple approaches into their training (Table 1), ultimately the onus is on individuals to develop the self-confidence to be fearless about crossing disciplinary boundaries to learn whatever new tools and approaches are needed to address a particular research question.