Introduction

Ecological communities typically comprise numerous species, and their biological diversity can be remarkably rich. This high biodiversity has always fascinated ecologists and conservationists. The ‘world records’ of plant species richness are held by tropical rainforests at larger spatial scales and by temperate grasslands at smaller spatial scales (Wilson et al. 2012). Differences in species diversity of ecological communities have been attributed to natural processes and anthropogenic impacts (Lepš 2005). However, simple comparisons of raw numbers of observed species offer little insight, since they fail to reveal how many suitable species are absent (Pärtel et al. 2011), or, in other words, how complete communities are.

Each ecological community has its species pool – a set of species that can potentially reach and inhabit it (Eriksson 1993; Zobel 1997). The fraction of the species pool absent at any particular point in time is called dark diversity (Pärtel et al. 2011). Which species constitute the species pool? Firstly, the probability that a species can disperse to a given community must be reasonably high. In this respect, the arrival of species inhabiting other regions or even continents is unlikely. We should therefore include only species already present in the region surrounding a given community. Secondly, only a fraction of species able to disperse to a community can actually tolerate the community’s particular environmental conditions. In other words, membership in a species pool demands a match between habitat requirements of species (beta niche, habitat niche or Grinnellian niche) and environmental conditions in a local community. As a result, species pools are habitat-specific, and different community types within the same region have different species pools. Species pools are formed by speciation, which often occurs in areas beyond the location of our target community. Because of niche conservatism (Wiens et al. 2010; Peterson 2011), however, species inhabit regions with conditions analogous to those of their evolutionary origin. Species have reached new areas through historical migrations between and within continents. How many species from the regional species pool actually form a community depends, in turn, on dispersal limitations within the region and on biotic interactions within the community, which might exclude some species and support others (known as assembly rules, cf. Götzenberger et al. 2012). Biotic interactions are not limited to competition, although competition has been considered historically to be among the most important interactions, but also on interactions between plants and organisms of other tropic levels, which might exhibit a positive effect (such as mycorrhizal fungi or pollinators) or a negative one (such as predators). Last but not least, the presence or absence of particular species in a particular community is also affected by an array of biotic and abiotic stochastic variations (e.g. population dynamics, disturbances through fire or flooding). As summarized by Vellend (2010), community ecology is governed by only a few principal processes: speciation (determining the number of species and the beta niche of species), dispersal (both historical migration and dispersal within a given region), selection (both abiotic filtering and assembly rules) and drift (abiotic and biotic stochastic variation).

Only two numbers, the number of species actually present and the number of species potentially present, determine the completeness of a local community. Dark diversity, though locally absent, constitutes an important characteristic of a local community since these species are both present in the region and can potentially inhabit the site. Analogous ideas have been suggested earlier, but here we introduce and formalize the community completeness concept, introduce the Community Completeness Index and suggest practical strategies of how it can be applied in theoretical ecology and nature conservation.

Community Completeness: The Concept

The idea of community completeness (and even the term itself) originates from early Russian plant community ecology. Ramensky (1924) defined completeness (in Russian полночленность, ‘polnochlennost’) as the situation in which all potential inhabitants that are able to disperse into a community and inhabit it (i.e., pass both abiotic and biotic filters) are actually present in the community. This concept was further developed and discussed by Rabotnov (1984), who mentioned that the degree of community completeness could be assessed with the help of field experiments involving species introductions. However, to the best of our knowledge, no empirical analysis of such experiments exists. In another work, Rabotnov (1987) distinguished between floristic completeness (the degree of presence of potential species) and coenotic completeness (whether abundances of species correspond to typical natural conditions). He also emphasized the need to address community completeness at different trophic levels, not only in plants. The English-language scientific literature contains only a single article containing the term ‘community completeness’ in its title (a study of bird communities, Cam et al. 2000). Community completeness metrics (relative richness, see below) have more often been implicitly used in studies concerning regional species pools (e.g., Zobel and Liira 1997; Ingerpuu et al. 2001; Pärtel et al. 2007a).

The species pool concept originated as a hypothesis suggesting that local richness in communities is related to the historical and regional commonness of a habitat type (Taylor et al. 1990), or in other words, how many suitable species are present in a region (Zobel 1992; Eriksson 1993; Pärtel et al. 1996). We advocate a more general species pool concept, which declares that local communities can be better described with respect to their species pools. Moreover, even community assembly rules governed by local biotic interactions can sometimes be revealed efficiently only after prior consideration of the species pool (de Bello et al. 2012; Lessard et al. 2012). The main purpose of the species pool concept is to estimate which species composing the species pool are absent and therefore form dark diversity.

Community completeness is a new perspective on how to advance the species pool concept. In essence, completeness expresses how much of the species pool is realized within a local community. Expressing community completeness on a relative scale facilitates comparisons among similar communities in different geographical regions, among different communities within a region and among communities of different trophic levels, for example, insects, birds and plants (Pärtel et al. 2011).

The locally absent component of the species pool – dark diversity – shares characteristics with the concept of beta-diversity, yet it is fundamentally distinct (Pärtel et al. 2011). Both dark diversity and beta-diversity are derived from common local richness or alpha-diversity. Beta-diversity is determined by gamma-diversity whereas dark diversity is derived from the species pool. Gamma-diversity is basically just the sum or an extrapolation of the number of species encountered locally, that is, directly related to alpha-diversity. The species pool, by contrast, is determined by regional distribution and habitat requirements of individual species and is principally independent of which species are actually found locally. Moreover, beta-diversity was originally defined as the extent of community composition change across environmental gradients, for example, altitude (Whittaker 1960). This means, however, that species pools for these particular communities differ as well. Species that inhabit low altitude zones, for example, cannot be considered as dark diversity for high altitude communities, yet both constitute beta-diversity. Dark diversity always entails ecologically ‘filtered’ sets of species that can potentially inhabit a particular local community, whereas beta-diversity is defined by all species which occur in the study region but are absent from a particular locality. This difference is also highly relevant for nature conservation: High beta-diversity is typically considered a good indicator of biodiversity at the landscape scale, whereas high dark diversity is rather an indicator of community impoverishment (see below).

One question related to the species pool concept is whether local communities are unsaturated or saturated with species (Cornell and Lawton 1992). In unsaturated communities, local richness is unaffected by local biotic interactions and increases proportionately with species pool size (Szava-Kovats et al. 2012). In saturated communities, by contrast, biotic interactions cause local richness to gradually reach an upper asymptote limit with increasing species pool size. The concepts of community completeness and community saturation are related yet distinct. Community completeness can be defined for a single local habitat (Fig.  1 ). However, to recognize saturation requires observations across multiple sites. A test for saturation determines whether community completeness is negatively related to species pool size. Community completeness indicates how far a community is from its potential diversity. It does not address how this diversity is regulated, however. Community saturation, by contrast, indicates how biotic interactions regulate the number of coexisting species. It does not reveal how much of the species pool is realized within a local community. Consequently, both measures are useful, but for different purposes.

Fig. 1
figure 1

The species pool concept examines local species richness with respect to the part of the species pool that is absent – dark diversity. Local richness is always a subset of the species pool, so all communities lie below the 1 : 1 line. Community completeness expresses how much of the species pool is realized within a local community, i.e. how close a community is situated to the 1 : 1 line. Community completeness can be assessed for single communities. Community saturation addresses the shape of the relationship between species pool size and local communities and requires several communities. Community saturation examines whether community completeness varies with species pool size

Community Completeness: Methods

In order to evaluate community completeness, we need to estimate the extent of dark diversity, i.e. compile a list of species potentially able to live and reproduce in a given community but which are currently absent. One method to estimate the size of dark diversity is to draw on available biogeographic and ecological information. The first step is to define a region from which species can potentially reach a given community. This region can be a political region that contains an essentially ‘complete’ flora or fauna, for example, a small country, a county or a nature reserve (Pärtel et al. 1996). A more logical approach, however, is to include species from a single relatively uniform biogeographic region (Pärtel et al. 2011). Provided that distribution ranges of individual species are known, circular areas with defined radii can been used (Graves and Gotelli 1983), although more sophisticated approaches, such as dispersion fields (Lessard et al. 2012), might be preferable. The probability of arrival of a species in a community can be further specified by using species’ regional abundance and dispersal potential (Lessard et al. 2012). The second step is to ascertain the ecological requirements of species in the regional flora or fauna. This can be determined by inventorying similar habitats, using habitat suitability models created with GIS (Guisan and Rahbek 2011; Mokany and Paini 2011), and analysing species co-occurrence patterns (Ewald 2002; Münzbergová and Herben 2004). Regional lists of species can be filtered for ecological requirements through (semi)quantitative habitat requirement characteristics (e.g. Ellenberg indicator values, de Bello et al. 2012) or measured traits (Sonnier et al. 2010). If communities can be assigned to predefined habitat types, local expert knowledge can be applied to define species pools for each habitat type (Sádlo et al. 2007; Zobel et al. 2011).

Dark diversity can also be determined experimentally. For example, early Russian works suggested that completeness of plant communities can be estimated experimentally by sowing seeds of a large number of species (Ramensky 1924; Rabotnov 1987). Seed sowing experiments have been a common tool in plant community ecology for many years and have provided valuable insights into the habitat preferences of species as well as into the role of dispersal limitation in determining local diversity (e.g. Myers and Harms 2009; Vítová and Lepš 2011). It is exceedingly difficult, however, to test a very large number of species experimentally. For long-living plant species, germination might also need a proper “window of opportunity” (Eriksson and Fröborg 1996); failure to establish after sowing does not necessarily exclude a species from the species pool. Initial establishment of a species does not guarantee long-term persistence either (Gustafsson et al. 2002). Some species may die out during more extreme years (e.g. because of drought or cold), while other species may become locally extinct due to long-term pressure from predators. Moreover, experimental introduction is technically demanding when animal communities are concerned. Although experimental determination of the size of dark diversity sounds appealing, its practical difficulties may overshadow its advantages.

The estimation of dark diversity is always confounded by whether some species are really absent from the community or whether they simply have evaded detection. Studies of animal communities have suggested that expected richness be used rather than raw inventories (Cam et al. 2000). Sampling sufficiency can be estimated using sampling intensity-richness curves and Jackknife or Chao estimates of richness (Gotelli and Golwell 2011). Some species in a community can be detected only when active (e.g. butterflies, Sang et al. 2010). Some plant species can be dormant and hidden in the soil. Molecular identification of roots can offer a solution for this (e.g. Hiiesalu et al. 2012). Environmental meta-genomics is a developing field, which may detect rare species in ecosystems from DNA samples (Taberlet et al. 2012).

Relative richness has traditionally been expressed in terms of simple proportions, for example, observed richness/species pool, which are constrained to values between 0 and 1. Differences in this ratio reflect absolute differences in observed richness. We argue that relative differences are more meaningful: an increase from 5 to 10 observed species is not equivalent to an increase from 50 to 55 species but rather an increase from 50 to 100 species. Ln(observed richness/species pool), which is constrained to values <0, expresses differences in observed richness on a relative basis, but is mathematically incoherent, inasmuch as the difference between two dissimilar communities depends on whether we compare completeness or incompleteness. Dark diversity is by definition species poolobserved richness, yet differences expressed as ln(observed richness/species pool) are not necessarily equivalent to those expressed as ln(dark diversity/species pool). We therefore introduce the Community Completeness Index based on a log-ratio (or logistic) expression, ln(observed richness/dark diversity) (sensu Szava-Kovats et al. 2012). This expression preserves not only a relative description of a difference but maintains the same magnitude with respect to either observed richness or dark diversity. Moreover, this metric places data on an unbounded line in real space (theoretically from –∞ to + ∞), which is a fundamental assumption in all forms of conventional statistics (Bacon-Shone 2011). Our logistic Community Completeness Index cannot be applied if either local observed richness or dark diversity is totally absent. This, however, is basically just a theoretical hindrance. In nature we typically cannot define a community without species. An empty site might seem like a totally ‘incomplete community’ after a major disturbance such as poisoning or a volcanic eruption, but even then we would expect species to arrive soon thereafter. Similarly, it is not very likely to have a fully complete community with zero dark diversity; even a site at which the entire species pool is represented at one particular moment is likely to exhibit temporal species loss with multiple sampling.

Just as species pools can be approached hierarchically depending on the spatial scale of interest (Zobel 1997), so can community completeness. Species pools are typically defined at the regional scale (regional species pool), but it is possible to define species pools at smaller scales as well. For instance, if we are interested in how communities assemble within landscapes during shorter time periods (say decades), a local species pool can be defined by including species from the surrounding landscape within a radius of a few kilometres. This approach can be useful if our aim is to explore how dynamic landscapes influence species arrival and persistence (e.g. through loss of habitat size and connectivity). Furthermore, if we want to study community assembly processes within a community, we can address local richness in a fixed sample (e.g. 1 × 1 m sample in a grassland) and consider all species actually present in a habitat patch as a community species pool. Small-scale community completeness can indicate a response to biotic interactions, including anthropogenic effects.

Community Completeness: Applications in Ecological Theory

In order to understand local communities, it is important to know why their completeness differs. Various processes can ‘sentence’ potential species to be part of dark diversity: dispersal limitation, biotic interactions or simple stochastic variation.

We can expect that areas that have experienced long-term (evolutionary, geological) stability will be more complete than communities in areas which are comparatively young (e.g. post-glacial) simply because stable sites have had more time to ‘collect’ arriving species from the surrounding region. In Northern Europe many species have not yet reached their climatically suitable areas, probably due to dispersal limitation and the short time since the last Ice Age (Normand et al. 2011). In addition, evolutionary older communities on oceanic islands likely have a greater share of endemic species, which also comprise the regional species pool (Zobel et al. 2011).

We can also expect successionally stable communities to be more complete. Succession entails that species themselves modify environmental conditions through soil formation (e.g. primary succession in areas newly exposed by tectonic uplift) and through decreasing light availability (e.g. old-field succession). Early and late successional stages might therefore be represented by different species pools (Pykälä 2004). Indeed, during succession the species pool associated with early successional stages is replaced by that of later successional stages. Dynamic plant communities may have a transient rich mixture of species from different species pools, but long-term persistence of these species is impossible. Consequently, completeness is low regardless of how the species pool is defined: many species from the early-successional species pool are already lost, but species from the late-successional species pool have not yet arrived.

Community completeness is clearly linked to the landscape patterns surrounding the community. Large and well-connected local communities are expected to be more complete because local extinctions are less likely and the arrival of new species from neighbouring habitat patches is more probable (MacArthur and Wilson 1967). If different habitat types are present within a landscape, community completeness can be compared among them. For example, if a stable landscape pattern has changed recently, community completeness likely reflects extinction debt or colonization credit (Kuussaari et al. 2009). This, however, might be different for plant and animal communities (Brändle et al. 2003; Krauss et al. 2010).

Occasional atypical disturbances within communities can weaken local populations, leading to local extinctions and consequentially to a decrease in community completeness. Disturbed microhabitats are occasionally invaded by ruderal or opportunistic species, thereby increasing local species richness (Moora et al. 2007; Questad and Foster 2007; Schnoor and Olsson 2010). This increase, however, will not increase completeness because these invading species do not belong to the original species pool of the target community. Instead, they belong to the species pool of disturbed habitats. At the same time, a historically constant disturbance regime might be associated with successional stability and thus represent a ‘natural part’ of local habitat conditions (Moles et al. 2012). For instance, semi-natural calcareous grasslands (alvars) on thin calcareous soil experience regular disturbances caused by animal grazing and frost upheaval of the topsoil but still exhibit high vegetation diversity and community completeness (Pärtel et al. 1999; Rosén and van der Maarel 2000). These disturbances are historically relevant and form part of the ‘normal environment’ of this grassland type. Cessation of grazing reduces the level of local disturbance but also leads to overgrowing of open grasslands with scrub and a decline in diversity.

Community completeness depends on negative and positive biotic interactions among members of a community. Local richness in temperate regions is often unimodally related to habitat productivity (Pärtel et al. 2007b). This relationship was initially attributed to competition; that is, weaker species are out-competed when productivity is high (Al Mufti et al. 1977). Subsequent studies have additionally attributed lower species richness in highly productive communities to a smaller species pool (Taylor et al. 1990; Pärtel et al. 2007b). Zobel and Liira (1997) addressed small-scale community completeness (referred to as ‘relative richness’) and still found a unimodal relationship with habitat productivity, although the relationship is more pronounced when raw richness values are used. This finding suggests that both the species pool and competition might play a role in producing the unimodal diversity-productivity relationship.

Community completeness depends on other trophic levels in the ecosystem. For instance, positive and negative microbial feedback mechanisms both affect the number of coexisting plant species (Bever et al. 2010), thereby changing the completeness of communities. The actual numbers of coexisting plant species may also depend of herbivory (Hillebrand et al. 2007), pathogens (Allan et al. 2010) and other biotic interactions. It is, however, too early to ascertain whether general patterns of community completeness depend on the impact of other tropic levels.

Humans can act both as predators/parasites and mutualists in ecological communities, and anthropogenic influence can be a major determinant of community completeness. Small-scale community completeness in semi-natural alvar grasslands in Estonia is unimodally related to current human population density, indicating that extreme human influence (be it too low or too high) is unfavourable (Pärtel et al. 2007a). Community completeness might therefore be a valuable indicator for nature conservation.

Community Completeness: Applications in Nature Conservation

Community completeness allows meaningful comparisons of local diversity across communities from different regions, landscapes, vegetation types, successional stages as well as across different trophic levels. Community completeness has important implications for conservation. No single principle can be applied when selecting areas for conservation (Tuvi et al. 2011), but community completeness provides a useful metric. Although it is prudent to protect all major habitat types within a territory, community completeness allows identification of the most complete locations for each habitat type.

Community completeness might be a vital metric indicating the status of local biodiversity in natural communities (de Bello et al. 2010). It can also quantify the success of ecological restoration for different regions and different trophic levels (Suding 2011). Restoring a lake, for example, can entail reintroduction of fish species and macrophytes, while invertebrates and algae are expected to disperse spontaneously. The community completeness of all these trophic levels can be compared in order to assess restoration success at the ecosystem level.

For conservation purposes, it is valid to distinguish between native and alien species since recent human-induced invasions pose an increasing threat to local biodiversity. A long-standing debate revolves around whether diverse plant communities are more resistant to biological invasions than less diverse ones (Fridley et al. 2007). The use of relative measures will provide a dimensionless approach to addressing this question. Akatov and Akatova (2010) showed that higher completeness in plant communities is associated with lower invasibility of alien species. Recently, Catford et al. (2012) suggested two general relative measures of ecosystem invasibility, relative alien species richness and relative alien species abundance. Future studies need to examine whether relative alien richness and abundance are significantly dependent on community completeness.

Conclusions

Species diversity and coexistence have been major topics in both basic and applied community ecology (Lepš 2005). Community completeness explores how many species from the species pool are actually present locally and how many are absent. The absent part of the species pool – dark diversity – can be estimated by means of various ecoinformatics and experimental methods. We introduce the Community Completeness Index as a statistically sound log-ratio expression. Community completeness can be linked to the theory of ecological communities and can be utilized in conservation biology, where it can serve as a suitable indicator for nature conservation prioritization and for ecosystem restoration monitoring or as a measure of how biodiversity responds to different threats.