Introduction

Large-scale patterns in body size diversity have been studied extensively from both the paleontological and macroecological perspectives. One widely cited trend in the fossil literature is an increase in body size within lineages over time known as Cope’s rule (Cope 1877; Newell 1949; Damuth 1993). A net increase in average size over macroevolutionary time scales has been observed in many, although not all, animal taxa (Polly 1998; Jablonski 1997a; Knoll and Bambach 2000; Hone and Benton 2005). Stanley (1973) noted that an evolutionary expansion into an ecologically or physiologically bounded size space from a relatively small ancestral size would result in a passive trend (sensu McShea 1994) towards larger average size within lineages (hereafter referred to as the ‘Stanley effect’). Indeed many vertebrate taxa are thought to have originated at a relatively small size as compared with the range of sizes into which they eventually diversified (Alroy 1998; Shu et al. 2003; Laurin 2004). May (1988) argued from theory that a simple size diffusion model with an absorbing (i.e. not reflecting) lower boundary, with size change resulting from many multiplicative anagenetic events, will produce log-normal rather than log-skewed distributions, and (Maurer et al. 1992) confirmed this result using simulation studies.

Yet in most taxonomic groups, the majority of species have relatively small body sizes, as compared with the full range of sizes for that group (Hutchinson and Macarthur 1959). From a macroecological perspective, the size-frequency distribution of most clades is right-skewed, even on a log scale, with more scatter to the right of the median than to the left (Brown and Maurer 1986; Gaston and Blackburn 2000; Maurer et al. 2004). In a right-skewed distribution (i.e. with positive skewness) the tail on the left side of the probability density function is longer than the right side, and the bulk of the values (possibly including the median) lie to the right of the mean (e.g. Fig. 1). Right-skewed size distributions have been interpreted variously as evidence for the selective advantages of small size (e.g. Damuth 1993; Blanckenhorn 2000; Maurer et al. 2004), increased rates of diversification (more speciation and/or less extinction) at small size (Jablonski 1997b; Maurer 1998; Gardezi and da Silva 1999; Knouft and Page 2003), or the long-term survival risks of large size (Maurer et al. 1992; Purvis et al. 2003; Clauset and Erwin 2008; Clauset et al. 2009).

Fig. 1
figure 1

Smoothed body-size distributions of extant fish species. a Size data for 24,260 species pooled. Note this distribution is right skewed (P < 0.001) such that the median (vertical dashed line at 14.8 cm) is less than the average (not shown at 27.3 cm). Note also the tails have similar rarity values indicating approximately symmetrical boundary effects. b. Size-frequency distributions of four extant post-Ordovician fish clades; Myxinoidei (thin line), Petromyzontiformes (thick dashes), Chondrichthyes (thick line), Actinopterygii (thin dashes). Data for non-tetrapod sarcopterygians (11 species) not shown. Note these hump-shaped size distributions are maintained as a balance between the short-term (microevolutionary) effects of neutral diffusion and selection away from an ancestral condition, and the long-term macroevolutionary risks of extinction in taxa with extreme sizes. Note also the absence of a lower boundary effect in fishes

Why are the size-frequency distributions of most higher taxa right-skewed, even on a log scale? Should a right-skewed size distribution be viewed as representing a transient state, i.e. the expansion of species to larger sizes, or as a leftward shift in the mass of the distribution (with more species at smaller sizes)? Does a skewed distribution indicate the operation of different evolutionary processes (i.e. anagenesis, speciation, extinction) at extreme (large and small) sizes (Stanley 1990; Arnold et al. 1995; Jablonski et al. 2006)? Or does skewness indicate an asymmetrical response to the influence of the same processes among taxa at either end of the size spectrum (Gaston 1998; Clauset and Erwin 2008; Clauset et al. 2009)?

Here we use an integrative approach to assess trends of size evolution, employing paleontological, macroecological, and phylogenetic information in fishes (datasets in Albert et al. 2009). Fishes (non-tetrapod craniates) are an excellent taxonomic group for testing theories on the evolution of continuous traits such as size. Fish clades are phenotypically and ecologically diverse, representing more than 50% of all living vertebrate species (Haussler et al. 2009), and exhibit body size diversity approximating that of all vertebrates combined (Gillman 2007) (Table 1). Fishes also have a rich fossil record from throughout the Phanerozoic (Long 1995; Janvier 1996). We introduce a new use of a method for measuring rates of evolution in darwins (Haldane 1949) of continuous traits in an explicitly phylogenetic context (i.e. on a tree), and demonstrate how non-directional mechanisms of evolution (i.e. passive diffusion processes) can produce an hitherto unperceived bias to larger size, when size is measured on the conventional log scale. This approach interprets the shape of a size distribution as a balance between the competing forces of diversification, allowing taxa to evolve away from ancestral values, and of conservation, drawing taxa closer to a central tendency (Fig. 1a). Our goal is to describe the major patterns of size evolution in fishes, and to use these patterns to evaluate particular evolutionary models of size evolution (e.g. Cope’s rule).

Table 1 Summary of statistics for the extant fish species dataset

Materials and Methods

Size and Stratigraphic Data

Size and stratigraphic data are for 465 fish species are reported in Albert et al. (2009), and a taxonomic summary of this dataset is provided in Table 2. Size was assessed as maximum recorded total length (TL) in cm as measured from the tip of the snout to the posterior margin of the caudal fin. Total length was measured directly from specimens, published photographs or reconstructions of articulated specimens in primary sources, or published lengths (e.g. Frickhinger 1995). Total length is highly correlated with body mass (in grams), where mass may be estimated from TL from the empirical equation: y = 0.0217x 2.861 (Albert et al. 2009). Importantly, the skewness of log transformed size distributions are identical, regardless of whether size is measured as length or mass.

Table 2 Taxonomic summary of the fossil fish dataset

All analyses were conducted using log transformed total length data and the geometric average was used as a measure of central tendency. Because fishes exhibit indeterminate growth, taxa for which morphologically mature specimens are not known were excluded, as were taxa for which adult body size cannot be reliably estimated from known fossilized fragments (e.g. †Polymerolepis margaritifera, †Lophosteus sp.). Many macroevolutionary studies of size in organisms with indeterminate growth use maximum rather than average size, despite the statistical difficulties presented by extreme values (Jablonski 1997b; Roy et al. 2000). Mature specimens are recognized by osteological criteria when available; i.e. the shape of bones in the sphenoid and palatoquadrate regions of the skull, and the scapulocoracoid region of the pectoral girdle (Arratia 1997). Size of some Paleozoic forms was estimated from large body fragments (e.g. †Andinaspis suarezorum, †Pituriaspis doylei). Fossil species were dated to one of 24 geological epochs or series (e.g. Upper Devonian, Paleocene) with geological dates from (Gradstein et al. 2004). These time intervals range from 5 MY (Upper Silurian) to 46 MY (Upper Cretaceous), with an average of 23.6 ± 12.5 MY.

Taxon Sampling

We used a clade-based approach to taxon sampling designed to represent all higher craniate taxa for which reliable size estimates are available. The taxon sampling strategy was to include whenever possible, for each node (branching point) in the tree, at least one species in each of the two daughter taxa. Such a clade-based approach to taxon sampling maximizes representation of phylogenetic diversity among major lineages, and has been shown to be less sensitive to preservational biases than stratigraphically based taxon-counting methods (Griswold et al. 1998; Prendini 2001, Lane et al. 2005). This taxon sampling strategy produced a dataset of sufficiently broad temporal (107–108 MY) and taxonomic (102–104 species) scope to avoid nonrandom sampling errors arising from community assembly processes, convergent evolution, or investigator bias (Ackerly 2000).

This taxon-sampling scheme resulted in a dataset with 465 fossil species, of which 425 are known only as fossils. The fossil dataset includes representatives of 26% (164 of 622) of all fish families for which fossils are known, 51% of the 324 fish families known only as fossils (Benton 1993), and 68% (71 of 105) of non-teleost actinopterygian genera known only as fossils (Benton 1993; 106 non-teleost actinopterygian genera in Sepkoski (2002). Conodonts were excluded from analysis due to uncertainties in body size and detailed phylogenetic information (Donoghue and Sansom 2002; Janvier 2003; Dong et al. 2005; Northcutt 2005; Blieck et al. 2005). Triassic neoselachians are known only from teeth (Underwood 2006) and were also excluded from analysis.

In order to more completely sample certain clades that are poorly represented as fossils (Hurley et al. 2007) (Albert et al. 2009), size data for 13 terminal taxa are presented as geometric averages of the log maximum total lengths of the extant species, using data from Froese and Pauly (2005). These clades include the seven extant myxiniform (hagfish) genera, two of the extant petromyzontiform (lamprey) genera, one extant dipnoan genus (Protopterus), and five extant actinopterygian genera (Polypterus, Acipenser, Scaphyrinchus, Psuedoscaphyrinchus, Clupea). The addition of these living terminal taxa can potentially affect the analysis of size trends because extant fishes often exhibit larger sizes than fossil representatives of their respective higher taxa (Albert et al. 2009). However excluding these extant taxa resulted in large gaps in the taxon sampling of the major craniate lineages, and so they were retained in the analysis.

Phylogeny

Size-change was traced on a composite tree topology constructed from literature sources (Fig. 2; see Albert et al. 2009 and references therein). The phylogeny of Paleozoic vertebrates (i.e. Ordovician Radiations sensu Blieck and Turner 2003) follows Long (1995), Janvier (1996), and Janvier (2003), and references in these papers, with emendations as noted by taxon in Albert et al. (2009). These sources were used to construct a tree topology with 843 branches and 86 polytomies, or a tree that is about 91% resolved. Two chordate outgroup taxa were used to root the size optimizations: Cephalochordata and Yunnanozoa (Mallatt and Chen 2003). The phylogenetic positions of †Haikouichthys as a non-craniate deuterostome follows (Shu 2003). Data for Hyperoartia are from Janvier and Lund (1983), Gess et al. (2006) and Janvier et al. (2006), for post-Paleozoic chondrichthyans from Cappetta (1987), Underwood (2006), for sarcopterygians from Clouthier (1996), Clouthier (1997), Clouthier and Forey (1991), and Clouthier and Ahlberg (1996), for basal actinopterygians from Arratia (1997), Coates (1998), Arratia (1999), Dietze (2000), Arratia and Cloutier (2002), Arratia and Cloutier (2004), Lund (2000), and Friedman and Blom (2006), and for teleost from de Pinna (1996), Johnson and Patterson (1996), Arratia (1997, 2004) and Santini et al. (2009).

Fig. 2
figure 2

Phylogeny of 465 fossil fish species with rates of size evolution (in darwins) estimated at 928 branches. Tree topology from Albert et al. (2009) and references therein. Extinct clades in gray font; extant clades in colored fonts. Size data as ln(cm). Branch lengths in MY (not to scale) estimated from stratigraphic data (see text for methods) (Color figure online)

Branch lengths were estimated from stratigraphic data from fossils (Benton 1993, 2005; Benton and Donoghue 2007) in millions of years (MY) as the absolute age difference between nodes. Minimum branch lengths were set at 1.0 MY so that all branch lengths were non-zero. Increasing the minimum branch length reduces absolute estimates of size evolution for short branches but does not affect the skew of the size-change frequency distribution (data not shown).

Ancestral Trait Reconstruction

Linear Parsimony (LP) and Least-Squared Parsimony (LSP) optimization methods were employed to estimate total length values at interior tree nodes using the Mesquite v.1.06 software package (Maddison and Maddison 2006). LP minimizes the total amount of trait change along tree branches such that the cost of a change from state x to y is |x − y|. LSP, also referred to as Squared-Change Parsimony, follows a Brownian motion model of evolutionary change in which the cost of a change from state x to state y is (x − y)2 (Maddison 1991). LP differs from LSP, and also from model-based (i.e. Bayesian and Likelihood) approaches to character state optimization, in that LP permits the reconstruction of discontinuous events, or of large changes in trait values (Butler and Losos 1997; Pagel 1999; Albert 2006). Although evolutionary change is often thought of as gradual, large differences in trait values between internal tree nodes may result from a variety of real biological processes, including punctuated evolution (Pagel et al. 2006; Monroe and Bokma 2009) or extinction of taxa with intermediate trait values (Butler and Losos 1997). LP also permits the reconstruction of ambiguous ancestral state values when data are insufficient to provide an unambiguous resolution. Nevertheless, estimates of average size among fossil fishes per epoch using LP and LSP are significantly correlated (Albert et al. 2009). All ancestral reconstruction methods assume that trait evolution is conservative enough for node reconstruction techniques to be useful, even in the face of large standard errors (Polly 2001). LP optimization was performed using 10 replicates on arbitrarily fully resolved trees and the qualitative results of the size-change analysis were similar in all replicates (data not shown).

Rates of Evolution

As a direct test of Cope’s rule we assessed the proportion and magnitude (per unit time) of changes to larger and smaller sizes, over the internal branches (inter-nodes) of the well-resolved phylogenetic tree of fossil fishes (Fig. 2). All size data were log transformed prior to analysis. Evolutionary rates were expressed in darwins (Haldane 1949) as the difference in size per unit time along internal branches of a phylogenetic tree:

$$ d = [\ln S_{2} - \ln S_{1} ]/t_{2} - t_{t} $$
(1)

where S n is the inferred size at node n, and t n is the estimated geological age of node n in Ma. Equation 1 can be expressed more simply as:

$$ d = dS/dt $$
(2)

where dS is the difference in size between nodes per million year of branch length (dt). In this metric scheme one darwin is a change in the value of the trait by a factor of e in one million years.

Measuring evolutionary rates in darwins has been criticized on the grounds that rates are expected to diminish when measured over larger time intervals due to averaging of higher-order frequency fluctuations in the direction of trait change (Gingerich 1983; Pérez-Claros and Aledo 2007), and should be corrected by plotting rates against time intervals (Gould 1984; Sheets and Mitchell 2001). However, branch lengths used in this study showed low variance (21.8 ± 3.3 MY), and we found no correlation between rate values (in darwins) and branch length (R 2 = 0.0004; directional P = 0.30). Rates were not expressed in haldanes (phenotypic standard deviations per generation (Gingerich 1993) due to uncertainties of estimating generation times.

Results

Macroecological Patterns

Extant fish species range in size over about three orders of magnitude as measured on an arithmetic scale (1.0–1,265 cm), or about seven orders of magnitude as measured on a log scale (Table 1; Figs. 1 and 3a). Extant fish families also exhibit a broad range of skewness values, from −2.16 (n = 7 species) in the sawfishes (Pristidae, Chondrichthyes) to 2.19 (n = 173 species) in the perches and darters (Percidae, Actinopterygii). Although species richness and average body size per se do not predict skewness values in extant fish families, most species-rich families have skewness values near zero (median skewness 0.04), and most families with extreme skewness values (≥|2|) have few (<10) species (Percidae being an exception to both these trends) (Fig. 3b). There is a correlation in the relationship between skewness and size (Fig. 3c).

Fig. 3
figure 3

Relationships between species richness and measures of size diversity (average and skewness of size distributions) among extant fish families. Data for 24,260 species in 515 families from (Froese and Pauly 2005). In all panels each data point represents a single fish family, and dashed vertical lines represent average (skewness) or median (size) values of all fish families. Solid vertical line in a represents average size of all fish species (solid curve). a Right-skew size distribution of all extant fish species and families. b The skewness values of most fish families are close to zero, and few (1% or 5/515) fish families exhibit extreme skewness values >|2|. The median skewness value of all fish families is 0.04, and of all fish species is 0.37. c Body size is not correlated with skewness among extant fish families

Positive skewness is a pervasive feature of body-size distributions in fish taxa. Most large datasets of fish diversity are right-skewed, even when size as assessed as ln[cm]. The data sets examined for this study include: all extant fish species (Fig. 1; n = 24,260; skewness = 0.37; P < 0.001), four of the five extant post-Ordovician fish clades (Table 1, Sarcopterygii excluded), the fossil species dataset of Table 2 (n = 465; skewness = 0.09; P ≤ 0.001), and the distribution of size-change events along the branches of the fossil fish phylogeny in Fig. 2 (see also Table 3; n = 929; skewness = 0.29; P = < 0.001).

Table 3 Summary of statistics for rates of size evolution (in darwins) for fossil fishes during the Phanerozoic

Macroevolutionary Patterns

Diversification of fishes into size space through the Phanerozoic occurred in two readily distinguishable phases (Figs. 4a, b). The first was a relatively rapid Expansion Phase from the Lower Cambrian to the Upper Silurian (542–417 Ma), a duration of about 23% of the Phanerozoic. During this Expansion Phase average arithmetic size (in cm) increased about 10-fold, from about 2.7 cm in the Lower Cambrian (n = 5) to 28.7 cm (n = 59), and the range of sizes increased from 0.5 cm to about 200 cm. By the Upper Devonian (c. 375 Ma.) giant species like the arthrodire placoderm †Dunkleosteus terrelli at about 800 cm had evolved, approaching the upper size limit of fishes as a whole. Further, the size distribution of fossil fishes during the Expansion Phase has a negative skewness (Table 3). Fishes achieved approximately modern dimensions of size diversity (i.e. average size in cm and size range as ± SEM) by the Lower Devonian (c. 416 Ma; 28.7 ± 4.5 cm, compared to the extant 27.2 ± 2.6 cm; Fig. 4c, d). The size range of Devonian fishes spans about three orders of magnitude in metric units, from miniaturized yunnanolepid antiarchs (c. 5 cm) to giant brachythoracid arthrodires (c. 800 cm). The range of sizes for fishes in the Devonian and Recent are therefore very similar, especially considering the limited taxon sampling and taphonomic filters that obscure understanding the full range of diversity in paleofaunas (Albert et al. 2009).

Fig. 4
figure 4

Average body size of fossil fishes through the Phanerozoic. a LP estimates for all branches on the phylogeny of Fig. 2. b Average size of fossils for each stratigraphic interval (epoch). Error bars ± one standard deviation of the average of branches or fossils per epoch. Data fit to 3rd order polynomials with adjusted R2 values. See text for methods. c Right-skewed size distribution of species in the fossil fish dataset (skewness = 0.09; P < 0.01; Kuiper test). d Right-skewed distribution of average size in extant fish families (n = 515; skewness = 0.46; P < 0.0001; Kuiper test). The right tail of the extant fish family distribution is truncated to facilitate comparison with the size range of fossil fishes; the top four size bins are not plotted, containing 8/515 (1.5%) families (with 22 species), each with an average maximum length >4.0 m. By comparison, the smallest size bin contains four families (with 29 species), each family with an average total length <2.7 cm. Note the mode and spread of the size distribution for fishes have been maintained in a state of quasi-equilibrium over the last c. 400 Ma

The second phase of diversification into size space was a much longer time interval, extending from the Lower Devonian to the Recent (416–0 Ma.), or over about 77% of the Phanerozoic. During this interval the average size and size diversity of fishes fluctuated non-directionally in a state of quasi-equilibrium. Size estimates of fossil fishes during this Quasi-equilibrium Phase have an arithmetic average (or mean) value of 51.7 cm (SEM = 4.7 cm; Fig. 3b), as compared with an average of 27.3 cm for all extant fish species (Fig. 1), or 56.7 cm for all extant fish families (Fig. 3d). These data show that the major features of body size diversity in living fishes, especially measures of central tendency (average and mode) and spread (range and standard deviation) had all become established relatively early, when vertebrates had achieved about 23% of their current age. Indeed the limits of size diversity in fishes were largely established within a few tens of millions of years after the origin of Gnathostomata during the Silurian (444–416 Ma).

Patterns of size evolution (as measured in darwins) in fossil fishes through the Phanerozoic are similar to those of absolute size itself. The range of size-change values (d) along branches of the fossil fish phylogeny increased rapidly during the early Paleozoic, and then exhibited no directional change for the remainder of the Phanerozoic (Table 3; Fig. 5). Such a rapid size expansion during the early Paleozoic is observed in many metazoan taxa (e.g. Sepkoski 1981; Miller 1997, 1998; Stanley 1998; Novack-Gottshall and Lanier 2008; Alroy 2010). As with the frequency distribution of absolute body size in fossil fishes, the distribution of size-changes (in darwins) is also right-skewed (Fig. 5b; skewness = 0.61; P < 0.001). The right-skewed distribution of evolutionary rates reported here in a phylogenetic context for fossil fishes closely resembles those of other fossil and extant vertebrate clades examined to date with comparable taxonomic and temporal resolution (Gingerich 1993; Millien 2006). The values of evolutionary rates in fishes have a median of d = 0.00, and range from d = −1.79 to 1.90. These are similar to values reported for other fossil (Clyde and Gingerich 1994) (Guthrie 2003) and some (Koch 1986) but not all (Reznick et al. 1997) extant vertebrate groups.

Fig. 5
figure 5

Patterns of size-evolution in fossil fishes through the Phanerozoic. a Rates of size-change in darwins (d) estimated along 928 branches on the phylogeny of fossil fishes in Fig. 2. b–d Frequency distributions of size-change events for the: b Phanerozoic (542–0 Ma; n = 928; skewness = 0.37), c Expansion Phase (542–417 Ma; n = 280; skewness = −0.32), and d Quasi-equilibrium Phase (416–0 Ma.; n = 648; skewness = 0.49). Note skewness is negative during the Expansion Phase and positive in the Quasi-equilibrium Phase. Additional summary statistics for these phases are provided in Table 2

Discussion

Size-Biased Extinction Risks

The most extensive macroecological analyses of size-frequency distributions to date have focused on extant mammals (including late Quaternary fossils) and extant birds (Clauset and Erwin 2008; Glazier 2008; Clauset et al. 2009; Monroe and Bokma 2009; Olson et al. 2009; Capellini et al. 2010). In both these clades, the shape of the size distribution was shown to closely match a three-parameter simulation model of cladogenetic diffusion (Clauset and Erwin 2008; Clauset et al. 2009). The parameters of this model are: (1) biased rates of anagenesis to larger sizes (Cope’s rule), (2) higher extinction risk at larger sizes, and (3) taxon-specific ecophysiological limits at lower sizes. However, the generality of studies on mammals and birds for understanding body size evolution in other animal groups remains unclear. For example, the distribution of body size in fish clades is qualitatively different from that of terrestrial mammals and birds in the apparent absence of a lower boundary effect (Fig. 1). Whereas there are fewer species of terrestrial mammals and birds at the smallest sizes, the proportions of fish species in the two tails (smallest and largest size categories) are similar.

Unlike the situation in mammals and birds, there does not appear to be a size-biased extinction risk for fish taxa with small adult body sizes (Olden et al. 2007). For example, minimum body size has not been correlated with geographic range size (Reynolds et al. 2005; Griffiths 2006; Albert et al. 2011a, b), elevation (Matthews 1998; Fu et al. 2004), hypoxia tolerance (Nilsson and Östlund-Nilsson 2008), or oceanic depth (Smith and Brown 2002). Further, miniaturization is a widespread life-history strategy in many fish groups, both freshwater (Weitzman and Vari 1988a) (Knouft and Page 2003) and marine (Munday 1998). However, large body size is associated with elevated extinction risk in many living fish taxa, due to commercial fishing and anthropogenic habitat degradation (Olden et al. 2007). Although the relative roles of speciation, extinction and adaptive evolution (anagenesis) have been parsed in an explicitly phylogenetic context in some fish taxa (Knouft 2003; Near et al. 2005; Hardman and Hardman 2008), the generality of these processes among fishes as a whole remain poorly understood (see e.g. Smith 1981; Smith et al. 2010).

The generality of the Clauset and Erwin (2008) model may be further limited by the different ways in which homeothermic and poikilothermic species respond to metabolic constraints on size evolution. As compared with other vertebrates, mammals and birds have an elevated and relatively constant basal metabolic rate, and they also exhibit determinate growth in which somatic development slows abruptly at sexual maturity. By contrast the great majority (~74%, Haussler et al. 2009) of vertebrate species, including all fishes, are poikilotherms, in which mass-specific metabolic rate increases with ambient temperature (Makarieva et al. 2005), and in which the aerobic capacity (power density) of mitochondria is about one order of magnitude lower than homeotherms (Wieser 1995). Poikilotherms also exhibit indeterminate growth, in which growth rate and maximum adult body size are more plastic (less constrained by genetics), and in which lifetime reproductive output has a higher variance (Stebens 1987) than in homeotherms. Because fishes exhibit indeterminate growth, size is a better parameter than chronological age to use as an independent variable in the study of metabolic rates (Kaufmann 1981). Also in contrast to homeotherms, the upper limit to body size in poikilotherms is constrained by a minimum mass-specific metabolic rate, below which biological performance is impaired (Makarieva et al. 2005). Thus the maximum body size of a poikilothermic species inhabiting warmer environments should be larger than that of a closely related species inhabiting cooler environments. Such physiological constraints due to size do not apply to homeothermic animals that have metabolic control of body temperature.

Evolution Near Extreme Sizes

Examination of the total extant fish size distribution (Figs. 1 and 5), and of fish species near the limits of the total size distribution (Table 4), indicate little or no asymmetry in upper and lower boundaries to body size. In other words these data do not support the hypothesis that it is harder to scale the left side of the size distribution than the right (contra Clauset and Erwin 2008; Clauset et al. 2009). If extreme body size is defined as a total length greater than three standard deviations from the average value of all fishes, then 33 extant fish species may be regarded as extreme giants (>825 cm TL), and 32 extant species as extreme miniatures (<1.4 cm TL). A total of 21 genera include species that are extreme giants and another 24 genera include extreme miniatures. Thus, assuming the monophyly of these genera, we can infer that living fish species with extreme body sizes, both giants and miniatures, have evolved multiple times, and also that fishes have minimally evolved about the same numbers of times to either size extreme.

Table 4 Summary of data for fish species with extreme body size; i.e. those beyond three standard deviations from the average

Does the transition to very smallest mature body sizes require more adaptive (ecophysiological) modifications than change to the largest sizes? Species at both ends of the size spectrum in fishes exhibit phenotypic specializations associated with extreme size. One of the main constraints to maturation at extremely small size is the energetic demand of securing sufficient energy to allocate to eggs in a small animal with high tissue specific metabolic rates (Weitzman and Vari 1988b; Ruber et al. 2007). Ecological and physiological limits to extremely large size in fishes may be associated with demographic factors (e.g. small effective population sizes, long generation times) that may ultimately confer elevated extinction risk (Knouft 2003; Knouft and Page 2003; Hardman and Hardman 2008).

It is interesting to note that the pool of higher taxa (18 families) from which extreme giants are recruited is (1.5 times) larger than the pool of taxa (12 families) from which the extreme miniatures evolved. Considered within the context of all fish families from which species of extreme size could potentially be drawn, the difference between 18/515 (3.5%) and 12/515 (2.3%) is not significant (two tail Chi Square test P = 0.34 when Yates corrected), and such a difference could easily have arisen by chance alone. Yet body size diversity has a strong phylogenetic component (e.g. Ramirez et al. 2008), and species with extreme size are not distributed randomly among all fish families (Table 4). Further, clades within certain fish families (i.e. Characidae, Cyprinidae, Gobiidae, Poeciliidae) do appear ecophysiologically or ontogenetically predisposed to produce extreme miniatures (Weitzman and Vari 1988b; Ruber et al. 2007). However, the family-level clades from which fish species that have extreme sizes are recruited do not have a predictable bias in their skewness; 8/18 (44%) of families with extreme giants have positive skewness (longer right tail), and 5/18 (28%) have negative skewness. The numbers for extreme miniatures are 8/12 (67%) families with positive skewness and 3/12 (25%) with negative skewness (numbers do not total to 100% because skewness values for families with few species cannot be calculated). Thus, although the proportion of clades with positive skewness is slightly greater in the pool of families that gave rise to extreme miniatures than to extreme giants, both pools exhibit similar proportions of clades with negative skewness.

The range of sizes exhibited by species with extreme sizes could also provide information on the symmetry of evolution near the limits of size space. Extreme giants range in size 2.60 times more (on a log scale) than do the extreme miniatures (Table 4). One interpretation of this asymmetry in size ranges near the limits of size space is that the lower limit is a more difficult barrier to approach closely. It is also interesting to note the size range of species near the two limits are almost precisely the same when size measured on a power-log scale (see below). To summarize, data on fishes with extreme sizes indicate that the ecophysiological or ontogenetic barriers to achieving the largest and smallest body sizes do not appear to differ quantitatively. This means that the total amount of phenotypic modification (adaptation) required to achieve very large or very small body sizes is similar.

Lower Boundary Effect

The lower boundary effect observed in the body size distributions of homeothermic vertebrate clades (birds and mammals) is qualitatively different from that of poikilothermic fishes. Phenomenologically it appears that the special constraints on evolving to the very smallest sizes observed in homeotherms, are not present in fish clades. In other words, although the overall frequency distribution of fishes is right skewed, both tails of the distribution (at largest and smallest sizes) similarly taper to the lowest values gradually. By contrast the tails of size distributions for homeotherms are highly uneven, with what appears to be veil-line (sensu Preston 1948)) over the smallest size values. Why are there no bird or mammals species below this apparent veil line?

One possible explanation is the different physiological responses of poikilothermic and homeothermic animals to temperature changes. Whereas metabolic rate in poikilotherms is largely monotonic response to ambient temperatures, most homeotherms exhibit a thermoneutral zone of non-stressful metabolic activity within a range of temperatures to which they are adapted, and thermostress zones at lower and higher temperatures (Lindstedt and Boyce 1985; Ruel and Ayres 1999). The idea is that homeotherms with very small adult sizes reach their maximum tolerable metabolic rate at both low and high temperatures, whereas stressfully high metabolic rates are never achieved by poikilotherms at low temperatures. Therefore, homeotherms experience metabolic constraints on the evolution of small sizes not imposed on poikilotherms.

A possible complication to this argument is that most (or all) homeothermic vertebrates employ a range of behavioral responses to thermoregulation that buffer them from experiencing very low temperatures, whereas fish taxa in an aquatic medium are almost always strict thermoconformers (McNab 2002). However, the consequences of behavioral thermoregulation in shaping the size-frequency distribution of a large clade is probably very limited. Most extant vertebrate species (both poikilotherms and homeotherms) inhabit tropical environments, and most of the clades to which they belong (e.g. families) originated in relatively warm and thermally stable environments (Wiens and Donoghue 2004; Wiens and Graham 2005; Wiens 2007). The differential macroevolutionary responses of poikilothermic and homeothermic taxa to low temperatures may therefore be regarded as negligible.

Power Scaling in Ecology

Studies on body size diversity have generally assessed the parameter ‘size’ in units of mass or length on the natural log scale: loge (S), where S is a measure of size in cm or g, and the natural logarithm is the inverse function of the exponential function: S = ln[e S] (if S > 0). The natural log is an intuitive scale for the study size diversity, in part because the exponential function (e S) is a common feature of growth and other dynamic processes (Haldane 1949; Hayami 1978; Kaufmann 1981; Demetrius 1997, 2000; Glazier 2005), and perhaps also because the log function is (or closely matches) the native scale of human mensural perception (Dehaene et al. 2008; Dehaene 2011). Nevertheless, the natural log differs only in scale from that of logarithms in other fixed bases, and many historically important investigations have successfully analyzed size diversity on a log10 scale (Schmidt-Nielsen 1984; McNab 2002).

In addition, a lengthy history of physiological studies has shown how many important ecophysiological and evolutionary variables, such as metabolic rate, generation time, and mutation rate, do not scale with size as an isometric log function, but rather according to allometric power functions of the form: y = aS b, or ln[y] = ln[a] + b ln[S], where a and b are empirically derived constants (Rubner 1883; Kleiber 1932, 1947; McNab 1990, 2002). In this relationship the y-intercept (a) is a proportionality coefficient, and the scaling exponent (b) expresses the (negatively allometric) relationship between the values of phenotypic traits S (e.g. size) and y (e.g. mass specific metabolic rate at a given size; Rolfe and Brown 1997; Darveau et al. 2002).

Power scaling theory posits that changes in adult body size are constrained by the hierarchical, fractal-like design of resource distribution networks within organisms (Gillooly et al. 2001; West et al. 2003). Body size directly influences many of the demographic factors involved in adaptation and phenotypic diversification, including rates of reproduction, population growth, population density, organismal vagility, and gene flow (Brown et al. 2004). Individual size and metabolic rate have been correlated with species-specific rates of development (Reiss 1989; Gillooly et al. 2008) and molecular evolution (Allen et al. 2005; Gillooly et al. 2005), and with energy use and tropic level (Brown et al. 2007; Romanuk et al. 2011). It is important to note that the relationships between metabolic rate, temperature and body mass differ substantially among taxa (Glazier 2005, 2006, 2009; Capellini et al. 2010; Cassemiro and Diniz 2010; Glazier 2010; Isaac and Carbone 2010; Lima-Ribeiro et al. 2010; McCain and Sanders 2010).

The scaling exponent (b) between size and metabolic rate exhibits a range of empirical values among vertebrate clades, from 0.67 to 1.06, with an average value for teleost fishes of 0.80 (Clarke and Johnson 1999; McNab 2002; Julian et al. 2003). Further, empirically derived allometric relationships between size and metabolic rate are not always log-linear (Glazier 2005, 2006). Thus, despite the well-characterized macroecological relationships between size and metabolic rate (Brown et al. 2004; Brown and Sibly 2006; Allen and Gillooly 2007; Gaston et al. 2009), the macroevolutionary consequences of allometric constraints on body size diversification remain incompletely understood.

Interestingly, positive skewness in log size and size change data can be removed by applying a power term to the logged size data:

$$ y = a[\ln S]^{b} $$
(3)

where b = 1.0 represents the special case where size (or size change) is measured on the conventional log-only scale. Could power laws act on the log scale for size change evolution rather than the traditional linear scale? If so, then this would provide an alternative explanation for the widespread occurrence of right-skewed size distributions. Or is this just a phenomenological relationship with no mechanistic basis? While our current knowledge of power law relationships occur at the linear scale, we encourage more thought into the relevance of power laws at alternative scales.

During the Quasi-equilibrium Phase of size evolution, the skewness of the size-change distribution was positive when size is measured on the log-only scale (Fig. 5d; n = 392; skewness = 0.50; P = 0.11; Kuiper test). However, the skewness of the size-change distribution does not differ significantly from zero when size-change is measured on the power-log scale (Fig. 6b; 0.36 < b < 0.67; P < 0.05; Kuiper test). The best fit to these data are at b = 0.50 (P < 0.0001 that the power function equals 1.0). In other words, the size-change distribution converges to normal below the theoretically expected exponent value of three-quarters (McNab 2002).

Fig. 6
figure 6

A new hypothesis for the origin of right-skewed size distributions. a. Size distributions expected from a passive diffusion (non-directional) model of trait evolution; i.e. no size-biased rates of anagenesis, cladogenesis or extinction. Note both curves represent the same data, plotted alternatively on a conventional log-only (red) and power-log (blue) mensural scales, where b is the allometric scaling exponent. The red curve is right-skewed (at b = 1.0), with more scatter to the right of the median than to the left, whereas the blue curve is symmetrical or non-skewed (at b = 0.5). b Relationship between the power function exponent (b) and the empirical skewness values of size-change (d) data for fossil fishes. Black line traces skewness (with 95% CI in grey lines) of the size-change (d) distribution for all branches during Quasi-equilibrium Phase from the Lower Devonian to Recent (n = 648 branches). Note the empirical distribution does not deviate from that of a standard normal distribution in the range: 0.36 < b < 0.67 (Color figure online)

Such an unexpectedly low value of b may reflect real differences in the physiological scaling of the fishes that constitute the fossil versus extant fishes datasets, or it may reflect a history of selection within multiple taxa for larger body sizes; i.e. Cope’s rule. In principle, right-skewed body size distributions could arise from directional trends in size evolution, as a result of differential rates of anagenesis (selection and drift), speciation (cladogenesis), or extinction (Tables 5 and 6, models 1–5). For example, a right-skewed size distribution could be seen to represent an excess of small-bodied species (Gaston and Blackburn 2000). In such as case the mass of the distribution (e.g. mode) would have shifted historically towards the smaller end of the size spectrum, in comparison to the mode of a normal distribution (i.e. with the same range of sizes and with a skewness of zero). However, this explanation does not apply to the fish datasets because the mode, skewness and range values of size distributions have not shifted substantially since the Lower Devonian. The Stanley effect (Tables 5 and 6, model 6) can also contribute to the production of right-skewed size distributions by affecting rates of anagenesis towards larger versus smaller sizes in the non-equilibrium Expansion Phase (Fig. 4c). Comparing the qualitative expectations of these alternative models with empirical patterns observed in the fossil fish datasets shows partial although incomplete matches (Table 6).

Table 5 Alternative models of size-evolution proposed to generate right-skewed size distributions, based on differential rates of anagenesis (selection and drift), speciation (cladogenesis) and/or extinction
Table 6 Qualitative patterns of size evolution in fossil fishes and expectations of alternative models of size evolution

Another possibility is suggested by the empirical distribution of size-change data in Fig. 6, namely that the right-skewed body size distribution arose from symmetrical diffusion on a metabolically relevent power-log scale (Brown et al. 1993); i.e. the scaling exponent b was significantly less than 1.0 (Table 5, model 7). In this case, skewness arises from excesses in the magnitude (as opposed to the number or frequency) of changes to larger sizes. Empirical patterns in the skewness of body sizes, and of size-change events, in fossil fishes are largely consistent with the hypothesis of symmetrical diffusion of size on a power-log scale, with the exception of the sign of the skewness of the size-change distribution during the Expansion Phase (Table 6). Among the models outlined in Tables 5 and 6, only model 1 (Cope’s rule) successfully predicts the later.

Power Scaling and the Metabolic Theory of Ecology

From an ecophysiological perspective, species can and do respond adaptively to different environments by allometric changes in body size (Damuth 1993; Demetrius 2000). Biological rate processes that constrain changes in adult size, including cardiac and respiratory cycles, circulation of blood volume, growth rate and generation time, all scale with body size as a power function with exponent values below unity: b < 1 (Glazier 2009). From this ecophysiological perspective therefore, a power-log mensural scale [ln cm]b may be a more relevent scale than the conventional log-only scale, on which to assess directional hypotheses of size evolution like Cope’s rule. Passive diffusion of a continuous trait like body size on a power-log scale (where b < 1) into a bounded size space is expected to produce a symmetrical (non-skewed) distribution when assessed on that same power-log scale. However, this passive diffusion produces a right-skewed distribution when size assessed on a log-only scale (where b = 1). The biological mechanisms by which metabolic scaling at the organismal level translate into asymmetries in the evolutionary responses of populations (due to selection, drift, and the probabilities of speciation and extinction) are as yet poorly understood (Brown et al. 1993, 2004; West et al. 1997). Although skewness in the logged data can be accounted for by a power-log transformation, our current knowledge suggests that power laws act on the linear sizes, not logged sizes. The fit between the expectations of the power-log model and empirical data must therefore be regarded at this point as phenomenological.

Statistical and Sampling Considerations

The major patterns of size evolution recovered in this analysis do not appear to be statistical or taphonomic artifacts (Albert et al. 2009). If, as predicted by Cope’s rule for example, there were persistent and general tendencies to increase body sizes within lineages, ancestral size estimates obtained from analysis of terminal (fossil or extant) taxa would be systematically overestimated (Stanley 1998; Polly 1998; Hone and Benton 2005). This is because methods for optimizing ancestral-trait values are not capable of estimating values outside the range of the terminal taxa. As a result sizes at internal tree nodes may be overestimated regardless of optimization method used (i.e. LP vs. LSP). However, the principal qualitative results of this study do not seems to be artifacts of parsimony-based optimization methods; among the 23 Phanerozoic Epochs for size data are evaluated, estimates of internal node values from LP and LSP approaches are significantly correlated (at P < 0.001) with each other, and with that of the direct stratigraphic approach that does not use phylogenetic methods (Albert et al. 2009).

Estimates of body size taken from the taxonomic literature have been found to be slightly larger than the average of bulk samples, which is expected since type materials are generally selected from mature and more well preserved specimens. This effect has been shown in marine brachiopods and bivalves, where whole body size is well preserved in the size of the fossil shells (Krausse et al. 2007). However, this study also concluded that the average size of the monograph sample, though biased, closely approximates that of the bulk sample, and that relative changes in size through time should be detected equally well by both data types.

In general, large scale trends of size evolution in vertebrates through the Phanerozoic have not been reported to reflect variation in the nature of the fossil record or of fossil bearing sediments (Alroy et al. 2001). Average body size has not been correlated with the amount of sedimentary rock volume (Peters and Foote 2001), or the average size of top predators (Janvier 1996; Twitchett et al. 2005). Madin et al. (2006) found that ecological interactions such as predation and bioturbation did not drive Phanerozoic macroevolutionary patterns in a large dataset of fossil benthic marine invertebrates.

One of the limitations of the macroevolutionary approach to studying size diversity its heavy use of statistical associations among taxa selected at an equivalent taxonomic rank (e.g. species, families). This approach makes several necessary assumptions, with varying degree of veracity, including the assertion that all the ranked taxa are monophyletic, and that the evolution of the quantitative trait under consideration (e.g. size) is not structured phylogenetically within or between the ranked taxa (Garland et al. 1992; Blomberg et al. 2003; Albert 2006). Regarding the former assumption, fish systematics of the past several decades has made exceptional progress towards generating an entirely phylogenetic classification, and most family-level taxa are now well supported monophyletic groups (Wiley and Johnson 2010). Regarding later assumption however, many aspects of size diversity have been shown to evolve within (Knouft 2003; Knouft and Page 2003; Hardman and Hardman 2008) and between (Albert et al. 2009) nominal fish families. A formal analysis using modern comparative methods will therefore be required to test the macroecological correlations presented here.