Introduction

Morphometrics is usually considered to be a tool for the quantitative description and statistical analysis of morphology. The variety of applications of morphometrics to evolutionary biology and the frequency with which they have been applied have increased notably in the past decade, thanks in part to the ease with which geometric methods for simultaneously analyzing and visualizing complicated phenotypes can now be performed (Bookstein 1991; Dryden and Mardia 1998; Richtsmeier et al. 2002; Zelditch et al. 2004; Hammer and Harper 2005).

Evolutionary studies for which morphometrics are central include phenotypic integration and modularity (e.g., Marroig et al. 2004; Goswami 2006; Richtsmeier et al. 2006; Mitteroecker and Bookstein 2007), quantitative trait locus (QTL) analysis (e.g., Klingenberg et al. 2004; Albert et al. 2008), study of the regulation of morphological variance through ontogeny (e.g., Cardini and O’Higgins 2005; Zelditch et al. 2006), assessing the genetic dimensionality of morphological traits (Mezey and Houle 2005), and measuring the tempo and mode of evolutionary change (e.g., Roopnarine et al. 2005; Hannisdal 2007; Hunt 2007; Wood et al. 2007; Drake and Klingenberg 2008). All of these studies use empirical data to determine associations among organisms or among the variables that represent their phenotypes.

Another class of evolutionary studies uses morphometric spaces, or morphospaces, to extrapolate trajectories, either through empty space between empirical data points or inside an empty morphospace that has been constructed with the mathematical properties that characterize a particular evolutionary hypothesis. This class of studies includes: (1) evolutionary quantitative genetic predictions of change in the phenotypic means and variances (e.g., Lande 1976, 1979; Arnold et al. 2001; Marroig et al. 2004; Jones et al. 2007; Hohenlohe and Arnold 2008); (2) “random skewers” methods for evaluating the evolutionary effects of a phenotypic covariance matrix by measuring the response to a series of random selection vectors (Cheverud 1996; Cheverud and Marroig 2007); (3) long-term modeling of phenotypic evolution over paleontological time scales (Polly 2004); (4) mapping phenotypes onto a phylogenetic tree and reconstructing ancestral phenotypes at the nodes (Felsenstein 1985; Grafen 1989; Maddison 1991; Martins and Hansen 1997; Rohlf 2001; Steppan 2004; Wiley et al. 2005; Corbin 2008; Polly 2008); and (5) building phylogenetic trees from quantitative phenotypic traits (Felsenstein 1973, 1988). Not all of these methods are self-described as involving evolutionary extrapolations through a morphospace, but the geometry of their equations allows them to be viewed as such.

A common feature of these methods is that they project mathematical trajectories through a morphospace based on statistical models derived from evolutionary quantitative genetic theory. These evolutionary transformations are interpreted as good estimates of evolutionary change given the assumptions implicit in the transforming equations and the geometry of the space. Such evolutionary transformations usually have the following properties: (1) they trace a continuous gradation of phenotypes; (2) they involve transformations in a fixed number of phenotypic variables, which are chosen as part of the study design; and (3) the geometry of the morphospace and the transforming equations involve only phenotypic variables, though in most cases they are based on theory that connects them to underlying genetic, environmental, or developmental processes. These properties are generally compatible with multivariate evolutionary quantitative genetic theory (e.g., Simpson 1944; Lande 1976, 1979; Arnold et al. 2001; Blows and Brooks 2003; Gavrilets 2004; Hansen 2006; Blows 2007). The evolutionary transformations listed above are founded explicitly or implicitly on this body of evolutionary quantitative genetic theory, which assumes that phenotypic change is continuous; that the direction of change is a function of genetic covariances (the G matrix), selection, itself a function of adaptive and fitness landscapes, selective covariances, and drift; and that the magnitude of change is a function of genetic variance, selection intensity, and population size.

Another body of theory, derived primarily from evolutionary developmental biology, suggests that such evolutionary transformations in morphospace may be founded on unrealistic views of how phenotypic transformations are expected to occur (e.g., Alberch 1982, 1985; Gilchrist and Nijhout 2001; Salazar-Ciudad and Jernvall 2004; Salazar-Ciudad 2006, 2007). This literature emphasizes that development is the set of mechanisms by which the phenotype is constructed from genetic and epigenetic interactions, as well as the interactions between those and the environment: phenotypes are emergent products of these interactions. Continuous changes in underlying genetic and environmental parameters may not map additively into continuous transitions in the phenotype (Rice 2002, 2004a, b).

Developmental interactions of this kind have three consequences that may hamper the accurate prediction of evolutionary transformations in morphospace. One consequence is that evolutionary transitions in underlying genetic parameters—the rate of expression of developmental activators or inhibitors, for example—may be quasi-continuous, but the corresponding change in the phenotype may jump discontinuously from one point in morphospace to another without traversing intermediate points (Fig. 1). If so, then the same quantitative genetic equations cannot simultaneously describe changes in the phenotype and in the underlying factors. A second consequence is that developmental interactions may limit production of some phenotypic variants, biasing variation in certain directions that, in turn, bias the direction of evolutionary transformations (Alberch 1982; Goodwin 1994; Newman and Müller 2000; Arthur 2004). Some argue that these biases are adequately accounted for in evolutionary genetic theory by the structure of variances and covariances in the G matrix (e.g., Maynard Smith et al. 1985; Hansen 2006), whereas others argue that many of the biases come from non-additive interactions and so are not adequately described by G (e.g., Salazar-Ciudad 2007). The third consequence is that developmental interactions can produce novel structures or cause the loss of existing structures. By strict definition, a novel structure is one that is neither homologous to any structure in an ancestor nor equivalent to any other structure in the same organism (Müller and Wagner 1991). More loosely, a novel structure can also be the repetition of an already existing structure, such as a vertebra. The gain and loss of such features may be impossible to represent in some morphospaces because the dimensionality of the space is fixed and because the quantitative variables associated with the dimensions of the space may be mapped onto the phenotype in such a way that the gain or loss of phenotypic components is mathematically prohibited. Several authors have recently begun extending evolutionary quantitative genetic theory to represent these developmental phenomena (Johnson and Porter 2001; Rice 1990, 2000, 2002, 2004a, b; Wolf et al. 2001, 2004; Hunt et al. 2007).

Fig. 1
figure 1

Non-linear mapping of genetic-developmental parameter space onto phenotypic space. Continuous evolutionary changes developmental genetic parameter space, such as a change the rate of expression of activators or inhibitors of cellular division (bottom panel), may produce discontinuous changes in phenotypic space

This review explores the potential consequences of developmental interactions for modeling evolutionary transformations in morphospace. I will first review the mathematical properties of morphospaces that are relevant to such transformations. I will then review how evolutionary quantitative genetic theory is tied to existing methods for estimating evolutionary trajectories through morphospace and how those estimates may differ form predictions derived from evolutionary developmental biology. I will also review how the recent extensions of evolutionary quantitative genetic theory try to account for the non-linear developmental mapping of the phenotype onto genetic and environmental factors. The specific consequences of these issues for modeling evolutionary transformations in morphospace will be explored and a revised concept for modeling them based on the extended evolutionary quantitative genetics models will be considered.

Morphospaces and Their Properties

Morphometric spaces, or morphospaces, are mathematical constructions for the orderly arrangement of phenotypes using one or more variables as the criteria for the ordering. Morphospaces are often used for visualizing multivariate phenotypic similarities and differences; for statistical analysis of phenotypes to one another or to external factors such as environment, sex, age, or geography; and for generating theoretical phenotypic models (Bookstein 1991; Dryden and Mardia 1998; McGhee 2007). Morphospaces can have one or more dimensions, each of which is defined by a phenotypic variable (such as the length of a structure or a Cartesian coordinate of a landmarks) or by a transformed combination of such variables. Each position within a morphospace represents a single, unique phenotype whose identity is determined by the geometry of the space and how the associated morphometric variables are mapped onto the phenotype.

Morphospaces are sometimes categorized as empirical or theoretical depending on whether their geometry, especially their coordinate system, is derived from real data or from the parameters of equations used to generate theoretical morphotypes (Raup 1966; McGhee and McKinney 2002; McGhee 1999, 2001, 2007). This distinction is useful in many contexts, though it is fairly arbitrary (MacLeod 2002; Polly 2004). For the purposes of this paper, however, the distinction between empirical and theoretical morphospaces will be ignored because my focus is on trajectories through empty parts of morphospace that are “theoretical” regardless of whether they are drawn in an empirical morphospace or not.

Morphospaces have several properties that relate directly to the question of evolutionary transformations within them: the criteria used to order the phenotypes, the coordinate system of the space, the scaling of the axes, the dimensionality of the space, and the methodological philosophy with which variables associated with the morphospace are mapped onto the phenotype.

Ordering of Phenotypes, Coordinates Systems, and Scaling in Morphospaces

The criteria used to order phenotypes within a morphospace, the coordinate system of the space, and the scaling of its axes are closely related to one another and to the variables used to characterize the phenotypes. The simplest morphospaces have the original variables as their axes, often log transformed if they are size variables (Fig. 2a). Phenotypes in these morphospaces are therefore ordered by the scale of the original variables and the coordinate system of the space is defined by the same variables. The distribution of phenotypes is often correlated on two or more of the axes, as is usually the case with size variables.

Fig. 2
figure 2

(a) A simple morphospace with two measured variables, length and width, that form the axes, and therefore the coordinate system, of the space. Note that the length and width data are correlated. (b) A principal component (PC) morphospace, which is a transformation of the space in A by rigid rotation. Note that the distances among the data points are unchanged, but that there is no correlation in the data between PC 1 and PC 2

In multivariate morphometrics, the coordinate system of the space is often transformed to the principal components (PCs) of the covariance matrix of the original variables (Reyment 1984). The PC coordinate system is centered on the phenotypic mean and the primary axis runs through the major axis of variation in the data with subsequent axes running at right angles through the minor axes of variation. This transformation, if based on the covariance matrix, is a simple rigid rotation of the original variable space: even though the axes and coordinate system are new, the ordering and spacing of phenotypes is unchanged (Fig. 2b). Evolutionary transformations within a covariance-based PC morphospace are exactly the same as in the original variable space, except for the coordinates used to describe the trajectory. If the PC system is based on a correlation matrix, the ordering of phenotypes is unchanged by the transformation, but their spacing is altered. Evolutionary transformations in this space would be different than in the original variable space.

Other transformations of scaling and coordinate systems are used in morphometrics, one of the most common being the canonical variates (CV), which maximizes differences among pre-specified groups (Reyment 1984). Technically speaking, the ordering of phenotypes is unchanged in a CV space, but the changes in scaling are so radical and so closely tied to the properties of a data set that they are not easily predictable. The nearest neighbor of an object in CV space can be considerably different from its nearest neighbor in the original variable space (Klingenberg and Monteiro 2005).

The ordering of phenotypes within a morphospace is thus normally the same regardless of the coordinate system or the scaling of the axes and is determined by the ordering of the original variables. The ordering of the phenotypes is preserved when the transformed axes are linear combinations of the original axes. The spacing between phenotypes is the same if the scaling of the original variables is preserved in the transformation of the axes, but it is changed if the scaling is altered. When a PC system is based on a correlation matrix, for example, the variance of the original variables is standardized to 1.0 prior to the rotation of the coordinate system.

The consequence of these properties is that evolutionary trajectories drawn between two particular phenotypes in a morphospace will pass through intermediate phenotypes in approximately the same order, regardless of changes in coordinate system or scaling. The scaling of phenotypic change along the trajectory depends heavily on the scaling of the axes of the morphospace, however, which has implications for methods that estimate distance between phenotypes or rates of change between one phenotype and another. The nearest neighbor to any particular phenotype also depends on the scaling of the axes, which has implications for tree-building in morphspaces. The effect of scale changes can have effects that are more complicated than what is described here (e.g., Rohlf 2000; Klingenberg and Monteiro 2005) and one should never presume that a linear evolutionary trajectory between two phenotypes in one space will pass through the exactly the same intermediate phenotypes in a transformed space unless the transformation is known to be a rigid rotation.

Dimensionality of Morphospaces

The dimensionality of a morphospace is fixed, either by the number of variables used in its construction or by the degrees of freedom in the data if the morphospace axes are based on an empirical sample (Reyment 1984; Bookstein 1991). Dimensionality places stringent limits on the variety of phenotypes that can be represented, limits that are determined by the choice of variables used to represent the phenotype. Imagine a morphospace where each axis represents a different phenotypic trait. An evolutionary trajectory drawn through that space can describe change in any one or more of the traits, but a new trait, an evolutionary novelty, cannot be added nor can a trait be lost because the number of traits is implicitly defined by the choice of variables. Thus, morphospaces cannot easily be used to model evolutionary transformations that involve the gain and loss of traits.

The limitation of dimensionality is not absolute, however, because the evolutionary gain and loss of features can be modeled in a morphospace if each axis represents an independent feature and if the value of the phenotype on the axis can move to zero. In a morphospace with these properties, evolutionary loss of a feature could be modeled by restricting trajectories to a subspace that does not include the axis associated with that feature. The gain of that feature could then be modeled by allowing trajectories to move away from the zero point of that axis. This approach would still be limited because the number of features that can be gained or lost is predetermined by the choice of variable axes in the morphospace.

It should be noted, however, that all finite-dimensional morphospaces can be viewed as subspaces of an infinite dimensional Hilbert space. Such an infinite-dimensioned space would allow modeling of the evolutionary gain and loss of any number of features, but the relation of each axis to a phenotype would have to be specified to make the model biologically meaningful.

Homology and Homology-Free Characterizations of the Phenotype

The mapping of morphometric variables onto the phenotype is important for modeling evolutionary transformations in morphospace. Even though the mapping is not directly related to the other properties of the morphospace, it determines how individual points in the space are translated into a picture of the phenotype. The choice of mapping will determine what kind of phenotypic transformations can be described by evolutionary trajectories through a morphospace and how the phenotypes are ordered in the space. Many classifications of kinds of morphometric variables exist (Reyment 1984; Bookstein 1991; Zelditch et al. 2004; Hammer and Harper 2005), but here I will concentrate on the distinction between homology and homology-free variables. I will also concentrate on geometric variables that can be used to graphically represent the phenotype, though some of this discussion could be extended to other types of variables.

Homology-based variables are ones that are associated directly with a particular biologically homologous structure. Landmark points that are applied to specific homologous substructures of the phenotype are the most familiar example (Fig. 3a). Regardless of whether such points are analyzed as Cartesian coordinates (Bookstein 1991) or as interpoint distances (Strauss and Bookstein 1986; Lele and Richtsmeier 1991), there is a fixed correspondence between variables and homologous substructures of the phenotype.

Fig. 3
figure 3

(a) Homology-based mapping of landmark points on a lower third molar of Marmota caligata in occlusal view. (b) Homology-free outline coordinates around the perimeter of the same tooth. (c) Homology-free 3D grid-coordinates on the surface of the tooth

Homology-free variables are ones that are applied to the phenotype using a mathematical algorithm that is “homologous” in its orientation to place individual points at regularly specified intervals that may or may not fall on the same homologous substructure. Outlines (Younker and Erlich 1977; Lohmann 1983; MacLeod and Rose 1993) and 3D surfaces (Salazar-Ciudad and Jernvall 2004; Wiley et al. 2005; Polly 2008; McPeek et al. 2008), whether quantified as polynomial functions, Fourier descriptors, angular deformations, or Cartesian coordinates, are examples of homology-free variables (Fig. 3b, c). Homology-free characterizations have a fixed number of variables that help define the dimensions of the associated morphospace, but the number of homologous substructures is not necessarily fixed. This property of homology-free characterizations of the phenotype potentially allows structures to be gained or lost in evolutionary transformations in a morphospace, though the potential is limited to structures that are identifiable from the contours of a curve, an outline, or a surface. Homology-free variables hold the most promise for modeling the evolution of novelty.

The pros and cons of homology-free variables have been discussed at length in the morphometric and systematics literature (e.g., Bookstein et al. 1982; Erlich et al. 1983; Bookstein 1991; Zelditch et al. 1995; MacLeod 1999). Homology-based variables are required if one wants to measure variation in particular homologous substructures (though some methods are better suited for this task than others (Lele and Richtsmeier 1992)), to maintain a one-to-one correspondence between morphometric structures and homologous characters in a phylogenetic analysis, or, arguably, to validly use morphometric variables as data for phylogeny reconstruction. Homology-free variables are required if one wants to characterize shape variation that includes the gain and loss of homologous substructures. Because of existing disagreements about the validity of homology-free methods for characterizing geometric shapes, three things are worth emphasizing: (1) the “homology-free” characterization I am describing is homology-based in that it is applied to biologically homologous structures in a biologically comparable orientation; (2) that relative morphometric distances between phenotypes as wholes are closely correlated, regardless of which characterization is used because both approaches are fundamentally measuring the same phenotype; and (3) homology-based and homology-free characterizations grade into one another because homologous landmarks can be used to restrict homology-free curves, semi-landmark constellations, or subsurfaces to particular homologous substructures (MacLeod 1999; Bookstein et al. 2003; Wiley et al. 2005). The difference between the approaches is really a matter of how variables are mapped onto substructures and which interpretations are possible given that mapping.

The potential for modeling evolutionary novelty is most fully realized with full three-dimensional characterizations of entire structures. Morphometric methods for the analysis of 3D surfaces are in their infancy and have logistical and mathematical issues waiting to be resolved. A handful of studies have appeared, however, that demonstrate the power of surface characterizations for representing some kinds of evolutionary gains and losses. Salazar-Ciudad and Jernvall (2004), for example, used a simple grid of 3D points whose x y base was placed on a mammalian tooth row and whose z heights were used to characterize the shape of the tooth row. This method is able to describe variation in the number of teeth, presence and absence of cusps and cingulae on individual teeth, as well as relative shape and proportions of teeth, cusps, and dentitions. Polly (2008) used a flexible grid of 3D points fit to the surface of the calcaneum to characterize variation across taxonomic and locomotor groups of the mammalian Carnivora. This system was able to capture the presence and absence of the peroneal process, as well as the functional proportions of the bone and the curvature of articulating joints. Wiley et al. (2005) used a combination of homologous landmark coordinates, sliding semi-landmark curves, and surface patches of semi-landmarks to characterize entire primate skulls. McPeek et al. (2008) used 3D spherical Fourier harmonics to describe the entire surface structure of elements of the male genitalia in damselflies. Male and female structures interlock in these insects in a complicated way that is associated with prezygotic reproductive isolation. Transformations in the phenotype from one species to another are structurally dramatic. Each of the last three studies modeled evolutionary change in the respective phenotypes—including the gain, loss and transformation of substructures like the peroneal process—by applying statistical models derived from evolutionary quantitative genetics to extrapolate empirically measured phenotypes over a phylogenetic tree.

Linking Evolutionary Genetics and Development to Morphospaces

Evolutionary Quantitative Genetics of the Phenotype

Most existing models for mapping evolutionary transformations in phenotypic morphospaces are derived from multivariate evolutionary quantitative genetic theory. The theory for phenotypic evolution is extrapolated from the theory for allele frequencies and uses basically the same equations (Fisher 1930; Wright 1968; Lande 1976; Roff 1997). Simply put, change in phenotypic means is a function of drift, selection and the matrix of additive genetic variances and covariances, G. Drift is a function of population size and G, selection is a function of fitness, which can be described by an adaptive landscape (Lande 1976; Arnold et al. 2001; Gavrilets 2004). Long-term evolution of the phenotype is the summation of the changes in mean phenotype over many generations of selection and drift.

More specifically, the multivariate phenotype P (or z in some notations) is composed of two major components, genetic, G, and environmental, E. Change in mean multivariate phenotype is described by the multiplication of a vector of selection coefficients and the additive genetic covariance matrix, such that \( {\Updelta }\bar z_{\text{s}} = G\beta \), where \( {\Updelta }\bar z_{\text{s}} \) is change in the mean phenotype due to selection, G is the additive genetic covariance matrix, and β is the vector of selection coefficients (Arnold et al. 2001; Lande 1979). Added to this is change in the mean phenotype due to drift: \( {\Updelta }\bar z_{\text{d}} = G/N_{\text{e}} \), where \( {\Updelta }\bar z_{\text{d}} \) is change in the mean phenotype due to drift, and N e is the effective population size (Lande 1979). The direction and magnitude of selection coefficients is a function of reproductive fitness, which is represented by the shape of an adaptive landscape, \( W(\bar z) \) (Arnold et al. 2001). Selection on the adaptive landscape normally moves the mean phenotype toward the peak. The literature on quantitative phenotypic evolution is rich and deals with many issues not touched on here (see the succinct review by Arnold et al. 2001).

An important feature of these evolutionary quantitative genetic equations is they describe changes solely in terms of the phenotype. Genetic and developmental correlations are described by G, which is estimated from phenotypes experimentally, perhaps with offspring-parent regression (Roff 1997). Likewise, selective and environmental correlations are described by \( W(\bar z) \) and β, which are estimated from phenotypes using an appropriate experimental design. These evolutionary genetic equations do not require specific knowledge of the mapping of the phenotype onto specific underlying genetic, developmental, or environmental factors, though the additive phenotypic effects of those factors are described by G and so influence the direction and magnitude of evolutionary changes predicted by these equations (Maynard Smith et al. 1985; Steppan et al. 2002; Hansen 2006).

The fact that the evolutionary quantitative genetic equations are expressed exclusively in terms of the phenotype make them particularly applicable to modeling phenotypic transformations in morphospace in a theoretically informed, probabilistic manner (Fig. 4). Indeed, a morphospace can be constructed directly from G if it is available, or from P, which is autocorrelated with G (Cheverud 1988; but see Willis et al. 1991) if G is not known. All of the methods used to estimated evolutionary trajectories in morphospace—short and long-term modeling of phenotypic evolution, random skewers, phylogenetic character mapping, ancestral state reconstructions, and maximum likelihood phylogenetic reconstructions—are founded on these evolutionary genetic equations (e.g., Felsenstein 1988; Martins and Hansen 1997; Rohlf 2001; Polly 2004).

Fig. 4
figure 4

Typical mapping of evolutionary genetic processes (b) into morphospace (a). Evolutionary quantitative genetic models of phenotypic evolution usually make a direct link between phenotype and fitness without intermediate consideration of interactions between molecular genetic, developmental, or environmental components of the phenotype. Evolutionary changes, such as movement of the mean phenotype up the slope of an adaptive landscape map linearly into morphospace because there is a one-to-one correspondence of phenotypic traits in the two systems. Any process that is continuous on the adaptive landscape produces continuous change in the phenotype

Alternative Developmental Models of Phenotypic Evolution

Despite the proven effectiveness of evolutionary quantitative genetic theory and the ease of applying it to evolution in morphospaces, the criticisms derived from developmental biology that phenotypic evolution does not necessarily behave according to the assumptions of this theory deserves consideration. Three issues deserve special attention: (1) the mapping of phenotypes onto underlying genetic, developmental, and environmental factors may be non-linear so that evolutionary transformations that are continuous at one level may not be continuous at the other; (2) developmental interactions may bias phenotypic variation in non-additive ways, perhaps prohibiting some variants altogether or making variation meristic instead of continuous; and (3) developmental interactions may produce novel phenotypic structures even though the underlying genetic, developmental, and environmental processes are not novel. (Novelty can, of course, arise from non-developmental transformations; I am not intending to imply that this last is the only source of novelty).

These issues have long been a part of debate about whether the developmental interactions play a role in evolution independent of adaptation and selection. The roots of developmental side of this debate extend back to Cope (1887), Bateseon (1894), Morgan (1916), and Vavilov (1922), but have been elaborated recently by Riedl (1978), Alberch (1982, 1985), Gould (1985, 2002), Müller and Wagner (1991), Kauffman (1993), Raff (1996), Salazar-Ciudad et al. (2001), Salazar-Ciudad and Jernvall (2004), and Salazar-Ciudad (2006, 2007). The issues raised by these authors could once have been considered to be outside the domain of evolutionary quantitative genetics because the transformations involved were difficult to express as quantitative traits. However, the heightened ability for geometric morphometrics to quantitatively describe complicated, multivariate phenotypes, the ease with which evolutionary transformations can be modeled using quantitative genetic equations, and the increase in knowledge about how genetic, developmental, and environmental interactions produce phenotypes make it worth reconsidering the consequences of the for modeling phenotypic evolution in morphospaces.

The developmental issues have three important consequences for modeling phenotypic change in morphospaces: (1) transitions in the phenotype often involve evolutionary gains or losses that can be quantitatively equivalent to the gain and loss of variables or morphospace dimensions, except possibly when homology-free variables are used; (2) evolutionary transitions within phenotypic space cannot be expected always to be continuous, even though the transitions in their underlying genetic parameters might be; and, therefore, (3) the equations that describe evolution of the phenotype may need to be fundamentally different from the equations that describe evolution of allelle frequencies, molecular sequences, gene expression levels, or other factors that underlie the phenotype.

One key to understanding the potential conflict between developmental models of the phenotype and the expectation of phenotypic evolution inherent in evolutionary quantitative genetic models is that standing variation in the phenotype does not directly give information about the underlying developmental processes (Salazar-Ciudad 2007); any observed pattern of phenotypic covariance might be the product of one of several possible systems of interaction among underlying factors (Rice 2004a). Depending on the interaction of the underlying factors, evolutionary change might take the course predicted from the additive effects of G or it might not depending on the situation. Instead of proceeding continuously, phenotypic evolution may wander continuously in one region of morphospace before jumping discontinuously to another. Instead of e being able to take on all its possible values, a phenotypic variable may have limits that cannot be crossed because they describe developmentally prohibited phenotypes. Instead of a multivariate phenotype being composed of a fixed number of variables, structures may be gained and lost such that terms would have to drop into and out of modeling equations to describe the changes.

Mammalian teeth provide an example of how developmental interactions can produce such biased and discontinuous phenotypes even though the underlying changes in underlying molecular genetic factors may be continuously gradational (see reviews by Thesleff and Sharpe 1997; Jernvall and Thesleff 2002). Teeth form along the dental arch as a series of interactions between ectodermal and neural-crest derived mesodermal layers, which are regionalized into incisors, canines, premolars and molars by HOX-like expression domains. Individual teeth start as buds in the ectodermal epithelium, which are surrounded by a proliferation of mesenchyme cells. Iterative signaling within and between these layers, initiated by the primary enamel knot cells in the epithelium, causes the bud to invaginate and fold into the rough shape of the tooth crown, after which mineralization forms the enamel and dentine structures that will later erupt into the mouth as a functional tooth. While involving many gene products, the developmental system that produces the crown shape can be characterized by the epithelial rate of cell division, the rate of expression of activator molecules, which signal affected epithelial calls to differentiate into new signaling centers when a threshold concentration is reached, inhibitor cells, which inhibit production of activator molecules and which stimulate growth in the mesenchymal layer, and diffusion coefficients for the activators and inhibitors (Salazar-Ciudad and Jernvall 2002). Small changes in any of these parameters can change the pattern of folding to increase or decrease cusp sharpness, increase or decrease the number of cusps, transform cusps into lophs, or to reorganize structures on the tooth crown. Changes in expression of signaling molecules can have dramatic effects on several structures in several teeth. Up or down regulation of ectodysplasian, for example can change the number of cusps, the position of cusps on the crown and the number of teeth in the tooth row (Kangas et al. 2004). These dramatic, discontinuous changes across the entire tooth row are possible because signaling happens by diffusion. Changes in growth of the developing tooth and the folding of its tissues can affect the outcome equally as much as actual changes in the level of expression of the signaling molecules. Change in one parameter thus can produce non-linear knock-on effects in the entire system.

Phenotypic differences like the ones associated with ectodysplasian are characteristic of differences found among phylogenetically distant species, but also of variation within species (Wolsan 1989; Jernvall 1995, 2000; Szuma 2002). Differences in cusp number, tooth number, and the topography of features on the tooth crown are arranged in geographic clines similar to ones characteristic of body size, for example (Szuma 2007). Furthermore, continuous variation in underlying developmental parameters can produce patchy and discontinuous distributions of phenotypes in morphospace (Salazar-Ciudad and Jernvall 2002). It would clearly be of interest to be able to model population and evolutionary changes in such traits in a way that is consistent with these patterns, a goal that is not obviously obtainable from existing models of evolutionary transformations in morphospaces.

Evolutionary Quantitative Genetics Extended

Theoretical work has already begun to extend evolutionary quantitative models to link underlying genetic, developmental and environmental factors with the phenotype instead of treating both levels independently with the same equations (Johnson and Porter 2001; Rice 1990, 2000, 2002, 2004a, b; Wolf et al. 2001, 2004; Hunt et al. 2007). These approaches insert a quantitative level between the adaptive landscape, on which genetic parameters evolve in response to selection and drift, and the phenotypic level (Fig. 5). The new level is the phenotypic landscape, which describes how interactions between genetic, developmental, environmental are translated into phenotypes. Each phenotypic trait (or independent component of a multivariate phenotypic trait) has an equation that describes its relation to the underlying factors (Rice 2004a). The relationships may be additive, pleiotropic (if the same factor appears in more than one equation), or non-additive as is appropriate. The equations that describe real phenotypic traits must be based on information from developmental, genetic, or environmental studies, though hypothetical equations could be used purely for the purposes of simulation (as was done to generate Fig. 5). The equations for the various components of the phenotype can be combined into a multivariate phenotypic landscape, which describes how the phenotype changes in response to changes in the underlying parameters (Rice 2004a). In situations when the effects of the underlying factors on the phenotype are purely additive, then the system behaves as expected under the more standard quantitative genetic equations reviewed above; however, when some effects are not additive, the system behaves in a more complicated manner that includes biased patterns of multivariate phenotypic variance, discontinuous transformations of the phenotype in response to continuous transformations in the underlying factors, and the appearance of novel traits in response to changes in how the factors underlying the phenotype interact. Whether these extended quantitative genetics models can capture the morphodynamic aspects of development described by Salazar-Ciudad and Jernvall (2002) is not clear, but they certainly capture many of the properties of phenotypic variation considered to be important by evolutionary developmental biologists.

Fig. 5
figure 5

Alternative mapping of genetic processes (c) into morphospace (a) using phenotypic landscapes (b) as an intermediate step. The complicated, multivariate phenotypes described by the morphospace can be biologically decomposed into separate phenotypic traits, each of which has its own equation to describe its relationship to underlying environmental and genetic factors. The independent contribution of each equation to the complete phenotype, as well as non-linear terms in the equations, permits non-linear, discontinuous phenotypic transitions to result from continuous transitions on the adaptive landscape. If the morphospace is constructed from homology-free variables, the discontinuous phenotypic changes can include the gain and loss of structures, as well as transformations in their shape and relative size

A Revised Approach to Morphospace Modeling?

Both the discontinuity in morphospace of phenotypic transitions and the evolution of novel features, which is another type of mathematical discontinuity, present challenges for modeling evolution in morphospace. Generally speaking, evolutionary genetic models of morphological evolution describe the effects of selection and drift completely in terms of changes in phenotypic means and variances. Quantitative constructs like the phenotypic adaptive landscape can be mapped directly into morphospace because there can be a one-to-one correspondence between variables in the two systems (Fig. 4). Continuous evolutionary transitions on an adaptive landscape map directly to continuous phenotypic transitions in morphospace, though it should be noted that the topography of the adaptive landscape can itself be complicated, perhaps including non-linear discontinuities (Gavrilets 2004).

One issue with the mapping of evolutionary process into morphospace is whether developmental interactions and other constraints can be realistically taken into account when evolutionary transformations are modeled. A strong argument has been made that developmental interactions that influence phenotypic variation are described by G (Maynard Smith et al. 1985; Hansen 2006). Processes that introduce biases or correlations in phenotypic variation will manifest themselves as correlations in the adult phenotype. Because G is derived from such phenotypic data, it incorporates the effects of developmental interactions even though those may not be known directly. Even the effects of morphodynamic processes are expected to be imprinted in G, making it possible to predict evolutionary transformations in the phenotype that allow for developmental interactions. One objection to this view is that heritable variance may exist for a trait in an existing population, but developmental interactions may inhibit the production of new variance for that trait outside the observed range. Standard evolutionary genetic equations would allow the phenotypic mean to move into a developmentally prohibited range because G describes variance in that direction. Furthermore, G only describes additive covariances. The non-additive effects of interactions among underlying factors will not be represented in G, making it a poor predictor of evolutionary response in some situations (Rice 2004a).

The extended evolutionary quantitative genetic system of Rice and others could be used to revolutionize the way that evolutionary transformations are modeled in morphospace. Their new system models many of the properties of phenotypic variation considered important by the evolutionary developmental biologists listed above (Rice 2004a, b). Non-linear interactions at the level of the phenotypic landscape can result in phenotypic variances that are not necessarily even close to multivariate normal, but which can be biased in surprising ways by the topography of the phenotypic landscape. Novel features can arise as traits with no heritable variation become heritable due to changes in the underlying factors, after which the new feature will start to evolve over the landscape. Discontinuous transformations in the phenotype can result from continuous transformations in the underlying genetic factors. An example of such discontinuity is illustrated in Fig. 5, where the non-linear phenotypic landscapes transform the simple linear change in two genetic factors in response to selection on a Gaussian adaptive landscape into a series of jumps in morphospace. While the teeth associated with the phenotypic clusters in Fig. 5 are merely illustrative of the possibility of what those clusters represent, the clusters themselves result from the mathematics of the two underlying layers.

A homology-free method for characterizing the phenotype would be required for such a system to model evolutionary novelties (or the loss of phenotypic structures). Many of the interesting phenomena in evolution of the phenotype involve the gain and loss of features, the origin of novelty (Raff 1996; Nitecki 1990; Müller and Wagner 1991). Despite the desirable properties of the extended quantitative genetic system, the origin of novel features cannot be modeled if the variables used to describe the phenotype are exclusively tied to existing homologous structures. With homology-free variables novel features can be visually represented by particular combinations of points at the appropriate topographical position, just like new shapes can be created on a bitmap computer screen by changing the colors of particular combinations of pixels. Salazar-Ciudad and Jernvall (2004) used a simple version of such a homology-free system to assess disparity and complexity of tooth shapes generated by their computer model of tooth development. For phenotypes like the dentition, where the novel features are cusps, crests, and teeth (Salazar-Ciudad and Jernvall 2004), for phenotypes like the mammalian tarsus, where novel features are processes and bones, (Polly 2008), or for phenotypes like insect genitalia, where novel features are projections and invaginations (McPeek et al. 2008), homology-free surfaces are capable of representing most of what one would need to describe the gain and loss of homologous substructures. Other kinds of evolutionary novelties, such as internal organs, may not be easy to capture with existing methods.

The primary challenge for using this system to model morphospace transformations is that the equations that map phenotypic traits onto underlying factors must be known from experimental studies, even though no such information is available for many of the phenotypic traits of interest to evolutionary biologists, systematists, or paleontologists. (One can, however, imagine a Bayesian estimation of the parameters of the phenotype equations from observations of phenotype, environment, and gene expression).

Conclusions

Morphometric spaces are based purely on variables that represent phenotypes, with no reference to factors that influence evolution of one phenotype to another, regardless of whether those factors are developmental interactions, genetic linkages, selection, or whatever. Nevertheless, morphospaces are often used in conjunction with evolutionary quantitative genetic equations to model the short or long-term course of phenotypic evolution, to map phenotypic traits onto phylogenetic trees, to reconstruct ancestral phenotypes at tree nodes, or even to reconstruct phylogenetic relationships from the quantitative traits that define the morphospace. All of these procedures rely on certain assumptions about continuity and direction of phenotypic change within the space in response to typical evolutionary processes such as drift and selection. Furthermore, the geometry of most morphospaces and the way that their associated variables are mapped onto the phenotype mean that evolutionary transformations in morphospace can only describe changes in a static set of features, but not the gain and loss of features.

The theoretical underpinnings of such transformations in morphospace are founded on evolutionary quantitative genetic theory of phenotypic evolution. Key features of this theory are that population means follow continuous trajectories through phenotypic space; the direction of change is a function of genetic covariances, selection (and selective covariances), drift; and the magnitude of change is function of genetic variance, selection intensity, and population size. Because this theory forms the basis for existing methods for projecting evolutionary trajectories through morphospace, these trajectories have the following properties: (1) changes in phenotype are continuous; (2) nothing except selection or additive variance prevent the phenotype from moving infinitely in a given direction; and (3) there is no obvious provision for gain or loss of features.

Nevertheless, studies of development have demonstrated that in many systems phenotypic variation may be discontinuous, it may be biased in certain directions, the interactions of genetic, developmental, and environmental factors may have non-linear effects on the phenotype, and “novelties” may arise from these non-linear interactions. Morphometric and quantitative genetic analysis of the phenotype are only indirectly indicative of the underlying developmental, genetic and environmental factors that channel variation in the phenotype. Some but not all of the underlying developmental factors are captured in measures of phenotypic variance and covariance.

A multi-level system for modeling evolutionary change in morphospaces using the extended evolutionary genetic equations developed by Rice and others for phenotypic landscapes deserves further attention. Of special interest would be integrating the phenotypic landscape equations with existing geometric morphspaces. While this elaboration is unnecessary for morphometric studies of existing phenotypic diversity, it could become very powerful tool for modeling long-term phenotypic evolution, including radiations involving evolutionary novelties, with the discontinuous, non-linear phenotypic transformations that characterize so many biological systems.

Does any of this matter? Are quantitative genetic extrapolations, such as random skewers simulations of phenotypic change over a single generation, likely to be dramatically altered by incorporating non-additive interactions of genetic, developmental and environmental factors? Are ancestral reconstructions based on extrapolating multivariate phenotypes across a phylogenetic tree likely to be dramatically different for having included these effects? Probably not. Most morphometric studies to date have self-selectively focused on systems where the assumptions of morphospace modeling are met, where the traits under consideration are consistently present across the taxa being studied, where the phenotypic changes are likely to have been fairly continuous, and where additive genetic variance for the range of phenotypes under consideration is possible. The non-additive effects of developmental interactions may be more important for long-term simulations of morphological change over paleontological timescales, though it should be remembered that evolutionary “novelty” can be an issue even within populations, as demonstrated by variation in the presence and absence of cusps and crests in mammalian teeth. But if geometric morphospace modeling can be extended to deal with discontinuous variance, with discontinuous evolutionary change, and with the origin of novelty, then these same powerful methods may in future be applied to new problems, such as the analysis of discontinuous variation or to phenotypic transitions in the early radiation of metazoan animals. Better methods for homology-free representation of phenotypic variation will be required, as will sophisticated, non-linear algorithms for creating morphospaces, for drawing trajectories through them, and for ordinating extremely disparate morphologies in the same morphospace. Are these improvements possible, or necessary, in the near future? We shall see.