Introduction

Most of evolution has happened in geological time, so if we wish to understand the origin of organismal diversity, it would be paramount to examine the contributions that fossils can make to this task (Raff 2007; Sánchez-Villagra 2012). The study of several morphological transformations in vertebrate evolution, such as those concerning the origin of the tetrapod limb (Shubin et al. 2009), the mammalian middle ear (Luo 2007; Luo et al. 2007) and the turtle shell (Scheyer et al. 2012) provide examples of the illuminating integration of embryological with paleontological data. Fossils serve to date the tree of life and provide estimates on when and under which paleoecological circumstances new developmental programs arose in evolution (Peterson et al. 2007; Schoch 2009). On the other hand, the more direct role that paleontology can play in studies of development is far from obvious. Fossils are incomplete and no experiments can be conducted with them. What, then, is the place of paleontology in the larger context of studies of morphology and its developmental origin? This is a large question, and to address it some fundamental historical developments about current evolutionary theory are here first summarized.

Gene-centric conceptualization characterized the Modern Synthesis (MS) that surged in the 1930s and 1940s. The paleontologist George Gaylord Simpson, considered one of the “architects” of the MS, assumed that microevolutionary data can be simply extrapolated to explain macroevolutionary patterns (Simpson 1944). The importance of processes above population-level was largely disregarded, and with time several authors raised the issue (e.g., Rensch 1959; Eldredge and Cracraft 1980; Gould 1980a; Vrba 1983; Erwin 2000; Leroi 2000; Grantham 2007). Population-genetic approaches (Charlesworth et al. 1982) face difficulties when addressing long-term patterns of evolution because they are unable to determine and account for the frequency and intensity of mass extinctions, and the large-scale modes and rates of evolution (e.g., Hunt 2007; Jablonski 2007, 2010). The growing interest and major developments of the last decades in evolutionary developmental biology (EvoDevo) have also led to the recognition of the incompleteness of the MS (Gould 1977; Alberch 1989). Some authors argued for an “extension” of the MS to incorporate development (Goodwin 1988; Gilbert et al. 1996; Pigliucci and Müller 2010). Central here has been the critique to an adaptationist program as it surged from the MS (Gould and Lewontin 1979), and the growing importance of the concept of constraints in its different forms (Alberch 1980, 1989; Smith et al. 1985; Oster et al. 1988; see Urdy and Chirat 2006 for a review). The approach of Konstruktionsmorphologie and the triad of factors involved in evolution advocated by Seilacher (1970) already contained the ideas of constraint that became so dominant in the English-speaking literature in later years.

During the time of the MS, researchers outside Anglo-American circles also developed approaches that took into account morphology and the developmental origin of organic form (Olsson et al. 2010). For example, the works of Alexei Nikolaevich Severtsov were influential to other authors around the time of the MS (Rensch 1959; Schmalhausen 1949). Most notably, Severtsov (1912) detailed reasons why he thought morphology was of key importance among the evolutionary sciences. These centered upon how evolutionary theory could not be independent of morphology (see Adams 1980 for review). One argument Severtsov put forth was that without knowing of the changes that have occurred in life’s history (without examining the fossil record), it is impossible to formulate a theory of how such changes might have happened. Essentially, paleontological data, in a unique manner, represent, “all we know directly about the actual course of life’s history” (Gould 1980b, p. 153), and thus define the patterns to be explained.

In the Anglo-American discourse of the late 1970s and early 80s embryology and morphology were reconnected to evolutionary questions (Gould 1977; Alberch 1980; Raff and Kaufman 1983; review by Gilbert 2003). Some authors repeatedly argued for the search of general rules of development advocating that at least some amount of biological order was caused by the dynamics of development (Gould and Lewontin 1979; Alberch 1980, 1989; Webster and Goodwin 1982; Alberch and Gale 1985; Goodwin 1988; Oster et al. 1988; Kauffman 1993). The debate concentrated mainly on the nature and clarification of the concept of constraints (Alberch 1980; Smith et al. 1985) and the specific role that one has to attribute to natural selection and development, respectively, to account for the origin of order (Kauffman 1993). But the debate on constraints also produced some ambiguity, given the different conceptions of constraint (Amundson 1994). One relevant point, raised by Salazar-Ciudad (2006), is that the selection/developmental constraints debate relies on two different assumptions about the relationship between genotype and phenotype (linear/non-linear) and what kind of morphological variation is produced by development (gradual/discrete, unbounded/limited).

Developmental Biases in Evolution

Historically, the concept of developmental constraints was cited to argue against the view that without selection, phenotypic variation would be random (e.g., Alberch 1980). Putting aside the complications in defining “random phenotypic variation” (Eble 1999), it appeared that in practice, the MS assumed that variation was gradual and “in every direction.” This view has been much debated since the late 1970s under the umbrella of “developmental constraints,” to bring about the notion that, as all morphologies are constrained by the rules of chemistry, physics, and geometry (Thompson 1917), development is a source of order structuring variation into discrete phenotypes. The dynamical interactions of various developmental factors then set out the possibilities for variation of morphologies during development and evolution. For instance, it has been argued that the relative level of conservation of body plans could be an expression of the limited possibilities of development rather than an expression of adaptation by natural selection (Hall 1996). Likewise, the broad distribution of homoplasies could also be due to developmental constraints (Wake 1991).

The term “constraints” has also been used in a somehow different meaning than discussed above, as in the mathematical constraints that correspond to the initial and boundary conditions of the system under study. In mathematical models, the behaviors of developmental systems are described, characterized, and predicted thanks to the rules of interaction between molecules, proteins, cells, and/or tissues under a particular set of constraints and initial boundary conditions. Without such constraints, the behavior of a system cannot be predicted. In this view, constraints are given a decisive and “creative” role (Urdy 2012).

When critics of the MS pointed out that not every kind of variation is developmentally possible (first sense), they were arguing that development was limiting the range of possible variation on one side, and that development was creative on the other side (second sense). In this way, the first and second meanings of constraints partially overlap, the first building extensively on the second. Probably because of this ambiguity, it has been advocated that this term would be best replaced by “developmental bias” (Arthur 2004).

Independent of the operational issues related to constraints and developmental biases, it is currently agreed that some phenotypes are possible and extremely probable whereas others are unlikely or even impossible. The morphological space occupied by a clade of organisms is then a reflection of the robustness of development on one side and a reflection of its developmental plasticity on the other side (Kaneko 2011). Looking at the space occupied by a clade considering only extant species can be misleading since fossils often record a vast number of phenotypes that do not exist anymore. As Wilson (2013) stated, “the vast record of geological time provides the richest account of what is possible to do.” Modern molecular EvoDevo methods will not inform us about the development of phenotypes that no longer exist, and a theory of evolution that does not account for those cannot be complete.

Unique Patterns of Life History Revealed by the Vertebrate Fossil Record

Adult phenotypes of extinct organisms can inform us about developmental processes by showing a combination of features or levels of integration unrecorded in living species. Fossils chronicle how phenotypic evolution progressed and the (paleo)ecological factors that had shaped them. These phenotypic character transformations serve to confirm the constraint hypotheses of developmental evolution based on extant forms, or expand the range of morphospace that a clade can occupy. Without inspecting this evidence, the subject matter of developmental studies remains incomplete (Wagner and Larsson 2003). The study of vertebrate fossilized ontogenies is mostly restricted to postnatal and late stages of growth, but nevertheless can deliver great insights into life history and evolutionary mechanisms affecting development in general. The following examples serve to illustrate this point.

Island Mammals’ Metabolism and Bone Growth

Paleohistology, the fastest growing area of research in developmental paleontology of vertebrates, can address many aspects of postnatal/post-hatching life history. It can serve to estimate the age of sexual maturity and of death, to understand activity cycles and reproductive cycles as well as to decipher growth patterns (e.g., Chinsamy-Turan 2005; Sander and Klein 2005; Cubo et al. 2011). Among the main features studied by paleohistologists are the “lines of arrested growth” (LAGs), which mark the cessation of appositional bone growth. Endogenous and exogenous conditions most likely trigger the formation of these growth marks, and their cyclical appearance, probably coupled with seasonal changes in the environment, is used to estimate the age of fossil individuals at the time of their death. It has been generally assumed that this feature is “erased” in animals with a high metabolic rate and an associated high degree of bone remodeling. Whereas LAGs are commonly recorded and studied in reptiles, mammals are not expected to have them or rather have them mostly obliterated during growth. However, a recent report of a dwarf bovid from the Pleistocene of Mallorca, Myotragus balearicus, indicated the presence of numerous LAGs in adult individuals. This finding, coupled with a low degree of bone resorption and remodeling, suggest that Myotragus had a crocodile-like mode of growth and physiology unrecorded among living species (Köhler and Moyá-Solá 2009). Current work on other island mammal dwarf forms, including the deer Candiacervus from the Pleistocene of Crete, shows that the bone specializations in Myotragus are not universal for other island mammals (Kolb et al. 2011).

Skeletal Formation in Placodont Reptiles

Placodontia is a group of armored marine reptiles restricted to the Triassic period and part of the sauropterygian radiation that also consists of plesiosaurs among other forms. All placodont species possess dermal armor plates, some building a single row dorsal to the spine, others superficially resembling turtles in forming an armor shell. In a study of bone microstructures in this group, Scheyer (2007) discovered the unique presence of cartilaginous tissue in some postcranial armor plates of placodonts. The developmental pathways leading to the “postcranial fibro-cartilaginous bone” tissue of placodont armor plates is unique among tetrapods, in which otherwise osteoderms develop intra-membraneously or through metaplastic ossification without cartilaginous preformation. Placodonts were aquatic, and as other groups living in this kind of environment, they possessed pachyostotic limb bones (e.g., de Ricqlès and de Buffrénil 2001). Scheyer (2007) interpreted the unique presence of compact “postcranial fibro-cartilaginous bone” as an osteosclerotic trend in the armor plates, which, as in the limbs, aided in buoyancy control, affecting manoeuvrability and swimming speed. The paleoecological context of placodont evolution in which the skeletogenetic innovation arose is well known (Scheyer et al. 2011).

The Marine Reptiles’ Case: Iguanas and Pachypleurosaurs

High bone compactness has evolved in several lineages of land vertebrates that have secondarily and independently adopted an aquatic lifestyle, related to the need for buoyancy control. Hugi et al. (2011) provided an example of how in some extinct marine reptiles bone compactness developed in a different way than in other, analogous marine reptiles. The groups in question are the pachypleurosaurs (extinct sauropterygians from the Triassic), and the marine iguana. In pachypleurosaurs from the Triassic of Monte San Giorgio in Switzerland, a medullary cavity never forms, and high bone compactness develops by adding layers of compact bone around a mineralized cartilaginous center (Hugi et al. 2011). In the iguana, in contrast, a medullary cavity is visible, and a higher bone compactness is achieved only by increased periosteal ossification (Hugi and Sánchez-Villagra 2012). The comparison of the living with the fossil provides an example of the flexibility of variation of developmental mechanisms.

Body Size and Life History

A seemingly simple feature that fossils document with large implications for life history evolution is body size. In the fossil record, the maximum body size appears to be much larger than one may have predicted from the living species, such as 73-ton sauropods, half-a-ton rodents, and 3-ton diprotodontian marsupials (Geiger et al. in press; Mazzetta et al. 2004; Wroe et al. 2004). This apparently simple variable is correlated with many life history variables. To attain those extreme sizes, growth could have occurred over a long period of time or have been accelerated. This matter has been examined in many groups of land vertebrates, with a main focus on dinosaurs. Scheyer et al. (2010) recently presented a summary of these studies (see also Erickson 2005). Living ectotherm reptilians like crocodylians and turtles display lower growth rates but prolonged time spans of continued growth between sexual and skeletal maturity, as well as extended life spans afterwards in which growth virtually ceases. With few exceptions (i.e., elephants), mammals reach sexual maturity shortly after reaching full adult size, and life spans are usually shorter. Birds reach full adult size extremely fast with extremely high growth rates, and determinate growth coupled to high metabolic rates is also characteristic of this growth pattern (Scheyer et al. 2010). Like other reptile groups (e.g., crocodylians or turtles), dinosaurs also exhibited continued growth until skeletal maturity, and extended life spans afterwards. By having high growth rates, non-avian dinosaurs could achieve giant sizes, as seen in large theropods or in the gigantic sauropods. Non-avian dinosaurs may thus have had a life history influenced by “increased physiological demands and/or predation exposure associated with reproduction” (Erickson et al. 2009, p. 1514).

Vertebral Numbers in Amniotes

The vertebral number in each region of the axial skeleton, easily recorded in fossils of adult individuals, provides indirect information about somitogenesis and Hox-gene expression boundaries (Thewissen et al. 2012). The number of somites has a one-to-one correspondence with that of segments in the axial skeleton, as two somite-halves are involved in the development of each vertebra (Head and Polly 2007). Another coupling is that of the boundaries among regions of the vertebral column—including cervical, thoracic, and lumbar anterior to the sacrum—and the expression domains of some Hox-genes, which then determine the morphological identity of the sections of the body (Wellik and Capecchi 2003). With this background, Müller et al. (2010) examined presacral vertebral counts across extant and extinct amniotes and mapped them onto phylogeny. They used the relationship between presacral and cervical numbers to infer the relative influence of homeotic effects and meristic changes, and found no correlation between somitogenesis and Hox-mediated regionalisation. Furthermore, they reconstructed ancestral states of major clades in amniote evolution using squared-change parsimony, thus tracing the evolution of segmentation and regionalization in that clade. The mammalian and the reptilian lineages show early in their evolutionary histories clear divergences in axial developmental plasticity, with basal stem mammals (the early synapsid lineage) sharing the conserved axial configuration of crown mammals, and basal reptiles exhibiting the plasticity of extant taxa. These results contradicted the hypothesis that a developmental constraint involving high metabolism is characteristic of mammals (see also Hautier et al. 2010), as the stem forms with a reptilian physiology already were conservative (Sánchez-Villagra 2010). Müller et al. (2010) also found that whereas conservatism in presacral numbers characterized early synapsid lineages, in some cases reptiles and synapsids exhibit the same developmental innovations in response to similar selective pressures. Conversely, increases in body mass are not coupled with meristic or homeotic changes, but mostly occur in concert with postembryonic somatic growth.

This examination of segmentation using the fossil record is preceded by a series of elegant works examining that in trilobites. In that group, a major discovery has been the increased canalization or lack of plasticity during trilobite evolution and how the diminution of regionalization led to more plasticity in some clades (Hughes et al. 2006), the latter a phenomenon also recorded in amniotes (Müller et al. 2010).

Exceptional Preservation of Mineralized Fossils and Soft Parts

Exceptional fossils of rarely preserved stages can resolve issues of species identification and document evolutionary changes in reproductive modes. In the case of vertebrates, a comprehensive survey of around 1,600 references on fossilized ontogenies (www.developmental-palaeontology.net) served to identify topics and taxa which are the subject of much investigation and others with much potential for research. There are extensive records of embryos or juveniles for “fish” (Cloutier 2010), amphibians (Fröbisch et al. 2010), reptiles (Delfino and Sánchez-Villagra 2010), synapsids (Sánchez-Villagra 2010), and hominins (Zollikofer and Ponce de Leόn 2010). In the latter case, further insights may be revealed through the study of dental development to reconstruct growth and the timing of developmental milestones over the course of hominin evolution (e.g., Humphrey et al. 2008; Dean 2010; Humphrey 2010). Advances in non-invasive imaging methods have started to allow the extraction of previously inaccessible data (Tafforeau et al. 2006). This is particularly true for very tiny specimens, such as fossilized embryos from the Neoproterozoic Doushantou Formation of China, which have been examined with the aid of synchrotron microtomography (e.g., Donoghue et al. 2006a, b). Such methods have also allowed the reconstruction of the movements of the oro-pharyngeal elements of conodonts operated during feeding, lending strong support to their interpretation as vertebrates (Goudemand et al. 2011). The origin of jaws can best be understood by comparing fossil and living jawless vertebrates, using insights from developmental studies and from the evolutionary record (Kuratani 2012).

Mineralized Skeletons and Biomineralization Modes

Ontogenetic information on fossil “invertebrates” is mostly restricted to animals with a hard or mineralized exoskeleton (e.g., Klug 2001; Nützel et al. 2007; Sumrall and Wray 2007; De Baets et al. 2012; Korn 2012). These exoskeletons usually preserve a record of ontogeny due to their accretionary growth.

Biomineralization has evolved many times independently in many clades of animals and plants, as also documented paleontologically (Murdock and Donoghue 2011). The study of the geochemical properties of fossils has revealed unsuspected capabilities to change a biomineralization system. However, developmental plasticity among groups varies. This must have affected past patterns of evolution and may potentially affect the future of shelled organisms in marine ecosystems, as exemplified by corals, which are among the most prolific biomineralizing organisms. Their case is of particular interest nowadays, as the increased atmospheric CO2 levels increase the acidity of sea water, leading to their decalcification (Anthony et al. 2008). What is the fate of “naked” corals in evolutionary terms? Can they resume calcification if atmospheric CO2 levels decrease? Looking at the geological past provides examples of several responses to environmental changes in biomineralizing organisms, including corals. The first corals to appear in the Middle Triassic—after the major extinction event at the end of the Permian—were scleractinian corals (Brayard et al. 2011). Their ancestors are supposed to have been “naked”, anemone-like corals that survived the Permian mass extinction. It had been assumed that scleractinian corals form purely aragonitic skeletons. But an exceptionally preserved fossil from the Upper Cretaceous possessed a purely calcitic skeleton (Stolarski et al. 2007). This implies that these corals could form skeletons of different carbonate polymorphs, as do some other but not all groups of marine, calcium carbonate-producing organisms (Stolarski et al. 2007).

Preservation of Soft Parts

Among arthropods the best fossils yielding ontogenetic data from their exoskeletons are trilobites (e.g., Barrande 1852; McNamara 1978; Tripp and Evitt 1986; Edgecombe et al. 1988, 1997; Chatterton and Speyer 1989; Chatterton et al. 1990, 1994; Lee and Chatterton 1997, 2003, 2005; Zhang and Pratt 1999; Clarkson and Ahlberg 2002; Lerosey-Aubril and Feist 2005a, b; Hughes et al. 2006, 2008) and bivalved arthropods, such as ostracods (e.g., Spjeldnaes 1951; Maness and Kaesler 1987; Hoare 1991; Tinn and Meidla 2003, 2004), bradoriids (in former times misidentified as ostracods; Zhang and Pratt 1993; Zhang 2007; Hou et al. 2010), or spinicaudate crustaceans (e.g., Olempska 2004; Tasch 1961; and references therein). Even for large arthropods with unmineralized exoskeletons such as sea scorpions (eurypterids), different developmental stages are preserved in the fossil record (Leutze 1958; Andrews et al. 1974; Cuggy 1994).

In some fossil sites there has been unusual preservation of soft parts, thus going beyond the usual exoskeleton or shells (Briggs et al. 2005). These sites include the Crato Formation in the Cretaceous of Brazil, Solnhofen in the Jurassic of Bavaria in Germany, Rhynie Chert in the Devonian of Scotland, and the “Orsten” Lagerstätten worldwide, best known from Sweden and expanding from the Lower Cambrian to the Lower Ordovician (Maas et al. 2006). As an example of how fossils from these exceptional sites can address the subjects treated in this article, we elaborate on the case of the “Orsten” fauna, which provides tiny fossils preserved in three dimensions, with minute details such as setae, setules, membraneous areas, and eyes.

The “Orsten” fauna is dominated by larval forms and some tiny adults of arthropods, mainly crustaceans of various groups. Based on the detailed material that is best studied with scanning electron microscopy (SEM), more or less complete ontogenetic series of fossil arthropods have been reconstructed (e.g., Müller and Walossek 1988; Walossek 1993) and have contributed to the knowledge of developmental patterns, as summarized in four major aspects:

  1. (1)

    Segment addition patterns absent in any extant taxon. Examples are (a) the extremely gradual mode of segment addition in the branchiopod Rehbachiella kinnekullenis Müller, 1983 that resembles the development in the ground pattern of Eucrustacea (the crown group of Crustacea; Walossek 1993), and (b) the trilobite-like development of the early crustacean Henningsmoenicaris scutula (Walossek and Müller, 1990) reflecting the development in the ground pattern of Crustacea sensu lato (Haug et al. 2010a).

  2. (2)

    Specialised larval stages. An example is the nauplius larva (one with three pairs of appendages), an autapomorphy of Eucrustacea. It is present in some taxa from the “Orsten,” rendering them representatives of the crown group (e.g., Müller and Walossek 1986, 1988; Walossek 1993), already in the lower Cambrian (Zhang et al. 2010), while other species possess a more plesiomorphic type of arthropod larva (e.g., Haug et al. 2009a, 2010a, b).

  3. (3)

    Specialized developmental rate. An example is the limb-bud delay, with first delayed and then accelerated development of trunk limbs in extant barnacles and their relatives (Thecostraca). This pattern is also found in Bredocaris admirabilis Müller, 1983 from the middle Cambrian, identifying this species as a representative of the thecostracan lineage (Müller and Walossek 1988).

  4. (4)

    Specialized developmental timing of appearance of certain structures. An example is the proximal endite, an important feeding structure medio-proximally on the limbs, which appears in different stages of the larval sequence in several species of early Crustacea sensu lato (Fig. 1; Haug et al. 2009a, 2010a, b).

    Fig. 1
    figure 1

    Simplified phylogeny of Crustacea sensu lato highlighting some heterochronic events within this lineage (cf. Haug et al. 2010b). Top: af 3D models of Cambrian “Orsten” crustacean larvae (head larva stages) and their left appendages two and three in anterior view. a, b Late planktotrophic head larva stage of Henningsmoenicaris scutula. Although already feeding, the legs lack the proximal endite, an important feeding structure that will not develop before stage seven, in which already several trunk segments are present. c, d Head larva stage of Goticaris longispinosa. A proximal endite is already developed on the third appendage (arrow), but not yet on the second appendage. e, f Martinssonia elongata, early head larva stage. Although lecitotrophic and still lacking a mouth opening, proximal endites are already developed (arrows). Bottom: Based on the phylogeny and the pattern of appearance of the proximal endites within the larval sequences, two heterochronic events can be identified, both pre-displacement (arrows). The proximal endite appears both times earlier in ontogeny than in the more plesiomorphic condition

Based on phylogenetic analyses including the developmental patterns discovered in fossils, the reconstruction of evolutionary scenarios and the detection of heterochronic events becomes feasible (Fig. 1; Haug et al. 2010a, b). Besides crustacean fossils, also other taxa are represented within the fauna with their early developmental stages, such as larvae or embryos of agnostines (trilobite-like arthropods, sister group to Crustacea sensu lato; Müller and Walossek 1987), chelicerates (Waloszek and Dunlop 2002), and different taxa of nemathelminths (Müller and Hinz-Schallreuter 1993; Maas et al. 2007, 2009; Haug et al. 2009b).

In addition to the “Orsten” in the strict sense, there are also different fossil deposits with “Orsten”-like fossils. Among these, several specimens of various taxa with preserved soft-parts yield developmental information. These taxa comprise ostracods (Weitschat 1983a, b; Smith 2000), decapod larvae (Maisey and de Carvalho 1995; Tanaka et al. 2009), insects (Duncan et al. 1998), nemathelminths (Dong et al. 2004; Dong 2007; Zhang et al. 2011), and forms of yet unknown affinities (Steiner et al. 2004).

Macroevolutionary Patterns of Allometry and Modularity

Allometry

First coined by Huxley and Teissier (1936a, b), the term allometry refers to the pattern of covariation among the size of two morphological traits. Stemming from the early works of Dubois (1897) and Lapique (1907) that examined relationships between brain weight and body weight in mammals, allometry studies have a long history in the literature, and have more recently begun to be examined from a mechanistic perspective, as EvoDevo has focused more sharply on identifying specific genes and developmental pathways that are responsible for the evolution of ontogenies (West and Brown 2005; Li et al. 2007; Sears et al. 2007).

The advent and application of geometric morphometrics—the statistical analysis of the variation in the Cartesian geometric coordinates of homologuous landmarks (Dryden and Mardia 1998; Bookstein 1991)—has reformed the ways in which morphological form can be described and has permitted intuitive visualizations of ontogenetic trajectories as vectors in multivariate morphospace (e.g., O’Higgins 2000; Ponce de Léon and Zollikofer 2001; Monnet et al. 2009). The use of ontogenetic data to create developmental morphospaces has great potential to yield a wealth of results comparable to those currently documented by the studies of adult form, which at present dominate the literature. Importantly, the study of morphological evolution in fossils and extant species is equally possible using these techniques.

Understanding the dynamics of allometric evolution can be greatly enriched by incorporating data from the fossil record. Such data provide the only opportunity to evaluate the evolutionary persistence of factors affecting morphospace occupation, on a geological time scale. Ontogenetic series are known for many fossils. Rodents, the most speciose mammalian order, represent an excellent case study. Their unparalleled taxonomic success among mammals, coupled with phenomenal levels of morphological diversity and a rich fossil record, provide a wealth of opportunities to explore phylogenetic, ecological, or functional hypotheses relating to morphospace occupation and structure. Among the Caviomorph rodents, a group endemic to South America, well-preserved ontogenetic material is known for several fossil Ctenomyid rodents. These include the Pliocene rodents Actenomys, Xenodontomys, and Praectenomys (e.g., Verzi 2008). Comparison of allometric patterns between extant and fossil ctenoymids has already revealed several insights into ontogenetic evolution above the species level, particularly in relation to the acquisition of digging adaptations (Vassallo and Mora 2007). In this regard, Actenomys is a key taxon, as it shows a plesiomorphic condition for several traits usually considered to be adaptations for digging in the Ctenomyidae. By comparing ontogenetic series of fossil and extant ctenomyids, Verzi et al. (2010) showed how changes in ontogenetic trajectories, for instance among traits for incisor and mandibular morphology, have led to diversification in cranial form within the group. These data can be further used to explore ontogenetic dynamics on a macroevolutionary scale, and to investigate the factors leading to the patterning of species in morphospace. Wilson and Sánchez-Villagra (2010) examined patterns of allometric trajectory evolution in two extant clades of rodents with differing levels of morphological diversity. Their study revealed that dietary habit played a crucial role in patterning allometric evolution in the cranium, regardless of phylogenetic relatedness. The temporal persistence of the adaptive evolutionary patterns revealed by Wilson and Sánchez-Villagra (2010) may be evaluated by incorporating growth series of fossil species (Fig. 2) that will allow a suite of questions to be addressed relating to the importance of ecological constraints in rodent evolution.

Fig. 2
figure 2

Ontogenetic series of the Pliocene rodent Actenomys priscus, which is recognized as an early fossorial representative of the Ctenomyidae. Specimens from Museo de Ciencias Naturales “Lorenzo Scaglia,” Mar del Plata, Argentina. Scale: 2 cm

In a wider context, ontogenetic trajectories are commonly represented using the major axis of covariance, and consequently through the exploration of developmental morphospaces, the extent and patterning of modification to covariance structure can be revealed, reflecting the possibilities that remain for the functional or developmental differentiation of integrated phenotypes (Eble 2004). Genetic (G) and phenotypic (P) covariance matrices have been cited as the quantitative expression of constraints that shape evolution (Arnold 1992), and much attention has been devoted to the interaction between the two (e.g., Cheverud 1984; Marroig and Cheverud 2004; Revell 2007; Arnold et al. 2008) in an attempt to bridge the gap between microevolutionary processes and macroevolutionary patterns.

Integrating fossil data into estimates of phenotypic covariance structure—through allometric evolution, integration, or modularity studies—is of particular interest because one major issue of concern is the temporal stability of the P matrix (Arnold et al. 2008). Logic confers that over a long enough time period the P matrix will alter; otherwise all organisms would have morphologies of approximately the same form. What are the time frames for covariance structure change? How do these compare between clades and how are these differences related to evolutionary success and morphological diversity? These macroevolutionary questions center on the understanding of how morphologies are generated, and measurements from fossils provide the patterns to be explained by morphogenetic models.

Modularity

Modularity refers to the differential integration of sets of characters and is reflected in the relatively high degree of covariation of units within a module and the relative independence of these units from other modules (Berg 1960; Callebaut and Raskin-Gutman 2005; Klingenberg 2008). Since the seminal paper of Wagner and Altenberg (1996), modularity has become a central concept in evolutionary biology. Morphometric methods used to identify phenotypic modules (Goswami and Polly 2010) can be applied to extinct taxa, as has been done for some mammals (Goswami et al. 2011), and trilobites (Webster and Zelditch 2011). In the paleontological context, the goal is to identify integrated morphological traits acting as units of evolutionary change among species (Eble 2004; Schoch 2006; Young et al. 2010; Gerber and Hopkins 2011).

There has been much theoretical discussion regarding whether modularity constrains or facilitates morphological evolution (e.g., Wagner and Altenberg 1996; Kirschner and Gerhart 1998; Budd 2006; Marroig et al. 2009). Theoretically, both situations are equally likely (Fig. 3), and empirical data are needed to explicitly test these assumptions. Elucidating the dynamics of trait integrations and interactions is particularly important for the explanation of macroevolutionary trends.

Fig. 3
figure 3

A hypothetical phenotypic morphospace with species (circles) unequally distributed. Each extant or fossil species may be represented by a number of landmarks recorded on adult individuals or an ontogenetic series. Traits measured may have connective relationships with one another such that they form modules or sets of highly integrated traits that behave relatively autonomously. Modularity may constrain or facilitate morphological evolution, and thus the ability of a clade to explore morphospace, that is, to generate phenotypic variation. Under the constraint hypothesis (left: purple point cloud), strong correlations among traits within a module may limit the potential of an individual trait to vary and hence comparatively increased amounts of modularity may result in limited morphospace occupation. Fewer trait interactions and modules (right: green point cloud) may thus permit a greater dispersion in phenotypic space. Under the facilitation hypothesis, the reverse would be the case. Modularity evolves, as indicated by the different hypothetical patterns and magnitudes illustrated in the caption boxes, and therefore a clade-wide study of modularity essentially reflects a lineage-specific study of the evolution of evolvability, most basically the ability of an organism to evolve. Modules are expected to arise through developmental or functional interactions among traits. Urotrichus talpoides cranium illustrated after Wilson 2012

Concerning vertebrates, the complexity of mammalian skull growth and development provides an excellent opportunity to test hypotheses about the factors responsible for variability and evolutionary change, and for this reason many studies have taken advantage of the wealth of knowledge already available for this system. Among mammals, several macroevolutionary studies of primates and carnivorans have documented patterns of modularity in the adult cranium (e.g., Goswami 2006; Marroig et al. 2009; de Oliveira et al. 2009; Shirai and Marroig 2010). These studies have revealed differences among clades, indicating that modularity changes over time.

Macroevolutionary Patterns of Covariation, Convergences, and Morphogenetic Models

Segmentation

Although no genetic experiments can be performed on fossils, strong inferences on genetic mechanisms underlying development in extinct taxa can be made (Luo et al. 2007; Schmid and Sánchez-Villagra 2010; Schmid 2012). For example, the segmental morphology of extinct taxa can be studied by making comparison with patterns of morphological expression for regulatory genes in extant arthropods (Hughes et al. 2006; Hughes 2007) or vertebrates (Müller et al. 2010).

The consideration of developmental genetic changes in extant taxa can serve to understand morphological variation in extinct ones (Schmid 2012; Thewissen et al. 2012). In particular, extant mutants can serve as models to infer mechanisms that may have been responsible for morphological diversity in fossil lineages. For example, the molecular underpinning of the morphological diversification of the Triassic basal actinopterygian fish Saurichthys was inferred by Schmid and Sánchez-Villagra (2010) on these grounds. Originating from an ancestor covered entirely by uniform rhomboid scales with numerous, highly segmented fin rays, Saurichthys radiated into species diagnosed by different degrees of loss in rays, scales, and dermal bones (Romano et al. 2012). These changes are analogous to those reported in mutants of different extant species, such as zebrafish, sticklebacks, and medaka. Schmid and Sánchez-Villagra (2010) suggested one of two alternatives: (1) a mutation or a regulatory change of a signaling pathway, in which either the fibroblast growth factor pathway was affected, assuming that a gene duplication had occurred, for example in the closely related acipenserids or in teleosts; (2) the ectodysplasin pathway was involved, assuming that its pleiotropic effects led to viable morphological diversification.

The Tetrapod Limb

The vertebrate limb is one classic example illustrating the successful integration of developmental and evolutionary studies. Based on the consideration of living species alone, one could wrongly conclude that the last common ancestor of tetrapod vertebrates (land vertebrates) must have had five fingers and five toes and that evolutionary “innovations” have largely involved reductions in the number of digits. But stem tetrapods that lived between 385 and 360 million years ago possessed more than five fingers (Coates and Clack 1990). Polydactyly in the stem tetrapods would not have been predicted based on the study of the crown group alone (Shubin et al. 2009); fossil discoveries reshaped our knowledge of ancestral limb morphology. Furthermore, it has inspired the examination of phenotypes of extant species that are suggestive of polydactyly, such as that of the pseudo-thumb of some anurans (Tokita and Iwai 2010) and talpid moles (Mitgutsch et al. 2012).

A recent mathematical model simulating the behavior of chondrogenic cells in the limb was capable of reproducing skeletal morphologies recorded in stem tetrapods (Zhu et al. 2010). This study is a significant achievement in that it brings together information on the regulatory network known for limb development in recent species, mathematical modeling, and information provided by fossils. The core mechanism for chondrogenesis consists of an activation subnetwork, an inhibition subnetwork, as well as adhesive and extracellular matrix molecules that promote pre-cartilage condensations. In conjunction with a gradient of a growth factor emanating from the apical ectodermal ridge situated at the distal part of the limb bud, the model accounts for the proximo-distal and antero-posterior layout of cartilaginous primordia in the chicken limb, as well as distal truncations and other patterns resembling mutational and experimental variants in various species. The model is able to reproduce the patterns of limbs observed in stem tetrapods like the transitional forms between fishes and amphibians or the secondarily marine ichthyosaurs. These morphologies result from variation in the kinetic parameters of topologically conserved networks acting in distinctly shaped limb buds. The model shows that the limb regulatory system is endowed with generic properties (Urdy 2012). Major features of normal, experimentally manipulated, genetically aberrant and evolutionary transitional limb forms emerge from the inherent self-organizing properties of this core mechanism. The model’s predictive power shows that the mechanism acting in the present, especially the network topology, can be used to explain the early evolution of the tetrapod limb. Reciprocally, information gleaned from the morphology of fossils provides an independent test for the relative success of morphogenetic models.

Molar Proportions

Much progress has been made in recent years in understanding the mechanisms of tooth development. For instance, a recent model, integrating gene networks and tissue mechanics, suggests that despite the complexity of development and teeth, there may be a simple basis for variation (Salazar-Ciudad and Jernvall 2010). These authors argued that changes in single parameters regulating signaling may underlie variation among individuals of ringed seals, whereas changes in the parameters regulating the growth of the epithelium may underlie tooth-to-tooth variation along the jaw. The model also generates 3D patterns of gene expression, changing over the course of development, as well as 3D morphologies that can be compared to real teeth using morphometric methods. Moreover, proportions of molars can be easily connected to experimentally developed models that predict a dynamic balance between inter-molar inhibition and mesenchymal activation, due to the sequential initiation of molar buds along the jaw (Kavanagh et al. 2007). It has been proposed that the second molar always makes up one-third of the total molar area. This simple coupling of proportions with developmental mechanisms makes the examination of fossils in a developmental perspective possible (Polly 2007; Renvoisé et al. 2009; Sánchez-Villagra 2010). Whereas this rule has been shown to hold among murine rodent species with various diets (Kavanagh et al. 2007), exceptions have been described in bears, horses (Polly 2007), voles (Renvoisé et al. 2009), and some extinct clades of South American “meridiungulates” (Wilson et al. 2012). Thus, exceptions to this simple rule point to slightly different mechanisms acting along the jaw, related to the increase or decrease of the inhibition of one molar relative to the next to form. Such comparisons have led to the discovery that in the Cenozoic radiation of now extinct ungulates from South America, novel molar proportions evolved that were outside those recorded among living species (Wilson et al. 2012). These novel proportions are related to abrasive diets and testify to the developmental plasticity under ecological conditions (Wilson et al. 2012). The abrasiveness of the foodstuff is related with the ashes produced by volcanic activity, a major geological feature in the South American Cenozoic (Williams and Kay 2001).

Evolutionary Recurrent Patterns in Mollusks

Thompson (1917) inspired a powerful approach to the study of biological development, which consists in unraveling the “laws of form” describing how developmental systems generate and constrain the variation of biological forms over short and long time scales. One of his favorite examples concerned the shell shape of mollusks, which generally conforms well to the logarithmic spiral. Mollusks are indeed well suited to address issues of developmental palaeontology, as they have an excellent fossil and living record and their accretionary mode of growth preserves shell ontogeny. The mantle, a soft, thin elastic tissue, secretes the shell. The shape of shell increments is equivalent to that of the mantle edge poking out of the aperture at the time of shell growth. However, the study of evolutionary changes occurring in fossil molluscan lineages relies nearly exclusively on the interpretation of shell morphologies, and the evolution of the molluscan shell is characterized by frequent convergences in form and ornamentation. Important taxonomic features of mollusks include the shape of the aperture, the degree of coiling of the shell tube, the ornamentation (ribs, tubercles, spines, keels) and growth features (growth halts, constrictions, varices).

The comparison of shell shape between and within different clades of mollusks can be informative with regards to the basic rules of accretionary growth. Common rules of growth could underlie the morphogenesis of the shell and its evolution in ammonoids and gastropods (Bucher 1997; Bucher and Guex 1990; Bucher et al. 1996; Checa and Jimenez-Jimenez 1997; Checa et al. 1998, 2002). Evidences come from the comparison of intraspecific and/or interspecific patterns of covariation among shell characters (Westermann 1966; Morita 1991a, b, 2003; Dagys and Weitschat 1993; Checa et al. 1996; Hammer and Bucher 2005a), from the description of changes occurring at maturity in different species or clades (Thompson 1917; Burnaby 1966; Bucher 1997; Chirat et al. 2008), and from the analysis of teratological shells in response to injuries (Thompson 1917; Guex 1967, 1968; Bayer 1970; Landman and Waage 1986; Bond and Saunders 1989; Hammer and Bucher 2005b) or to change in living conditions (Linsley 1977; Checa and Jimenez-Jimenez 1997; Checa et al. 2002).

It has been often observed that in variable ammonoid species, there is a tendency for the aperture shape to covary with the degree of whorl overlap and the robustness of ornamentation. The most ornamented specimens tend to exhibit a depressed aperture and small whorl overlap, whereas smooth specimens exhibit a laterally compressed aperture and a large whorl overlap (Fig. 4). These patterns of intraspecific variation, known as Buckman’s first law of covariation, have been observed in Triassic boreal ammonoids (Rieber 1972; Dagys and Weitschat 1993; Checa et al. 1996; Bucher 1997; Dagys et al. 1999; Hammer and Bucher 2005a; Monnet and Bucher 2005), and in Jurassic (Westermann 1966) and Cretaceous ammonites (Kennedy and Cobban 1976). As these patterns of covariation have been observed in phylogenetically distant ammonoids at several different time periods, they are evolutionary recurrent. Hammer and Bucher (2005a) proposed that the negative correlation between the compression of the aperture and the intensity of ornamentation can be satisfactorily accounted for by assuming that lateral rib heights increase isometrically with aperture width, whereas ventral rib heights increase isometrically with aperture height. Simple scaling relationships lead to produce proportionally stronger lateral ribs on depressed specimens than on compressed specimens, which only exhibit strong ribs on venter (Fig. 4). These observations indicate that molluscan shell shape variation is remarkably structured. Some studies highlighted the generic rules underlying the morphogenesis of the molluscan shell, using either geometrical (Thompson 1917; Raup 1961; Raup and Michelson 1965; Okamoto 1988a, b; Illert 1990; Savazzi 1990; Rice 1998; Ubukata 2003; Hammer and Bucher 2005b; Urdy et al. 2010a, b), mechanical (Morita 1991a, b; Morita 1993; Vermeij 2002; Hammer and Bucher 2005a), or chemical models (Hammer and Bucher 1999; Guex et al. 2003). Some other studies laid emphasis on the role of life orientation in the determination of growth direction (Linsley 1977, 1978; Checa and Jimenez-Jimenez 1997; Checa et al. 2002). Others suggested that the preceding whorl played a role in the regulation of coiling (Hutchinson 1989; Checa et al. 1998; Morita 2003).

Fig. 4
figure 4

Variation in an assemblage of juveniles of Amaltheus margaritatus from Jurassic, illustrating the so-called Buckman’s laws of covariation, which state that the coarseness of ribs tends to correlate negatively with the degree of aperture compression (ratio of whorl height against whorl width), whorl overlap (ratio of umbilical diameter against shell diameter) and spacing between septa. In this assemblage, the most robust variants (top) display relatively few strong ribs, a small whorl overlap and widely spaced septa, whereas compressed variants (bottom) exhibit numerous faint ribs, a high whorl overlap and closely spaced septa. During ontogenesis, specimens tend to display relatively narrower apertures, increased whorl overlap and fainted ornamentation, so that the most extreme variation is observed in the juveniles samples. Photographs by Noel Podevigne (Université Claude Bernard Lyon (UCBL), Lyon, France)

Indeed, teratological shells, including fossil ones, often provide a useful source of information about the way development generally proceeds. For instance, planispiral ammonites that were infested by epizoans during their lifetime exhibit alterations of their coiling geometry (Checa et al. 2002). These authors pointed out that, most commonly, the epizoans settled on the venter of ammonoids, and constituted an obstacle to the subsequent growth. This disturbance probably initiated changes in the hydrostatic conditions of the ammonite and caused a lateral shifting of the growth direction compared to the previous whorl in attempts to avoid the obstacle. Using a hydrostatic model, Checa et al. showed that the shell tube should periodically cross the venter, thus leading to zigzag coiling, if the ammonite tried to maintain the growth direction perpendicular to the substrate (sea bottom). If the epizoan was positioned on the midventer, the whorl could be detached from the previous whorl. Under constant growth direction relative to the substrate, a lateral placement of the epizoan would rather result in trochospiral coiling (Fig. 5), especially if the epizoan had a certain non-negligible weight, which could cause the tilting of the ammonite.

Fig. 5
figure 5

Example of teratological shells found in an assemblage of Aplococeras vogdesi from Triassic, Nevada. First row a regular planispiral specimen. Second and third rows two specimens exhibiting trochospiral coiling. No scars are visible and the trochospiral coiling appears in the early growth stages. The settlement of an epizoan on the left side of these specimens could have induced the tilting of the ammonite because of changes in the hydrostatic condition, especially if the weight of the epizoan was comparable to the weight of the ammonite. A trochospiral coiling is subsequently produced, if a constant growth direction with respect to the sea floor is maintained. Photographs by Rosi Roth (Paläontologisches Institut und Museum der Universität Zürich (PIMUZ), Zürich, Switzerland)

A similar role for life orientation in determining the growth direction has been experimentally tested in gastropods. In the benthic freshwater Planorbidae (Gastropoda), specimens experimentally altered by extra weights on one side of the shell revealed that the growth direction remained perpendicular to the substrate (Checa and Jimenez-Jimenez 1997). Similarly, the benthic prosobranch gastropods exhibiting a tangential aperture with regards to the coiling axis have been shown to live with the aperture parallel to the substrate (Linsley 1977). These gastropods have the ability to regulate the amount of torsion/detorsion of the foot to place the center of gravity of the shell and body over the midline of the cephalopodial mass, thus allowing the maintenance of a constant life orientation. A well-known example of the influence of change of mode of life on shell morphology is provided by the gastropod Distorsio, which, once settled on the substrate, displays distorted coiling. Well-known heteromorph ammonites like the late Cretaceous Nipponites were first viewed as a challenge to the simulation of their ontogeny and their evolution. However, Okamoto (1988b) derived a mathematical model of the peculiar meandering whorls of Nipponites. Assuming neutral buoyancy and a constant aperture angle relative to sea bottom, Okamoto‘s model showed that the meandering whorls morphology is controlled, under hydrostatic constraints, by the permitted range of variation in the growth direction relative to the sea bottom. The simulations suggest that the morphological transition from a simple helicoidal form to a meandering form occurs abruptly without any intermediary form as a result of a minor change in the upper and lower limits of variation in the growth direction.

In ammonoids, regenerated shells after damage are often found (Guex 1967, 1968; Bayer 1970; Landman and Waage 1986; Bond and Saunders 1989; Hammer and Bucher 2005b). Particularly, some changes in the ornamental features have been described in response to the location of injuries reaching the mantle (Guex 1967, 1968; Bayer 1970; Hammer and Bucher 2005b). For example, some shells with a ventral keel associated with ribs on the flanks can lose their keel in response to a wound located on the venter. Then, the ribs in the post-damaged shell cross the venter, whereas before they were interrupted by the keel. Some other shells bearing bifurcating ribs on the venter rather construct simple ribs after being damaged on one side. These examples are described in terms of “ornamental compensation” (Guex 1967, 1968). This phenomenon can be seen as a generic outcome of modes of shell growth, whether one interprets such results in terms of reaction–diffusion (Guex et al. 2003; Hammer and Bucher 2005b), or mechanical effects (Hammer and Bucher 2005a).

But probably the most famous work on fossil mollusks stems from Raup’s seminal (Raup 1961) paper, which triggered the emergence of theoretical morphology. Raup’s morphospace, particularly convenient to compare planispiral ammonites, has been extensively used to record changes in the patterns of morphological diversity of ammonites during the Mesozoic (Raup 1961, 1966; Raup and Michelson 1965; McGhee 1999). Since then, geometrical models progressively shifted from shape to growth description, by considering the timing of growth processes (Rice 1998; Urdy et al. 2010a, b). Such studies highlighted the role of growth rates and timing in the generation of allometries and phenotypic plasticity, providing a theoretical link between variation at the ontogenetic, population, and species levels (Urdy et al. 2010a, b). These approaches are expected to facilitate the comparison of theoretical and empirical data in the future, and to help interpreting mollusk fossil morphologies in a developmental, ecological, and evolutionary context.

Conclusions

Several examples show how paleontology can benefit from, but also contribute to the current conceptual and empirical advances to understand the evolution of development and constraints. Paleontological data can address directly mostly late aspects of ontogeny—a rich subject of study involving the origin of morphological diversity. Indirect information on developmental patterns can also be gathered from fossils. Even with a uniformitarian approach in which only mechanisms known from living organisms are used to make predictions about past phenomena, new discoveries about developmental evolution have been made. The fossil record provides numerous examples of extinct morphologies and developmental patterns. Given that 99.9 % of species are now extinct, neglecting the fossil species to construct morphospaces and infer developmental constraints may be highly misleading. Thus, the conceptual and empirical studies of development in a paleontological context constitute a significant contribution to the ongoing expansion of evolutionary theory, with the incorporation of EvoDevo themes not considered in the MS.