Introduction

Lineages of functionally equivalent forms are generally expected to evolve along allometric trajectories, at least within size ranges that can accommodate the requisite allometric transformations (Huxley 1932; Gould 1975; Sweet 1980; Strauss 1985; Marroig and Cheverud 2005). Tree squirrels would seem to be one of those functionally uniform groups, but some studies suggest that evolution of jaw morphology in tree squirrels is not allometric but rather is isometric over most of their size range (Ball and Roth 1995; Thorington and Darrow 1996; Velhagen and Roth 1997). Most of the exceptions to the general isometry are at the extremes of the size range: pygmy squirrels (<80 g), and giant squirrels (>2000 g). Within the range of more typical species (100–1200 g), tree squirrels have a reputation for being morphologically uniform relative to other rodents, a uniformity that includes the anatomical position of jaw muscle attachments and the proportions of input and output lever arm lengths. The explanation for this uniformity might be particular to jaws or particular to squirrels.

Jaws of all animals may maintain isometric scaling of linear dimensions for purely mechanical reasons; allometric scaling is necessary only when functional features of different dimensionality must keep pace with each other. In one classic example of allometry, limb cross sections (areas) must keep pace with the mass they support (a function of volume); therefore the square root of area must increase faster than the cube root of mass. Jaw function is not so directly tied to the mass of the body; the jaws may be provisioning a larger body, but the requirement to eat more food does not necessarily translate to a requirement for larger foods. Carnivores can reduce the costs of hunting by eating larger prey, but herbivores derive no particular benefit from eating larger leaves. More important, the performance of jaws as levers for exerting force on food items may be determined by the ratios of just a few lever arm lengths (Davis 1955; Turnbull 1970; Bramble 1978; Greaves 1982). Because the lengths that determine these ratios all have the same dimensionality, they should be expected to scale isometrically with respect to each other to maintain functional relationships such as their mechanical advantages. In fact, ratios of jaw lever arm lengths have been shown to be quite stable in several mammalian lineages that are ecologically and functionally distinct from tree squirrels (Radinsky 1982, 1985; Greaves 2000). Stability of these ratios produces a reliable correspondence between muscle force production and bite force, and thus a fairly direct link between a selective input (selection on bite force) and an evolutionary output (change in muscle mass). This could, in turn, permit diversification by size stratification and niche partitioning, as Radinsky (1982) suggested for mammalian carnivores. In addition, stability of these few ratios, by insuring a reliable response to selection on bite force, may free other aspects of jaw morphology to evolve independently of that same selective pressure, perhaps in response to other functional demands. Thus, conservation of lever arm ratios need not extend to other features of jaw shape.

In tree squirrels, uniformity of jaw lever arms may actually be associated with a general uniformity of jaw shape. In fact, tree squirrels appear to be highly conservative in nearly all their anatomical features, not just their jaws (Bryant 1945; Moore 1959; Black 1963; Emry and Thorington 1982). Only such major functional and ecological shifts as the evolution of ground squirrels or flying squirrels have led to notable qualitative anatomical changes. The close similarity between extant species of Sciurus and forms found in the early Oligocene (<30 mya) has earned Sciurus, and tree squirrels in general, a reputation as living fossils (Emry and Thorington 1984). Such strong conservatism bespeaks a highly constrained morphology, one that might resist even the pressures of size scaling. The source of that conservatism may lie in the extraordinary functional demands of arboreality and durophagy. Durophagy places a premium on the ability to generate large bite forces to fracture hard foods (Werdelin 1989; Binder and Van Valkenburgh 2000; Herrel et al. 2002; Dumont et al. 2009). Selection for rapid arboreal locomotion would not only favor eyes positioned to capture a complex three-dimensional visual scene, but also a relatively large brain that can quickly process complex spatial data and manage the correspondingly complex locomotor demands (Lemen 1980; Eisenberg and Wilson 1981). Because the bones that encompass the brain and eyes are also the attachment surfaces for the jaw muscles, the skull may have a limited ability to accommodate these two sets of demands (Dullemeijer 1958; Cartmill 1980; Roth 1996). Limitations on skull morphology would entail corresponding limitations on the lower jaw, which must maintain effective occlusion.

In this study, we examine evolution of jaw morphology in relation to jaw size in sciurine tree squirrels (Sciurini Fischer de Waldheim 1817), which encompasses nearly all New World tree squirrel species and a few Old World species (Steppan et al. 2004; Thorington and Hoffmann 2005). Recent comprehensive molecular analyses have demonstrated that this is a monophyletic group, corroborated several key branch points and established their ages (Mercer and Roth 2003; Steppan et al. 2004), providing a framework for phylogenetic analyses of jaw size and shape. We described jaw morphology using both geometric shape variables and lever arm lengths comparable to those used in previous studies (Thorington and Darrow 1996; Velhagen and Roth 1997). Comparison of these two data sets allowed us to corroborate the stability of the lever arm ratios, and evaluate the diversity of shapes within that constraint.

Methods

Data Acquisition

All specimens (Appendix Table 7) were photographed in a standardized orientation to minimize variation in orientation as a source of measurement error. In addition, all specimens were photographed at distances approximately proportional to their sizes, so that they occupied no more than the central third of the field of view, minimizing the effect of parallax on apparent shape and maintaining a depth of field greater than the distance between the medial and lateral corners of the angular process. A ruler was included in all images for calculation of lengths and centroid size.

Twelve landmarks and 75 semilandmarks were digitized on each image using tpsDig (Fig. 1a). Procrustes superimposition to remove non-shape differences between configurations was performed in TpsRelw; the bending energy optimization was used to slide semilandmarks between the endpoints of their respective curves and remove effects of arbitrary placement of semilandmarks on curves. Centroid sizes of the jaws were also computed from these configurations. Mathematical details of these and related methods can be found in Marcus et al. (1996), Zelditch et al. (2004), and references therein.

Fig. 1
figure 1

Data collection schemes shown for a representative specimen of Sciurus arizonensis. a Landmarks and semilandmarks. Numbered circles indicate locations of landmarks. Dashed curves indicate marginal edges sampled at equal intervals. The numbers of semilandmarks for each curve are: a-9, b-9, c-13, d-18, e-15, f-15. b Lever arm lengths, measured as line distances from the posterior of the condyle to the following points: T1––tip of coronoid process, proximal end of temporalis insertion; T2––base of coronoid process, distal end of temporalis insertion; SM1––posterolateral corner of angular process, posterior end of superficial masseter insertion; SM2––medioventral corner of angular process, ventral end of superficial masseter insertion; AM––anterior end of the anterior deep masseter insertion, INC––tip of the incisor, MOL––distal end of the first molar

Seven linear distances from the posterior of the condyle (landmark 10, Fig. 1b), representing lever arm lengths pertinent to jaw closing mechanics, were also calculated for each specimen. Mechanical advantages (input arm length/output arm length) were computed for all 10 combinations of input and output arms. To compute two of the lever arms (INC and SM2), we digitized two points that were not included in the landmark set for reasons discussed below. This set of lengths was modeled after those used in previous analyses of squirrel jaws (Thorington and Darrow 1996; Velhagen and Roth 1997). Lengths were computed in tmorphgen after format conversion using CoordGen.

The incisor tip point was not included in the landmark set because it exhibits severe seasonal wear in some populations; some individuals have incisors worn to within a few millimeters of the alveolus, which would strongly influence the superimposition of those specimens and the estimates of mean size and shape in those species. Excluding the incisor tip point from the landmark set allowed inclusion of these specimens in the analysis of mean shape and centroid size. To facilitate comparisons between this study and previous work, incisor output arm was included in the analysis of lever arm lengths, and those specimens with extremely worn incisors were excluded from the lever arm data set.

The point at the distal end of SM2 represents the most medial point of the angular process, which bends first medially, then laterally in sciurids. This point is often, but not always, the most ventral point on the angular process. From a mechanical perspective, its importance lies in the fact that it represents the part of the superficial masseter insertion that is farthest from the condyle and has the highest mechanical advantage. Although the point is easily recognized on specimens in hand, and in properly lit 2D images, it is not a corner, but an extreme point on a broad curve in most of the taxa in this study. Thus we do not regard it as landmark in the strictest sense––as a discrete, anatomically defined point location that is comparable across specimens. Accordingly, we included SM2 in the lever arm lengths, but omitted its end point from the landmark set (however, the contribution of this projection to the shape of the angular process is captured by the semilandmarks following the ventral edge of that process).

Analysis

Canonical variates analysis (CVAGen) was used to determine whether mean jaw shapes differed among species. Principal components analysis (PCAGen) was performed on all 209 individuals, to determine the number of axes needed to account for at least 90% of the shape variation, then CVA was performed on the scores for this subset of PCs. Classifications based on CV scores were evaluated by a jackknife procedure, treating each specimen as an unknown and comparing its classification based on CVA of the remaining specimens to its original species assignment. The misclassifications produced by this procedure identify which species have overlapping distributions that cannot be separated by CVA.

Pairwise distances between mean shapes (tpsSmall) were generated to identify the most similar species. Goodall’s F test was performed in TwoGroup to determine if the means of those species were significantly different.

All other analyses of both data sets (coordinates and lengths), were based on the means computed for each species. Interspecific regressions and correlations were evaluated for both the observed species means and the phylogenetically independent contrasts between sister taxa (Felsenstein 1985), using MorphoJ for shape variables and the PDAP module in Mesquite for length measurements.

Phylogeny

Hypothesized phylogenetic relationships (Fig. 2a) were based on several sources. The starred nodes and their dates are based on molecular analyses by Mercer and Roth (2003), which are calibrated by estimated ages of key fossils and ages of geologic events that would have separated descendant lineages. The ages of these nodes and the compositions of the descendant lineages are corroborated by Steppan et al. (2004). The age of the separation of the two Tamiasciurus species from each other is within the range delimited by Arbogast et al. (2001).

Fig. 2
figure 2

Phylogenetic hypotheses used to infer historical transformations and phylogenetically independent contrasts. a Branching pattern based on published analyses. Starred nodes are inferred from Mercer and Roth 2003. b Random rearrangement of poorly supported groups. Scale is 106 years

Treatment of Sciurus vulgaris and Sciurus lis as separate species is supported by Oshida and Masuda (2000). They also estimate the ages of the split between Tamiasciurus and Sciurus and that between Old World Sciurus (represented by S. vulgaris and S. lis) and New World Sciurus (represented by all other Sciurus in this study). Their estimates for these earlier events are much older than those given by Mercer and Roth, and as noted by Oshida and Masuda, greatly predate the geological events that most likely would have divided the ancestral populations. Accordingly, we have relied on Mercer and Roth for the ages of the earlier events and rescaled all time estimates from Oshida and Masuda to be consistent with the Mercer and Roth estimates. The rescaled estimate for the divergence of S. vulgaris and S. lis is used here.

Within the New World Sciurus, anatomical similarities and biogeographic relationships (Nitikman 1985; McGrath 1987; Hoffmeister and Hoffmeister 1991; Best 1995a, b, c; Best and Riedel 1995; Thorington and Hoffmann 2005; Villalobos and Cervantes-Reza 2007) provide support for three clades: (A) S. niger and the sister group that includes S. arizonensis and S. nayaritensis, (B) the S. aureogaster group that includes S. colliaei, S. variegatoides and S. yucatanensis, and (C) S. deppei and its sister group composed of all the South American species (including the polyphyletic Microsciurus). Biochemical differences among North American species are small and not consistently nested among taxa, suggesting a recent radiation within a narrow time interval. Because more precise dating is not available, divergence events along the main stem were approximately evenly spaced within this interval. Similarly, branching events for South American taxa that were not analyzed by Mercer and Roth are evenly spaced between dates established for the taxa that they did include. The resulting tree (Fig. 2a) was used in computation of independent contrasts and in estimation of hypothetical ancestral shapes. To guard against making inferences that were dependent on poorly supported estimates of branching patterns and divergence times, analyses were repeated using a second, partially randomized tree (Fig. 2b). In this second tree, the strongly supported nodes form the backbone and a random number generator determined the sequence of branching events for the remaining taxa. The ages of the randomized branching events were evenly distributed between the limits defined by the strongly supported nodes.

Software

Tps software (tpsDig, tpsRelw, tpsSmall, tpsTree) is available from F. J. Rohlf at http://life.bio.sunysb.edu/morph. IMP software (Coordgen, CVAGen, PCAGen, tmorphgen, DisparityBox, TwoGroup) is available from H. D. Sheets at http://www2.canisius.edu/~sheets/morphsoft.html. Independent contrasts regressions were performed in Mesquite (W. Maddison and D. Maddison) using the PDAP module (P. E. Midford et al.) for lever arm lengths, and in MorphoJ (C. P. Klingenberg) for shape variables. All other calculations were performed in Excel.

Results

Comparison of Lever Arm Lengths

Length of the molar output arm scales isometrically relative to the incisor output arm: the two lengths are highly correlated and the slope is also not significantly different from 1.0 (Table 1). Regression of most input lever arms lengths on the incisor output lever arm produced similar results. The exceptions were a relatively low r 2 for T1 (posterior temporalis), and a marginally significant negative allometry for T2 (anterior temporalis). The deviation from isometry for T2 appears to reflect the strength of the correlation (r 2 = 0.991) as much as the shallowness of the slope (0.955). Regressions of the muscle input arms on the molar output arm were comparable.

Table 1 Regression of molar output arm length and all muscle input arm lengths on incisor output arm length

The tight correlations among the lever arm lengths resulted in very low diversity for the mechanical advantages of the input arms (lengths scaled by output arm length, Table 2): the variances in species means are <0.2% of their respective grand means. For comparison, the diversity of jaw centroid size is 200 times greater. PCA using the five scaled lengths was used to obtain Euclidean distances of species means from the grand mean. These distances also demonstrated very low diversity with an average of 0.026, and variance (disparity sensu Foote 1993) of 8.29 × 10−5, about 0.3% of the mean.

Table 2 Average mechanical advantages for each species, and the mean for all species

Regression of the lever arm lengths on jaw centroid size also found extremely high correlations (Table 3). All lengths except the longer superficial masseter arm (SM2) exhibit a slight but statistically significant negative allometry relative to jaw centroid size. This result suggests that some linear dimensions of the jaw are changing length relatively faster than those represented by the lever arm lengths. Because the dorso-ventral thickness of the ramus and diastema is not well represented by the lever arm lengths, we infer that this dimension has a slight positive allometry relative to the measured lengths. This pattern would be consistent with production of larger forces in larger animals.

Table 3 Regression of lever arm lengths on jaw centroid size

The plots of muscle input arm lengths against incisor output arm length demonstrate that the smallest species in this study do not deviate from the isometry exhibited by larger species (Fig. 3). The smallest species here are the two Microsciurus species, which have body weights around 80 g. Although they are distinctly larger than most other dwarf and pygmy squirrels, they are still much smaller than the next smallest species in this study, 200 g Tamiasciurus. Even so, it is the Tamiasciurus species that have much shorter than expected values for T1. Several other species have comparably low values for T1, suggesting the value in Tamiasciurus does not reflect a size related trend, but the generally greater variability of this length. The largest species in this study, S. spadiceus, is one of the species with a relatively short T1, and it also has shorter than expected lengths for SM1 and SM2. This combination of deviations from the general pattern of relative lever arm lengths may indicate an important functional or ecological shift in the evolution of S. spadiceus.

Fig. 3
figure 3

Relationships of muscle input arm lengths to incisor output arm length

Use of independent contrasts in the regressions did not lead to substantially different conclusions. One difference from the conventional regressions is that both temporalis arms have slightly negative allometries relative to incisor arm length (Table 4); the other is that SM1 has a slight negative allometry relative to centroid size (Table 5). Randomizing the weakly supported branches did not substantially affect the conclusions, which is not surprising given the high r 2 values for the conventional regressions. The rearrangement did alter the specific combinations of large and small species in the South American lineages, and the number of nodes with distinctly large contrasts, but the change in tree topology did not change the inferences of isometry or slightly negative allometry.

Table 4 Regression results using independent contrasts for input lever arm lengths and incisor output arm length
Table 5 Regression results using independent contrasts for lever arm lengths and jaw size

All plots of the independent contrast scores indicate that there are three nodes with large contrasts in centroid size and in all seven lever arm lengths (Fig. 4). All three contrasts compare a lineage that contains a Microsciurus species and a lineage that does not. At these few nodes, size divergence is most dramatic and the divergences in lever arm lengths are correspondingly large. These contrasts might be expected to exert disproportionate influences on the analyses of correlations; however, the smaller contrasts also tend to be close to the predicted values, especially for the output lever arms. Only for T1 does it appear doubtful that there would be a significant correlation without the three largest contrasts.

Fig. 4
figure 4

Relationships of lever arm lengths to centroid size, inferred from independent contrasts. Contrasts were standardized using the branching pattern and branch lengths shown in Fig. 2a

Comparison of Shapes

Regression of Shape on Size

Regression of mean shape on mean size accounted for only 22% of the diversity of jaw shape, but was highly significant; in 10,000 permutations, none of the other data sets had a regression that accounted for a higher fraction of the shape variation (P 10,000 = 0.0001). The axis of shape change that is correlated with size change predicts that smaller jaws will have a relatively long condyloid process with a proportionately large condyle, a slightly reduced coronoid process, a more posterior and relatively square angular process, relatively short and more sharply curved diastema, and a masseteric fossa that extends more anteriorly under the cheek teeth (Fig. 5a). The converse will be true of larger jaw sizes (Fig. 5b).

Fig. 5
figure 5

Correlation of jaw shape and size. a Predicted pattern of shape change as size decreases, b predicted pattern as size increases, c biplot of shape scores versus CS. Shape changes are shown as deformation grids, with arrows indicating relative landmark displacements. Line is fit to all taxa except Microsciurus

The biplot of scores on this shape axis against centroid size (Fig. 5c) has a much more elliptical distribution than might be expected for a regression that explains <1/4 of the variation in the dependent variable. However, the variable plotted in the graph, and illustrated by the deformations, does not represent all of the shape variation in the data set. It only represents the trajectory through the shapes that is most highly correlated with size. The positions of the taxa along this trajectory are not perfectly correlated with size, only 60% of the variation in scores is explained by variation in size. That fraction of the variation in scores represents 22% of the variation in shape.

The distribution of scores on the size axis indicates that both Microsciurus lie far outside the range of shapes observed for the other taxa; i.e., they have much more change along this trajectory than do any other squirrels in the study. The larger of the two, M. flaviventer, also has much more change along this trajectory than is predicted for its size. In addition, a line fit only to the scores of the other species predicts a much smaller shape deviation than is observed for squirrels in the Microsciurus size range. Thus, the inferred regression may be strongly influenced by the unusual size and shape of the jaw in the two Microsciurus.

When Microsciurus were excluded from the data set, the picture of shape change associated with size is, in fact, quite different (Fig. 6). Based on this regression, small jaws are expected to have a condyloid process that is relatively thin, but not relatively long, with no change in the relative size of the condyle. The diastema and incisor are expected to be thinner than in large jaws, but less so than when Microsciurus were included in the data. Also, when Microsciurus are excluded, the inferred allometry does not include extension of the masseteric fossa under the molars in small squirrels, and the angular process does not change position with size. There is a thinning of the angular process as the curve between angular and condyloid processes is deeper in smaller squirrels. In addition, the tip of the coronoid process is extended posteriorly, giving it a more hooked shape in smaller squirrels. More important than the difference between the two patterns is the smaller amount of shape diversity that can attributed to allometry: although still highly significant (P 10,000 = 0.0185), this correlation accounts for only 11% of the shape variation in these taxa. Not only does this indicate that Microsciurus did unduly influence the previous analysis, it also indicates that the distributions of jaw size and shape are largely incongruent in the other taxa.

Fig. 6
figure 6

Correlation of jaw shape and size when Microsciurus are excluded. a Predicted pattern of shape change as size decreases, b predicted pattern as size increases, c biplot of shape scores versus CS

Regression using independent contrasts does not lead to substantially different conclusions. The axis of shape difference that was found to be associated with size differences predicts larger jaws will have a relatively deeper ramus and diastema, more elongate and posteriorly displaced coronoid process, relatively smaller condyle and shorter condylar process, and a slightly elongate and proportionately shallower angular process when size increases (Fig. 7a). However the magnitude of this expected shape change is minute even in the largest shape contrast, that between the clade containing the two large South American species, S. spadiceus and S. igniventer, and the clade containing all the other South American species. Most of the 22 contrasts have smaller size differences (Fig. 7b): 19 (86%) have size differences <1/2 the magnitude of the largest difference and 16 (72%) have size differences that are <1/4 of the largest. These small size contrasts often have magnitudes of shape change along this axis that are several times larger or smaller than predicted, and some have the reverse direction of shape change. Consequently, the regression accounts for only 25% of the variation in shape contrasts. The regression is still very significant (P 10,000 = 0.0001), but this can be explained by the fact that it connects a small number of contrasts with very large size differences. Even in these large contrasts, the amounts of shape difference represented by their scores on this axis comprise a small fraction of the total shape difference inferred for those nodes. Consequently, the implication of the independent contrast result is that there is no meaningful relationship between the evolution of jaw size and the evolution of jaw shape.

Fig. 7
figure 7

Correlation of jaw shape and size inferred from independent contrasts. a Predicted pattern of shape change as size increases, exaggerated by a factor of 10 to enhance visibility, b biplot of contrasts for shape scores and CS. OW––Old World, NW––New World, NA––North American, SA––South American. “Sciurus, NA versus SA” designates the contrast between the South American clade, including Microsciurus, and its sister group among the North American species

Significance of Differences Among Mean Shapes

Although the regression results indicate that size explains a small fraction of the shape variation, the possibility remains that the shape differences are too small to reject the hypothesis of isometry (invariance of shape). To address this issue, CVA was performed on all 209 individual specimens to test whether species differed in mean shape. Because the shape data comprise 95 dimensions, PCA was used to reduce the dimensionality of the data. The first 15 axes account for only 90% of the shape variation; but the remaining axes each account for less that 1.0% of variation. Accordingly, the CVA was performed using the scores on the first 15 PC axes. Significant interspecific differences (P < 0.0009) were found on the first 11 canonical variates.

Plots of the first three canonical variates (Fig. 8), show that the Microsciurus and most of the South American Sciurus have distinct shapes; the only species in this lineage that cannot be distinguished from North American Sciurus is S. stramineus. Results of the jackknife classification test (Table 6) are comparable. There are no misclassifications between Microsciurus and Sciurus, and none between North American taxa and three of the South American Sciurus (S. spadiceus, S. igniventris and S. ignitus). For the other two South American Sciurus, three specimens of S. granatensis and all S. stramineus were misclassified as North American species.

Fig. 8
figure 8

Scores on first three CVA axes for all individuals used to compute mean shapes for each species

Table 6 Jackknife classification rates using canonical variates scores

The CVA also indicates that Tamiasciurus are strongly differentiated from Sciurus, and that the Old World species (S. vulgaris and S. lis) are distinct from the New World species. There is some overlap of Tamiasciurus with Sciurus on the first three axes, especially with S. aberti and the Old World species, but no Tamiasciurus was misclassified as a Sciurus. Of the Old World species, only S. lis was misclassified as a New World species (S. aberti or S. ignitus); misclassifications between S. lis and S. vulgaris were much more frequent.

Within the North American Sciurus, three species (S. aberti, S. deppei and S. variegatoides) are relatively easy to distinguish using scores on the first few CVs, but each of the others broadly overlaps with several species. Even so, the jackknife analysis indicates that in most of the North American species samples only two or three individuals are misclassified, usually one individual per overlapping species. The distribution of classification errors is consistent with close packing of species (many near neighbors), but only slight overlap between neighbors. The contrast between the slight overlaps inferred from the jackknife and the broader overlaps indicated on the CVA biplots indicates that differences among most species are not well aligned with the canonical variates, which are constrained to be mutually orthogonal.

To further validate the significance of differences between species means, Procrustes distances between mean shapes were computed for all species pairs, and the four pairs of species with the smallest distances (<0.024) were compared using Goodall’s F. Three of the pairs were composed of all combinations of S. aureogaster, S. alleni and S. yucatanensis; the other pair was S. nayaritensis and S. arizonensis. All four pairwise differences were highly significant (F > 2.0, df > 170, 2000 and P < 0.00001). Thus despite the partially overlapping distributions of CVA scores and higher misclassification rates, even these very similar North American species have statistically significantly differences in mean jaw shape.

Large shape differences may be functionally important as well as statistically significant. For example, S. spadiceus has large score on CV1 and differs from the mean shape in several features (Fig. 9a). The diastema is somewhat elongated relative to the tooth row and also greatly thickened. The incisors are also relatively thicker in this dimension. Both observations suggest large forces loading on the incisors. The coronoid process of S. spadiceus is also modified, extending farther posteriorly and having a more robust tip. The condylar process is reoriented, placing the condyle closer to the occlusal plane, and the angular process is shallower and slightly shorter. These changes suggest that the role of the temporalis muscle may be relatively more important in S. spadiceus than in other sciurines. It also indicates that there have been changes in the angles between input and output lever arms; this type of change is most apparent in the ventral shift of the condyle relative to both the tooth row and the tip of the coroniod.

Fig. 9
figure 9

Shape divergence from the sample grand mean by species with high scores on the first three CVs. a S. spadiceus––CV1, b S. variegatoides––CV2, c M. flaviventer––CV1 and CV2, d T. hudsonicus––CV3

S. spadiceus is not the only taxon with a substantial change in jaw shape. S. variegatoides has a relatively large score on CV2, and another combination of transformations to the coronoid, condyloid and angular processes that would tend to change the angles between the muscle lever arms. In M. flaviventer, with large scores on both CV1 and CV2, most features of the shape change suggest relatively reduced muscle loads: the diastema is relatively thin, the angular process is shorter and squarer and the condyloid process is long and relatively slender. However, there is also a forward extension of the masseteric fossa that may indicate there are increases in the anterior parts of the anterior deep masseter that compensate for decreases in more posterior components. Also, when with anterior extension of the fossa is combined with the posterior extension of the condyloid process, a substantial reorientation of the anterior masseter lever arm is also apparent. Similarly, T. hudsonicus, with a large score on PC3, exhibits yet another combination of changes in the shapes of the coronoid and condyloid processes with concomitant in lever arm angles, as well as smaller changes in the diastema and angular process.

The diversity of shapes illustrated by the CVA confirms that there is substantial amount of shape diversity that is independent of size. As shown in the Fig. 9, the magnitudes interspecific shape differences can be quite large. The average Procrustes distance of species means to the grand mean is 0.0354, a little less than half of the distance of S. spadiceus. The Foote disparity for shape is 0.0015 or 4% of the mean. Although this is small in comparison to the variation in size, it is still more than 10 times greater than the disparity in mechanical advantages. Thus, the lack of variation in mechanical advantages is not associated with a similar limitation on shape variation.

Diversity of Evolutionary Directions

The regression results suggest that the evolutionary histories of size and shape are incongruent. This was investigated by phylogenetic analyses of size and shape. The inferred nodal values implied similar histories for the magnitudes of change (Fig. 10). For both size and shape, there apparently was little if any change along most of the main stem. The exceptions were early in sciurine history when Tamiasciurus, Old World Sciurus and New World Sciurus diverge. These nodes are widely separated in time, so large changes here do not necessarily imply high rates of evolution. Within the North American radiation, there is very little evidence of shared evolutionary change. Even among those small clusters that share relatively more recent common ancestors, the largest changes are inferred for the branches leading to the terminal taxa (cf., niger and aureogaster groups). This pattern is also found in the South American radiation, which differs primarily in having much larger divergences of terminal taxa from the main branch. These large changes may be partly due to the less complete sampling of that lineage, but they also reflect the much greater diversity of sizes and shapes observed in that clade. The two Microsciurus are part of that greater diversity, but even if they were excluded there are still several distances among South American species that are greater than any distance among North American Sciurus.

Fig. 10
figure 10

Evolutionary changes in a jaw centroid size and b shape. Branching pattern is the same as in Fig. 2a; vertical distances are proportional to magnitudes of changes inferred by squared change optimization. For centroid size, black indicates size increases, gray indicates size decreases, and the vertical scale bar represents a change of 1 cm. For shape, branch lengths represent Procrustes distances between shapes inferred for adjacent nodes; the vertical scale bar is in units of Procrustes distance

The pattern of short internodal branches and long terminal branches reflects frequent parallelism and reversals in a one-dimensional trait like size, but in a multidimensional trait like shape, it may represent a great diversity of evolutionary directions taken by the terminal lineages. The divergent evolutionary directions taken by sister taxa are particularly evident for species at the ends of long terminal branches (Fig. 11). For example, S. aberti (Fig. 11a) differs from the ancestor it shares with S. griseus in having a taller and more hooked coronoid process and a thinner proximal condyloid process (i.e., thinner near the joint). S. aberti also has a somewhat elongate horizontal ramus and a posteriorly placed angular process. Its sister species, S. griseus, is inferred to have diverged from their most recent common ancestor in the opposite direction (Fig. 11b). A more striking example can be seen in the divergences of S. igniventris and S. spadiceus from their common ancestor, which differ in both magnitude and direction (Fig. 11c, d). M. flaviventer also has diverged farther than S. ignitus from their common ancestor (Fig. 11e, f), but more notable, these two clearly exhibit different combinations of localized changes. The posterior tip of the condyloid process is elongated in both, but the coronoid is transformed in opposite directions (shortening in M. flaviventer, elongating in S. ignitus), and some regions are transformed in only one species (the angular and distal condyloid in M. flaviventer, the posterior diastema and anterior tooth row in S. ignitus). M. alfari and S. granatensis provide another example of sister species with different combinations of localized divergence from their common ancestor (Fig. 11g, h). It is also evident that the dwarf species M. flaviventer and M. alfari diverge in different directions from their respective common ancestors (cf. Fig. 11e, g), and converge on a common form only in shortening the tip of the coronoid and elongating the condyloid. When shape evolves in such a great diversity of directions, there can be little correspondence between the evolution of shape and the evolution of size.

Fig. 11
figure 11

Inferred shape changes in four pairs of sister taxa as deformations of their respective most recent common ancestors. a S. aberti, b S. griseus, c S. igniventris, d S. spadiceus, e M. flaviventer, f S. ignitus, g M. alfari, h S. granatensis

Discussion

Mechanical advantages of the input lever arms were extremely stable across the size range of animals that we studied, agreeing with prior studies by Velhagen and Roth (1997) and Thorington and Darrow (1996). And like Velhagen and Roth, we found that the masseter moment arms were isometric relative to the output arms, but the temporalis arms were negatively allometric. These results indicate mechanical properties of the tree squirrel jaw are extremely stable, as Radinsky (1982, 1985) found for several other mammalian lineages.

In contrast to the lever arm ratios, evolution of jaw shape in these squirrels was much less restricted. The shape differences were small, consistent with the squirrels’ reputation for conservatism, but they were not aligned along a single trajectory correlated with size. Once phylogenetic relationships were taken into account, it was clear that there was no meaningful correlation between the evolution of size and the evolution of shape. There is a statistically significant correlation, but it is mostly due to a small number of lineages with large transformations in both size and shape. Even in those few large contrasts, the amount of shape change that is correlated with the size change is a small fraction of the total. Furthermore, reconstruction of shape changes between nodes demonstrated that the lineages with the largest shape transformations evolved along several different trajectories, not just up and down a single one. Thus, despite the extremely consistent proportions of functionally important lengths, jaw shape evolved in a variety of directions, producing statistically significant differences in jaw shape.

The shape data also lead us ask what might be the value of conserving lever arm length ratios when so many other features of jaw shape are more labile. Given the mechanical principles elaborated previously (Davis 1955; Bramble 1978; Greaves 1982), it may seem reasonable to suppose that there might be a special advantage to these particular values as they relate to the effectiveness of the jaw closing musculature. In fact, the tree squirrels’ remarkable consistency may seem like an example of the many-to-one mapping of phenotypes to function discussed by Alfaro et al. (2004, 2005). However, the shape variation that exists within the apparent constraint of these length ratios suggests substantial differentiation in the forces borne by these jaws, which may indicate equally substantial differentiation in selection of food items. In addition, the consistency of relative lengths from the condyle to a muscle’s attachment points do not necessarily equate to a consistent perpendicular distance from the condyle to that muscle’s line of action (Bramble 1978). The latter distance is the effective lever arm length, which determines the true mechanical advantage (also called ‘mechanical potential’ by Carraway and Verts 1994). The shape data analyzed in this study suggest that positions of muscles relative to the condyle are shifting, and that these changes may be independent for anterior and posterior components of the muscles (as demonstrated by the base and tip of the coronoid process). Consequently, lines of action and effective lever arm lengths may be undergoing much greater transformations than would be inferred from the distances of muscle attachment sites from the condyle. By extension, jaw mechanical properties and the loading regimes jaws experience may be more diverse than might be inferred from the stability of lever arm length ratios.

Despite their small magnitudes, the shape changes are also functionally important. For example, the two Microsciurus species and several of the other small taxa diverge from their larger relatives in having deeper curves between the posterior processes (coronoid, condyloid and angular). These deeper curves not only reduce the relative size of the area available for muscle attachment, they also result in a more slender and less tapered process that is less able to resist deformation than a thicker, more tapered process with a broader base. This pattern is consistent with the scaling of bite forces seen in some studies (Herrel et al. 2002; Herrel et al. 2005; Lemen 2008). Smaller taxa have relatively smaller muscles that produce relatively smaller strains requiring less robust processes. Conversely, the greater dorso-ventral thickness of the ramus and diastema in taxa like S. spadiceus or both Tamiasciurus species is consistent with these animals generating proportionately larger forces to gnaw through harder objects than relatives and competitors. The functional significance of these changes is not immediately obvious because expanded muscle attachment areas of the coronoid and angular processes are consistent with both generating comparatively larger peak forces and greater differentiation of muscle activity patterns between anterior and posterior segments of the attached muscles. Greater differentiation would allow an animal to exert large forces at many different gape angles, or a sustained force through a large arc and over an extended period of time. The latter muscle activity pattern would be consistent with shaving long slivers off of a hard shell, like a carpenter planing a board, but either would help to explain how squirrels are able to gnaw through rinds, shells or cones that approach or exceed the diameters of their heads.

The above analysis leaves the question concerning the value of conserving lever arm length ratios unanswered, and indeed, the question cannot be answered with the available data. Rather the shape data examined here serve as a warning that jaws exhibit more morphological diversity than is represented by an analysis of a few lever arm lengths. Even in taxa that have only recently radiated and have not undergone substantial ecological or functional transformations, shape differences can be both statistically significant and functionally relevant. In this regard, a more instructive analysis might be one that focuses on taxa that have diverged for a longer period of time or have undergone a larger ecological shift. However, squirrels have not just evolved from generalized to specialized; they have also shifted ecological types. Recent phylogenetic analyses (Harrison et al. 2003; Steppan et al. 2004; Herron et al. 2004) suggest that the transformations between arboreal durophage, terrestrial granivore and grazer may have occurred multiple times. These more substantial functional shifts may have more profound effects on how the anatomy and variation of the jaw are structured and integrated.