Keywords

Introduction

Modern humans engage in sophisticated social interactions, including division of labor, long-term planning, and recreation, all of which support human culture and its continued evolution. The influential social brain hypothesis posits that the human evolution has largely been shaped by the increasingly more complex social structure over the course of primate evolution. In particular, the size of the neocortex in social primates increases in direct relation to the number of members in a typical social group for that species (Dunbar and Shultz 2007).

Social interactions require language and communication. The interactive nature of communication dictates that all members in the social group understand and then use the same language to exchange information, implying that imitation is critical. A model for imitation is the shared circuits model (SCM) (Hurley 2008). This model has implicated several cerebral cortical structures including the mirror neuron system (Rizzolatti et al. 1988, 2002). Already prominent as the most basic element of the SCM (layer I in the SCM model), however, is active perception . Known functions and patterns of anatomical connectivity of the cerebellum suggest that it may be a critical component, if not the major center, of all types of active perception. To assess the role of the cerebellum in the evolution of language and social communication, a comparative study was carried out to examine the evolution of the cerebellum for clues that may address the SCM and in particular active perception and its roles in imitation and other complex behaviors. As language and communication are largely supported by the function of cerebral cortical structures and that human communication is uniquely characterized by high information content, the present emphasis is the evolution of cerebro–cerebellar relationship and the information coding capacity of the cerebellum. These findings are discussed with a focus on potential contributions of the cerebellum to human evolution and to complex behaviors as adaptive responses to selection pressure.

Comparative Changes in the Cerebellum and Information Processing

Estimated Comparative Changes in the Volume Fraction of the Neocerebellum

All the data used here have been published previously. Data for cerebellar volume fractions of 95 species of fish with a broad sampling were obtained from Bauchot et al. (1989); Lisney and Collin (2006); Platel et al. (1977); Wagner (2001); Yopak et al. (2007). Volume fractions of ten amphibian species were obtained from Taylor et al. (1995), and 35 species of reptiles from Platel (1976). Cerebellar volume fractions for 141 species of birds, with broad taxonomic sampling, were obtained from Portmann (1947), who reported the volumes of the brainstem, optic lobes, cerebellum, and striatum. Additional data from Iwaniuk and Hurd (2005); Iwaniuk et al. (2005, 2006); Kalisinska (2005) brought the total to 247 species of birds. Cerebellar volume fractions for mammals were obtained from Stephan et al. (1981) as summarized in Clark et al. (2001). This data set includes 75 species of insectivores and primates, with data on the volume as well as volume fractions of the medulla, mesencephalon, diencephalon, telencephalon, and cerebellum. Additional data were obtained from Knudsen et al. (2002); Marino et al. (2000); Pilleri (1967); Pirlot and Kamiya (1985).

The detailed data analysis have been reported. Briefly, the volume (assumed to be proportional to mass) of a component of the brain can be related allometrically to the volume of the entire brain by a power relationship of the form (e.g., see Barton and Harvey 2000)

$$ {V}_{\mathrm{comp}}=a\, \cdot {V}_{\mathrm{brain}}^b $$
(1)

where the value of the exponent [b] describes the rate of increase in the component relative to the whole brain. With a logarithmic transformation, Eq. 1 becomes

$$ \log \, {V}_{\mathrm{comp}}=\, \log \, a\, +\, b\, \log \, {V}_{\mathrm{brain}} $$
(2)

In Fig. 1, the logarithms of the volumes of the cerebellum and the telencephalon increase approximately linearly as a function of the logarithm of the total volume of the brain in fish, amphibians, reptiles, birds, and mammals. A regression analysis showed that [b] is close to 1 (Table 1). Thus, both the cerebellum and the telencephalon remained a constant proportion of the brain over the whole range of size (isometry) and a good approximation for Eq. 1 becomes

$$ {V}_{\mathrm{comp}}=a\, \cdot {V}_{\mathrm{brain}} $$
(3)

where [a] equals [Vcomp/Vbrain], or [Fcomp], the volume fraction of the brain component in question.

Fig. 1
figure 1

The volume (assumed to be proportional to mass) of the cerebellum (left) and the telencephalon (right) as a function of the total volume of the brain in fish, amphibians, reptiles, birds, and mammals

Table 1 Volume fractions (assumed to be proportional to mass fractions) of the telencephalon, cerebellum, and the neocerebellum in five classes of vertebrates, and the means and standard deviations of [Fcbl] and [Ftelen]. Note: se = standard error of the estimate, sd = standard deviation of the distribution of species values. This method of estimation does not allow the determination of values for the SD of fneocbl

An analysis of covariance was next used to determine the relationship of log10-transformed component size to brain size with the different taxonomic groups (fish, amphibian, reptile, bird, and mammal) as fixed effects in the model. To facilitate comparison among animals with widely varying brain volumes, the fractions that the cerebellum and telencephalon constitute of whole brain were also examined (Fig. 2).

Fig. 2
figure 2

The volume fraction of the cerebellum (left) and the telencephalon (right, as a fraction of the total volume of the brain) in fish, amphibians, reptiles, birds, and mammals

For the cerebellum, the interaction term between brain size and taxon was significant (F = 4.0, df = 4426, P = 0.004), but it explained less than 0.1% of the total variance and the interaction was deleted from the model. Thus, the cerebellum/brain relationship was treated as having parallel slopes within each of the vertebrate classes. The common slope of the logarithmic cerebellum/brain relationship was 0.967 ± 0.010 se (P < 0.0001), which was close to but significantly less than 1 (isometry). Thus, the volume fraction of the cerebellum (as a fraction of the whole brain) decreased slightly with increasing brain size. Overall, the analysis of covariance explained 98.1% of the total variance in cerebellar volume (F = 4460, df = 5430, P < 0.0001).

For the telencephalon (striatum in birds), the interaction term between brain size and taxon was significant (F = 17.4, df = 4319, P < 0.0001), but explained only 0.2% of the total variance. An initial analysis of the telencephalon/brain relationship was made assuming parallel slopes for each of the groups. Accordingly, the common slope of the logarithmic telencephalon/brain relationship was 1.088 ± 0.010 se (P < 0.0001), which significantly exceeded 1. Thus, the telencephalon fraction (as a fraction of the whole brain) increased with increasing brain size. Overall, the analysis of covariance explained 99.0% of the total variance in telencephalon volume (F = 6272, df = 5323, P < 0.0001).

Table 1 is a summary showing that the volume fraction of the cerebellum [Fcbl] decreased sharply from 0.274 in fish to 0.016 and 0.029 in amphibians and reptiles, respectively, followed by an increase to 0.119 and 0.128 in birds and mammals, respectively. By contrast, the volume fraction of telencephalon [Ftelen] increased progressively from fish (0.185), to amphibians (0.243), reptiles (0.43), birds (0.631), and mammals (0.683). Values of [Fcbl] were statistically indistinguishable for birds and mammals (Table 1). The same was true for [Ftelen]. In Fig. 3, the volume fraction of the cerebellum (as a fraction of the whole brain) in mammals and in birds remained at 10–15% of the overall volume of the brain as the brain increased in size. This observation was consistent with the analysis of (Clark et al. 2001). The invariance of [Fcbl] in birds and mammals is remarkable despite the absence of morphologically discernable cerebellar hemispheres in the bird cerebellum (but see Brodal et al. 1950) and despite the variation of the cerebellar hemispherical fraction from ∼50–60% in typical mammals to 90–95% in human (Ghez 1991; Ito 1984). In the mammalian cerebellum, the lack of comparative change in [Fcbl] contrasts sharply with the findings on the comparative changes at the cellular and synaptic level (see later).

Fig. 3
figure 3

Volume fraction of the cerebellum (as a fraction of the whole brain) remains relatively constant with increasing brain volume

To estimate the size of the neocerebellum as a fraction of the cerebellum, an assumption was made: the volume of the various areas or subdivisions of the cerebellum dedicated to the analysis of information from the telencephalon and those from “other” brain structures should be proportional to the size of afferent projections from the telencephalon and “other” brain structures, respectively, or,

$$ {\mathrm{F}}_{\mathrm{cbl}}={\mathrm{g}}_{\mathrm{t}}\cdot \, {\mathrm{F}}_{\mathrm{t}\mathrm{elen}}+{\mathrm{g}}_{\mathrm{o}}\cdot \, {\mathrm{F}}_{\mathrm{o}\mathrm{ther}} $$
(4)

In Eq. 4, [Fcbl], [Ftelen], and [Fother] are the volume fractions, [gt] and [go] are the appropriate proportionality constants with their magnitudes reflecting the influence of the telencephalon and other afferent brain structures, respectively, on the volume fraction of the different subdivisions of the cerebellum. This assumption is supported by several lines of evidence. First, unlike the cerebral cortex, the cerebellar cortex contains no cortico-cortical pathways or cortical regions for the analysis of information from other areas of the cerebellar cortex. A given circumscribed area of the cerebellum is connected with an extra-cerebellar target via dedicated connectivities (Ramnani 2006; Strick et al. 2009; Sturrock 1989). Detailed results from neuroanatomical tracing studies suggest that the cerebral cortical area that is the main target of each output channel from the cerebellum is also the major source of input to that channel. Thus, a closed-loop circuit would best represent the major architectural unit of cerebro–cerebellar interactions. Second and consistent with the above idea, the local cerebellar cortical synaptic circuitry forms a corticonuclear complex and is everywhere the same within the cerebellum, suggesting that cerebellar function expressed by the input–output transfer function is everywhere the same within the cerebellum (Ito 2006). If these broad assumptions are justified, it follows then that the volume fraction of the neocerebellum (as a fraction of the cerebellum) may be estimated when the proportionality constants [gt] and [go] can be evaluated. This is accomplished first by recalling the normalization statement,

$$ {\mathrm{F}}_{\mathrm{cbl}}+\, {\mathrm{F}}_{\mathrm{telen}}+{\mathrm{F}}_{\mathrm{other}}=1 $$
(5)

Substituting (Eq. 5) into (Eq. 4) to eliminate Fother,

$$ {\mathrm{F}}_{\mathrm{cbl}}=\left[\left({\mathrm{g}}_{\mathrm{t}}-{\mathrm{g}}_{\mathrm{o}}\right)/\left(1+{\mathrm{g}}_{\mathrm{o}}\right)\right]\, \cdot {\mathrm{F}}_{\mathrm{t}\mathrm{elen}}+\left[{\mathrm{g}}_{\mathrm{o}}/\left(1+{\mathrm{g}}_{\mathrm{o}}\right)\right] $$
(6)

To estimate [gt] and [go] for each class of animal, a regression analysis of [Fcbl] was carried out based on Eq. 6 (Fig. 4). The slope of the regression yielded estimates of [(gt – go)/(1 + go)] and the intercept yielded estimates for [go/(1 + go)]. Values of the slope and intercept were then used to compute [gt] and [go], which, in turn, made it possible to further estimate [gt Ftelen]. This represented the portion of the cerebellum that may be dedicated to the telencephalon, which was, by definition, the neocerebellum (fneocbl in Table 1).

Fig. 4
figure 4

The cerebellar volume fraction (of the whole brain) as a function of telencephalon volume fraction (also of the whole brain) in fish, amphibians, reptiles, birds, and mammals

For fish, amphibian, and reptile, the values of [fneocbl] were ≤4%. The cases of the bird and the mammalian cerebellum (particularly the primate) are of interest, as these classes of animals are considered by behavioral scientists to exhibit considerably more complex behaviors and have been the subjects of various investigations in cognitive function (Briggs et al. 2002; Emery and Clayton 2004). Complex behaviors in birds and mammals are consistent with the relatively high values of [Ftelen] in mammals and in birds compared with those in amphibians and reptiles (Fig. 4). In addition, the slope [(gt – go)/(1 + go)] and the intercept [go/(1 + go)] in Fig. 4 for birds and mammals were generally larger than those in amphibians and reptiles. The value of [fneocbl] was, on average, higher in mammals (∼58%) than in birds (∼11%). The lower [fneocbl] in birds was consistent with the observation that the bird cerebellum did not have visually noticeable cerebellar hemispheres at the gross morphological level. These results also lend support to the hypothesis that the transformation of the cerebellum toward being a general-purpose computational engine may have begun in a common ancestor of birds and mammals. The higher [fneocbl] in mammals suggest that full attainment as a general-purpose computational engine, however, is most evident in the mammalian cerebellum.

Comparative Changes in the Number of Granule-Cell-Purkinje-Cell (gcPc) Synapses in Mammals

All data used here have been published previously. The detailed data analysis has also been reported. Briefly, the average number of gcPc synapses per single granule cell was estimated by dividing the average parallel fiber length with the average inter-varicosital distance (Huang et al. 1999). This approach entails several assumptions. One, there must be a one-to-one correlation between synapses and synaptic varicosities. This assumption is supported by results from combined Golgi and electron microscope studies of the cerebellar parallel fibers (Harris and Stevens 1988; Palay and Palay 1974; Xu-Friedman et al. 2001). Two, the synapses between granule cells and inhibitory neurons may be ignored by assuming that all the synapses on the parallel fibers are gcPc synapses. This assumption is also well supported (Huang et al. 2006a; Palay and Palay 1974; Palkovitz et al. 1971c). Three, the gcPc synapses in the ascending segments of the granule cell axons may be ignored as they are far fewer (<7%) than those in the parallel fibers (Huang et al. 2006a; Napper and Harvey 1988a, b; but see (Gundappa-Sulur et al. 1999). Four, relatively constant values for inter-varicosital distances have been reported in studies of cerebellar fibers from a number of animal species (Harvey and Napper 1988; Palkovitz et al. 1971a, b, c; Napper and Harvey 1988a, b; Huang et al. 1999, 2006a; Pichitpornchai et al. 1994; Shepherd et al. 2002). This leaves the number of gcPc synapses per granule cell directly proportional to the parallel fiber length. Five, the status of the cerebellar synaptic network may change with age and may not be adequately defined by a snapshot from one single point in time. In general, the ages of the animals used in this analysis were not available in the literature. The number of gcPc synapses per mouse granule cell in Table 2 was derived from the 3-month-old mouse, in which this value was at its peak and approximately twice the magnitude at 6 months (Huang et al. 2006b). The data on parallel fiber length in the human cerebellum were derived from the cerebellar hemisphere of a 50-year-old male with the methods previously described (Huang et al. 1999) and most likely did not represent a peak value but is in good agreement with estimated values reported previously (Smolyaninov 1966, also cited in (Ito 1984). Taken together, uncertainties introduced by variation in age and other factors were not likely to change the estimates in Table 2 by a factor of 2, whereas many of the microscopic parameters for the human cerebellum in Table 2 were much larger and differed by an order of magnitude or more from those of the other species. Once the number of gcPc synapses per granule cell has been determined, the total number of gcPc synapses was derived by multiplying this number by the number of granule cells. Granule cell counts were obtained from the literature for mouse, rat, cat, rhesus monkey, and human (see references in Table 2). Additional data on the mean length of parallel fibers and mean inter-varicosital distances were obtained from data in the literature (Brand et al. 1976; Harvey and Napper 1988; Huang et al. 1999, 2006b; Napper and Harvey 1988a, b).

Table 2 Comparative changes in cerebellar cellular and synaptic parameters. The numbers in italics are calculated whereas others are measured data. Although the parallel fiber length of the cat has been estimated stereologically to be from 2 to 4 mm (Palkovitz et al. 1971a), the experimental data from Brand et al. (1976) are preferred. This puts the cat parallel fiber length at 6 mm. This value is longer than that of the rhesus monkey and may represent a special functional adaption. Still, note that the data on human cerebellum often stand out by a factor of ten or more from the rodents

In Table 2, only five primary parameters of the cerebellum must be measured (rather than calculated from other parameters): the volume of the cerebellum, the number of granule cells, the number of Purkinje cells, the length of parallel fibers, and the linear density of synapses along parallel fibers. Comparisons of measured or calculated parameters are facilitated by normalizing the values relative to the mouse (Table 3). For example, the body mass ratio from mouse to rat, cat, monkey, and man is 1, 12, 120, 800, and 2400. This normalization separates the parameters in Table 3 into different categories. Category I parameters change proportionately with body mass: the volume of the cerebellum (1, 6, 48, 180, 3000) and the total number of granule cells (1, 3.7, 81.5, 220, 4000). The proportional relationship between the volume of the cerebellum and the number of granule cells has been noted previously (Fox and Barnard 1957; Ito 1984; Mayhew 1991). Category II parameters also increase with body mass, but less rapidly: number of Purkinje cells (1, 2, 6.25, 15, 150) and number of gcPc synapses per Purkinje cell (1, 3, 38, 40, 133). Category III parameters – ratios, densities, fiber lengths – with no obvious reasons to be related to body or brain volume but nonetheless increase with body mass: length of parallel fibers and the number of gcPc synapses per granule cell (both at 1, 1.5, 3, 2.9, 5), and the density of gcPc synapses per unit volume (1, 0.91, 5, 3.5, 6.6). Category IV parameters remain constant with body mass: density of granule cells per unit volume (1, 0.6, 1.75, 1.25, 1.25). This should be evident as the number of granule cells and the total cerebellar volume are both category I parameters (but see Lange (1975) who has reported a change of ∼1/2 order of magnitude over five orders of magnitude of cerebellar volume). Another category IV parameter is the inter-varicosital distance (1, 1, 1, 1, 1) (Harvey and Napper 1988; Huang et al. 1999; Napper and Harvey 1988a, b; Palkovitz et al. 1971a, b, c; Pichitpornchai et al. 1994). One category V parameter, the density of Purkinje cells per unit volume (1, 0.33, 0.13, 0.08, 0.05), decreases with body mass. Finally, one category VI parameter, the number of gcPc synapses (1, 5.5, 236, 620, 20,000), increases faster than the volume of the cerebellum, particularly from monkey to human (Fig. 5). The increase from monkey to human (32X) is largely due to an increase in the number of granule cells (18X), together with an increase in the length of parallel fibers (1.7X). The number of gcPc synapses is of potential interest as the capacity of information storage in brain systems has been suggested to be of the order of 1–2 bits per synapse (Chklovskii et al. 2004).

Table 3 Comparative changes in cerebellar cellular and synaptic parameters. The values for the mouse are set at 1. The parameters and numbers in italics are calculated whereas others are measured data
Fig. 5
figure 5

Ratio of cerebellar circuitry parameters (Y-axis) as a function of cerebellar volume ratio (X-axis). The values for the mouse are set arbitrarily as one (mouse data as the origin). The other points are rat, cat, rhesus monkey, and man, in that order. Legends: number of gcPc synapses (crosses), granule cells (small solid squares), Purkinje cells (larger empty squares), and length of parallel fibers (horizontal line segments). Comparative changes of the parallel fiber length and the number of Purkinje cells are less steep than the number of granule cells, which scales proportionally with the volume of the cerebellum. The number of gcPc synapses varies more steeply than the number of granule cells. Both the X- and the Y-axes are logarithmic

Summary

Results of experimental measurements at the gross morphological level have shown that amphibians and reptiles have very small cerebellum (<3% of the total brain volume). The volume fraction of the cerebellum increases to 10–15% of the whole brain in mammals and in birds. The neocerebellum in fish, amphibians, and reptiles is likely to be insignificant. Birds (neocerebellum at ∼11% of the whole cerebellum, on average) and mammals (neocerebellum at ∼58%, on average) have significantly larger neocerebellum fractions compared with all other animals. Even though the volume of the cerebellum largely remains proportional to the telencephalon in mammals such that the volume fraction of the cerebellum remains constant, the number of gcPc synapses increases dramatically according to 1:5.5:236:620:20,000 from mouse, rat, cat, rhesus monkey, to human.

The Role of the Cerebellum in Complex and Adaptive Behavior

Mammals, which as a group have generally larger neocerebellum than birds, also exhibit the largest repertoire of complex behaviors. In mammals, humans exhibit the most complex behaviors and the human neocerebellum accounts for 90–95% of the cerebellar volume (Ghez 1991). Do the cerebellum and its evolution play any important role in adaptive and complex behavior? This question is examined in three sections: (1) the role of the cerebellum in adaptive behavior, (2) the role of information-processing capacity in complex behavior, and (3) the implications of comparative changes of the cerebellum in complex and adaptive behavior.

Adaptive Behavior and the Cerebellum: Action Perception and Learning and Memory

Much of the complexity in human behavior, particularly modern human behavior, is adaptive in nature and is acquired after birth. The emergence of adaptive changes in behavior is dependent upon how the brain responds to novel stimuli from the environment. In active perception, the brain may encounter for the first time a novel sensory input from the environment, analyze it, and then make modifications on brain structure and function via intrinsic mechanisms of neuroplasticity. On subsequent encounter with the similar sensory stimuli, a modified and more experienced brain can make better use of the afferent information and generate better responses in order to improve upon a behavior or action. In the shared circuits model (SCM), active perception forms part of the foundation for more complex behavior such as imitation (Hurley 2008).

In many known cerebellar operations, adaptive cerebellar neuroplasticity allows new behavioral solutions to emerge in response to novel or unexpected stimuli that have become repetitive. Detailed cellular and molecular mechanisms of cerebellar neuroplasticity are known and the part concerning the gcPc synaptic circuitry is particularly well known (Hansel et al. 2001; Ito 2006; Jorntell and Hansel 2006). A generally accepted hypothesis is that cerebellum-bound climbing fibers carry error messages or information about an unexpected event to the Purkinje cell (e.g., see Apps and Garwicz (2005); Bloedel and Bracha (1998)). These messages contribute to associative learning by providing information on timing and other properties of the error message or the unexpected event of interest. An important function of climbing-fiber discharges is to mark the time for Purkinje cells to search for coincidental mossy-fiber events, whose associated gcPc synapses are then subjected to long-term depression (LTD). After many encounters with such events, changes in behavior can occur. The gradual acquisition of newly learned behavior or response to such an event is often accompanied by reduction or absence of climbing-fiber discharges, signaling that error messages are no longer needed. In this way, the cerebellum helps to generate the most appropriate responses to the novel stimuli.

For example, normal vestibulo-ocular reflex (VOR) causes eye movement in a direction opposite to the angular acceleration of the head so that there is no retinal slip and the fovea can better track the visual target of interest. In a subject wearing magnifying or minifying glasses, a “learned” VOR with a different gain can develop with time so that once again retinal slip can be minimized or eliminated (Blazquez et al. 2003). Here the archicerebellum (mainly dedicated to the vestibular system) and the vestibular nuclei, which are phylogenetically related to the cerebellar nuclei, accomplish this feat by learning-related cellular mechanisms which modify the details of the VOR neural circuitry and thereby the equations relating the direction of angular acceleration of the head to the direction of the eye movement. In the paleocerebellum (mainly dedicated to the spinal cord), the cerebellar neuroplasticity similarly modifies the equation relating muscle spindle length to contractile force of the skeletal muscle so that the stretch reflex and other postural reflexes are appropriately and dynamically modulated to make way for, and indeed facilitate, purposeful movement (Ito 1984; Lisberger 1988). In the neocerebellum (mainly dedicated to the cerebral cortex), eyeblink conditioning is another example of such procedural learning in which a tone (as unconditioned stimulus) may have no biological significance at first but can quickly drive the eyeblink (as conditioned response) if the tone is paired in an associative manner with an air puff to the eye (Attwell et al. 2002; De Zeeuw and Yeo 2005; Thomson et al. 1997).

Cerebellar analyses of afferent information are therefore constantly modified in an interactive manner based on climbing-fiber discharges, leading to active perception supported by LTD and subsequent functional and structural changes within the gcPc synaptic system of the cerebellar cortex without affecting the peripheral sensory mechanisms. It is therefore possible to view active perception and procedural learning as overlapping, if not identical, concepts. The cerebellum is therefore a major center for active perception of signals from the environment in the generation of adaptive behavior .

Complex Behavior and the Cerebellum: Information Coding and Processing Capacity

All animals are exposed to the same information from the environment. They ignore irrelevant signals from the environment, recognize and subsequently utilize biologically significant information in environmental signals, and generate complex behaviors via associative learning or active perception in order to avoid extinction. As the mammalian cerebellum evolves, it acquires granule cells at a rate faster than Purkinje cells while it gradually lengthens its parallel fibers (Fig. 5). Whenever prompted by a climbing-fiber discharge with an error message, higher granule-cell-Purkinje-cell ratio and longer parallel fiber would allow each Purkinje cell to search more exhaustively for coincidental mossy-fiber events (in reference to a climbing-fiber event) for the purpose of better associative learning. The enhanced sensitivity of the associative learning process leads to greater and more effective behavioral plasticity. The neocerebellum, which has a demonstrated ability to recognize biological significance in environmental stimuli based on associative learning principles (e.g., as in eyeblink conditioning), can be expected to be particularly important to cognitive behavior such as those in social interactions.

Relative to eyeblink conditioning, many other procedural learning processes involve considerably more complex movement sequences which require high information content for their execution. The same can be expected to procedural leaning involving cognitive behavior. Indeed, while social interactions including division of labor, long-term planning, and recreation may be observable in other animal societies such as those of social insects (recreation may be a noticeable exception in social insects but play is common in social mammals and can have social implications (Van Leeuwan et al. 2010)), the foremost feature that characterizes human culture, particularly modern human culture, from all other culture may be in its high information content in complex human behavior.

The mammalian cerebellum contains an enormous number of granule cells. Cerebellar granule cells can reach to >70% of all brain cells in a number of rodents and are likely to be even more in human (Herculano-Houzel et al. 2006). In humans, there is a massive mossy-fiber-granule-cell (mfgc) afferent system from the neocortex to the cerebellum with the collective afferents via the pons containing 40 million fibers (Leiner et al. 1989). Next to the corpus callosum, these fibers constitute one of the largest fiber tracts in the entire central nervous system. This is matched by an equally impressive gcPc synaptic system, estimated to contain 2.2 × 1014 synapses in the human cerebellum (Table 2). Marr (1969) has proposed that the massive cerebellar system provides a large capacity to code in detail all kinds of sensory information at the input stage in cerebellar information processing. Data from eyeblink conditioning studies suggest that the gcPc synaptic system may offer storage for information on motor timing and the coordination of intricate movement sequences available for motor programming at the output stage as procedural or non-declarative memory via immediate recall without further involvement of climbing fibers (Ito 2006). A large cerebellar information coding and processing capacity can therefore improve the analysis and extraction of biological significance from environmental signals on the sensory side and support a more precise (and therefore information-laden) motor response including gesturing and speech motor control. In communication, it is necessary that increases in the information coding and processing capacities for the analysis of subtle sensory cues are matched by appropriate increases in the generation of codes with which to execute movements with a capacity to express equally subtle cues. Comparative increases in the number of gcPc synapses (Tables 2, 3, Fig. 5) suggest, but do not prove, that humans are better supported in terms of information coding capacity which is needed for motor control precision in both the spatial and the temporal dimension. Examples of these high-information-content motor behaviors include the fine and skilled manipulation of voluntary movement sequences in upper limbs and their digits, gesturing and facial expression, the considerable speech motor control, as well as the willful manipulation of abstract symbols in the use of language and logic (think of them as mental gymnastics) (Molinari et al. 1997). With few candidate brain structures rivaling the cerebellum in the scope of its connectivities with other brain structures (Ramnani 2006; Strick et al. 2009), the capacity for information processing (Marr 1969), the ability to provide dynamic neuroplasticity (Hansel et al. 2001; Ito 2006; Jorntell and Hansel 2006), and the unique role serving both sensory analysis and motor programming (Bower 1997; Gao et al. 1996), the cerebellum is an underappreciated but ideal brain center supporting complex behaviors such as language and communication – the foundation of social interaction.

Comparative Changes of the Cerebellum and Behavior

The volume fraction of the cerebellum takes a significant reduction from fishes (∼27%) to amphibians and reptiles (<3%). This is accompanied by a reduced role of the lateral-line system in locomotion and perhaps also the role of locomotion in survival (Videler 1993). For amphibians and reptiles, their smaller cerebellum is not likely to bring about significant active perception in behavior, including motor behavior. The volume fraction of the cerebellum jumps to 10–15% in birds and mammals. Although such increase in birds has been attributed to flying (Paulin 1993; Witmer et al. 2003), the increasingly more significant role of the neocerebellum is surely a more important factor leading to the expansion of the mammalian cerebellum. Within major cerebellar subdivisions, the neocerebellum fraction is insignificant in amphibians and reptiles but becomes larger in birds (11% of the cerebellum) and more so in mammals (58%). Indeed, in both birds and mammals, the expansion of the cerebellum is nearly proportional to the telencephalon (Clark et al. 2001; Sultan 2002, Fig. 3). An interpretation of the constancy of the cerebellar fraction in birds and mammals (∼10–15% of the whole brain) may be that the cerebellum serves as a general-purpose computational engine for the whole brain (Herculano-Houzel 2010; Herculano-Houzel et al. 2006, but also see Clark et al. 2001). In view of the role of the cerebellum in active perception and procedural learning, it may be further suggested that the nature of cerebellar computation is to provide the dimension of neuroplasticity. The larger neocerebellar fraction in mammals further suggests a role of cerebellar neuroplasticity in cerebral cortical function, including cognition. Comparative changes in the number gcPc synapses are also evident in mammals even though the volume fraction of the cerebellum remains constant. Increased number of gcPc synapses can enhance the sensitivity and effectiveness of climbing-fiber-related learning and neuroplasticity and improve the spatial and temporal precision of motor as well as cognitive behaviors in climbing-fiber-free cerebellar operations. These ideas help to underscore the significance of active perception and/or associative procedural learning in brain modeling as well as the role of information-processing capacity of the cerebellum in complex behavior and its acquisition.

Cerebellum and Human Evolution

What aspect of being human is most prominent in human evolution? If complex behaviors in animals are effective to stave off extinction, then they should be interactive, preferably proportional, and adaptive in response to the changing environment. Practically for all species in the animal kingdom successfully surviving extinction, changes in their repertoire of behavior have kept pace with (the relatively slow) changes in environmental complexity over hundreds of millions of years. Humans may represent an exception in that human behavioral complexity has increased at a faster rate compared with most other animals and compared with changes in environmental stimuli since the appearance modern humans about two hundred thousand years ago (for examples in which human culture with advanced language and communication suffered catastrophic failures, see Diamond (2005)). The pace of this change may arguably be quickened in recent times. The reason for such increases in human behavioral complexity is in the relationship between selection pressure and response. Under selection pressure, a basic measure on the potency of a given behavioral response should be defined as how information content of the complex behavior is matched against information content of the environmental complexity. One reason for this exception, therefore, may be that human social interaction generated complex pressure which demands high-information-content responses with human language and communication as a major example. A second reason is that language and communication can also represent sensory stimuli which, in turn, further enrich the sensory world beyond what can be provided by environmental stimuli. The responses to social selection pressure, therefore, can become additional sources of selection pressure. Language and communication are complex adaptive behaviors in response to the kind of selection pressure described in the social brain hypothesis (Dunbar and Shultz 2007), but they can foster further escalation of social and cultural evolution.

What is the fundamental difference between various adaptive behaviors exhibited in many animal species and complex adaptive behaviors such as human communication and language? Results of behavioral genetic studies in the domestication of fox suggest that adaptive behavior such as aggression can be manipulated significantly by selective breeding over a matter of mere 50–60 years (Kukekova et al. 2008). In the mean time, although many animal species have been domesticated for approximately 8–10,000 years or more and their genes manipulated exhaustively in selective breeding, they do not display complex behavior at the same level as humans. Chimpanzees with a genetic makeup more closely related to that of the humans than any other animals have been exposed to the human environment and even raised with humans since birth apparently are far less likely than humans to recognize and utilize the biological significance in cues from the environment (Smith 2003). The patterns of such limitations in many animal species are consistent with a lack of the required information-processing capacities in hardware. This hardware limitation, possibly manifested by a more limited gcPc synaptic system in the cerebellum, will surely lead to limitations in active perception , which, in turn, cast limitations on the types of complex behavior they can acquire, including imitation (but see Tomasello (2006)). Thus, low information capacities can support many types of adaptive behavior, but high information capacities are required for manifestation of complex adaptive behaviors. One such complex behavior may be imitation in which the quality of the imitated behavior may involve functions of error messages and the cerebellar gcPc system. In SCM, the extent in which the repertoire of complex behaviors can evolve is therefore dependent on active perception as well as the information coding and processing capacities (Hurley 2008). The human brain may be unlike the brain of many other animals in that it contains elements with significantly higher information coding and processing capacities.

What are the brain structures or systems that are likely to be important for supplying high information coding and processing capacities? The cerebellar information-processing system should rank high as such a brain structure (at least for procedural learning). Among the parameters in Table 2, the number of gcPc synapses is the one that shows the most comparative change, implying that the information coding and processing capacities of gcPc synapses are more responsive to selection pressure than other parameters in that table. The comparative expansion of the neocerebellum from fish, to amphibian, reptile, bird, and mammal strongly suggests that the mammalian cerebellum, and human cerebellum in particular, works in close association with cerebral cortical structures including those supporting imitation. As most of the afferent mossy fibers to the neocerebellum are cerebral cortical in origin, it is likely that information conveyed in such afferents are processed at the conscious level (whereas similar operations in the archicerebellum and the paleocerebellum are largely at the subconscious level). A greatly expanded neocerebellum may be the basis of the ability to be conscious of and, therefore, better able to attend to the generalized “errors” or some unexpected events of interest. The expansion of the human neocerebellum and improved cerebro–cerebellar interaction may have further brought about a blending of cerebellar procedural learning and other mechanisms of declarative learning intrinsic to cerebral cortical structures. The result can be a hybrid memory and learning mechanism that involves both the cerebral cortex and the cerebellum routinely and simultaneously while operating at a high level of information coding capacity (Squire 2004). For example, the skill or lack of it in playing tennis, basketball, music instruments, and in writing this chapter is most likely the result of such hybrid learning. The tendency toward an ever-increasing information-processing capacity may have helped to elevate the quality of human consciousness in general and facilitated the evolution of human language and communication in particular (Zimmer 2010) .

Conclusions and Future Directions

The observations on the coincidental increases in brain volume, social complexity, and cerebellar information coding capacity beg the question whether these increases are related by a causal relationship as have been suggested for the overall brain size and social complexity (Dunbar and Shultz 2007). To establish such a causal relationship would require considerable investigations in the future. In the shared circuits model (SCM), one level above active perception is imitation (Hurley 2008). Interestingly, imitation plays an important role in the acquisition of nearly all complex human behaviors. During the course of evolution, humans may have become more likely to imitate on account of a highly elaborated mirror neuron system and other related brain structures. Humans may also be better at imitation on account of a highly evolved cerebellar system that supports active perception and/or procedural learning with a large information coding capacity. The resultant impact of these ideas should be relevant to the evolution of the cerebellum as the center piece of the shared circuits model (SCM) and a key structure in imitation and other forms of complex behavior (Hurley 2008).