Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 1 Introduction

Chemosensation is responsible for the detection of chemicals in the external environment and is essential for an organism's survival and reproduction (Prasad and Reed 1999). Chemosensation originated very early in evolution, as even bacteria can respond to chemical changes in the environment. This type of chemoreception is known as the general chemical sense and is universal among organisms (Smith 2000). In this review, however, we will not study this general chemical sense. Instead, we will focus on two types of chemoreception that are animal-specific: olfaction (detection of odorants and pheromones) and gustation (detection of tastants). Owing to space limitations, we will only discuss vertebrates. It is widely thought that chemoreception plays multiple important roles in a vertebrate's daily life, including food detection and discrimination, toxin and predator avoidance, mating, and territoriality (Prasad and Reed 1999). Vertebrate chemosensory systems include the olfactory system, which detects odorants and pheromones in the nasal cavity, and the gustatory system, which perceives different tastants with the tongue. Within the olfactory system there are two anatomically distinct organs: the main olfactory epithelium (MOE) and the vomeronasal organ (VNO) (Dulac and Torello 2003). It was initially thought that the MOE and the VNO have distinct functions, as the MOE is largely responsible for the detection of ordinary odorants, while the VNO detects pheromones (Dulac 1997; Buck 2000), although the current view is that the two systems can both detect odorants and pheromones. For the gustatory system, the tongue can perceive five basic tastes: sour, salty, bitter, sweet, and umami (Kinnamon and Margolskee 1996; Lindemann 2001). Among them, the sweet and umami tastes can influence appetitive reactions and generally reflect the identification of nutrients, whereas the bitter taste may result in aversion and therefore is a defensive mechanism against ingestion of toxins (Herness and Gilbertson 1999).

The ability of the chemosensory system to detect a diverse array of chemicals is mediated by the distinct chemoreceptors encoded by several gene families. The characterization of chemoreceptor genes started in 1991 with the Nobel-prize-winning discovery of 18 rat odorant receptor (OR) genes (Buck and Axel 1991). ORs have seven transmemebrane domains and belong to the G-protein-coupled receptor (GPCR) family A. It has been proposed that the potential odorant-binding pocket is formed by the third, fifth, and sixth transmembrane domains (Emes et al. 2004). OR genes have no introns and the coding region of each gene has about 1,000 nucleotides. They are mainly expressed in sensory neurons of MOEs. It is widely accepted that a single OR allele is expressed in each olfactory sensory neuron, known as the “one neuron—one gene” hypothesis (Mombaerts 2004). Recently, trace amine-associated receptors (TAARs) were demonstrated to be the second class of chemosensory receptors in the MOE (Liberles and Buck 2006). TAARs and ORs share many features, including the gene structure and expression profile (Liberles and Buck 2006), but TAARs and ORs are not coexpressed in any neurons (Liberles and Buck 2006).

Two distinct superfamilies of GPCRs, V1Rs and V2Rs, have been identified as vomeronasal receptors (Dulac and Axel 1995; Herrada and Dulac 1997; Matsunami and Buck 1997; Ryba and Tirindelli 1997). Like ORs and TAARs, V1R genes have intronless coding regions. They are coexpressed with the G-protein subunit Gαi2 in sensory neurons whose cell bodies are located in the apical part of the vomeronasal epithelium (Dulac and Torello 2003; Mombaerts 2004). In contrast, V2Rs are characterized by the presence of a long, highly variable N-terminal domain. They are encoded by multiexon genes expressed in GαO-positive neurons whose cell bodies are located basally in the vomeronasal epithelium (Dulac and Torello 2003; Mombaerts 2004). Neurons expressing V1R and V2R receptors project to the anterior and posterior accessory olfactory bulb, respectively, where they form multiple glomeruli in spatially conserved domains (Dulac and Torello 2003). Interestingly, the four olfactory-system-related gene families (OR, TAAR, V1R, and V2R) are not evolutionarily related, although all of them belong to GPCRs.

In mammals, T1Rs and T2Rs have been identified as sweet/umami and bitter taste receptors, respectively. In additional to the distinct physiological functions, T1Rs and T2Rs also differ in expression pattern and molecular structure. Multiple T2Rs are coexpressed in individual cells that also express α-gustducin, a G-protein subunit (Adler et al. 2000; Nelson et al. 2001; Behrens et al. 2007). T1Rs, on the other hand, are not coexpressed with α-gustducin. T1R1 and T1R2 are expressed in distinct taste receptor cells, but they are always coexpressed with T1R3. Consistent with the expression feature, T1R3 forms a heteromeric receptor with T1R1 to detect L-amino acids and monosodium L-glutamate, which is the taste of umami, or combines with T1R2 to broadly respond to sweet tastants (Nelson et al. 2001; Li et al. 2002). Different from T1R genes, which contain multiple introns, T2R genes are intronless in the coding region. T1R proteins are characterized by a long N-terminal extracellular domain, whereas T2Rs have a short N-terminal domain (Hoon et al. 1999; Adler et al. 2000; Meyerhof 2005).

Chemoreception varies substantially among vertebrates, probably because of the tremendous diversity of chemical stimuli in the external environments of various species. We summarize in this review recent comparative genomic studies on the variation of gene number and gene sequence of chemoreceptor gene families among vertebrates. We also discuss the genetic mechanisms and evolutionary consequences of these features with an emphasis on the adaptive diversification of chemosensory receptor genes.

2 2 Chemoreceptor Gene Families in the Main Olfactory System

2.1 2.1 The OR Gene Family — the Largest Gene Family in Mammals

The OR gene family is known to be the largest gene family in the vertebrate genome. Since the original discovery in 1991 (Buck and Axel 1991), OR genes have been partially cloned from many vertebrates (Mombaerts 1999). The first near-complete OR gene repertoire was not unveiled until 2001, when the draft human genome sequence became available (Glusman et al. 2001; Zozulya et al. 2001). Since then, the complete OR gene repertoires have been characterized in several major vertebrate lineages for which the genome sequences are available, including teleosts (pufferfish, fugu, and zebrafish), amphibians (frog), birds (chicken), and mammals (human, mouse, rat, dog, cow, opossum, and platypus) (Alioto and Ngai 2005; Niimura and Nei 2005, 2006, 2007; Grus et al. 2007). As shown in Table 1, the number of functional OR genes varies greatly among species, ranging from 44 genes in fugu to over 1,200 genes in rat (Alioto and Ngai 2005; Niimura and Nei 2005, 2007). The gene number is substantially smaller in fishes than in birds and mammals, while that of the amphibian frog appears to be in the middle. The largest known fish functional OR gene repertoire is in zebrafish, with at least 102 genes (Alioto and Ngai 2005; Niimura and Nei 2005). However, this number is still much lower than that in mammals, even when we consider only human and platypus, two mammals that are believed to have reduced main olfactory sensitivity (Niimura and Nei 2006; Grus et al. 2007). Variation in the number of functional OR genes also exists among species of the same class. In mammals, the smallest numbers are 387 and 262, found in human and platypus, respectively (Young and Trask 2002; Niimura and Nei 2003, 2007 Malnic et al. 2004; Grus et al. 2007), while the largest numbers are 1,207 and 1,188, for rat and opossum, respectively (Niimura and Nei 2007).

Table 1 Sizes of chemosensory receptor gene repertoires in vertebrates

Phylogenetic analysis helps us understand the evolutionary history and mechanism of the extraordinary diversity of the vertebrate OR gene family. The phylogenetic tree shows that the OR gene family can be classified into two groups, type 1 and type 2 (Fig. 1a). The divergence of these two types predated the split between jawed vertebrates and jawless vertebrates (Fig. 1b). The phylogenetic analysis revealed at least nine ancestral OR genes (or gene lineages) in the most recent common ancestor (MRCA) of fishes and tetrapods (Niimura and Nei 2005) (Fig. 1b). Eight of the nine ancestral gene lineages have been maintained in fishes (Fig. 1b), probably because the extant teleosts share a similar environment with the MRCA of fishes and tetrapods. By contrast, only two ancestral gene lineages have been retained in mammals or birds, but these gene lineages have expanded extensively in mammals and birds, giving rise to the largest gene family in the mammalian genome. By contrast, these two gene lineages, although present in today's fishes, have not expanded in evolution (Fig. 1b). On the other hand, four gene lineages that are present in fishes have been lost completely in mammals and birds (Fig. 1a,b). Taken together, these observations show that the long-term evolutionary dynamics of OR genes follows the “birth-and-death” process, characterized by frequent gene duplication and gene loss (Nei et al. 1997). The large amount of turnover of OR genes in vertebrate evolution probably reflects the functional requirement for different olfactory abilities in different evolutionary lineages. This view is further supported by the fact that frogs have both mammal-like and fish-like OR genes (Niimura and Nei 2005).

Fig. 1
figure 1_4

Evolution of vertebrate odorant receptors (ORs). a Condensed phylogenetic tree for 310 functional OR genes from fishes, frogs, chickens, and humans at the 70% bootstrap level. The nine major clades are labeled α, β, γ, δ, ε, ζ, η, θ, and κ. Human class I genes are within the α clade and human class II genes are within the γ clade. Open circles and closed circles at nodes represent branches with bootstrap values greater than 90% and greater than 80%, respectively. b Evolutionary dynamics of vertebrate OR genes. There are at least nine ancestral genes in the most recent common ancestor (MRCA) of fishes and tetrapods. Fishes maintain eight of nine ancestral genes, whereas mammals contain only two of them. c OR gene gains and losses in mammals. The numbers with plus and minus signs for each branch indicate the numbers of gene gains and losses, respectively, and the numbers in rectangular boxes denote the functional OR genes for the extant or ancestral species. MYA million years ago. (a, c Modified from Niimura and Nei 2003. b Modified from Niimura and Nei 2007)

Many OR gene gains and losses have also been observed within mammals. Comparative genomic and phylogenetic analyses show that gene family expansions occurred independently in monotremes, marsupials, and placental mammals. Consequently, many lineage-specific genes are observed in today's mammalian genomes (Fig. 1c). The largest gene family expansion occurred in the marsupial lineage, with at least 750 gene gains. Similarly, more than 400 genes were gained in the cetartiodactyl and rodent lineages (Fig. 1c). On the other hand, the number of gene losses in the primate lineage is much greater than that in other lineages (Niimura and Nei 2007). As shown in Fig. 1c, since the human—mouse split, 385 ancestral genes have been inactivated in the human lineage, while only 55 new OR genes have been acquired. In the mouse lineage, the two numbers are 277 and 623, respectively. These gene gains and losses explain the dramatic difference in OR gene family size between human and mouse. Furthermore, even when the gene number is similar between two species, the gene content may differ, owing to rapid gene gains and losses (Niimura and Nei 2007).

OR gene number variation also exists among closely related species. Rouquier et al. (2000) found a higher fraction of OR pseudogenes in ten primates than in mice. A subsequent study by Gilad et al. (2004), based on an analysis of 100 orthologous OR genes in 19 nonhuman primate species, found that the percentage of OR pseudogenes is significantly higher in humans, apes, and Old World monkeys than in most New World monkeys and mice (Gilad et al. 2004). Interestingly, in the howler monkey, the only New World monkey with a full trichromatic vision as humans, apes, and Old World monkeys have, approximately 30% of the genes are pseudogenes, similar to the number in Old World monkeys. On the basis of this result, the authors suggested that the loss of OR genes in humans, apes, and Old World monkeys is a result of the acquisition of trichromatic vision. On the basis of the analysis of 50 orthologous OR genes, Gilad et al. (2003) suggested that humans also have fewer functional genes and more pseudogenes than chimpanzees, which was later substantiated by a genome-wide comparison of human and chimpanzee OR repertoires (Gilad et al. 2005). More recently, the variation of OR gene number was also identified among different human individuals, which is known as the copy number variation (Trask et al. 1998; Wong et al. 2007). Interestingly, the level of interspecific divergence relative to that of intraspecific variation in OR gene copy number is not significantly different between functional genes and pseudogenes, suggesting that human intraspecific and human—chimpanzee interspecific OR gene number variations may not have any fitness consequence (Nozawa et al. 2007; Zhang 2007).

In addition to the vast variation in gene family size among vertebrates, the remarkable diversity of ORs is also reflected in the high sequence (and potentially functional) variation among alleles found within species. For example, in humans, pygmy populations tend to have higher frequencies of intact alleles than Caucasians in 32 OR genes examined (Gilad and Lancet 2003). A further study of 51 human OR loci in 189 ethnically diverse individuals reached the same conclusion and suggested that different evolutionary forces may have shaped the OR repertoire in different human populations (Menashe et al. 2003). Similarly, great allelic diversity was found in different mouse strains and dog breeds, respectively (Zhang et al. 2004; Tacher et al. 2005). In mice, single nucleotide polymorphisms (SNPs) were counted by comparing two mouse genome sequences, which were derived from different strains (Zhang et al. 2004). It was estimated that there are 2.68 SNPs per OR gene coding region, about twice that in other mouse GPCR genes (Zhang et al. 2004). In dogs, a high level of allelic variation among 20 different breeds was observed in a survey of 16 OR genes among 95 individuals. All genes were found to have SNPs and 50% of SNPs are nonsynonymous. More interestingly, some SNPs are breed-specific and they may be the basis of breed-specific olfactory sensitivity (Tacher et al. 2005).

2.2 2.2 The TAAR Gene Family — the Second Class of Olfactory Receptor Gene Family

TAAR genes were initially identified to respond to trace amines in rodents and were later shown to be chemosensory receptors in the MOE (Borowsky et al. 2001; Liberles and Buck 2006). Gloriam et al. (2005) performed the first genome-wide investigation in zebrafish and identified 57 intact TAAR genes, which is almost 10 times the number in human. More recently, a comprehensive scan of ten vertebrate genomes found a large variation in the size of this gene family among vertebrates (Hashiguchi and Nishida 2007). In sharp contrast to the OR gene family, which is larger in tetrapods than in fishes, the TAAR gene family is smaller in tetrapods than in some fishes such as zebrafish and stickleback (Table 1). The largest TAAR gene repertoire, found in zebrafish, has 102 intact genes, whereas the smallest repertoire, in chicken, has only three intact genes (Hashiguchi and Nishida 2007). The comparative genomic and phylogenetic analyses suggested that the large gene repertoires in some fishes are attributable to the genome duplication in teleosts followed by additional gene duplications (Hashiguchi and Nishida 2007). By contrast, in frog, chicken, and the majority of mammals, the TAAR family lost several ancestral genes but gained virtually no new members. Opossum, cow, mouse, and rat are exceptions, with some gene duplications (Fig. 2). These findings suggest that the TAAR gene family is also subject to the birth-and-death evolutionary process observed in the OR gene family and that biogenic amine odorants are more important for fish than for tetrapods (Hashiguchi and Nishida 2007).

Fig. 2
figure 2_4

Phylogenetic tree of 268 trace amine-associated receptor genes identified from ten vertebrates. The tree was reconstructed by the maximum-likelihood method using nucleotide sequences. Bootstrap percentages greater than 50 are shown on interior branches. White dots on nodes indicate the MRCA of fishes and tetrapods. (Modified from Hashiguchi and Nishida 2007)

3 3 Vomeronasal Receptor Gene Families

The vomeronasal system is present in most tetrapods (amphibians, reptiles, and mammals), but is absent in fishes (Dulac and Torello 2003). Thus, the nomenclature of the fish chemosensory receptors that are homologous to mammalian vomeronasal receptors has been confusing. On the basis of sequence homology, some authors termed them “V1R-like” and “V2R-like” genes (Hashiguchi and Nishida 2005, 2006; Pfister and Rodriguez 2005; Pfister et al. 2007), while some authors separately designated them as “olfactory receptor A family GPCR” and “olfactory receptor C family GPCR” by considering their expression pattern and phylogenetic position in the GPCR family (Alioto and Ngai 2006; Saraiva and Korsching 2007). Here we use the former terminology for three reasons. First, most, if not all, VR-like genes have been identified by sequence homology and their expression pattern and biological function are usually unknown. Second, although teleost fishes do not have a morphologically distinct vomeronasal organ, they may have a primordial vomeronasal system (Grus and Zhang 2006). Third, the latter nomenclature is also confusing and undistinguishable from that for other chemosensory receptors expressed in the MOE.

3.1 3.1 The V1R Gene Family — the Family with the Highest Among-Species Variation in Gene Family Size

In 1995, V1R genes were first identified in rats by comparative hybridization of complementary DNA libraries from individual VSNs (Dulac and Axel 1995). The first complete V1R repertoire was described for mouse by Rodriguez et al. (2002). In this work, they identified 137 functional V1R genes from the mouse draft genome sequence and subsequently classified them into 12 subfamilies according to protein sequence identity (Rodriguez et al. 2002). In human, the entire V1R repertoire, including functional genes and pseudogenes, has approximately 200 members. However, the functional V1R repertoire is small, with only four open reading frames in most individuals (Rodriguez and Mombaerts 2002; Zhang and Webb 2003). Grus et al. (2005) identified functional V1R genes from five orders of placental and marsupial mammals (Table 1). The intact V1R repertoire size varies by at least 23-fold among mammals with functional VNOs and this size ratio represents the greatest among-species variation in gene family size of all mammalian gene families (Grus et al. 2005). Young et al. (2005) made similar findings. A more recent study found that the platypus, a semiaquatic monotreme, has the largest V1R repertoire characterized to date, with 270 potentially functional genes and 579 pseudogenes. Thus, the functional V1R repertoire size varies by at least 34-fold among mammals with functional VNOs (Grus et al. 2007). This large variation in V1R repertoire size is also observed between closely related species. For example, mouse has 191 functional V1R genes, about 50% more than the number in rat (117) (Shi et al. 2005; Young et al. 2005; Zhang et al. 2007). The number of V1R genes also varies tremendously among nonmammalian vertebrates (Shi and Zhang 2007). No V1R genes were found in chicken, consistent with the fact that birds have neither VNO nor VNO-mediated olfaction (Keverne 1999). A total of 21 functional genes and two pseudogenes was found in the western clawed frog (Shi and Zhang 2007). In contrast to mammals, fishes have highly conserved V1R-like repertoires, containing four genes in two pufferfish species and five genes in zebrafishes, stickleback, and medaka, respectively (Hashiguchi and Nishida 2006; Saraiva and Korsching 2007; Shi and Zhang 2007). Interestingly, the number of intact V1R genes is positively correlated with the morphological complexity of the VNO, suggesting that VNO morphology is a good indicator of vomeronasal sensitivity (Grus et al. 2005).

A phylogenetic analysis of all vertebrate V1Rs suggests that V1R genes can be divided into at least three major clades that diverged from one another before the separation of tetrapods and teleosts (Shi and Zhang 2007) (Fig. 3a). Clade 1 now contains genes from frog and mammals, but was lost in fishes. Clade 2 and clade 3 include frog and fish genes, which are absent in mammals. Major expansions of the V1R gene repertoire occurred in some mammals (clade 1), whereas minor expansions occurred in frog (clades 1 and 2) (Fig. 3a) (Shi and Zhang 2007). Very recently, two very divergent V1R-like genes were found in teleost fishes (Saraiva and Korsching 2007). It appears that the evolutionary diversity of V1R genes in fishes is much larger than that in tetrapods, although the gene family size is smaller in fishes than in tetrapods.

Fig. 3
figure 3_4figure 3_4

Neighbor-joining trees of intact vomeronasal receptors from vertebrates. a The V1R tree. b The V2R tree. The trees were reconstructed with protein Poisson distances. Bootstrap percentages for some major groups are presented. Vomeronasal receptors from the mammals, frogs, and fishes are shown by light grey background, dark grey background, and black background, respectively. (Modified from Shi and Zhang 2007)

In mammals, both gene duplicate and pseudogenization have played important roles in generating the remarkable among-species variation in V1R gene repertoire (Grus et al. 2005; Young et al. 2005). On one hand, substantial numbers of gene duplication events occurred independently in monotremes, marsupials, and placentals, giving rise to platypus-specific, opossum-specific, and placental-specific gene clusters (Grus et al. 2005, 2007) (Fig. 3a). Following the initial gene duplications in the MRCA of placental mammals, additional expansions occurred most prominently in rodents, in which the evolution of the V1R repertoire is characterized by rapid gene turnover and species-specific phylogenetic clustering (Grus and Zhang 2004; Lane et al. 2004; Grus et al. 2005; Shi et al. 2005; Young et al. 2005) (Fig. 3a). Extreme examples include two subfamilies of V1R genes that appear in mouse but not in rat (Grus and Zhang 2004; Shi et al. 2005). On the other hand, human, cow, and dog lost many ancestral V1R genes. In humans, only three of the 12 ancestral family groups were observed. This is also the case in cow and dog, where at least four ancestral family groups are missing (Grus et al. 2005; Young et al. 2005).

3.2 3.2 The V2R Gene Family — Independent Origins of Two Types of V2Rs

The first extensive characterization of any V2R gene repertoire was conducted in 2005 (Yang et al. 2005), 8 years after the initial identification of the gene family (Dulac 1997; Herrada and Dulac 1997; Ryba and Tirindelli 1997; Yang et al. 2005). Comparative genomic studies showed that the across-vertebrate variation in gene number for V2R genes is not lower than that for V1R genes. The largest V2R repertoire is unexpectedly found in frog, with 249 intact genes and 408 disrupted genes. By contrast, no intact V2R genes are found in chicken, cow, dog, and human. In addition, 4, 18, 44, 90, 120, and 70 intact V2R genes are present in the green spotted pufferfish, fugu, zebrafish, opossum, mouse, and rat, respectively (Shi and Zhang 2007; Young and Trask 2007) (Table 1).

In comparison with the V1R gene repertoire, the V2R repertoire is frequently lost in terrestrial vertebrates. There are at least three independent losses of the entire V2R repertoire in chicken, human, and cow/dog, respectively, which is consistent with the loss of certain morphological features of the VNO (Shi and Zhang 2007). By contrast, the V2R gene repertoire expanded in teleosts with prominent patterns of lineage-specific gene amplifications (Alioto and Ngai 2006; Hashiguchi and Nishida 2006; Shi and Zhang 2007) (Fig. 3b). The V2R repertoire also expanded in rodents and opossums and exhibit the characteristics of rapid gene turnover and species-specific gene clustering (Yang et al. 2005; Young and Trask 2007), as seen in V1Rs. Interestingly, the V2R gene family tree has an unique branching pattern, where not all V2R genes cluster in one monogenetic clade. The phylogenetic analysis shows that V2Rs of family C (also termed V2R2 subfamily) are quite different from those of families A and B and are evolutionarily closer to Ca2+-sensing receptors than to V2Rs of families A and B. This observation suggests that family C and families A and B had independent origins (Yang et al. 2005) (Fig. 3b). This evolutionary history may explain the differences in expression pattern and transport mechanism between the two types of V2Rs and suggests that family C V2Rs may be functionally distinct from those of families A and B (Yang et al. 2005; Young and Trask 2007).

3.3 3.3 Diversity of Protein Families Interacting with Vomeronasal Receptors

In mice, there are two gene families that are known to function in concert with V2Rs. One of them is the M10 family of major histocompatibility class Ib molecules, which appear to function as escort molecules in the transport of some V2Rs to the cell membrane of vomeronasal sensory neurons (Loconto et al. 2003). The second is the exocrine gland peptide (ESP) family, which can activate the V2R-expressing vomeronasal sensory neurons and have been suggested to be ligands of some V2Rs (Kimoto et al. 2005). A recent study compared these two gene families in 11 vertebrates and found them to have been coevolving with V2Rs (Shi and Zhang 2007). Consistent with the absence of V2R genes in dog, cow, and human, neither M10 nor ESP genes are found in these species. Unexpectedly, however, M10 and ESP genes are not found in the opossum genome, despite the presence of numerous intact V2R genes, suggesting that the requirement of M10 molecules for the transport of some V2Rs to cell membranes is probably a rodent-specific phenomenon and the use of ESPs as potentially V2R-recognizing pheromones is also rodent-specific (Shi and Zhang 2007). Interestingly, these two gene families share with the V2R family the rapid birth-and-death evolutionary pattern. Very recently, major urinary proteins were identified as V2R-recognizing pheromones in mice and the major urinary protein family size was found to covary with the V2R family size across vertebrates (Chamero et al. 2007).

A similar story can be told for the transient receptor potential channel C2 (TRPC2) gene, which encodes an ion channel indispensable for vomeronasal signal transduction. TRPC2 is absent in the catarrhine primates (humans, apes, and Old Word monkeys), which possess only vestigial VNOs and have no or significantly reduced ability of pheromone detection (Liman and Innan 2003; Zhang and Webb 2003). Consistently, the majority of V1R genes and all V2R genes have disrupted open reading frames in catarrhine primates (Zhang and Webb 2003; Shi and Zhang 2007; Young and Trask 2007). Similarly, the lack of the TRPC2 gene and V1R and V2R genes is observed in chicken, reflecting the ancient loss of the VNO in birds. Conversely, the TRPC2 open reading frame is maintained in all vertebrates known to have functional vomeronasal receptors (Grus and Zhang 2006).

4 4 Taste Receptor Gene Families

4.1 4.1 T2R Gene Family — the More Variable Group of Taste Receptors

Taste receptor genes were the last chemoreceptor genes to be isolated. In 2000, the T2R gene family (also known as TRBs or Tas2Rs) was identified and two mouse T2R genes were shown to be bitter taste receptors (Adler et al. 2000; Chandrashekar et al. 2000; Matsunami et al. 2000). To date, the complete T2R gene repertoires have been described in mammals, birds, amphibians, and some fishes (Conte et al. 2002, 2003; Go 2006; Shi and Zhang 2006). In addition, a small number of T2R genes have also been described in several nonhuman primates (Parry et al. 2004; Wang et al. 2004; Fischer et al. 2005; Go et al. 2005). As shown in Table 1, the T2R gene repertoire varies extremely among vertebrates, ranging from three genes in chicken to 50 genes in amphibians (Go 2006; Shi and Zhang 2006). This observation is consistent with the fact that bitter taste perception, as a mechanism of guarding against the ingestion of toxins, varies enormously among vertebrates that have different diets and environments. Most interestingly, the comparative genomic analysis shows that the size of the gene family appears to be positively correlated with the number of bitter toxins that an organism is likely to encounter (Shi and Zhang 2006). Omnivorous mammals tend to have the largest T2R gene repertoires and the lowest fractions of T2R pseudogenes, probably because they consume both animal and plant tissues and consequently encounter more toxic compounds than herbivorous and carnivorous mammals do. By contrast, carnivores have a small number of functional T2R genes than herbivores, because animal tissues contain fewer toxins than plant tissues do (Shi and Zhang 2006). Cow was found to have the highest proportion of T2R pseudogenes (44%), suggesting that detecting poisons in the diet is not as important in ruminants as in other animals, probably owing to the detoxification role of cow's rumen microbes (Shi and Zhang 2006). These hypotheses need to be scrutinized in more mammalian species.

A phylogenetic analysis of all vertebrate T2R genes suggests that there were multiple T2R genes in the common ancestor of tetropods and teleosts, because T2R genes from teleost fishes do not cluster into one monophyletic clade (Fig. 4a). In addition, the overall evolutionary pattern of vertebrate T2R genes follows the birth-and-death process, similar to that observed in several other chemoreceptor gene families. Specifically, the T2R gene repertoire expanded considerably in the common ancestor of tetrapods, followed by additional independent expansions in frogs and mammals and contraction in chicken (Go 2006; Shi and Zhang 2006). The comparative genomic analysis of human and mouse T2R genes shows that some T2R genes exhibit one-to-one orthologous pairing, whereas others form species (lineage) specific clusters, in which the genes from the same species cluster together in the phylogenetic tree. These species-specific genes are the results of tandem gene duplications and are probably used for detecting species-specific bitter tastants (Shi et al. 2003). One-to-one orthologous genes were found to be subject to stronger selective constraints than species-specific genes, suggesting that each of the one-to-one orthologous genes is possibly detecting one or several distinct bitter compounds that are encountered by a wide range of animals (Shi et al. 2003). This still requires further verification by functional analysis of the receptors, although two recent evolutionary studies (Go 2006; Shi and Zhang 2006) that extended the study of T2Rs to nine additional vertebrate species supported the hypothesis.

Fig. 4
figure 4_4figure 4_4

Phylogenetic relationships of vertebrate intact T1R and T2R genes. a The T1R tree. b The T2R tree. The tree was reconstructed by the neighbor-joining method with protein Poisson distances. Bootstrap percentages greater than 50 are shown on interior branches. Groups A, B, and C in the T2R tree were previously defined on the basis of the tree of human and mouse T2R genes (Shi et al. 2003). The arrow indicates the root of the tree, which was determined by using vertebrate V1R genes as outgroups for T2R genes, and using V2R genes as outgroups in the T1R tree. (Reprinted from Shi and Zhang 2006, copyright 2006, with permission from Oxford University Press)

Comparative analysis of the T2R gene family between several species of primates and rodents revealed that the genes were under reduced selective constraints in primates compared with rodents (Parry et al. 2004; Wang et al. 2004; Fischer et al. 2005; Go et al. 2005). The proportion of pseudogenes in the T2R repertoire is lower in mice (15%) than in apes (21–28%), which is in turn lower than that in humans (31%) (Fischer et al. 2005; Go et al. 2005). The prevalence of lineage- or species-specific pseudogenes in primates further supports this conclusion (Go et al. 2005). In addition, the ratio of nonsynonymous to synonymous substitution rates for T2R genes is lower in rodents than in primates (Wang et al. 2004; Fischer et al. 2005; Go et al. 2005). The most likely explanation is that primates have reduced bitter taste needs owing to changes in the environment and diet (Go et al. 2005). Actually, some ecological studies support this explanation. For instance, meat accounts for 2–13% of diet in chimpanzees, whereas it has never been found in the diet of other apes (Wang et al. 2004). Furthermore, there were significant changes in human diet, such as decreased intake of plant tissues and the controlled use of fire to detoxify food (Wang et al. 2004). Both factors may have caused a reduction in the importance of bitter taste and consequently triggered a functional relaxation on T2Rs in humans, as has been observed (Wang et al. 2004).

4.2 4.2 The T1R Gene Family — the More Conserved Group of Taste Receptors

In 1999, two GPCR genes that are now named T1R1 and T1R2 were described in subsets of taste receptor cells (Hoon et al. 1999). The third T1R gene, T1R3, was identified almost simultaneously by six groups in 2001 (Kitagawa et al. 2001; Max et al. 2001; Montmayeur et al. 2001; Nelson et al. 2001; Sainz et al. 2001; Zhao et al. 2003). In contrast to the T2R gene family, the T1R family is evolutionary conserved in both gene family size and gene sequence (Shi and Zhang 2006). In terms of the family size, the number of T1R genes is virtually constant across mammals, which might reflect the necessity of both sweet and umami tastes among mammals (Shi and Zhang 2006). But the number of T1R genes varies in some nonmammalian vertebrates, including gene duplications observed in pufferfish and stickleback and gene losses found in western clawed frog and chicken (Shi and Zhang 2006; Hashiguchi et al. 2007). Interestingly, the western clawed frog does not have any T1R genes, although it has many T2R genes. In chicken, the T1R2 gene is missing (Shi and Zhang 2006) (Fig. 4b). In addition, cats and closely related carnivores are also known to miss the T1R2 genes, which is likely the cause of the insensitivity to sweet tastants in these species (Li et al. 2005). Thus, pseudogenization of T1R2 happened multiple times in evolution.

At the protein sequence level, T1R genes evolve more slowly than T2R genes at both interspecific and intraspecific levels. For interspecific comparison, sequence divergence among orthologs is significantly lower for T1R genes than for T2R genes when human, mouse, rat, and opossum genes were compared (Shi and Zhang 2006). Within human populations, the mean pairwise difference per nucleotide between sequences of T1Rs is also lower than that for T2Rs (Kim et al. 2005, 2006).

In sum, the contrasting evolutionary modes between T1R and T2R gene families suggest the relative constancy in the number and type of sweet and umami tastants encountered by various vertebrates or low binding specificities of T1Rs but a large variation in the number and type of bitter compounds detected by different species.

5 5 Adaptive Diversification of Chemoreceptor Gene Repertoires

Why do the chemosensory receptor gene families vary so much among vertebrates? One potential answer is the variable functional requirements for different species to adapt to their specific environments. Here we summarize evidence for the adaptive hypothesis at three levels: (1) gene family, (2) newly duplicated paralogous genes, and (3) intraspecific variation.

Adaptive evolution at the gene family level may be detected by comparing the gene repertoires of nasal chemosensory receptors in terrestrial vertebrates with those in aquatic vertebrates because terrestrial vertebrates tend to encounter volatile chemicals, while aquatic vertebrates encounter water-soluble chemicals. A recent analysis of V1R, V2R, and OR gene families in several vertebrate genomes showed that the ratio of the number of intact V1R genes to that of intact V2R genes increased by approximately 50 fold in the evolutionary transition from water to land. Note that circumstantial evidence suggests that V1Rs tend to recognize airborne molecules, while V2Rs tend to recognize water-soluble ligands. The comparison of the number of class II ORs to that of class I ORs, which have been suggested to bind to volatile and water-solvable molecules, respectively, also shows a similar pattern of change during the evolutionary transition of vertebrates from aquatic to terrestrial environments. By contrast, a comparison of pairs of randomly chosen gene families from the zebrafish and mouse genomes does not show such dramatic changes, indicating that the observation made in the nasal chemoreceptor genes is unlikely caused by random gene turnovers (Shi and Zhang 2007). Rather, the two nasal chemosensory systems appear to show a consistent pattern of a shift from receptors for water-soluble molecules to those for volatiles in the vertebrate transition from water to land, reflecting a rare case of adaptation to terrestrial life at the gene family level.

Gene duplication is believed to be the primary source of new genes with novel functions (Zhang 2003). In chemoreceptor gene families, gene duplication occurs frequently and the newly generated genes may acquire the ability to recognize new ligands, which could increase an organism's fitness. As expected, analysis of ORs, V1Rs, V2Rs, T1Rs, and T2Rs revealed positive selection acting on newly duplicated genes in most vertebrate lineages (Hughes and Hughes 1993; Mundy and Cook 2003; Shi et al. 2003, 2005; Emes et al. 2004; Alioto and Ngai 2005, 2006; Yang et al. 2005; Shi and Zhang 2006). More interestingly, positive selection tends to happen in potential ligand-binding regions. For instance, the analysis of eight closely related human T2R genes suggested positive selection in extracellular domains, while purifying selection in transmembrane and intracellular domains (Shi et al. 2003). Although no crystal structure of T2Rs has been solved, existing functional data suggest that extracellular domains of T2Rs are involved in binding to ligands (Soranzo et al. 2005). Positive selection was also detected in extracellular domains in V1Rs, which are most closely related to T2Rs in sequence (Mundy and Cook 2003; Shi et al. 2005). These results contrast the majority of positively selected sites in ORs, which are located in transmembrane domains that are thought to be the binding pocket in ORs (Emes et al. 2004). For the V2R and T1R gene families, most positively selected sites are mapped to the long N-terminus, which is believed to be involved in heterodimerization or homodimerization and ligand binding (Yang et al. 2005; Shi and Zhang 2006). Together, these positive selection analyses suggest that newly generated chemoreceptor genes tend to be subject to diversifying selection, probably because of the ability to recognize a diverse array of chemicals that the animals encounter in exploring new habitats and foods.

As described above, sequence variation of chemoreceptor genes is prevalent even within species. If these polymorphisms affect an individual's fitness such as mate selection and sibling sustenance, some of them may be under positive selection. There is now evidence supporting this possibility. For example, Zhang et al. (2004) compared V1R gene sequences from two mouse draft sequences which were derived from different inbred mouse strains. They found a high ratio of nonsynonymous to synonymous substitutions, a possible result of positive selection on these genes. In humans, most chemoreceptor genes are under relaxed selective constraints. However, positive selection has been detected in the human T2R16 gene, which encodes a β-glucopyranoside receptor (Bufe et al. 2002). By analyzing the sequences from different human populations, Soranzo et al. (2005) detected signatures of positive selection on a derived allele, which was found in all human populations except Africans. Compared with the ancestral allele, the derived allele exhibits increased sensitivity in detecting β-glucopyranoside. It was suggested that the derived allele may provide better protection against harmful cyanogenic plant foods and natural toxins (Soranzo et al. 2005). Another interesting case is the T2R38 gene, which is largely responsible for the human polymorphism in tasting phenylthiocarbamide (Kim et al. 2003). Balancing selection was suggested to maintain both taster and nontaster T2R38 alleles in human populations (Wooding et al. 2004), although a subsequent analysis found the evidence for balancing selection unconvincing (Wang et al. 2004). Interestingly, chimpanzees are also known to have tasters and nontasters of phenylthiocarbamide, but the nontaster allele is apparently a null allele (Wooding et al. 2006).

6 6 Conclusions

The hallmark of vertebrate chemoreceptor gene family evolution is the extremely high diversity of gene family size and gene sequence among species. The general genetic mechanisms involved in generating this pattern include frequent gene duplication and pseudogenization, conforming to the “birth-and-death” process (Nei et al. 1997). Adaptation to changing environments and diets is likely the major selective force behind this evolutionary process, in addition to the random factor of genomic drift (Nei 2007). Although evolutionary and genomic studies have resulted in enormous advances in this field in the last several years, many fundamental questions are yet to be answered (Mombaerts 2004; Meyerhof 2005). It is expected that evolutionary analysis, coupled with functional assays of chemoreceptors, will yield useful information on the molecular mechanisms and selective forces behind vertebrate chemoreceptor gene diversification.