Introduction

The evolution of reptilian venoms has previously been inferred to have occurred at the base of a strongly supported clade termed the Toxicofera (Anguimorpha, Iguania and Serpentes) (Fry et al. 2006; Vidal and Hedges 2005; Wiens et al. 2012). Whilst nuclear gene sampling has been unable to resolved the relative relationships within this clade, the use of SINES has been useful (Piskurek et al. 2006). Critical to this conclusion of a single early evolution of reptile venom was evidence that toxin genes sampled from a variety of toxicoferan taxa demonstrated monophyly of lizard and snake toxins to the exclusion of non-venom homologues and that in some cases snake and lizard genes were not reciprocally monophyletic (Fry et al. 2006, 2010a). One limitation to this approach is the potential influence incomplete sampling of gene homologues (i.e. both venom and non-venom duplicate genes) could have upon phylogenetic analyses—potentially resulting in gene trees that falsely depict toxin genes as monophyletic (cf. Casewell et al. 2012). However, the incorporation of recently sampled non-venom gene homologues into toxin family gene trees provided further support for the single early origin hypothesis, with character reconstruction analyses supporting the premise that reptile venom evolved at the base of the Toxicofera (Casewell et al. 2012).

Historically, the two members of the genus Heloderma were considered the only venomous lizards. However, recent discoveries on the origins of venom and associated structures in snakes and lizards have however, led to a paradigm shift in our understanding of the evolution of squamate venom systems (Vidal and Hedges 2005; Fry et al. 2006, 2009a, b, 2010a, b, 2012). The current data suggest that venom in reptiles appeared approximately 170 million years ago and that all modern venomous reptiles share a common venomous ancestor. As a result, these animals have been collectively placed in the clade Toxicofera. The ancestral toxicoferan reptile possessed both maxillary and mandibular serous dental glands of relatively simple structure. These glands produced pharmacologically active compounds that acted as a substrate for the subsequent evolution of all the reptilian toxins. Iguanian lizards diverged whilst this system was still in a primitive state, and thus their dental glands apparently conferred no ecological advantage and have either been secondarily lost completely or retained in a seemingly incipient condition. The remainder of the clade subsequently split into two major radiations: the snakes, which evolved complex maxillary glands and in many cases lost the mandibular glands entirely and the anguimorph lizards in which the mandibular glands were retained and diversified whilst the maxillary glands were lost completely in all but one species examined to-date.

As the toxin-secreting systems of anguimorph lizards have received less attention than those of the more medically important venomous snakes, it remains unknown to what extent different toxin types have undergone structural and functional innovation within this major clade. Detailed knowledge of the Anguimorpha toxins is crucial for understanding of the structure, function and composition of the ancestral Toxicofera venom system, as well as for investigation of the parallel diversification of the venom system in the two primary clades of venomous reptiles.

Helodermatid lizards produce clinically complex envenomations with symptoms including extreme pain, acute local swelling, nausea, fever, faintness, myocardial infarction, tachycardia, hypotension and inhibition of blood coagulation (Bogert and del Campo 1956; Bouabboud and Kardassakis 1988; Hooker and Caravati 1994; Strimple et al. 1997; Cantrell 2003; Beck 2005). Venom studies have revealed a great diversity of components present in the venom of helodermatid lizards (Table 1). Previous work by us has shown that all anguimorph lizards share a core toxin arsenal produced by the mandibular toxin-secreting gland (Table 1; (Fry et al. 2006, 2009b, 2010b)).

Table 1 Anguimorpha lizard toxin mRNAs sequences recovered in this study

In our studies to-date, we have shown that Anguimorpha mandibular gland transcripts indeed encode bioactive peptides and proteins, with targets ranging from anti-platelet (type III phospholipase A2) to novel peptides acting upon the cardiovascular system (celestoxin, cholecystokinin, goannatyrotoxin, helokinestatin and natriuretic) (Fry et al. 2006, 2009a, b, 2010a, b). Despite even non-helodermatid anguimorph species being a rich source of novel compounds, within the family Anguidae only three species have previously had their venom systems studied: Gerrhonotus infernalis (terrestrial), Celestus warreni (semi-fossorial) and Pseudopus apodus (fossorial) (Fry et al. 2010b). In particular, arboreal lineages have not been studied to-date. We address this imbalance by conducting in-depth analyses of the transcripts from the mandibular toxin-secreting gland of Abronia graminea, an arboreal generalist predator of both invertebrates and small vertebrates such as skinks. The results provided insights into toxin evolutionary history and the selection pressures that might have shaped them. Of particular focus were the natriuretic and helokinestatin peptides, which uniquely share a precursor, with the helokinestatins being a de novo evolution within the natriuretic precursor.

Materials and Methods

Species Examined

Four A. graminea specimens were field collected by BGF and ACS from Esperance, Mexico under the University of Melbourne Animal Ethics approval number 03126. Animals were placed under surgical level anesthesia using Zoletil and euthanized by decapitation whilst under anesthesia. Glands were then dissected or heads preserved in formalin.

Histology

Histological sections were prepared from one specimen. Whole heads were removed and a cut was made to the underside to allow fast penetration of the fixative (10 % neutral buffered formalin). After a minimum of 2 days excess tissue was removed and the head immersed in Kristensen’s decalcification solution and placed on a rotor for 3 weeks. Before processing the head was bisected longitudinally for cutting transversely, at three microns, in two separate blocks. The processing schedule was: 10 % Formalin 2 h; Absolute ethanol 4 × 1 h; Histolene 3 × 1 h; Paraffin wax 2 × 90 min. The sections were taken every 100 microns stained Masson’s Trichrome stain.

cDNA Library Construction and Analysis

Mandibular toxin-secreting glands of three specimens were pooled and total RNA extracted using the standard TRIzol Plus method (Invitrogen). Extracts were enriched for mRNA using standard RNeasy mRNA mini kit (Qiagen) protocol. mRNA was reverse transcribed, fragmented and ligated to a unique 10-base multiplex identifier (MID) tag prepared using standard protocols and applied to one PicoTiterPlate for simultaneous amplification and sequencing on a Roche 454 GS FLX+ Titanium platform (Australian Genome Research Facility). Automated grouping and analysis of sample-specific MID reads informatically separated A. graminea sequences from the other transcriptomes on the plates, which were then post-processed to remove low quality sequences before de novo assembly using the MIRA software programme. Assembled contigs were processed using CLC Main Work Bench (CLC-Bio) and a variety of other bioinformatic tools to provide Gene Ontology, BLAST and domain/Interpro annotation (Conesa et al. 2005; Conesa and Gotz 2008; Gotz et al. 2008, 2011). The above analyses assisted in the rationalisation of the large numbers of assembled contigs into phylogenetic ‘groups’ for detailed phylogenetic analyses outlined below.

Phylogenetic Analyses

Phylogenetic analyses were performed to allow reconstruction of the molecular evolutionary history of each toxin type for which transcripts were bioinformatically recovered. Toxin sequences were identified by comparison of the translated DNA sequences with previously characterised toxins using BLAST search (Altschul et al. 1997) of the Swiss-Prot/Uni-Prot protein database (http://www.expasy.org/tools/blast/). Molecular phylogenetic analyses of toxin transcripts were conducted using the translated amino-acid sequences. Comparative sequences from other venomous reptiles and physiological gene homologues identified from non-venom gland transcriptomes were included in each dataset as outgroup sequences. To minimize confusion, all sequences obtained in this study are referred to by their Genbank accession numbers (http://www.ncbi.nlm.nih.gov/sites/entrez?db=Nucleotide) and sequences from previous studies are referred to by their UniProt/Swiss-Prot accession numbers (http://www.expasy.org/cgi-bin/sprot-search-ful). Resultant sequence sets were aligned using the programme CLC Mainbench. When presented as sequence alignments, the leader sequence is shown in lowercase and cysteines are highlighted in black. > and < indicate incomplete N/5′ or C/3′ ends, respectively. Datasets were analysed using Bayesian inference implemented on MrBayes, version 3.0b4 (Ronquist and Huelsenbeck 2003). The analysis was performed by running a minimum of 1 × 107 generations in four chains, and saving every 100th tree. The log-likelihood score of each saved tree was plotted against the number of generations to establish the point at which the log-likelihood scores of the analysis reached their asymptote, and the posterior probabilities for clades established by constructing a majority rule consensus tree for all trees generated after the completion of the burn-in phase.

Selection Analyses

In order to test whether natriuretic peptides and helokinestatins were influenced by positive- or negative-selection pressures, we employed the maximum likelihood models of coding-sequence evolution implemented in CODEML in PAML (Yang 2007) programme package version 4, which compares the maximum likelihood estimates of dN and dS across an alignment to a predefined distribution and uses empirical Bayes methods to identify individual positively selected site (Nielsen and Yang 1998).

Site-specific models (Nielsen and Yang 1998) were employed for detecting diversifying selection across the sites in the helokinestatin and the natriuretic propeptide alignments. They statistically detect diversifying selection as a non-synonymous-to-synonymous nucleotide substitution rate ratio (ω) significantly greater than 1. Since no priori expectation exists for the distribution of ω values, we compared likelihood values for three pairs of models with different assumed ω distributions: M0 (constant ω rates across all sites) versus M3 (allows the ω to vary across sites within ‘n’ discrete categories, n ≥ 3); M1a (a model of neutral evolution) where all sites are assumed to be either under negative (ω < 1) or neutral selection (ω = 1) versus M2a (a model of positive selection) which in addition to the site classes mentioned for M1a, assumes a third category of sites; sites with ω > 1 (positive selection) and M7 (Beta) versus M8 (Beta and ω), and models that mirror the evolutionary constraints of M1 and M2 but assume that ω values are drawn from a beta distribution (Nielsen and Yang 1998). The estimations are considered significant, only if the alternative models (M3, M2a and M8: allow sites with ω > 1) show a better fit in Likelihood-Ratio Test (LRT) relative to their null models (M0, M1a and M8: do not show allow sites ω > 1). LRT is estimated as twice the difference in maximum likelihood values between nested models and compared with the χ2 distribution with the appropriate degree of freedom (the difference in the number of parameters between the two models). The Bayes empirical Bayes (BEB) approach (Yang et al. 2005) was used to identify amino acids under positive selection by calculating the posterior probabilities that a particular amino acid belongs to a given selection class (neutral, conserved or highly variable). Sites with greater posterior probability (PP ≥ 95 %) of belonging to the ‘ω > 1’ were inferred to be positively selected.

We further employed a lineage-specific two-ratio model as well as the optimized branch-site test (Yang and Nielsen 2002; Zhang et al. 2005) to assess selection pressures acting upon individual lineages. A LRT was conducted by comparing the two-ratio model that allows omega to be greater than 1 in the foreground branch, with the null model that does not. The branch-site model by comparison, allows omega to vary both across sites of the protein and across branches in the tree and has reasonable power and accuracy to detect short bursts of episodic adaptations (Zhang et al. 2005).

Detection of positive selection using the aforementioned lineage-specific methods requires the foreground (lineage to be tested for positive selection) and background branches (rest of the lineages) to be defined a priori. We used GA-Branch test (Pond and Frost 2005b) implemented in the HyPhy (Pond et al. 2005c) package that does not require this information and works on the principle that there could be many models that better fit the data than a single priori hypothesis and uses a robust multi-model inference to collate results from all models examined and provides confidence intervals on dN/dS for each branch. We also used the branch-site Random-effects likelihood (REL) (Pond et al. 2011) that performs a series of LRTs to identify lineages in the phylogeny that have a proportion of sites evolving under episodic diversifying selection pressures. The more advanced Mixed Effects Model Evolution (MEME) (Pond et al. 2011) was also used to detect episodic diversifying selection. MEME employs Fixed-effects likelihood (FEL) along the sites and REL across the branches to detect episodic diversifying selection. We employed REL, FEL and Single Likelihood Ancestor Counting (SLAC) models (Pond and Frost 2005a) to further provide significant support to the above analyses and to detect sites under positive and negative selection. For clear depiction of the proportion of sites under selection, an evolutionary fingerprint analysis was carried out using the ESD algorithm (Pond et al. 2011) implemented in Datamonkey.

It has been suggested that the maximum-likelihood method of evaluating positive-selection produces false-positive results when no positively selected sites exist (Suzuki and Nei 2004) or when positively and negatively selected sites are mixed (Anisimova et al. 2002). For this reason, further support for the PAML results was sought using a complementary protein level approach implemented in TreeSAAP (Woolley et al. 2003) which measures the selective influences on 31 structural and biochemical amino-acid properties during cladogenesis, and performs goodness-of-fit and categorical statistical tests based on ancestral sequence reconstruction. Amino-acid variability was estimated using the Selecton (Doron-Faigenboim et al. 2005; Stern et al. 2007) web server.

Test for Recombination

Recombination can confound phylogenetic and evolutionary selection analyses (Posada and Crandall 2002). Hence, we evaluated the effect of recombination on natriuretic and helokinestatin domains by employing Single Breakpoint Recombination and Genetic Algorithms for Recombination Detection (GARD) implemented in the Datamonkey server (Pond et al. 2005c, 2006; Delport et al. 2010). Potential breakpoints were detected using the small sample Akaike Information Criterion (AIC) and the sequences were compartmentalized before conducting the aforementioned analyses.

Results

Histology

The mandibular toxin-secreting gland of A. graminea consists of numerous discrete compartments. Similar to those of other anguid lizards examined to-date (Fry et al. 2010b), there is an arrangement of one compartment per tooth (Fig. 1a). These glandular structures are enclosed by basally nucleated secretory epithelial cells that form a large vesicular region along the medial surface of the gland (Fig. 1b). The toxin vesicles appear to be released via exocytosis into the lumen of the toxin-secreting gland (Fig. 1b).

Fig. 1
figure 1

a Masson’s trichrome-stained transverse histology section of A. graminea showing the mixed seromucous lobules. Connective tissues surround individual regions or capsules of the toxin-secreting glands. b 100× view showing that these glandular structures consist of secretory epithelium (SE) with basally nucleated (N) cells that are rich in enlarged vesicles (V) throughout their apical regions. Scale Bar = 5 μm

Toxin Molecular Evolution

BLAST analyses of transcripts recovered from A. graminea toxin-secreting gland cDNA libraries identified seven previously characterized toxicoferan toxins: CRiSP, cystatin, kallikrein, lectin, B-type natriuretic peptide, nerve growth factor, type III phospholipase A2 and vascular endothelial growth factor. Phylogenetic analysis showed that they clustered together with homologous sequences from the mandibular toxin-secreting gland of other anguimorph lizards and also homologous sequences produced by the maxillary venom glands of snakes. The VEGF sequences were an exception, forming a clade with snake similar snake venom gland forms not secreted into the venom rather than with the derived form secreted in snake venoms (Fig. 2).

Fig. 2
figure 2

Molecular phylogeny of VEGF. Outgroups are the non-toxin sequences from Anser anser (P83300) and Rhea americana (P84617)

Key functional residues were conserved for CRiSP and cystatin sequences and CRiSP sequences also conserved the cysteine pattern. The A. graminea lectin sequences contain the EPN tripeptide motif that facilitates hemotoxic activity via the mannose-binding site (Drickamer 1992; Fry et al. 2008, 2010b). Phylogenetically the A. graminea lectin sequences grouped with the sequence from Varanus indicus (which also contains the EPN motif), rather than with the sequence from the more closely related Pseudopus [Ophisaurus] apodus (which contains instead the galactose-binding QPD motif) (Fig. 3). We recovered the first full-length anguimorph lizard PLA2 sequence obtained to-date and this was revealed to have an extraordinarily long pro-peptide domain within the precursor region (Fig. 4). The A. graminea PLA2 also had a long C-terminal tail containing multiple di-basic post-translational cleavage sites previously noted as a characteristic of Anguimorpha lizard PLA2 (Fry et al. 2006, 2009b, 2010b).

Fig. 3
figure 3

Molecular phylogeny of lizard and snake lectin toxins. Outgroups are the non-toxin sequences from Anser anser (P83300) and Rhea americana (P84617)

Fig. 4
figure 4

Sequence alignment of lizard Type III phospholipase A toxins 1. A. graminea (c9); 2. Heloderma suspectum (P16354); 3. Varanus komodoensis (B6CJU9); 4. Heloderma suspectum (P80003); 5. Heloderma suspectum (C6EVH0); 6. Varanus gilleni (E2E4K7); 7. Varanus varius (Q2XXL5); 8. C. warreni (E2E4K8) and 9. Heloderma suspectum (C6EVG9). Dibasic cleavage sites in bold and pro-pep is underlined

We also recovered kunitz peptides for the first time from any lizard oral secretion—this toxin had previously only been recorded from snake venoms. The fragment recovered was of the two-domain type only sequenced once from a snake (Austrelaps labialis) (Doley et al. 2008). For comparison’s sake, we also sequenced a fragment from the mandibular venom gland of Heloderma suspectum cinctum and both anguimorph lizards contained the c-terminal extension (Fig. 5). Phylogenetic analysis recovered the lizard and A. labialis two-domain forms as basally divergent from all the mono-domain forms sequenced from snake venoms, and the snake venom forms were non-monophyletic (Fig. 6).

Fig. 5
figure 5

Sequence alignment of kunitz peptides 1. A. graminea; 2. Heloderma suspectum; 3. B2BS84| A. labialis; 4. Q6T6S5 Bitis gabonica; 5. B5L5M7 Austrelaps superbus; 6. A7X3V4 Telescopus dhara; 7. B2KTG1 Bungarus fasciatus; 8. G9I929 Micrurus tener; 9. A7X3V7 Philodryas olfersii; 10. P24541| Eristicophis macmahoni

Fig. 6
figure 6

Molecular phylogeny of lizard and snake kunitz toxins and related non-toxin sequences

The highest degree of molecular evolution was displayed in the natriuretic transcripts. All A. graminea natriuretic sequences share a pattern along with those from G. infernalis of five helokinestatin variants contained with the precursor region (Fig. 7). The A. graminea sequences also contain a new variant PPPFIPFIP inserted after the third shared domain. This repeat preserves the conserved helokinestatin pattern of PPPxxPxxP, where the x residues are almost invariably hydrophobic (F, I, V, L). The domain encoding the natriuretic peptide shows the same high level of conservation shared with other anguimorph venom forms of this peptide. However, the A. graminea natriuretic peptide sequences share with those of Gerrhonotus the change to glutamic acid (E) from the ancestral key functional residue aspartic acid (D) at ring position 7. This is one of two mutations previously shown by us to greatly reduce the aortic smooth muscle relaxing potency of the Gerrhonotus form (Fry et al. 2010b). The A. graminea sequences also uniquely contained the helokinestatin variant PPPFLPLVPR inserted after the fifth helokinestatin repeat.

Fig. 7
figure 7

Sequence alignment of lizard B-type natriuretic peptide toxins. 1. A. graminea (c648); 2. A. graminea (c448); 3. A. graminea (c731); 4. A. graminea (c776); 5. G. infernalis (E2E4J3); 6. Heloderma suspectum (C6EVG7); 7. Heloderma horridum (E8ZCG5); 8. Heloderma suspectum (D7FB57); 9. Varanus glauerti (E2E4J0); 10. Varanus scalaris (E2E4J1); 11. Varanus komodoensis (B6CJV0); 12. Varanus varius (Q2XXL8); and 13. Varanus glauerti (E2E4J2). Helokinestatin domains are highlighted in light grey, natriuretic domains in dark grey

The assumption of a single phylogenetic history for all sites in a sequence, which is a prerequisite for most phylogenetic analyses, can be violated by recombination which may cause elements with different genetic backgrounds to blend together. Thus, recombinant genes cannot be accurately described by a single/unique tree topology and recombination can influence selection analyses significantly by elevating false positives. Recombination can have more impact on the site-to-site and branch-to-branch omega estimations relative to the global estimates. We screened natriuretic peptide and helokinestatin domains for recombination using GARD and SBP algorithms. GARD failed to identify any regions that underwent significant recombination. SPB however, identified a single potential breakpoint at the 100th position with a model-averaged support of 100 % and an IC (model with two trees) improvement of 79.73 over the base (single tree) model when using the AIC. Subsequent analyses were conducted by compartmentalizing the multiple sequence alignment into two non-recombinant units.

Initial selection analyses of full-length helokinestatin/natriuretic precursor (site-models, branch-models, branch-site models, SLAC, FEL, REL and evolutionary fingerprinting) unanimously confirmed the high degree of conservation, a trend observed in the natriuretic domain of this proprotein in other anguimorph venoms investigated to-date (Tables 2, 3; Fig. 8) (Fry et al. 2006, 2009b, 2010a, b). This was confirmed by selection analyses at the level of the proteome conducted using TreeSAAP which apparently confirmed that none of the 31 biochemical or structural properties of this proprotein were under significant influence of selection.

Table 2 Maximum-likelihood parameter estimates for the helokinestatin and natriuretic peptides
Table 3 Comparison of natriuretic and helokinestatin domain evolution
Fig. 8
figure 8

Evolutionary fingerprinting of natriuretic and helokinestatin propeptide and the test for detection of sites under episodic diversifying selection. a Evolutionary fingerprint: Estimates of the distribution of synonymous (alpha) and non-synonymous (beta) substitution rates inferred for natriuretic and helokinestatin proprotein. The ellipses reflect a Gaussian-approximated variance in each individual rate estimate, and coloured pixels show the density of the posterior sample of the distribution for a given rate. The diagonal line represents the idealized neutral evolution regime (ω = 1), points above and below the line correspond to positive selection (ω > 1) and negative selection (ω < 1), respectively. The legend label shows the omega estimation under the site model M8 (Codeml) and the number of positive and negatively selected residues detected by the integrative analyses (SLAC, FEL, REL and MEME). b GA-branch test: Lineages under different regimes of selection pressures are coloured differently and the accuracy with which they can be ascribed to that regime is denoted above them. The legend labels show the total percentage of the tree with respective dN/dS. c Branch-site REL: The strengths of different selection pressures, namely, positive, negative and neutral are indicated by different hues (red, blue and grey, respectively) with the width of each colour component indicating the proportion of sites in the corresponding class. Branches detected as undergoing episodic diversifying selection by the sequential LRTs at corrected P ≤ 0.05 are denoted by arrows (Color figure online)

Although the branch and branch-site models failed to detect any significant positive selection, the free-ratio model, GA-branch and Branch-site REL tests identified a small proportion of sites within the A. graminea and Heloderma lineages that had undergone significant diversifying selection. In contrast, most of the varanid lizard natriuretic sequences appear to have evolved under negative selection. The observed diversifying selection in the Abronia and Heloderma sequences is a result of the presence of helokinestatin regions within them, which is unique to this clade and absent entirely in the varanid lizard venom form (Fry et al. 2006, 2009b, 2010a, b).

The two-speed rate of evolution was subsequently confirmed by compartmentalizing the sequences into helokinestatin and natriuretic domains and performing selection analyses in CODEML which revealed that the former exhibits weaker evolutionary constraints than the latter (ω = 0.60 and 0.30, respectively). Furthermore, investigations identified sites under the influence of positive Darwinian selection in the helokinestatin domain, in comparison with the natriuretic domain that largely remains under negative selection (Table 3). All the positively selected sites in the helokinestatin/natriuretic precursor detected by SLAC, FEL and REL came only from those that specifically encode the helokinestatin domain. On the other hand, 27 % of the sites in natriuretic peptides were under negative selection in comparison with the 5 % of the total residues of the helokinestatin domain. Out of the four codons detected by MEME test as under the influence of episodic diversifying selection (significance 0.05), three belonged to the helokinestatin domain whilst only one belonged to the natriuretic domain, further providing evidence for the differential selection pressures acting on the two domains of the proprotein (Table 3). Assignment of sites into variable and invariable categories using the selecton web server revealed a greater number of variable codons in the helokinestatin domain than the natriuretic domain (21.36 and 12.19 % of total codons, respectively). In addition, the helokinestatin domain contained fewer sites evolving under constrained selection pressures than the natriuretic domain (27.27 and 41.46 % of total codons, respectively—Table 3).

It should be noted that some varanid lizard lineages, namely Varanus komodoensis and one of the hypothetical ancestral clades (node 15: Fig. 8b and node 14: Fig. 8c), also exhibit a small proportion of sites with significant diversifying selection. This indicates that the positive evolutionary selection pressures act on the natriuretic peptide domain, a phenomenon previously observed in other toxicoferan venom components (Fry et al. 2006, 2009b, 2010a, b; Sunagar et al. 2012) and presumed to be the result of predator–prey co-evolutionary arms races. This could also be indicative of an active role for these components in the venom arsenal of these species.

Discussion

A ‘use it or lose it’ evolutionary trajectory has been documented previously in toxicoferan reptiles. Secondary loss or reduction of the venom system has been documented in several lineages of snakes including within highly toxic elapid clades (Li et al. 2005a, b; Fry et al. 2008, 2009a, b, Fry et al. 2010b, 2012). Thus, the presence of robust glands and evidence of active molecular evolution of some of the encoded proteins suggests that the toxin-secreting system is biologically and ecologically relevant for A. graminea. The A. graminea toxin-secreting glands examined in this study exhibited signs of active secretion, with cells full of secretory granules. In addition, the molecular evolution of A. graminea toxins displayed varying rates of evolution between different protein types, which provides further evidence that these components are biologically relevant and are ‘seen’ by active selection pressures. Some molecular derivations appear lineage-specific. For example addition to the ancestral kallikrein cysteine arrangement, a form was found in the A. graminea library that has only ever been recovered from the mandibular toxin-secreting glands other anguid lizards (C. warreni and G. infernalis—(Fry et al. 2010b). (This anguid-specific isoform includes a deletion resulting in the loss of 12th ancestral cysteine and the insertion of a new cysteine in between ancestral cysteines 9 and 10. We also showed for the first time that the kunitz peptide toxin type is ancestral to all toxicoferan reptiles, as evidenced by the snake sequences being non-monophyletic relative to the lizard sequences.

The helokinestatin encoding domain which has evolved de novo within the natriuretic precursor pro-pep region displayed the most active molecular evolution of any toxin transcript recovered in this study. Several sequences of this toxin type possessed sites undergoing significant episodic diversifying selection, which may be indicative of the active role of this component in the toxic arsenal of A. graminea. Thus, it would be quite interesting for future studies to assess the effect of the presence/absence of helokinestatins on the pharmacological activity of the oral secretions. In contrast, the natriuretic peptide encoding domain located just downstream of this element exhibits tremendous conservation. This shows that even very closely related toxins can undergo differential selection pressures. The natriuretic peptide region highlights the usefulness of toxins in understanding the structure–function relationships of the ancestral body form. We previously showed that the change from D (glutamic acid) to E (aspartic acid) at ring position 7 and I (isoleucine) to V (valine) at ring position 9 both resulted in significant reduction in the potency of action upon vascular smooth muscle (Fry et al. 2010b). Both changes would normally be considered as conserved substitutions as they are negative/negative and hydrophobic/hydrophobic, respectively. In both cases however, whilst the substitutions have essentially the same chemical properties, they result in steric differences. This mirrors and reinforces the exquisite subtlety of venom peptide targeting. The extraordinarily long pro-pep region of the A. graminea PLA2 may ultimately be shown to contain also post-translationally liberated peptides, as has been demonstrated for some of the other precursors with lengthy pro-pep regions such as the helokinestatin/natriuretic precursors.

The NGF and VEGF sequences were interesting in their relationships to non-toxin sequences. The NGF sequences also seem to be under profound evolutionary constraints. Intriguingly, the A. graminea and snake venom gland transcripts of this protein are virtually identical to the G. infernalis (E2E4J3) and Heloderma suspectum (C6EVG7) nuclear gene sequences obtained for use in taxonomical studies (Wiens et al. 2010). This indicates that the NGF expressed in venom may be the same gene as is used in the body and therefore may be a rare case of a venom protein resulting from a non-duplicated gene. The VEGF sequences reinforce the value of phylogenetic analyses to establish homology, as there are two clades of VEGF that have been sequenced from snake venom glands: one type that is a part of the vasculature of the venom gland and the other type which is the form actually secreted in the venom. The form recovered in this study was of the former type and thus the VEGF toxins are not known to-date as a basal toxicoferan venom component and appear to be restricted to the advanced snakes. Consistent with the distinction between the body form of a protein and its toxin homologue, the vascular VEGF display much less sequence variation than the venom sequences, indicating that they are evolving under negative selection whilst the actively secreted forms are evolving under positive selection.

Our results show that even small arboreal anguimorpha lizard lineages retain the ancestral venom system and that the continued diversification is indicative of continued evolution operating under selection pressure. Thus, these lizards are technically venomous. It should be stressed, however, that in no way do we suggest that these animals are ‘venomous’ from the perspective of a threat to human health and thus should not be considered as ‘venomous’ from the stand-point of dangerous animal legislation. We instead consider the analogy with spiders to be relevant: like spiders, anguimorpha lizards are venomous from a biological/evolutionary perspective but are harmless from the perspective of human health.

Due to the relative sampling employed, the toxin types recovered in this study from the mandibular venom gland of A. graminea are no doubt but a subset of the total diversity encoded. More intensive sampling would almost certainly recover novel isoforms of known toxin types, perhaps even changing our knowledge of the timing of recruitment events, as well as even recovering entirely new suites of bioactive compounds. Regardless, this study has advanced our knowledge of lizard venom evolution by showing that even obscure arboreal species can be a rich source of novel sequences. Such new information is not only just of use for evolutionary investigations but also provides a pragmatic platform for the investigation of novel components as lead compounds in drug design and development. The strong evidence of positive selection on certain secretory toxins as well as the evolutionary conservation of others is highly suggestive of the active use of these secretory toxins in the ecology (either predatory or defensive) of this anguimorph lizard. Confirmation in future studies of the presence of these toxins in the secretory-proteome of A. graminea will add further strength to the hypothesis that the toxin-secreting oral glands of many toxicoferan lizards are not merely examples of ‘exaptation’ (for future development into ‘true venom glands’) but play an important role in the ecology of these species and are thus are true ‘venom glands’ by any definition.