Introduction

Genes were originally conceived as static entities confined to specific loci along the chromosomes. The discovery of transposable elements (TEs) in eukaryotes challenged that concept by revealing a previously unknown mechanism for the production of genetic and genomic variation (Hua-Van et al. 2011). TEs are mobile DNA sequences with an inherent ability to move within and among genomes (Pritham 2009). This mobility allows them to exert control over their local genetic environments. Based on their mode of transposition, TEs are grouped into two major categories. Class I elements (retrotransposons; copy and paste) transpose through an RNA intermediate whereas Class II elements (DNA transposons; cut and paste) transpose directly as DNA (Pritham 2009).

TEs change the evolutionary trajectory of genomes in many ways. TE activity can (a) facilitate diversification of the genome by the emergence of new genes; (b) modify gene expression patterns; and (c) promote chromosomal rearrangements that ultimately increase the adaptability of the species (Oliver and Greene 2009). On the other hand, excessive TE proliferation can (a) decrease organismal fitness by mutagenic effects of insertions; (b) increase metabolic costs associated with transcription; and (c) induce translation and chromosomal damage due to ectopic recombination (Oliver and Greene 2009; Nuzhdin 1999). Thus, genomes have evolved mechanisms to repress the uncontrolled proliferation of TEs. These mechanisms include epigenetic processes such as RNAi, a transcriptional and post-transcriptional gene silencing system that relies on short, non-coding RNAs (Blumenstiel 2011).

Nonetheless, physiological stress can activate TEs and disrupt epigenetic silencing, thus leading to TE proliferation (Capy et al. 2000; Slotkin and Martienssen 2007; Zeh et al. 2009). For instance, in Drosophila simulans, developmental temperature can affect both somatic and germline TE excision rates (Chakrani et al. 1993; Garza et al. 1991). Similarly, the invasion of new habitats may challenge genomes and lead to TE activation. For example, the increased TE copy number in D. melanogaster compared to D. simulans could be associated with its worldwide colonization (Vieira and Biémont 2004). TE proliferation has also been observed in colonizing populations of Drosophila subobscura and Arabidopsis lyrata (García Guerreiro et al. 2008; Lockton et al. 2008), presumably due to the physiological stresses associated with the invasion of new habitats. Such TE proliferations may facilitate the invasion of novel habitats through genome dynamism.

Herein, we analyze TE proliferations in the human parasite Schistosoma mansoni, which has recently invaded new habitats and thus may be exposed to novel environmental and evolutionary stressors. The parasites of the genus Schistosoma are the causative agents of schistosomiasis, a chronic and debilitating human disease of significant medical importance, contributing to 200 million cases in the tropics (Brunette 2014; Chitsulo et al. 2000). Eggs produced by adult parasites become lodged in viscera, provoking granulomatous reactions and portal hypertension (Lambertucci et al. 2000; Fig. 1). Six Schistosoma species are known to infect humans; one of the most common is S. mansoni, the etiological agent of intestinal schistosomiasis (Crompton 1999; Brindley and Hotez 2013). In fact, S. mansoni shows the most widespread distribution, spanning both old world and new world. Intestinal schistosomiasis is found in 54 countries, including the Arabian Peninsula, sub-Saharan Africa, Caribbean islands and South America (Chitsulo et al. 2000; Brunette 2014).

Fig. 1
figure 1

Life cycle of Schistosoma mansoni. Adult male and female worms inhabit definitive vertebrate hosts, mating and producing eggs. The eggs are released from the host through feces, hatch into miracidia in freshwater and infect snail intermediate hosts. The cercariae released from intermediate hosts infect vertebrate hosts by skin penetration (Figure modified from Wijayawardena et al. 2013).

The diversification of the S. mansoni lineage has intrigued both parasitologists and molecular biologists. Molecular evidence indicates that the genus Schistosoma originated in Asia (Snyder and Loker 2000; Lockyer et al. 2003; Morgan et al. 2005) and descendants of this lineage then moved to Africa approximately 12–19 million years ago, possibly by means of a widespread mammalian migration (Lockyer et al. 2003). African lineages underwent extensive radiation and gave rise to S. mansoni about 0.3–0.43 million years ago, concomitant with the emergence of modern humans (Morgan et al. 2005). From Africa, S. mansoni likely invaded the new world through the Atlantic slave trade within the last 500 years and quickly became established in the new world (Morgan et al. 2005; Files 1951, Fletcher et al. 1981, Desprès et al. 1993). This rapid colonization of the new world may have been facilitated by its genomic structure.

The genomes of three major human infecting schistosomes have been sequenced recently (Berriman et al. 2009; Protasio et al. 2012; Zhou et al. 2009; Young et al. 2012). Interestingly, 40–50 % of the schistosome genome consists of transposable elements (TEs). The estimated copy numbers of TEs in the genome of S. mansoni varies between 50 (e.g. SINBAD class I LTR TE; Thomas et al. 2010) and 20, 000 copies (e.g. SR1, class I non-LTR TE; Thomas et al. 2010). Several families of both class I and class II TE sequences have been identified in S. mansoni, most of them actively transcribed (Protasio et al. 2012; Thomas et al. 2010). For instance, LINE elements of S. mansoni show evidence of recent bursts of transposition that were not paralleled in the conspecific S. japonicum, possibly associated with the move from Asia to Africa and establishment and consequent speciation (Venancio et al. 2010). Given its phylogeography, physiological stress associated with the invasion of new habitats may have also shaped the TE repertoire of S. mansoni (Venancio et al. 2010). Herein, we test this hypothesis by performing qPCR to compare relative copy numbers of both class I and class II TEs in new and old world strains. Our null hypothesis is that there is no difference in relative copy numbers between new and old world strains. However, if increased TE content in S. mansoni results from physiological stress associated with the invasion of the new world, then we expect the new world strains of S. mansoni to have an increased copy number of TEs compared to the old world strains.

Materials and methods

Parasite strains

In total, four new world strains and two old world strains of S. mansoni were used in our study. Either single (adults or cercariae) or multiple (adults and cercariae) life cycle stages were used from each strain (Table 1). We obtained PR-1 and NMRI strains from the Biomedical Research Institute (Rockville, MD). The NMRI strain originated from eggs isolated from Puerto Rican school children in the early 1940’s whereas the PR-1 strain was collected from infected snails in Arecibo, Puerto Rico, in 1950 (Table 1; Lewis et al. 1986). The LE strain originated from Belo Horizonte, Minas Gerais, Brazil, in 1965 from an infected human patient. We also used a Brazilian field strain from Americaninhas (AMH) first isolated in 2005 from human fecal samples (Thiele et al. 2008). The hatched miracidia of AMH isolate were passed through Biomphalaria glabrata snails and Balb/cj mice to harvest the adult worms. Our old world strains included EGY-1, collected from infected humans in 1978, Qualhum, Egypt (Fletcher et al. 1981) and a Kenyan field isolate (KY: cercariae obtained from infected snails Biomplalaria pfeifferi, in 2013 from Mwea, Kenya).

Table 1 S. mansoni strains used in the analysis, their origins, and life cycle stage used

All laboratory strains were maintained in the recommended snail host/strain combinations (Lewis et al. 1986, Fletcher et al. 1981). At 4–6 weeks after infection by miracidia, Biomphalaria snails were exposed to fluorescent light for 2 h to release the cercarial stage of the parasite from the snail (Fig. 1). BALB/cJ mice (n = 3–6) were infected with approximately 120 cercariae each, using the tail emersion method modified from Oliver and Stirewalt (1952). After 6–8 weeks, mice were humanely euthanized with CO2 and dissected to obtain adult worms and eggs. Adult worms were extracted from hepatic portal and mesenteric veins. The hatched eggs were used to infect the snails to maintain the parasite life cycle. All animal use followed protocols approved by the Purdue Animal Care and Use Committee (PACUC).

DNA extraction and cDNA synthesis

We used male parasites as they are considerably larger than the females, yielding sufficient quantity of good quality DNA. DNA was extracted from 30 fresh individual male worms using standard proteinase K digestion followed by organic extraction (Sambrook and Russell 2001). Extracted DNA samples were stored at −20 °C until PCR. Cercariae were collected from pooled samples, by shedding ~100 infected snails per strain. QiaAmp® (Qiagen) DNA extraction kits were used to extract DNA from single cercariae. Individual cercariae were sexed using the primers designed for the female-specific w1 repeat (Grevelding 1995). Only male cercariae were used in our analysis (n = 30 per strain), to avoid sex bias.

TRIzol® reagent (Invitrogen) was used to extract RNA from adult male worms according to the manufacturer’s instructions. As single worms yielded insufficient RNA, pooled worms were used. Each pool contained 5 adult worms, and 20 such pools per strain were used in the analysis (i.e. 20 biological replicates per strain; corresponding to strains NMRI, PR-1, LE and EGY-1). The quantity and quality of RNA was analyzed using a spectrophotometer (NanoDrop 3000, ThermoFisher Scientific) and by 1 % agarose gel electrophoresis followed by ethidium bromide staining. All RNA samples were treated with DNase I (New England Biolabs) prior to cDNA synthesis to remove trace amounts of contaminating genomic DNA. First strand cDNA synthesis was performed using the ClonTech SMART cDNA library construction kit with a modified CDS III/3′ cDNA synthesis primer (5′ TAGAGGCCGAGGCGGCCGACATGTTTTGTTTTTTTTTCTTTTTTTTTTVN 3′) (Zhu et al. 2001). A control first strand synthesis reaction was performed without reverse transcriptase to eliminate second strand amplification products that would be non-specific, i.e. generated from residual DNA contamination in the RNA samples. ClonTech PCR Advantage II polymerase mix was used for cDNA second-strand synthesis, using the following thermo cycling parameters: initial denaturation at 95 °C for 10 min followed by 40 cycles of 95 °C for 15 s and 56 °C for 1 min. Water was substituted in place of cDNA template as a negative control for second strand synthesis.

qPCR relative quantification

A single transposable element can span more than 1,000 base pairs and carry multiple genes that enable its excision and proliferation (Pritham 2009). PCR-based estimation of TE copy number can be challenging because amplification efficiency decreases as the template length increases (Huber et al. 2009). Thus, we designed primers corresponding to 100–150 bases of specific TE genes (Table 2; DeMarco et al. 2004; DeMarco et al. 2006; Feschotte 2004). All primers were designed to have optimum binding at 56 °C, so that minimum effort was required in primer optimizing (IDT® Oligo Analyzer). Six representative of the various categories of TEs in S. mansoni genome (Table 2) were selected for copy number analysis; these include four class I elements (Saci-1, Saci-2, Saci-3, Perere-1), and two class II elements (Merlin-sm1 and SmTRC1). All of these elements are known to be transcriptionally active (DeMarco et al. 2004; DeMarco et al. 2006; Feschotte 2004). To determine if the use of different genes has a significant effect on copy number estimation, two primers sets were designed for Saci-3, targeting the RNaseH gene and Protease gene (Table 2). Primer/target selection was based on primers having the same amplification efficiency as our internal control gene, GAPDH (see qPCR relative quantification method below).

Table 2 Transposable elements and primers used in the qPCR analysis

To determine the copy number of TE genes, qPCR relative quantification was performed using StepOnePlus™ Real Time PCR system (Applied Biosystems) and Power SYBR Green PCR master mix, according to the manufacturer’s instructions (Applied Biosystems). GAPDH (Glyceraldehyde Phosphate DeHydrogenase) was used as the internal control gene (separate primers for DNA and cDNA; Messerli et al. 2006). At the beginning of each assay, five serial dilutions of the template (DNA and cDNA separately) were used in the real-time PCRs on both target and reference genes to assess efficiency and to confirm that the amplification efficiencies of both genes were equal (Schmittgen and Livak 2008). qPCR mixtures consisted of 1 × SYBR Green, 3.6 μM of each primer and 1 ng of template. The thermal profile for all genes was as follows; 95 °C for 10 min, followed by 40 cycles of 95 °C for 15 s and 56 °C for 1 min. A melting curve analysis was conducted from 50 to 90 °C with 0.5 °C increases per cycle for a total of 80 cycles to insure there was no mis-annealing or contaminated cDNA in the sample. Each reaction was performed in three replicates and alongside a no-template negative control to rule out contamination (see Doyle et al. 2011). Samples with Ct values higher than the negative control and multiple Tm peaks (indicative of contamination) were removed from the analysis.

Statistical analysis

The fold differences of TE copy number in the genome and the transcriptome were calculated by the comparative Ct method, with the formula 2−ΔΔCt (Schmittgen and Livak 2008).

$${\text{Fold difference }} = { 2}^{{ - \Delta \Delta {\text{Ct}}}} = { 2}^{{ - \left[ {\left( {{\text{Ct TE gene}},\,{\text{ New world}} - {\text{Ct GAPDH}},\,{\text{ New world}}} \right) \, - \, \left( {{\text{Ct TE gene}},\,{\text{ Old world}} - {\text{Ct GAPDH}},\,{\text{ Old world}}} \right)} \right]}}$$

Subsequently, the fold differences of each TE were compared between the old world and new world strains (separately for adults and cercariae). TE copy number between the field and laboratory strains were also compared to identify possible TE dynamics associated with the demographic history of the population (Fold difference = 2−ΔΔCt = 2−[(Ct TE gene, Laboratory strain−Ct GAPDH, Laboratory strain)−(Ct TE gene, Field strain−Ct GAPDH, Field strain)]). The significance was determined using non-parametric Mann Whitney U test (equal variance) and Kolmogorov–Smirnov test (unequal variance). Bonferroni corrections were used to adjust for multiple comparisons (Bland and Altman 1995).

We expected more abundant and/or transcriptionally active TEs to show a greater difference in copy number between old world and new world parasites compared to rare/inactive TEs. Pearson correlation was used to quantify the relationship between TE abundance and the difference in copy number between old world and new world parasites (i.e. estimated copy number of the TE in the genome x the average difference in copy number between the new world and old world parasites). TE abundance and activity data were adapted from DeMarco et al. (2004). All statistics were calculated using R software release 3.1.1 (www.r-project.org).

Results

TE copy number in genomic DNA

We used an array of old and new world laboratory and field strains of the parasite S. mansoni to quantify TEs. As some field strains were available only as adults or cercariae, we conducted separate comparisons for each life cycle stage. The genomes of our new world parasites (both adults and cercariae) carried more copies of both class I and class II TEs than the old world parasites for both lab and field strains (Figs. 2, 3). This difference was significant in the adults (Strains NMRI, PR-1, LE, AMH, EGY-1; Kolmogorov–Smirnov test Bonferroni corrected p < 0.05; Supplementary Table S1), yet mostly non-significant in the cercariae (Strains NMRI, PR-1, LE, EGY-1, KY; five out of the seven comparisons non-significant; Supplementary Table S1).

Fig. 2
figure 2

Class I TE copy number difference in genomic DNA of old world and new world S. mansoni cercariae (male cercariae, n = 20, field and laboratory strains NMRI, PR1, LE, EGY1, KY) and adults (adult worm DNA, n = 30 per strain, field and laboratory strains NMRI, PR1, LE, AMH, EGY1). The x axis indicates the strains while the y axis represents the fold difference in TE copy number compared to the internal control GAPDH. The level of significance is indicated by asterisks (bonferroni corrected p < 0.05: *; p < 0.005: **; p < 0.0005: ***). The error bars represent standard errors. The difference in copy number is significant in adults. In cercariae, only Saci-1 and Saci-2 TE genes are significantly different between new world and old world strains

Fig. 3
figure 3

Class II TE copy number difference in genomic DNA of old world and new world S. mansoni strains (Cercariae DNA n = 20 per strain; Adult worm DNA n = 30 per strain; comparison includes both field and laboratory strains, as in Fig. 1). The x-axis indicates the strains whereas the y-axis represents the fold difference in TE copy number compared to the internal control GAPDH. The level of significance is indicated by asterisks (bonferroni corrected p < 0.05: *; p < 0.005: **; p < 0.0005: ***). The error bars represent standard errors. The difference in class II TE copy number is significant between adult samples of old world and new world strains

When only laboratory strains (NMRI, PR-1, LE, EGY-1) were compared, new world adults still had significantly more copies of class I and class II TEs than the old world adults (Kolmogorov–Smirnov test Bonferroni corrected p < 0.05; Supplementary Figure S1; Supplementary Table S2). In cercariae of the laboratory strains, the TE copy number difference was non-significant, but the new world strains still had more copies than the old world strains (Supplementary Figure S1; Supplementary Table S2).

Overall, class I elements were more abundant in the genomes of new world strains than were class II elements (Figs. 2, 3). The correlation between the estimated TE abundance and the difference in copy number between old world and new world adult parasites was moderate (correlation coefficient = 0.44; Supplementary Figure S2). However, we observed a strong correlation (correlation coefficient = 0.80; Supplementary Figure S3) between TE activity (estimated copy number of the TE in the genome*estimated transcriptional activity of the TE) and the average difference in copy number between the new world and old world parasites. Saci-1, a retrotransposon present in 70–700 genomic copies, has the highest transcriptional activity in adults and shows the highest copy number difference between old world and new world parasites (Supplementary Figure S3; DeMarco et al. 2004). Perere, an abundant retrotransposon with relatively low transcriptional activity shows the second largest difference between the new world and the old world adult parasites. SmTRC1, the TE with the lowest copy number exhibits the lowest copy number difference between the strains (Supplementary Figure S3; DeMarco et al. 2004). We identified no significant correlation between genomic TE abundance and difference in copy number between old world and new world cercariae (data not shown).

TE expression

We used cDNA synthesized from laboratory strains to calculate the differential expression of TEs at the transcriptome level. Unlike the genome, the transcriptome data revealed no consistent significant difference between the old and new world strains (Supplementary Table S3). Only class II Merlin-sm1 TE had a significantly higher expression level in the new world samples (Kolmogorov–Smirnov test Bonferroni corrected p < 0.05; Supplementary Table S3).

Variation in copy number estimates between genes

We estimated the copy number of Saci-3 using two different TE genes (RNaseH and Protease) to determine whether the amplicon has a significant effect on TE copy number estimation. Both RNaseH and Protease data indicate greater copy numbers of Saci-3 in new world strains (Fig. 2e–h), and our calculations indicate that estimates are not significantly affected by the TE gene assayed (\({\text{Fold difference}} = 2^{{ - \Delta \Delta {\text{Ct}} }} = 2^{{ - [\left( {{\text{Ct}}Saci{\text -}3,{\text{ RNaseH{-}Ct GAPDH}}} \right) - ({\text{Ct}}Saci{\text -}3,{\text{ protease{-}Ct GAPDH}})]}}\); Kolmogorov–Smirnov test p > 0.05).

TE copy number in laboratory versus field strains

Laboratory and field parasites are subjected to differential evolutionary pressures and therefore could exhibit differences in their genetic makeup (Fletcher et al. 1981; Stohler et al. 2004). Therefore, we calculated the TE copy number differences between laboratory and field strains in two different ways. First, we compared TE copy number between laboratory and field strains of geographically proximate sampling sites (i.e. Brazilian field strain vs. Brazilian lab strain and Egyptian lab strain vs. Kenyan field strain) to reduce the effects of differential evolutionary history while identifying such disparities. Second, we compared the pooled new world adults of laboratory strains (NMRI, PR1 and LE) against the new world field strain (AMH) to identify significant differences in TE copy number, despite the geographical distance. Our analysis indicates that the TE copy numbers are different between well-established laboratory strains and field collected parasites in both old and new world samples (Table 3). The differences were significant in all adult worm comparisons, but only marginally significant in a single cercarial comparison (Table 3). Interestingly, the direction of change was the same in all the comparisons, except in Perere and SmTRC1 of cercariae EGY-1 versus KY. Specifically, both adult and cercarial field samples seem to carry more copies of Saci-1 (RNase H), Saci-2 (RNase H), Saci-3 (RNase H), Saci-3 (Protease) and Merlin-sm1 (Transposase). The comparisons were significant in adults (Table 3). In both adult worm comparisons, four out of seven significant comparisons indicated more TE copies in the field strains. Despite these differences, both field and laboratory samples showed the same overall trend (i.e. new world parasites have more TE copies), when they were analyzed separately for old world vs. new world (Figs. 2, 3; Supplementary Figure S1; Supplementary Tables S1–S2).

Table 3 p values associated with laboratory versus field strain comparisons at the DNA level

Discussion

Schistosoma mansoni is one of the most notorious human parasites due to its pathogenicity and widespread distribution (Brunette 2014). Phylogenetic analyses suggest that S. mansoni invaded the new world during the trans-Atlantic slave trade (Morgan et al. 2005; Files 1951; Fletcher et al. 1981; Desprès et al. 1993) and quickly became established (Dejong et al. 2001). This suggests that S. mansoni must have adapted rapidly to the new world, where it no doubt faced novel intermediate and definitive hosts (Dejong et al. 2001, 2003), different environmental/climatic regimes, and dissimilar suites of pathogens, predators, and competitors. Beyond such ecological factors, genomic factors such as TE proliferation might have helped the parasite to adapt to new habitats (Venancio et al. 2010). We used qPCR quantification to test the idea that TE repertoires differ between parasites from established versus new habitats, by calculating the copy number of TEs in new and old world strains of S. mansoni relative to a control gene (GAPDH). Our results indicate that new world strains of S. mansoni do indeed carry more TE copies than old world strains. The trend is consistent in both cercarial and adult samples, albeit only significant in adults.

Both class I and class II elements are more abundant in the genomes of new world strains than in old world strains. Class I TEs, which copy and paste via an RNA intermediate, create new copies of themselves and thus the population of elements in a genome can grow rapidly. As a consequence, large genomes often consist primarily of class I retrotransposons (Venancio et al. 2010; Kidwell 2002). Conversely, class II elements increase their copy numbers through indirect mechanisms that rely on the host machinery such as DNA replication and double-strand break repair (Engels et al. 1990; Ros and Kunze 2001). Although not extensively as class I, class II DNA transposons can also proliferate due to stress (Naito et al. 2009; Feschotte et al. 2003).

We were surprised to observe a disparity in TE copy number between adult and cercarial life history stages (Figs. 2, 3). One possible explanation for this difference is that mitotic chromosomal recombination is often observed in schistosome cercariae (Bayne and Grevelding 2003; Semyenova et al. 2007), despite their asexual proliferation within the snail host (Fig. 1). Intra- and inter-chromosomal recombination events result in cercariae having differential numbers of repetitive elements (Rinaldi et al. 2012; Bayne and Grevelding 2003). Mitotic recombination can produce populations that are more diverse within hosts than among hosts (Korsunenko et al. 2012). TEs can facilitate ectopic recombination events, and could be associated with increased TE expression in cercariae (DeMarco et al. 2004). Another possible explanation for the disparity in TE copy number between life history stages is the use of parasites from different generations, as parasites with complex life cycles undergo several bottlenecks that can produce differences in genetic and genomic structure (Stohler et al. 2004).

Despite the differences in genomic copy number, there was no significant difference in the expression of most TEs between new world and old world strains. There are at least three possible explanations for this: pooling, bias, and TE repression. Pooling is often carried out in transcriptome analysis when insufficient RNA is available from individuals or individual tissues (Marinov et al. 2014). However, pooling can reduce the likelihood of identifying true biological variation in TE gene expression (Jolly et al. 2005) and thus cannot be discounted. However, there are also biological reasons that might explain the discordance in copy number variation between the genome and transcriptome. One of these is bias, whereby truncated TE copies are much more likely to become fixed in populations than full length copies—yet not necessarily more likely to be more highly expressed—because of purifying selection against active full length copies (Tollis and Boissinot 2013).

Finally, the lack of expression differences could correspond to reactivation of TE repression systems sometime after new world colonization. We favor this idea because our data are consistent with the emerging body of evidence summarized here. Elevated transposition rates cannot be maintained for long without deleterious effects caused by TE insertions (Zeh et al. 2009). In schistosomes, hyper-methylation of TEs has been identified as a mechanism for silencing TE transcription (Geyer et al. 2011). Furthermore, schistosomes utilize an RNAi pathway to silence the expression of genes (Batista and Marques 2011; Gomes et al. 2009; Chen et al. 2010). Hao et al. (2010) identified and characterized endogenous siRNAs which were primarily derived from S. japonicum TEs. Several genes encoding core and accessory proteins that function in RNAi pathways are conserved in the genome of schistosomes and gene silencing induced by introduction of exogenous dsRNA can be readily induced (Krautz-Peterson et al. 2007). This suggests that the TEs in schistosome genomes are now under tight control to prevent TE proliferation.

Repression is an arms race that is not always effective. Consider Merlin-sm1, a short class II element with an estimated 50–500 copies in the genome (Feschotte 2004) and the only expressed TE with significantly higher copy number in the new world. This suggests that Merlin-sm1 escaped TE repression systems, continuing to actively transcribe long after the new world invasion of S. mansoni. The patchy phylogenetic distribution of Merlin-like sequences suggests the possibility that it might have spread across distantly related species (including schistosomes) through repeated horizontal transfer (Feschotte 2004).

TE proliferations are susceptible to environmental changes (Zeh et al. 2009). The link between stress and TE activation is not always straightforward, but emerging patterns point towards underlying processes (Casacuberta and Gonzalez 2013). TE activation is often a consequence of the relaxation of epigenetic control induced by environmental changes (Slotkin and Martienssen 2007; Zeh et al. 2009; Rebollo et al. 2010). According to the “epi-transposon hypothesis”, epigenetic regulatory mechanisms (e.g., RNA interference, DNA methylation, and histone modifications) help maintain genome stability by suppressing TE mobilization. However, physiological stress, as induced by climate change or the invasion of new habitats, disrupts epigenetic regulation and unleashes TEs. Such TE proliferations can also induce the expression of adjacent genes when TEs act as transcription enhancers. For instance, activation of mPing elements in rice, can upregulate the expression of nearby genes in a stress inducible manner (Naito et al. 2009). Conversely some TEs, such as TF1 in Schizosaccharomyces pombe, specifically integrate near stress-responsive genes and are activated during stress (Bowen et al. 2003).

Other processes could also lead to increased proliferation of TEs. According to the “founder event hypothesis” TE proliferation occurs due to founder effects associated with genetic drift (García Guerreiro et al. 2008; García Guerreiro and Fontdevila 2007; Vieira and Biémont 2004). The differential fixation of TE copies in colonized versus native populations of Drosophila subobscura (García Guerreiro et al. 2008) and Arabidopsis lyrata (Lockton et al. 2008) strongly suggests that demographic history can influence the fate of TEs in the genome. Such founder effects are evidenced by low nucleotide diversity and have been identified in the new world strains of S. mansoni (Morgan et al. 2005). TE dynamics are governed by transposition, selection, and drift. Purifying selection purges deleterious copies of TEs in large populations, but selection is less effective at purging TEs in small populations where genetic drift predominates (Lynch and Conery 2003). Therefore, deleterious TEs are expected to proliferate most rapidly in small populations governed by genetic drift. This scenario is consistent with the evolutionarily recent establishment of S. mansoni in the new world, where colonizers were no doubt subject to bottlenecks and founder effects. Likewise, laboratory strains are notoriously affected by inbreeding and founder events as they are often established from one or few individuals (Fletcher et al. 1981). In addition, they are more susceptible to population bottlenecks and differential selection pressures (i.e. due to the maintenance in the mouse host) than field isolates (Blank et al. 2010; Stohler et al. 2004). Our results indicate differential TE profiles in laboratory and field strains, possibly due to their different demographic histories (Table 3).

Once unleashed due to environmental stressors, TE proliferations can inadvertently foster genetic innovations required for rapid adaptation (Zeh et al. 2009). Indeed, transposable elements have the ability to generate mutations through the process of transposition, which can create ectopic rearrangements, inversions, translocations and gene duplications (Casacuberta and Gonzalez 2013). Mutations are the ultimate source of all genetic variation and serve as the raw material for evolutionary adaptations. TE-induced mutations are known to confer adaptive advantages to host genomes (Schlenke and Begun 2004; Zeh et al. 2009). For instance in Drosophila simulans, the Doc element within a P450 gene provides increased resistance to insecticides (Schlenke and Begun 2004). Following this logic, TE proliferations might have assisted colonizers such as S. mansoni in the invasion of new habitats (and new worlds).

Conclusions

Dynamic environmental conditions pose a major challenge to the survival of organisms and positive selection acts upon those genetic variants that foster organismal adaptation (Visser 2008). TEs act as “mutator loci” that can give rise to new innovations that serve as a basis for adaptation. We report one of the first studies of TE dynamics in parasites. We used genes from multiple TEs and two life cycle stages to substantiate our results. Using qPCR, we demonstrated that new world strains of S. mansoni carry more copies of TEs in their genomes than old world strains. This finding is consistent with a burst of stress-induced TE activity concomitant with the invasion of the new world by S. mansoni. Such amplifications might have positively influenced the parasite’s ability to invade new habitats, but if so the signature of TE bursts was absent in transcriptomes, perhaps due to the reactivation of TE repression systems and/or indicating the presence of inactive TE copies. Although new world strains possess more TE copies in both laboratory strain and field strain comparisons, we found that field and laboratory samples also differ in their TE repertoires. Our results demonstrate the value of studying TE dynamics at multiple hierarchical levels in order to better understand TE and genome evolution.