Introduction

Giardiasis is one of the most common gastrointestinal protozoan diseases of humans and also affects many other mammalian species. It occurs more frequently in developing countries where approximately 200 million people suffer symptomatic giardiasis every year (WHO 1992). Clinical symptoms, when present, include diarrhoea, flatulence, stomach cramps and nausea, which sometimes can lead to weight loss and dehydration (Gardner and Hill 2001).

Currently, six species are recognized in the Giardia genus, G. duodenalis is the only species that infects humans, albeit it is found in many other mammalian hosts (Thompson and Monis 2011). In order to understand the epidemiology of giardiasis, particularly the role of zoonotic transmission, and the correlation between the genetic variation of the parasite and its pathogenicity in the host, a large number of isolates have been characterised genetically (Amar et al. 2002; Caccio et al. 2005; Sprong et al. 2009; Vanni et al. 2012). These data have shown that isolates of G. duodenalis, which cannot be distinguished morphologically, can be assigned to at least eight distinct genetic groups (assemblages). Among these, assemblages A and B have a broad host range and infect humans and many other mammals (Alyousefi et al. 2013; Caccio and Sprong 2010; Eligio-Garcia et al. 2005; Feng and Xiao 2011; Geurden et al. 2009; Yang et al. 2010). The other assemblages are thought to be more host specific (assemblages C and D for canids, assemblage E for cloven-hoofed domestic mammals, assemblage F for felids, assemblage G for rats and assemblage H for marine mammals), although recent studies have detected some of these assemblages in humans from developing countries (Foronda et al. 2008; Gelanew et al. 2007).

Further genetic variation exists within each assemblage: subgroups within assemblages A and B were originally defined by isoenzyme analysis and classified as AI and AII, BIII and BIV (Monis et al. 1999). This technique, however, is not suitable for routine genotyping studies because it requires large amount of biological material that can only be obtained by in vitro growth of Giardia isolates. Currently, the genetic characterization of Giardia is based on PCR amplification and sequencing of fragments of the small subunit rRNA, beta-giardin (bg), triose phosphate isomerase (tpi) and glutamate dehydrogenase (gdh) genes, which differ in terms of sensitivity and degree of genetic polymorphism (Wielinga and Thompson 2007). Genotyping data on human isolates have been generated from many parts of the world, albeit most of the data are from Europe, Australia and the USA (Caccio et al. 2002; Read et al. 2004; Sulaiman et al. 2004; van der Giessen et al. 2006). Recent genotyping studies have also been performed in tropical, developing or newly industrialized countries such as India, Philippines, Mexico, Argentina, Brazil, Peru, Nicaragua, Egypt, Guinea Bissau and Ethiopia (Eligio-Garcia et al. 2008; Ferreira et al. 2012; Foronda et al. 2008; Gelanew et al. 2007; Kohli et al. 2008; Lebbad and Svard 2008; Minvielle et al. 2008; Traub et al. 2004; Yason and Rivera 2007). These studies reported high genetic variability, mixed assemblage infections and assemblages not typically associated with human infections, thus confirming the complexity of Giardia molecular epidemiology.

To obtain an informative picture of the epidemiology of giardiasis, the analysis of single markers is not sufficient. Indeed, recent studies based on a multilocus typing approach have revealed an unexpected complexity due to mixed infections, allelic sequence heterogeneity and lack of concordance in the assignment of isolates to a specific assemblage (Almeida et al. 2010; Caccio et al. 2002; Geurden et al. 2009; Laishram et al. 2012; Lebbad et al. 2008, 2010, 2011; Soliman et al. 2011; Sprong et al. 2009; Wielinga et al. 2011).

In Germany, approximately 4,700 cases of giardiasis are officially registered yearly (RKI 2008), but, due to underreporting and under diagnosing, the actual number might be much higher. Most cases of human giardiasis in Germany are related to travel, particularly to tropical and subtropical areas.

The present study was aimed to perform a molecular analysis through multilocus genotyping on isolates collected from German patients with a recent history of travel to endemic regions, which may provide further insights on the distribution of Giardia molecular patterns in different geographical areas.

Materials and methods

Sources of isolates

Faecal samples from travel returnees were collected at the Institute of Tropical Medicine and International Health in Berlin (Germany). The detection of Giardia cysts and/or trophozoites in those samples was based on light microscopy using conventional MIF concentration as well as direct immunofluorescence (Merifluor Crypto & Giardia, Meridian Bioscience, Cincinnati, OH, USA). In 1 year, a total of 66 patients mostly suffering gastrointestinal disturbances were found positive for Giardia. For 57 patients, data about previous stays abroad were available: recent travels to Latin America (Mexico, Guatemala, Honduras, Argentina, Bolivia, Brazil), India, Southeast Asia (Thailand, Cambodia, Laos), western Africa (Ghana, Burkina Faso, Benin), eastern Africa (Sudan, Tanzania, Kenya), southern Africa (Namibia, Malawi) and northern Africa (Egypt) were reported.

Sample treatment and DNA extraction

About 3 g of faecal material were dissolved in phosphate-buffered saline (PBS), vortexed, filtered through a mesh and centrifuged for 5 min at 4,700 rpm. The supernatant was discarded, and the pellet was dissolved in 3 ml of PBS. The faecal suspension was laid over 10 ml of a 1-M sucrose solution and centrifuged at 2,000 rpm for 10 min without brake. The intermediate layer containing the Giardia cysts was recovered, brought to 50 ml with PBS and concentrated by centrifugation at 4,700 rpm for 5 min. The pelleted cysts were used for DNA extraction using the QIAamp Stool kit (Qiagen, Hildesheim, Germany), with an initial incubation of 10 min at 95 °C and a final elution in 100 μl of elution buffer instead of 200 μl. The eluted DNA samples were stored at −20 °C before use.

Molecular and phylogenetic analysis

Nested PCR amplification of fragments of the bg (511 bp), tpi (530 bp) and gdh (530 bp) genes was performed using published protocols from Lalle et al. (2005), Sulaiman et al. (2003) and Caccio et al. (2008), respectively. A published quantitative TaqMan real-time PCR assay (qPCR) targeting the bg locus (Guy et al. 2004) was performed on samples for which assignment to assemblages was non-concordant.

PCR products were purified using spin columns (GE Healthcare, Germany) and sequenced from both strands. Sequences were edited using SeqMan 7.0 software (DNASTAR, WI, USA) and aligned with reference sequences retrieved from the GenBank using Clustal W (Aiyar 2000).

Representative sequences of subgroups AI (accession numbers EU014394, L40509, AF069556), AII (accession numbers AY072723, L40510, AF069557), BIII (accession numbers AY072726, AF069059, AF069561) and BIV (accession numbers AY072725, L40508, AF069560) were retrieved from GenBank and used as reference. As reference for sub-assemblages A3–A6 at the bg locus, the sequences submitted by Cacciò et al. (2008) were used.

Results

PCR amplicons were obtained from 61 of the 66 samples at the bg locus, from 29 samples at the tpi locus and from 17 samples at the gdh locus. All positive isolates were sequenced and the sequences aligned with those from reference strains representing sub-assemblages AI, AII, BIII and BIV and D (Baruch et al. 1996; Homan et al. 1998; Monis et al. 1999). The degree of polymorphisms detected at the three loci in the different assemblages is shown in Fig. 1 and in the Supplementary Tables S2, S3, S4. The heterogeneity index reported in Fig. 1 is a graphical representation of the degree of polymorphism expressed as the mean number of (1) polymorphic sites generating a novel sequence (single-nucleotide polymorphism (SNP)); (2) the heterogeneous positions common to assemblages A and B, representing mixed assemblage infection (mixAssem) and (3) the heterogeneous positions (double peaks) at random sites not common to assemblage A or B (Heter), where the denominator is the number of isolates of a given assemblage.

Fig. 1
figure 1

Heterogeneity index (number of SNP/number isolates of a given assemblage) of Giardia isolates at the three different loci considered

The novel sequences without heterogeneous positions detected in the present study are available in the GenBank database under accession numbers FJ007840–FJ007859.

Multilocus typing analysis

The assignment of the samples to different assemblages through standard PCR and sequencing and qPCR by using assemblage-specific primers is reported in Table 1.

Table 1 Genotyping results (assemblage, sub-assemblage) of human isolates of Giardia duodenalis obtained at the bg, gdh and tpi loci by standard PCR and by qPCR at the bg locus and geographical origin of infection (recent travel)

Sequence data for all three loci were available for 14 samples. By comparing the assignment to assemblages at the bg, tpi and gdh loci, nine isolates were consistently classified as assemblage B (seven samples) or assemblage A (two samples), whereas the remaining five isolates were not. In two cases, assemblage A was detected at the bg and gdh loci and assemblage B was detected at the tpi locus, whereas three isolates were typed as assemblage A at bg, but as assemblage B at the tpi and gdh loci.

Furthermore, the two samples typed at the gdh locus as novel genotypes of assemblage D were classified as novel genotypes of assemblage A at bg locus. Among the 32 samples for which genotyping data were available for at least two markers, the majority of them showed incongruent assignment to assemblages (21 out of 32 samples), in particular assemblage A at bg and assemblage B at tpi (19 out of 32 samples).

In order to further analyse cases of inconsistent assignment to assemblage, ten samples with available multilocus typing data were tested by qPCR assays with assemblage-specific primers (A and B) at the beta-giardin. The samples BfR163, BfR164, BfR167, BfR169 and BfR172, typed as assemblage A at bg, showed also an amplification signal for assemblage B, while in samples BfR162 and BfR168, unambiguously typed as B at bg, gdh, and tpi, the assemblage A was also detected by qPCR.

Polymorphisms at single loci

At the bg locus, 51 out of 61 samples were classified as assemblage A, 43 being identical to genotype A3 (accession number AY072724), whereas two were identical to genotype A2 (accession number AY072723). Four samples of these (8 % isolates) were new variants with unique SNP patterns, distinct from the six subtypes (A1–A6) previously described by Cacciò et al. (2008).

Among the ten isolates classified as assemblage B, the percentage of new variants was much higher (six out of ten) than among assemblage A isolates.

At gdh locus, the majority of isolates were typed as assemblage B (11 out of 17) and were characterised by heterogeneous positions at random sites (Supplementary Table S3). In six of the isolates, a sequence corresponded to a mixture of sub-assemblages BIII and BIV, and all of the isolates also contained mixed positions. Only four out of 17 isolates were classified as assemblage A, where no heterogeneous positions were observed. Interestingly, two samples (BfR163 and BfR172) were classified as assemblage D, which is considered to be adapted to canid hosts. These isolates were from two patients with symptomatic giardiasis who spent holidays both in Southeastern Asia. This result was confirmed by repeating PCR and sequencing.

Sequence analysis at the tpi locus showed 26 isolates genotyped as assemblage B and only three typed as assemblage A, with novel polymorphisms of assemblage B in six samples with unique SNPs (Supplementary Table S4). Eight isolates of assemblage B were characterised by heterogeneous positions at random sites not to be ascribed to mixed assemblages. No evidence of heterogeneous template was detected among the isolates grouped in assemblage A.

Geographical origin of the infection

Information about travel destinations (where most likely the Giardia infection was acquired) was available for 57 patients. The information concerning the areas visited by the infected patients was compared with figures about German travellers visiting yearly each area considered in this study (DESTATIS 2006). Although the data were insufficient for statistical comparison, India and west Africa seemed to be the areas associated to the highest risk of acquiring Giardia infection (Fig. 2). Further, in Table 1, the geographical origin of the isolates next to the genotyping results of at least two loci is reported. The most common genotype combination typed as assemblage A at bg and B at tpi (18 isolates) locus was detected in all the geographical areas considered in the present study. The seven isolates consistently assigned to assemblage B at all three markers (a relatively uncommon multilocus assemblage typed in this study), originated from India (five samples) and Central America (two samples). The two isolated typed as assemblage D at gdh marker were both originating from southern Asian continent (India and Southeast Asia).

Fig. 2
figure 2

Prevalence of giardiasis in patients returning from the areas considered compared to the destinations of German travellers

Discussion

The main objective of the present study was to investigate the genetic variability of Giardia duodenalis isolates collected from returning travellers by a multilocus analysis at the bg, gdh and tpi loci. This analysis, however, was negatively impacted by the low rate of amplification observed at the gdh and tpi loci (26 and 48 % of samples, respectively), compared with that obtained at the bg locus (92 %). These markers are all single-copy genes, thus the different rate of amplification cannot be explained by difference in target copy number. The presence of mismatches between the genomic sequences and the primers used for PCR, that may cause a strong reduction or even a lack of amplification, cannot be excluded. Interestingly, similar findings were reported by other researchers (Amar et al. 2002; Covacin et al. 2011; Robertson et al. 2006; Soliman et al. 2011). Soliman et al. (2011) reported a rate of amplification close to 50 % at the tpi and gdh loci, whereas 100 % of the samples could be amplified at the bg locus; Covacin et al. (2011) reported even only 7 % successful amplification at gdh locus.

Despite this limitation, our study provided several interesting results, which are discussed in turn below.

The most intriguing observation was the difficulty in assigning isolates to specific G. duodenalis assemblages. Indeed, only ten of the 31 samples for which sequence information was available at more than one locus were unequivocally assigned to a specific assemblage. On the contrary, most of the samples classified as assemblage A at the bg locus were classified as assemblage B at the tpi or the gdh locus. This lack of concordance in the assignment to assemblages has been reported in many recent studies (e.g. Cacciò et al. 2008, references) and can be explained by two very different mechanisms, namely mixed infections or recombination (Caccio and Sprong 2010). In the first case, two different assemblages are responsible for the infection of a single host and their detection depends on the relative proportion (with the majority population being favoured by PCR) and on the lack of bias during amplification. In the second case, genetic exchanges within or among assemblages can create recombinants that may contain specific sequences of assemblage A in a genome of assemblage B (or vice versa).

High frequency of multiple/mixed infections has been increasingly reported from multilocus studies in other countries in both humans and dogs and indeed hypothesis of meiotic recombination or preferential amplification of one assemblage over another in mixed infections have been suggested (Caccio et al. 2005; Covacin et al. 2011; Sprong et al. 2009; Wielinga and Thompson 2007).

To detect mixed infections, both conventional and real-time PCR assays, based on the use of assemblage-specific primers have been developed (Almeida et al. 2010; Geurden et al. 2009) and tested on DNA extracted from stools or purified cysts. In this study, a real-time PCR assay detected DNA of assemblages A and B in six of eight samples tested (two isolates were negative); four of these six isolates gave an inconsistent assignment when tested by conventional PCR (A or B in two cases, A or D in the other two cases), whereas the other two isolates were classified as assemblage B. Therefore, mixed infections can potentially explain only a fraction of cases where different assemblages were detected using different PCR assays. The occurrence of recombinants could not be investigated.

A second relevant finding was the identification of assemblage D (at the gdh locus) in two patients returning from India and Southeast Asia, G. duodenalis. The occurrence of the dog-adapted assemblage D has been already reported in human isolates from India (Traub et al. 2004) when genotyping was performed at the 18S rDNA locus, but this finding could not be supported by genetic characterization at other loci (ef1-α, tpi), as was the case in our study (see Table 1). It should be noted, however, that assemblage E was found in 15 % of the human isolates from Egypt by genotyping at the tpi locus (Foronda et al. 2008), and more recently, also assemblage C was found at bg locus in human patients from the same country (Soliman et al. 2011). Further, a study conducted in Ethiopia revealed the presence of assemblage F in three human isolates (Gelanew et al. 2007). The epidemiology of giardiasis in developing countries is greatly influenced by poor hygienic conditions and the closeness between humans and animals, which could favour the transmission of unusual parasitic strains. However, passive carriage of cysts from those assemblages cannot be excluded.

Our results reinforced previous data concerning the amount of polymorphism and the occurrence of allelic sequence heterogeneity (ASH) in isolates from assemblages A and B. Among the isolates classified as assemblage A (at the bg locus), the majority were identical to an already described genotype (A3), albeit three of them represented novel variants. In more than 30 kb of sequenced DNA, only four heterogeneous positions were found in three isolates, which do not represent mixed infections with other known assemblage A genotypes. At the gdh and tpi loci, the few samples ascribed to assemblage A were identical to the reference strain AII (Monis et al. 1996; Souza et al. 2007), and novel variants or heterogeneous sites were not detected. This is illustrated in Fig.1, where the heterogeneity of assemblage A isolates was at the lowest level at bg locus, and even absent at gdh and tpi locus. This result largely agrees with previous studies in which ASH in assemblage A isolates was very low or absent (Caccio et al. 2008; Morrison et al. 2007). On the contrary, isolates classified as assemblage B showed a larger amount of polymorphism and of ASH at all the three loci investigated, in agreement with previous studies (Caccio et al. 2008; Lebbad et al. 2008, 2011;Robertson et al. 2006; Sulaiman et al. 2004; Teodorovic et al. 2007; Winkworth et al. 2008). Novel genotypes were observed at all three markers, and the heterogeneity index for SNPs and heterogeneous positions was high, in particular at gdh and bg loci. The reasons for the different ASH levels in assemblage A and B are not understood, but a recent study has demonstrated, by typing single cysts isolated by micromanipulation, that ASH occurs between the two nuclei of single cysts but also between different cysts taken from the same patient (Ankarklev et al. 2012). Therefore, the genetics of Giardia is complex, and the occurrence of ASH complicates the assignment of isolates to specific genotypes, especially for assemblage B.

Finally, interesting observations were made with respect to the geographical origin of the infection. Indeed, by comparing the number of Giardia-positive travellers to the total number of tourists visiting these areas yearly, a higher risk of acquiring the infection was found to be associated with travel to India and west Africa. Although statistically weak, this finding agrees with previous studies that showed a higher prevalence of giardiasis among travellers returning from the Indian subcontinent and from west Africa compared to other tropical areas (Ekdahl and Andersson 2005; Jelinek and Loscher 2000). This increased risk could be associated to the poor management of water supply and wastewater treatment in areas with high-population density. The comparison of the genotyping results with the geographical origin of the isolates allows only limited conclusion, mainly due to the limited number of isolates. The majority of the isolates originated from India and were typed as assemblage A at bg and assemblage B at tpi, the most common multilocus assemblage registered in this study. Similarly, in other studies conducted in India, Giardia isolates in human samples were typed as assemblage A1 at bg and as assemblage B at tpi (Khan et al. 2011; Sulaiman et al. 2003; Traub et al. 2004). The samples originating from west Africa were typed mostly (five out of six samples) as assemblage A at bg, which is in contrast with another study conducted in west Africa (Guinea Bissau), where the predominance of assemblage B was detected at bg locus in 82 % of samples analysed (Ferreira et al. 2012). No particular geographical clustering pattern of multilocus assemblage was detected, but these results may provide a preliminary contribution to further spatial analysis of multilocus assemblage distribution, if assessed together with more genotyping data.

In conclusion, the results of this study have confirmed that the epidemiology of giardiasis still presents several unclear aspects. Further studies are needed to clarify the host specificity of this parasite, given the finding of unusual assemblages in human and animal isolates. Moreover, the molecular typing of isolates from travellers returning from endemic areas may provide a useful framework for comparing the genotype variability from different geographic regions and may help in investigating the pathogenic potential of different strains and in identifying risk-associated areas.