Introduction

Gastrodiscidae species are important paramphistomes parasiting in the large intestine of small ruminants such as goats, sheep and cattles, and Homalogaster paloniae (Poirier, 1883) is one of the most common species (Li 2011; Taylor et al. 2007). As a neglected pathogen, H. paloniae have been reported in Burma, China, Formosa, India, Indonesia, Japan, Philippines, and Thailand (Yamaguti 1971). H. paloniae (Poirier, 1883) usually inhabits in the caecum of animals. Mature eggs are expelled with feces from the intestine into the environment and will develop into miracidium after several days under favor conditions. Subsequently, miracidium will invade into freshwater snails (intermediate host). Miracidium will develop into cercaria, and escape from the freshwater snails, then develop into metacercaria on the water plants. Ruminants infect H. paloniae by intaking water plants polluted by metacercaria. Although animals infected with H. paloniae usually do not show obvious symptoms, it can cause considerable losses to the breeding of sheep and cattle under heavy burden (Guoqing 2006).

Since no effective vaccine is available, application of chemical drugs is the main methods for the prevention and control of H. paloniae and other paramphistomes. Accurate diagnosis of H. paloniae infection is essential for the prevention and control of this species. Traditional morphological methods have been widely used for a long time; however, these methods are time-consuming and inaccurate (Bott et al. 2009). Based on these restricted factors, molecular methods based on PCR were developed for species identification (Itagaki et al. 2003; Morgan and Blair 1995). Recently, the mitochondrial genome has been used for species identification, population diversity, phylogenetic analysis, and so on (Cheng et al. 2016; Choi et al. 2012; Liu et al. 2012, 2015). Besides, advances in PCR-coupled sequencing together with bioinformatics analysis have been proved to be useful for the molecular study of species (Jex et al. 2010). In the present study, we aimed (i) to characterize the H. paloniae mt genome, and this is the first report about full mt genome of Gastrodiscidae; (ii) to assess the phylogenetic relationship between Gastrodiscidae and other family; and (iii) to provide useful information for further study of species identification, biology, population genetics and phylogenetic analysis.

Materials and methods

Parasites and DNA extraction

Adult flukes were collected from the caecum of a naturally infected black goat post-mortem in Macheng, Hubei province, PR China. After sufficiently washed with physiological saline, the specimens were fixed in 75 % (v/v) ethanol and preserved at −20 °C until use. These flukes were identified to be H. paloniae based on key morphological characteristics and parasitic positions (Guoqing 2006; Li 2011; Taylor et al. 2007).

Subsequently, total genomic DNA of H. paloniae was isolated from single worm by a kit (E.Z.N.A.® Tissue DNA Kit, D3396-01). The ITS-2 (the second internal transcribed spacer) region of H. paloniae was amplified and sequenced for further molecular identification (Itagaki et al. 2003); the sequence was 100 % similarity with a sequence available for H. paloniae (GenBank accession no. KM281535.1).

PCR-coupled sequencing of H. paloniae mitochondrial genome

Firstly, seven pairs of primers (Table 1) were designed and synthesized to amplify short fragments from nad1, nad4, cox1, cox2, nad5, rrnS, and cytb based on the conserved regions of the mt genomes of Fasciola hepatica (Le et al. 2001a) and Paramphistomum cervi (Yan et al. 2013). PCR reactions (25 μl) were as follow: 1 × Taq polymerase buffer, 0.2 mM each of dNTP, 0.5 μM of each primer, 2 U Taq polymerase (Takara), and 2.5 μl genomic DNA in a thermocycler (Biometra) under the following conditions: 94 °C for 5 min, followed by 35 cycles of 94 °C/30 s, 50 °C/30 s, 72 °C/1 min, and a final extension of 72 °C/7 min. Subsequently, PCR products were identified by agarose gel electrophoresis, and positive amplicons were sent for sequencing in both directions. Later, seven pairs of primers were designed based on the obtained seven short fragments to amplify the remaining sequences of the complete mt genome in seven long-PCR reactions (Table 1). The long-PCR reactions (25 μl) were performed in 0.8 mM of each dNTPs, 2.5 μl 10× LA Taq buffer, 0.5 μM of each primer, 2.5 U LA Taq polymerase (Takara), and 2.5 μl genomic DNA sample. PCR reactions were carried out under the following condition: 94 °C for 5 min, then 35 cycles at 94 °C/30 s, and annealed at 50 °C/30 s, followed by extension at 72 °C/8 min, and a final extension of 72 °C/7 min. Then, positive PCR amplicons were purified and cloned into pGEM-T vector (Promega, USA) for sequencing using a primer-walking strategy.

Table 1 Primers used for amplifying the mitochondrial genome of Homalogaster paloniae

Assembly, annotation, and bioinformatics analyses

The complete H. paloniae mt sequences were assembled by bioinformatics analysis and manually adjustment. Gene boundaries of the mt genome sequence of H. paloniae were predicted by aligning against that of F. hepatica (Le et al. 2001a) and P. cervi (Yan et al. 2013) using the software Clustal X 1.83 (Thompson et al. 1997). The protein-coding genes, transfer RNA, ribosomal RNA, and non-coding regions were annotated as reported (Yan et al. 2013).

Nucleotide comparison of the complete mt genome of H. paloniae and other 17 selected digeneans was conducted. Meanwhile, the AT-skew and GC-skew were calculated as previous study (Baek et al. 2014; Yan et al. 2014).

Sliding window analysis of nucleotide variation

Pairwise alignment of the complete mt genome of H. paloniae, Fischoederius elongates, P. cervi, Gastrothylax crumenifer, and Ogmocotyle sikae was accomplished by MEGA v6.0 to predict variable nucleotide sites (Tamura et al. 2013). Subsequently, a sliding window analysis of H. paloniae, F. elongates, P. cervi, G. crumenifer, and O. sikae was accomplished using DnaSP v.5.0 to assess the nucleotide variation diversity for the 12 protein-coding genes among these five paramphistomes (Librado and Rozas 2009).

Phylogenetic analysis

All 12 protein-coding genes of H. paloniae mt genome and other 17 selected digeneans were translated, concatenated, and aligned for phylogenetic analysis, including Clonorchis sinensis (NC_012147) (Shekhovtsov et al. 2010), Fasciola gigantica (NC_024025) (Liu et al. 2014), F. hepatica (NC_002546) (Le et al. 2001a), Fischoederius elongatus (KM_397348) (Yang et al. 2015b), G. crumenifer (KM_400624), Haplorchis taichui (NC_022433.1) (Lee et al. 2013), Hypoderaeum conoideum (KM111525) (Yang et al. 2015a), Metagonimus yokogawai (KC330755.1), O. sikae (KR006934) (Ma et al. 2015), Opisthorchis felineus (EU_921260) (Shekhovtsov et al. 2010), Opisthorchis viverrini (JF729304.1) (Cai et al. 2012), P. cervi (NC_023095.1) (Yan et al. 2013), Schistosoma haematobium (NC_008074) (Littlewood et al. 2006), Schistosoma japonicum (AF215860) (Le et al. 2001b), Schistosoma mekongi (NC_002529) (Le et al. 2000), Schistosoma spindale (NC_008067) (Littlewood et al. 2006), and Trichobilharzia regent (NC_010976) (Webster et al. 2007). And Taenia solium (NC_004022.1) (Nakao and Sako 2003) was included as an outgroup control.

The amino acid sequences were aligned and subjected to phylogenetic analysis by maximum likelihood methods using MEGA v.6.0 with default settings (Tamura et al. 2013).

Results

Genome content and organization

The complete mitochondrial (mt) genome of H. paloniae (GenBank accession no. KT266674) is 14,490 bp in length (Fig. 1) and contains 12 protein-coding genes, 22 transfer ribonucleic acid (tRNA) genes, two ribosomal ribonucleic acid (rRNA) genes (rrnS and rrnL), and two non-coding regions (Table 2). All the genes are transcribed in the same direction, which is in accordance with other digeneans (Le et al. 2001a; Yan et al. 2013). The gene arrangement is similar with other digeneans except for S. haematobium and S. spindale (Littlewood et al. 2006).

Fig. 1
figure 1

Arrangement of the mitochondrial genome of Homalogaster paloniae. Genes organization of 12 protein-coding genes, 22 tRNA gene, two RNA genes, and two non-coding regions in the mitochondrial genome of Homalogaster paloniae

Table 2 Content of the Homalogaster paloniae mitochondrial genome

As for the nucleotide composition, H. paloniae mt genome is obviously favor in T (Table 3). The nucleotide contents in the complete mt genome are 21.92 % (A), 9.28 % (C), 43.21 % (T), and 25.6 % (G). And the A + T content of mt genes range from 63.23 to 71.04 %, total A + T content is 65.12 %.

Table 3 Nucleotide composition of the mitochondrial genome of Homalogaster paloniae

Annotation of H. paloniae mt genome

The H. paloniae mt genome has 12 protein-coding genes (Fig. 1). For these genes, the most commonly used start codon is ATG (nine of 12 protein genes), and GTG is used by the remaining genes (three of 12 protein genes) (Table 2), which is in agreement with other digeneans (Cai et al. 2012; Le et al. 2001a; Littlewood et al. 2006; Yan et al. 2013; Yang et al. 2015a). The termination codon is TAA for nad5, and TAG for the rest genes. No incomplete codons are used in the mt genome of H. paloniae.

The 12 protein-coding genes encode 3359 amino acids excluding the termination codons (Table 4). Among all the amino acids, Phe (TTT 10.00 %) is the most used, followed by Leu (TTG 7.09 %), and Leu (TTA 6.91 %). The least used codon is Arg (CGC 0.12 %), followed by Leu (CTC 0.15 %), and Arg (CGG 0.15 %).

Table 4 Codon usage for 12 protein-coding genes in Homalogaster paloniae mitochondrial genome

As for the tRNA genes and rRNA genes, the length of the 22 tRNA genes ranged from 59 to 72 bp (Table 2). The size of rrnS and rrnL were 741 and 984 bp, respectively (Table 3). The location of rrnS is between trnC and cox2 and rrnL is between trnT and trnC, and their A + T content was 63.70 and 65.14 %, respectively (Table 3).

In the H. paloniae mt genome, two non-coding regions were recognized based on their AT-rich features and locations (Yan et al. 2013), one short non-coding region (SNR 71 bp) and one long non-coding region (LNR 1022 bp) (Table 2). The location of SNR is between cytb and nad4L, and LNR is located between trnE and cox3.

Comparative analyses of the mt genomes of H. paloniae and other digeneans

Nucleotide composition, AT skews and GC skews of the mt genome of H. paloniae and other digeneans were presented in Table 5. All the 18 digeneans mt genomes are rich in A + T. The nucleotide composition of H. paloniae is biased to T compared with A (AT skew = −0.327), and biased to G compared with C (GC skew = 0.468), which is in accordance with that of other digeneans.

Table 5 Comparisons of nucleotide composition of the full genome of selected digeneans, including Homalogaster paloniae

Nucleotide variability

The sliding window analysis was showed in Fig. 2; the highest level of nucleotide variability was within cox1, and the lowest was within cox3. In our study, cox1 and nad6 are the most conserved genes, and cox3 and cytb are the least conserved.

Fig. 2
figure 2

A sliding window analysis of complete mt genome sequences of Homalogaster paloniae, Fischoederius elongates, Paramphistomum cervi, Gastrothylax crumenifer, and Ogmocotyle sikae. The black line in the picture showed nucleotide diversity in the sliding window analysis (windows = 300 bp; steps = 10 bp). There are two overlapping genes in the protein-coding genes, one is between Nad4L and nad4, and the other is between cox2 and nad6. All the 12 protein-coding genes are indicated using grey boxes

Phylogenetic analyses

Waeschenbach and colleagues reported that the complete mt sequences are more reliable for phylogenetic analyses (Waeschenbach et al. 2012). Based on previous study, the concatenated amino acid sequence data of the 12 protein-coding genes of H. paloniae and other 17 digeneans (C. sinensis, F. gigantica, F. hepatica, F. elongatus, G. crumenifer, H. taichui, H. paloniae, H. conoideum, M. yokogawai, O. sikae, O. felineus, O. viverrini, P. cervi, S. haematobium, S. japonicum, S. mekongi, S. spindale, and T. regent) and one tapeworm (T. solium, as an outgroup) were used for the phylogenetic study. The relationship of H. paloniae with selected digeneans was showed in Fig. 3. The phylogenetic tree contains two clades with significantly strong support (100 %), one contains 13 members from eight families (Opisthorchiidae, Heterophyidae, Echinostomatidae, Fasciolidae, Notocotylidae, Gastrodiscidae, Paramphistomidae, and Gastrothylacidae), and the other contains five members from the Schistosomatidae family. The tree indicated that H. paloniae was together with other paramphistomes including F. elongatus, G. crumenifer, O. sikae, and P. cervi in one sub-clade, but separated from F. gigantica and F. hepatica from Fasciolidae, and H. paloniae has the closest relationship with members from Paramphistomidae and Gastrothylacidae that inhabiting in small ruminants. Nevertheless, more mt genomes from digeneans are needed for further phylogenetic analyses in the future.

Fig. 3
figure 3

The phylogenetic relationships of Homalogaster paloniae and other digeneans based on concatenated amino acid sequence data representing 12 protein-coding genes using Taenia solium as an outgroup

Discussion

As an important paramphistome, H. paloniae can lead to considerable economic losses to the breeding industry of small ruminants under heavy burden. Although the development of advances in technology, knowledge about epidemiology, biology, and genetics is still limited.

The present study firstly characterized the mt genome of H. paloniae. The gene content and organization are the same as other digeneans. Knowledge of the H. paloniae mt genome should provide useful for comparative study of this species and other digeneans.

As for the complete mt of H. paloniae, the gene arrangement is the same as other digeneans except for S. haematobium and S. spindale (Littlewood et al. 2006). All the protein-coding genes use complete codons, which is in accordance with other selected digeneans. Among all the 18 digeneans, all species show strand asymmetry (AT skew = −0.513 ∼ −0.159; GC skew = 0.359 ∼ 0.512). With the accomplishment of sliding window analysis, cox1 gene is the most conserved region among these four paramphistomes; this is in accordance with previous studies, which indicated the conserved characteristics of cox1 gene (Chibwana et al. 2013; Pérez-del-Olmo et al. 2014; Rollinson et al. 2009). Phylogenetic analyses can provide a basic understanding of the relationship of H. paloniae with other digeneans.

Although H. paloniae is closer to Fasciolidae in shape, phylogenetic analysis based on the complete mt genome of H. paloniae and other digeneans indicated that H. paloniae is closely related to paramphistomes, this is in accordance with their relationship in taxonomy.

Now, the H. paloniae mt genome is available; this should provide useful information for the study of epidemiology, biology, species identification, population genetic, and phylogenetic analyses.

Conclusions

In conclusion, our study firstly reported the complete mt genome sequence of H. paloniae and compared the mt genome of H. paloniae with other selected digeneans. The H. paloniae mt genome is the first mt genome available for Gastrodiscidae. Knowledge of mt genome of H. paloniae should enrich the mt genome databases of digeneans and also provide useful information for the study of epidemiology, biology, population genetics, as well as phylogenetic analyses.