Introduction

The intracellular protozoan, Toxoplasma gondii, is an opportunistic parasite infecting all warm-blooded animals including human beings (Shaddel et al. 2014b). Although T. gondii often causes subclinical infection, it may cause neuropsychological manifestations including hydrocephalus, blindness, mental retardation and encephalitis and high rates of morbidity and mortality in congenitally-infected and immunocompromised patients (Shaddel et al. 2007; Sanecka et al. 2016). Furthermore, T. gondii infection may lead to remarkable economic losses due to stillbirths, abortion, and neonatal deaths in livestock which is a main source of the infection to human (Bai et al. 2012; Shaddel et al. 2014a). Therefore, the development of effective vaccines against this pathogen would be a crucial purpose to control the infection and reduce economic losses in the livestock industry (Meng et al. 2012). DNA vaccine could be an ideal technique against T. gondii because it is safer, able to induce a wider range of immune response types and easy to produce. DNA-based vaccination is considered as one of the most efficient strategies against intracellular parasites (Dumonteil 2007). Efforts to identify and construct DNA vaccine against T. gondii using surface proteins as ideal targets are increasing (Zhou et al. 2016). Based on the results from many studies, the most promising vaccine candidates of T. gondii include the dense granule (GRA), surface antigens (SAG proteins), rhoptry and microneme proteins (Couper et al. 2003; Martin et al. 2004).

It has been reported that the GRA antigens play a vital role in parasite survival and virulence and in forming the parasitophorous vacuole (Mercier et al. 2005; Mozaffarian et al. 2014). These proteins are highly expressed in the acute and chronic phases of the infection and can stimulate host immunity responses. Several GRA antigens in particular GRA-1, GRA-4, GRA6 and GRA-7 have been considered as suitable DNA vaccine candidates to control toxoplasmosis (Bivas-Benita et al. 2003) but how to decide which of these proteins should be chosen for in vivo studies remains a challenge. The identification of protein epitopes characters using bioinformatics tools will be useful for diagnostic purposes and development of DNA vaccine (Wang et al. 2013).

Bioinformatics, as a nexus of computer science, mathematics and biotechnology, is one of the newest scientific fields involved in biological issues. Bioinformatics approaches present useful results from large amounts of raw data (Romano et al. 2011). They have been extensively used to analyze gene and protein expression and predict structural, immunogenicity and general features of proteins. Study and comparison of the physical, chemical and immunogenicity characteristics of proteins can increase our knowledge regarding them and help the researchers select the proper epitopes for DNA vaccine investigations.

Therefore, the present study aimed at analyzing and comparing the structural, physical and chemical characters and immunogenicity of the GRA-1, GRA-4, GRA-6 and GRA-7 proteins using bioinformatics approaches.

Materials and methods

Data resources

The protein sequences of T. gondii GRA-1, GRA-4, GRA-6 and GRA-7 were obtained from ToxoDB 10.0 (http://toxodb.org/toxo/). T-Coffee server (http://tcoffee.crg.cat/apps/tcoffee/do:regular) was used to align the protein sequences. The secondary Structure of proteins was predicted using SOPMA secondary structure prediction method, available at (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_sopma.html).

Determination of immunogenic epitope of each protein

In order to determine the immunogenic epitopes of GRA-1, GRA-4, GRA-6 and GRA-7 proteins, different parameters including hydrophilicity (Parker et al. 1986), beta turn prediction (Chou and Fasman 1979), antigenicity (Kolaskar and Tongaonkar 1990), flexibility (Karplus and Schulz 1985) and surface accessibility (Emini et al. 1985) were separately calculated for each protein by Immune Epitope Database (IEDB) Analysis resources (http://tools.iedb.org/bcell/). The epitopes with high score were then selected. The general features of the proteins including molecular weight, the estimated half-life, amino acid composition and stability index were predicted by Prot-Param (http://web.expasy.org/protparam/). The selected epitopes of proteins were then linked together using a linker to form different multi-hybrid epitopes.

Comparison of the single and multi-hybrid epitopes

To determine the proper epitopes for DNA vaccine study against T. gondii, all single and multi-hybrid epitopes of the GRA proteins were compared together based on their immunogenicity, secondary structure and general features using mentioned software and bioinformatics tools.

Results

Protein information and Sequence Alignment Analysis of the GRA proteins

The GRA-1, GRA-4, GRA-6 and GRA-7 genes encoded proteins of 190, 345, 179 and 236 amino acids, respectively. The molecular weight (MW) and Theoretical isoelectric point (T pI) of the proteins were as follow: GRA-1 (MW: 20 kDa, T pI: 4.15), GRA-4 (MW: 36 kDa, T pI: 7.05), GRA-6 (MW: 18 kDa, T pI: 6.75) and GRA-7 (MW: 26 kDa, T pI: 5.08).

Multiple sequence alignment of GRA-1, GRA-4, GRA-6 and GRA-7 showed that the four proteins sequences had low similarity (Fig. 1).

Fig. 1
figure 1

Protein sequence alignment of GRA-1, GRA-4, GRA-6 and GRA-7 using T-Coffee

Secondary Structure for GRA-1, GRA-4, GRA-6 and GRA-7 Proteins

SOPMA method was used to predict the secondary structures of the proteins. Figure 2 showed that GRA-1 protein has 12 alpha helices, 6 extended strands, 9 -turn and 14 coils; GRA-4 protein has 10 alpha helices, 13 extended strands, 15 -turn and 19 coils; GRA-6 protein has 11 alpha helices, 7 extended strands, 9 -turn and 15 coils, and GRA-7 protein has 10 alpha helices, 6 extended strands, 5 -turn and 13 coils.

Fig. 2
figure 2

Secondary structure analysis of the GRA proteins according to the SOPMA secondary structure prediction method. h: alpha helix, c: random coil regions, e: extended strand regions, t: -turn

Results indicated that GRA-1 and GRA-6 have similar secondary structure except for a little discrepancy.

Prediction of hydrophilicity

In the present study, the hydrophilic/hydrophobic regions of the proteins were predicted using the scale Hphob./Abraham & Leo in the ProtScale program. Multiple hydrophilic regions and some classical high hydrophilic domains of each epitope can be observed in Fig. 3.

Fig. 3
figure 3

The hydrophilicity/hydrophobicity of the GRA proteins

Prediction of immunogenic epitopes

The immunogenicity of the GRA-1, GRA-4, GRA-6 and GRA-7 epitopes was predicted by IEDB Analysis resources (Fig. 4). This program evaluates immunogenicity of the epitopes based on parameters described previously and extracts the data as score. Numeral indexes of the half-time, stability index, antigenicity property and immunogenicity of the single (Table 1) and multi-hybrid epitopes (Table 2) of GRA-1, GRA-4, GRA-6 and GRA-7 proteins are summarized.

Fig. 4
figure 4

Immunogenic epitopes of the GRA-1, GRA-4, GRA-6 and GRA-7 proteins

Table 1 Immunogenicity, antigenicity and general features of the single epitope of GRA-1, GRA-4, GRA-6 and GRA-7 proteins (using ProtParam, Predicted Antigenic Peptides software)
Table 2 Immunogenicity, antigenicity and general features of the multi-hybrid epitope of GRA-1, GRA-4, GRA-6 and GRA-7 proteins (using ProtParam, Predicted Antigenic Peptides software)

As observed in Table 1, the GRA-1 and GRA-4 epitopes are stable and GRA-4 shows the highest degree of antigenicity. Although the GRA-7 epitope has the highest score of immunogenicity, this epitope is instable and has the lowest degree in antigenicity and half-time in eukaryotic cell. Based on data presented in Table 2, the only stable multi-hybrid epitope is GRA1–GRA4. Also, the results show that GRA4–GRA7 epitope and GRA6–GRA7 have the highest degree of antigenicity and immunogenicity, respectively. Given the information presented in this study, GRA-4 epitopes seems to be the most suitable candidate for DNA vaccine study among the studied epitopes.

Discussion

Bioinformatics approaches play an undeniable role in vaccine development to predict protein structures, functions, and other biological characteristics and to understand the specific immune response of selected vaccine epitopes (Lundegaard et al. 2010; Zhang et al. 2013). One of the main goals of the bioinformatics is to increase our knowledge about the biological processes and improve the health level using new vaccines design. DNA vaccination is a powerful technique with strong potential to induce specific humoral and cellular immune responses (Hiszczynska-Sawicka et al. 2011). Given the importance of toxoplasmosis in human health and also recent developments in designing DNA vaccine based on GRA proteins against this disease, we carried out a study to analyze and compare the structural, physical and immunogenicity features of the single and multi-epitopes of GRA-1, GRA-4, GRA-6 and GRA-7 proteins using bioinformatics tools and websites to design proper DNA vaccine against T. gondii. To the best of our knowledge, there have been no reported studies about the bioinformatics analysis of GRA proteins in Iran and there are few researches regarding the subject in other countries. Bioinformatics analyses of protein structure to predict the immunity contributed to the development of a new vaccine candidate against T. gondii (Cong et al., 2010).

In the present study, hydrophilicity/hydrophobicity of the proteins was predicted. Analysis of amino acid hydrophilicity/hydrophobicity presents suitable information to understand the protein folding, analyze the protein interaction sites and predict the protein secondary structure (Bai et al. 2012). The secondary structures have a major impact on the epitopes. The α-helices and β-turn, located in the inner parts of the protein, have a high hydrogen-bond energy which can preserve the structure of a protein and make strong interactions with antibodies. We use IEDB to predict the immunogenicity of the epitopes based on five parameters including hydrophilicity, beta turn prediction, antigenicity, flexibility and surface accessibility. Wang et al. (2014) reported that these methods reliably predicted the proper epitopes for DNA vaccine development. The IEDB Analysis Resource database uses NetMHC pan as prediction method since 2011. This method generates a quantitative prediction of the affinity of any peptide-MHC class I interaction, covering HLA-A and HLA-B for humans as well as chimpanzee, macaque, gorilla, cow, pig and mouse. (Hoof et al. 2009). Hydrophilicity, beta-turns, secondary structure, and surface accessibility are the important features of the amino acids which can present significant and useable data of epitopes for biological studies including DNA vaccine. (Wee et al. 2010; Zhao and Li 2010). Epitope prediction methods are mostly based on several features (not one) of the proteins because researchers believe that only by evaluation one character we cannot access to sufficient and accurate information regarding the epitope (Kringelum et al. 2012).

In the present study, among single and multi-epitopes, GRA-4 showed the highest degree of antigenicity. Given the scores obtained from the studied parameters, GRA-4 seems to be a proper antigen to use as a target in developing of DNA vaccine against T. gondii. This protein is considered as one of the most important T. gondii’s surface antigens that has been used to design DNA vaccine. The main epitope in GRA-4 structure is epitope B which includes 11 amino acids from the C terminal of GRA-4. The second most important epitope of the GRA-4 is located between amino acids 318 and 334 (Mévélec et al. 1998). In the present study, for GRA-4 epitopes, amino acids 236–273 showed the highest degree of immunogenicity. Meisel et al. (1996) reported that in GRA-4 protein, amino acids ranging from 229–242 to 231–245 induce cellular and humoral immunity against T. gondii. The GRA4 231–245 peptide is immunogenic and considered a suitable target for DNA vaccine design (Wang et al. 2014). We demonstrated that peptide 205–231 amino acids of GRA-7 had the greatest degree of immunogenicity based on studied parameters. However, stability index and the estimated half-time of this epitope were lower than other studied proteins.

In the current study, single epitopes had higher degree of antigenicity compared with multi-epitopes. Nevertheless, some studies reported that multi-epitope antigens are of the most important antigens for the development of diagnostic kits for toxoplasmosis screening (Dai et al. 2012). Among the studied multi-epitopes, GRA7–GRA1 epitope showed high degree of immunogenicity. This finding is consistent with the results recorded by Jongert et al. (2007). Moreover, immunization of T. gondii-infected mice with DNA vaccine encoding GRA-1 and GRA-7 leads to protective immune responses and reduction of brain cysts. DNA vaccination with GRA1–GRA7 (in mixture form) induced protection against infection with different virulent T. gondii strains in mice (Jongert et al. 2007).

Bioinformatics analysis in the present study indicated that the GRA-6, one of the favorite antigens for DNA vaccine researches against T. gondii, had the longest half-time compared with other epitopes. GRA-6 is a highly immunogenic candidate to design vaccine against toxoplasmosis. This protein is the source of the immunodominant HF10 peptide which is located at the carboxyl terminus of GRA6 and presented by the H-2Ld major histocompatibility complex class I molecule (MHC class I). It induces a protective immune response in BALB/c mice carrying the H-2Ld molecule against T. gondii infection (Nicolas and Federico 2008).

The GRA6 gene encodes a 230-amino-acid polypeptide and does not contain any introns. The deduced polypeptide contains two hydrophobic regions with the characteristics of transmembrane domains. The central hydrophobic domain is surrounded by two hydrophilic domains which contain four blocks of amino acids. The C-terminal hydrophilic region comprises 24% of glycine residues indicating a structural role for GRA6 in the network (Lecordier et al. 1995).

Bioinformatics can play a significant role in study of antigenic proteins, by which we can understand the structures and functions of an antigen and select proper epitopes without running the risks involved in cultivating the pathogen of interest. Epitopes prediction can reveal the pathogenesis and immune mechanisms of pathogens, and prepare crucial data for the development of diagnostic reagents and design of immunogenic peptides and new vaccines (Bai et al. 2012). This approach presents some privileges such as faster output and less cost compared with traditional vaccinology methods (Tong and Ren 2009).

Conclusion

The present study showed the information of structure and basic characteristics of the GRA proteins. The results from the current study should contribute to the development of a highly protective DNA vaccine against T. gondii. Taking account of the predicted results, it can be concluded that GRA-4 and GRA-7 are potential proteins with strong antigenicity; hence, they can be powerful DNA vaccine candidates against T. gondii. The present study provided a theoretical basis for the screening of effective DNA vaccine candidate components against infection with T. gondii. The results of the present study should be considered in further experimental researches to determine the suitable GRA epitopes for DNA vaccine study against T. gondii.