Abstract
Glycoside hydrolases (GHs) have been classified in the CAZy database into 153 GH families. Currently, there might be four α-amylase families: the main family GH13, the family GH57 with related GH119 and, eventually, also GH126. The family GH57 was established in 1996 as the second and smaller α-amylase family. In addition to α-amylase, it contains 4-α-glucanotransferase, α-glucan branching enzyme, amylopullulanase, dual-specificity amylopullulanase–cyclomaltodextrinase, non-specified amylase, maltogenic amylase and α-galactosidase. The family GH57 enzymes employ the retaining reaction mechanism, share five typical conserved sequence regions and possess catalytic (β/α)7-barrel succeeded by a four-helix bundle with the catalytic machinery consisting of catalytic nucleophile and proton donor (glutamic acid and aspartic acid at strands β4 and β7, respectively). The present bioinformatics study delivers a detailed sequence comparison of 1602 family GH57 sequences with the aim to highlight the uniqueness of each enzyme’s specificity and all eventual protein groups. This was achieved by creating the evolutionary tree focused on both the enzyme specificities and taxonomical origin. The substantial increase of numbers of sequences from recent comparisons done more than 5 years ago has allowed to refine the details of the sequence logos for the individual enzyme specificities. The study identifies a new evolutionary distinct group of α-galactosidase-related enzymes with until-now-undefined enzyme specificity but positioned on the evolutionary tree on a branch adjacent to α-galactosidases. The specificity of α-galactosidase is, moreover, the only one of the entire family GH57 for which there is no structural support for the proposal of the proton donor based on sequence analysis. The analysis also suggests a few so-called “like” protein groups related to some family GH57 enzyme specificities but lacking one or both catalytic residues.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
α-Amylase (EC 3.2.1.1) is a glycoside hydrolase (GH) catalyzing in an endo-fashion the hydrolysis of α-1,4-glucosidic linkages in starch and related polysaccharides and α-glucans. Despite the fact that the catalytic action of any α-amylase should be, in principle, the same, different protein molecules may have evolved even within the same organisms to possess the same catalytic activity of the α-amylase (Janecek et al. 2014). This means that in the Carbohydrate-Active enZymes database (CAZy; http://www.cazy.org/; Cantarel et al. 2009), there have been created more α-amylase GH families reflecting especially unambiguous differences in amino acid sequences. Currently, the CAZy families GH13, GH57, GH119 and eventually also GH126 are considered as α-amylase families (Janecek et al. 2014).
The main α-amylase family, the family GH13, was established in 1991 (Henrissat 1991). At that time, regardless of the newly introduced concept of sequence-based classification of glycosidases, several α-glucan-active enzymes, e.g. cyclodextrin glucanotransferase, α-glucosidase and pullulanase, grouped around the α-amylase, were recognized to share some sequence similarities, catalytic residues and overall fold of their catalytic domain (Svensson 1988; MacGregor and Svensson 1989; Jespersen et al. 1991, 1993; Takata et al. 1992). At present, the α-amylase family GH13 represents one of the largest GH families within the CAZy database counting more than 57,000 members covering more than 30 various enzyme specificities (Janecek et al. 2014; Lombard et al. 2014). The really huge number of sequences that has still been rapidly increasing led to definition of subfamilies (Oslancova and Janecek 2002), which resulted in dividing the family into official GH13 subfamilies by CAZy curators (Stam et al. 2006), the subfamily members exhibiting a higher degree of sequence similarity to each other than to members of other GH13 subfamilies. Overall, the members of the α-amylase family GH13 employ the retaining reaction mechanism and share four–seven conserved sequence regions (CSRs; Janecek 2002), catalytic machinery and the (β/α)8-barrel (i.e. TIM-barrel) fold of the catalytic domain (Kuriki and Imanaka 1999; MacGregor et al. 2001; van der Maarel et al. 2002). In a wider sense, the family GH13 constitutes the clan GH-H together with related families GH70 and GH77 (Cantarel et al. 2009; Janecek and Gabrisko 2016).
The family GH57, created in 1996 (Henrissat and Bairoch 1996) has subsequently been established as the second and smaller α-amylase family (Janecek et al. 2014). In fact, it was based on the existence of sequences of two assumed α-amylases, one from thermophilic bacterium Dictyoglomus thermophilum (Fukusumi et al. 1988) and the other one from hyperthermophilic archaeon Pyrococcus furiosus (Laderman et al. 1993a), that were mutually similar, but obviously have lacked the sequence features, i.e. CSRs characteristic of the family GH13 (Janecek 1997). Although both these fundamental family GH57 members are now recognized as 4-α-glucanotransferases (Laderman et al. 1993b; Nakajima et al. 2004; Janecek et al. 2014), the family has remained to be known as the α-amylase family despite the fact that the only amylolytic enzyme characterized as the α-amylase was shown to exhibit also the pullulanase specificity (Kim et al. 2001).
The family GH57 possesses its own basic characteristics that discriminate it from the family GH13 as follows: (1) the catalytic domain adopts the fold of the so-called incomplete TIM-barrel, i.e. a seven-stranded (β/α)7-barrel (Imamura et al. 2003; Dickmanns et al. 2006; Palomo et al. 2011; Santos et al. 2011; Park et al. 2014; Na et al. 2017); (2) the catalytic machinery consists of two residues—a glutamic acid as a catalytic nucleophile and an aspartic acid as a proton donor located at the strands β4 and β7, respectively, of the incomplete TIM-barrel (Imamura et al. 2001; Palomo et al. 2011); and (3) there are five CSRs representing the “sequence fingerprints” of the family GH57 members (Zona et al. 2004; Blesak and Janecek 2012). Both families GH57 and GH13, however, are similar to each other in employing the same retaining reaction mechanism (Rye and Withers 2000).
With regard to families GH119 and GH126, the former containing only one experimentally confirmed α-amylase from Bacillus circulans (Watanabe et al. 2006) was found closely related to family GH57 (Janecek and Kuchtova 2012), whereas the latter with the only characterized amylolytic enzyme from Clostridium perfringens (Ficko-Blean et al. 2011) exhibiting, depending on the substrate, both endo- and exo-type of activity cannot be considered a pure α-amylase family since it exhibits also structural homology to β-glucan active endoglucanases from inverting families GH8 and GH48 (Janecek et al. 2014).
The main goal of the present study was to perform a detailed and overall bioinformatics analysis of the entire α-amylase family GH57. The study was undertaken in an effort to compare and divide the family GH57 members to as many as possible groups/subfamilies that could reflect the individual GH57 enzyme specificities and/or protein groups. The objective was to refine the “sequence fingerprints” covering the five previously described CSRs of the individual established enzyme specificities and, eventually, to identify novel, until now unrecognized GH57 groups, and thus to contribute further to the evolutionary picture of this interesting enzyme family.
Materials and methods
Sequence collection and comparison
Sequences were collected according to the information for the family GH57 in the CAZy database (Lombard et al. 2014), except for the specificity of maltogenic amylase (or maltose-forming amylases) that as yet has not been assigned to any CAZy family despite the fact that it was demonstrated to exhibit all the sequence-structural features characteristic of the family GH57 (Blesak and Janecek 2013; Jeon et al. 2014; Jung et al. 2014; Park et al. 2014). The sequences of maltogenic amylases, currently kept in CAZy among the “non-classified” sequences, were obtained by protein BLAST search (Altschul et al. 1990) using the sequence of maltogenic amylase from Pyrococcus sp. ST04 (Jung et al. 2014; UniProt accession no.: I3RE04) as a query. All studied sequences were retrieved from GenBank (Benson et al. 2018) and/or UniProt (UniProt Consortium 2017) sequence databases.
Sequences were preliminary aligned using the Clustal-X (Larkin et al. 2007) with regard to five CSRs typical for the family GH57 (Zona et al. 2004; Blesak and Janecek 2012). Those obviously lacking any of the five CSRs were eliminated from further analysis. If there was an uncertainty in identifying any of the five CSRs, a three-dimensional structure was modelled using the Phyre-2 server (http://www.sbg.bio.ic.ac.uk/phyre2/; Kelley and Sternberg 2009) and the CSR was confirmed by structure-based alignment and also structure overlay by the programme MultiProt (http://bioinfo3d.cs.tau.ac.il/MultiProt/; Shatsky et al. 2004).
The approach described above has resulted in collecting 1602 GH57 sequences (Table S1) that were divided into clusters covering previously determined enzyme specificities and those reflecting potentially novel protein groups (Table 1).
For CSRs of each identified enzyme specificity and/or protein group sequence, logos were created using the WebLogo 3.0 server (http://weblogo.threeplusone.com/; Crooks et al. 2004).
Evolutionary analysis
The evolutionary tree of all 1602 GH57 sequences (Table 1) was calculated based on the alignment of all five CSRs as a Phylip-tree type using the neighbour-joining clustering (Saitou and Nei 1987) and the bootstrapping procedure with 1000 bootstrap trials (Felsenstein 1985) implemented in the Clustal-X package (Larkin et al. 2007). The tree was displayed with the program iTOL (http://itol.embl.de/; Letunic and Bork 2011).
Structure comparison
Three-dimensional structure for Thermococcus litoralis 4-α-glucanotransferase (PDB code: 1K1Y; Imamura et al. 2003), as the family GH57 representative, was retrieved from the Protein Data Bank (PDB) (Rose et al. 2015). Three-dimensional structural models for the α-galactosidase from Pyrococcus furiosus (UniProt Acc. No.: Q9HHB5) and two members of the newly identified group of the α-galactosidase-related enzymes, i.e. from Clostridium kluyveri (UniProt Acc. No.: A5MZ16) and Shewanella baltica (UniProt Acc. No.: A3D6T3), were created with the Phyre-2 server (http://www.sbg.bio.ic.ac.uk/phyre2/; Kelley and Sternberg 2009). The obtained structural models were superimposed with the real structure of T. litoralis 4-α-glucanotransferase using the programme MultiProt (http://bioinfo3d.cs.tau.ac.il/MultiProt/; Shatsky et al. 2004) and the structures were visualized with the program WebLabViewerLite (Molecular Simulations, Inc.).
Results and discussion
Evolutionary relationships
The present study may represent the most complete and detailed bioinformatics analysis of the α-amylase family GH57 since it delivers a comparison of 1602 GH57 sequences (Table S1). Of these, 1568 sequences were retrieved from the family GH57 of the CAZy database directly, whereas remaining 34 sequences of the specificity of maltogenic amylase were obtained using the BLAST. This was because the three biochemically characterized maltogenic amylases have still not been classified within the family GH57, although previous in silico analysis (Blesak and Janecek 2013) along with cloning, sequencing and structural studies (Comfort et al. 2008; Jeon et al. 2014; Jung et al. 2014; Park et al. 2014) have clearly suggested they exhibit all sequence/structural features characteristic of the family GH57.
The alignment of all family GH57 proteins was originally performed using the complete amino acid sequences; however, since the sequences are too variable and, in fact, not alignable on their entire lengths (Zona et al. 2004; Blesak and Janecek 2012), further work and analysis have been based on the alignment of their five CSRs (Table S1). Since the CSRs exhibit sequence features characteristic of the individual enzyme specificities, it was reasonable to group the sequence newly collected in the present study with the already recognized enzyme specificities of the family GH57.
The overall division of all 1602 sequences from the studied set into the individual enzyme specificities and/or protein groups is illustrated by the evolutionary tree (Fig. 1a). The tree contains clusters of family GH57 enzymes, such as α-amylase, 4-α-glucanotransferase, amylopullulanase, bifunctional amylopullulanase–cyclomaltodextrinase, maltogenic amylase, α-galactosidase, non-specified amylase and α-glucan branching enzyme, well established by previous studies (Blesak and Janecek 2012, 2013). With regard to their mutual evolutionary relationships, α-amylase are clustered together with 4-α-glucanotransferase; both being in a closer relatedness with amylopullulanases and their bifunctional counterparts possessing also the cyclomaltodextrinase specificity. Next to them, there is cluster covering closely related maltogenic amylases with α-galactosidases, containing interestingly also a potential newly discovered group α-galactosidase-related enzymes. This group may eventually represent even a new GH57 specificity because the sequences of its members possess a complete family GH57 catalytic machinery. In the remaining part of the evolutionary tree, at the site opposite to α-amylases and 4-α-glucanotransferases, there are clusters of the non-specified amylases and α-glucan branching enzymes.
In addition, there are several groups of hypothetical GH57 proteins, which are closely related to a given enzyme specificity, but they lack one or even both catalytic residues. These are very probably not able to play the role of a typical family GH57 enzyme. Such a special group with incomplete catalytic machinery was first observed for the specificity of α-amylase and was named as the group of α-amylase-like proteins (Janecek and Blesak 2011). This study brings the analogous groups of “like” proteins for a few additional enzyme specificities, i.e. 4-α-glucanotransferase, amylopullulanase-cyclomaltodextrinase and α-glucan branching enzyme (Fig. 1a). It should be pointed out that the groups of the “like” proteins are not in all four cases absolutely homogeneous, i.e. there are some exceptions that contain the complete catalytic machinery, e.g., not only mainly among the α-amylase-like proteins but also, although rather rarely, among the “like” proteins related to 4-α-glucanotransferases and α-glucan branching enzymes. Note that there is only one GH57 sequence (originating from Opitutaceae bacterium; Fig. 1a) that—due to sequence differences even within the five CSRs—has been positioned on a separate branch and thus has not been classified with any of the above-mentioned family GH57 groups.
The evolutionary tree shown in Fig. 1b illustrates the division of the individual sequences with regard to taxonomy, i.e. their either bacterial or archaeal origin. It is evident that overall Bacteria dominates over the Archaea within the family GH57 (Lombard et al. 2014). It is of interest that some enzyme specificities like that of the α-glucan branching enzymes are almost completely of bacterial origin although, on the other hand, e.g. all maltogenic amylases originate only from archaeons (Fig. 1b). Interestingly, all the sequences from the groups of α-glucan branching enzyme-like, 4-α-glucanotransferase-like and amylopullulanase–cyclomaltodextrinase-like proteins are represented by bacterial producers only. The α-galactosidases and α-galactosidase-related enzymes occupying the two adjacent branches of the tree are of archaeal and bacterial origins, respectively (Fig. 1b).
Sequence logos
The importance of the five CSRs defined first by Zona et al. (2004) has already been demonstrated using their sequence logos (Blesak and Janecek 2012, 2013) that can be used as the so-called sequence fingerprints of the individual enzyme specificities. This is due to the fact that all five CSRs (Fig. 2) represent the best conserved segments of the family GH57 members, there are also some unique positions within the logos attributable to a given enzyme specificity only discriminating that specificity from remaining ones (Janecek et al. 2014). This study delivers thus a revisited view of the five family GH57 CSRs because the previous in silico studies (Janecek and Blesak 2011; Blesak and Janecek 2012, 2013) mapped the situation more than 5 years ago (2011–2013) with the total number of compared sequences reaching 554 sequences in comparison with 1602 sequences studied here identifying also some novel enzymatic and/or protein groups (Fig. 1). The approximately 300% increase in the number of sequences has, however, resulted in the fact that some sequence features described in original studies (Blesak and Janecek 2012, 2013) as absolutely unique for some specificities have not necessarily kept their total uniqueness until now.
For sequences of α-amylases, positions 1 (CSR-1) and 12 (CSR-3) occupied mostly by a glutamic acid (or glutamine) and arginine (or glutamic acid) can be considered as important ones, but the most important sequence feature of the logo has been identified at its end in positions 35 and 36 with two tyrosines in the case of α-amylases. With regard to position 12 (beginning of the CSR-3), there is almost exclusively found a tryptophan (or at least an aromatic residue) in all specificities except for the α-amylase and α-amylase-like proteins. The two last positions of the logo represent, on the other hand, the best sequence feature distinguishing the individual family GH57 enzyme specificities from each other (Fig. 2). For example, the non-specified amylases possess there two phenylalanines, maltogenic amylases have phenylalanine succeeded by tryptophan, amylopullulanases have two tryptophans (although not unambiguously), whereas α-glucan branching enzymes possess there just a phenylalanine succeeded by a non-invariantly conserved non-aromatic residue and α-galactosidases contain only the tryptophan preceded by an invariant glycine (Fig. 2). As far as the newly identified group of α-galactosidase-related enzymes is concerned, this seems to have no one of the two positions (35 and 36) occupied by any aromatic residue; only threonine and lysine (histidine) can be found there (Fig. 3).
Despite the slight modifications within some positions proposed previously as exclusive features of the individual enzyme specificities (Janecek and Blesak 2011; Blesak and Janecek 2012, 2013), it is still obvious that, in addition to the end of the logo described above (positions 35 and 36), the position 23 defined by Blesak and Janecek (2012) should still be seriously taken into account. For most of specificities, there is always an almost invariantly conserved but different residue, such as threonine/serine/alanine for α-amylases/non-specified amylases/maltogenic amylases, lysine for 4-α-glucanotransferases, leucine for α-glucan branching enzymes and phenylalanine for α-galactosidases and α-galactosidase-related enzymes (Fig. 2). The position occupied by a cysteine residue proposed previously as unique for α-glucan branching enzymes (Blesak and Janecek 2012) has also lost its total invariance since in a few cases the cysteine is replaced by a methionine, like in the case of the α-glucan branching enzyme from Thermus thermophilus (Palomo et al. 2011).
With regard to the groups of family GH57 proteins named as the “like” proteins, their sequence logos also exhibit their own characteristic features, the most prominent one being the substitution of one or both catalytic residues, as originally defined by Janecek and Blesak (2011). Their detailed in silico analysis will be, however, described elsewhere.
Tertiary structure comparison
Since the three-dimensional structures of the family GH57 enzymes that can be considered as being “amylolytic” ones have already either been experimentally determined (4-α-glucanotransferase, α-glucan-branching enzyme and maltogenic amylase) or modelled (α-amylase, non-specified amylase and amylopullulanase), the structural analysis here has been focused on the group of less deeply studied α-galactosidases and on the new group found on the adjacent branch of the evolutionary tree, the so-called group of α-galactosidase-related enzymes (Fig. 1). The three-dimensional structure models were prepared by the homology modelling for the biochemically characterized α-galactosidase from Pyrococcus furiosus (van Lieshout et al. 2003) and two hypothetical α-galactosidase-related enzymes (differing between each other in their length) from Shewanella baltica (640 residues; a typical length for this group) and Clostridium kluyveri (747 residues).
It should be pointed out, however, that for the α-galactosidase neither the homology modelling using the Phyre-2 server (Kelley and Sternberg 2009) nor the structural comparison using the MultiProt server (Shatsky et al. 2004) has supported the sequence comparison completely with regard to identification of both catalytic residues. While the catalytic nucleophile, i.e. Glu117 in the Pyrococcus furiosus α-galactosidase, has perfectly matched with the corresponding glutamic acid counterpart from all real template structures of α-glucan branching enzymes and 4-α-glucanotransferase (Imamura et al. 2003; Dickmanns et al. 2006; Palomo et al. 2011; Santos et al. 2011; Na et al. 2017), the catalytic proton donor, i.e. Asp248 suggested within the frame of the CSR-4 (Fig. 2), has not overlapped with the required aspartic acid residue from any of structural templates mentioned above. Note that the proton donor was found neither in the only experimental study dealing with α-galactosidases (van Lieshout et al. 2003). It is thus reasonable to suggest that to solve this issue satisfactorily, more experimental work is still necessary, e.g., preparing and characterizing the site-directed mutants of the Asp248 in the α-galactosidase from Pyrococcus furiosus and of other candidate positions, in addition to crystallography trials.
With regard to the two α-galactosidase-related enzymes, their homology models have unambiguously confirmed the predicted catalytic machinery from sequence logos (Fig. 2), i.e. catalytic glutamic acid nucleophile in the CSR-3 and proton donor aspartic acid in the CSR-4. The models have spanned the regions 5-516 of 640 and 1-613 of 747 residues, respectively, in Shewanella baltica and Clostridium kluyveri α-galactosidase-related enzymes. Of the models, 481 and 541 residues were superimposed with the template real structure of the 4-α-glucanotransferase from Thermococcus litoralis (Imamura et al. 2003) in the former and latter enzymes with the zero Å root mean-square deviation value in both cases. The potential catalytic machinery for α-galactosidase-related enzymes from Shewanella baltica and Clostridium kluyveri should thus consist of Glu126 and Asp225 and of Glu125 and Asp223, respectively.
Abbreviations
- CSR:
-
Conserved sequence region
- GH:
-
Glycoside hydrolase
- PDB:
-
Protein Data Bank
References
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
Angelov A, Liebl S, Ballschmiter M, Bömeke M, Lehmann R, Liesegang H, Daniel R, Liebl W (2010) Genome sequence of the polysaccharide-degrading, thermophilic anaerobe Spirochaeta thermophila DSM 6192. J Bacteriol 192:6492–6493. https://doi.org/10.1128/JB.01023-10
Ballschmiter M, Fütterer O, Liebl W (2006) Identification and characterization of a novel intracellular alkaline α-amylase from the hyperthermophilic bacterium Thermotoga maritima MSB8. Appl Environ Microbiol 72:2206–2211. https://doi.org/10.1128/AEM.72.3.2206-2211.2006
Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Ostell J, Pruitt KD, Sayers EW (2018) GenBank Nucleic Acids Res 46:D41–D47. https://doi.org/10.1093/nar/gkx1094
Blesak K, Janecek S (2012) Sequence fingerprints of enzyme specificities from the glycoside hydrolase family GH57. Extremophiles 16:497–506. https://doi.org/10.1007/s00792-012-0449-9
Blesak K, Janecek S (2013) Two potentially novel amylolytic enzyme specificities in the prokaryotic glycoside hydrolase α-amylase family GH57. Microbiology 159:2584–2593. https://doi.org/10.1099/mic.0.071084-0
Brumm P, Hermanson S, Hochstein B, Boyum J, Hermersmann N, Gowda K, Mead D (2011) Mining Dictyoglomus turgidum for enzymatically active carbohydrases. Appl Biochem Biotechnol 163:205–214. https://doi.org/10.1007/s12010-010-9029-6
Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B (2009) The Carbohydrate-active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res 37:D233–D238. https://doi.org/10.1093/nar/gkn663
Choi KH, Cha J (2015) Membrane-bound amylopullulanase is essential for starch metabolism of Sulfolobus acidocaldarius DSM639. Extremophiles 19:909–920. https://doi.org/10.1007/s00792-015-0766-x
Comfort DA, Chou CJ, Conners SB, VanFossen AL, Kelly RM (2008) Functional-genomics-based identification and characterization of open reading frames encoding α-glucoside-processing enzymes in the hyperthermophilic archaeon Pyrococcus furiosus. Appl Environ Microbiol 74:1281–1283. https://doi.org/10.1128/AEM.01920-07
Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14:1188–1190. https://doi.org/10.1101/gr.849004
Dickmanns A, Ballschmiter M, Liebl W, Ficner R (2006) Structure of the novel α-amylase AmyC from Thermotoga maritima. Acta Crystallogr D Biol Crystallogr 62:262–270. https://doi.org/10.1107/S0907444905041363
Dong G, Vieille C, Zeikus JG (1997) Cloning, sequencing, and expression of the gene encoding amylopullulanase from Pyrococcus furiosus and biochemical characterization of the recombinant enzyme. Appl Environ Microbiol 63:3577–3584
Erra-Pujada M, Debeire P, Duchiron F, O’Donohue MJ (1999) The type II pullulanase of Thermococcus hydrothermalis: molecular characterization of the gene and expression of the catalytic domain. J Bacteriol 181:3284–3287
Felsenstein J (1985) Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39:783–791. https://doi.org/10.1111/j.1558-5646.1985.tb00420.x
Ficko-Blean E, Stuart CP, Boraston AB (2011) Structural analysis of CPF_2247, a novel α-amylase from Clostridium perfringens. Proteins 79:2771–2777. https://doi.org/10.1002/prot.23116
Fukusumi S, Kamizono A, Horinouchi S, Beppu T (1988) Cloning and nucleotide sequence of a heat-stable amylase gene from an anaerobic thermophile, Dictyoglomus thermophilum. Eur J Biochem 174:15–21. https://doi.org/10.1111/j.1432-1033.1988.tb14056.x
Guan Q, Guo X, Han T, Wei M, Jin M, Zeng F, Liu L, Li Z, Wan Y, Cheong GW, Zhang S, Jia B (2013) Cloning, purification and biochemical characterisation of an organic solvent-, detergent-, and thermo-stable amylopullulanase from Thermococcus kodakarensis. Process Biochem 48 KOD1:878–884. https://doi.org/10.1016/j.procbio.2013.04.007
Henrissat B (1991) A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem J 280:309–316. https://doi.org/10.1042/bj2800309
Henrissat B, Bairoch A (1996) Updating the sequence-based classification of glycosyl hydrolases. Biochem J 316:695–696. https://doi.org/10.1042/bj3160695
Imamura H, Fushinobu S, Jeon BS, Wakagi T, Matsuzawa H (2001) Identification of the catalytic residue of Thermococcus litoralis 4-α-glucanotransferase through mechanism-based labeling. Biochemistry 40:12400–12406. https://doi.org/10.1021/bi011017c
Imamura H, Fushinobu S, Yamamoto M, Kumasaka T, Jeon BS, Wakagi T, Matsuzawa H (2003) Crystal structures of 4-α-glucanotransferase from Thermococcus litoralis and its complex with an inhibitor. J Biol Chem 278:19378–19386. https://doi.org/10.1074/jbc.M213134200
Imamura H, Jeon BS, Wakagi T (2004) Molecular evolution of the ATPase subunit of three archaeal sugar ABC transporters. Biochem Biophys Res Commun 319:230–234. https://doi.org/10.1016/j.bbrc.2004.04.174
Janecek S (1997) α-Amylase family: molecular biology and evolution. Prog Biophys Mol Biol 67:67–97. https://doi.org/10.1016/S0079-6107(97)00015-1
Janecek S (2002) How many conserved sequence regions are there in the α-amylase family? Biologia 57(Suppl. 11):29–41
Janecek S, Blesak K (2011) Sequence-structural features and evolutionary relationships of family GH57 α-amylases and their putative α-amylase-like homologues. Protein J 30:429–435. https://doi.org/10.1007/s10930-011-9348-7
Janecek S, Gabrisko M (2016) Remarkable evolutionary relatedness among the enzymes and proteins from the α-amylase family. Cell Mol Life Sci 73:2707–2725. https://doi.org/10.1007/s00018-016-2246-6
Janecek S, Kuchtova A (2012) In silico identification of catalytic residues and domain fold of the family GH119 sharing the catalytic machinery with the α-amylase family GH57. FEBS Lett 586:3360–3366. https://doi.org/10.1016/j.febslet.2012.07.020
Janecek S, Svensson B, MacGregor EA (2014) α-Amylase—an enzyme specificity found in various families of glycoside hydrolases. Cell Mol Life Sci 71:1149–1170. https://doi.org/10.1007/s00018-013-1388-z
Jeon BS, Taguchi H, Sakai H, Ohshima T, Wakagi T, Matsuzawa H (1997) 4-α-Glucanotransferase from the hyperthermophilic archaeon Thermococcus litoralis—enzyme purification and characterization, and gene cloning, sequencing and expression in Escherichia coli. Eur J Biochem 248:171–178. https://doi.org/10.1111/j.1432-1033.1997.00171.x
Jeon EJ, Jung JH, Seo DH, Jung DH, Holden JF, Park CS (2014) Bioinformatic and biochemical analysis of a novel maltose-forming α-amylase of the GH57 family in the hyperthermophilic archaeon Thermococcus sp. CL1. Enzyme Microb Technol 60:9–15. https://doi.org/10.1016/j.enzmictec.2014.03.009
Jespersen HM, MacGregor EA, Sierks MR, Svensson B (1991) Comparison of the domain-level organization of starch hydrolases and related enzymes. Biochem J 280:51–55. https://doi.org/10.1042/bj2800051
Jespersen HM, MacGregor EA, Henrissat B, Sierks MR, Svensson B (1993) Starch- and glycogen-debranching and branching enzymes: prediction of structural features of the catalytic (β/α)8-barrel domain and evolutionary relationship to other amylolytic enzymes. J Protein Chem 12:791–805. https://doi.org/10.1007/BF01024938
Jiao YL, Wang SJ, Lv MS, Xu JL, Fang YW, Liu S (2011) A GH57 family amylopullulanase from deep-sea Thermococcus siculi: expression of the gene and characterization of the recombinant enzyme. Curr Microbiol 62:222–228. https://doi.org/10.1007/s00284-010-9690-6
Jung JH, Seo DH, Holden JF, Park CS (2014) Maltose-forming α-amylase from the hyperthermophilic archaeon Pyrococcus sp. ST04. Appl Microbiol Biotechnol 98:2121–2131. https://doi.org/10.1007/s00253-013-5068-6
Kelley LA, Sternberg MJE (2009) Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4:363–371. https://doi.org/10.1038/nprot.2009.2
Kim JW, Flowers LO, Whiteley M, Peeples TL (2001) Biochemical confirmation and characterization of the family-57-like α-amylase of Methanococcus jannaschii. Folia Microbiol 46:467–473. https://doi.org/10.1007/BF02817988
Kuriki T, Imanaka T (1999) The concept of the α-amylase family: structural similarity and common catalytic mechanism. J Biosci Bioeng 87:557–565. https://doi.org/10.1016/S1389-1723(99)80114-5
Labes A, Schonheit P (2007) Unusual starch degradation pathway via cyclodextrins in the hyperthermophilic sulfate-reducing archaeon Archaeoglobus fulgidus strain 7324. J Bacteriol 189:8901–8913. https://doi.org/10.1128/JB.01136-07
Laderman KA, Asada K, Uemori T, Mukai H, Taguchi Y, Kato I, Anfinsen CB (1993a) α-Amylase from the hyperthermophilic archaebacterium Pyrococcus furiosus. Cloning and sequencing of the gene and expression in Escherichia coli. J Biol Chem 268:24402–24407
Laderman KA, Davis BR, Krutzsch HC, Lewis MS, Griko YV, Privalov PL, Anfinsen CB (1993b) The purification and characterization of an extremely thermostable α-amylase from the hyperthermophilic archaebacterium Pyrococcus furiosus. J Biol Chem 268:24394–24401
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) ClustalW and Clustal X version 2.0. Bioinformatics 23:2947–2948. https://doi.org/10.1093/bioinformatics/btm404
Letunic I, Bork P (2011) Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy. Nucleic Acids Res 39:W475–W478. https://doi.org/10.1093/nar/gkr201
Li X, Li D (2015) Preparation of linear maltodextrins using a hyperthermophilic amylopullulanase with cyclodextrin- and starch-hydrolysing activities. Carbohydr Polym 119:134–141. https://doi.org/10.1016/j.carbpol.2014.11.044
Li X, Li D, Park KH (2013) An extremely thermostable amylopullulanase from Staphylothermus marinus displays both pullulan- and cyclodextrin-degrading activities. Appl Microbiol Biotechnol 97:5359–5369. https://doi.org/10.1007/s00253-012-4397-1
Li X, Zhao J, Fu J, Pan Y, Li D (2018) Sequence analysis and biochemical properties of an acidophilic and hyperthermophilic amylopullulanase from Thermofilum pendens. Int J Biol Macromol 114:235–243. https://doi.org/10.1016/j.ijbiomac.2018.03.073
Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B (2014) The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res 42:D490–D495. https://doi.org/10.1093/nar/gkt1178
MacGregor EA, Svensson B (1989) A super-secondary structure predicted to be common to several α-1,4-d-glucan-cleaving enzymes. Biochem J 259:145–152. https://doi.org/10.1042/bj2590145
MacGregor EA, Janecek S, Svensson B (2001) Relationship of sequence and structure to specificity in the α-amylase family of enzymes. Biochim Biophys Acta 1546:1–20. https://doi.org/10.1016/S0167-4838(00)00302-2
Murakami T, Kanai T, Takata H, Kuriki T, Imanaka T (2006) A novel branching enzyme of the GH-57 family in the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1. J Bacteriol 188:5915–5924. https://doi.org/10.1128/JB.00390-06
Na S, Park M, Jo I, Cha J, Ha NC (2017) Structural basis for the transglycosylase activity of a GH57-type glycogen branching enzyme from Pyrococcus horikoshii. Biochem Biophys Res Commun 484:850–856. https://doi.org/10.1016/j.bbrc.2017.02.002
Nakajima M, Imamura H, Shoun H, Horinouchi S, Wakagi T (2004) Transglycosylation activity of Dictyoglomus thermophilum amylase A. Biosci Biotechnol Biochem 68:2369–2373. https://doi.org/10.1271/bbb.68.2369
Oslancova A, Janecek S (2002) Oligo-1,6-glucosidase and neopullulanase enzyme subfamilies from the α-amylase family defined by the fifth conserved sequence region. Cell Mol Life Sci 59:1945–1959. https://doi.org/10.1007/PL00012517
Palomo M, Pijning T, Booiman T, Dobruchowska JM, van der Vlist J, Kralj S, Planas A, Loos K, Kamerling JP, Dijkstra BW, van der Maarel MJ, Dijkhuizen L, Leemhuis H (2011) Thermus thermophilus glycoside hydrolase family 57 branching enzyme: crystal structure, mechanism of action, and products formed. J Biol Chem 286:3520–3530. https://doi.org/10.1074/jbc.M110.179515
Park KH, Jung JH, Park SG, Lee ME, Holden JF, Park CS, Woo EJ (2014) Structural features underlying the selective cleavage of a novel exo-type maltose-forming amylase from Pyrococcus sp. ST04. Acta Crystallogr D Biol Crystallogr 70:1659–1668. https://doi.org/10.1107/S1399004714006567
Park YU, Jung JH, Seo DH, Jung DH, Kim JH, Seo EJ, Baek NI, Park CS (2018) GH57 amylopullulanase from Desulfurococcus amylolyticus JCM 9188 can make highly branched cyclodextrin via its transglycosylation activity. Enzyme Microb Technol 114:15–21. https://doi.org/10.1016/j.enzmictec.2018.03.005
Paul CJ, Leemhuis H, Dobruchowska JM, Grey C, Önnby L, van Leeuwen SS, Dijkhuizen L, Karlsson EN (2015) A GH57 4-α-glucanotransferase of hyperthermophilic origin with potential for alkyl glycoside production. Appl Microbiol Biotechnol 99:7101–7113. https://doi.org/10.1007/s00253-015-6435-2
Rose PW, Prlic A, Bi C, Bluhm WF, Christie CH, Dutta S, Green RK, Goodsell DS, Westbrook JD, Woo J, Young J, Zardecki C, Berman HM, Bourne PE, Burley SK (2015) The RCSB Protein Data Bank: views of structural biology for basic and applied research and education. Nucleic Acids Res 43:D345–D356. https://doi.org/10.1093/nar/gku1214
Rye CS, Withers SG (2000) Glycosidase mechanisms. Curr Opin Chem Biol 4:573–580. https://doi.org/10.1016/S1367-5931(00)00135-6
Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4:406–425. https://doi.org/10.1093/oxfordjournals.molbev.a040454
Santos CR, Tonoli CC, Trindade DM, Betzel C, Takata H, Kuriki T, Kanai T, Imanaka T, Arni RK, Murakami MT (2011) Structural basis for branching-enzyme activity of glycoside hydrolase family 57: structure and stability studies of a novel branching enzyme from the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1. Proteins 79:547–557. https://doi.org/10.1002/prot.22902
Shatsky M, Nussinov R, Wolfson HJ (2004) A method for simultaneous alignment of multiple protein structures. Proteins 56:143–156. https://doi.org/10.1002/prot.22902
Stam MR, Danchin EG, Rancurel C, Coutinho PM, Henrissat B (2006) Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of α-amylase-related proteins. Protein Eng Des Sel 19:555–562. https://doi.org/10.1093/protein/gzl044
Svensson B (1988) Regional distant sequence homology between amylases, α-glucosidases and transglucanosylases. FEBS Lett 230:72–76. https://doi.org/10.1016/0014-5793(88)80644-6
Tachibana Y, Fujiwara S, Takagi M, Imanaka T (1997) Cloning and expression of the 4-α-glucanotransferase gene from the hyperthermophilic archaeon Pyrococcus sp. KOD1, and characterization of the enzyme. J Ferment Bioeng 83:540–548. https://doi.org/10.1016/S0922-338X(97)81134-8
Takata H, Kuriki T, Okada S, Takesada Y, Iizuka M, Minamiura N, Imanaka T (1992) Action of neopullulanase. Neopullulanase catalyzes both hydrolysis and transglycosylation at α-(1,4)- and α-(1,6)-glucosidic linkages. J Biol Chem 267:18447–18452
The UniProt Consortium (2017) UniProt: the universal protein knowledgebase. Nucleic Acids Res 45:D158–D169. https://doi.org/10.1093/nar/gkw1099
van Lieshout JFT, Verhees CH, van der Oost J, de Vos WM, Ettema TJG, van der Sar S, Imamura H, Matsuzawa H (2003) Identification and molecular characterization of a novel type of α-galactosidase from Pyrococcus furiosus. Biocatal Biotransform 21:243–252. https://doi.org/10.1080/10242420310001614342
van der Maarel MJ, van der Veen B, Uitdehaag JC, Leemhuis H, Dijkhuizen L (2002) Properties and applications of starch-converting enzymes of the α-amylase family. J Biotechnol 94:137–155. https://doi.org/10.1016/S0168-1656(01)00407-2
Wang H, Gong Y, Xie W, Xiao W, Wang J, Zheng Y, Hu J, Liu Z (2011) Identification and characterization of a novel thermostable gh-57 gene from metagenomic fosmid library of the Juan de Fuca Ridge hydrothemal vent. Appl Biochem Biotechnol 164:1323–1338. https://doi.org/10.1007/s12010-011-9215-1
Watanabe H, Nishimoto T, Kubota M, Chaen H, Fukuda S (2006) Cloning, sequencing, and expression of the genes encoding an isocyclomaltooligosaccharide glucanotransferase and an α-amylase from a Bacillus circulans strain. Biosci Biotechnol Biochem 70:2690–2702. https://doi.org/10.1271/bbb.60294
Zona R, Chang-Pi-Hin F, O’Donohue MJ, Janecek S (2004) Bioinformatics of the glycoside hydrolase family 57 and identification of catalytic residues in amylopullulanase from Thermococcus hydrothermalis. Eur J Biochem 271:2863–2872. https://doi.org/10.1111/j.1432-1033.2004.04144.x
Acknowledgements
This work was financially supported by Grant no. 2/0146/17 from the Slovak Grant Agency VEGA.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Both authors declare that they have no conflict of interests.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Martinovičová, M., Janeček, Š. In silico analysis of the α-amylase family GH57: eventual subfamilies reflecting enzyme specificities. 3 Biotech 8, 307 (2018). https://doi.org/10.1007/s13205-018-1325-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13205-018-1325-9