Introduction

Restriction–modification (RM) systems are largely accepted to be the cellular defense machinery of prokaryotes against invading bacteriophages, plasmids, or other foreign DNAs (Raleigh and Brooks 1998; Vasu and Nagaraja 2013). The RM system consists of a restriction endonuclease (REase, R) that recognizes and cleaves specific DNA sequences, and a cognate mehtyltransferase (MTase, M) which protects the self DNA from the REase by selectively methylating the adenine and/or cytosine bases of the same recognition sequence (Bickle and Kruger 1993). In addition to the role of RM systems in protecting cell against foreign DNA, there is emerging information on these as selfish genes to their involvement in stabilizing genomes, controlling speciation etc (Vasu and Nagaraja 2013). RM systems are observed to be widely distributed in eubacteria and archaebacteria (Oliveira et al. 2014). Even certain phages are known to carry the RM systems (Chaturvedi and Chakravorty 2003; Dempsey et al. 2005; Hadi et al. 1983; Rao et al. 2014; Xia et al. 1986; Xia and Etten 1986). The function of the phage-encoded RM enzymes are proposed to provide selective survival advantage by preventing infection of the host by other viruses (Agarkova et al. 2006; Joshi et al. 1982; Dempsey et al. 2005) and in degradation of the host DNA (Szekeres et al. 1983). REBASE (Roberts et al. 2015), a comprehensive database of proteins of RM systems, reports about 4290 different restriction enzymes genetically or biochemically characterized until date, and more than 48,175 putative REases are predicted from genome sequences (http://rebase.neb.com/rebase/statlist.html).

The REases are classified primarily into four categories, Type I–IV, based on the DNA recognition sequence, cleavage position, cofactor requirements, and the protein subunit composition (Loenen et al. 2014). Of the four types of REases, Type II enzymes are considered as the “work horses” of modern molecular biology (Roberts 2005; Loenen et al. 2014). Type II REases exhibit high sequence specificity with respect to both recognition and cleavage, and usually require Mg+2 ions as the only cofactor (Pingoud et al. 2014). Type II enzymes are further grouped into 11 subtypes, with the most commonly used enzymes that recognize Palindromic sequences belonging to subtype IIP. About 43% of the putative restriction enzymes are classified as Type II REases and greater than 96% of the characterized REases belong to the Type II category with 412 distinct specificities (http://rebase.neb.com/rebase/statlist.html).

Most restriction enzymes have been isolated from mesophilic bacteria (Roberts et al. 2015), but thermostable REases with both novel and known specificities (isochizomers) are desirable from enzyme properties and protein processing perspectives (Sharma et al. 2013). Thermophilic bacteria found in a broad range of geothermally active habitats, such as geysers, hot springs, and deep sea hydrothermal vents, are source of thermostable enzymes. In this paper, we describe isolation and purification of a moderately thermophilic REase, TfoI, from a thermophile (Tepidimonas fonticaldi PL17) found in the water of the Polok hot spring in Northeast India. Sequence analysis and biochemical characterization of purified TfoI reveal it to be a Type IIP REase, and as a new isoschizomer of MseI enzyme.

Materials and methods

Strain and growth conditions

T. fonticaldi PL17 used in this study was isolated from one of the two hot springs in Polok (Latitude: 27°21.00°, Longitude: 88°19.353°) in Sikkim, Northeast India (Thakur et al. 2013). The temperature and pH of water in this hot spring was measured to be at 58 °C and 8.0, respectively. Bacteria were typically cultured in R2A broth (Reasoner and Geldreich 1985) (HiMedia) at 55 °C, which was determined to be optimal based on the temperature dependence of growth of the isolate. To determine the optimum time of growth for maximum production of the TfoI, 100 ml of R2A broth inoculated with 2% overnight seed culture was grown at 55 °C for 20 h. Cells were harvested from 1.5 ml of the culture broth at different hours of growth and stored at −80 °C until further use.

16S rDNA amplification, sequencing, and phylogenetic analysis

The 16S rDNA gene sequence was PCR amplified using the genomic DNA of the isolate (ZR Fungal/Bacterial DNA MiniPrep kit) as a template and E. coli 16S rDNA specific primers, 27F and 1492R (Weisburg et al. 1991). The PCR product was directly subjected to ExoSap (Affymetrix, China) treatment before sequencing. DNA sequencing was performed using primers, 27F, 536R, and 1492R; the numbers correspond to the E. coli 16S rDNA positions (Weisburg et al. 1991; Wang and Qian 2009). The individual sequences were aligned and the complete 16S rDNA sequence was assembled using SeqManPro in DNAStar (v7.0). The ez-taxon module in EzBioCloud (Kim et al. 2012) was used to determine the phylogenetic neighbors and the phylogenetic tree was constructed using the Neighbor-Joining method (Saitou and Nei 1987) with 1000 bootstrap replications to assess the nodal support in the tree. The dendrogram was created from the Molecular Evolutionary Genetics Analysis software (MEGA, version 6).

Restriction endonuclease (REase) activity assay in cell lysate

Frozen cells from 1 to 1.5 ml of the isolate grown in R2A broth were thawed, and re-suspended in 250 µl of 50 mM Tris (pH 7.4). Re-suspended cells were lysed by sonication (30% amplitude; 10 s on-15 s off per cycle; total time of 2 min) and the supernatant (soluble cell lysate) was separated by centrifugation at about 18,000×g for 15 min. The supernatant (2 µl) was used to digest about 200 ng of plasmid DNA (pBR322, Fermentas; pUC18, Merck) and phage DNA (Lambda DNA, Takara or NEB; M13mp18, Merck and φX174, Merck) in a reaction volume of 10 µl for 1 h at 55 °C and the digestion patterns were obtained on a 1% agarose gel. The fragmentation pattern obtained for various DNA substrates cleaved with the enzyme present in the cell lysate were compared with that obtained for the same substrates digested with the purified TfoI (data not shown).

In the experiment to determine the TfoI production as a function of the cell growth, the total protein content in the cell lysate at different timepoints of the growth curve was estimated using the BCA protein estimation kit (Thermo Scientific). The pBR322 plasmid DNA (100 ng) was then digested with around 1–4 µl of cell lysate containing 500 ng of total protein in a reaction volume of 15 µl for 10 min at 55 °C in buffer B (Thermo Scientific). The digestion products were visualized on a 1% agarose gel.

Purification of TfoI

Cell pellet from two liters growth of T. fonticaldi PL17 (∼3 g of cell wet-weight per 2 L of culture) at 55 °C for 12 h at 200 rpm in R2A broth (HiMedia) was obtained after centrifugation of culture at 2430×g for 20 min at 4 °C. The cell pellet was re-suspended in 40 ml of 50 mM Tris, pH 7.4 (USB) and stored at −80 °C until further use. The frozen cells were thawed, sonicated (amplitude 30%, 10 s on-15 s off per cycle) for a total time of 30 min. The cell lysate was clarified by centrifugation at 18,500×g for one hour at 4 °C.

Four different chromatography matrices, Q-Sepharose fast flow (GE Healthcare), SP-Sepharose fast flow (GE Healthcare), Affi-Gel Blue gel (BioRad), and phenyl-Sepharose fast flow (GE Healthcare), were used in sequence for purification of the active protein. The buffer used for purification with all columns was 50 mM Tris, pH 7.4 and protein elution was carried out in a step gradient mode with 50 mM Tris (pH 7.4) containing NaCl at varying concentrations. In brief, the supernatant after cell lysis was loaded on the Q-Sepharose resin (30 ml); active protein primarily present as unbound and in low salt (0.1 M NaCl) wash fractions was pooled and loaded on the SP-Sepharose resin (20 ml). Active TfoI maximally eluted in buffer containing 0.2 M NaCl and its NaCl concentration was adjusted to a final concentration of 0.4 M before loading on to the Affi-Gel Blue gel matrix (12 ml). Active protein which eluted between 0.5 M and 1.0 M NaCl containing buffer was pooled, and the NaCl concentration was increased to a final concentration of 2 M before applying it on the phenyl-Sepharose fast flow resin (2 ml). The final active protein was eluted in water from this column and further transferred into 50 mM Tris (pH 7.4) using 1 M Tris stock solution. Fractions containing active TfoI in each step of the purification protocol were determined by REase activity assay using pBR322 as substrate. The total protein content in active fractions for each of the purification steps was estimated using the BCA protein estimation kit (Thermo Scientific), and the pooled fractions with active TfoI were analyzed on a silver-stained 15% SDS–PAGE.

Unit activity determination

To determine the total TfoI enzyme units at different stages in the purification process, one µg of Lambda DNA in buffer B (Thermo Scientific) was digested with different volumes of pooled active fractions in a reaction volume of 50 µl and the reaction mixture was incubated at 55 °C for 1 h. The digested products were analyzed on a 1% agarose gel. One unit of TfoI activity was estimated to be present in the minimum volume of the active fraction that resulted in complete digestion of one µl of Lambda DNA under the above mentioned reaction condition.

Assay for ATP-dependent nucleases

Lambda DNA (200 ng) was digested with two units of TfoI for 1 h at 55 °C in buffer B (Thermo Scientific) in the presence and absence of 1 mM ATP (Thermo Scientific) from different stages of purification. The reactions were analyzed on a 1% agarose gel.

TfoI recognition and restriction site determination

REase activity assay was performed using two units of purified TfoI with 500 ng of plasmid DNA (pBR322, Fermentas; pUC18, Merck) and 300 ng of phage DNA substrates (EcoKI methylated-Lambda DNA, Takara; M13mp18, Merck and φX174, Merck) as described earlier. The fragmentation pattern from the digestion of these DNA substrates was input in the REBpredictor (http://tools.neb.com/maint.php/REBpredictor/) to predict the likely recognition sequence. The largest DNA fragment (of about 1.6 kb) from digesting pBR322 DNA with TfoI was purified from gel, and sequenced using a primer designed as an internal primer, 1500F, based on the prediction by REBpredictor. A 1045 bp pBR322 fragment (base position 871–1915) containing a single predicted recognition site by REBpredictor was PCR amplified using primers 871F and 1915R. About 500 ng of the PCR product was digested with ten units of purified TfoI at 55 °C for 1 h in buffer B (Thermo Scientific). The two fragments thus obtained were gel-purified and sequenced using the 1500F and 1915R primers, respectively. The number in all primer names is with respect to the start base position in the pBR322 plasmid that these are derived from; and forward and reverse orientations of the primers are referred to as ‘F’ and ‘R’, respectively.

TfoI cleavage site was confirmed by testing for TfoI end compatibility with the NdeI restriction site using a directional cloning approach. The pET28a(+) plasmid was double digested with NdeI and XhoI restriction enzymes to generate the vector for cloning. A fragment of pET29b(+) plasmid encompassing its multiple cloning site was PCR amplified using primers corresponding to T7 promoter and T7 terminator sequences, and was subjected to TfoI-XhoI restriction digestion to generate the insert for cloning. The vector and insert fragments purified from gel were ligated using T4 DNA ligase (NEB), and transformed into E. coli DH5α competent cells. Plasmid was isolated from the transformant cells, and DNA sequenced to determine the ligation sites. The NdeI-XhoI and MseI-XhoI digests of pET29b(+) PCR fragment were used as control inserts in this experiment.

Cut-ligate-cut assay

One µg of Lambda DNA was digested with ten units of purified TfoI in a reaction mixture of 50 µl for 1 h followed by heat inactivation of TfoI at 80 °C for 20 min. About 200 ng DNA (cut sample) was removed from this reaction mixture and stored for later analysis. The remaining reaction mixture was subjected to overnight DNA ligation at 16 °C by the addition of ligation buffer and T4 DNA ligase (NEB). The T4 DNA ligase was inactivated at 65 °C for 20 min and 200 ng DNA (ligated sample) was removed for analysis. Ten units of purified TfoI was added to the remaining ligated reaction mixture and incubated for 1 h at 55 °C (re-cut sample). About 200 ng of re-cut DNA from the final reaction mixture was removed and all three (cut, ligated and re-cut samples) were analyzed on a 1% agarose gel.

Overdigestion assay

Increasing amount of purified TfoI (10, 20 and 30 units) was used to digest one µg of Lambda DNA in buffer B (Thermo Scientific) in a reaction volume of 50 µl for 16 h. About 200 ng DNA from the reaction mixtures was loaded on a 1% agarose gel for analysis.

Optimal temperature and buffer conditions for TfoI activity

The temperature dependence of TfoI activity was tested by incubating one µg of Lambda DNA and pBR322 plasmid DNA with one unit of the enzyme at different temperatures (25–75 °C) for 1 h. TfoI activity in buffers supplied with commercial REases from NEB (NEBuffers 1, 2, 3 and CutSmart) and Thermo Scientific (TANGO, B, G, O, R, and FastDigest) was assessed by digesting one µg of Lambda DNA with one unit of the purified enzyme for 1 h at 55 °C. About 200 ng DNA from the reaction mixture in both sets of the reaction were analyzed on a 1% agarose gel.

Temperature effect on TfoI activity

Two units of TfoI in buffer B (Thermo Scientific, without DNA substrate) was incubated at 55, 65, and 80 °C for 30 min. The reaction was brought to room temperature and 100 ng of pBR322 DNA was added as substrate followed by incubation at 55 °C for 30 min and the digestion products were analyzed on a 1% agarose gel. As controls, 100 ng of pBR322 DNA was digested with two units of TfoI at 55, 65, and 80 °C for 30 min. Heat inactivation of TfoI at 80 °C was also assessed by performing restriction digestion of a DNA substrate (plasmid DNA or genomic DNA) at 55 °C followed by heating the enzyme at 80 °C for 20 min and then assaying for residual activity of heat-inactivated TfoI in digesting a second DNA substrate (Fig. A9).

Dependence of the restriction enzyme activity on divalent metal ions and NaCl

About 100 ng of pBR322 DNA was digested with one unit of TfoI for 30 min at 55 °C in buffer B (Thermo Scientific) containing MgCl2 (Fisher Scientific), MnCl2 (Fisher Scientific), CoCl2 (Sigma) and CaCl2 (Sigma) at a final concentration of 10 mM. About 100 ng of pBR322 DNA was digested with one unit of TfoI for 30 min at 55 °C in buffer B containing increasing concentrations of Mg2+ (0–200 mM, supplied as MgCl2 salt). The effect of NaCl was measured by incubating 100 ng pBR322 DNA with one unit of TfoI for 30 min at 55 °C in buffer B in the presence of 0–500 mM NaCl. All digests were analyzed on a 1% agarose gel.

Effect of disulfide-reducing agents and organic solvents on enzyme activity

One unit of TfoI was incubated in buffer B (Thermo Scientific) containing 0–5 mM dithiothrietol (DTT, Sigma) and 0–5 mM beta-mercaptoethanol (β-ME, Sigma) for 12 h at 4 °C. The plasmid pBR322 (100 ng) was added as substrate to the mixture followed by incubation at 55 °C for 30 min. About 100 ng of pBR322 DNA was digested with one unit of TfoI in buffer B (Thermo Scientific) in the presence of 0 to 15% (v/v) of glycerol (Sigma) and 0 to 10% (v/v) dimethyl sulfoxide (DMSO, Bio Basic Inc.) for 30 min at 55 °C. All reactions were analyzed on a 1% agarose gel.

Identification of TfoI RM gene cluster

The T. fonticaldi PL17 whole genome shotgun sequencing was performed on MiSeq Illumina sequencing platform using MiSeq reagent kit, version 2 (Genotypic Technology Pvt. Ltd.). The de novo genome was assembled using CLC Genomics Workbench 7.5.1 to obtain minimum contig length of 500 bp and the 67 contigs obtained are submitted to the NCBI database (accession number: LZDH00000000). The protein coding sequences were annotated by NCBI Prokaryotic Genome Annotation Pipeline (PGAP) version 3.3 (Tatusova et al. 2016), Rapid Annotation using Subsystems Technology (RAST) version 2.0 (Aziz et al. 2008), and GLIMMER-3 (Delcher et al. 1999). The annotated genes belonging to restriction–modification systems were analyzed. The absence of a complete Type II RM system in the annotation led us to further analyze the only two MTase subunit genes one each as part of the Type II (YeeA type) and Type III RM systems that were annotated by RAST and GLIMMER. BLAST analysis of these two-gene products revealed that the protein corresponding to the MTase subunit annotated as Type III MTase shares 53.9% sequence identity with MseI MTase. The DNA sequence encompassing this MTase gene was analyzed next for the presence of an open reading frame (ORF) in its vicinity. An un-annotated ORF was found downstream of the MTase subunit of the Type III MTase and was subsequently used as a query sequence to search the Genbank sequences using protein BLAST. Gene identity search based on the sequence alignment of this RAST-annotated Type III MTase along with its immediate downstream un-annotated ORF using BLAST indicates this two-gene cluster to be a –TTAA– recognizing RM system (described in “Results”).

Determination of methylation status of genomic DNA of T. fonticaldi PL17

The genomic DNA (50 ng) of T. fonticaldi PL17 was digested with ten units of purified TfoI, Tru1I (Thermo Scientific, isoschizomer), and HaeIII (NEB, 5′GG^CC3′) for 1 h. The reaction conditions were used as appropriate for each enzyme. The genomic DNA of Pseudomonas aeruginosa MTCC1934 and E. coli BL21(DE3) was also subjected to digestion with all three REases and served as controls.

Results

One of the Gram-negative, moderately thermophilic bacterium isolated in our study of microbial biodiversity in water of a hot spring in Polok, Sikkim, Northeast India screened positive for REase activity (Fig. A1). Comparison of the 16S rRNA gene sequence of this isolate clustered it with the Tepidimonas genus (Fig. 1). Based on its 99.86% sequence identity with Tepidimonas fonticaldi AT-A2T, the isolate is assigned to be Tepidimonas fonticaldi PL17. The REase produced by T. fonticaldi PL17 was named TfoI according to REase nomenclature rules (Roberts et al. 2003).

Fig. 1
figure 1

Phylogenetic positioning of T. fonticaldi PL17. The phylogenetic tree of T. fonticaldi PL17 was derived from the 16S rDNA gene sequence comparison of 20 closely related bacterial species with sequence identity >94% using the Neighbor-Joining method. The numbers indicated at the branch nodes are percentage of 1000 bootstrap replications and only values >50% are shown here. E. coli ATCC11775 was used as an outgroup. The bar represents 0.02 substitutions per nucleotide position

TfoI Purification

REase activity assay in cell lysates performed as a function of growth of T. fonticaldi PL17 indicates that cells produce the enzyme from early log phase until the stationary phase of growth (Fig. A1). The enzyme produced by the isolate after 12 h of growth was purified to homogeneity using a four-step purification procedure (Fig. 2a). The final yield of the purified protein was about 2500 units from a liter of culture which was a recovery of around 15% of the total enzyme produced with specific activity of at least 150,000 U/mg of the protein. REase activity assayed in the presence of ATP at different stages of the purification indicates that ATP-dependent nuclease contamination is successfully removed after the second column of SP-Sepharose (SP) fast flow resin. The final protein eluted from the phenyl-Sepharose (PS) matrix is essentially free of any ATP-dependent nuclease as indicated by the identical digestion pattern observed in the activity assay performed with and without ATP (Fig. A2). The monomeric enzyme has a molecular weight of about 20 kDa (marked by an arrow in Fig. 2a), which was also confirmed by the mass spectrometric data (Fig. A11).

Fig. 2
figure 2

TfoI enzyme purification. a 15% silver-stained SDS–PAGE with pooled fractions containing active TfoI sample from each step of the protein purification is shown here. The arrow marks the position of TfoI. The lane labels indicate whole cell lysate as crude; clarified cell lysate after sonication as lysate; and pooled active fractions from Q-Sepharose, SP-Sepharose, Affi-Gel Blue gel, and phenyl-Sepharose columns as QS, SP, AG, and PS, respectively. TfoI enzyme activity in the pooled active fractions from each purification step using Lambda DNA as a substrate is shown in (b)

The absence of any other DNases in the purified TfoI was further confirmed by performing the overdigestion assay. Unaltered band pattern observed for the Lambda DNA digested with 30-fold excess of enzyme compared with that of cleavage with one unit of the enzyme indicates both absence of non-specific nuclease contamination as well as absence of any star activity of TfoI under the test conditions (Fig. 3a). The ligation of fragments generated by TfoI digestion of Lambda DNA and an identical digestion profile observed for the substrate DNA (Lambda DNA) in the ‘cut’ and ‘re-cut’ samples in the cut-ligate-cut assay, indicates that the 5′- and the 3′-termini of the digested products are intact, i.e., the final purified enzyme is free of any nucleases and phosphatases (Fig. 3b, A3).

Fig. 3
figure 3

Quality control assays for TfoI purity. a Overdigestion of a microg of the Lambda DNA with upto 30 units of pure TfoI for 16 h, and b cut-ligate-re-cut assay of the Lambda DNA with the purified enzyme was performed to assess the purity of purified TfoI. Digestion of Lambda DNA with one unit and 16 units of the enzyme for 1 h served as control in the overdigestion assay in (a)

Determination of TfoI recognition and cleavage site

Analysis of digestion patterns obtained on TfoI digestion of different phage DNA and plasmid DNA using the REBpredictor program suggested that TfoI recognizes 5′–TTAA–3′ as a target sequence (Fig. 4, A4). The largest sized band of about 1.6 kb obtained from digesting pBR322 was subjected to DNA sequencing to determine the sequence at the cleavage site. The –TTAA– DNA sequence is a recognition sequence of the known REases, such as MseI and its isoschizomers. We, therefore, also sequenced the same fragment of pBR322 plasmid digested with MseI. The sequence at the fragment ends for both TfoI (Fig. A5) and MseI (data not shown) was identical, indicating that the recognition site of TfoI is the same as that of MseI, i.e., TfoI is an isoschizomer of MseI. The 4-base –TTAA– sequence as the recognition site was reconfirmed by digestion of a PCR fragment containing a single –TTAA–site generated from the pBR322 plasmid DNA, and sequencing of both fragments (Fig. 5). However, the DNA sequencing data posed an ambiguity regarding the cleavage site due to the presence of a false peak of adenine base at the 3′-end in the chromatogram (marked by ‘a’ in Figs. A5, 5), likely due to the non-template dependent terminal nucleotide addition activity of AmpliTaq polymerase used for DNA sequencing (Hu 1993; Samuelson et al. 2004).

Fig. 4
figure 4

TfoI digests of various DNA substrates. Digestion pattern obtained upon TfoI digestion of different phage DNA (Lambda, λ; PhiX174, φX174; M13), plasmids (pBR322 and pUC18) and a pBR322 PCR fragment containing a single –TTAA– site as DNA substrate. The labels ‘’ and ‘+’ indicate samples with and without the added enzyme, respectively

Fig. 5
figure 5

Determination of TfoI cleavage site. a DNA fragment (base positions 871 to 1915) of pBR322 plasmid amplified (using primers 871F and 1915R, ‘-’ in b) containing a single –TTAA– sequence at position 1720 (in bold and underlined) was used as a substrate to determine TfoI cleavage site. The enzyme digest (cut) yielded two fragments (b) which were subjected to DNA sequencing using primers 1500F and 1915R. The predicted sequence above the chromatogram refers to the sequence with –T^TAA– as the TfoI cleavage pattern. Template-independent addition of adenine base by AmpliTaq® used in the sequencing reaction is marked as ‘a’ in the predicted sequence

MseI and NdeI enzymes are known to produce compatible restriction overhangs. Therefore, to resolve if the cleavage site of TfoI is same or different from that of MseI, the end compatibility of TfoI with NdeI was checked (Fig. 6a). A successful ligation of the TfoI-digested pET29b(+) fragment with the NdeI-digested pET28a(+) established that TfoI produces DNA ends that are compatible with those produced by NdeI. The DNA sequencing data of the transformed plasmid obtained after ligation (Fig. 6b) conclusively show that TfoI cleaves between the two thymines in its recognition sequence (5′–T^TAA–3′).

Fig. 6
figure 6

TfoI and NdeI create compatible cohesive ends. The directional cloning approach followed to test end compatibility of TfoI with NdeI enzyme is outlined in (a). The TfoI-XhoI digested fragment (158–311) of pET29b(+) was cloned into NdeI-XhoI digested pET28a(+) vector, and the plasmid isolated from the transformants were DNA sequenced. Successful ligation of the NdeI/TfoI ends, marked by a dotted box, leads to loss of both NdeI and TfoI sites at the ligation site. b Chromatogram of the DNA sequence of a transformant obtained from the cloning described in (a). The predicted sequence above the chromatogram has the insert segment underlined and marked in bold. The ligation site at the NdeI/TfoI end is boxed by a dotted line

Enzyme characterization

Excess of enzyme with prolonged incubation time resulted in complete digestion of substrate DNA indicating that TfoI functions under a broad range of reaction conditions including buffer composition and temperature (data not shown). To determine the optimal buffer and temperature, the enzyme activity was assayed for digestion of one µg of DNA substrate with one unit of the enzyme in one hour. TfoI is observed to be optimally active in the CutSmart NEbuffer and B, G and O buffers from Thermo Scientific (Fig. A6). Of the various tested divalent metal ions, Mg+ 2 and Co+ 2 showed as preferred cofactors, while no activity is observed in the presence of Ca+ 2 ions (Fig. 7). Higher concentrations of Mg+ 2 ions (75 mM and above) is inhibitory to TfoI activity (Fig. A7a). The enzyme is fully functional up to 200 mM NaCl, and shows considerable activity even at high salt concentrations (0.5 M NaCl). However, NaCl is not required for the REase activity (Fig. A7b). TfoI functions best in the temperature range of 55–70 °C (Fig. 8a, A8). In the absence of substrate DNA, TfoI is inactivated after 30 min incubation at 80 °C (Fig. 8b). Under the conditions used in our experiment, heat treatment of TfoI at 80 °C for 20 min in the presence of DNA also led to inactivation of the enzyme (Fig. A9). The banding pattern of TfoI digests of pBR322 plasmid in the absence and presence of 15% glycerol (Fig. A10a) or 10% DMSO (Fig. A10b) remained identical. The enzyme activity is also observed to be unaffected by up to 5 mM concentration of disulfide reducing agents such as DTT and β-ME (Fig. A10c).

Fig. 7
figure 7

Divalent metal ion dependence of DNA cleavage by TfoI. The plasmid pBR322 DNA digested with TfoI in presence of different divalent metal ions indicates preference of Mg+ 2 and Co+ 2 as a cofactor for endonuclease activity. REase activity assay set in the Thermo Scientific buffer B (B) serves as a positive control, and ‘’ refers to the uncut plasmid

Fig. 8
figure 8

Effect of temperature on TfoI activity. a Temperature dependence of TfoI cleaving Lambda DNA indicates optimum temperature range as 55–70 °C for its activity. b TfoI is heat inactivated at 80 °C. The pBR322 plasmid DNA was used in the heat inactivation assay and TfoI activity at 55, 65 and 80 °C without pre-incubation served as controls. The partial digestion of the pBR322 plasmid observed at 80 °C is due to the digestion of the DNA that occurs before the enzyme is fully heat denatured. The undigested DNA is marked by the lane labeled as ‘

TfoI RM gene cluster identification in the T. fonticaldi PL17 genome

The whole genome shotgun project of T. fonticaldi PL17 is deposited at DDBJ/ENA/GenBank, accession number LZDH00000000. The T. fonticaldi PL17 genome annotation using NCBI PGAP, RAST and GLIMMER did not yield any gene as a Type II REase. However, two genes were marked as methylase (MTase) subunit (YeeA) of a Type II restriction enzyme and methylation subunit of a Type III restriction–modification system by RAST and GLIMMER annotation (and not by PGAP). The protein sequences for these two MTases were subjected to protein BLAST analysis. The protein corresponding to the MTase subunit annotated as Type III MTase present on contig 6 [location: 462674-461760 (-ve strand)] shows 53.9% sequence identity with the MseI MTase (Micrococcus sp. NEB 446). Analysis of the neighboring sequence of this gene yielded an ORF downstream [location: 461759-461213 (-ve strand)] which when subjected to BLAST analysis shows significant sequence identity with other –TTAA– recognizing REases of the Type II RM systems; 47.22, 62.22, and 64.64% with MseI, RspRSORF124P, and Tam77409Ip, respectively (Fig. 9). The sequence analysis data thus clearly suggest that the two-gene cluster on contig 6 [location: 462,674 − 461,213 (-ve strand)] codes for a –TTAA– recognizing Type II RM system. It should be mentioned here that ORF analysis of the 3 kb DNA sequence encompassing the annotated YeeA Type II MTase gene in all six reading frames indicates that there was no sequence that showed similarity to known REases.

Fig. 9
figure 9

Comparison of TfoI sequence with known orthologous REases. Multiple sequence alignment of the deduced amino acid sequence of putative TfoI (TfoIp) with the protein sequence of other –TTAA– cleaving REases from Micrococcus sp. NEB 446 (MseI), Roseiflexus species RS-1 [RspRSORF124P] (RspRS-1p) and Thermus amyloliquefaciens YIM 77,409 (Tam77409Ip) using T-Coffee (Di Tommaso et al. 2011). The conserved residues among the sequences are highlighted; with 100% conserved residues (asterisk) in white on black background and similar amino acid residues are depicted as white on grey background. The 33 residue extension at the N-terminus of Tam77409Ip protein is not included in the alignment analysis. The final letter ‘p’ in the enzyme name indicates that the protein sequence is of a putative enzyme

Discussion

TfoI from T. fonticaldi PL17 is a first of the restriction enzymes from the Tepidimonas genus. It is 4-base cutter, cleaving within the palindromic sequence of 5′–TTAA–3′ between the two thymine bases to generate a 2-base 5′-overhang (5′–T^TAA–3′) and requires a divalent metal ion as the only cofactor for activity. Therefore, TfoI can be classified as a Type IIP REase. Although the PGAP, RAST, and GLIMMER gene annotation algorithms did not mark the presence of a complete RM system in T. fonticaldi PL17 genome, protein sequence analysis of the MTase subunits of RM system allowed us to fish out the TfoI RM gene cluster. The gene located at position 461759-461213 of contig 6 in the negative strand of the genome codes for putative-TfoI REase encoding a protein of 181 amino acid residues. Four-step purification of the native enzyme yielded functionally pure protein free of any potential nucleases as indicated by the quality control assays known for restriction enzymes. The pure monomeric protein on an SDS–PAGE revealed a band between 15 and 25 kDa bands of the molecular weight ladder. Mass spectrometry of the sample indicated a mass of 20.696 kDa (Fig. A11), which is in agreement with the calculated molecular weight of 20.694 kDa for the putative monomeric protein from the gene sequence.

T. fonticaldi PL17 is a slightly thermophilic bacterium with 55 °C as optimum growth temperature which is in agreement with the temperature of the source hot spring from where it is isolated. Nevertheless, the organism grows in a temperature range of 37–60 °C under the laboratory conditions. Interestingly, the enzyme TfoI from T. fonticaldi PL17 is also found to be functional in a broad range of reaction conditions, including temperature between 37–70 °C and salt concentration. Such a phenotype may be considered beneficial, since the temperature of the Polok hot springs are reported to show seasonal variations, below 60–75 °C (Das et al. 2012). TfoI, like most REases uses Mg2+ as an obligatory cofactor for activity. In addition, Co2+ can equally well substitute for Mg2+ as a cofactor, while the enzyme activity is weak in the presence of Mn+ 2. Even though TfoI cleaves DNA at 70 °C, it is inactivated at 80 °C. Incubation of TfoI at 80 °C leads to irreversible heat denaturation of the enzyme and no residual activity is observed when a DNA substrate is digested with the 80 °C-preheated TfoI (Fig. A9). Unaltered activity of TfoI in high concentrations of glycerol and DMSO indicate that there is no detectable star activity of TfoI (Wei et al. 2008). Disulfide reducing agents did not have any effect on TfoI activity suggesting that the protein is unlikely to contain disulfide bond(s) of any structural consequence.

REase is a component of the restriction–modification system in an organism. The production of TfoI by T. fonticaldi PL17 hints at the presence of a methyltransferase in this bacterium, which is also supported by the presence of a gene similar to MseI MTase in its genome. The T. fonticaldi PL17 genomic DNA (gDNA) subjected to digestion with purified TfoI and Tru1I (an isoschizomer) clearly indicates gDNA protection from cleavage (Fig. 10), revealing the presence of a functional TfoI restriction–modification unit. T. fonticaldi PL17 has a 69.5% GC-rich genome, and is likely to contain fewer AT-rich sites. Therefore, genomic DNA of two other organisms, P. aeruginosa (high GC content, 66% (Labaer et al. 2004)) and E. coli BL21(DE3) (GC content, 50.8% (Jeong et al. 2009)), were also subjected to TfoI and Tru1I. A 4-base cutter, HaeIII, cleaving at –GG^CC– sites was used as yet another control to test the GC-rich genomic DNA sensitivity. As evident from Fig. 10, while BL21(DE3) gDNA shows a smear with both types of cutters, a similarly GC-rich genome of P. aeruginosa shows smearing with TfoI and Tru1I. As expected, no intact gDNA is observed for both P. aeruginosa and T. fonticaldi PL17 digestion with HaeIII. These data show that high GC content alone is not sufficient to protect gDNA from (A + T)-rich recognition sequence cutting REases. Thus, T. fonticaldi PL17 harbors a complete functional Type II restriction–modification system for TfoI, and this is a first report of an RM system in the Tepidimonas genus.

Fig. 10
figure 10

Determination of T. fonticaldi P17 genomic DNA sensitivity to TfoI. Both the –TTAA– cleaving enzymes (Tru1I and TfoI) do not digest the T. fonticaldi P17 genomic DNA. Restriction digestion of P. aeruginosa MTCC1934, and E. coli BL21(DE3) genomic DNA were set as controls for TfoI and Tru1I activity, while HaeIII, a GC-cutter, served as an REase control with a different specificity. The lane labeled as ‘’ marks undigested genomic DNA

The widely used –TTAA– cutter MseI produced by Micrococcus species (Morgan 1988) has other known commercially available isoschizomers, such as RspRSII (Park et al. 2014), SaqAI from Salinibacterium aquaticus RFL1, Tru9I from Meiothermus ruber 9, and Tru1I from Meiothermus ruber RFL1 (the genus name Meiothermus mentioned here is as per the re-classification of Thermus ruber (Nobre et al. 1996)). While MseI and SaqAI are mesophilic enzymes, the other three isoschizomers are thermophilic enzymes with optimal activity at 60 °C for RspRS II or 65 °C for M. ruber enzymes. TfoI from T. fontacaldi PL17 is a new isoschizomer of MseI, and differs from its thermophilic counterparts in terms of exhibiting activity over a broad temperature range.

It is noteworthy that of the seven reported –TTAA– cutters in the REBASE along with TfoI (this study), a site-specific endonuclease with identical recognition sequence is produced by Gram-positive mesophiles (MseI and SaqAI), Gram-negative moderate thermophiles (RspRSII, Tru9I, Tru1I, and TfoI) and a phage (SphSshM2 from Synechococcus phage S-ShM2). Such an observation may indicate that this REase is acquired by horizontal gene transfer. This idea also gains support from the fact T. fonticaldi PL17, a 69.5% GC rich organism, carries the TfoI RM-encoding gene cluster with a GC content of 42.6% which is surrounded by ORFs with greater than 65% GC content.

TfoI being a 4-base cutter or a frequent cutter is not a suitable REase for routine gene cloning. However, TfoI may be used for limited random DNA fragmentation and coupled with its property to generate cohesive ends suitable for ligation at a compatible site (for, e.g., NdeI), it can be used in preparing genomic DNA libraries. In addition, similar to the widely accepted application of MseI and its isoschizomers in AFLP (Amplified Fragment Length Polymorphisms) analysis (Vos et al. 1995; Vuylsteke et al. 2007) for the identification of genetic variation among closely related species (Zhao et al. 2006), karyotyping of chromosomes (Ludena et al. 1991) or in the study of genomic repetitive sequences, such as teleomere sequences (Vaquero-Sedas and Vega-Palas 2012), TfoI is an alternative isoschizomer of MseI in such analyses.