A robust two-step PCR method of template DNA production for high-throughput cell-free protein synthesis

Yabuki, Takashi; Motoda, Yoko; Hanada, Kazuharu; Nunokawa, Emi; Saito, Miyuki; Seki, Eiko; Inoue, Makoto; Kigawa, Takanori; Yokoyama, Shigeyuki

doi:10.1007/s10969-007-9038-z

A robust two-step PCR method of template DNA production for high-throughput cell-free protein synthesis

Published: 01 January 2008

Volume 8, pages 173–191, (2007)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Journal of Structural and Functional Genomics

A robust two-step PCR method of template DNA production for high-throughput cell-free protein synthesis

Download PDF

Takashi Yabuki¹,
Yoko Motoda¹,
Kazuharu Hanada¹,
Emi Nunokawa¹,
Miyuki Saito¹,
Eiko Seki¹,
Makoto Inoue¹,
Takanori Kigawa^1,2 &
…
Shigeyuki Yokoyama^1,3

2520 Accesses
74 Citations
6 Altmetric
Explore all metrics

Abstract

A two-step PCR method has been developed for the robust, high-throughput production of linear templates ready for cell-free protein synthesis. The construct made from the cDNA expresses a target protein region with N- and/or C-terminal tags. The procedure consists only of mixing, dilution, and PCR steps, and is free from cloning and purification steps. In the first step of the two-step PCR, a target region within the coding sequence is amplified using two gene-specific forward and reverse primers, which contain the linker sequences and the terminal sequences of the target region. The second PCR concatenates the first PCR product with the N- and C-terminal double-stranded fragments, which contain the linker sequences as well as the sequences for the tag(s) and the initiation and termination, respectively, for T7 transcription and ribosomal translation, and amplifies it with the universal primer. Proteins can be fused with a variety of tags, such as natural poly-histidine, glutathione-S-transferase, maltose-binding protein, and/or streptavidin-binding peptide. The two-step PCR method was successfully applied to 42 human target protein regions with various GC contents (38–77%). The robustness of the two-step PCR method against possible fluctuations of experimental conditions in practical use was explored. The second PCR product was obtained at 60–120 μg/ml, and was used without purification as a template at a concentration of 2–4 μg/ml in an Escherichia coli coupled transcription-translation system. This combination of two-step PCR with cell-free protein synthesis is suitable for the rapid production of proteins in milligram quantities for genome-scale studies.

A Cell-Free Expression Screen to Identify Fusion Tags for Improved Protein Expression

A Single-Tube Assembly of DNA Using the Transfer-PCR (TPCR) Platform

Rolling circle amplification of synthetic DNA accelerates biocatalytic determination of enzyme activity relative to conventional methods

Article Open access 24 June 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The demand for high-throughput and flexible protein expression has increased in contemporary genome-scale research. For massive, high-throughput experiments using robotics, the simplicity and robustness of the experimental procedures are especially important. Greater complexity of the experimental protocol usually increases the rates of human error, other experimental errors, and contamination, and thus reduces the overall success rate of the experiment.

In the case of a protein expression experiment, one of the keys is the protein expression method itself, and the other is the template DNA preparation method for protein expression. For the protein expression method, cell-free protein synthesis is most suitable for high-throughput use. First, the proteins can be produced from a linear template DNA. Second, the reaction conditions can be optimized for a target protein by adding or deleting components. For example, ¹⁵N/¹³C-labeled proteins were produced and subjected to structural analyses by NMR [1, 2, 3, 4]. Selenomethionine-substituted protein for X-ray crystallography were synthesized, and the structures were solved [5, 6, 7]. The addition of chaperones or protein disulfide isomerases allowed some proteins to be expressed in soluble and active forms [8, 9]. Functional membrane proteins were produced in the presence of detergents [10]. Furthermore, the protocol is simple and suitable for automation. Parallel protein expression can easily be achieved in a multi-well format. The reaction mixtures can then be directly subjected to protein analysis or purification. Complicated procedures, such as cell fermentation and cell disruption, are not necessary for producing proteins by cell-free protein synthesis. Thus, cell-free protein synthesis is suitable for massive and high-throughput applications [11, 12]. For the preparation of a template DNA for protein expression, complicated procedures such as cloning, including ligation, transformation and culture, are used in many cases. Gateway cloning or Ligation Independent Cloning can simplify these procedures; however, these methods still require transformation and time-consuming culture steps. Some template DNA preparation methods for cell-free protein synthesis without cloning steps were reported [13].

Recently, another method for linear DNA template production by two-step PCR was published [14] and successful protein synthesis was obtained with the two-step PCR product. In the first PCR step, a linear DNA containing the coding sequence was amplified by PCR using gene-specific primers, and was purified. In the first stage of the second PCR step, the product DNA was combined with the dsDNA for the T7 promoter and terminator elements. In the second stage of the second PCR step, the extended template DNA was amplified with an additional primer. The resultant PCR product was used as the template DNA for cell-free protein synthesis. This method was rapid, and generated high yields and high success rates for many coding sequences. However, there were still some complicated steps, such as the purification step and the separate PCR stages, which are not suitable for automation.

In the present study, we developed a simpler and more robust two-step PCR protocol to prepare template DNA for high-throughput cell-free protein synthesis. Our two-step PCR protocol does not involve DNA purification. The second PCR step does not require a break for primer addition, and therefore is not separated into two stages. This protocol provided a high yield and a high success rate, and the protocol was applicable to target DNAs with various GC contents. In the second PCR step, the 5′- and 3′-termini of the target coding sequence can be connected to additional sequences, such as N- and/or C-terminal tags and transcription and translation elements, e.g., a promoter, a ribosome binding site, and a terminator. We explored the robustness of the two-step PCR against possible fluctuations in experimental conditions in practical use. We also expressed proteins fused with various tags by cell-free synthesis. Both of the present methods of two-step PCR and cell-free protein synthesis are suitable for robotics, because they involve only simple procedures and are free from any cloning process. Thus, we have established a practical platform to produce and analyze proteins on a genome scale in a high-throughput manner.

Materials and methods

Materials

Oligonucleotides were purchased from Invitrogen and SIGMA Genosys. The purification grades of the oligonucleotides were ‘desalted’ for Invitrogen and ‘cartridge’ for SIGMA Genosys, respectively. The Expand Hi-Fi PCR kit was obtained from Roche. The iProof Hi-Fi PCR kit was obtained from Bio-Rad. The pCR2.1-TOPO cloning vector was from Invitrogen.

In our experience, it is important to check the quality of the primers and to select the appropriate primer manufacturer and grade carefully, especially when working with many samples, because the quality of the primers was strongly dependent on the manufacturers and grades. The concentrations of the primers in some lots from some manufacturers were far less than the specified ones. Some primer lots contained a large amount of defective product. The two-step PCR sometimes failed when using these primers. The two-step PCR product obtained using some primer lots contained more nucleotide deletions in the primer region than in other regions, as confirmed by sequencing after cloning into a vector followed by single colony isolation, although the products seemed to have the proper length, as confirmed by agarose gel electrophoresis.

Target clones

Human cDNA clones (Ultimate ORF clones, Invitrogen), which varied in GC contents, were used as test clones (Table 1). A total of 18 clones were for excised domain expression, and 24 clones were for full-length expression. The pk7-Ras plasmid [15, 7], which encodes the human Ha-Ras protein, was used as the standard. For full-length expression, the clones for which the localization of the translated proteins was predicted as cytoplasmic or nuclear by PSort2 [16] were selected. The cDNA vectors were used to transform E. coli strain DH10B, and the cells were cultured in a 96-well plate in LB medium with 50 μg/ml ampicillin and 7% glycerol to the steady-state phase without shaking. For the culture of cells with the pk7-Ras plasmid, kanamycin was used as the antibiotic.

Table 1 Test clones and PCR results

Full size table

In our experience, the quality of the cDNA clones differed widely among the manufacturers. Some clones from some manufacturers had a mismatch between the actual sequence and the provided sequence information. Quite a few samples that failed in the two-step PCR had this mismatch, and the unique primers were designed according to the mismatch region. For trouble-shooting of problems with the two-step PCR, reconfirming the sequence of the target cDNA clone is strongly recommended.

T7 promoter (T7P) fragment

A fragment was excised by PCR using the U2T7PL primer (GCTCTTGTCATTGTGCTTCG CATGATTACGAATTCAGATCTCGATCCCG) and the c(NL1) primer (CCCGAGGAGCCGCTGG) from pk7b2-NHisRas, a derivative of pk7-Ras, which has a natural poly-histidine affinity tag (NHis), a tobacco etch virus (TEV) protease recognition sequence [17], and the NL1-linker sequence (CCCGAGGAGCCGCTGG) upstream of the c-Ha-Ras coding sequence (NHis-TEV-TV2 fragment, Fig. 1a). The NHis tag is a modified version of the HAT tag [18], which is part of the chicken lactate dehydrogenase-A gene [19]. In addition to the NHis tag fragment, fragments with glutathione-S-transferase (GST), maltose-binding protein (MBP) or streptavidin-binding peptide (SBP) [20] tags were also constructed (the GST-TEV-NL1, MBP-TEV-NL1, and SBP-TEV-NL1 fragments, respectively). In a similar way, a derivative tag that contains the TV2-linker sequence (ACTGAGAACCTGTACTTCCAGGG), instead of the TEV recognition site and the NL1-linker, was also constructed (the NHis-TEV-TV2 fragment). Some fragments were cloned into the pCR2.1-TOPO vector (Invitrogen), sequence-verified, and amplified from the cloned vector by PCR, using Pyrobest polymerase (Takara) with the U2 primer (GCTCTTGTCATTGTGCTTCG) and the c(NL1) primer (CCCGAGGAGCCGCTGG) or c(TV2) primer (CCCTGGAAGTACAGGTTCTCAGTAGTTGGGATATCG) for the fragments with the NL1-linker or TV2-linker, respectively. The resultant fragment was purified by agarose electrophoresis. The gels were stained with SYBR-Gold and were visualized by excitation with blue-light. Gel slices with the appropriate DNA bands were excised, and the DNA was purified by absorption to a glass-surface under chaotropic conditions, using a GFX column (Amersham Biosciences) or a QIAGEN Plasmid Midi Kit (QIAGEN).

T7 terminator (T7T) fragment

The CL1-Term T7T fragment was excised by PCR using the c(CL1-Term-LN1) primer (GCGGTGGCAGCAGCCAACTCAGCATCAATCAATTATTATCCTGACGAGGGCCCCG; the sequences complementary to the termination codons are underlined) and the U2T7TL2 primer (GCTCTTGTCATTGTGCTTCG CCAAGCTTGCATGCCTGCAGCTC) from the pk7b2-NHisRas vector. The sequence of the fragment was confirmed, and the fragment was purified in a similar manner as the T7P fragment (Fig. 1b). The CL1-Term fragment contains the CL1-linker sequence (CCTGACGAGGGCCCCG in the anti-sense strand) followed by the tandem repeat of termination codons (TAATAATTGATTGAT). Derivative fragments with the SBP- or MBP-coding sequence with the TEV protease recognition sequence, instead of the termination codon repeat (the CL1-TEV-SBP and CL1-TEV-MBP fragments, respectively), and a derivative fragment in which the CL1-linker sequence was replaced by the DT2-linker (GGGCGGGGATCAATCAATCATT in the anti-sense strand) were also constructed (the DT2-Term fragment). The linkers are shown in Fig. 2, and the entire sequences of the fragments are shown in Supplementary Table 1.

Unique primers for two-step PCR

The forward (FW) unique primer for the NL1 linker consisted of the NL1-linker sequence and the unique sequence: 5′-CCAGCGGCTCCTCGGGA-X_n-3′. The sequence X_n was identical to the 5′-terminal sequence of the target coding sequence. The reverse (RV) unique primer for the CL1 linker consisted of the CL1-linker sequence and the unique sequence: 5′-CCTGACGAGGGCCCCG-Y_n-3′. The sequence Y_n was complementary to the 3′-terminal sequence of the target coding sequence. The lengths of the unique sequences (X_n and Y_n) were designed to be 14 nt or longer, to provide a T _m of at least 46°C (Fig. 3a). The unique sequences used for test construction are shown in Supplementary Table 2. For the forward TV2-linker, the FW unique primer sequence was 5′-ACTGAGAACCTGTACTTCCAGGGA-X_n-3′. For the reverse DT2-linker, the RV unique primer sequence was 5′-GGGCGGGGATCAATCAATCATT-Y_n-3′.

Two-step PCR

We defined the ‘Standard (Std)’ two-step PCR conditions as follows. The first PCR was carried out in a reaction mixture (20 μl) with 3 μl of 50-fold diluted culture medium of the cDNA clone as the template, 50 nM each of the FW and RV unique primers, 0.2 mM each of dNTPs, 1× Expand-Hi-Fi buffer and 0.5 U Expand-Hi-Fi Enzyme (Roche) with hot start (Fig. 3a). The PCR program began with a 2 min denaturation step at 94°C. This step was followed by 40 cycles of denaturation at 94°C for 30 s, annealing at 60°C for 30 s and extension at 72°C for 1 min (after the 20th cycle, the extension duration was prolonged for 5 s per cycle). The last step was an incubation at 72°C for 7 min. The resultant product was immediately cooled to 10°C. The second PCR was carried out in a reaction mixture (20 μl) with 5 μl of 5-fold diluted first PCR product, 50 pM T7P fragment, 50 pM T7T fragment, 1 μM U2 universal primer (GCTCTTGTCATTGTGCTTCG), 0.2 mM each of dNTPs, 1× Expand-Hi-Fi buffer and 0.5 U Expand-Hi-Fi Enzyme (Roche) with hot start (Fig. 3b). The PCR program began with a 2 min denaturation step at 94°C. This step was followed by 30 cycles of denaturation at 94°C for 30 s, annealing at 60°C for 30 s and extension at 72°C for 2 min for the N-NHis tag (the NHis-TEV-NL1 and CL1-Term fragments), 3 min for the N-SBP tag (the SBP-TEV-NL1 and CL1-Term fragments) and the N-NHis/C-SBP tag (the NHis-TEV-NL1 and CL1-TEV-SBP fragments) and 4 min for the N-MBP tag (the MBP-TEV-NL1 and CL1-Term fragments), the N-GST tag (the GST-TEV-NL1 and CL1-Term fragments) and the N-NHis/C-MBP tag (the NHis-TEV-NL1 and CL1-TEV-MBP fragments). After the 10th cycle, the annealing temperature was changed to 64°C, and the extension duration was prolonged for 5 s per cycle. The last step was an incubation at 72°C for 7 min. The resultant product was immediately cooled to 10°C. The concentration of the resultant product was determined with a PicoGreen dsDNA quantification kit (Invitrogen). All of the dilution steps in the two-step PCR protocol were carried out using the dilution buffer (1 mM Tris–HCl, 0.01 mM EDTA, pH 8.0).

Two-step PCR with DMSO conditions

The ‘Std’ PCR conditions were modified as follows for the ‘+DMSO’ conditions. DMSO (5% v/v) was added to both the first and second PCR reaction mixtures. The first denaturation temperature was 95°C. This step was followed by 30 cycles of denaturation at 95°C for 30 s, annealing at 60°C for 30 s and extension at 72°C for 3 min (after the 10th cycle, the extension duration was prolonged for 5 s per cycle). The last step was an incubation step at 72°C for 7 min. The resultant product was immediately cooled to 10°C. The same cycle protocol was used for both the first and second PCRs.

High-fidelity and fast two step PCR

We defined the ‘High fidelity and Fast (HF)’ two-step PCR conditions as follows. The first PCR was carried out a reaction mixture (20 μl) with 10 ng of purified cDNA vector as the template, 50 nM each of the FW and RV unique primers, 0.2 mM each of dNTPs, 1× iProof-HF buffer and 0.4 U iProof enzyme (BioRad) with hot start. The PCR program began with a 30 s denaturation step at 98°C. This step was followed by 25 cycles of denaturation at 98°C for 5 s, annealing at 60°C for 10 s and extension at 72°C for 30 s. The last step was an incubation at 72°C for 5 min. The resultant product was immediately cooled to 10°C. The second PCR was carried out in a reaction mixture (20 μl) with 5 μl of 5-fold diluted first PCR product, 50 pM T7P fragment, 50 pM T7T fragment, 1 μM U2 universal primer, 0.2 mM each of dNTPs, 1× iProof-HF buffer and 0.4 U iProof enzyme (BioRad) with hot start. The PCR program began with a 30 s denaturation step at 98°C. This step was followed by 25 cycles of denaturation at 98°C for 5 s, annealing at 60°C for 10 s and extension at 72°C for 45 s. The last step was an incubation at 72°C for 5 min. The resultant product was immediately cooled to 10°C.

Robustness of two-step PCR

Dilution factor of template culture

To explore the effects of the dilution factor of the template culture on the two-step PCR, a culture of E. coli cells with the pk7Ras plasmid, which was grown to 0.8 OD₆₀₀, was diluted to various relative culture concentrations (from 1/16- to 4-fold). One-fold of the relative culture concentration corresponds to a 50-fold culture dilution, which is used in the ‘Std’ PCR conditions. The two-step PCR was carried out with the diluted cultures as the template for the first PCR.

Cell density in culture

To clarify the effects of the cell-density in the culture, cells harboring the pk7Ras plasmid were grown to various densities (from 0.05 to 2 OD₆₀₀), and the resultant culture was subjected to the two-step PCR with the ‘Std’ conditions.

Plasmid content in a unit quantity of cells

To determine the effect of plasmid contents in a unit quantity of cells, cultures of cells with the pk7Ras plasmid (pk7Ras culture) and without the plasmid (blank culture) were grown to 0.8 OD₆₀₀. The pk7Ras culture was diluted by the blank culture to various extents (from 2⁻¹⁵- to 1-fold), and the resulting cultures were subjected to the two-step PCR with the ‘Std’ conditions.

Ramp rate of heating and cooling in PCR program

To explore the effects of the ramp rate of heating and cooling in the PCR programs, two-step PCR under the ‘Std’ conditions with restricted ramp rates for heating and cooling (1, 2 and 3°C/s) were carried out for test constructs (Nos. 1–10).

Primer concentration

To determine the effects of FW and RV primer concentration variations, the concentration of each primer was calibrated by its A₂₆₀ value, using the molar absorbance coefficient for each primer, which was provided by the primer manufacturer, and two-step PCR was carried out with various primer concentrations (from 1/4- to 8-fold concentrations for the ‘Std’ conditions for both the FW and RV primers) for the test constructs (Nos. 1–10). To determine the effect of an imbalance between the FW and RV primer concentrations, FW and RV primers with 1/2-, 1- and 2-fold of the ‘Std’ concentration were mixed with each other, and two-step PCR was carried out using the primer mix for the first PCR.

Cell-free protein synthesis

The cell-free protein synthesis reaction was carried out at 30°C overnight, with the 30-μl scale dialysis method [21]. The second PCR product (0.5 μl), was used without purification as the template for cell-free protein synthesis. An aliquot of the resultant product was reserved (total fraction), and then the remainder was centrifuged at 15,000 g at 4°C for 5 min, and the supernatant was reserved (sup. fraction). The fractions were analyzed by SDS-PAGE (Perfect NT Gel, DRC) and were stained by Quick-CBB (Wako Pure Chemicals). The gel images were acquired by LAS3000 (Fuji) or FAS III (Toyobo) imagers. The target protein was quantified from the density of the band in the image, using BSA as the standard. The quantification error was estimated from multiple sets of assays of a subset of the clones. For the tag cleavage assay, the cell-free reaction mixture was incubated with 15 μg/ml of TEV protease at 30°C for 3 h after protein synthesis.

PCR error rate analysis and assessment of effects on HSQC spectra

In order to quantify the error rate for the two-step PCR, the second PCR products of construct Nos. 7 and 35 were cloned into the pCR2.1-TOPO vector, and clones were picked and sequenced. The error rate was calculated from the resultant sequences.

The ¹H-¹⁵N-HSQC spectrum provides information about protein folding and may be used to determine the viability of the structure determination of the protein. We examined the effect of template error rates higher than those in the normal two-step PCR products on the ¹H-¹⁵N-HSQC spectrum. The two-step PCR product with the c-Ha-Ras coding sequence, NHis-TEV-NL1, and CL1-Term fragments were cloned into the pCR2.1-TOPO vector. The clones were sequenced and a clone with no errors was selected. The selected vector was used as the error-free template for cell-free protein synthesis. Linear templates with a higher error rate than the typical two-step PCR product were produced by error-prone PCR, which is a repetition of the PCR or by mutagenic PCR [22], from a two-step PCR product. Cell-free protein synthesis reactions were carried out with ¹⁵N-labeled amino acids, based on the previously described protocol [23] with the error-free vector template as the standard and linear templates with increased error rates of 9.2 × 10⁻⁴ and 3.9 × 10⁻³ mutation/bp. The NHis-Ras proteins were purified with Ni-affinity resin. The NHis tag was cleaved in a reaction mixture with 7.6 μg/ml TEV and 0.4–0.6 mg/ml of the tagged protein at 30°C overnight, and then the ¹H-¹⁵N-HSQC spectrum was acquired using the tag and Ras protein mixture.

Simulation of effect of template error rate on HSQC spectra

About 1000 randomly mutated DNA sequences with a given error rate (E) were generated from the NHis-Ras open reading frame sequence (656 bp, including tag, linker sequences and a termination codon). Each sequence was translated and the mutations in the amino acid sequence were analyzed. The average number of point mutations per sequence (Np(E) and Ap(E) for nucleotide and amino acid mutations, respectively) and the relative amount of nonsense mutants (Rn(E)) in the set were calculated.

The relative peak height (H(E)) of the ¹N-¹⁵N-HSQC amide cross peaks of the random mutant pool relative to that from an error-free ensemble can be roughly estimated as:

$$ H(E) = 1 - Rn(E) - Ap(E)*M/N, $$

where M is the number of peaks with chemical shifts that change by more than the peak width at half height of the residue of the wild type, and N is the total number of amino acid residues of the translation product from the open reading frame. Here, it was assumed that neither the shifted peaks nor the peaks from nonsense mutants were observed in the ensemble.

N is 217 (amino acid residues) for the NHis-Ras protein. M was estimated to be 26 (a.a.), which was derived from a comparison of the ¹N-¹⁵N-HSQC spectra of the wild type and the Y32W mutant Ras protein (personal communication from T. Matsuda).

All of the calculations were carried out using Microsoft Excel and Visual Basic. Standard deviations of the values were determined from three sets of simulations.

Results

Design of two-step PCR protocols

In the first step of the two-step PCR (Fig. 3a), a linear DNA fragment containing the sequence encoding the target protein is amplified by PCR using two “unique primers”, consisting of the gene-specific sequences and the N- and C-terminal linker sequences (“NL1 and CL1 linkers”). In the second PCR step (Fig. 3b), the product of the first PCR step is treated with two dsDNA fragments (“T7P and T7T fragments”) and a single PCR primer (“U2 primer”). The T7P and T7T fragments have the promoter and terminator sequences for T7 RNA polymerase, the optional N- and/or C-terminal tag-coding sequences, and the NL1 and CL1 linker sequences. The NL1 and CL1 linkers were designed based on the following concept. These linkers encode six small hydrophilic amino acid residues, Ser-Ser-Gly-Ser-Ser-Gly and Ser-Gly-Pro-Ser-Ser-Gly, respectively, to be connected to the N- and C-termini of the target protein (Figs. 1 and 3a), so that their influence on the structure and other properties of the target protein is minimized. The GC contents of the linkers were set to be as high as about 75%, in order to shorten the length of the unique primers and thus reduce their costs. The NL1 and CL1 linkers are the standard linkers for our high-throughput protein expression system for structural analysis. In addition to them, we designed the TV2 and DT2 linkers in order to minimize the number of residual amino acids after tag cleavage with the TEV protease (Fig. 2). The TV2 N-terminal linker encodes the TEV protease recognition sequence, and only one glycine remains at the N-terminus of the target protein region after TEV protease cleavage. The DT2 C-terminal linker provides four termination codons, UAA-UAA-U-UGA-U-UGA, where the tandem in-frame UAA codons are used to avoid read-through, and the two out-of-frame UGA codons are used to prevent read-through by frame shifting in the target protein region. As compared to the NL1 and CL1 linkers, the TV2 and DT2 linkers require longer unique primers because of their lower GC contents. The DT2 linker is designed to make the same C-terminal amino acid residue as that of the target protein. Generally, an additional adenine overhang is often attached to the 3′-terminus of the amplified dsDNA product, in PCR using the family of Taq polymerases. Thus, the bases around the linkers were designed in order to completely match the fragments with the first PCR product, even if the adenine overhangs were attached (Fig. 3b, bold ‘A’ bases).

The U2 primer, designed with an artificial sequence to avoid mispriming to popular vectors, was used to amplify the final product in the second PCR step (Figs. 1 and 3b). The U2 primer-binding site is incorporated at both the 5′-terminus of the sense strand of the T7P fragment and the 5′-terminus of the anti-sense strand of the T7T fragment. Therefore, the second PCR amplification is performed with the single U2 universal primer. Amplification with the single primer could also inhibit dimerization of the primer and increase the yield of the proper PCR product [24].

During the preparation of the T7P and T7T fragments, UV-light excitation for visualizing the DNA band for extraction from the agarose gel should be avoided, in order to minimize the damage to the DNA. When the fragment band was visualized by UV-light excitation, the following second PCR step tended to fail (data not shown).

The concentrations of the FW and RV “unique primers” in the first PCR step are important. The primer concentrations should be as low as 50 nM each, in order to increase the priming rate per primer for amplification of the target-coding region. This is to reduce the production of “primer dimers”, in which the two primers are connected head-to-head, without the target-coding region, but with an insertion or deletion of several nucleotides. The primer dimers generate a byproduct, the direct concatemer of the T7P and T7T fragments, in the second PCR step. Therefore, the low concentrations of the unique primers enable the direct use of the first PCR product, without any purification, as the template in the second PCR step. The concentrations of the four materials in the second PCR step should be in the following order: the U2 universal primer ≫ the first PCR product ≫ the T7P and T7T fragments, which is to avoid direct concatenation of the T7P and T7T fragments and to obtain mostly a single PCR product, by consuming the first PCR product, the T7P fragment, and the T7T fragment. Thus, the second PCR product can be used as the template for protein synthesis without any purification step. About 40 cycles of amplification were used in the first PCR step, to ensure amplification even from cell cultures with poor growth as the template of the two-step PCR.

Performance and robustness of the two-step PCR protocols

The fragments used for two-step PCR are shown in Fig. 4. Two-step PCR experiments were performed for a test set of human cDNA clones, with various GC contents (Table 1). The results for the N-NHis and C-Term (no tag) fragments are summarized in Table 1 and Fig. 5. For constructs of the proper length, the concentration of the second PCR product was about 60–120 μg/ml. Under the standard PCR conditions (Fig. 5a, ‘Std’ condition), construction for target Nos. 37, 40, 43 and 44, which have relatively high GC contents, failed and a 400-bp byproduct was observed in the second PCR products. The byproduct was the direct concatemer of the N-NHis and C-Term fragments, as confirmed by sequencing (data not shown). Construction for a high GC content target tended to fail in the two-step PCR under the ‘Std’ conditions. Moreover, a stronger correlation was observed between the two-step PCR results and the maximum values of GC content, scanned over the whole sequence with a 150-bp window (GCMax150) (Table 1). These failed constructs were successfully recovered by the use of the two-step PCR under the ‘+DMSO’ conditions (Fig. 5a, ‘+DMSO’ conditions). This indicates that PCR under the ‘+DMSO’ conditions would be effective for target protein regions with a high GCMax150 value.

Construction by two-step PCR was successful at least for target protein regions with 100–2200-bp lengths and with 45–84% GCMax150 values or 37–75% GC contents. Irrespective of the tag size, the N-NHis, N-SBP, N-GST, and N-MBP tagged constructs were successfully obtained (Table 1, Fig. 5b). The N-NHis/C-SBP tagged constructs and the N-NHis tagged constructs with the TV2 and DT2 linkers were also successfully obtained (Table 1). The two-step PCR construction could cover a large variety of target protein regions and tags with a few modifications of the reaction conditions. Moreover, the experimental procedure is simple, because it contains neither a purification step nor a separation step.

Experimental protocols for high-throughput use, i.e., parallel processing of many samples, should be tolerant of fluctuations in experimental conditions, because it is difficult to equalize the conditions precisely for all of the samples. For example, cumulative liquid-dispensing on the micro-liter scale using robotic systems may sometimes cause an accumulated error in the total concentration of two-fold or more, and the cell density and the plasmid contents of cultures for PCR templates may vary. Therefore, we investigated the tolerance of the two-step PCR protocols against some possible variations in practical use.