Introduction

Ginseng (Panax ginseng C. A. Mey.) is known as the “king of herbs” in China, Korea, Japan and several other eastern Asian countries. It has been cultivated as a medicinal herb for over 2,000 years in China. Because ginseng has been shown to have numerous bioactive effects on human health, such as immune system stimulation, anti-carcinogenic activity, alleviation of fatigue stress and reduction of blood glucose levels, it has been widely used as a herbal medicine and feedstock, especially its major medically effective component, ginsenosides, in a number of industries, including medicine, beverage, food, cosmetic, etc. Most of the world’s ginseng is produced in northeast Asia, 70 % in the Changbai Mountains, Jilin, China. Currently, the annual business revenue stimulated by ginseng in Jilin, China exceeds US $3.0 billion. It is estimated that this number will approach US $15.0 billion by 2020.

However, ginseng research in the past has been focused on chemical components and their medical bioactivities (Popovich et al. 2012). Research into modern genetics, genomics and breeding of the species is limited. This limits ginseng genetic improvement and the isolation, characterization and utilization of genes economically important in ginseng, such as those involved in the biosynthesis pathway of ginsenosides. To facilitate such research, hundreds of DNA markers including randomly amplified polymorphic DNA, inter-simple sequence repeat, amplified fragment length polymorphism, restricted fragment length polymorphism and simple sequence repeat (Bang et al. 2004; In et al. 2005; Kim et al. 2005; Ma et al. 2007; van Dan et al. 2010), over 10,000 expressed sequence tags (ESTs) (Jung et al. 2003) and a large-insert bacterial artificial chromosome (BAC) library (Hong et al. 2004) have been developed. These DNA markers, ESTs and BAC library have provided useful resources and tools for modern genomics, genetics and breeding research in ginseng, but they are not sufficient for all research purposes and are underdeveloped, in comparison with other economically important plant species such as rice and tomato. Moreover, most of these researches were carried out with Korean ginseng; few genome resources and tools have been developed for Chinese Jilin ginseng, even though it contributes over 70 % of the world’s ginseng to ginseng industries. Finally, the existing ginseng BAC library (Hong et al. 2004) only has a genome coverage of 3.34× that is not well-suited for many genomic studies and is not competent for direct transformation through either Agrobacterium or biolistic bombardment, which is necessary for functional analysis and use of economically important genes for ginseng breeding and molecular farming through genetic engineering.

Ginseng is perennial, usually taking 3–4 years from planting to seed setting, and special soils and climates are required for its growth and production. Moreover, research and technical practices for breeding of the species are limited, even though it has been cultivated for over 2,000 years. These limitations significantly restrict ginseng genetic improvement and production. It has been demonstrated in several plant species that the binary bacterial artificial chromosome (BIBAC) technology promises to provide efficient tools for enhanced plant genetic improvement, molecular breeding, and isolation, characterization and utilization of economically important genes for agricultural production (Liu et al. 1999; Song et al. 2004; Chang et al. 2011). Compared with BACs, BIBACs have several advantages in their use for genomics research, especially functional and post-genomics research, and molecular breeding. This is because BIBACs not only are capable of cloning and stably maintaining DNA fragments of over 300 kb in bacteria as BACs (Hamilton et al. 1996; Tao and Zhang 1998), but also can be directly used for high-molecular-weight (HMW) DNA transformation in plants by the particle bombardment (Chang et al. 2011) and Agrobacterium-mediated methods (Hamilton et al. 1996; Tao and Zhang 1998; Liu et al. 1999). The plant transformation competence of BIBACs expedites many functional genomics, molecular breeding and molecular farming researches, including economically-important gene and quantitative trait locus (QTL) cloning and genetic engineering (Zhang et al. 2012b). Moreover, a BIBAC having an insert size of 100–150 kb potentially contains 10–30 genes (Chang et al. 2011). Transforming such a large-insert BIBAC allows transfer of the genes or gene clusters and regulatory elements involved in a single pathway or biological process (Song et al. 2004), such as the genes involved in the biosynthesis of ginsenosides in ginseng. Furthermore, large-insert BIBAC transformation is well suited for engineering genes from a distantly or unrelated species: for instance, engineering the genes involved in the biosynthesis of ginsenosides from ginseng to tractably cultivated crop plants such as carrot and tomato for molecular farming. This is because the native regulatory sequences and neighboring genes of the donor species are crucial to the expression of transgenes in host species (Song et al. 2004; Chang et al. 2011). Therefore, large-insert BIBAC transformation will more likely produce transgenics that express the target genes in an appropriate spatial and temporal manner due to the reduced problems of gene silencing that are frequently encountered in the traditional gene transformation method.

Techniques for large-insert BIBAC transformation have been well established in plants, either via Agrobacterium-mediated transformation (Hamilton et al. 1996; Liu et al. 1999, 2002; Frary and Hamilton 2001; He et al. 2003) or by particle bombardment (Chang et al. 2011). Large-insert BIBACs transformed into plants are inherited stably (Hamilton et al. 1996; Frary and Hamilton 2001; Song et al. 2004; Chang et al. 2011), and the transgenes contained in large-insert BIBACs are actively expressed, yielding new or varied traits in host plants (Song et al. 2004; Chang et al. 2011). Therefore, BIBACs are emerging as an important tool for gene/QTL cloning (Hamilton et al. 1996; Zhang 2007) and large-scale genome functional analysis (Song et al. 2004; Chang et al. 2011). A large number of large-insert plant-transformation-ready BIBAC libraries have been constructed for a variety of plant species (e.g., Liu et al. 1999, 2002; Men et al. 2001; He et al. 2003; Chang et al. 2001; Meksem et al. 2000; Xu et al. 2005; Zhang et al. 2010, 2012b, c). Nevertheless, no large-insert BIBAC library has yet been available for ginseng.

In this study, we constructed a large-insert BIBAC library from Chinese Jilin ginseng cv. Damaya, a major cultigen of ginseng in northeastern China. To characterize the library, further validate its genome coverage and to identify the clones containing the genes potentially involved in the biosynthesis pathway of ginsenosides, we printed the library on nylon membranes in high clone density and screened it using overgo probes designed from nine genes potentially involved in the pathway. We isolated a total of 36 positive BIBAC clones, with an average of 4.0 positive clones per gene probe. Moreover, we also tested the transformability and stability of ginseng BIBAC clones in Agrobacterium, because they are crucial to BIBAC transformation in plants (Song et al. 2003; Chang et al. 2011). Therefore, this BIBAC library provides resources and tools for modern genome research in several aspects, particularly ginseng molecular breeding, and isolation, characterization and utilization of its economically important genes in molecular farming.

Materials and methods

Plant materials

Chinese Jilin ginseng cv. Damaya was used as the source DNA of the BIBAC library. Fresh root systems were collected from the plants growing in the field and used as materials for preparation of megabase-sized nuclear DNA.

Preparation of megabase-sized nuclear DNA

Megabase-sized nuclear DNA was prepared according to Zhang et al. (1995, 2008, 2012a). Because ginseng roots are abundant in starch and polysaccharides that potentially influence the digestion of the resultant DNA embedded in agarose, nuclei were washed for two additional times to minimize the contamination of these metabolites. The nuclei were suspended at a concentration of about 1.5 × 107 nuclei/ml for preparation of low-melting-point agarose plugs, thus making approximately 5 μg nuclear DNA per 100-μl plug.

Preparation of BIBAC vector

The BIBAC library was constructed in the widely-used BIBAC vector pCLD04541 (Tao and Zhang 1998; Meksem et al. 2000; Wu et al. 2000; Men et al. 2001; Tao et al. 2002; Fang et al. 2004; Ortiz-Vázquez et al. 2005; Feng et al. 2006; Zhang et al. 2010; Chang et al. 2011). The isolation and purification of the vector DNA and vector preparation followed Zhang et al. (2012b).

Library construction

The BIBAC library was constructed as described by Zhang et al. (2012b). We first conducted partial digestion tests using a series of amounts of BamHI per reaction to determine the optimal condition, particularly the amount of restriction enzyme (BamHI) per reaction and digestion incubation time. The partial digestion tests indicated that 20 units of BamHI per reaction containing three of the nine slices derived from a 100-μl megabase-sized DNA plug and 8-min incubation at 37 °C generated the fragments with most being between 80 and 200 kb. Therefore, the condition was selected for large-scale partial digestion for the BIBAC library construction.

We partially digested 15 100-μl megabase-sized DNA plugs with 20 units of BamHI per reaction at 37 °C for 8 min to construct the BIBAC library. The partially digested megabase-sized DNA was selected on a 1 % agarose gel by pulsed-field gel electrophoresis. The fragments ranging from 80 to 200 kb were selected, electroeluted, dialyzed and ligated into the dephosphorylated pCLD04541 vector (Zhang et al. 2012b). The ligated DNA was transformed into Escherichia coli strain DH10B (Invitrogen, USA) by electroporation using the Cell Porator™ Device (Gibco BRL, USA) as described by Zhang et al. (2012b).

Library characterization and identification of clones containing genes important in ginseng

A random sample of 124 clones was analyzed to estimate the insert size of the library according to Zhang et al. (2012b). BIBAC DNA was isolated, digested with NotI and run on a 1 % agarose gel by pulsed-field gel electrophoresis. The insert size of each clone was estimated by adding up all insert band(s) of each clone lane using the lambda ladder PFG marker (New England BioLabs, USA) as the molecular weight standard.

The BIBAC library was double-spotted onto 22.5 × 22.5 cm Hybond N+ membrane (Amersham-Pharmacia, USA) in a 4 × 4 format using the GeneTAC Robotic Workstation (Genomic Solutions, USA). Each membrane contained a total of 18,432 (48 × 384) double-printed clones and the entire library was spotted on 7.5 membranes (Zhang et al. 1996a; Zhang 2000). The library was screened by hybridization with overgo probes designed from the single-copy regions of nine genes potentially involved in the biosynthesis pathway of ginsenosides, using a two-step procedure. The library membranes were first hybridized with a probe prepared from a mixture of nine gene-specific overgos, then, the positive clones were re-arrayed into a 96-well plate, printed onto nylon membrane and re-hybridized with probes prepared from individual overgos of the genes. The hybridization was carried out at 60 °C overnight. After the hybridization, the membranes were washed in 1× SSC, 0.1 % (w/v) SDS at 60 °C twice, 15 min each time, followed by 0.5× SSC, 0.1 % SDS (Sambrook et al. 1989) at 60 °C for 10 min.

Stability of large-insert ginseng genomic DNA BIBACs in Agrobacterium

Panax ginseng has a large complex genome (3,300 Mb/1C, Hong et al. 2004), which may affect the stability of its large-insert BIBACs in Agrobacterium (Song et al. 2003). To explore the feasibility of transforming the ginseng genomic DNA large-insert BIBACs in plants via Agrobacterium for ginseng molecular breeding and ginseng gene molecular farming through genetic engineering, we randomly selected two large-insert BIBACs from the ginseng library with insert sizes of approximately 100 and 150 kb, respectively. The two clones were transformed into Agrobacterium tumefaciens strains COR308 and C1C58, respectively, by electroporation using the Cell Porator™ Device (Gibco BRL, USA) and tested for their stability in the strains. The conditions of electroporation were 4-kΩ resistance for the Voltage Booster setting; and low-ohm impedance, 330-μF capacitance and 360-V for the Cell-Porator settings. Two–five Agrobacterium BIBAC clones were selected from each BIBAC Agrobacterium transformation and grown in LB medium containing appropriate antibiotics at 28 °C, 250 rpm for 100 generations. DNA of the clone Agrobacterium cultures was isolated and re-transformed into E. coli strain DH10B (Invitrogen, USA) by electroporation as described above. Random bacterial clones were selected from the re-transformed BIBACs, and DNA of the clones was isolated and analyzed against the original BIBAC DNA used in the Agrobacterium transformation by pulsed-field gel electrophoresis (Zhang et al. 2012b).

Results and discussion

Construction and characterization of the BIBAC library

We constructed a BIBAC library from the megabase-sized nuclear DNA of Jilin ginseng cv. Damaya partially digested with BamHI in the BIBAC vector pCLD04541 (Jones et al. 1992; Tao and Zhang 1998). This vector has been widely used to construct BIBAC libraries for a variety of plant species and its transformability demonstrated in Agrobacterium and plants (Meksem et al. 2000; Wu et al. 2000; Men et al. 2001; Tao et al. 2002; Fang et al. 2004; Ortiz-Vázquez et al. 2005; Feng et al. 2006; Zhang et al. 2010; Chang et al. 2011). We isolated megabase-sized nuclear DNA from ginseng roots. Because they are abundant in metabolic substances such as starch and polysaccharides that may influence large-insert DNA library construction, we washed the nuclei extensively to minimize them, thus enhancing the DNA clonability. Moreover, we also conducted pre-size selection on the megabase-sized DNA embedded in agarose plugs to further minimize the metabolic substances and to remove the small fragments in the plugs (Zhang et al. 2012b). These measures significantly facilitated the construction of the BIBAC library.

The ginseng BIBAC library contains 143,312 clones arrayed in 368 384-well microtiter plates. To determine the insert sizes of the library clones, we analyzed a random sample of 124 clones from the library (Fig. 1a). The result showed that the clones of the library had insert sizes ranging from 45 to 200 kb, with an average insert size of 110 kb (Fig. 1b). Of the 124 clones analyzed, 97 (78.23 %) had insert sizes larger than 80 kb, each thus likely containing 10–30 genes. Therefore, the BIBACs could be possibly used to engineer clusters of genes involved in a biological process or pathway such as those for the biosynthesis of ginsenosides. Since ginseng is estimated to have a genome size of some 3,300 Mb/1C (Hong et al. 2004), the BIBAC library has a haploid genome coverage of 4.8×, with a probability of >99 % of obtaining at least one positive clone from the library using a single-copy probe (Zhang et al. 1996b; Wu et al. 2004). Therefore, the library provides useful resources and tools for genomics and genetics research on the species. Furthermore, since the library was constructed in a BIBAC vector that is competent in plant transformation with either the widely-used Agrobacterium-mediated method or the widely-used particle bombardment method (Meksem et al. 2000; Wu et al. 2000; Men et al. 2001; Tao et al. 2002; Fang et al. 2004; Ortiz-Vázquez et al. 2005; Feng et al. 2006; Zhang et al. 2010; Chang et al. 2011), this has added additional utilities to the library for ginseng genome research, particularly molecular breeding and molecular farming.

Fig. 1
figure 1

BIBACs randomly selected from the Jilin ginseng cv. Damaya BIBAC library (a) and their insert size distribution (b). M lambda ladder PFG marker, N the number of random clones analyzed

Identification of BIBACs containing genes important in ginseng

To validate the genome coverage of the library, test its utility for genome research and identify the BIBACs containing genes potentially involved in ginsenoside biosynthesis, we searched Genbank for the genes potentially involved in the pathway (Jung et al. 2003). As a result, we found nine genes likely involved in the biosynthesis pathway of ginsenosides (Table 1). We then designed single-copy overgos from the genes and used them as probes to screen the library. The library screening with the nine gene-specific overgos resulted in a total of 36 positive clones, with each gene probe having 1–7 positive clones and an average of 4.0 positive clones (Fig. 2; Table 1). Interestingly, we found that two of the genes, DS and OSC, shared five positive clones, suggesting that these two genes are physically clustered in the ginseng genome. The results further confirmed the genome coverage of the library estimated above by statistical analysis and suggested that the BIBAC library is suited for isolation and characterization of genes important in ginseng. The positive clones identified provide tools useful for ginseng molecular breeding and molecular farming via genetic engineering.

Table 1 Positive clones of gene-specific single-copy overgos identified from the BIBAC library
Fig. 2
figure 2

Example of positive BIBACs containing genes potentially involved in the ginsenoside biosynthesis pathway. Arrows indicate the positive BIBAC clones containing the genes identified from the BIBAC library

Stability of ginseng genomic DNA large-insert BIBACs in Agrobacterium

Previous studies have demonstrated that large-insert BIBACs can be readily transformed and engineered using the traditional Agrobacterium-mediated and particle bombardment methods in plants (Hamilton et al. 1996; Liu et al. 1999; Frary and Hamilton 2001; Liu et al. 2002; He et al. 2003; Chang et al. 2011). The key to the successful transformation of large-insert BIBACs into plants through Agrobacterium was shown to be their stability in Agrobacterium (Song et al. 2003; Chang et al. 2011). Therefore, to test the utility of the ginseng large-insert BIBACs in genetic engineering through genetic transformation, we selected two random clones having insert sizes of 100 and 150 kb, respectively, named Pg100 and Pg150, from the library and tested their stability in two Agrobacterium strains COR308 and C1C58. Analysis of two random clones of Pg100 transformed into Agrobacterium strain C1C58 showed that both of them had identical NotI restriction pattern to that of the original BIBAC DNA after being grown in the Agrobacterium strain for 100 generations (Fig. 3a). Analysis of four random clones of Pg150 transformed into Agrobacterium strain COR308 showed that three of them had identical NotI restriction patterns to that of the original BIBAC DNA while one was clearly rearranged after growing in the Agrobacterium strain for 100 generations (Fig. 3b). These results indicated that the ginseng large-insert BIBACs could be readily transformed into Agrobacterium by electroporation and were largely stable in Agrobacterium, even though their stability may vary among different Agrobacterium strains. Further studies are needed to test the utility of the library in ginseng gene genetic engineering in different plant species.

Fig. 3
figure 3

Stability of large-insert ginseng DNA BIBACs in widely-used Agrobacterium strains. a BIBAC Pg100 (100 kb) digested with NotI and fractionated on a pulsed-field gel. Lane 1 BIBAC DNA before transformed into Agrobacterium, lanes 2 and 3 DNA of the BIBAC transformed into Agrobacterium strain C1C58 and grown in the Agrobacterium strain for 100 generations. b BIBAC Pg150 (150 kb) digested with NotI and fractionated on a pulsed-field gel. Lane 1 BIBAC DNA before transformed into Agrobacterium, lanes 2, 3, 4 and 5 DNA of the BIBAC transformed into Agrobacterium strain COR308 and grown in the Agrobacterium strain for 100 generations