Introduction

Energy needs and environmental concerns have induced worldwide interest in developing fuels based on renewable non-fossil fuel sources. Biodiesels from plants [1], algae [2], bacteria [3] and yeast [4] are rapidly emerging as potential renewable fuel sources. Most plant oils are primarily composed of five fatty acids: palmitate (16:0), stearate (18:0), oleate (18:1), linoleate (18:2) and linolenate (18:3) [1, 5, 6]. Palmitate and stearate are saturated fatty acids, oleate is a monounsaturated fatty acid, and linoleate and linolenate are polyunsaturated fatty acids. However, most plant oils used commercially as biodiesel feedstocks have high levels of polyunsaturated fatty acids, which decreases the stability of the oil, affects the cetane number [7] and directly limits their use as biofuels. Biodiesel with a high monounsaturated fatty acid (oleate) content has desirable characteristics for fuel performance such as ignition quality, nitrogen oxide (NO x ) emissions and oxidative stability [8]. Therefore, from a compositional standpoint, plant oils high in oleic acid and low in palmitic acid are good candidates for developing a high-performance biodiesel [9].

Soybean oil is essentially composed of triacylglycerol, a product of fatty acid esterification, and its performance as a fuel depends directly on its fatty acid composition. The metabolic manipulation of fatty acids in soybeans as it relates to improving oil quality and performance has been studied [1, 10]. Plants have been developed with genetically modified oils via mutation [11] and recombinant DNA technology [10], followed by selective breeding programs. Both approaches require screening to ensure optimal expression, stability across diverse growing environments and acceptable agronomic performance of the derived lines. Mutant soybean lines have been developed and introduced into a breeding program aiming at manipulating their fatty acid composition [11]. However, the results thus far have indicated that in addition to the length of time required to develop advanced varieties via sexual crossings, the main challenge in developing commercial lines is achieving the required agronomic characteristics, such as high seed yield. Early studies have indicated that these constraints could be due to pleiotropy or unfavourable linkages during sexual crossings [12]. Over the last decade, the fatty acid composition of oil seeds such as soybean and Jatropha curcas has been successfully altered via metabolic engineering [1315, 10, 16]. Down-regulating FAD2-1 and FatB using interference RNA has enabled the production of soybeans with significantly higher oleic acid content (up to 85 %) and reduced levels of palmitic acid (<5 %), which improve its agronomic and biodiesel fuel performances [10]. These results demonstrate that engineering the fatty acid metabolism of soybean seeds allows the production of biodiesel with improved fuel performance, stability and NO x emissions [10].

Monitoring the composition and structural quality of the fatty acids in the metabolically engineered seed lines of T0 transgenic plants/T1 seeds using heterozygous individual segregating seed, while maintaining germination without waiting for further progeny generation, is essential. These possibilities provide the basis by which one can decide whether to propagate advanced progenies for generating homozygous lines. A routine approach to evaluating fatty acid composition was developed using optimised capillary GC-MS [17]. This new ability to determine the structures of the fatty acids was the result of significant technological advances in mass spectrometry [18, 19]. Despite the availability of a range of mass spectrometry procedures [2022], the structural analysis of fatty acids has been performed on few metabolically engineered oil seed plants [14, 10].

In our study, we have generated metabolically engineered transgenic soybean lines with high levels of oleic acid and decreased levels of palmitic acid by down-regulating the expression of Δ12 desaturase (FAD2-1) and palmitoyl thioesterase (FatB1a) under the control of a constitutive promoter and RNA interference technology. We have also determined the composition and structural characterisation of the fatty acids in these lines via mass spectrometry.

Materials and methods

Synthetic fatty acid sequences and the selective marker

To generate the fatty acid down-regulated RNAi soybean construct, we used two gene-specific 300-bp fragments corresponding to the cDNA coding regions of SoyFAD2-1 (GenBank accession L43920.1, nt 272 to 571) and SoyFatB1a (GenBank accession DQ861997.1, nt 65 to 364). These cDNA fragments were assembled in tandem to form a 600-bp fragment. An in silico restriction analyses of the 600-bp sequence indicated an XbaI site (TCTAGA) at position 533. The first base, T (underlined), was deleted to remove the restriction site. A synthetic 599-bp fragment corresponding to the FAD and FAT coding regions flanked by 5′ XbaI/XhoI 3′ and 5′ KpnI/BamHI 3′ restriction sites was produced (DNA2.0 Inc., Menlo Park, CA 94025, USA). The synthetic fragment was inserted in the sense direction into the XhoI/KpnI sites and in the antisense direction into the BamHI/XbaI sites of pKannibal [23] to generate the pSoybiofuel1 RNAi vector. In the pSoybiofuel1 vector, the total hairpin sequences were under the control of the constitutive 35SCaMV promoter and OCS terminator. The pAC321 vector carrying the mutated ahas gene from Arabidopsis thaliana, which confers tolerance to the herbicide Imazapyr®, was used as a selective marker [24].

Soybean gene transfer

The plasmids pSoybiofuel1 and pAC321 [25, 24] were co-bombarded in a 1:1 ratio into the apical meristematic region of somatic embryonic axes from mature soybean seeds (cultivar conquista, Embrapa, Brazil) via particle bombardment as previously described [24]. The putative plants grown in Imazapyr®-selective medium were analysed by PCR for both transgenes. The PCR-positive T0 plants were acclimatised and cultivated in soil under greenhouse conditions until they produced seeds. All seeds were harvested and used individually for molecular and biochemical analyses.

PCR analyses

A small segment (approximately 1 mm2) of the cotyledon from each T1 and T2 seed was excised for DNA isolation. The segment from the seed was removed from the opposite side to the embryonic axis to allow later seed sowing and plant development. The plant DNA was isolated according to the technique of Doyle and Doyle [26]. The PCRs contained 10 mM Tris-HCl (pH 8.4), 50 mM KCl, 2 mM MgCl2, 160 mM each of dNTP, 2 U Taq polymerase (Invitrogen), 20 ng of DNA and 200 nM of one of the following pairs of primers: KANNIf (CCACTATCCTTCGCAAGACC) and PDKr (TAACTATTTTATACTAAAAGGA); PDKf (TTTCTAAATAATACTTGTAGTT) and KANNIr (CGATCATAGGCGTCTCGCAT) or ahasp124 (ACTAGAGATTCCAGCGTCAC) and ahas500c (GTGGCTATACAGATACCTGG). The DNA was pre-denatured at 95 °C for 5 min and amplified for 35 cycles of 95 °C for 1 min, 55 °C for 1 min and 73 °C for 1 min, with a final cycle at 72 °C for 7 min. The expected amplification products using the given primers contained 728 and 835 bp for the forward and reverse bands, respectively, and 685 bp for the ahas gene.

Preparing the oil samples for mass spectrometry

Oil from the soybean seeds was extracted and esterified as previously described [27], with modifications. Soybean pods from T0 transgenic and non-transgenic plants were harvested. The T1 and T2 seeds were collected and stored individually below 4 °C. Using a scalpel blade, a 1–2-mm-thick section weighing approximately 10 mg was excised from the seed cotyledon, as previously described. The section was ground using a mortar and pestle. The sample powder was transferred to a glass flask and dried overnight in an oven at 90 °C. To each flask, 500 μL of a 0.5 N KOH-methanol solution was added, and the mixture was vortexed for 15 s. Then, the flasks were heated in an oven at 80 °C for 60 min. To each flask, 2 mL of an acid-methanol solution (6 mL H2SO4 in methanol) was added, and the samples were vortexed for 3 min and heated in an oven at 80 °C for an additional 60 min. The samples were cooled at room temperature for 10 min, and 2 mL of hexane was added, followed by vortexing for 3 min. The samples were allowed to separate into phases for 45 min. Then, 1 mL of the supernatant was collected and placed in a capped vial for a GC autosampler. Standards of palmitic acid (Sigma-76119), stearic acid (Sigma-85679), linolenic acid (Sigma-62160), linoleic acid (Sigma-62230) and oleic acid (Sigma-75040) were also esterified using the aforementioned method and used for GC-MS calibration. For the LC-MS analysis, 400 μL of the GC-MS prepared sample corresponding to line 65-25 was placed in a new glass vial and dried in a N2 flux. Methanol (1 mL, LC-MS grade) was added, and the sample was vortexed for 30 s. This solution was directly infused into the MS.

GC-MS analysis

Gas chromatography mass spectrometry was performed using a Shimadzu GC-2010 and a Shimadzu GCMS-QP2010 equipped with an autosampler. Two microlitres was applied to an OmegaWax 250 capillary column (30 m × 0.25 mm × 0.25 μm) from Supelco (Sigma-24136). The following chromatographic conditions were set: initial temperature, 50 °C; final temperature, 240 °C; injection temperature, 250 °C; and rate, 5 °C/min up to 240 °C then hold at 240 °C for 10 min (total run time = 58 min). A linear flow of 2 mL/min with a helium pressure of 117.6 kPa was used in a total flow of 25 mL/min. The linear velocity was set at 51.3 cm/s with a purge at 3 mL/min and a split ratio of 10. The mass spectrometry was operated at a temperature of 250 °C with an 833-Hz scan speed from 100–500 m/z over 58 min. The chromatography peaks and MS fragmentation data were searched in the FAME fragment database.

Exact mass spectrometry and ion mobility mass spectrometry

The samples previously prepared for GC-MS were analysed in a MicrO-TOF Q II electrospray ionisation mass spectrometer (Bruker Daltonics) and a Synapt G2 high definition mass spectrometer (Waters, USA). For the Synapt G2 analysis, a nanospray source (at 600 nL/min) was used in the positive ion mode with the following conditions: analyser, resolution mode; capillary, 3.0 kV; sampling cone, 30.0; extraction cone, 6.0; source temperature, 70 °C; desolvation temperature, 200 °C; cone gas flow, 0.0 L/h; nanoflow gas pressure, 0.2 Bar; purge gas flow, 800.0 mL/h; desolvation gas flow, 800.0 L/h; LM resolution, 4.9; and HM resolution, 15.0. The mass spectrometry was externally calibrated using ion series from a sodium formate solution. The fragment analyses were performed in the MS/MS mode, and the data were collected from 10 to 300 m/z over 5 min. The collision energy was applied in a trap cell, ramping from 6 to 35 eV, and the LM resolution was set to 23.9. Ion mobility was also evaluated in the resolution mode, adjusting the quadrupole radio frequency to select each FAME with an LM resolution of 23.9. The IMS gas flow was set to 90 mL/min, and the helium gas flow to 180 mL/min. The IMS wave velocity and wave height were set such that the TICs were observed in the centre of the drift tube, and these values were maintained for all samples. The palmitic methyl ester spectra were obtained from 270 to 275 m/z. The linolenic methyl ester spectra were obtained from 290 to 297 m/z. The linoleic methyl ester spectra were obtained from 294 to 300 m/z. The oleic methyl ester spectra were obtained from 295 to 300 m/z. The exact mass and MS/MS experiments were also performed in a MicrO-TOF Q II electrospray ionisation mass spectrometer using a standard electrospray probe and internal calibration for sub-ppm mass errors in the MS mode.

Results and discussion

Soybean transformation

Transgenic soybean plants were generated by co-bombardment using a developed protocol as previously described [24]. The microparticle bombardment experiments included 3,600 embryonic axes. The embryonic axes were cultured in vitro for 6 weeks, and the putative transgenic plantlets were analysed via PCR. Eight independent soybean T0 lines were obtained and presented both the 728 and 835 bp fragments corresponding to two internal regions of the pSoybiofuel1 vector expression cassette that were amplified using the KANNIf/PDKr and PDKf/KANNIr primer combinations, respectively. The transgenic plants were also analysed for the presence of the selectable marker gene, ahas [25]. All transgenic plantlets presented the 685 bp fragment corresponding to the amplification of the 5′ internal segment of the mutated ahas gene. The transgenic plants were transferred to soil under greenhouse conditions to set seeds. The T1 heterozygous individual seed lines were then analysed by mass spectrometry to evaluate the oil composition and structure.

GC-MS analysis

After soybean oil extraction and derivatisation, the methyl esters were analysed by GC-MS (Table 1; Figs. 1 and 2). Different T1 lines exhibited an improved profile for production of oleic acid and a reduction in palmitic acid (Electronic Supplementary Material, Table S1). Seed line 65-25 presented the highest performance with 94.58 % oleic acid and 2.12 % palmitic acid (Table 1). To verify the extraction and derivatisation stability over time, lines 65-25, 65-14 and 122-22 were injected into the GC-MS once a week throughout the analytical experiments. The standard deviations (Table 1) demonstrated that the analyses can be performed with a week between the extraction, derivatisation and GC-MS analysis without significant variation in the results. Furthermore, the 65-25 line presented higher concentrations of oleic acid and lower concentrations of palmitic acid than the oil extracted from the non-transgenic seeds (Fig. 1) [10], suggesting that the structural oil characteristics determined in line 65-25 meet the international standards for biodiesel production [10]. Thus, all T2 seeds (Table 1) were generated from line 65-25.

Table 1 GC-MS results yielding the oil composition of the transgenic soybean seed line 65 and its sub-lines. Sample number 22 is a non-transgenic event and was used as a negative reference. The standard deviation varies from 0.1 to 0.6 % from injection-to-injection
Fig. 1
figure 1

Soybean fatty acid GC-MS profile. Percentage of soybean fatty acids in the oil extracts from transgenic seed line 65

Fig. 2
figure 2

Soybean fatty acid GC-MS profiles for the oil extracts from transgenic line 65 and its sub-lines. Retention times from the different soybean lines. Each colour represents one line

Ion mobility analysis

The structural characterisation of the fatty acid profile in the transgenic and non-transgenic soybean seeds was performed using high-resolution/accurate mass spectrometry and ion mobility-MS. The exact mass spectra of line 65-25 (Fig. 3) indicated no change in the masses of palmitic methyl ester (271.2 [M + H] m/z), linolenic methyl ester (293.2 [M + H] m/z), linoleic methyl ester (295.2 [M + H] m/z) and oleic methyl ester (297.2 [M + H] m/z). However, the mass of stearic methyl ester (299.2 [M + H] m/z) could not be determined because it matched that of one isotope of oleic methyl ester, resulting in a contaminated fragmentation spectrum.

Fig. 3
figure 3

MicrO-TOF Q exact mass spectra (m/z) from the oil extracted and esterified from line 65-25

Figures 5, 6, 7 and 8 demonstrate the fragmentation of the fatty acid methyl esters (FAMEs). The results indicate no differences between the standard spectra and those of the soybean 65-25 esterified oil. The fragmentation process in the collision cell produces ions that can be assigned as signatures for the FAMEs. Figure 4 shows the three most common FAME rearrangements due to the fragmentation process. The Mclafferty rearrangement often occurs [28, 18], forming ions with M + H-31 and M + H-74 (Fig. 4b, c). The hydrocarbon chain has a signature of M + H-14, corresponding to the consecutive break of CH2. A series of ions was also detected based on the 87 m/z ion (Fig. 4a), forming ions with masses of 101, 115, 129 and 143 m/z (CH3OCO[CH2] N ) with low intensity. These signatures can be observed in Figs. 5, 6, 7 and 8, which depict ions 197 m/z and 240 m/z for palmitic acid, 225 m/z and 268 m/z for stearic acid, 223 m/z and 266 m/z for oleic acid, 262 m/z and 219 m/z for linolenic acid and 264 m/z and 221 m/z for linoleic acid, representing M + H-31 and M + H-74, respectively. In addition, Table 2 contains a list of all relevant masses found in the spectra. We found a series of ions that have M + H-18, M + H-50 as precursors with two other series that start with ions of 55 m/z and 53 m/z. These ions were also detected in the GC-MS spectra (data not shown). We can theorise that M + H-18 represents the resulting water loss in the compound, CH3(CH2) N CCHOCH2 + H2O. The M + H-50 series could be a multiple rearrangement of the hydrocarbon chain following water loss (CHCCC[CH2]NCOCH2 + C2H5 + 3H + H2O). The ion series starting at 55 m/z could begin with a compound modified from Fig. 4b, CH2CH(CH2) N CO. We do not believe that these ions could be formed by contamination either in the ion source or in the sample because the ion mobility of each methyl ester exhibited only one conformation (Fig. 9), indicating that the observed fragments were obtained from ions isolated from the quadrupole.

Fig. 4
figure 4

Molecular rearrangements of the fatty acid methyl acids produced in the collision cell. A Formation of the base peak of 87 m/z. B Loss of OCH3 (31 m/z). C Loss of CH2COHOCH3 (74 m/z)

Fig. 5
figure 5

Palmitic methyl ester fragmentation patterns. The molecular structure of this FAME and the CH2 loss are shown in detail. The arrows indicate the ion series of 87 m/z

Fig. 6
figure 6

Linolenic methyl ester fragmentation patterns. The molecular structure of this FAME and the CH2 loss are shown in detail. The arrows indicate the ion series of 87 m/z

Fig. 7
figure 7

Linoleic methyl ester fragmentation patterns. The molecular structure of this FAME and the CH2 loss are shown in detail. The arrows indicate the ion series of 87 m/z

Table 2 Mass values (m/z) of important ions and series as determined by the mass spectrometry experiments. The ion series (M + H-18) corresponds to a possible water loss. Ion series (M + H-50) corresponds to a possible rearrangement in carbonic chain with a water loss (C2H3 + 3H + H2O). Ion series 87 was described by Mclafferty [28, 18]. Ion series 55 and 53 was observed in the spectra

Furthermore, the ion mobility of each FAME was evaluated for multiple structures and/or contamination between line 65-25 and a non-transgenic line. The results are provided in Fig. 9, with the following drift times for each FAME: palmitic = 4.09 ms, linolenic = 3.79 ms, linoleic = 4.13 ms and oleic = 4.44 ms (Fig. 9a). Figure 9b depicts the isotopes of each FAME, showing a uniform distribution. These data indicate that a single structural conformation is formed, and no difference is observed between the FAMEs from line 65-25 and those from the non-transgenic line.

Fig. 8
figure 8

Oleic methyl ester fragmentation patterns. The molecular structure of this FAME and the CH2 loss are shown in detail. The arrows indicate the ion series of 87 m/z

Fig. 9
figure 9

Ion mobility analysis of the FAMEs from transgenic and non-transgenic seeds. A Superimposition of the drift time for each analysed FAME. B 3D representation of the positions of the FAMEs in the drift tube and their isotopes

These results are in agreement with the analysis and comparison of the soybean oil samples from the transgenic and non-transgenic lines previously performed by other groups using different techniques [23, 9]. For example, Luna et al. [29] analysed the oil content from transgenic and non-transgenic soybean seeds using infrared spectroscopy to distinguish and classify the soybean oil for process monitoring and control. Their results indicated no significant differences among the samples. Similar results were observed by Graef et al. [10], who compared oil performance from transgenic soybean seeds and demonstrated improved quality in transgenic lines for the production of oil for biofuel.

Historically, vegetable oils have been of great industrial importance in the preparation of margarines and other foodstuffs. They have now become important as raw materials for biofuel. The worldwide demand for vegetable oil for both foods and biofuels requires increased production. Biotechnology presents a variety of solutions to improve the fatty acid composition for food and industrial uses. Through metabolic engineering, the oil content can be altered in a variety of crops, such as soybeans, to produce the oils needed by industry. For example, the omega-3 polyunsaturated fatty acids found in fish oil can now be produced in plants [30], improving the quality of food for humans and animals. Each gene responsible for the production of oil in soybeans has been identified [10, 27, 3133] and we have developed a soybean line that expresses an oleic acid content of up to 95 % and a palmitic acid content below 4 %. The oils from the metabolically engineered soybean seed lines produced have similar structural characteristics to those from non-transgenic seeds. These lines may form the foundation for the next generation of homozygous lines and the large-scale production of improved performance oils for the bioenergy sector.

Conclusions

Metabolically engineered soybean plants exhibited a significant increase in oleic acid and a reduction in palmitic acid in their seed oil content. Structural characterisation, performed using high-resolution/accurate mass spectrometry and ion mobility-MS, revealed no differences in the fatty acids of the transgenic and non-transgenic oil extracts. Using this approach, it is possible to check potential modifications in the first plant generation, keeping the seeds viable for germination and progeny development