Introduction

Algal biomass is associated with high biodiesel potential, and besides the presence of raw material for biodiesel production, mainly neutral lipids [triglycerides (TG) and free fatty acids (FFA)] and polar lipids (glyceroglycolipids and phospholipids), contains proteins, sugars, vitamins, etc. The neutral lipids are important components of interest to the energy sector because of their high potential to produce biodiesel with high growth rates of seven to 30 times those of traditional food crops. Many microalgae, such as Chlorella vulgaris, Scenedesmus ecornis, and Dunaliella, have the ability to produce a substantial amount of biomass with 20–50 % of dry cell weight of TG as storage lipids on account of photosynthesis [16]. Basically, the fatty acid composition of neutral lipids has the same profile as that of vegetable/fish oil (C12–C20/C24), including n − 3 C18:3, C20:5 (eicosapentaenoic acid, EPA), and C22:6 (docosapentaenoic acid, DHA) [57]. The biomass and lipid productivity, composition, and nature of the components are dependent on the algal species, the cultivation media, the light flux, and the feed used for their cultivation. Stressed culture conditions such as nitrogen limitation or nutrient imbalance for stimulating an increase in the neutral lipid content in the microalgal cells are used for growth of algae of high biodiesel potential. The types of cultivation system (open pond, closed or hybrid bioreactor) for cultivation of algae also influence the yield and quality of the algal biomass [1, 5]. Lipids are extracted by chemical methods which involve extraction by solvents such as hexane/cyclohexane, chloroform, methanol, 2-propanol, and their mixtures aided by Soxhlet extraction, accelerated solvent extraction, sonication, and supercritical fluid chromatography methods [1, 3, 4, 8, 9].

Analytical techniques such as chromatography [gas chromatography (GC), high-pressure liquid chromatography (LC), gel permeation chromatography, supercritical fluid chromatography], spectroscopy [NMR spectroscopy, IR spectroscopy, mass spectrometry (MS), UV/fluorescence spectroscopy], and atomic microscopy (scanning electron microscopy, transmission electron microscopy) are being used extensively for lipid content and compositional analyses, including fatty acid profile, and monitoring the production and quality of feedstocks, and their conversion to biodiesel [736]. The determination of lipid content in algal biomass is done by methods based on solvent extraction followed by gravimetric lipid quantification [3, 4, 6, 9, 27], fat cellular staining by fluorescence microscopy using a lipid stain such as Nile red [19], solid-state time-domain NMR spectroscopy for direct estimation [15], liquid-state NMR spectroscopy [16, 23, 25, 26, 31], Fourier transform IR spectroscopy with statistical correlation [28], and fatty acid methyl ester (FAME) conversion determined by GC–MS [30, 34]. Specifically, high-pressure LC can be used to quantify different types of lipids in algal extracts, e.g., monoglycerides, diglycerides, and TG [21]. GC alone and coupled with MS (GC–MS) is used routinely to identify and quantify FAMEs from algal and yeast extracts [7, 17, 26, 30]. The review by Han et al. [28] presents a brief overview and description of the methods used for lipid quantification in microalgal biomass.

High-resolution NMR spectroscopic techniques because of their inherent sensitivity, high resolution, and multinuclear capability offer many advantages over existing analytical methods owing to the ease of direct, rapid, simultaneous, and nondestructive analyses in the multicomponent system of fat, algal oils, and biodiesel obtained from different sources, such as edible oil, nonedible oil, and algal and yeast biomass [8, 1016, 18, 2226]. More important is the diverse analytical capabilities of multinuclear one-dimensional (1D) and two-dimensional (2D) NMR (1H/13C/31P) to provide detailed quantitative compositional analyses, such as lipid content, polyunsaturated fatty acid (PUFA) content, fatty acid profile, and iodine value, and production and quality monitoring of oil and biodiesel from different sources [8, 1016, 18, 2226, 31]. However, NMR spectroscopy lacks sensitivity when compared with LC–MS and GC–MS. Usually, the identification and quantification of the components in algal extracts from the NMR spectral interpretation is done by comparison of the generated with the literature chemical shift assignments and coupling constant data of standard compounds. Single-component standards such as triglyceride oleate (TGO), triglyceride stearate, or triglyceride palmitate have been used to study the fatty acid composition of algal extracts. The matrix effect has not been taken into account when single-component standards have been used for quantitative analyses of lipid content in algal extracts [16, 23, 31]. If matrix effects are not considered, the results may be misleading owing to overlapping chemical shift regions or resemblance with the matching structures in complex matrices containing a multicomponent system of TG, FFA, and polar lipids as observed in the present studies.

Knowledge of the detailed chemical composition of algal biomass is essential to understand the chemistry of neutral and polar lipids to explore their biodiesel and high-value-product potential as well as prediction of quality, yield, and physiochemical properties of fuels derived from algae. The accurate determination of neutral lipids in the algal biomass is a challenging task owing to the presence of a complex multicomponent matrix comprising glyceroglycolipids/phospholipids, proteins, carbohydrates, and minor constituents. Methods based on analytical techniques are not standardized, and have been found to be laboratory specific as well as the composition of biomass [30]. Thus, there is an essential requirement for standardization of methods in order to ensure reproducibility and accurate assessment of lipid yield and productivity of the produced biomass.

In the present investigation, the potentials of 1H and 13C NMR (1D and 2D) spectroscopic techniques have been explored to derive directly detailed compositional information on the fatty acid profile and to develop a direct method for the absolute determination of TG, FFA, and polar lipids in the solvent extracts of microalgal biomass samples generated on a laboratory scale while systematically addressing the matrix effect by internal and external reference standardization.

Materials and methods

Samples

The ten biomass samples of S. ecornis and C. vulgaris were cultivated in flasks and bottles, under white fluorescent light illumination of 135 and 300 μmol photon m−2 s−1, respectively. Five different media were used: WC medium (WC; see [37] for the components of this medium); WC where the source of nitrogen was replaced by Chilean saltpeter (WCS) or urea (WCU); WC where the source of phosphorus was replaced by superphosphate (WCSP); and WC where the sources of phosphorus and nitrogen were replaced by superphosphate and Chilean saltpeter (WCSPS). The source of CO2 was air which entered the cultivation media by agitation or by air pumping. The cellular suspensions were centrifuged and lyophilized. The biomasses lyophilized were frozen until extractions were performed. The designation of the biomasses as mentioned in the text is given in Table 1. The freeze-dried samples were reduced to a powder in a mortar. The samples were extracted with a mixture of solvents. After extraction, the extracts were kept in a refrigerator at −4 °C to protect them from degradation.

Table 1 Nomenclature of extracts of algal biomasses cultivated under different conditions/in different media

Mixoil14 blend for reference

Mixoil14 blend containing 13 vegetable oils—Jatropha (from Brazil and from India), corn, sunflower, soybean, sesame, coconut, cotton, babassu (Attalea speciosa), palm kernel, canola, wild mango (Mangifera indica), and fish oil—and one biodiesel from soybean oil in different proportions was prepared for use as an internal and external reference for quantitative analyses of lipid content in different algal oils.

Extraction and analytical strategy

Single-step ultrasonic extraction

The extraction of neutral lipids (TG and FFA) and polar lipids from the algal biomasses of S. ecornis and C. vulgaris was done by single-step ultrasonic (200 W, 50 Hz) extraction methods using a mixture of chloroform, methanol, and water (2:1:0.3; ClMe for 60 min as per the procedure described by Sarpal et al. [26] and presented in scheme 1 in Fig. 1. The sample amount taken was 120–800 mg in 15–60 mL of the solvent or solvent mixture depending on the sample quantity available. Extracts were designated according to the species followed by the cultivation medium and by the kind of extract, such as SNDWCSPSClMe and CHLWCSPSClMe. These mean S. ecornis (i.e., SND) or C. vulgaris (i.e., CHL) cultivated in WCSPS and extracted by the solvent mixture ClMe, respectively. Some of the extracts were further extracted successively three to four times with a cyclohexane–methanol (2:1) blend in order to preconcentrate the neutral lipids in the cyclohexane layer and the polar lipids in the methanol layer (scheme 2 in Fig. 1). These extracts are designated as SNDWCSPSCH, SNDWCCH, CHLWCSPSCH, or CHLWCCH. The nomenclature of the solvent extracts of algal biomass samples cultivated using different species and media is presented in Table 1. The extraction efficiency of lipids from the algal biomass was optimized and monitored by Fourier transform IR analyses as described by Sarpal et al. [26]. The amounts of extracts obtained are given in Table 6.

Fig. 1
figure 1

Ultrasonic solvent extraction schemes 1(ClMe extraction) and 2 for the extraction of lipids from algal biomass. ClMe chloroform–methanol–water (2:1:0.3)

The repeatability of the standard deviation of the extraction procedure developed and the amount of extracts obtained was checked by doing the extraction three times for each algal biomass in the required solvent amount. The standard deviation in the 2.92–11.99 % range as given in Table 2 for five samples is self-evident of the excellent repeatability demonstrated by the ultrasonic procedure developed for the extraction of lipids in view of the number of parameters involved, such as temperature (30 °C), solvent ratio, and homogeneity of the cultivated biomass.

Table 2 Repeatability of solvent (CHCl3–MeOH–H2O) extraction results (% w/w)

Transesterification for FAME production

Lipids from lyophilized biomasses were converted into FAMEs by adoption of the usual methanol–KOH in situ transesterification using the procedure described by Menzes et al. [32]. Lipids extracted from the biomasses using different mixtures of solvents were transesterified and transformed into FAMEs using this same method. The conversion efficiency was around 70–90 % depending on the amount of polar lipids present in the biomass or extract.

Instrumental analyses

NMR recordings

All the 1H NMR spectra were recorded with a Bruker 500-MHz NMR spectrometer (AVANCE 111) equipped with broadband (BB) probe and a broadband inverse (BBI) probe. The solutions were prepared by dissolving approximately 5–10 mg of algal extracts, FAMEs, vegetable oils, and biodiesels in 0.7–0.8 mL of CDCl3 containing the internal reference tetramethylsilane. Instrument parameters such as the relaxation delay and the receiver gain were optimized, and the 90° pulse width was calibrated in order to sufficiently relax the nuclei to obtain the quantitative spectra as described in our previous work [26]. The 90° pulse width was calibrated for each sample. Variation in the pulse width was observed in the 8.1–8.64-μs range depending on the concentration of the sample. To study the effect of relaxation delay on the quantitative analyses, the spectra of two extracts (SNDWCSPS, CHLWCSPS) and soybean oil were recorded under relaxation delays of 5, 10, and 15 s, and the integral areas of functional groups were measured. The results did not show any variation in the integral intensities of protons of functional groups of esters, unsaturations, methyls, etc., at relaxation delays of 10 and 15 s. Similar observations were reported in our earlier work on the analysis of yeast extracts [26] and also by Oromí-Farrús et al. [13] in their work on the analysis of biodiesels. The following parameters were used for each recording: relaxation delay 10 s; acquisition time 3.4 s, 90° pulse width 8.13–8.64 μs; number of scans 16 or 32, chemical shift range 0–12 ppm.

All the spectra were integrated three times after proper phase and baseline corrections, and average integral areas were taken for quantitative analyses. The 13C NMR spectra were recorded with a Bruker 500-MHz NMR spectrometer using a BBI probe in a CDCl3 solution of algal extracts (40–70 mg in 0.8 mL). The overnight recordings (approximately 10,000 scans) were performed with a relaxation delay of 10 or 15 s in composite pulse decoupling mode, a pulse width of 15 μs, and a sweep width of 0–240 ppm. However, 25 % w/v solutions of vegetable oil or biodiesel were used for 13C NMR recordings, with a maximum number of scans of 2,500–5,000 under identical conditions.

Two-dimensional NMR recordings

The 2D NMR TOCSY experiments were conducted with a 15–20 mg of sample in 0.7 mL CDCl3 using a Bruker AVANCE 111 500-MHz NMR spectrometer equipped with a BBI probe as per the pulse program described in [33] with 128 increments, eight transients, 1 s relaxation delay, and a sweep width of 7 ppm in both dimensions. A mixing time (d9) of 100 ms was found to be optimum for algal extracts. The data were processed with TopSpin (Bruker) using a 90°-shifted sine-squared window function for a total of 512 × 512 data points in the F2 and F1 dimensions. The HSQC–TOCSY spectra were recorded for a solution of 30–50 mg of sample in 0.7 mL CDCl3 with 512 increments, 16 transients, 2 s relaxation delay, a mixing time of 100 ms, and a sweep width of 7 and 150 ppm for the 1H and 13C dimensions, respectively. The data were processed with TopSpin using a 90°-shifted sine-squared window function for a total of 1,024 × 512 data points in the F2 and F1 dimensions [33].

GC–MS analyses

FAMEs produced directly from the algal biomasses or from the extracts were analyzed by GC–MS with use of a Shimadzu gas chromatograph equipped with a quadrupole mass spectrometer using electron impact ionization. A capillary polar PEG wax column (30-m length, 0.25-mm diameter, 0.25-μm film thickness) was used for separation of FAMEs. The temperatures of the injector and column were maintained at 265 and 250 °C, respectively. Heptane solutions (3–5 mg in 1 mL) were injected in a split ratio of 150:1 under application of the following ramp: 70 °C for 3 min, 10 °C per minute from 70 to 240 °C, hold time 13 min, and finally 5 °C per minute up to 250 °C and held for 10 min. The identification of all types of saturated and unsaturated components, including DHA and EPA, in algae was done by comparison with the profile of NIST SRM 2772, PUFA standard, and FAMEs of fish oil.

Results and discussion

Fatty acid profile determined by GC–MS

The fatty acid profiles of extracts of different algal biomasses given in Table 3 indicate that the extracts were composed of both saturated (C14–C18) and unsaturated fatty acids, including higher carbon number unsaturated fatty acids such as C20:3, 20:4, and C22:6 compared with other biomasses. The fatty acid profiles of extracts of biomasses of SNDWCSPS, CHLWCSPS, SNDWC, and CHLWC contain C20:3 and C20:6 besides other fatty acids compared with other samples. The fatty acid profiles of algal biomasses grown in other media are similar except for the variation in the concentration of saturated and unsaturated fatty acids. These biomasses were found to contain a negligible amount of higher carbon number unsaturated fatty acids.

Table 3 Fatty acid composition of algal and fish oils determined by gas chromatography–mass spectrometry (GC–MS) (area %)

1H NMR spectral features of algal extracts

The spectral features of algal extracts were described in detail in our earlier work [25]. However, to correlate them with the present objectives, the 1H NMR features are briefly described. The algal extracts of algal biomasses generated by extraction with various solvents using ultrasonic methods from different species cultivated in different media as described in Table 1 were characterized for the nature of neutral lipids (TG and FFA) and polar lipids and their fatty acid profile. The 1H NMR spectra of ClMe and cyclohexane extracts of S. cornis algal biomass (SNDSPSClMe, SNDSPSCH), fish oils, and soybean oils are given in Fig. 2. The algal extracts clearly indicate signals at 4.05–4.38 ppm (OCH2; sn-1, sn-3), 5.25 ppm (OCH; sn-2), 5.27–5.5 ppm (CH = CH), 2.6–2.9 ppm (bisallylic, CH = CH–CH 2 –CH = CH), 2.1–2.5 ppm (CH2C = O), 2.1–1.9 ppm (allylic, CH 2 –CH = CH–), 1.61 ppm (CH 2 -CH2C = O), 1.24 and 1.30 ppm [long alkyl chain (CH2) n ], 0.88 ppm (terminal CH3 groups of saturated C14–C18, etc., or n − 6/n − 9 types of unsaturated components C18:1, C18:2), and 0.98 ppm (terminal CH3 of n − 3 types of three or more than three double bonds) characteristic of TG comprising both saturated and unsaturated fatty acid chains. The spectra also indicate less intense signals at 2.40 and 2.42 ppm (CH 2 C = O), which have been specifically assigned to C22:6 (DHA). The spectra shows the presence of CH 2 C = O signals at 2.35 ppm due to FFA, which slightly overlap with the ester signals (CH 2 C = O) of fats at 2.30–2.31 ppm. The characteristic signals due to different unsaturated fatty acids/esters in the 2.6–3.0-ppm region, such as C18:2 (2.77 ppm), C18:3 (2.81 ppm), C22:6 (2.84 ppm), and FFA, are well marked in the spectra, as shown in the expanded part of the spectra of different algal extracts (Fig. 3).

Fig. 2
figure 2

500-MHz 1H NMR spectra of algal extracts. 1 Scenedesmus ecornis (SNDSPSClMe; see Table 1 for an explanation of the nomenclature), 2 SNDSPSCH, 3 soybean oil, and 4 fish oil

Fig. 3
figure 3

Expanded view of 500-MHz 1H NMR spectra of algal extracts of S. ecornis and Chlorella vulgaris cultivated in different media: WC medium [37], superphosphate and saltpeter, and blends of free fatty acids (FFA) in Mixoil14, and algal extracts showing demarcation of CH2C = O of FFA (2.35 ppm) and triglyceride (TG) (2.25–2.45 ppm), and bisallylic (2.70–2.9 ppm) regions. SPS1 and SPS2 are extracts from repeated extractions, and FFA1 and FFA2 are free fatty acids with different amounts

The low-intensity signals in the regions from 4.38 to 5.2 ppm and from 3.0 to 4.05 ppm have been assigned to OCH2 and OCH ester groups, and CHOH and CH 2 OH groups from the glycerol part of glyceroglycolipids and phospholipids (polar lipids), as shown in Fig. 2. This region may also contain a contribution from diglycerides and TG, if they are present in a sample. The presence of phospholipids in the extracts was confirmed by 31P NMR analyses, which showed the presence of a very small amount of phosphorus components in the region from −5.0 to 6.0 ppm [25]. The spectra also show a signal of very weak intensity at 3.67 ppm, resembling signals due to phospholipids. This might be due to the OCH3 ester signal of biodiesel, and it is a unique revelation as the presence of naturally occurring FAMEs in biomass is rarely reported. The 1H NMR spectra of algal extracts also show very weak intensity signals due to olefinic protons (CH = CH or –C = CH–) in the 5.8–6.8-ppm region and aldehyde protons in the 8–11-ppm region (8.0, 8.55, 8.69, 9.40, 9.50–9.53, and 10.4 ppm), as shown in the expanded part in Fig. 2. These signals have been assigned to conjugated/nonconjugated unsaturated protons of oxylipins (lipoxygenase-derived oxygenated acid products) of PUFAs (polyunsaturated fatty esters) such as DHA and EPA [29].

The cyclohexane extract (SNDSPSCH) (Fig. 2) shows spectral features similar to those observed for the ClMe extract, except the variation in the integral intensities, which are particularly very low in the 3–4 ppm region assigned to glyceroglycolipids/phospholipids compared with the ClMe extract. Similar spectral features are demonstrated by the different solvent extracts of C. vulgaris and S. ecornis biomasses cultivated in different media, as shown in Fig. 4. The expanded view of the region of the terminal CH3 of n − 3 fatty acid components at 0.98 ppm (0.93–1.0 ppm) in the spectra of ClMe extracts shows interesting features compared with the spectra of cyclohexane extracts (Fig. 4). Each singlet of a triplet is split into a doublet with the same coupling constant as shown in spectra 4–6 in Fig. 4. This might be due to either the presence of different types of CH3 groups or polarization of these methyls by the presence of epoxy fatty acid components, as revealed later.

Fig. 4
figure 4

Partial 500-MHz 1H NMR spectra of cyclohexane and ClMe extracts of S. ecornis and C. vulgaris biomasses cultivated in different media. The partial spectra in the 0.925–1.01-ppm region are due to terminal methyls

The identities of the signals observed at 2.42–2.43 ppm, symmetrical multiplets at 2.45–2.6 ppm (marked “A A” in Fig. 4), and signals in the 1.64–1.8-ppm region (marked “C20:5 ?” in Fig. 2) in the extracts could not be revealed. The spectra show a triplet at 1.69 ppm matching the features of C20:5. However, its presence could not be confirmed owing to it overlapping with other signals in the same region. The nature of these signals was characterized by 13C and 2D NMR spectral analyses as discussed in succeeding sections.

TG constitute more than 95 % of the total monoglycerides, diglycerides and TG, as evident from the expanded part of the spectra in the 4–5-ppm region as shown in Fig. S1. The chemical shift assignments corresponding to various components as discussed above was done by comparison with spectral features of soybean oil, fish oil (Fig. 2), and standards of FAMEs. The chemical shifts of characteristic signals due to different components of various algal extracts described in Figs. 2, 3, and 4 are summarized in Table 4.

Table 4 1H NMR chemical shifts (ppm) of functional groups of algal extracts and oils

13C NMR analyses

The 13C NMR spectra of the ClMe extract of S. ecornis biomass (SNDSPSClMe) as depicted in Fig. 5 show signals characteristic of glycerides due to the carbonyl group of saturated and unsaturated fatty esters between 172 and 174.6 ppm, unsaturated carbons (CH = CH) between 126 and 133 ppm, OCH2 (sn-1, sn-3) and OCH (sn-2) of TG at 62.12 and 68.96 ppm, CH2C = O at 34.05–34.3 ppm, and long alkyl saturated and unsaturated fatty chains at 31.95 ppm (γ-CH2), 22.7 ppm (β-CH2), 14.1–14.3 ppm (α-CH3), 29–30 ppm (CH2) n , and 23–28 ppm (CH2) n . The expanded part of the spectral regions corresponding to different functional groups clearly shows the characteristic signals corresponding to unsaturated fatty acid, particularly for C18:N, N = 1–3, and C22:6 (14.29, 132.04, and 172.17 ppm). The signals due to C20:5 (14.29, 128.0, 128.98, 173.04, and 173.11 ppm) were absent, as revealed by comparison with the 13C NMR spectra of fish oil. The confirmation regarding the absence of C20:5 was provided by the 2D NMR HSQC–TOCSY analyses as described in the next section. The carbonyl signal characteristic of FFA is indicated at 178.9 ppm. The 13C NMR spectra of ClMe extracts of algal biomasses SNDSPS and CHLSPS cultivated under different conditions indicate signals at 40.93, 67.69, and 169.17 ppm, which are uncommon to glycerides. The multiplicity of these signals was revealed to be OCH, OCH2, and C = O, respectively, by analyses by means of distortionless enhancement by polarization transfer with a 135° angle (spectra not given), and the nature was confirmed by 2D NMR analyses.

Fig. 5
figure 5

Full 500-MHz 13C NMR spectra (5–185 ppm) and expanded regions of terminal CH3 (13.9–14.4 ppm); OCH2/OCH esters (60–70 ppm), fatty acid/ester chain (5–50 ppm), carbonyl ester and acid (169–182 ppm); and unsaturated carbons (126–133 ppm) of ClMe extracts of S. ecornis cultivated in superphosphate and Chilean saltpeter. Refer to Table 5 for peak assignment

The signals of very weak intensity between the intense signals, particularly between 38 and 72 ppm, are due to CH2O, CH2OH, CHO, and CHOH groups of glycolipids, and may also be due to monoglycerides and diglycerides. These signals were more intense in ClMe extracts than in cyclohexane extracts. The presence of these signals was also indicated by the 1H NMR spectral analyses of these extracts [16, 20, 25, 35]. The region from 170 to 220 ppm did not show signals due to aldehyde (CH = O) and keto (C = O) groups, although signals of very weak intensity were clearly observed in the 7–11-ppm region in the 1H NMR spectra. This is due to the long relaxation times of carbonyl carbons which inhibited the appearance of these very weak intensity signals with the acquisition parameters applied (relaxation delay 10 s). The 13C NMR chemical shifts of the characteristic signals due to different components of various algal extracts are summarized in Table 5.

Table 5 13C NMR characteristic chemical shifts (ppm) for algal extracts and oils

Two-dimensional NMR analyses

Two-dimensional (2D) NMR (HSQC–TOCSY and TOCSY) analyses were conducted with the following two objectives:

  1. 1.

    To confirm the 1H and 13C chemical shift assignments made corresponding to various fatty acid components.

  2. 2.

    To resolve several overlapping signals, especially in the regions from 1.4 to 3.0 ppm and from 5 to 5.6 ppm assigned beta to CH2C = O, allylic, bisallylic, and olefinic protons in the 1H NMR spectra.

It is evident that each class of fatty acid components exhibited characteristic signals that could be selected as NMR fingerprints for their identification in a mixture such as algal extracts as demonstrated in the next section. The 2D NMR techniques were used to resolve signals corresponding to various fatty acids, particularly unsaturated fatty acids such as C18:N (N = 1–3), 20:3, C20:5, and C22:6 in order to confirm unambiguously their assignments. The identities of the signals at 1.65–1.71, 2.42, and 2.5–2.7 ppm, which were observed in ClMe extracts, were also revealed by 2D NMR TOCSY and HSQC–TOCSY contour analyses. The 1H–1H TOCSY analyses are well described in Fig. S2. Although the TOCSY plots clearly resolved the several overlapping peaks, especially in the 1.4–3.0-ppm region, the identities of the signals at 2.42 and 2.5–2.65 ppm (marked “A A” in Fig. 4) remained a mystery. The HSQC–TOCSY analyses resolved and identified these signals, as discussed in the next section.

HSQC–TOCSY analyses

The 1H–13C HSQC–TOCSY experiment combines 1H–13C HSQC and 1H TOCSY experiments to give through-bond correlations between a 13C-attached 1H and all other coupled 1H. The coupled 1H nuclei can be seen along a line at the same 13C chemical shift from the carbon atom attached to the primary 1H. These NMR techniques are powerful analytical tools for structural elucidation of complex molecules where overlapping resonances are difficult to resolve by the TOCSY spectrum alone [33]. In the present studies, combining conventional structure determination by 1D NMR spectral techniques with HSQC–TOCSY experiments, we achieved unambiguously the 1H and 13C assignments in the overlapped regions, and applied these assignments to the identification of unsaturated and saturated fatty acid chains of glycerides in the algal extracts (Fig. 6) and fish oil (Fig. S3). The signals at 2.42 and 2.39 ppm in the 1H NMR spectrum due to CH2C = O of DHA (C22:6) are very close to each other. The HSQC–TOCSY contour plot shows a cross peak indicating connectivity to the unsaturated carbons in the regions of PUFA types of components (line a). The symmetrical signals in the 2.5–2.65-ppm region in the 1H NMR spectra show direct connectivities to signals at 5.26 ppm (1H NMR) and 67.64 and 127–128 ppm (13C NMR) in Fig. 6a and b (lines b and h). Thus, these signals are assigned to epoxy types of components of higher unsaturated fatty esters. The GC–MS analyses also revealed a FAME component at a retention time higher than that of C20:5 and C22:6. The lines indicating connectivities corresponding to saturated and unsaturated components are marked in the spectra of the algal extract and fish oil, and are described in the captions for Figs. 6 and S3, respectively. The contour plots show the absence of C20:5, as marked by line q in Fig. 6b in comparison with the contour plot of fish oil shown in Fig. S3 (lines O, P, and q).

Fig. 6
figure 6

a Full 500-MHz 1H–13C 2D heteronuclear single quantum coherence (HSQC)–total correlation spectroscopy (TOCSY) contour plot of algal extract SNDSPSClMe. a, b C18:3, C22:6; c, e epoxy ester; n C22:6; q absence of C20:5; f, g OCH2 (sn-1, sn-3); h OCH (sn-2). b Partial 500-MHz 1H–13C 2D HSQC–TOCSY contour plot of algal extract SNDSPSClMe. Connectivities are shown as lines: a C18:3 and C22:6; b, h epoxy ester; c 18:3, e C22:6; g 18:1; i, f, j, k, l CS; n C12–C18. Lines P, q, and n (C20:5) are missing in SNDSPSClMe compared with fish oil (Fig. S3). c Partial 500-MHz 1H–13C 2D HSQC–TOCSY contour plot of algal extract SNDSPSClMe. Connectivities are shown as lines: l epoxy ester; m, n glycerides

From the detailed interpretation of the TOCSY and HSQC–TOCSY spectra above, it is evident that 2D NMR spectral analyses resolved the chemical shifts and unambiguously assigned them to the saturated and unsaturated fatty acid components, particularly C22:6, C20:5, C18:3/C20:3, and epoxy fatty acid, present in a complex matrix comprising multiple components of TG, FFA, and glyceroglycolipids/phospholipids.

From the detailed NMR spectral interpretation of solvent extracts of algal biomasses and GC–MS analyses, we summarize that algal extracts are composed of the following components:

  1. 1.

    Neutral lipids

    1. (a)

      Fats (more than 95 % as TG) (major)

    2. (b)

      FFA (low to medium)

    3. (c)

      Saturated fats (as C14–C18)

    4. (d)

      Unsaturated fats [C16:N, N = 1–3, C18:N, N = 1–3, C20:3, C22:6 (DHA), epoxy esters]

    5. (e)

      Biodiesel (minor)

    6. (e)

      Oxylipins (minor)

  2. 2.

    Polar lipids: glyceroglycolipids/phospholipids

Quantitative analyses of neutral (TG, FFA) and polar lipids

As a general practice reported in the literature, the fat or glyceride content in algal or vegetable oils is determined by use of a single-component external reference such as TGO, triglyceride stearate, or triglyceride palmitate [16, 23, 31]. This approach might not be appropriate in the case of algal oils for the following reasons:

  1. 1.

    The fatty acid composition comprises both saturated and unsaturated C12–C24, including EPA and DHA, as evident from the NMR and GC–MS analyses (Table 3). Therefore, whether it is a appropriate to determine the composition against an external single-component reference needs to be investigated.

  2. 2.

    The algal oils comprise a complex matrix of neutral and polar lipids with structures quite different from the components of vegetable oils, which are mainly composed of TG.

  3. 3.

    Overlapping of signals of glyceroglycolipids/phospholipids with the signals of glycerides, particularly in the region of interest from 4.0 to 4.40 ppm.

In the present studies, the effect of these factors on the quantitative determination was investigated in detail by both external reference and internal reference blended processes.

Internal reference standardization by blending processes

In NMR investigations, internal standard calibration involves the addition of standard organic compounds to the solution containing the target analytes, and comparing the integral areas of the protons of resonances originating from the standard compound with the integral areas of the protons of resonances originating from the analytes. Since the areas of the resonances are proportional to the number of protons present, the purity or concentration of target compounds can be measured quantitatively in mole or weight percentages. The standards are either certified reference materials or matching standards of target analytes. The internal reference standardization takes care of the matrix effect as matching standards with nature similar to that of the analytes are used, in contrast to external standards, where integral areas are measured separately for a standard and the analytes [3436]. In the present studies, the matching internal standard of Mixoil14 was used for quantitative determination of the lipid content as per the approaches described later.

The following equation was developed for the determination of the TG content:

$$ TG\;\mathrm{content}={K}_{tg}\times 2{I}_{tg}\left(4.05-4.35\mathrm{ppm}\right) $$
(1)

where K tg is a proportionality constant, and I tg is the integral intensity in the 4.25–4.35-ppm region in the 1H NMR spectra. The approaches adopted for the development of Eq. 1 are described in the following sections.

Polar lipid component matrix interference

Generation of polar lipid matrix

To distinguish the 1H NMR spectral features of polar lipids from the features of neutral lipids, the polar lipids were separated from the ClMe extracts of SNDWCSPS, SNDWC, CHLWCSPS, and other biomasses by successive extractions with the cyclohexane–methanol (2:1) blends (Fig. 1, scheme 2). The 1H NMR spectra of the methanol layers (polar lipids) as shown in Fig. 7 indicate practical absence of TG as evident from the structural features in the region from 3.0 to 5.0 ppm and at 5.26 ppm. The spectral features resemble the features of glyceroglycolipids/phospholipids owing to the appearance of intense signals in the 3–5-ppm region [25]. The fatty acid chain mostly comprises unsaturated carbons as evident from the higher iodine values compared with those for the cyclohexane layer. Since the polar lipid region largely extends into the TG region (4–4.4 ppm), this will result in a larger integral area for TG owing to the signal overlapping effect. Therefore, it was considered essential to minimize or eliminate the overlapping effect in order to accurately quantify the lipid content.

Fig. 7
figure 7

a. 500-MHz 1H NMR spectra of fractions of algal extracts (scheme 2). Spectra 1, 2, and 3 are for polar lipids (methanol layer, ME), and spectra 4, 5, and 6 are for neutral lipids (cyclohexane layer, HEX) of S. ecornis and C. vulgaris. b 500-MHz 1H NMR spectra of fractions of algal extracts (scheme 2). Spectra 1 and 2 are for polar lipids (methanol layer), and spectra 3 and 4 are for neutral lipids (cyclohexane layer) of C. vulgaris. PUFA polyunsaturated fatty acid

To study the matrix effect, Mixoil14 and FFA were blended in different proportions in the polar lipids (methanol layer) in order to generate blends with large variation of polar and neutral lipids. Mixoil14 was used as an internal reference in order to generate a fatty acid composition matching that of the algal extracts (Table 3). In such a blending process, 60 blends were generated, each containing different proportions of TG, FFA, and polar lipids. The amount of polar lipids in the blends was judged from the integral intensity in the 3–4.0-ppm region.. The plot of the integral intensities of blended TG, i.e., theoretical, I tg (Thr), versus the experimentally determined intensities of the TG, i.e., I tg (Exp), in the 4.05–4.40-ppm region (sn-1, sn-3; OCH2) was obtained as shown in Fig. 8 (top left). The regression coefficient R 2 = 0.674 is indicative of the large variation on the order of 20–40 % w/w in the experimentally determined TG content compared with the blended TG content. This is ascribed to a signal overlapping effect as discussed in the preceding paragraph.

Fig. 8
figure 8

Correlation curves for the NMR integral intensities in the 4.05–4.38-ppm region of blended (theoretical, Thr) TG versus determined (experimental, Exp) TG in the polar lipid matrix (top left) and for twice the integral intensity in the 4.25–4.35-ppm region (top right), and for the blended TG content versus twice the NMR integral intensities (I tg) in the 4.25–4.35-ppm region in the polar lipid matrix (bottom left) (Eq. 2) and for the blended FFA content versus the NMR integral intensities (I ffa) in the 2.34–2.40-ppm region in the polar lipid matrix (bottom right) (Eq. 5)

The overlapping effect was minimized or practically eliminated by considering one part of the OCH2 signals (sn-1) as demonstrated in the plot of the integral intensities I tg (Thr) and I tg (Exp) in the 4.25–4.35-ppm region (sn-1, OCH2) as shown in Fig. S1. R 2 = 0.983 indicates a reasonably good agreement between both determinations as shown in Fig. 8 (top right). I tg is twice the integral intensity in the 4.25–4.35-ppm region. Consequently, a plot of blended TG content (% w/w) versus I tg (%) using Eq. 1 resulted in R 2 = 0.984 with the following equation (Fig. 8, bottom left):

$$ TG\;\mathrm{content}\left(\%w/w\right)=26.06{I}_{tg}-0.62 $$
(2)

To determine directly the TG content by Eq. 2, I tg should be calculated as twice the integral intensity in the 4.25–4.35-ppm region.

External reference method

To evaluate the effect of a single-component and a multicomponent external reference in the present studies, TGO, soybean, Jatropha, and Mixoil14 oils were used as external standards to determine K tg by Eq. 1. The values of K tg were determined as 26.0, 26.6, 26.3, and 26.9 for TGO, soybean, Jatropha, and Mixoil14 oils, respectively. The constants were calculated by dividing the TG content (100 %) by theoretical integral intensities of OCH2 groups in the 4.05–4.38-ppm region. The TG content for algal extracts determined by applying these constants using Eqs. 3 and 4 shows insignificant variation in the range from 1.5 to3.5 % w/w irrespective of the nature of the external reference as demonstrated for TGO and Mixoil14 (Table 6):

Table 6 1H NMR composition (% w/w) of CHCl3–MeOH–H2O extracts of algal biomasses (S. cornis and C. vulgaris) cultivated under different conditions obtained by application of Eqs. 1, 2, 3, 4, and 5 (extraction scheme 1)
$$ TG\;\mathrm{content}(TGO)=26.0{I}_{tg}, $$
(3)
$$ TG\;\mathrm{content}\left(\mathrm{Mixoi}14\right)=26.88{I}_{tg}. $$
(4)

The results for different algal extracts obtained with Eqs. 2, 3, and 4 are given in Table 6.

Determination of FFA in the algal extracts

The 1H and 13C NMR analyses of the different solvent extracts of algal biomass samples confirmed the presence of FFA. There is an overlapping of the FFA signal at 2.35 ppm with the ester signal at 2.30 ppm, as shown in the expanded part of the spectra in Fig. 3. The 2.34–2.38-ppm region as marked in the spectra was selected for the derivation of Eqs. 5 and 6. The blends prepared for the derivation of Eq. 1 for TG in a polar matrix were used for the derivation of Eq. 5 (Fig. 8, bottom right) and the blends of FFA in Mixoil14 were used for the derivation of Eq. 6 (curve not shown). The results obtained with these equations for algal extracts show standard deviations in the 0.15–0.8 range (Table 6) (Fig. 8, bottom right), which is insignificant in view of the large differences in the matrices between the two approaches. Equation 5, which takes care of the matrix effect, was used for the determination of FFA in the algal extracts (Table 6):

$$ FFA\;\mathrm{content}\left(\%w/w\right)=23.57{I}_{ffa}-7.84, $$
(5)
$$ FFA\;\mathrm{content}\;\left(\%w/w\right)=24.74{I}_{ffa}-7.28. $$
(6)

Comparison of internal and external methods of quantification

The standard deviation graphs among the results for the TG content obtained with Eqs. 2, 3, and 4 (Table 6) show standard deviation in the 0.22–1.7 range (Fig. 9a). The standard deviation indicating variation in results may be insignificant in view of the large matrix difference between algal extracts and vegetable oils. However, the higher standard deviation between the externally applied reference Mixoil14 (Eq. 4) and the internally applied Mixoil14 (Eq. 2) for algal extracts certainly demonstrates some degree of matrix effect. This can be significant when real-time productivity is compared and measured while screening the algal species or strains for neutral lipid enhancement efficiency. In such an experimental design, application of Eq. 2 would be an appropriate choice. Similarly, higher standard deviation was observed between TGO and soybean oil when they were used internally and externally in the algal matrix. The standard deviation curves for determination of FFA in the algal matrix (Eq. 5) and without the algal matrix (Eq. 6) show similar results, with insignificant variation, as seen in Table 6 and Fig. 9b.

Fig. 9
figure 9

Standard deviation (SD) curves for TG (a) and FFA (b) for C. vulgaris and S. ecornis algal extracts from different media obtained with Eqs. 2, 3, 4, 5, and 6

From these systematic designed experimental protocols, it is evident that the matrix has a signal overlapping effect. Once this has been eliminated by the proper demarcation of the chemical shift region (4.25–4.35 ppm), the external and internal single-component or multicomponent reference approaches both demonstrate an insignificant matrix effect on variation in the results.

Limitation of Eqs. 2 and 5 for lipid content determination

Equations 2 and 5 were applied to the determination of TG and FFA in the ten solvent extracts (ClMe) from ten algal biomasses from two species cultivated in different media (Table 6). The results reported in Table 6 show large compositional differences in the content of TG (5.24–56.4%w/w), FFA (8.48–59.38 % w/w), and polar lipids (23.30–84.32 % w/w) in the solvent extracts. This trend is quite supportive and indicative of the flexibility of the methods developed covering wide ranges of TG, FFA, and polar lipids in the algal biomasses grown under different conditions and with different species. The determinations by Eqs. 2 and 5 are independent of the composition and type of neutral and polar lipids of an algal biomass and, therefore, can be applied directly for the determination of TG and FFA in the solvent extracts.

Equation 2 estimates a higher TG content when the polar lipid content in the extracts is more than 80 % w/w, as shown in Table 6 for extracts of SNDWCS (84.25 % w/w) and SNDWCU (83.93 % w/w). This is due to the intense polar lipid signal overlapping effect as shown in the spectra in Fig. S1 and in Table 6. In such cases, it is recommended to further preconcentrate the ClMe extract for neutral lipids by the successive extraction with a cyclohexane–methanol (2:1) blend (Fig. 1, scheme 2). This is evident from the 1H NMR spectral features of cyclohexane layers shown in Fig. 7. The cyclohexane layers contained 80–90 % of the neutral lipids. Since the methanol layer comprises polar lipids, extraction scheme 2 will allow polar lipids to be determined gravimetrically. The results obtained with schemes 1 and 2, and gravimetric extractions show good agreement (Table 7).

Table 7 Comparison of TG and FFA (% w/w) content from application of scheme 1 and scheme 2 determined by NMR spectroscopy and fatty acid methyl ester (FAME) content determined by NMR spectroscopy and GC–MS

Validation of results obtained by Eqs. 2 and 5

  1. 1.

    The plot of I tg (Exp) (determined) versus I tg (Thr) (blended) shows a good correlation, with R 2 = 0.983 (Fig. 8, top right).

  2. 2.

    The average alkyl chain of the fatty acid components was C18.2:1.2 as determined from the fatty acid profile by GC–MS (Table 3). The value of K = 25.7 (102.8/4) determined from the average fatty acid alkyl chain is very close to the values determined by Eq. 2.

  3. 3.

    The amounts of FAMEs, neutral lipids, and polar lipids of five algal biomasses and a fish oil determined by gravimetric extractions and GC–MS were compared with the amounts calculated by Eq. 7:

$$ \begin{array}{l}NMR\;\mathrm{FAME}\;\mathrm{content}\;\left(\%w/w\right)=TG\;\mathrm{content}\times 1.005+FFA\;\mathrm{content}\\ {}\times 1.05+\mathrm{polar}\;\mathrm{lipid}\;\mathrm{content}\times 0.67.\end{array} $$
(7)

The results are in good agreement, as shown in Table 7. The factors in Eq. 7 were calculated on the basis of the structural determination of neutral and polar lipids by NMR spectroscopy taking the average alkyl chain as C18:1 [34].

Precision of method

The repeatability of the standard deviation for TG and FFA determination by Eqs. 2 and 5 was demonstrated on nine algal extracts obtained from different algal biomasses generated by S. ecornis and C. vulgaris in different media and in a blend. The NMR spectra of each sample were recorded three times, each spectrum was integrated three times, and the average integral intensities were used to determine the standard deviation (Table 8). The standard deviation was in the range from 0.11 to 3.79 % and from 0.67 to 12 % for TG and FFA, respectively. The higher standard deviation in the case of FFA is due to the overlapping of signals with the ester signal in the 2.33–2.38-ppm region.

Table 8 Precision data for S. ecornis (extracts 1–4), C. vulgaris (extracts 5–9), and the blend

Precision of the extraction process for the nature of the components

To study the precision of the extraction process with regard to the nature of the components extracted, each of three samples of algae was extracted three times under identical conditions by the ultrasonic method developed. These biomasses were generated in sufficient amount (approximately 1 g) to allow a component nature extraction repeatability study. The 1H NMR spectral interpretation of functional groups corresponding to different components of the extracts such as fatty esters and acids comprising unsaturated and saturated fatty acids revealed practically no variation in the nature and concentration of these components present in three extracts of each algal biomass. The integral areas and the repeatability of the standard deviation of different functional groups in the range from 0.81 to 7.48 is self-evident of the presence of similar types of components in the solvent extracts obtained by repeated extraction of the same algal biomasses (Table 9). This exercise proved the excellent precision of both the solvent extraction process and identification and interpretation skill as demonstrated by the NMR methods developed. The results presented in Tables 2, 6, 7, 8, and 9 show that internal reference standardization provides absolute quantification of lipid contents.

Table 9 Repeatability of the standard deviation of the ultrasonic extraction procedure for algal biomasses

Conclusions

NMR spectroscopic techniques provide direct, rapid, and convenient methods for the determination of glycerides, FFA, and polar lipids in the solvent extracts of algal biomass without any sample pretreatment and prior separation. The detailed 1D and 2D NMR spectral interpretation has provided unambiguous assignments of various chemical shift regions corresponding to the profiles of unsaturated (C18:N, N = 1–3, C20:5, C22:6) and saturated fatty acids/esters, including identification of epoxy fatty acid esters in a way similar to that of those determined by GC/GC–MS analyses. The systematic designed experimental protocol for the study of internal and external reference standardization made it possible to derive Eqs. 2 and 5 for absolute quantification. The matrix effect of polar lipids is due only to the signals overlapping, and it was practically eliminated by an appropriate demarcation of chemical shift regions due to TG (4.25–4.32 ppm) and FFA (2.35 ppm). Since the extracts of biomass of algae are generated in small quantities on a laboratory scale, the analytical protocol developed based on NMR spectroscopy and GC–MS can be applied to monitor rapidly the extraction efficiency of various solvents, the content of neutral lipids, and quality aspects with regard to the nature of fatty acids. The results will facilitate optimization of cultivation parameters for the enhancement of neutral lipid efficiency with regard to the fatty acid profile. The NMR methods offer great potential for analyses of a large number of samples requiring approximately 10 mg of an extract for rapid screening and comparison of algal strains for generation of algal biomasses with the desired neutral lipid content in view of the cost economics of the overall cost of generation of the biomass.