Introduction

Most vegetable oils have limited applications in their original forms as a result of their specific chemical compositions [1]. For instance, frying oils rich in saturated fatty acids are good in terms of crisping and stability, but they have high melting points, poor operating performances, and are unhealthy. On the contrary, oils rich in polyunsaturated acids are healthy but rather prone to becoming oxidized and generating harmful substances during deep frying [2]. So, blended oils give manufactures greater flexibility to tailor the products to accomplish specific functional properties or to satisfy nutritional requirements. Considering the production cost and the availability of plant sources, blended oils have high market share in developing countries like China and India [3, 4]. Among the oils used for producing blended oils, five kinds of pure oils [soybean (SBO), corn (MZO), peanut (GNO), rapeseed (RSO), sunflower (SFO)] are the main and important materials in China [5]. Edible blended oils, composed of different vegetable oils, have different nutritional qualities and prices. Adulteration of oil products, involving the replacement of expensive ingredients with cheap substitutes, could be very lucrative for a vendor or a raw material supplier. Therefore, the determination of the quality and authenticity of blended oils is needed to prevent economic losses, mislabeling problems, and unfair trading practices [6,7,8,9]. Moreover, the most important index to evaluate the quality of blended oils is to determine the proportions of the constitutive components in it.

To this day, many studies have been focusing on the detection of blended or adulterated vegetable oils; many analytical methods have been proposed, such as spectroscopy [infrared spectroscopy (IR), ultraviolet spectroscopy (UV), nuclear magnetic resonance (NMR), etc.] [10,11,12,13], chromatography (GC, LC, etc.) [14, 15], biotechnology (DNA, enzyme, etc.) [16], often followed by chemometric tools. Chemometrics is applied in both qualitative analysis and quantitative analysis. There are two major categories of qualitative analysis techniques—unsupervised (PCA, cluster analysis, etc.) and supervised [partial least squares–discriminant analysis (PLS-DA), etc.] [17]. Multivariate calibration methods have been applied to construct models for quantification analysis, such as PLS [18,19,20,21], principal component regression (PCR) [22], and artificial neural networks (ANN) [23]. With the launch and implementation of Commission Regulation EU No. 29/2012, an analytical method for verifying if the percentage of olive oil in a blended oil is lower or higher than 50% [24] is demanded; therefore, the detection and quantification of the co-occurring components in a blended oil are required in this new view of quality control for blended oils.

Currently spectroscopy techniques (particularly NIR, Raman, MIR spectroscopy) and chemometrics have been used to detect for authenticity in blended oils as rapid and low-cost methods [25]. These works report MIR spectroscopy as an effective technique to detect various kinds of adulteration [26,27,28,29,30,31,32]. Oussama et al. [12] focused on the detection and quantification of extra virgin olive oil adulterated with soybean oil or sunflower oil using MIR spectroscopy and PLS [33]. A PLS regression model was able to predict the proportion of sunflower oil or soybean oil adulterants in a range from 1% to 24% (w/w) with relative prediction errors lower than 3% for external validation samples. Özdemir and Özturk [34] measured NIR transmittance of pure olive oils and olive oils adulterated with sunflower oil and corn oil. The proposed method was able to predict olive oil adulteration in binary blends in a range of 5–95% (w/w). However, in the case of ternary blends of olive oil, sunflower oil, and corn oil, a good prediction was observed for the olive oil content but a limited prediction was observed for the other two vegetable oils. Compared with the spectral method, the biggest advantage of chromatography is that it is a powerful separation technique [17]. Measurement of TAGs by a high temperature gas chromatography (HTGC)–MS system was applied for the quantification of olive oil in binary blends with vegetable oils (sunflower, corn, seeds, sesame, and soya) with several percentages (10–90%) using multivariate statistical analyses, such as soft independent modeling of class analogy (SIMCA), PLS, and genetic algorithm–partial least squares (GA-PLS) [15]. Qualitative classification and compositional quantification of multivariate blends represent a complex multi-scale question which has been studied much less to this point in time. Xu et al. [35] reported an accurate and effective method for the identification and quantification of blends of sunflower oil, soybean oil, and peanut oil with three-dimensional fluorescence spectra (3DFS). 3DFS contains more information including greater numbers of samples and more excitation wavelengths (or emission wavelengths) to distinguish overlapping analytics. However, it has not been widely used because of its high price.

All the vegetable oils including blended oils are generally mixtures of TAGs which make up to 95–98% (w/w) of composition and show characteristic distributions with different concentration levels. The characteristics and the composition of the oils extracted from various vegetable seeds depend on several factors, such as soil, climate, processing technology, harvesting, and chemical degradation process occurring during storage [36, 37]. Although TAGs are more stable than other compositions, like sterols and vitamin E, pure oils do not have exact similarity in their whole TAGs profiles, let alone blended oils. In order to apply this model extensively, large samples are needed to be collected and analyzed. It is only through this database that the composition of blended oil can be routinely quantified [38].

TAG analysis methods have attracted significant attention over the last decades. Papers dealing with the chromatographic analysis of TAGs in edible oils can be found elsewhere [17]. These include gas chromatography (GC) [39, 40], liquid chromatography (LC) [41,42,43], and supercritical fluid chromatography (SFC) [44]. Depending on different separation principles, these techniques have unique strengths, which can complement each other. However, there have been few studies on the combination of TAG values from GC and HPLC to build mathematical models.

Proportions of co-occurring components in a blended oil are governed by additive–dilutive processes which obey a simple rule [45], i.e., the simulation method complies with the law of conservation of mass. Multiple linear regression (MLR) is commonly used in chromatography to determine relevant proportions of different miscible solvents for target separations of co-occurring chemical components in complex matrices [46]. In this study, the independent variables in the complex matrix chromatogram are the TAG compositions of pure oils and a blended oil, with which the proportions of constitutive components, i.e., the dependent variables, can be obtained.

The purpose of this study was to verify whether it was possible to quantify the co-occurring components in multi-blended oils with varied proportions of soybean oil, corn oil, peanut oil, rapeseed oil, and sunflower oil by TAGs and chemometric tools, which has not previously been described. There were 20 groups of binary and ternary blended oils consisting of two or three oils out of those five kinds of oils, which were analyzed and processed separately. Chemometrics tools, such as PCA and MLR, were applied in this study.

Materials and Methods

Pure Oils

Most of the oils used to construct the blended oil are refined oils. The refined vegetable oils were provided by manufacturers. The unrefined oil (peanut oil, GNO) was extracted by mechanical processing in order to maintain more flavor substances. The vegetable oils comprised soybean oil (SBO) (147 pure oils), peanut oil (GNO) (179 pure oils), maize oil (MZO) (107 pure oils), sunflower oil (SFO) (65 pure oils), and rapeseed oil (RSO) (116 pure oils). SBO, MZO, and GNO were from China; SFO was from Australia, Argentina, and China; and RSO was from the Netherlands, Australia, and China.

Lab-made Blends Oils

An original simulation approach based on complex networks was carried out to evaluate the effect on the MLR method [45]. In simplex networks, geometrical locations of blends are governed by the within-blend constraint under which the coordinates (mass) of co-occurring oil varieties obey the unit-sum rule:

$$ \sum\nolimits_{j = 1}^{q} {\beta_{j}^{\prime } = \beta } $$
(1)

where q is the number of the total components in the considered blended oil, \( \beta_{j}^{\prime } \) is the mass of component j, and β is the total mass of the blended oil. Then the mass \( \beta_{j}^{'} \) is directly converted into proportions β j by \( \beta_{j}^{\prime } /\beta \). The blended oil is made by q (≥2) additive (dilutive) components j (j = 1 to q) which obey a unit (constant) sum constraint:

$$ \sum\nolimits_{j = 1}^{q} {\beta_{j} = 1} $$
(2)

As a result of the unit sum of q proportions, the relative variations of q components can be geometrically represented within a (q − 1)-dimensions simplex space (Fig. 1). In other words, simplex is a variation space with (q − 1) dimensions in which one can represent the relative variations of q components.

Fig. 1
figure 1

Geometrical representations for different simplex systems showing different blended parameters

Figure 1 shows a simplex space corresponding to a segment or an equilateral triangle, representing blended oils with q = 2 or 3 components, respectively. The different points in a simplex space have q coordinates corresponding to the proportions β j of q components.

The q apexes of the simplex space correspond to an extreme “blend” exclusively made by one component (pure oil state). For example, for q = 3, there are three extreme states corresponding to the three apexes of an equilateral triangle with the coordinates (β 1 = 1, 0, 0), (0, β 2 = 1, 0), (0, 0, β 3 = 1) (Fig. 1b). Points located inside segments represent blends of two components (bi-varietal blends) excluding a third one, e.g., (β 1, β 2, 0), (β 1, 0, β 3), and (0, β 2, β 3). Points located within simplex space correspond to blends of three components without any null mass: (β 1, β 2, β 3) with all β j  > 0.

Random sampling is a method to simulate this application in a real-life situation, since it is unknown which pure oils would be used in the blended oil. There were 20 groups of binary and ternary systems which consisted two or three oils out of those five kinds of oils (SBO, MZO, GNO, RSO, SFO) by means of random sampling. They were prepared step by step with fixed intervals (interval of binary system was 5%, interval of ternary system was 10%). The proportions of constitutive components in blended oils varied from 5% to 95% in the binary system and from 10% to 80% in the ternary system.

Binary system (10 groups, 150 blended oils): SBO–MZO, SBO–GNO, SBO–RSO, SBO–SFO, MZO–RSO, MZO–SFO, GNO–RSO, RSO–SFO, GNO–SFO, MZO–GNO.

Ternary system (10 groups, 360 blended oils): SBO–MZO–GNO, SBO–MZO–RSO, SBO–MZO–SFO, SBO–GNO–RSO, SBO–GNO–SFO, SBO–RSO–SFO, MZO–GNO–RSO, MZO–GNO–SFO, MZO–RSO–SFO, GNO–RSO–SFO.

All the vegetable oil samples were stored in dark bottles at −4 °C until analysis.

Chemicals

Glyceryl triundecanoate and tripentadecanoin (purity ≥ 95%) were purchased from Sigma. HPLC-grade hexane, acetonitrile, and ethyl acetate were purchased from Fisher Scientific. Other chemicals and reagents of analytical grade were obtained from reliable commercial sources.

Internal Standard Solution Preparation

Tripentadecanoin was diluted with toluene to a concentration of 1.0 mg/mL. This internal standard solution (IS-a) was used for sample preparation of GC–FID analysis.

Glyceryl triundecanoate was diluted with ethyl acetate to a concentration of 20 mg/mL. This internal standard solution (IS-b) was used for sample preparation of HPLC–RID analysis.

Sample Preparation

For GC–FID analysis, the controlled oil was dissolved to a final concentration of 0.4% (w/w) with 2 mL internal standard solution (IS-a) and 8 mL hexane. The well-mixed diluted oil sample was directly injected into the GC system, without any chemical derivation or purification step prior to chromatographic analysis.

For HPLC–RID analysis, the controlled oil was dissolved to a final concentration of 20 mg/mL with 100 μL internal standard solution (IS-b) and 900 μL ethyl acetate. The well-mixed diluted oil sample was injected directly into the HPLC system.

GC–FID and HPLC–RID Analysis of TAG Species in Oil Samples

GC–FID analyses of TAG species in oil samples were evaluated following the AOCS official method Ce 5-86 (2009). All separations were performed with an Agilent 7890 gas chromatograp (USA) equipped with a split/splitless injector coupled to a flame ionization detector. A split injection was used with a ratio of 25:1. The samples were introduced using a robotized autosampler module and then a sample volume of 1 μL was injected into the GC system.

A capillary column coated with 65% diphenyl–35% dimethylpolysiloxane stationary phase (Restek Rtx-65TG; 30 m × 0.32 mm × 0.25 μm; maximum temperature 370 °C; Restek Corp., Bellefonte, PA, USA) was used. The GC oven temperature was programmed from 250 to 350 °C at 10 °C/min, and held for 20 min. The injection port was held at 370 °C isothermally. Helium (99.995%) was used as the carrier gas and its flow rate was 1.0 mL/min; FID, 360 °C.

HPLC–RID analyses of TAG species in oil samples were evaluated following the AOCS official method Ce 5b-89 (2009). Chromatographic analysis was carried out using a reversed-phase HPLC system (Agilent 1260, USA) equipped with a differential refraction detector coupled to an Hibar RP-18 analytical column (Merck LiChrosorb® RP-18, 3 × 150 mm, 5 μm). The column temperature was kept at 40 °C. The injection volume was kept at 10 μL.The mobile phase composed of acetonitrile and ethyl acetate (8:2) was used for isocratic separation. The flow rate was 1.8 mL/min. RID conditions were polarity, positive; peak width >0.2 min.

Data Analysis

The TAG profiles from GC–FID and HPLC were handled using Open LAB Data Station and results were calculated using Microsoft Excel 2007. The internal normalization method was used to complete the quantitative analysis, assuming that the detector response was the same for all compounds. All experiments were performed in triplicate.

The statistical analyses including PCA and MLR were developed by Matlab (Ver. 7.10 (2010a): The MATHWORKS, USA).

PCA is frequently used for the purpose of generating a reduced set of variables which account for the most of the variability in the original data. In this study, PCA created the three-dimensional score plot, which was based on the 11 TAG composition data sets collected by GC–FID, to visualize the differences of those five vegetable oils.

The main steps of the MLR with constrained linear least squares method and exhaustion method for proportions prediction of constitutive components are shown as follows.

Step1. The database for TAG composition of pure oils was prepared.

Step 2 (Input). Matrix representations of simplex mixture systems were given.

For instance, q = 3,

$$ T_{iBlended oil} = \beta_{1} T_{i1} + \beta_{2} T_{i2} + \beta_{3} T_{I3} $$
(3-1)

Some restrained conditions for solving linear program can be written as

$$ 0 \le \beta_{1} ,\beta_{2} , \ldots \ldots , \beta_{j} $$
(3-2)

where \( T_{{i{\text{Blended oil}}}} ,T_{i1} ,T_{i2} ,T_{i3} \) are the contents of TAG i in the blended oil, type 1, type 2, and type 3, respectively; and β 1, β 2, and β 3 are the proportions of type 1, type 2, and type 3 in blended oil.

Matrices representing mixture systems were analyzed under an exhaustive method. Each of the matrices was carried out by exhaustive sampling of TAG compositions which were shared in the three pure oil groups corresponding to their proportions β j . The ternary system SBO–MZO–GNO (30%, 50%, 20%) consists of β SBO = 30%, β MZO = 50%, and β GNO = 20% TAG compositions exhaustively taken from the first (SBO), second (MZO), and third (GNO) oil groups, respectively. In this paper, the database includes five groups, i.e., SBO (147 pure oils), MZO (107 pure oils), GNO (179 pure oils), SFO (65 pure oils), and RSO (116 pure oils). Therefore, N corresponding to the total number of the TAG composition combinations from its constitutive pure oils is \( N = C_{147}^{1} \times C_{107}^{1} \times C_{179}^{1} = 2815491 \).

Step 3 (Output). Calculation of constitutive component proportions.

The composition of the TAGs (11 TAGs–GC or six TAGs–HPLC) from pure oils and blends was considered as independent variable. According to the example in step 3 (Input), the whole calculation process needs to be executed 2,815,491 times. Each of the matrices/mixtures will be calculated to obtain a set of predicted results/proportions. The standard technique for performing the MLR is least squares regression. However, the results could be unreasonable. In some cases, each composition in the objective function is not absolutely independent, but is restrained by some physical and mathematical conditions. As shown in Eq. 2, Eq. 3-1, and Eq. 3-2, there are three conditions in this model: all proportions should fit in the linear regression; the proportions should be always positive; the sum of the proportions in the blended oil should equal 1. To overcome the drawback, constrained linear least squares (CLLS) is used to resolve the unreasonable fitting results when proportions of constitutive components in blended oil were predicted. The program will provide all the statistical parameters (e.g., minimum, maximum, mean, standard deviation (SD), etc.) as output. Data processing by MLR with constrained linear least squares method and exhaustion method in this study was carried out by a homemade Matlab program.

Results and Discussion

TAG Analysis by GC–FID

Compared to the other TAG analysis methods, GC has features of simple operation, stable running, and does not require a long time to start analysis. Therefore, GC has been applied as a measurement of the quality and the purity of vegetable oils in this paper. TAG analysis of these oils reveals a quite homogeneous composition dominated by C50, C52, and C54 groups which are mainly composed of palmitic (P-16:0), stearic (S-18:0), oleic (O-18:1), linoleic (L-18:2), and linolenic (Ln-18:3) acid residues, and these groups account for 95–98% of the investigated vegetable oils. TAG chromatograms of GC–FID are shown in Fig. 2: C50 group includes POP and PLP, C52 group includes POS, POO + PLS, PLO, and PLL, and C54 group includes SOO, OOO + SLS + SLO, OLO + SLL, OLL, and LLL.

Fig. 2
figure 2

GC–FID chromatograms of five kinds of vegetable oils (all kinds of oils analyzed in the study). Numbers (1–11) represent 11 typical TAGs: 1 POP, 2 PLP, 3 POS, 4 POO + PLS, 5 PLO, 6 PLL, 7 SOO, 8 OOO + SLS + SLO, 9 OLO + SLL, 10 OLL, 11 LLL

Statistical Analysis

Table 1 shows the relative TAG compositions and the minimum, maximum, and mean values for each TAG of 614 pure vegetable oils by GC–FID. The one-way ANOVA for each TAG of different kinds of oils was applied by using the SPSS software. The results of the ANOVA (the significance levels are less than 10−4, P = 10−4) indicated there was a statistically significant difference in the content of TAGs within the different kinds of oils used in this experiment.

Table 1 Eleven TAG compositions (% mass) in five kinds of vegetable oils by GC–FID and ANOVA results

Afterwards, the data set of five kinds of pure vegetable oils (614) having different origins and refining conditions was classified by PCA (Fig. 3). The PCA graphic shows the similarities and differences among the pure oils, and similar oils tend to form clusters and dissimilar oils are separated away. As shown in Table 1 and Fig. 3, except for SBO and MZO, the other kinds of pure vegetable oils, which form three clusters, are noticeably separated. The TAG compositions of SBO and MZO are very similar; hence, it can be inaccurate when predicting the proportions of SBO and MZO in their co-existence groups via the developed MLR simulation.

Fig. 3
figure 3

Principal components analysis (PCA) plot for TAGs of the overall vegetable oils

Furthermore, the current analysis condition of GC–FID leads to poor peak shapes when samples contain longer fatty acids and TAGs that have the same carbon number of long chains which cannot be separated individually. Therefore, reversed-phase high-performance liquid chromatography (RP-HPLC) is also employed for the separation of TAGs, because it operates on the principle of both chain length and degree of unsaturation of fatty acids, thus providing a better separation of individual TAG molecules, especially TAGs containing polyunsaturated fatty acids.

TAG Analysis by HPLC–RID

Chromatograms of TAGs in SBO, MZO, GNO, RSO, and SFO using HPLC are shown in Fig. 4. Peak 1 eluted at 12.13 min and only appears in SBO. Furthermore, there is a great difference in the content of peak 2 between SBO and MZO. Peak 1 and peak 2 were identified to correspond to 3-linolenoyl-1,2-dilinoleoryl-sn-glycerol (LLLn) and trilinolein (LLL) by LC–MS/MS, respectively. These MS results (protonated molecule [M + H]+ and “diacylglycerol” ions, [DG]+) were in accordance with the report published by Neff and Byrdwell in 1995 [47]. Table 2 shows the relative TAG compositions and the minimum, maximum, and mean values for each TAG of 614 pure vegetable oils by HPLC–RID. These TAGs could be new noticeable markers to determine the proportions of the constitutive components in blended oil which cannot be distinguished by the GC. Thus, all the SBO and MZO were re-analyzed by HPLC–RID to find better independent variables and to offer more accurate proportion results.

Fig. 4
figure 4

HPLC-RID chromatograms of five kinds of vegetable oils (all kinds of oils analyzed in the study). Numbers (1–6) represent six typical TAGs: 1′ LLLn, 2′ LLL, 3′ LLO, 4′ LLP, 5′ OOL, 6′ POL

Table 2 Six TAG compositions (% mass) in five kinds of vegetable oils by HPLC-RID and ANOVA results

Development of a MLR-based Method for Determination of the Proportions of the Constitutive Components in Blended Oils

Mass balance is a universal principle. TAG compositions of pure oils and a blended oil were considered as the independent variables and proportions of constitutive components were defined as the dependent variables. MLR was used to determine the relationship between multiple independent predictor variables (the TAGs composition of pure oils and a blended oil) and dependent variables (the proportions of constitutive components). The details of such simulation have been shown in Sect. “Data Analysis”.

TAG compositions of GC as independent variables include 1, POP (dipalmitoyl oleoyl glycerol); 2, PLP (dipalmitoyl linoleoyl glycerol); 3, POS (palmitoyl oleoyl stearoyl glycerol); 4, POO + PLS (dioleoyl palmitoyl glycerol + palmitoyl linoleoyl stearoyl glycerol); 5, PLO (palmitoyl linoleoyl oleoyl glycerol); 6, PLL (dilinoleoyl palmitoyl glycerol); 7, SOO (dioleoyl stearoyl glycerol); 8, OOO + SLS + SLO (triolein + distearoyl linoleoyl glycerol + stearoyl linoleoyl oleoyl glycerol); 9, OLO + SLL (dioleoyl linoleoyl glycerol) + dilinoleoyl stearoyl glycerol); 10, OLL (dilinoleoyl oleoyl glycerol); and 11, LLL (trilinolein).

In order to improve the prediction accuracy, the binary and ternary blended oils with SBO and MZO co-occurring were analyzed and processed by MLR twice, and the independent variables of MLR were from GC–FID and HPLC-RID, respectively.

TAG compositions of HPLC as independent variables include 1′, LLLn (3-linolenoyl-1,2-dilinoleoryl-sn-glycerol); 2′, LLL (trilinolein); 3′, LLO (1, 2-dilinoleoyl-3-oleoyl-rac-glycerol); 4′, LLP (dilinoleoyl palmitoyl glycerol); 5′, OOL (dioleoyl linoleoyl glycerol); and 6′, POL (palmitoyl linoleoyl oleoyl glycerol).

The calculation steps to analyze SBO and MZO co-existence in blended oils are as follows: firstly, the sum of β SBO and β MZO and the proportions of other oils are determined using MLR with the TAG variables from GC–FID. Secondly, β SBO and β MZO are further predicted using MLR with the TAG variables from HPLC–RID.

Prediction and Evaluation of the MLR Method in the Real Blended Oils of Binary and Ternary Systems

Considering the extensive sources of collected oils, exhaustive algorithm was applied to calculate the proportions of all the possible combinations. The program provided all the statistical parameters (e.g., minimum, maximum, mean, standard deviation (SD), etc.) as output. The accuracy of these values is not constant for all blended oils as it depends on the type and the proportion of the constitutive components in blended oils. The mean value was used as the parameter to calculate the relative error in the statistical analysis.

The relative error e βj of β j was calculated with the following formula:

$$ e_{\beta j} = {{\left| {\beta_{j,actual} - \beta_{j,pred} } \right|} \mathord{\left/ {\vphantom {{\left| {\beta_{j,actual} - \beta_{j,pred} } \right|} {\beta_{j,actual} \times 100\% }}} \right. \kern-0pt} {\beta_{j,actual} \times 100\% }} $$
(4)
$$ e_{\beta } = \max_{j} e_{\beta j} $$
(5)

where e β is the max j  e βj in a blended oil, also regarded as the relative error of each oil and illustrated in Fig. 5, β j,actual denotes the actual additive proportion of co-occurring components when preparing a lab-made blended oil, and β j,pred denotes the predicted additive proportion of co-occurring components in a blended oil.

Fig. 5
figure 5

Relative errors, e βj , of each kind of the five oils in related groups of the binary and the ternary system

To analyze the relative errors of each of the five oils (SBO, MZO, GNO, RSO, SFO) in related groups in the binary system and the ternary system, another approach named “pinwheel and chessboard” was applied.

For binary systems, the center of the pinwheel represents a kind of pure oil. For instance, A1 in Fig. 5 represents SBO, where the 4 axes represent the other oils (MZO, GNO, RSO, SFO), and the proportion of SBO decreases gradually while the proportions of other oils increase as the axis extends farther out from the center. The extent and tendency of the deviation can be observed from colors, lengths, and locations of the bars, i.e., the white representing e βj is less than 20%, the gray representing e βj is from 20% to 30%, and the black representing e βj is more than 30%. The formula to calculate e βj is shown as Eq. 4.

For ternary systems, the center of the chessboard represents a kind of pure oil. For instance, B1 in Fig. 5 represents SBO, where each triangle in the chessboard represents a group which contains SBO, and the other two triangle vertexes represent the other two oils. Similar to the binary system, the proportion of SBO decreases gradually while the proportions of the other oils increase as vertexes extend outward from the center. The extent and tendency of the deviation can be observed from the colors and locations of the points, same as in the binary system.

$$ P_{{N_{n} }} = N_{{e_{\beta } }} /N_{n} $$
(6)

In Eq. 6, \( N_{{e_{\beta } }} \) is the number of e β (considering only the maximum of e βj in a blended oil sample) within a given range,\( N_{n} \) is the number of samples in the sample collection n, and \( P_{{N_{n} }} \) is defined as the percentage of the relative errors within a given range in a sample collection, and is regarded as an evaluation indicator for the samples. N 1 and N 5 are for the statistical results of binary system and ternary system based on GC–FID variables; N 2N 4 and N 6N 7 are for the statistical results of binary system and ternary system based on GC–FID variables combined with HPLC variables. The results are shown in Table 3. Regarding a given range, \( P_{{N_{n} }} \) of the method with GC–FID variables and HPLC variables (\( P_{{N_{4} }} \) = 98.2% and P N7 = 84.7%, e βj  < 20%) is always larger than \( P_{{N_{n} }} \) of the method with GC–FID variables (\( P_{{N_{1} }} \) = 61.3% and \( P_{{N_{5} }} \) = 48.3%, e β  < 20%). In addition, the proposed method requires a minimum proportion of each oil, i.e., at least 10% for MZO, 15% for RSO, 20% for SBO, and 30% for GNO and SFO in a binary blend, and at least 10% for GNO, 20% for MZO, RSO, and SFO, and 30% for SBO in a ternary blend, respectively (e β  < 30%). It is interesting to note that the larger proportion βj (Eq. 2) is added in blended oils, the lower e βj (Eq. 4) is obtained in the binary/ternary system, i.e., the predicted results become more accurate in the binary/ternary system. As shown in the Fig. 5, there are almost all white dots/bars near the centers of the pinwheels and the chessboards. The proportions βj are getting smaller as they move away from the centers, and the further from the centers, the more possibilities of getting high e βj since βj is the denominator in Eq. 4. In other words, there is a negative correlation between βj and e βj (e.g., given that prediction errors are 10%, e βj is 50% when βj is 20%, while e βj is 12.5% when βj is 80%); therefore it is better to take βj into consideration when the standards were set up for e βj in the process of a practical application. Furthermore, this mathematical model can be extended to determine the proportions of the constitutive components in other kinds of blended oils.

Table 3 Statistical results of the percentage of the relative errors in different ways for the binary system and the ternary system

Conclusions

In view of the results attained within this study, it can be concluded that the proposed method of MLR with constrained linear least squares and exhaustion calculation based on the combination of TAG values from GC and HPLC analysis is a promising and reasonable way to verify the proportion of constitutive components in some blended vegetable oils. To verify the effectiveness of the experimental protocols and the models, it needs to be validated in more blended oils and in a larger database with a greater diversity and quantity of pure oils. Once approved, it can be applied for quick segregation and quality control in routine analysis to help regulate the blended oil market. Additionally, more specific and stable markers will be further studied to enhance the accuracy of the predicted proportions of constitutive components in the edible blended oils.