Introduction

Biodiesel is an alternative fuel for petroleum fuel that can be commercially produced through transesterification of vegetable oils or animal fats and methanol or ethanol with an alkaline or acidic catalyst [1, 2]. Biodiesel is a mixture of fatty esters with each ester component contributing to the properties of the fuel. The properties of biodiesel are decided by the structure of the fatty acid methyl esters (FAME) that constitute biodiesel [35].

The main properties of biodiesel fuels include density, viscosity, flash point (FP) and high heating value (HHV). Generally, the properties of biodiesel are measured by standard method such as ASTM [68]. The values from standard test are accurate and acceptable. However some methods of testing always incur time and material resources.

For these reasons, a Quantitative Structure–Property Relationship (QSPR) model can be considered as a useful means of relating the properties of biodiesels with their molecular structure. This method has been successfully used for some organic substances [9, 10]. However, reported QSPR methods by different researchers can only be used for an individual component but not for the mixture. It cannot reflect the difference in physical properties among several mixtures which have same components but different mass fractions.

To use the QSPR method in predicting the properties of biodiesel fuels successfully, a topological index based on the distance matrix and adjacent matrix of the molecular structure was developed in our laboratory [11]. However, the molecular structure parameters were not introduced fully, so the calculated results were not sufficiently accurate and only the viscosity of biodiesel fuels was calculated and predicted.

Based on these considerations, the purpose of this work is to attempt to establish a novel topological index which will include more information on the molecular structure such as molar mass, bond length and bond energy of FAME. Then by introducing the modified Grunberg–Nissan or Hind equation to get the topological index for FAME mixture [12, 13], we may obtain the mathematical relationships between the topological index of FAME mixture with the properties of biodiesel, such as density, viscosity, flash point (FP) and high heating value (HHV).

Using this method, there is a lower requirement for expensive measurement of the properties of the FAME mixture and the feedstock can be appropriately selected to meet the specifications necessary for biodiesel production.

Model for Mixture Topological Index

Foundation of the Individual Component Topological Index

The Quantitative Structure–Property Relationship (QSPR) models are widely used nowadays to perform property estimation. A QSPR model consists of a correlation between a property of substance and a topological index derived from molecular structure of the substance.

The relationship between topological index and molecular structure may be developed as follows: firstly, the molecular structure graph is described according to the molecular structural formula by using a connected graph, then, the topological structure is obtained by translating the molecular structure graph into matrix. Finally, the molecular matrix diagram is numerically represented. In the mean time, to connect the molecular structural graph with physical properties, it is necessary to translate the matrix into a dimensionless factor to ultimately develop the relationship.

Traditionally, the molecular structure graph was hydrogen-depleted graphs which the hydrogen atom was ignored, based on this consideration, the topological index is expressed in Eq. (1):

$$ Xu = \sqrt n \lg \left({{\sum\limits_{i}^{n} {v_{i} s_{i}^{2} } } \mathord{\left/ {\vphantom {{\sum\limits_{i}^{n} {v_{i} s_{i}^{2} } } {\sum\limits_{i}^{n} {v_{i} s_{i} )} }}} \right. \kern-\nulldelimiterspace} {\sum\limits_{i}^{n} {v_{i} s_{i} } }}\right) $$
(1)

where n is the vertex number of the molecular structure graph, s i is distance matrix, v i is the vertex value which is not consider the number of hydrogen [11].

The topological index Xu consists of a number of carbon and oxygen atoms in a molecule, the vertex value and the bond length. Nevertheless there are some disadvantages in this topological index Xu.

  1. (1)

    The parameter of Xu is not sufficiently comprehensive.

  2. (2)

    It is difficult to distinguish when the molecular structure is similar but the atoms are different.

  3. (3)

    The topological index Xu is not available during the chemical reaction since it can not reflect the bond’s cleavage.

To overcome these disadvantages for the topological index Xu, the bond length (C–C, C=C, C–O and C=O), bond energy and molar mass will be taken as the basis to develop a novel topological index in this work.

Firstly, the molar mass M is directly proportional with to the physical properties of FAME. It is an indispensable parameter of the molecule. So the molar mass M is introduced into the topological index expression.

Secondly, the vertex degree v i is corrected:

The number of hydrogen atoms connected to each carbon atom or heteroatom is different, this fact was taken into account in the corrected vertex degree v i in our work [14].

  1. (1)

    The point valence adjustment of the hetero-atom is δ v  = (z v − h)/(z − z v − 1). Here z is the total number of extranuclear electrons, z v is the number of valence electrons and h is the number of hydrogen atoms linking with the hetero-atom [15].

  2. (2)

    With relation to the unsaturated bonds, a double bond is treated as two single bonds and a triple bond as three single bonds. The point valence after adjustment is δ v  = z v − h.

Thirdly, the relative intensity of the bond is modified. By substituting the topological distance with relative bond distance defined as the ratio of topological distance with corresponding C–C distance when the latter is set to be 1.0. The ratio is defined as f ij . For example, the real length of the C–C bond is 0.154 nm and the C–O bond is 0.134 nm in esters, so the relative bond distance of the C–O bond in esters is 0.134/0.154 = 0.8701. By substituting the topological bond energy with relative bond energy defined as the ratio of topological energy with the corresponding C–C bond energy when the latter is set to be 1.0. The ratio is defined as e ij . For example, the real bond energy of C–C bond is 345.6 KJ/mol and that of the C–O bond is 357.9 KJ/mol in esters, so the relative bond energy of the C–O bond in esters is 357.9/345.6 = 1.035. If we consider that the bond energy is smaller when the bond length is longer, then the relative bond intensity r ij is defined as the ratio of relative bond energy e ij and relative bond length f ij .

$$ \left\{ \begin{aligned} r_{ij} &= e_{ij} /f_{ij} \,\,(i \ne j) \\ r_{ij} &= 0 (i = j) \\ \end{aligned} \right. $$
(2)

The r ij is the sum of the relative bond intensity for all the bonds from the vertex i to j. Using this modification, the molecular relative bond intensity matrix R is found, then, the column sum of matrix R is calculated and a new matrix R i is obtained.

The bond parameters of some ordinary bonds are tabulated in Table 1.

Table 1 Bond parameter of some ordinary bonds

Combined, the M, V, R and a novel topological index A is expressed in Eq. (3):

$$ A = M\ln \left({{\sum\limits_{i}^{n} {v_{i} R_{i}^{2} } } \mathord{\left/ {\vphantom {{\sum\limits_{i}^{n} {v_{i} R_{i}^{2} } } {\sum\limits_{i}^{n} {v_{i} R_{i} )} }}} \right. \kern-\nulldelimiterspace} {\sum\limits_{i}^{n} {v_{i} R_{i} } }}\right) $$
(3)

The M, V, R contain an abundance of molecular structure information, so a new topological index A can be assumed to reflect the molecular structure more accurately and obtain a better correlation results with the properties of FAME.

Foundation of the Mixture Topological Index to Correlate Properties

The Grunberg–Nissan equation is expressed in Eq. (4), and the Hind equation is expressed in Eq. (5):

$$ \ln \eta_{\text{m}} = \sum\limits_{i}^{n} {x_{i} } \ln \eta_{i} + \sum\limits_{i}^{n} {\sum\limits_{j\, > \,i}^{n} {x_{i} x_{j} S_{ij} } } + \sum\limits_{i}^{n} {\sum\limits_{j\, > \,i}^{n} {\sum\limits_{k\, > \,j}^{n} {x_{i} x_{j} x_{k} S_{ijk} } } } $$
(4)
$$ \eta_{\text{m}} = \sum\limits_{i}^{n} {x_{i}^{2} } \eta_{i} + 2\left( {\sum\limits_{i}^{n} {\sum\limits_{j\, > \,i}^{n} {x_{i} x_{j} S_{ij} } } + \sum\limits_{i}^{n} {\sum\limits_{j\, > \,i}^{n} {\sum\limits_{k\, > \,j}^{n} {x_{i} x_{j} x_{k} S_{ijk} } } } } \right) $$
(5)

where, η m is the mean value of the mixture’s properties, η i the value of the individual component’s properties, x i , x j and x k the mole fraction of i, j and k component, and S ij the binary interaction parameter, S ijk the ternary interaction parameter, n the number of components.

Biodiesel is the mixture of FAME in which the chemical structures are similar (aliphatic chains) and it may be regarded as a non-ideal solution due to their similarity. Therefore, the Grunberg–Nissan equation or Hind equation can be modified to simplify computation. Because the individual component topological index can reflect the structural information of the fatty acid methyl esters component, Eqs. (4) or (5) are applied to obtain the relationship of topological index between individual components and mixtures. Some simplification and assumptions may be made here.

  1. (1)

    Since the biodiesel can be regarded as a near-ideal solution, the binary and ternary interaction parameters in Eqs. (4) and (5) would be small and thus can be neglected, as in other works [16].

  2. (2)

    Since the mass fraction is commonly used in current literatures, the mole fraction is replaced by the mass fraction in calculating properties of biodiesel.

  3. (3)

    The properties η i of i component was replaced by its topological index value A i .

With these modifications, Eqs. (4) or (5) were modified as Eqs. (6) or (7) respectively and used to calculate the topological index of the FAME mixture.

$$ \ln A_{\text{m}}^{\text{G}} = \sum\limits_{i\, = \,1}^{n} {x_{i} \ln A_{i} } $$
(6)
$$ A_{\text{m}}^{\text{H}} = \sum\limits_{i\, = \,1}^{n} {x_{i}^{2} A_{i} } $$
(7)

where, A m is the mixture topological index value, A i the topological index value of the individual i component, and x i the mass fraction of the individual i component. Superscripts G and H represent the method of Grunberg–Nissan and the Hind, respectively.

The correlation process of the mixture topological index values with the properties of biodiesels can be realized as follows. Firstly, the individual topological index value (A i ) of FAME was calculated according to Eq. (3). Secondly, combining the individual topological index value with the mass fraction (Z i ) for each kind of FAME in biodiesel, the mixture topological index values (A m) of biodiesels were calculated according to Eqs. (6) or (7), respectively. Thirdly, two quantitative regression equations were obtained from the mixture topological index values and properties of biodiesels. Lastly, the coefficients of regression (R c) of these two regression equations were compared and the better solution was chosen to illustrate the correlation of the mixture topological index values and properties of biodiesels.

Results and Discussion

The Calculation Process of Topological Index Value

It is known from Eq. (6), that the mixture topological index values of biodiesels are decided by the topological index value of individual components (FAME) and their mass percentage in the biodiesels. The fatty acid methyl esters components and their mass percentage for eleven biodiesels are tabulated in Table 2 [16].

Table 2 Fatty acid methyl esters and their mass percentage of 11 types of biodiesels provided by Allen et al

CH3(CH2)6COOCH3 (Caprylic methyl ester, CME) is a common component of different biodiesel fuels, and its structure diagram is shown in Fig. 1.

Fig. 1
figure 1

The structure diagram of CME

The molar mass of CME is 0.158 kg/mol. Matrix R, R i and the vertex degree matrix v i of CME are obtained as follows.

Firstly, taking r 15 as an example to illustrate the calculation process for matrix element r ij of relative bond intensity matrix R:

$$ r_{15} = 2{\text{ Relative bond intensity}}\left( {{\text{C--O}}} \right) + 2{\text{ Relative bond}}\,\,{\text{intensity}}\left( {{\text{C--C}}} \right) = 4.379 $$

These relative bond intensities can be seen in Table 1.

Similarly, other matrix elements r ij can be obtained, and the relative bond intensity matrix R of CME is shown as follows.

$$ R = \left[ {\begin{array}{*{20}c} 0 & {1.1895} & {2.379} & {3.379} & {4.379} & {5.379} & {6.379} & {7.379} & {8.379} & {9.379} & {5.0924} \\ {1.1895} & 0 & {1.1895} & {2.1895} & {3.1895} & {4.1895} & {5.1895} & {6.1895} & {7.1895} & {8.1895} & {3.9029} \\ {2.379} & {1.1895} & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & {2.7134} \\ {3.379} & {2.1895} & 1 & 0 & 1 & 2 & 3 & 4 & 5 & 6 & {3.7134} \\ {4.379} & {3.1895} & 2 & 1 & 0 & 1 & 2 & 3 & 4 & 5 & {4.7134} \\ {5.379} & {4.1895} & 3 & 2 & 1 & 0 & 1 & 2 & 3 & 4 & {5.7134} \\ {6.379} & {5.1895} & 4 & 3 & 2 & 1 & 0 & 1 & 2 & 3 & {6.7134} \\ {7.379} & {6.1895} & 5 & 4 & 3 & 2 & 1 & 0 & 1 & 2 & {7.7134} \\ {8.379} & {7.1895} & 6 & 5 & 4 & 3 & 2 & 1 & 0 & 1 & {8.7134} \\ {9.379} & {8.1895} & 7 & 6 & 5 & 4 & 3 & 2 & 1 & 0 & {9.7134} \\ {5.0924} & {3.9029} & {2.7134} & {3.7134} & {4.7134} & {5.7134} & {6.7134} & {7.7134} & {8.7134} & {9.7134} & 0 \\ \end{array} } \right] $$

Next, the matrix R i is obtained by the summation of each column of matrix R.

$$ R_{i} = \left[ {53.3139\ 42.6084\ 34.2819\ 31.2819\ 30.2819\ 31.2819\ 34.2819\ 39.2819\ 46.2819\ 55.2819\ 58.2819} \right] $$

Finally, taking v 3 (double bond), v 11 (hetero-atom) as examples, the vertex degree matrix v i of CME is calculated as follows:

$$ v_{3} = (z^{\text{v}} - h) = 4\quad (z^{\text{v}} \, = \,4,\,\,h\, = \,0) $$
$$ v_{11} = (z^{\text{v}} - h)/(z - z^{\text{v}} - 1) = 6\quad (z^{\text{v}} = 6,\,\,h = 0,\,\,z = 8) $$

The vertex degree matrix v i is obtained as follows:

$$ v_{i} = \left[ {\begin{array}{*{20}c} 1 & 6 & 4 & 2 & 2 & 2 & 2 & 2 & 2 & 1 & 6 \\ \end{array} } \right] $$

The topological index value of CME is calculated according to Eq. (3), and the topological index value of CME is 0.2203.

The individual topological index value A i of FAME is calculated and the process is similar. According to Eqs. (6) or (7), combined the A i with mass percentage of FAME in biodiesel, the mixture topological index value A m of biodiesel is calculated. For example, the mixture topological index value of Soybean oil is 0.3562 using Eq. (6) and 0.1300 using Eq. (7).

The mixture topological index values of biodiesels were calculated based on Eqs. (6) and (7), respectively, correlated with each property such as density, viscosity, flash point, cloud point, and higher heating value of biodiesel, respectively, so that the correlated precision can be compared by two regression coefficients (R c).

Density

Density (D) is an important property of biodiesel, where it is the weight of a unit volume of fluid. The density of biodiesels was discussed in more recent literature [17].

The densities and topological index values of six types of biodiesels are tabulated in Table 3.

Table 3 Density and topological index of six types of biodiesels

The correlation between the density (D) and the mixture topological index value (A m) of biodiesel is expressed in regression Eqs. (8) and (9) respectively:

$$ D^{\text{G}} = 981.3291 - 327.7233A_{\text{m}}^{\text{G}}\quad R{\text{c = 0}} . 9 8 3 4 $$
(8)
$$ D^{\text{H}} = 858.0202 - 18.7781A_{\text{m}}^{\text{H}}\quad R{\text{c = 0}} . 0 9 7 2 $$
(9)

It can be seen that these two equations have a large difference in the coefficient of regression. The Hind equation shows a lower correlated precision.

Figure 2 shows the correlation results between the topological index values and densities of biodiesels. The densities of biodiesels decrease with increasing mixture topological index values.

Fig. 2
figure 2

Correlation between A Gm and density for biodiesels

The properties of FAME are decided by their molecular structures. The topological index A is established based on the molecular structure of FAME so it can indicate the properties of FAME and biodiesels. The topological index values of individual components increase with increasing chain length (M) and saturation (R i ) from Eq. (3). The same rule is also applicable to the mixture topological index. Although the densities of individual components decrease with increasing chain length and increase with decreasing saturation. However, the decreasing effect of saturation is much less than the decreasing effect of chain length. Thus, the densities have smaller values when the biodiesels contain longer chain components. Accordingly, the densities of biodiesels decrease with an increase in mixture topological index values.

Viscosity

Viscosity is another important property of biodiesel since it affects the fluidity of the fuel and the operation of fuel injection equipment. High viscosity leads to poorer atomization of the fuel spray and less accurate operation of fuel injectors. The viscosity values of vegetable oil methyl esters are between 2.8 and 5.2 Cts and greatly decrease after transesterification process [8].

The viscosities [17] and topological index values of seven types of biodiesels are tabulated in Table 4.

Table 4 Viscosity and topological index of seven types of biodiesels

The correlation between the viscosity (V) and the mixture topological index values (A m) of biodiesel is expressed in regression Eqs. (10) and (11) respectively:

$$ V^{\text{G}} = - 3.0781 + 20.1877A_{\text{m}}^{\text{G}}\quad R{\text{c = 0}} . 9 8 7 5 $$
(10)
$$ V^{\text{H}} = 4.5752 - 1.6478A_{\text{m}}^{\text{H}}\quad R{\text{c = 0}} . 1 5 6 7 $$
(11)

It can be seen that these two equations have a large difference in the coefficient of regression. The Hind equation shows a lower correlated precision.

Figure 3 shows the correlation between the topological index values and viscosities.

Fig. 3
figure 3

Correlation between A Gm and viscosity for biodiesels

The viscosities of individual components increase with increasing chain length and decrease with decreasing saturation. However, the decreasing effect of saturation is much less than the increasing effect of chain length. Thus the viscosities have larger values when the biodiesels contain more long chain components. Accordingly, the viscosities of biodiesels increase with an increase in the value of the mixture topological index values.

Flash Point

The flash point (FP) is the lowest temperature at which a liquid produces flammable vapor near its surface that will ignite when brought in contact with air and a spark or a flame.

Flash point is one of the most important and widely used flammability characteristics of liquids and low melting substances. It provides a simple, convenient index of the flammability and combustibility of substances since it provides the knowledge needed for the handling and transporting of the liquid in bulk quantities [18].

The flash points [17] and topological index values of five types of biodiesels are tabulated in Table 5.

Table 5 Flash point and topological index values of five types of biodiesels

The correlation between the flash point and the mixture topological index values (A m) of biodiesel is expressed in regression Eqs. (12) and (13) respectively:

$$ {\text{FP}}^{\text{G}} = 263.5402 + 494.2471A_{\text{m}}^{\text{G}}\quad R{\text{c = 0}} . 9 9 1 6 $$
(12)
$$ {\text{FP}}^{\text{H}} = 445.5889 - 26.3687A_{\text{m}}^{\text{H}}\quad R{\text{c = 0}} . 1 5 8 4 $$
(13)

It can be seen that these two equations have a large difference in the coefficient of regression. The Hind equation shows the lower correlated precision.

Figure 4 shows the correlation between the topological index values and flash points.

Fig. 4
figure 4

Correlation between A Gm and flash point for biodiesels

The values of flash points of individual components increase with increasing chain length and decrease with decreasing saturation. However, the decreasing effect of saturation is much less than the increasing effect of chain length. Thus the flash points have larger values when the biodiesels contain longer chain components. Accordingly, the values of flash points of biodiesels increase with increasing topological index values of the mixture.

Higher Heating Value

The higher heating value (HHV) is an important property defining the energy content and thereby efficiency of fuels. The HHV is obtained by an oxygen-bomb calorimeter method as the latent heat of moisture in the combustion products is recovered. The HHVs of biomass-derived fuels have been estimated using their proximate and ultimate analysis data [19].

The higher heating values [17] and topological index values of six types of biodiesels are tabulated in Table 6.

Table 6 Higher heating value (HHV) and topological index of six types of biodiesels

The correlation between the higher heating value and the mixture topological index values (A m) of biodiesel is expressed in regression Eqs. (14) and (15) respectively:

$$ {\text{HHV}}^{\text{G}} = 38.3080 + 8.4274A_{\text{m}}^{\text{G}}\quad R{\text{c = 0}} . 9 4 5 5 $$
(14)
$$ {\text{HHV}}^{\text{H}} = 41.4086 + 0.4389A_{\text{m}}^{\text{H}}\quad R{\text{c = 0}} . 1 6 8 0 $$
(15)

It can be seen that these two equations have a large difference in the coefficient of regression. The Hind equation shows a lower correlated precision.

Figure 5 shows the correlation between the topological index and higher heating values.

Fig. 5
figure 5

Correlation between A Gm and HHV for biodiesels

The higher heating values of individual components increase with increasing chain length and decrease with decreasing saturation. However, the decreasing effect saturation is much less-than the increasing effect of chain length. Thus the higher heating values have larger values when the biodiesels contain longer chain components. Accordingly, the higher heating values of biodiesels increase with increasing mixture topological index values.

Average Relative Errors

The average relative errors (ARE) were calculated according to Eq. (16):

$$ ARE\,(\% ) = \frac{{\sum {\left| {(y_{\text{E}} - y_{\text{C}} )/y_{\text{E}} } \right|} }}{n} $$
(16)

here, y E and y C are experimental values reported in the literature and calculated values in this work, respectively.

The ARE of density, viscosity, flash point (FP) and high heating value (HHV) are presented in Table 7. It can be seen from this table, that values for all ARE is less than 2%. From the reported literature, the maximum allowable error for a prediction equation to estimate the viscosities of the ester mixtures was set at 5% [16]. So, the average relative error of less than 2% is acceptable.

Table 7 The average relative errors (ARE)

Conclusion

Combining the novel topological index which contains more molecular structure information such as the molar mass, bond length and bond energy with the modified Grunberg–Nissan or Hind equations, the topological index values of biodiesel (mixture of FAME) were calculated then related with the density, viscosity, flash point and higher heating value of biodiesel, by using the QSPR model. The correlated results show that the Grunberg–Nissan equation more closely agrees with the properties of biodiesel than the Hind equation.

Due to the different effects of chain length and saturation toward the properties of FAME, the topological index values and the properties present different regression relations. It can be seen that the density of the biodiesel decrease with increase in topological index value. The viscosity, flash point and higher heating value of the biodiesel increase with increase in the topological index value.