Interest in alternative fuels and engines which can run on biofuels and their blends with petroleum fuels has been stimulated by the need to control environmental pollution and by the pursuit of energy independence [1,2]. Accordingly, in this work we have analyzed the relationship between the content of two types of biodiesel in blends with petroleum diesel fuel and the density of these blends. The major objective of this work was to obtain linear equations relating the density of the blend to the biodiesel content and allowing us to make a preliminary estimate of the applicability of formulated biofuel recipes.

As we know, biodiesel is obtained by transesterification of oil or fat by alcohol, usually methanol, in the presence of a catalyst: sodium or potassium hydroxides, or increasingly more often, alkoxides [3,4]. Besides biodiesel, the product contains unreacted starting materials plus alcohol, catalyst, and glycerol. The glycerol is separated in the biodiesel purification stage, but nevertheless commercial biodiesel may contain glycerol in trace amounts.

Biodiesel obtained in the laboratory was analyzed to determine how its characteristics corresponded to the specifications of the standards. Table 1 gives the characteristics of the two types of biodiesel studied, the specifications of the standard EN14214 for biodiesel, and also the characteristics of the petroleum diesel fuel used in this work and the specifications of the Greek standard for this fuel. Biodiesel based on blended feedstock (synthetic biodiesel) was obtained from a mixture of 30% waste cooking oil, 5% waste household cooking oil, 5% palm oil, 5% animal fat, and 55% sunflower oil.

Table 1

The following blends were formulated using the fuel samples provided:

  • I:petroleum diesel fuel + biodiesel based on soybean oil;

  • II:petroleum diesel fuel + biodiesel based on blended feedstock (synthetic biodiesel);

  • III:petroleum diesel fuel + the two types of biodiesel in 1:1 ratio.

The densities of the blends, measured according to ASTM D1298-99 (2005) at a temperature of 15°C, are given in Table 2.

Table 2

Preliminary statistical analysis allowed us to propose three linear equations of the following general form that relate the density to the composition of the three studied blends:

$$ {Y_i}={b_0}+{b_1}X+{U_t} $$

where Y i is the dependent variable (the density); X is the independent variable (the mix, i.e., the composition of the blend); b 0, b 1 are constants; U t is the residual which we assume follows a normal distribution.

X and Y represent the attributes of the variables which they express. According to the attributes, the signal connecting the two model variables has a plus (+) sign, and the connection can be expressed by an equation of the form Y = αX + β. Obviously according to this equation, a higher value of X (i.e., higher biodiesel content in the blend) means a higher density of the blend. For blend I:

$$ Y=0.817461+0.000521X $$
(1)

For Eq. (1), the value of the Fisher statistic is F = 174.0493 and the p-value is equal to 0, which is evidence for statistical significance of this model.

For blend II, we can propose the equation:

$$ Y=0.830749+0.00031X $$
(2)

Analogously for blend III:

$$ Y=0.826332+0.00041X $$
(3)

It was established that the dependent variable Y is statistically significant in all the equations. The positive values of the coefficient of determination R 2, close to 1, are evidence for partial or complete dependence of the density of the blend on the biodiesel content in the blend. In other words, the closer R 2 is to 1, the higher the significance of the model. The coefficient of determination (R 2) for model (1) is equal to 0.95; 0.9 for model (2); 0.96 for model (3). Consequently, in these equations, the independent variable determines the dependent variable by respectively 95%, 90%, and 96%.

Table 3 gives the Student’s t-statistics and the p-values for the variables b0 and b1 for the three models obtained. The p-value for models (1) and (2) is close to 0, which confirms that they have high statistical significance.

Table 3

In order to study the applicability of the models, we carried out diagnostic tests. First we found the standardized residuals and the squared standardized residuals (Table 4). As shown by the results, the values of the statistical term of the Ljung—Box (LB) test on the standardized residuals and squared standardized residuals of model (1) are not statistically significant [6]. In lags 1-3 and 5-9, we can see serial correlation of errors, which is confirmed by the values of the Q-statistic in the LB(n) column in Table 4. Serial correlation of errors is also characteristic for the squared standardized residuals, which is evidence for heteroskedasticity. As in model (1), the values of the squared standardized residuals for models (2) and (3) are not statistically significant [6]. As for the standardized residuals, we observe serial correlation of errors.

Table 4

In order to identify the presence of heteroskedasticity, we carried out the Breusch—Godfrey serial correlation test (Table 5). We see that autocorrelation is not observed for all the models. According to the Breusch-Godfrey tests, for all the models the probability is greater than 0,763661, i.e., the probability is greater than 0.05 (5%).

Table 5

Testing of the proposed models was continued using the autoregression conditional heteroskedasticity (ARCH) test (Table 6), and the results of the Durbin—Watson test on the standardized residuals and squared standardized residuals was confirmed by the independence test. Heteroskedasticity is not observed according to the test results.

Table 6

Figure 1 shows the results of the stability studies for models (1)-(3): the trajectory followed by each variable. The graphs plot the growth in the variables with serial number. Thus we have confirmed that there is a linear relationship between the variables Y and X. As we see, the lines on the graphs have the shape of straight lines or straight lines with inflection points, which is associated with the chemical composition of the fuels.

Fig. 1
figure 1

Results of stability studies for the models: a) (1); b) (2); c) (3): density (lines 1); biodiesel content in blend (lines 2).

The calculated values of the density of the blended fuels can be used to estimate their calorific value, since the heat of combustion Q can be calculated sufficiently accurately from the equation [8]:

$$ Q=a-b{d^2} $$

where a, b are constants; d is the density at 15°C.

For blend III, we can expect high accuracy of the heat of combustion calculation, since the coefficient of determination R 2 of model (3) is the highest (0.96) among the three models obtained. For a small data file, achieving a high coefficient of determination is complicated. In other words, increasing the number of experiments and accordingly increasing the sample size results in higher accuracy of the models. The equations obtained can be used for a preliminary estimate of the applicability of formulated recipes for blended fuels.