1 Introduction

Microalgae biofuels have attracted considerable attention as a replacement for fossil fuels due to algae’s fast and season independent growth, and relatively high lipid content (Fortier et al. 2014). In spite of these advantages, a life cycle assessment (LCA) should be performed to evaluate whether biofuel produced from algal feedstock leads to lower life-cycle impacts than fossil fuels. Numerous LCA studies of microalgae biofuels have been published covering very different processes and leading to contrasting results and conclusions. Previous reviews of microalgae biofuels LCAs focused on both qualitative analysis of methodological aspects, such as system boundaries and handling of co-products (e.g., Collet et al. 2015; Collotta et al. 2016), and comparison of impact assessment results with fossil fuels and other biofuels (e.g., Figueiredo et al. 2017; Carneiro et al. 2017). These reviews showed high heterogeneity between studies regarding both methodological choices and reported impacts, including GHG intensity results, and highlighted the difficulty in selecting preferable pathways due to high variability.

The variability in LCA results of microalgae biodiesel has been approached using harmonization techniques. Liu et al. (2012) developed a meta-model by harmonizing six studies and adopting common methodological choices in the modelling assumptions and system boundaries, so as to reduce differences in results. Tu et al. (2017) proposed a harmonized inventory data set to minimize the variation in results and applied it to assess impacts of algae biofuel production pathways in the U.S. (Tu et al. 2018). These studies required the adjustment of several system (Liu et al. 2012; Tu et al. 2017) and technical (Tu et al. 2017) features to consistently reduce variability (Tu et al. 2017) or to find a consistent scope of assessment to assess the inherent variability of different production pathways (Liu et al. 2012).

Another approach to summarize LCA results and elucidate variables of interest is the application of meta-regression analysis (MRA) techniques. MRA can be regarded as a statistically harmonization method complementary to other harmonization methods used in the LCA literature, such as the work of Tu et al. 2017 and Liu et al. (2012). MRA reduces subjectivity by using raw data with as little data manipulation as possible – it does not eliminate subjectivity in the modelling that originates the data but aims to discern statistically significant patterns from the raw data (Menten et al. 2013).

An MRA of life-cycle GHG emissions of advanced biofuels was performed by Menten et al. (2013) covering microalgae biodiesel studies published in 2009–2010 (8 studies; 69 results). The descriptive variables in the regression model included system boundaries (well-to-wheels or well-to-gate), type of co-products (e.g., glycerine, electricity, heat) and multifunctionality approach (e.g., allocation, substitution), but left out the different technologies used in the various stages of the algae biofuel pathway (except cultivation). Since then, the LCA literature on the topic has expanded, with studies covering different locations, algae species, and productivity ranges, testing different harvesting, drying, extraction and conversion technologies, and culture mediums, and assessing multiple uses for by-product valorisation. These aspects introduce new layers of variability not evaluated in the previous MRA.

This article describes and synthesizes the new range of LCA literature results, including studies published from 2009 to 2017, and builds on previous work by Figueiredo et al. (2015, 2017), by (i) performing a qualitative assessment of the literature, focusing on key aspects, including modelling choices and technologies to identify variables of interest; (ii) assessing the variability in the reported GHG intensity; and (iii) quantifying the effect of the most influential variables on the GHG intensity of microalgae biodiesel through MRA, expanding the range of variables analysed in Menten et al. (2013). The main goal is to (i) investigate whether it is possible to identify specific variables (e.g., technologies, modelling choices) that may be significantly associated with higher or lower GHG emissions per MJ of biodiesel and (ii) provide recommendations on which technologies should be avoided or promoted based on currently available LCA data.

2 Methods

2.1 Overview of microalgae biodiesel life-cycle chain

A microalgae biodiesel production chain typically includes the following life-cycle stages: microalgae cultivation, harvesting and drying, oil extraction, conversion, distribution and use (e.g. in a vehicle). Figure 1 shows the available technologies for each life-cycle stage. The diversity of technologies indicates that microalgae biodiesel production processes are at the early stage of development and the optimal technologies are not yet clearly established.

Fig. 1
figure 1

Life-cycle chain (well-to-wheels) of microalgae biofuel, including alternative technologies for each phase. * Optional step – depends on the bio oil recovery technology. PBR: Photo-bioreactor; OP: Open pond; CF: Centrifugation; CFP: Chamber filter press; DAF: Dissolved air flotation; VBD: Vacuum belt dryers; DE: Dry extraction; WE: Wet extraction; SC-CO2: Supercritical CO2 extraction; HTL: Hydrothermal liquefaction; P: Pyrolysis; Trans: Transesterification; HT: Hydrotreating; U&DT: Ultrasonication and direct transerterification; SC-CH3OH: Supercritical methanol; MBR: Microalgae biomass residues

The cultivation of microalgae can be conducted in three ways (Stephenson et al. 2010):

  1. i)

    In photo-bioreactors (PBR), which are closed systems with a productivity of cellular biomass generally higher than in open ponds, but more expensive to build and operate;

  2. ii)

    In open ponds (OP), also termed raceways, which are open systems, where a paddlewheel is used to mix the biomass. These systems can suffer from high losses by evaporation, requiring large volumes of fresh water. Furthermore, productivity can be affected by contamination with microorganisms (Stephenson et al. 2010); and

  3. iii)

    In hybrid systems, which are a combination of PBR and OP. A small volume is cultured in a closed system (Khoo et al. 2011) (PBR) where the inoculum is prepared, which is then transferred to an OP where the algae grow.

The growth media can be: (i) fresh water or marine water, depending on the algal species, with synthetic nutrients (synthetic medium); and (ii) wastewater (WW) containing the nutrients required for microalgae growth. The CO2 required during microalgae growth can be sourced from pure CO2 from e.g. an ammonia plant (e.g., Hou et al. 2011; Campbell et al. 2011; Monari et al. 2016), CO2 extracted from natural deposits (e.g., Connelly et al. 2015; Liu et al. 2013), and flue gas from an adjacent plant (e.g., Brentner et al. 2011; Orfield et al. 2015; Mu et al. 2017). The latter can be provided by direct injection or first separated into pure CO2 via monoethanolamine scrubbing and then injected into the cultivation system (Zaimes and Khanna 2014).

For harvesting, a number of technologies are available and different combinations are employed. For instance, microalgae may be first decanted, filtered, and/or flocculated and centrifuged or filtered. Depending on the bio-oil extraction technology, microalgae may need to be dried (e.g., solar drying; heat drying with different fuels). In the bio-oil extraction step, oil can be extracted from seven ways:

  1. i)

    Dry lipid extraction (DE), which is similar to the extraction of other vegetable oils (e.g., rapeseed, soybean). Microalgae biomass is previously dried and then exposed to extraction by a solvent (e.g. hexane, most common);

  2. ii)

    Wet lipid extraction (WE), in which wet microalgae biomass is exposed to extraction by a solvent (e.g. hexane);

  3. iii)

    Supercritical CO2 extraction (SC-CO2), which uses a supercritical fluid (CO2) to extract thermally sensitive lipid fractions without degradation; it has the advantages of avoiding the use of organic solvents and allowing for the selection of the relevant lipids for biodiesel production during the extraction process (Brentner et al. 2011);

  4. iv)

    Hydrothermal liquefaction (HTL), a thermochemical process in which algae is heated in a pressurized wet slurry; under these conditions, algal cells are broken down to mainly liquid components (Gollakota et al. 2018);

  5. v)

    Pyrolysis (P), a thermochemical process that decomposes biomass into liquid oil, gases and solid products, at temperatures ranging from 350 to 700 °C and in the absence of air (Goyal et al. 2008). For this process, algae need to be dry;

  6. vi)

    Ultrasonication with direct transesterification (U&DTrans), where bio-oil extraction and transesterification occur simultaneously. Methanol is added directly to dried biomass using sulphuric acid as a catalyst (Brentner et al. 2011); and

  7. vii)

    Supercritical methanol (SC-CH3OH), which combines lipid extraction and transesterification of oil from wet microalgae, requiring no catalyst (Brentner et al. 2011).

Besides oil, the extraction process also produces microalgae biomass residue (MBR), for which different management options can be applied, such as: direct land application as fertilizer; anaerobic digestion, generating biomethane that can be combusted for heat and electricity in a combined heat and power plant; and sacharification and fermentation to produce ethanol. When microalgae is cultivated in a synthetic medium, MBR can also be used in food supplements, pharmaceutical and cosmetic products, and for animal feed.

The conversion phase can be made, for example, through transesterification or hydrotreating. The process of transesterification involves the catalysed reaction of lipids with methanol, producing fatty acid methyl esters (i.e. biodiesel) and the co-product glycerine. Glycerine can be used for energy purposes, for animal feed or for pharmaceutical products, among other uses. The hydrotreating process involves an hydrogenation step, which produces synthetic hydrocarbons, followed by hydrocracking and distillation (at the end of hydrotreating, bio jet-fuel, naphtha and a small quantity of biodiesel are produced) (Cox et al. 2014). Finally, biodiesel is distributed and combusted in a vehicle.

2.2 Selection process for the literature review

An online search of articles published between 2009 and 2017 consisting of LCA studies of microalgae biodiesel was performed in Web of Knowledge. The keywords used for the literature search (in TITLE) included a combination of synonyms of the terms “life cycle assessment” (OR “LCA” OR “life cycle analysis” OR “life cycle” OR “lifecycle” OR “life-cycle”) OR “greenhouse gas emissions” (OR “GHG”), AND “algae” (OR “algal” OR “microalgae” OR “microalgal”), AND “biofuel” (OR “biodiesel” OR “bioenergy”). The search retrieved 87 results. From these, only peer-reviewed journal articles written in English were considered (−5). Comments on published articles were excluded (−4) and articles with corrigendum were accounted for as a single reference (−2). Review articles were also not included (−8), but the references cited were scanned for relevant articles not retrieved in the previous literature search. The articles included in previous reviews by Figueiredo et al. (2015, 2017) were also skimmed. Articles were selected for further analysis if they detailed information on the methodology, assumptions, data used, and the GHG balances of the system (see Tables S1 and S2 in the Online Resource 1). To allow for comparisons, the results of the reviewed studies were normalized to 1 MJ of biodiesel (measured in terms of the lower heating value). Studies were excluded mainly due to: (i) addressing other bioenergy routes than biodiesel (e.g. bio jet fuel); (ii) lack of transparency and insufficient quantitative information (e.g. the study considered other functional unit than MJ of biodiesel and it was not possible to convert the results due to lack of data), (iii) the assessment being limited to one single life-cycle stage of biodiesel production, and (iv) the study focused the analysis in other metrics than GHG emissions (e.g., energy use, water footprint). In the end, 36 articles were included in the review and provided 308 estimates of life cycle GHG emissions of microalgae biofuel.

2.3 Meta-regression model and variables

The MRA was performed using STATA® (StataCorp 2012). A General-to-Specific methodology was employed to specify the regression model, i.e. estimating a general model containing a wide range of potentially important variables and then stepwise exclude the least significant variable, one by one, until only statistically significant (p < 0.05) variables remain in a specific or final model.

The proposed model is a multiple linear regression model with the following structure:

$$ {Y}_i={\beta}_1+{\beta}_2\ {X}_{2 i}+{\beta}_3\ {X}_{3 i}+\dots +{\beta}_k\ {X}_{ki}+{\varepsilon}_i $$
(1)

where Y is the dependent variable that represents the GHG emissions associated with one MJ of produced biodiesel; Xj = (X2,…,Xk) are the independent variables (or regressors); β1 is a common factor (intercept); and βj = (β2,…, βk) are the partial regression coefficients that measure characteristics of the biodiesel case study under analysis; i = (1,…,n) is the ith observation, and ε is the error term. The model was estimated using ordinary least square (OLS) method assuming a linear relationship between the dependent and independent variables. Due to the skewness of the original data (see Fig. S1 in Online Resource 1), we log transformed both the continuous independent and depend variable data to approximately conform to normality; therefore, a log-log model was used. Inevitably, all observations containing negative GHG emissions per MJ were excluded (<6%). However, as there were few negative values in the sample, the risk that these observations would change the interpretation of the relationship (the direction of the coefficient) is small. To account for potential non-linear relationships a quadratic term was included in the robustness checks.

2.4 Selection of variables of interest

The selection of variables of interest was based on a qualitative review of the LCA studies on microalgae biodiesel previously selected, focusing on: (i) relevant aspects related with microalgae biodiesel production systems which differed between studies and may influence GHG emission results (e.g., biomass productivity, lipid content, and microalgae species – additional details for each study can be found in Table S1 in Online Resource 1); and (ii) the main methodological choices adopted in the LCA studies (e.g., geographical scope, system boundary, and multifunctionality – additional details for each study can be found in Table S2 in Online Resource 1). Studies were performed using laboratory or pilot scale data, which may limit its use to represent full commercial applications.

The microalgae species used in LCA studies were set as variables. Different species of microalgae were reported, including both marine and freshwater species. One study reported results for more than one microalgae species. The most common genus of microalgae was Chlorella spp. (31%), particularly Chlorella vulgaris (19%), followed by Nannochloropsis ssp. (22%). Scenedesmus was mentioned in two studies (5%). In addition to those, three other species of microalgae were identified (Pleurochrysis carterae, Tetraselmis, Neochloris) and 42% of the studies did not indicate the kind of microalgae considered.

Another variable included was the type of culture medium: synthetic medium (included in 81% of the studies) and wastewater (25%). Two studies (5%) compared both culture mediums.

Both lipid content and microalgae productivity show a large variation in the reviewed studies and are seen as key factors for life cycle GHG emissions of microalgae biofuels (Menten et al. 2013). Lipid content varied from 2% to 61%. The productivities varied from 2 g DM m−2 day−1 (Sills et al. 2013) to 646 g DM m−2 day−1 (Brentner et al. 2011). From the point of view of biodiesel production, the important factor is the oil productivity, i.e. the product of lipid content and microalgae productivity; therefore, oil productivity was selected as a variable of interest.

The CO2 source also differed between studies and was set as variable. We distinguish studies that use CO2 from natural or industrial origins and studies that use waste CO2 (from flue gas from adjacent plant).

Regarding the different technology pathways, all technologies found in the literature for the different biodiesel production steps (see section 2.1), namely microalgae cultivation (OP [69% of the studies], PBR [25%], hybrid [19%]), bio-oil extraction (DE [61%], WE [31%], HTL [19%], P [2.5%]) and biodiesel conversion (transesterification, hydrotreating), were set as variables.

Key LCA methodological choices were also selected as variables of interest. Three alternative system boundaries were identified and set as variables. The most common was well-to-wheels (WtW) (53%), also referred to as cradle-to-combustion, cradle-to-grave, or pond-to-wheels, and covers the entire life cycle including the combustion of the biodiesel. One third considered a well-to-gate (WtG) boundary, also denominated as cradle-to-gate, which includes emissions only up to the gate of the biodiesel conversion process. Finally, 14% of the studies considered a well-to-pump (WtP) boundary (also known as well-to-tank), which also includes the distribution of the biodiesel, but not its combustion.

The inclusion or exclusion of capital goods in the system boundary was also assessed. Most of the studies excluded capital goods from the system boundary (53%), whilst one third accounted for the infrastructures and 14% did not provide any information on that aspect.

Dealing with multifuncionality in LCA studies is a challenging and controversial issue (Malça and Freire 2006) and different choices may yield very different results. The reviewed studies used different approaches to deal with multifunctionality. One quarter considered exclusively the substitution method or system expansion, including alternative functions for the co-products (Sander and Murthy 2010; Sills et al. 2013; Zaimes and Khanna 2013; Mu et al. 2014; Soh et al. 2014; Soratana et al. 2014; Chowdhury and Freire 2015; Monari et al. 2016; Mu et al. 2017). Different substitution options were compared in 8% of the studies (Zaimes and Khanna 2013; Soratana et al. 2014; Zaimes and Khanna 2014). One quarter of the revised studies considered exclusively an allocation method based on mass, energy or economic value (Brentner et al. 2011; Hou et al. 2011; Yanfen et al. 2012; Frank et al. 2013; Collet et al. 2014; Maranduba et al. 2015; Orfield et al. 2015; Souza et al. 2015; Bello et al. 2017). A combination of system expansion and allocation was used by 14% of the studies, namely considering the anaerobic digestion of MBR to energy and/or fertilizers combined with economic or energy allocation between biodiesel and glycerine (Stephenson et al. 2010; Frank et al. 2012; Adesanya et al. 2014; Woertz et al. 2014) or biodiesel and methane or ethanol (Soratana et al. 2013). Finally, 14% of the studies compared the substitution and allocation methods (Batan et al. 2011; Passell et al. 2013; Adesanya et al. 2014; Zaimes and Khanna 2014; Yuan et al. 2015). Detailed information concerning the allocation method considered by each study is shown in Table S2 (Online Resource 1). The different ways in which multifunctionality was handled were also set as variable of interest.

Studies also differed in the alternative functions for the co-products. Microalgae biomass residue was used for: i) animal feed; ii) anaerobic digestion with combustion of methane for electricity and digested biomass for fertilizer; iii) co-generation; and iv) ethanol production (substituting ethanol from corn). Glycerine was either used by pharmaceutical industries, displacing fossil glycerine, or combusted for energy production. For the particular cases in which WW was used as culture medium, different ways of dealing with the co-production of biodiesel and biological nutrients removal (BNR) were considered: not accounting for the nutrient gains or the environmental burdens associated with the use of WW effluent prior to microalgae cultivation and the avoided burdens from using microalgae as a BNR process (Soratana et al. 2014; Woertz et al. 2014; Monari et al. 2016); substitution, assuming that the conventional BNR process in a wastewater treatment plant is replaced by the microalgae cultivation system (and hence applied credits from avoided BNR) (Mu et al. 2014); substitution, assuming avoided production of synthetic fertilizers (Chowdhury and Freire 2015); substitution of both conventional BNR process and synthetic fertilizers production (Soratana et al. 2013); and allocation of the environmental burdens from WW treatment prior to microalgae cultivation to biodiesel production (Sander and Murthy 2010).

The geographical location of the case studies was selected as a variable of interest. Location can influence insolation and the GHG intensity of background processes, such as energy used in microalgae production. Half of the reviewed studies were located in North America (USA) and one-sixth in Europe (two in the UK, two in France, one in Denmark, and one undefined countries). Remaining locations were Australia, China, and Brazil, with two studies each, and India, Israel and Singapore (all with one study). The geographical scope of three studies was undefined.

Table 1 presents all independent variables (Xj in Eq. 1) included in the meta-regression analysis, which were selected based on the identified potential factors for the higher variability associated with the GHG emissions of 1 MJ of biodiesel. In addition, the year of publication was considered as variable of interest. It is important to note that, with the exception of microalgae oil productivity and publication year, all variables are discrete, and therefore are inevitably represented in a binary manner, i.e. quantified as either 0 or 1.

Table 1 Independent variables included in the meta-regression analysis

3 Results and discussion

3.1 Reported GHG intensity of microalgae biodiesel

Figure 2 presents the GHG emissions reported in the 36 studies, which range between −700 and 3800 g CO2 eq. MJ−1 (corresponding to 308 data points, mean: 245 g CO2 eq. MJ−1). In Fig. 3, the reported results were grouped by: (a) culture medium (synthetic or wastewater); (b) CO2 source (pure CO2 or waste CO2); (c) cultivation system (OP, PBR and hybrid); (d) extraction technology (dry, wet, HTL, pyrolysis or other); (e) conversion technology (hydrotreating or transesterification); (f) location (North America, Europe, Asia, Australia and Latin America); (g) system boundary (WtW, WtG and WtP); and (h) multifunctionality approach (mass, energy, and price allocation, substitution or hybrid).

Fig. 2
figure 2

GHG intensity of microalgae biofuel reported in the reviewed studies

Fig. 3
figure 3

GHG intensity of microalgae biofuel reported in the reviewed studies grouped by: (a) culture medium (synthetic or wastewater); (b) CO2 source (pure CO2 or waste CO2); (c) cultivation system (open pound [OP], photobioreactor [PBR] and hybrid); (d) extraction technology (dry, wet, hydrothermal liquefaction [HTL], pyrolysis and other); (e) conversion technology (hydrotreating and transesterification); (f) location (North America [NA], Europe [EU], Asia [AS], Australia [AU] and Latin America [LA]); (g) system boundary (well-to-wheels [WtW], well-to-gate [WtG] and well-to-pump [WtP]); and (h) multifunctionality approach (mass, energy, and price allocation, substitution [Subs] and hybrid); n indicates the number of observations

About half of the reported biodiesel results are higher than fossil diesel GHG intensity, i.e. 94 g CO2 eq. MJ−1 (European Parliament Union and Council of the European Union 2018). The results with the highest GHG emissions (above 1000 g CO2 eq. MJ−1) were essentially associated with five studies (Stephenson et al. 2010; Mu et al. 2014; Soh et al. 2014; Soratana et al. 2014; Monari et al. 2016). These were caused by low lipid content (2%) (Soh et al. 2014) and a combination of the worst possibilities (low microalgae productivity, low lipid content, low technological efficiency, high energy consumption) (Mu et al. 2014). Results from Monari et al. (2016), and Stephenson et al. (2010) were associated with a high energy consumption in cultivation (pumps, paddlewheels), harvesting (blowers and centrifugation) and oil extraction. Soratana et al. (2014) did not discuss the high environmental impacts associated with biodiesel production, because the study was focused on assessing different pathways for co-products. Studies reporting results over 2500 g CO2 eq. MJ−1 (Soratana et al. 2014; Monari et al. 2016) all used wastewater as culture medium (Fig. 3a), PBR as cultivation system (Fig. 3c), transesterification as extraction technology (Fig. 3e) and substitution as the approach to deal with co-products (Fig. 3h).

In ten studies, negative GHG emissions were reported due to: i) using substitution or system expansion to deal with co-products (see Table S2 in Online Resource 1) (Sander and Murthy 2010; Batan et al. 2011; Adesanya et al. 2014; Mu et al. 2014; Cox et al. 2014; Chowdhury and Freire 2015; Monari et al. 2016); and ii) accounting for biogenic CO2 absorbed during photosynthesis but not when biodiesel is combusted (Pardo-Cárdenas et al. 2013; Mu et al. 2014; Bennion et al. 2015; Malik et al. 2015; Monari et al. 2016). In addition to the large variation between the different studies, many studies show very wide ranges, because different scenarios comparing technologies, co-product uses and future system changes were assessed.

Although we grouped the results of different studies according to different aspects (e.g., technologies, culture medium, multifunctionality approach), it was not possible to draw specific conclusions regarding the best pathways, because of the number of available technologies for each life-cycle phase and the number of possible combinations (19 different system combinations were evaluated, as shown in Table S3 in the Online Resource 1). A meta-regression analysis is thus presented in an attempt to extrapolate clearer conclusions.

3.2 Meta-regression analysis results

Estimated results for the microalgae biodiesel MRA are presented in Table 2. We included in the model all variables at p < 0.05 significance levels. The standard errors of the coefficient estimates are reported in parenthesis. N indicates the number of observations used to perform the regression and the strength of the regression is reported as adjusted R-square (Adj. R2). The final model after employing the General-to-Specific methodology is as follows:

$$ \ln \left({YgCO}_2 eq.{MJ}^{-1}\right)=4.459-0.550{Loc}_{EU}+1.036 CultSy{s}_{PBR}-0.470 Fee{d}_{lnOilProd}+0.785 ExtTe{c}_{Dry}+0.533 ConvTe{c}_{Tr}+0.951 WtP+0.646 Mult\_ Subst $$
(2)
Table 2 Results of meta-regression analysis of GHG emissions from microalgae biofuel (standard errors within parenthesis)

Our model suggests the following preferences for minimizing GHG emissions of microalgae biodiesel production:

  • Location: preference for EU;

  • Cultivation system: avoidance of PBRs;

  • Oil extraction technology: avoidance of dry extraction;

  • Conversion technology: avoidance of transesterification.

Despite the statistical significance of the variables (at 95%), the Adj. R2 of the model generated is not indicative of goodness of fit (48% of the variability around the mean is explained by the model), which is also shown in Fig. 4. In some instances, the model generated goes with the expectations, for example, the productivity of oil is negatively correlated with GHG emissions, indicating that a higher productivity is associated with a lower emission level, which confirms the findings of other scholars (e.g. Menten et al. 2013). PBRs appear to be associated with higher GHG emissions, as reported for instance in Menten et al. (2013) and Stephenson et al. (2010), but contrary to Tu et al. (2018) conclusions. The study presenting the highest GHG emissions per MJ of biodiesel (above 3.5 kg CO2 eq. MJ−1) uses PBR as cultivation system (Soratana et al. 2014), which might be driving the MRA results, and is not included in Tu et al. (2018) analysis – this hypothesis is tested in the robustness check (section 3.2.1).

Fig. 4
figure 4

Actual versus predicted values. The actual values refers to the dependent variable, ln(Y g CO2 eq. MJ−1) and the predicted values are based on the model specification of the dependent variable presented in Table 2

In terms of oil-extracting technologies, our model indicates that none of the technologies considered is statistically significant, with the exception of dry extraction, which is correlated positively with GHG emissions. In terms of the processing of algae oil into biodiesel, our model shows that transesterification is associated with higher emissions than hydrotreatment, in opposition to Menten et al. (2013) conclusions. Again, this result might be prompted by the high GHG emissions reported in one study (Soratana et al. 2014).

In terms of methodological choices, a mid-level system boundary (WtP) is unexplainably associated with higher emissions, as is the handling of co-production via substitution. The later suggests that dealing with multifunctionality in any way other than substitution (e.g. allocation) may underestimate emissions relative to using a system-expansion-and-substitution approach, which is the prioritized algorithm by the ISO 14044:2006 standards. Nevertheless, handling co-products via substitution often leads to net negative emissions as a result of the avoided burdens from displaced products; the use of a log-log model, which excludes negative values of GHG emissions, may overestimate the effect of substitution in microalgae biodiesel GHG emissions. Furthermore, location in the EU is associated with lower emission levels, which contradicts Menten et al. (2013) findings. The location may influence several parameters, such as insolation and the GHG intensity of energy used in microalgae production processes, but these aspects could not be captured by our model as the studies do not provide sufficient information to allow for a detailed assessment. Variables that presented no statistically significant effect include the algae species, the CO2 source, the culture medium, and the publication year.

3.2.1 Robustness checks

The data consists of several binary variables that are mutually exclusive within each category but not across categories, hence, imperfect multicollinearity is likely to exist between the independent variables. Multicollinearity produces unexpectedly large standard errors of the OLS estimate, and reduces the precision of the estimate coefficients and weakens the statistical power of the regression model. Hence, the stepwise approach of sequentially removing the least significant variables based on their associated p values might exclude the wrong variables. The presence of imperfect multicollinearity suggests it will be problematic to estimate accurately one or more of the partial effects using the data at hand (Stock and Watson 2015).

To test the robustness of the regression model an alternative approach, using the Variance Inflation Factor (VIF) as the exclusion criteria, was used in the general-to-specific procedure, as suggested by Zuur et al. (2010). The VIF estimates how much the variance of a regression coefficient is inflated due to multicollinearity in the model, hence, independent variables in the model with the highest VIF-value were excluded sequentially until no multicollinearity was detected (VIF below 2). Then, subsequent exclusion of variables by significance was performed. This procedure resulted in a different model specification (as shown in Table S4 in Online Resource 1), which underpins the concerns about the robustness of the baseline model as different significant variables were identified (ExtTec_Wet; ExtTec_HTL; Loc_NA, all with a negative contribution to GHG emissions).

An additional issue questioning the validity of our models is the low explanatory power, the R2. In an attempt to address this issue, several alternative model specifications were tested. First, a quadratic term was included to allow for curvature in the linear relationship between the dependent variable and oil productivity. The quadratic term was highly insignificant, and thus removed. Second, we included an interaction term to deal with the outliers in the data. There are a number of ways to deal with outliers, although they are often seen as data problems that must be fixed, outliers can also be of substantive interest and studied as unique phenomena that may lead to novel theoretical insights (Aguinis et al. 2013). In our case, no discernible insights could be extrapolated, except for the inexistence of clear patterns, which is an insight in itself, suggesting that LCA results are very variable between themselves and should not be compared. Since the exclusion of outliers post hoc might affect the objectivity of the results, they were not excluded. Instead, we investigated the traits among the outliers and added an interaction term explaining these common traits (see Table S4 in Online Resource 1).

The common traits are handling co-production via substitution and the use of PBRs and transesterification. There are 21 such observations. However, there is no intuitive explanation to why these three traits would yield the extreme estimate of emissions, as this group consists of observations in the upper and the lower tails of the emission distribution, ranging between −210 and 3800 g CO2 eq. MJ−1, although negative values were excluded from the MRA after the log transformation. Emissions on the upper tail of the emission distribution correspond to two studies: Monari et al. (2016) and Soratana et al. (2014). In both studies, GHG emissions are dominated by the biodiesel production process train and credits from substitution only account for less than 3% of those emissions. The high energy consumption in PBRs for CO2 pumping and nutrient mixing is reported by Monari et al. (2016) as the main reason for the high GHG emissions associated with biodiesel from microalgae. On the other hand, emissions on the lower tail of the emission distribution are associated with three studies: Batan et al. (2010) and Monari et al. (2016) when using waste CO2, whose negative results are mainly due to considering CO2 sequestration by the algae, but not biodiesel combustion; and Mu et al. (2014), which considers credits from avoided wastewater treatment, besides CO2 sequestration. Regardless of the interpretation of these common traits, the predictive power (R2) is improved when accounting for these by using an interaction term and different conclusions are supported regarding the other explanatory factors. For instance, whilst oil productivity and studies located in Europe are associated with lower GHG emissions and dry extraction with higher emissions as in our first regression model (see section 3.2), the new model (with an interaction term, see Table S4) also associates the use of CO2 from industrial or natural origin to higher GHG emissions and the use of synthetic medium to lower GHG emissions.

Notwithstanding, the application of the different methods to generate regression models did not circumvent the fact that no explainable patterns could be observed from the data and chosen variables.

4 Conclusions

We conducted a systematic literature review and meta-regression analysis of studies that report on the climate change effects of microalgae biodiesel, and reported their differences. The reviewed 36 studies altogether reported 308 data points in terms of g CO2 eq. MJ−1. The published values are widely variable, ranging from −0.7 to 3.8 kg CO2 eq. MJ−1biodiesel. Despite the high variability demonstrated, results suggest that reported GHG emissions from microalgae biodiesel are, on average, 2.6 times as high as those from its fossil counterpart, i.e. 94 g CO2 eq. MJ−1 (European Parliament Union and Council of the European Union 2018). To investigate whether significant variables and preferred technologies can be inferred from currently available LCA data, regression models were formulated considering several variables of interest describing a number of technical and modelling choices. Despite the comprehensive range of variables included in the models tested, none of the models generated could plausibly explain that the variability depends on the specific technologies considered nor on the methodological choices adopted. However, some relationships could be observed across most models. Specifically, our models indicate that location in the EU and high oil productivity are associated with lower emissions, as expected, whilst dry extraction should be avoided as it is associated with higher GHG emissions, on average.

The data collected does not follow any robust patterns, except confirm previous findings (e.g., importance of oil productivity and avoidance of energy-intensive processes such as dry extraction). While there were some studies showing large benefits, there was wide variation between the various algae technologies and it transpired that none performed consistently better than fossil diesel. The reported high levels of, and large variability in, GHG emissions question the climate-mitigation potential of microalgae biodiesel.