Introduction

The transport sector is responsible for about 23% of the world’s CO2 emissions. Moreover, global demand for transport is unlikely to decrease in the short term, given the growth of 80% in emissions for some means of transport during the 1990–2012 period (IEA 2014). Taking this into consideration, the use of alternative fuels to reduce vehicle emissions has become a major topic of discussion nowadays. However, it is quite hard to predict market success for alternative fuel vehicles because of the difficulty in forecasting the outcomes of policy measures that promote alternative fuels (Janssen et al. 2006).

International experience shows that market success of alternative fuel is not trivial. There are successful cases where the demand for alternative fuel vehicles has had a consistent rise. Such as the cases of compressed natural gas in Argentina (Yeh 2007; Janssen et al. 2006); compressed natural gas in Italy (Engerer and Horn 2010; DG MOVE 2015) and biofuels in Sweden (DG MOVE 2015).

Norway has the highest market share for electric vehicles, which is 23% (IEA 2016). Critical incentives to new car users are tax purchase price exemptions, road toll exemptions, bus lane access (Bjerkan et al. 2016), and increasing availability of charging stations (Mersky et al. 2016). However, bus lane access can be controversial because this can lead to a situation that could impede bus traffic (Myklebust 2013), a phenomenon that has already happened in the main cities. The Netherlands has the second highest market share for electric vehicles (10%), with incentives on registration and circulation taxes (IEA 2016).

However, there are also cases where, despite a promising start, the demand for alternative fuel has slowed down, mainly because, these vehicles have encountered several barriers to their massification. For instance, the U.S. has experienced infrastructure development barriers (Melaina et al. 2017) because the users and media attention has skipped among various fuel vehicle technologies across the years (Melton et al. 2016). In the Chinese case, the relatively high price ratio between natural gas and gasoline is impeding further natural gas vehicle development (Wang et al. 2015). Similarly, the case of natural gas vehicles in Germany (von Rosenstiel et al. 2015) where there was a market failure for alternative fuels, despite tax incentives and investments. Nevertheless, knowing market conditions and individual preferences toward alternative fuels and their characteristics is crucial to define strategies to introduce these vehicles in the market successfully.

In the Colombian case, the gasoline-fuelled vehicles have an 81.9% market share, and the diesel-fuelled vehicles have 13.2% (however, most of the diesel vehicles are trucks for freight transport). Meanwhile, despite all of the recent efforts to promote alternative fuel vehicles with initiatives such as tax exemption and the encouraging of an electrical vehicles fleet for transport in the capital of the country, compressed natural gas only has a 4.9% market share. Across the country, electric and hybrid vehicles have not achieved a significant market share to appear in the statistics.

Improvement in competition refers to the continuous progress in engine efficiency to achieve lower emissions and the decline of conventional fuel prices over time, which makes the process of replacing traditional fuel a hard task. Also, according to Romm (2006), the six most common barriers to the widespread use of alternative fuel vehicles are a high first cost for an alternative fuel vehicle, limited range, safety and liability concerns, high fuelling costs, lack of fuel stations, and improvements in the competition. Due to evidence of changes in attitudes and perceptions after real-life experience with alternative fuel vehicles (Jensen et al. 2014), policies to promote the use of alternative fuels in the vehicle market must be well planned.

Much of the research has focused on evaluating the effect of subsidies on the purchase price, showing a substantial impact on the market share of these new technologies and a lower share for conventional fuel vehicles (Horne et al. 2005; Daziano and Bolduc 2013). Gasoline taxes to disincentive the traditional fuels have also been studied, but the results seem to indicate that carbon taxes alone would have a limited effect on people’s purchase intentions (Ewing and Sarigollu 1998), as can be seen by the slight decrease in market shares even with a high tax. The increase of recharging infrastructure is another policy analysed to overcome the adoption barriers. However, the necessary investment in the refuelling network seems to be huge in comparison to the market share augmentation (Achtnicht et al. 2012; Daziano and Bolduc 2013; Hackbarth and Madlener 2013).

Also, some scenarios have been evaluated that enhance the major drawbacks of alternative fuel vehicles: an increase in the range of electric cars and a decreased battery recharging time (Ewing and Sarigollu 1998; Mabit and Fosgerau 2011; Hackbarth and Madlener 2013; Tanaka et al. 2014), showing a remarkable increase in demand for electric vehicles. Results from analysing non-monetary incentives like free parking areas and access to HOV lanes indicate that free parking is not significant for respondents, even in limited parking areas, if the parking charges are low (Potoglou and Kanaroglou 2007). Similarly, if the trips are short, the time saved with the access to HOV lanes is not significant (Ewing and Sarigollu 1998).

One isolated policy in itself is not going to achieve a substantial change in consumers. Also, the necessary effort is enormous because market shares are not going to rise easily. Therefore, a combination of different policies may help lead to the adoption of alternative fuel vehicles, but they need to be adequately studied. Nevertheless, there are cases of conversion programs where the respondents would have adopted the alternative fuels (converting the vehicle from gas to compressed natural gas) even if they do not have incentives to do so (Saldarriaga-Isaza and Vergara 2009).

The use of hybrid choice models to evaluate policy scenarios opens a new horizon in the field of qualitative improvements that have been unexplored (Daziano and Chiew 2012). Daziano and Bolduc (2013) estimate a scenario where the environmental concern of the population increased to its mean value through a marketing campaign. However, the policy analysis and scenario evaluation using discrete choice models is an open issue. Some problems can be addressed when forecasting with hybrid choice models, such as endogeneity respect behaviour, presence of cross-sectional data, measured only at one point in time (Chorus and Kroesen 2014; Vij and Walker 2016), not knowing the factor scale when forecasting, and the establishment of an acceptable variation range for the measure evaluation.

In this research, hybrid models were used to study the preference for alternative-fuel vehicles and the impact on consumer behaviour under several policy scenarios. The paper examines the Colombian vehicle market, considering differences and similarities in preferences across cities. In the country, the vehicle ownership rate is about 83 cars per 1000 inhabitants, which is small when compared with high-income countries. However, the stock of vehicles has been growing at a 5% average per year during the last decade. The Colombian vehicle owners do not have a high purchasing power, and the vehicle cost is highly relevant in their choice, which restrains vehicle access to a specific population with higher income, which causes the purchase to be highly sensitive to the capital cost. Besides, vehicle lifespan is longer than in other countries, making the use of cleaner fuels vital, despite their higher price.

This paper aims to analyse the peculiarity of the Colombian market, comparing differences in elasticities among cities and with other countries. Also, it evaluates the impacts of transport policies, attitudes and perceptions on the choice of alternative fuel vehicles. Even though the econometric approach is not the primary focus of the article, this paper tackles the influence of a latent factor on other latent factors, which remains as a novel aspect in the topic.

The organisation of the paper is as follows: “Data” section provides information about the data used; “Model framework” section shows the model framework. The estimated results are given and analysed in “Results” section. “Policy evaluation” section presents the analysis of policies and scenario simulations. Finally, “Conclusions” section is the concluding chapter.

Data

The data was collected from a survey conducted in 2013 among vehicle owners in the five most populous Colombian cities: Bogotá, Medellin, Cali, Barranquilla and Bucaramanga. These five cities cover more than 65% of the total vehicles in the country. The recruitment process consisted of interviewing people in workplaces, universities, shopping malls, and at home. The interview places were selected to cover the socioeconomic variability across the population. Within the sites, people were randomly selected.

The instrument was applied face to face by trained interviewers to randomly selected car owners who, by having the experience of owning a car, were expected to have a better understanding of a car purchase situation, by taking into account the variables presented, not only purchase price. The absence of non-car owners is a limitation of the study. However, this research only considers car owners to avoid the possible bias that can be produced because of limited knowledge regarding actual vehicle costs by non-owners. Second, in Colombia, most of the new vehicles are acquired by current vehicle owners. Most of the people buying a car for the very first time purchase a used car. The survey consisted of the following parts:

  • Part 1 Respondent’s socioeconomic information (sex, age, the number of household members, the number of vehicles in the family, occupation, education level, field of study).

  • Part 2 Current vehicle characteristics: fuel type, capital cost, type of vehicle, model year and engine size.

  • Part 3 Stated preference (SP) survey: the respondents were asked to purchase a vehicle in ten hypothetical situations.

  • Part 4 Perceptions Questionnaire: rating of some statements related to environmental care, car dependency, and government policies towards green transport.

  • Part 5 Additional information: average monthly income and socioeconomic status of the respondent. Socioeconomic status is a classification system used in Colombia that divides households into six class levels sorted in ascending order (from lower to higher) and primarily used for tax billing purposes. The respondents were also classified into three categories according to their income: high, medium and low.

In the SP survey (part 3), different hypothetical situations were generated for three types of vehicles: Small cars (S), Midsize-Large cars (M) and SUV’s. For each individual, the SP survey applied was customised according to the type and purchase price of their current vehicle.

We performed a first pilot study with a sample size of 124 individuals to define attribute levels and test the understanding of the instrument. The estimation results of the models from the pilot test were used as priors to generate the design. Regarding the experimental design, an efficient block design, using MNL specification was coded in NGENE®. Table 1 shows the survey attributes and levels. The experimental design uses the standard gasoline-powered vehicle as a reference for the other types of fuels, with each having three attribute levels. The SP design was selected following the D-optimality criterion and assuming the MNL model, with 30 choice situations divided into three blocks.

Table 1 SP design attributes and levels (expressed as percentage deviations from base values of standard gasoline-powered vehicles)

Four technological alternatives were always presented within each choice scenario, taking into account that the diesel vehicle alternative was only presented to SUV’s owners instead of the electric vehicle alternative which was available for the rest of the car users. They are standard gasoline vehicle (G), natural gas vehicle (NGV), hybrid electric vehicle (HEV), electric vehicle (EV) for small cars (S) and midsize-large cars (M) owners, and diesel vehicle (D) for SUV’s.

The vehicle attributes considered in the experiment were based on focus groups and literature review. These attributes are:

  • Capital Cost the purchase price of the vehicle.

  • Range Number of kilometres travelled by the vehicle with a full charge of fuel.

  • Refuelling Cost Total cost of refuelling.

  • Annual Tax Public tax paid by the car owner in a year.

  • Fuel Availability Presented as a ratio of 1 to X gasoline service stations. In the models, the variable was included as a proportion (range 0–1).

A total of 1065 valid questionnaires were collected out of 1200 issued. Since each respondent answered ten choice situations, 10,650 observations were used in the model estimation. Surveys not fully finished by respondents were considered as invalid responses, representing a non-response rate of 11%.

The survey also included a questionnaire of perception indicators in which respondents rated some statements on a 1–5 point Likert scale. The purpose was to evaluate the influence of underlying factors and allow the incorporation of latent factors that might affect the decision-making process.

Initially, some pairs of statements related to government policies were presented. The task consisted of expressing the degree of acceptance of those policies taking into account that if the respondent agrees with one, they disagree with the other. The pairs of statements were those indicated in Table 2. It should be noticed that a rating of 1 means a stronger preference for the left alternative, while a score of 5 means the opposite. It is important to clarify that statements regarding pro-environmental behaviour sometimes appeared as the left alternative while some other times as the right alternative. The reason to present the statements representing pro-environmental behaviour in different locations was to avoid position bias. The treatment given to these indicators is similar to a rating survey, where the ordinal logit defines preference probabilities between policies, rather than extrapolation.

Table 2 Support government policies indicators

In addition to those indicators, the respondents were also asked to evaluate some statements about environmental concerns and their propensity towards the use of cars and technologies using a Likert scale. Through a factorial analysis, some indicators were discarded. Table 3 presents the final structure considered for modelling purposes.

Table 3 List of indicators and expected sign

Most of the participants were men (see Table 4). This gender gap is expected, as men have 74% of driving licenses in Colombia. Also, most of the respondents are working people and students, and about 80% of the interviewees have a college education or higher. Regarding income classes, almost all of the respondents belong to middle or high-income classes. The former can be easily explained because in the country where the survey was applied, vehicles are not affordable for most low-income users and the small cars are the most popular. Table 5 show similar sample distributions across cities.

Table 4 Respondents socioeconomic characteristics
Table 5 Respondents choices

Table 4 shows that small cars and SUV’s owners have a predisposition to choose conventional fuel vehicles. However, the hybrid electric vehicle achieves high acceptance rates among respondent’s choices—about the same as natural gas. Electric cars were the least chosen vehicles (Table 5). This phenomenon could be explained by the lack of knowledge about the technology, or by the presence of habit or inertia effect (Cantillo et al. 2007; Valeri and Cherchi 2016). It is interesting the case of Bucaramanga, where most respondents chose the HEV option. Also, in Table 6, the percentages above 3 of each indicator responses per city are shown. There are some differences between cities, with relatively high standard deviations; however, the tendencies are, in general, the same despite some individual cases.

Table 6 Indicator responses for each city

Model framework

The discrete choice approach based on the random utility theory was used. This theoretical framework postulates that individuals possess perfect information about the alternatives and act rationally, choosing the alternative that maximises their personal utility or satisfaction (Ortúzar and Willumsen 2011).

Different approaches have been used to estimate demand for alternative fuel vehicles. On the one hand, there are macroeconomic approaches like time series analysis (Sobh 2015) and diffusion models (See Gnann and Plötz (2015) for a comprehensive review of diffusion models). On the other hand, there are microeconomic approaches like those based on random utility maximisation theory. Within microeconomic approaches, the discrete choice models are accepted as a robust framework. Also, because this research aims to evaluate individual behaviour and choices, disaggregate demand models are more efficient than aggregate models in terms of information usage.

Several studies on the estimation of preferences for alternative fuel vehicles have been conducted in past years using discrete choice models. The majority of them analyse the effect of car features like purchase price, fuel costs and range (Beggs et al. 1981; Brownstone et al. 2000; Horne et al. 2005; Potoglou and Kanaroglou 2007; Collantes 2010; Caulfield et al. 2010; Mabit and Fosgerau 2011; Achtnicht et al. 2012; Hess et al. 2012; Daziano and Bolduc 2013; Hackbarth and Madlener 2013; Hoen and Koetse 2014; Tanaka et al. 2014; Galván et al. 2016).

Because individual preferences toward alternative fuels may not be homogeneous across the whole population and could also be affected by latent factors,Footnote 1 it is common to consider more flexible choice frameworks, such as the hybrid choice models (Ben-Akiva et al. 2002; Walker 2001). Hybrid choice models integrate unobservable variables within the traditional discrete choice framework and also accommodate preference heterogeneity. In the context of the study of alternative fuel vehicle demand, latent class models (Hackbarth and Madlener 2016; Hidrue et al. 2011) and integrated latent variable and discrete choice models have been used to analyse the effect of environmental concern (Bolduc et al. 2008; Daziano and Bolduc 2013; Jensen et al. 2013; Soto et al. 2014); safety (Daziano 2012); convenience and vehicle lease (Glerum et al. 2013); appreciation of car features (Mabit et al. 2015); social influences (Kim et al. 2016; Cherchi 2017) and habitual behaviour (Valeri and Cherchi 2016) in the decision making process when there are alternative fuel vehicles involved.

Taking advantage of having preference data from several cities in Colombia, a multinomial logit with the panel effect feature was used, allowing for scale differences among the cities. Considering disagreements over the scale parameters gives an added value for the model regarding data enrichment (Louviere et al. 2000).

We considered differences in taste preferences among individuals of different cities. Primarily, separate models for each city were estimated, following the approach by Hensher et al. (1998) when estimating models with different data sources. Then, for the pooled model, Bucaramanga was defined as the base city.

The latent variables were initially obtained through an exploratory factor analysis considering the ratings to the indicators questionnaire. Four latent variables were defined: (a) support for green transport policies (GP); (b) environmental concern (EC); (c) attitudes pro-car use (PC) and (d) attitudes pro-technology (TEC).

The latent variable model presented here consists of two parts: the structural equations, which are equations that relate socioeconomic characteristics with the latent variables; and the measurement equations, that express the relationship between the latent variables and the indicators (see Fig. 1). Table 7 shows the variables used in the estimated models.

Fig. 1
figure 1

Hybrid choice model (HCM) framework

Table 7 List of used variables

The structural equations for an individual q and latent variable l can be expressed as (1). The measurement equation for a given indicator m can be formulated according to (2), whether it refers to continuous indicators, or as with (3), if categorical indicators are used. In the last case, when the continuous latent variable lies within some defined threshold limit values (τ), then it will assume a particular value of the categorical indicator through a censorship mechanism.

$$X_{lq}^{ *} = \mathop \sum \limits_{s} \gamma_{ls} S_{sq} + \delta_{lq}$$
(1)
$$I_{mq}^{*} = \mathop \sum \limits_{l} \lambda_{lm} X_{lq}^{*} + \omega_{mq}$$
(2)
$$I_{mq} = \left\{ {\begin{array}{ll} 1 \hfill &\quad {si \,\tau_{0} < I_{mq}^{*} < \tau_{1} } \hfill \\ 2 \hfill &\quad {si \, \tau_{1} < I_{mq}^{*} < \tau_{2} } \hfill \\ {} \hfill & \ldots \hfill \\ c \hfill &\quad {si \, \tau_{c - 1} < I_{mq}^{*} < \tau_{c} } \hfill \\ \end{array} } \right.$$
(3)

where \({\mathbf{X}}_{{\mathbf{q}}}^{*}\) = Vector of latent variables for individual q; \({\mathbf{S}}_{{\mathbf{q}}}\) = Vector of socioeconomic characteristics (denoted by s); \({\mathbf{I}}_{{\mathbf{q}}}\) = Vector of attitudinal and perception indicators; λ, γ = Vectors of unknown parameters to be estimated; \(\varvec{\delta}_{\varvec{q}} ,\varvec{ \omega }_{\varvec{q}}\) = Error terms.

If F is the cumulative distribution function, the probability of observing Imq, within a discrete indicator or category c, can be written as (4):

$$Pr\left\{ {I_{mq} = c |X_{lq}^{*} } \right\} = F\left( {\tau_{c} - \mathop \sum \limits_{l} \lambda_{lm} X_{lq}^{*} } \right) - F\left( {\tau_{{\left( {c - 1} \right)}} - \mathop \sum \limits_{l} \lambda_{lm} X_{lq}^{*} } \right)$$
(4)

The error terms (\(\omega\)) were assumed to follow a logistic distribution and are independent from the set of parameters λ. The logistic distribution form was used due to its close cumulative form that reduces the difficulty on the evaluation of the function (Daly et al. 2012). In this case, the scale factor associated with the logistic function was normalized to one. Then the probability of observing an outcome category c on the indicator m for individual q can be expressed as (5):

$$Pr\left\{ {I_{mq} = c |X_{lq}^{*} } \right\} = \frac{{e^{{\left( {\tau_{c} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}{{1 + e^{{\left( {\tau_{c} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }} - \frac{{e^{{\left( {\tau_{{\left( {c - 1} \right)}} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}{{1 + e^{{\left( {\tau_{{\left( {c - 1} \right)}} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}$$
(5)

If there are m indicators’ categories, only m 1 thresholds (τ) need to be estimated. The first and the last threshold were fixed to − \(\infty\) and \(\infty\), respectively.

In this paper, the discrete choice model (DCM) component was formulated using a mixed logit framework (ML), considering panel effect and different scale factors for different cities. ICLV models are already a mixture model and therefore can capture a part of the randomness due to panel effects. However, to separate effects due to multiple error terms, we decided to estimate the variance for the panel effect. The significance of the variance of the panel effect suggests that correlation among responses of the same individual does not have to be neglected. Also, it is important to clarify that we estimated models with and without considering the panel effect. The ICLV model including the panel effect resulted in being better fitted. Finally, because of the presence of 3 SP games, each one with 4 technological alternatives, there are 12 utility functions in the model. The utility functions have the following general structure:

$$U_{ijqt} = \mu_{j} \left( {ASC_{i} + \mathop \sum \limits_{k} \beta_{ik} X_{ikt} + \mathop \sum \limits_{l} \alpha_{il} X_{lq}^{*} + \varepsilon_{ijqt} + \varphi_{ijq} } \right)$$
(6)

where Xit: Is the attribute vector for observed variables k of technological alternative i, for a given choice scenario t; \({\mathbf{X}}_{{\mathbf{q}}}^{*}\): Is the latent variables vector; \(ASC_{i} :\) is the alternative specific constant; \({\varvec{\upalpha}}, {\varvec{\upbeta}}\): are vectors of parameters to be estimated; \(\varepsilon_{ijqt} :\) are the error terms for the DCM component. They are assumed to be iid Gumbel; \(\varphi_{q}\): Are the error terms associated with the panel effect; \(\mu_{j}\): Is the scale factor for city j.

For identification issues, it was necessary to set the specific constant and the coefficients for the latent variables associated with the gasoline alternative for all SP games as zero. Then the total likelihood was estimated as the product of the likelihoods of the DCM component and the latent variable model, using a simultaneous estimation code written in OX (Doornik 2007), and defined as follows:

$$L_{Hybrid} = \mathop \prod \limits_{q} \left\{ {\mathop \smallint \limits_{{{\varvec{\updelta}},{\mathbf{\varphi }}}}^{{}} \left[ {\left[ {\mathop \prod \limits_{t} \frac{{e^{{V_{iqt} }} }}{{\mathop \sum \nolimits_{r} e^{{V_{rqt} }} }}} \right]*\mathop \prod \limits_{m} \left[ {\mathop \sum \limits_{c} \left( {\frac{{e^{{\left( {\tau_{c} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}{{1 + e^{{\left( {\tau_{c} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }} - \frac{{e^{{\left( {\tau_{{\left( {c - 1} \right)}} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}{{1 + e^{{\left( {\tau_{{\left( {c - 1} \right)}} - \mathop \sum \nolimits_{l} \lambda_{lm} X_{lq}^{*} } \right)}} }}} \right)* D_{cmq} } \right]} \right]dF_{1} \left( \delta \right)dF_{2} \left( \varphi \right)} \right\}$$
(7)

In (7), \(D_{cmq}\) is a dummy taking the value of one if the individual q rated c on the indicator m; F1(δ), F2(φ) are density probability functions assumed to distribute independent normal with mean zero and unknown covariance to be estimated. For identification purposes, the variances associated to F1 (δ) were set to one.

The relationship between indicators and latent variables were calculated through a factorial analysis using maximum likelihood estimates and then confirmed in the hybrid choice model. The measurement equations are as shown in Table 8. Each “1” represents a relationship among variables (arrows represent the same relationships in Fig. 1).

Table 8 Structure of the measurement equations

Even though four latent variables are estimated in the model, only three of them have a direct impact on the utility function. Pro-technology attitudes (TEC) are hypothesised to influence the vehicle purchase through the environmental concern (EC) and the pro-car use attitudes (PC) instead of affecting the choice directly. The fact that the TEC variable is considered to be an explanatory variable of two other latent variables using a factor on factor regression defines what we called a second-order latent variable structure. To our best knowledge, no previous studies of alternative fuel vehicle purchases have used such a framework within an integrated choice and latent variables model.

Results

We used modified Latin Hypercube Sampling (Hess et al. 2006) to generate quasi-random numbers to estimate the simulated log-likelihood of the HCM. Results for MNL and the choice component of the second and first order HCM are shown in Table 9. Table 10 presents the socioeconomic and latent variables of the choice models.

Table 9 Car choice models
Table 10 Car choice models. Socioeconomic and latent variables

When analysing the results presented in Table 9, several noteworthy inferences can be made. Regarding the socioeconomic variables included in the reduced form MNL model (Table 10), it can be inferred that the higher the income, the lower the preference for NGV vehicles. Meanwhile, at bigger engine sizes, NGV vehicles are preferred. The former result could be because of operational costs savings when compared to conventional fuels. Highly educated users prefer EV; however, when cars per household rates increase, EV’s become less attractive. HEV’s become more likely to be chosen when income increases.

All parameter signs are consistent with their economic expectations. Indeed, all the cost related variables (purchase price, annual taxes and refuelling costs) have a negative orientation and are statistically significant. Both alternative fuel availability and vehicle range have a meaningful and positive influence on the choices. All the scale parameters are significant, confirming the initial assumption about differences among cities.

Most of the latent variable coefficients of the second order model are statistically significant. The environmental concern (EC) coefficients are significant in all available technologies. The support for green transport policies (GP) parameters are significant in almost all alternatives (except natural gas), and the pro-car use attitudes (PC) are only significant in electric and diesel vehicles. In the first order model, the latent variable GP is not significant at all; the latent variable TEC only has a significant coefficient for the electric vehicles, meaning high tech individuals are more likely to purchase an electric car. Besides, in Table 11, the goodness of fit of both MIMIC models is presented. Results demonstrate a relative improvement of the indicators when the second order relationship is considered.

Table 11 Goodness of fit MIMIC models

The coefficients of variables associated with the HCM have reasonable orientations. As expected, the direction of the Environmental Concern (EC) coefficient is positive, so the people with a higher environmental conscience are more likely to buy alternative fuel vehicles. Also, GP coefficients have a positive direction, which means that the users open to policies that encourage green transport modes are more likely to buy alternative fuel vehicles.

Regarding the latent variables PC, those that achieved statistical significance have negative orientations, except the NGV’s in the first order model. The previous results could be caused by a lack of knowledge of alternative technologies (e.g. Electric Vehicles) or because of certain prejudices or misconceptions about alternative technologies. This result seems to indicate that the users of conventional fuel vehicles are more attracted to continue using traditional cars than to buy alternative fuel vehicles.

Tables 12 and 13 contain the estimated parameters for the structural and measurement model components, respectively. According to the results of the latent variable TEC, women have less interest in technology, contrasting with younger people, who show the opposite behaviour. Also, the higher the number of cars per household, the higher the tech attitudes. Regarding the latent variable GP, the ratio between the number of vehicles and the number of household members was not found to be significant. On the other hand, users with a higher education level, income and age are more willing to support sustainable transport policies. In the latent variable PC, there are no significant variables at a 95% confidence level. It seems to indicate that those users with a higher income, a higher rate of vehicles per household and bigger vehicles overall, have preferences towards car use.

Table 12 Latent variable structural model
Table 13 Latent variable measurement model

Some socioeconomic variables included in the structural model have different signs across latent variables. For instance, the elderly are the least pro-technology (TEC); meanwhile, according to the model, they are more prone to support green transport policies (GP). In the case of the variable motor, respondents with low environmental concern (EC) and high pro-car use attitudes (PC) prefer bigger engine sizes.

Although the second-order latent variable, TEC, has a clear structure, it is not significant in the pro-car attitudes (PC) variable. However, the second order relationship between the EC results is highly significant with a negative coefficient, meaning that some users are more worried about technology but less concerned about the environment. These findings are in line with Kollmuss and Agyeman (2002), who indicate that people with a strong belief in growth and technological solutions might not see the need and will be less willing to engage in the pro-environmental behaviour. On the other hand, Jansson (2009) show that, in Sweden, green consumers have more AFV’s replacement intentions and an earlier adoption rate. In Colombia, according to the survey, approximately half of the respondents would instead use proven technology than an unproven novelty, suggesting a more traditionalist behaviour.

The second order model tries to provide a better understanding of the psychology underlying the attitudes. An attitude might not have a straightforward influence on the utility, but it may have an effect through another latent variable, which indeed reinforces the complexity of the decision process. So, it seems that the second order relationship is a successful way of dealing with poorly specified structural equations, but it needs further research and more testing.

Policy evaluation

Despite the recent developments of hybrid models, few studies use them either to forecast or to evaluate policies. Some of them include the role of individual attitudes and perceptions predicting demand for electric vehicles (Glerum et al. 2013) and cycling (Maldonado-Hinarejos et al. 2014). Regarding their use for policy evaluation, changes in the forecast choice probabilities may arise from variations in the explanatory variables (usually socioeconomic, which affect the choices implicitly through the latent variables) or changes in the tangible attributes (which affect the decisions directly through the utilities) (Yánez et al. 2009). However, there is still a gap in using hybrid models for prediction. That is because, on the one hand, latent attitudes and perceptions are partly endogenous on travel behaviour, precluding a strong inference of causality and, on the other hand, such variables are measured by a single data moment in time (cross section data), precluding an inference of within-person variation.

One of the potential attractions of hybrid choice models is that they appear to offer a mean to predict the impact of policies designed to change the perceptions and attitudes of decision makers (so-called soft measures). However, a challenge is how to predict the effect of policy interventions on latent variables and associated indicators. That is, how a real-world policy (e.g., a specific advertising campaign) might change perceptions. What is easier to predict is how changes in socioeconomic variables affect perceptions (through structural equations), and through this, the utility function and then, the choice probabilities.

Several authors have highlighted the problems of using hybrid choice models (Chorus and Kroesen 2014; Vij and Walker 2016) and stated preference data (Fujii and Gärling 2003) to predict demand. Although it is a matter of discussion, the use of stated preferences to evaluate the sensibility of the demand is extensive, especially when assessing several policy scenarios that are not currently available or alternatives that are not in the market. In this particular case, given that alternative fuel vehicles have a low market share, and some technologies lack a market presence in Colombia, it makes it impossible to use revealed preference data. Therefore, the estimated hybrid choice model was used to evaluate the expected impact of several attributes in car purchase behaviour.

Elasticity analysis

In discrete choice models, the coefficients represent the marginal effect of the attribute over the utility function. This effect can be better understood with the estimation of the elasticity. Table 14 shows the direct and the cross-elasticities and Table 15 compares our estimates with other studies found in the literature. Analysis of the direct elasticities lets us infer that the capital costs have the greater impact on the choice probability. The average direct purchase price elasticity of the demand is between values of − 0.9 to − 1.4, indicating a higher sensitivity compared to other attributes. This value is lower than the − 2.05 elasticity found by Mabit and Fosgerau (2011) and values within − 1.3 to − 2.4 found by Dagsvik et al. (2002). However, it is in the same order of magnitude of − 1.02 found by Jensen et al. (2013) in Denmark and values around − 0.8 found by Bittencourt et al. (2016) in Brazil.

Table 14 Direct and cross elasticities
Table 15 Elasticity comparison

At the same time, a rise in the alternative fuel vehicle’s range does not produce a significant increase in demand. The direct elasticity for range increase in electric cars is between 0.12 and 0.25, which is lower than Mabit and Fosgerau’s (2011) documented value of 0.34. Jensen et al. (2013) find values of 0.66 and 0.44 for a single car and multiple car households, respectively. These values were even higher after the respondents had hands-on experience with electric vehicles. The 0.19 elasticity for conventional fuel vehicles is in the same order of magnitude than Jensen et al. (2013).

Fuel availability is another relevant attribute according to the elasticities, giving the results of 0.44 in gasoline and hybrid vehicles. On the other hand, the direct elasticity for natural gas and electric vehicles are 0.22 and 0.13, respectively. This could mean that if some planning policies for alternative fuel vehicles are proposed, it will be necessary to produce a significant raise (not marginal) in the fuel availability and range of the AFV to increase the demand for them to obtain a comparable result with a price subsidy policy.

However, the results of the cross elasticities show a high sensitivity of the alternative fuel vehicles capital cost over the demand for conventional fuel vehicles. In that case, a price subsidy for AFV will reduce the market share of gasoline and diesel vehicles. At the same time, an increase in the charging costs of conventional fuels and annual taxes will have a positive impact on the demand for alternative fuel vehicles.

Table 16 shows the model obtained across the five Colombian cities included in the analysis. Bucaramanga is treated as the base, and there are other city indicator dummy variables with each of the choice attribute variables. All the base estimates are statistically significant at the 5% level. Using Medellín as an example, the capital cost parameter is not significant; in consequence, the preferences for such attribute are not different between the base (Bucaramanga) and Medellín. Nevertheless, the parameters for refuelling cost and fuel availability are higher in Medellín than in Bucaramanga, while the coefficients for range and annual taxes are lower than in the base city. Different trends apply to other cities.

Table 16 Estimation results across five Colombian cities

The direct elasticities for each city are estimated (Table 17). Regarding capital cost and range, cities have similar elasticities, except for higher values in Bogotá. This case can be explained because Bogota is the largest city in the country, where the car market is very price competitive, and travel distances are usually longer than other cities.

Table 17 Cities direct elasticities

In the case of annual taxes, similar values can be found across cities, with the exception of Barranquilla and Cali that show a lower and a higher sensitivity, respectively. People living in Medellin also show the highest sensitivity for refuelling cost and fuel availability, which can be due to the city topography and high-grade roads. Barranquilla has the lowest elasticities for annual taxes, refuelling cost and fuel availability, suggesting that the car market in Barranquilla is mainly (and almost only) influenced by capital cost.

According to the results, there are some differences in sensitivity to changes in attributes among cities. However, in all cases, the highest elasticity is concerning capital cost. Although there are variations, which are explained by the marked regional differences in the country, trends in behaviour are similar. On that matter, policy transferability regarding alternative fuel vehicles could be probable for Colombian cities.

Willingness to pay

The willingness to pay (WTP) for an attribute may be estimated as the marginal rate of substitution between such alternative attribute and the price at constant utility. In other words, how much someone is willing to pay, more, or less, for the increase or decrease of an attribute, keeping the same level of utility (Daziano and Bolduc 2013). In Table 18 the variation of the attributes equal to a willingness to pay of $COP 1,000,000 (500 $USD as $1 USD = 2000 $COP at the time of the study) of capital costs.

Table 18 Willingness to pay

As a result, there is a WTP $COP 1,000,000 of additional capital cost if the vehicle provides an extra range of 35 km (14 $USD/km, 11 €/km). The previous value lies within the interval € 8.32–€ 16.82/km (Hackbarth and Madlener 2013); is lower than € 16–€ 34/km values (Jensen et al. 2013); but higher than 3.44 $USD/km (Tanaka et al. 2014).

Consumers are also willing to pay $COP 1,000,000 additional capital cost for an annual reduction of $COP 117,000 (60 $USD) in annual taxes. This value is significantly lower than € 2327–€ 4704 (Hackbarth and Madlener 2013), € 1500 (Hoen and Koetse 2014), and CAN$ 2104–CAN$ 4985 (Potoglou and Kanaroglou 2007). However, it is important to acknowledge that in those three cases, the incentive offered was the tax exemption, not the annual tax reduction.

Also, regarding the refuelling cost, the users expect a reduction of $COP 8600/charge for $COP 1,000,000 additional capital cost; which corresponds a reduction of $COP 34,400/month ($USD 17.2/month), assuming four recharges per month. These values are in line with those presented in Daziano and Bolduc (2013). If the fuel availability is increased by 3.4%, consumers will be willing to pay $COP 1,000,000 more (147 $USD/%). This value is larger than €45–€92 (Hackbarth and Madlener 2013), but in the same order of magnitude as $CAN 166 (Daziano and Bolduc 2013), and within the €73–€824 interval (Achtnicht et al. 2012).

Nevertheless, it is important to highlight that the previous estimations were calculated on an existing base of service stations, and the willingness to pay to increase fuel availability decreases as fuel availability itself increases (Hackbarth and Madlener 2016; Achtnicht et al. 2012). In this case, Hackbarth and Madlener (2016) found a WTP within the range of €35–€960 for an increase of 1% in fuel availability using a 20% base and a €7–€194 range using a 99% base. Our experiment used fuel availability base values from 12.5 to 25%, which makes reasonable the estimated WTP.

Policy simulations

Finally, to appraise possible scenarios for AFV implementation, twelve scenarios were simulated (Table 19). The main barriers addressed were the financial one, regarding the capital cost, operational costs and fuel availability; the technical barrier, concerning the driving range; and the awareness of people through campaigns that change the latent factors of each individual.

Table 19 Scenarios description

The simulated scenarios involve possible policies such as price subsidies, tax exemptions, fuel availability increases and improving driving range for AFVs. It is important to acknowledge that forecasting market share is a challenging task in the case of innovations and products with a low market share (Jensen et al. 2016). Therefore, the baseline scenario was estimated using the experimental design and does not follow the current market conditions, but it helps to evaluate the impact of the different policies simulated. The estimate was made by calculating individual probabilities and using sample enumeration to obtain the aggregated probabilities.

The first simulated scenario considered the same purchase price for all vehicles. Noticing this scenario is quite extreme, we evaluated another one for a 10% decrease in the purchase price for AFVs. Also, to account for the negative externality produced by conventional fuels, we simulated a scenario with a gasoline price raised by 10%. On the other hand, the effect of a gasoline price reduction was simulated as well, because it is a crowd-pleaser policy and it is highly demanded in the country.

One of the major concerns of the EV is the driving range. This concern is often increased when the person has the opportunity to drive an electric vehicle (Jensen et al. 2013). Therefore, some scenarios were simulated with the intention to estimate the effect of the increase of driving range in electric (50%) and hybrid vehicles (15%); and the impact of the expansion of the fuel availability for AFV (50 and 25%).

Also, as moving people towards a more advanced state-of-change might be a more cost-effective strategy for alternative fuel vehicles purchase (Langbroek et al. 2016), we simulated three scenarios to evaluate the variation of the consumer demand for AFV when the population has different values for the latent constructs. (1) The scenario in which the environmental concern is higher for the respondents. (2) The scenario in which the support for sustainable transport policies and environmental concern is higher for the respondents. (3) The scenario in which the attitudes towards car use increases, while the support for sustainable transport policies decreases. We considered a censorship mechanism on the density function of the latent factors, constraining the respondents to have certain levels of each one according to the scenario. To be precise, in scenario 10, the environmental concern was constrained in all consumers to have an EC at least equal to the mean value of the previously calculated distribution.

Indeed, those last three scenarios were estimated to evaluate the impact on choice probabilities when latent variables are higher (or lower) than actual values amongst respondents. The mechanism used to achieve those increases (or decreases) need to be assessed, taking into account the scale of the latent variable, and studying a reliable and credible change in those perceptions post-measures, to better frame the impact.

The last scenario was a combination of specific simulated scenarios, which were thought to be the likeliest to encourage the use of alternative fuel vehicles around the country. The outcomes of the simulation scenarios are reported in Table 20 and Fig. 2.

Table 20 Choice probabilities across policy scenarios
Fig. 2
figure 2

Simulated scenarios

The results show that, at equal prices, a significant increase occurs for the alternative fuel vehicles. At the same time, an increase in gasoline taxes, fuel availability and operational range have a positive impact on the demand as well, especially on hybrid and electric vehicles. It is interesting to note that in the policy-mix scenario 13, the reduction of 10% in the gasoline price (5) counteracts the effect of the other policies favouring alternative fuel vehicles (2, 6 and 8).

The analysis results indicate that if users had higher environmental awareness (latent variable EC), the demand for alternative fuel vehicles would be higher. In the case of users supporting the implementation of sustainable transport measures (latent variable GP, and indirectly EC), the demand for alternative fuel vehicles would be higher. Otherwise, if users are more attached to the car and support pro-vehicle policies (Increase in latent variable PC and a decrease in latent variable GP), this would lead to an increase in demand for conventional fuel vehicles.

Those policies that combine several interventions using more realistic thresholds achieve a higher increase in the demand for CNG vehicles over that of the EV and HEV vehicles. In other words, under more realistic scenarios (according to the Colombian context), natural gas becomes an important option for alternative fuel vehicles. Perhaps it is the best mid-way alternative to promote the transition to environmentally friendly cars.

Conclusions

This investigation estimated hybrid choice model to study the demand behaviour for alternative fuel vehicles in Colombia. Results show that the inclusion of latent variables in the estimation of discrete choice models could be a more appropriate representation of the consumer behaviour for new vehicle purchases when alternative fuel vehicles are available. The study also shows the relative convenience of including second-order latent variables in the econometric model, as it allows to conclude that some users of alternative technologies could be more excited about technology but less worried about the environment.

A strategy to encourage the use of vehicles powered by alternative energy sources must identify niche markets, considering that there are different segments. Therefore, specific strategies are needed for each segment. The analysis shows that two elements of segmentation of potential consumers could be the attitude of users towards sustainability and their car dependence level.

The estimated models suggest that for the Colombian car user, environmental concern and the support for sustainable transport policies have a positive influence on alternative fuel vehicle acceptability and purchase decision. On the other hand, the users who reveal a propensity for car use prefer well-known fuelling technologies as they are more prone to purchase conventional fuel vehicles.

According to the models, in Colombia, the purchase price along with the refuelling costs and taxes are the most relevant factors in vehicle purchase decision making. For this reason, to increment market share of alternative fuel vehicles, several incentive policies have to be launched, especially subsidies and tax exemptions. New NGV, hybrid and electric cars in the country have been exempt from the sales tax since 2012, in contrast to the 16% tax applied to gasoline and diesel vehicles. Despite this tax benefit, the purchase price for cars with alternative energy is still much higher than for conventional cars. In specific, NGV is exempt from the surtax (20%) that applies to gasoline and diesel. This former incentive has had a significant impact on the NGV level of penetration, particularly in vehicles with high consumption engines.

As it can be observed in the policy scenarios, in the Colombian context, it is necessary to consider more aggressive incentives to make the use of alternative energies more attractive and encourage their use. As successful cases, several policies must be implemented, but they depend on the stakeholder. Also, advertising campaigns have to approach consumers in different ways, considering their socioeconomic characteristics and attitudes. For that matter, the use of policies that promote tariff exemptions or subsidies of capital costs increases the tendency to choose the alternative fuel vehicles and it seems that the most appropriate fuel technology to begin the introduction of alternative fuel vehicles in Colombia is the natural gas, which could be used as a stepping stone to fuel transition in the country.

Another important subject that must be addressed is consumer awareness and acceptability of these new technologies; including environmental awareness, green policies acceptability and less dependence on car use. In that case, there is a need to evaluate measures which could change minds and consumer perceptions, and they should be used to encourage AFV use. For instance, if the governmental policies promote environmental awareness and sustainable transport, people are more likely to choose alternative fuel vehicles. On the contrary, if the policies and measures support car use, the conventional fuel vehicles will remain the market leaders with an increasing market share.

Further research should evaluate the behaviour of new car buyers. This study did not take them into account, in order to diminish the possible bias that can be produced because of insufficient knowledge regarding actual vehicle costs by non-owners. Another suggestion is the inclusion of indicators regarding how familiar are respondents with existing and emerging fuel technologies. The addition of such questions could help to isolate the possible presence of habit or inertia effect.

Another contribution of the paper is the assessment of the preferences in different Colombian cities for the analysed context. The analysis shows some differences in sensitivity to changes in attributes among cities. Although there are variations, which are explained by the marked regional disparities across the country, trends in behaviour are similar. In consequence, transferability of policies to encourage alternative fuel vehicles may be applied in various cases. More research is needed to evaluate if those similarities are also maintained at other geographic scales; maybe between countries or regions, extending the range of application of the results.