1 Introduction

Distance is often treated as a well-behaved, continuous variable in economic models describing human behavior. However, human behavior and perceptions of the landscape do not conform to a continuum. Landscapes are discontinuous entities marked by barriers (Tilley 1994; Appleton 1996; Coeterier 1996). There are natural barriers such as rivers, mountains, lakes, and forests, as well as manmade barriers such as roads, railroads, fences, and buildings which essentially slice the landscape into sections. Examples of such sections which are meaningful to people are, e.g., an island, a town, a suburb, a neighborhood, a valley, a beach, a forest. The sections and boundaries provide both restrictions and pathways through and into the landscape. People may experience these sections as changing landscapes and the boundaries may represent physical barriers that increase the costs of traveling (Weng and Lu 2009).

From a mathematical point of view, landscape barriers and sections can be seen as frictions across the landscape. Failing to understand or recognize these different frictions, we are unlikely to understand the spatial contexts of human behavior and the values provided by the landscapes. When the relation between people’s Willingness-To-Pay (WTP) for an environmental good and their distance to said good is unclear, advising environmental policymakers becomes challenging, e.g., when defining the extent of the market and when aggregating individuals’ WTPs for use in cost–benefit analysis (Bateman et al. 2006). It is widely recognized that the physical consequences of environmental policies or projects are more often than not unevenly distributed across the landscape. Theory and empirical evidence would suggest that the location of environmental goods and services has significant bearings on the welfare that individuals obtain from these (Bateman et al. 2006; De Valck and Rolfe 2018; Hanley et al. 2003; Schaafsma et al. 2013). Welfare estimates of environmental quality changes are likely to be affected by several different spatial aspects such as the geographical size of the area or resource affected by the change, the geographical boundary within which the relevant target population is identified, and the availability of relevant substitutes and complements (Schaafsma 2015; Johnston et al. 2017a, b). These aspects are all related to the concept of distance decay which refers to the commonly observed tendency of welfare estimates to decrease as the distance between the environmental good and the individual increases (Bateman et al. 2006).

Distance decay is relevant to consider when addressing use values such as recreational values. However, it is less clear—from the point of view of economic theory as well as empirically—to what extent non-use values would be subject to distance decay (e.g., Bateman et al. 2006; Jørgensen et al. 2013; Schaafsma et al. 2013; Rolfe and Windle 2012; Bakhtiari et al. 2018). Hanley et al. (2003) found distance decay in both use and non-use values, but the decay was more rapid for use values.

When assessing the values of environmental goods and services, it is commonly found that individuals exhibit heterogeneous preferences for the often wide range of different ecosystem service outcomes that are affected by ecological changes caused by a new project or policy initiative. Different individuals are thus likely to be interested in and motivated by different ecosystem services (Boyd et al. 2016). While some respondents in the given context obtain utility mainly from regulating services (e.g., wildlife conservation, habitat provision, and nutrient recycling), others obtain utility primarily from cultural services (e.g., recreation, physical and mental well-being, and preservation of local livelihoods and lifestyles). The former likely holds significant non-use value components while the latter by definition is directly linked to use values. Since the value components (use and non-use) differ between the many final ecosystem services that are affected by the same underlying ecological or policy change, this raises the question whether distance decay functions will differ between groups of respondents that are particularly motivated by different ecosystem services. In the context of distance decay, De Valck et al. (2017) and De Valck and Rolfe (2018) conclude that to improve our understanding of the influence of heterogeneity of preferences across respondents, more research is warranted. Hence, notwithstanding the ambiguous findings in the literature concerning differences in distance decay for use and non-use values, we follow Hanley et al. (2003) and hypothesize that distance decay will be more pronounced for respondents particularly motivated by cultural ecosystem services than for respondents motivated by other types of ecosystem services.

In distance decay analysis, welfare estimates are typically modeled parametrically as monotonic and continuous functions of distance (Holland and Johnston 2017). This requires the analyst to make assumptions concerning the functional form of the distance decay relationship. Though economic theory predicts the presence of a distance decay relationship, at least for use values, it does not provide any guidance regarding the functional form of the relationship (Kling 1989; Ferrini and Fezzi 2012). A wide range of functional form specifications have been empirically tested within environmental economics (e.g., Concu 2007), transportation (e.g., Beckmann 1999), and geography (e.g., Taylor 2010). Given that the relationships between distance and welfare measures will inevitably be case- and context-specific, the question of which functional form to assume is inherently an empirical one, with limited if any theoretical foundation (Ferrini and Fezzi 2012). As noted by Concu (2007, p. 178) “…a search for the best functional form is necessary”.

The analyst’s choice of functional form specification is, however, far from trivial. Assuming a functional form that is not supported by the data has been shown to lead to biased welfare estimates (Kling 1989). Even though measures of statistical fit may help to identify the functional form that best describes the data in a specific case, the identification will be made within a limited set of candidate functions determined by the analyst. While the common parametric assumptions concerning monotonicity and continuity may be appropriate in some cases, they are likely to be overly restrictive in others—implying a risk of biased welfare estimates (Bateman et al. 2006). Johnston and Ramachandran (2014) highlight that spatial patterns in non-market WTP may indeed be much more complex, e.g., in the presence of local patchiness and hot spots. This potentially renders the use of the commonly used parametric distance decay functions insufficient and misleading. Along the same lines, Jørgensen et al. (2013), Schaafsma et al. (2013) and De Valck et al. (2017) show that non-random spatial distribution of substitutes may also complicate distance decay patterns in WTP beyond what is accounted for when using the commonly used parametric distance decay functions.

Geographic barriers in the landscape, whether naturally occurring or in the form of manmade constructions, separating the individual from the utility-generating environmental good or service are likely to result in discontinuous distance decay patterns (Longley et al. 2005). In case such barriers are present, imposing a continuous distance decay function in the parametric model would likely generate biased welfare estimates (Ferrini and Fezzi 2012). These barriers can work on very different geographical scales, from a local gravel road with a low speed limit that only affects those living close by, to crossing between two islands with a ferry or using a toll bridge.

To avoid the parametric assumptions that are inherently necessary when implementing the typical parametric approaches to modeling distance decay, Ferrini and Fezzi (2012) suggest using Generalized Additive Models (GAM) instead. The non-parametric GAMs incorporate flexible, nonlinear and differentiable functions as smoothing terms to identify the functional form directly from the data, thus avoiding the otherwise necessary assumptions concerning model flexibility and functional form in the parametric modeling approach. Comparing the use of GAMs to more traditional parametric models used in nonmarket valuation studies involving distance decay, Ferrini and Fezzi (2012) find GAMs to provide better model fit as well as a reduced median WTP in a dichotomous choice contingent valuation (CV) study. Similarly, in a payment card CV study assessing the value of new urban parks, Andrews et al. (2017) find GAMs to produce lower welfare measures than standard parametric specifications.

As underlined by e.g. Schaafsma et al. (2012) and De Valck and Rolfe (2018) reliable estimation of the spatial distribution of welfare measures is essential for reliable welfare aggregation in environmental valuation. Given the lack of theoretical justification for a parametric specification of distance decay and the fact that GAM avoids this, it is somewhat surprising that the only applications of GAMs in environmental valuation distance decay contexts so far are Ferrini and Fezzi (2012) and Andrews et al. (2017). While both these applications are set in a CV context, Ferrini and Fezzi (2012) note that it would seem worthwhile to apply GAMs in a stated Choice Experiment (CE) setting.

We contribute to this literature by providing the first empirical test of the performance of GAM for modelling spatial welfare heterogeneity in a CE context, while at the same time taking preference heterogeneity for ecosystem services into account. We test the suitability of GAMs in a setting where common parametric assumptions concerning the shape of distance decay will often fall short, namely in an empirical case where the distance decay function is expected to show a clear jump discontinuity. Specifically, the case concerns welfare impacts of coastal water quality improvements in a specific geographic area. Respondents are sampled from the population on two islands, one close to the area of interest and the other one further away. The two islands are only connected by a toll bridge. We hypothesize that the toll bridge will cause a jump discontinuity in the distance decay of welfare measures due to the jump in travel costs. We test this using a range of standard parametric distance decay models as well as the non-parametric GAM.

A priori knowledge of barriers causing a jump discontinuity will be available in some empirical cases, but unavailable in others, and we therefore conduct our tests in two different “level of knowledge” scenarios. In the first scenario, we assume that the analyst has a priori knowledge of the barrier, i.e., the toll bridge. The analyst can thus utilize this knowledge when setting up empirical models, e.g., by estimating separate choice models for subsamples that are spatially separated by the jump or by simply incorporating a dummy variable in the parametric distance decay function to capture the jump discontinuity. In the second scenario, we assume that the analyst has no a priori knowledge about jump discontinuities being present. Hence, models will have to uncover the presence of the toll bridge entirely from the data with no auxiliary information from the analyst. While this scenario is artificial for our empirical case as the toll bridge is well known, it serves as an example of how well these models perform when jump discontinuities are present, but unknown to the analyst.

Furthermore, we use an experimental setup where respondents are classified according to which type of final ecosystem service they are primarily motivated by. Some of these are use value oriented, and thus sensitive to travel costs, while others are more non-use value oriented and likely less sensitive to travel costs. We hypothesize that the distance decay, and potentially also the impacts of the jump discontinuity, will differ across these different types of respondents.

The following section provides the methodological background, including econometrics. Section three elaborates on the experimental setup and the data collected while section four presents results. Finally, the last section provides conclusions.

2 Methodology

Similar to Campbell et al. (2009), Johnston and Ramachandran (2014) and Czajkowski et al. (2017), we utilize a two-stage modeling procedure to investigate our research questions by first modeling respondents’ preferences for location and level of water quality improvements using a mixed logit (MXL) specification. This model allows for heterogeneity in preferences across respondents, and it enables estimation of individual-specific preference parameters through the derivation of the conditional distribution based on the sampled respondents’ choices. Applying Bayes theorem to derive the expected ratio between the estimated water quality attribute parameters and the parameter estimate for the cost, individual-specific WTP estimates can then be extracted. As noted by Hensher et al. (2005), these individual-specific WTP estimates should not be considered as the individual respondent’s unique set of preference estimates. Strictly speaking, they are rather same-choice-specific parameter estimates for the subgroup of respondents who made identical choices when faced with the same choice tasks. In the second stage, the individual-specific WTP estimates obtained in the first stage are explained using distance decay models. Here we explore a range of typically used parametric distance decay specifications as well as the non-parametric GAM specification, with a particular focus on the ability to capture jump discontinuities.

2.1 Choice Modeling (Stage 1)

A standard Random Parameter Error Component Logit (RPECL) model is used for extracting information about preferences from the observed choices made by respondents (e.g., Train 2009; Scarpa et al. 2005, 2008). Focus group interviews and a pilot test had indicated that preferences for the water quality and location attributes could be expected to be heterogeneous. Hence, these attributes are specified as normally distributed random parameters. A full variance–covariance matrix is specified to allow for correlation patterns across these random parameters (Train and Weeks 2005; Scarpa et al. 2008; Hess and Train 2017). The price parameter is treated as a fixed parameter, even though it implies a fixed marginal utility of income. While specifying a random parameter for the price parameter may be more behaviorally plausible, it is often associated with severe limitations for welfare estimation (Johnston et al. 2017a, b). For instance, the well-known fat tail problems of the lognormal distribution will lead to extreme observations of WTP, which would be detrimental to the second stage of our procedure.Footnote 1 An Alternative Specific Constant (ASC) is specified for the status quo alternative to capture the systematic component of a potential status quo effect. Furthermore, an error component additional to the usual Gumbel-distributed error term is incorporated into the model to capture any remaining status quo effects in the stochastic part of utility. The error component is implemented as an individual-specific zero-mean normal distributed random parameter assigned exclusively to the two experimentally designed alternatives. This is to allow for correlation patterns in utility over these alternatives, capturing any separate variance associated with the cognitive effort of evaluating experimentally designed hypothetical alternatives as opposed to the real-life status quo alternative (Hensher and Greene 2003; Scarpa et al. 2007, 2008). The utility structure can be formally described as:

$$U_{ntj} = \left\{ {\begin{array}{*{20}l} {V(ASC) + \varepsilon_{ntj} ,} \hfill & {j = 1\,(Status\,quo);} \hfill \\ {V(x_{ntj} ,\tilde{\beta }_{n} ,\beta ,\mu_{n} ) + \varepsilon_{ntj} ,} \hfill & {j = 2,3} \hfill \\ \end{array} } \right.$$
(1)

where the indirect utility, V, is a function of the vector of explanatory variables, xntj, as well as the vectors of individual-specific random parameters, \(\tilde{\beta }_{n}\), and the fixed price parameter, \(\beta\). For the two experimentally designed policy alternatives, the common individual-specific error component μn enters the indirect utility function, while the ASC replaces it for the status quo alternative. The unobserved error term εntj is assumed Gumbel-distributed. Individual respondents are denoted by n, while j denotes the alternative and t the choice set. The \(\tilde{\beta }_{n}\) varies over individuals in the population with density \(f(\beta |\uptheta)\), where θ represents a vector of the unknown parameters to be estimated.

The specified RPECL model accommodates estimation of individual-specific estimates of preference parameters for each n. Following, e.g., Greene et al. (2004), Hensher et al. (2006) and Hess (2010), this is achieved by applying Bayes’ rule to construct the conditional density for the random parameters of interest:

$$f(\beta_{n} \left| {Y_{n} ,X_{n} ,\Omega )} \right. = \frac{{f(Y_{n} \left| {\beta_{n} ,X_{n} ,\Omega )f_{\beta } (\beta_{n} \left| {\Omega )} \right.} \right.}}{{f(Y_{n} \left| {X_{n} ,\Omega )} \right.}}$$
(2)

where Ω denotes the underlying parameters of the distribution of βn. The sequence of choices made by individual n is denoted by Yn, and Xn encompasses all elements of xntj for all alternatives and choice tasks. The conditional mean can then be approximated for each random parameter by simulated maximum likelihood:

$$\hat{E}(\beta_{n} \left| {Y_{n} ,X_{n} ,\Omega )} \right. = {\raise0.7ex\hbox{${\frac{1}{R}\mathop \sum \limits_{r = 1}^{R} \hat{\beta }_{nr} \mathop \prod \limits_{t = 1}^{T} \frac{{\exp \left( {\hat{\beta }_{nr} x_{ntj} } \right)}}{{\mathop \sum \nolimits_{k = 1}^{J} \exp \left( {\hat{\beta }_{nr} x_{ntk} } \right)}}}$} \!\mathord{\left/ {\vphantom {{\frac{1}{R}\mathop \sum \limits_{r = 1}^{R} \hat{\beta }_{nr} \mathop \prod \limits_{t = 1}^{T} \frac{{\exp \left( {\hat{\beta }_{nr} x_{ntj} } \right)}}{{\mathop \sum \nolimits_{k = 1}^{J} \exp \left( {\hat{\beta }_{nr} x_{ntk} } \right)}}} {\frac{1}{R}\mathop \sum \nolimits_{r = 1}^{R} \mathop \prod \nolimits_{t = 1}^{T} \frac{{\exp \left( {\hat{\beta }_{nr} x_{ntj} } \right)}}{{\mathop \sum \nolimits_{k = 1}^{J} \exp \left( {\hat{\beta }_{nr} x_{ntk} } \right)}}}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\frac{1}{R}\mathop \sum \nolimits_{r = 1}^{R} \mathop \prod \nolimits_{t = 1}^{T} \frac{{\exp \left( {\hat{\beta }_{nr} x_{ntj} } \right)}}{{\mathop \sum \nolimits_{k = 1}^{J} \exp \left( {\hat{\beta }_{nr} x_{ntk} } \right)}}}$}}$$
(3)

where \(\hat{E}\) expresses the average of β for the individual n, over the r = 1,,R simulated draws.

Once the individual-specific βs are estimated for each of the random parameters associated with water quality and with the location, the individual-specific WTPs are calculated by simply dividing by (the negative of) the price parameter estimate. Obtaining the individual-specific WTP estimates is the main objective of the first stage of analysis, as these estimates will serve as dependent variable in the second stage of the analysis.

2.2 Distance Decay Models Explaining WTP (Stage 2)

The spatial distance decay of the estimated WTP from the choice models is investigated using both parametric and non-parametric models. The parametric models cover commonly used distance decay specifications where WTP is modelled as a linear, a log, an inverse or a squared function of the distance, d, to the good—see functions in Eq. (4). Furthermore, for each of the parametric models, we specify an additional model incorporating a dummy variable indicating whether the respondent lives on Zealand. This is intended to directly capture the potential jump discontinuity caused by the toll bridge between Zealand and Funen (see Fig. 2).Footnote 2 Models including this dummy variable are intended to reflect a scenario where the analyst has a priori knowledge of the jump discontinuity, while the parametric models excluding it are intended to reflect a scenario where such knowledge is not available to the analyst.

$$\begin{aligned} & WTP_{n} = \varphi_{1} d_{n} + \epsilon ,\quad WTP_{n} = \varphi_{1} { \log }(d_{n} ) + \epsilon , \\ & WTP_{n} = \varphi_{1} \frac{1}{{d_{n} }} + \epsilon ,\quad WTP_{n} = \varphi_{1} d_{n} + \varphi_{2} d_{n}^{2} + \epsilon \\ \end{aligned}$$
(4)

The parametric models are estimated using a Generalized Linear Model (GLM) setup. The non-parametric model is estimated using a GAM where the respondents’ WTPs are explained as a smooth function over the geolocation of respondents. The location of respondents was geocoded based on the centroid of the postal code of the area in which they live. Distances were measured as the shortest possible road distance between this location and nearest entry point at the localized good.

The GAM is a special case of the GLM where a continuous explanatory variable can be treated non-parametrically in smoothing functions (Wood 2006). In this case, WTP is estimated as a function of the x, y coordinates of the respondent’s location.

$$WTP_{n} = f\left( {x_{n} ,y_{n} } \right) + \epsilon$$
(5)

In Eq. (5) \(f\) is the smoothing function over the x, y spatial coordinates of individual n. The smoothing function is made up of the sum of k thin plate regression spline bases, bh, each multiplied by the coefficient α : f\(= \sum\nolimits_{h = 1}^{k} {\alpha_{h} b_{h} \left( {x,y} \right)}\). The thin plate bases consist of a series of complex polynomials which enter into the model similar to other explanatory variables. The α coefficient is estimated similar to regular parameters. Separately these smoothing parameters do not provide any useful information. However, combined they produce a data-driven functional form relationship with the dependent variable. In this case the smoothing function includes two variables which means that instead of a line, they describe a surface in x, y space (von Graevenitz and Panduro 2015). The advantage of the GAM is that the functional relationship is constructed with no a priori assumptions of the specific relationship between WTP and location. In contrast, traditional parametric models a priori restrict the functional form to a specific assumed relationship.

The smoothing function will capture more of the variance of the dependent variable with the increase of thin plate splines. However, with a large number of splines, the risk of overfitting increases. To reduce the risk of overfitting, a penalty of “wiggliness”, \(\theta\), is included in the estimator. The penalty term enters into the object function as an additional term. The parameters of the model, described by B, are estimated based on the following expression.

$$\hat{B} = \arg min_{B} 2(l_{max} - l\left( B \right) + \theta \int {\int {\left( {\frac{{\delta^{2} (f\left( \alpha \right)}}{{\delta x^{2} }} + \frac{{\delta^{2} (f\left( \alpha \right)}}{\delta x*y} + \frac{{\delta^{2} (f\left( \alpha \right)}}{{\delta y^{2} }}} \right)} } \delta xy$$
(6)

The first term of the equation is the deviance measure which measures the difference between the satiated likelihood, \(l_{max}\), and the likelihood of the reduced model \(l\left( B \right)\). The second term is the penalty on the variance of the second derivatives of the smoothing function over the x, y coordinates. The estimator therefore explicitly contains the trade-off between bias and variance, which reduces the risk of overfitting even with a large number of splines (von Graevenitz and Panduro 2015).

3 Experimental Design and CE Implementation

Denmark is characterized by an intensely farmed landscape with more than 60% of the landmass being used for agriculture. This agricultural intensity, combined with a relatively long coastline and shallow fjords, estuaries and coastal waters, has led to problems related to coastal eutrophication (Dalgaard et al. 2014). This has negative impacts on water quality and consequently on the wide range of associated ecosystem services. To address our research questions, we use empirical dataFootnote 3 from a CE study aimed at assessing the WTP for achieving good ecological status in Danish coastal waters in line with the targets of the EU Water Framework Directive (European Commission 2000).

Questionnaire development was based on three focus group interviews with lay audiences and two workshops with water quality experts. This provided valuable insights on the survey design in general as well as on potential preference heterogeneity in relation to different ecosystem services affected by water quality changes. The focus groups were conducted over a period of three weeks in February 2015. The focus groups were conducted in different regions of the country, covering both the capital region, a suburban region, and a rural region. For each focus group, participants were recruited using a local nominator ensuring variation in the group according to gender, age, and educational background. The final set of attributes and levels used in the design of policy alternatives are summarized in Table 1.

Table 1 Attributes and levels used in the experimental design

Choice tasks were generated using the Ngene software (ChoiceMetrics 2014), optimizing the experimental design for D-efficiency, assuming a multinomial logit model. The design was updated using priors obtained from a pilot study with 200 respondents drawn from the same sample frame as the main survey in March 2015. The optimization algorithm was stopped after running for 24 hours without identifying further design efficiency improvements. The resulting design consisted of a sequence of 12 choice sets per respondent, with no blocking involved. Each choice task consisted of two policy alternatives that would improve water quality in particular geographical areas, and a third alternative, namely the zero-cost status quo option with no water quality improvements. Input from the lay audience focus groups and expert workshops were used to construct and describe a 5-level water quality ladder, similar in concept to water quality ladders or indices used in previous stated preference surveys (e.g., Hime et al. 2009; Van Houtven et al. 2014; Walsh and Wheeler 2013). The approach follows the standard classification within the EU Water Framework Directive (European Commission 2000). An example of a choice task as it appeared in the questionnaire is shown in Fig. 1.

Fig. 1
figure 1

Choice task example

Poor ecological status is only considered an option in the status quo alternative since an improvement to the current water quality level is required according to the EU Water Framework Directive. Variations in WTP associated with improvements in different policy areas are modeled by including attribute level dummy variables for each of two policy areas, Lillebaelt and Limfjorden, relative to the third policy area, Oresund, which serves as a reference level. Figure 2 displays the location of the policy areas. Due to the design of the CE, dummies for policy areas are only included for alternatives with water quality improvements. Hence, WTP estimates for policy areas are conditional on some improvement of the current water quality.

Fig. 2
figure 2

Map of Denmark, outlining the sample areas as well as the policy areas and the location of the toll bridge between Funen and Zealand. The small insert in the upper right corner displays the location of Denmark in Europe

To identify which type of ecosystem service each respondent was mainly motivated by, a series of questions consisting of nine attitudinal statements was developed based on a semi-quantitative Q-methodology analysis of the focus group interviews (Armatas et al. 2014). Each of these attitudinal statements were found to indicate preferences for a particular type of ecosystem services (Jensen 2019). Based on the nine attitudinal statements, three survey questions were developed in which the respondent had to choose a preferred statement among three competing statements. The statements and questions are provided in “Appendix 1”. Based on answers to these three questions, each respondent was sorted into one of four distinct groups. The first group was primarily motivated by improving conditions for habitats and biodiversity (regulating services). The second group was primarily motivated by improvements in recreational opportunities (cultural services). The third group was primarily motivated by improving conditions for the fishing industry (provisioning services). The fourth and last group contained remaining respondents who could not be classified as being motivated by any single type of ecosystem service.

The CE data collection was conducted online during April and May 2015, using a pre-recruited panel maintained by a market research firm in Denmark. Sampling was stratified to ensure representativeness on age, gender, geography, and education. Follow-up reminders were sent 2 and 3 weeks after the first contact, resulting in a final response rate of 22%. Having identified and excluded protesters,Footnote 4 a final sample of 1653 respondents living on Zealand and 408 respondents living on Funen was realized, i.e. a total of 2061 respondents providing 24,732 choice observations for the following analysis. The sample displays a wide variation in terms of spatial distribution and, thus, distance to the policy areas.

The original full datasetFootnote 5 also included respondents from the rest of Denmark, but for specifically investigating jump discontinuities in distance decay, we use only the subset of respondents living on Funen and Zealand. The reason is that since these two islands are only connected by a toll bridge, we know that there will be a sudden jump in the relationship between travel costs and travel distance caused by the payment required for crossing the bridge. The price for crossing the bridge one-way was 235 DKK (~ €31) at the time of data collection, with discounts available in the form of same-day return trips for 375 DKK (~ €50). Furthermore, in our distance decay analysis, we choose to focus on a new policy scenario that improves the water quality in the Lillebaelt coastal area to good ecological status. The relevant aggregate WTP thus consists of the WTP for improving water quality from medium to good ecological status plus the WTP for moving the location of the water quality improvement (the policy area) from Oresund to Lillebaelt. Due to the reference attribute levels used in the choice models (see Table 1), the aggregate WTP measure of interest thus relates to a change from a baseline policy in which the water quality in the Oresund coastal area is improved to medium ecological status, to a new policy instead obtaining good ecological status in the Lillebaelt coastal area. The reason for focusing on this policy change is that Lillebaelt, which is located on the western coast of Funen, is relatively close to respondents living on Funen, while all respondents living on Zealand are further away—plus they would have to pay for crossing the bridge to visit the Lillebaelt coastal area. As is evident from Fig. 2, the opposite holds true for the Oresund coastal area. We note that this also implies that there is a natural correlation between the distance and the dummy variable for respondents living on Zealand. As mentioned in Table 1, a third location, the Limfjorden coastal area, was also included as a level in the policy area location attribute. However, since this is located in the northern part of Denmark, which may also be reached by ferry from Zealand, this is left out of the distance decay analysis in stage 2.

4 Results

To provide an overview of the sample as well as a brief comparison of the two subsamples of respondents located east and west of the toll bridge, Table 2 displays a selection of relevant sample characteristics. If the subsamples differ substantially in characteristics that affect WTP, then incorporation of the toll bridge in the subsequent modeling would capture not only the jump in travel costs, but also these other differences between respondents living on Funen and Zealand.

Table 2 Selected characteristics of the sample

Most of the sociodemographic variables, as well as the motivations for particular ecosystem services,Footnote 6 are similar across the two subsamples, suggesting that any effects of the toll bridge on distance decay in the subsequent models would primarily be a consequence of the jump in travel costs. However, as indicated in Table 2, we do find significantly higher household incomes in the Zealand sample, and we also find a higher share of urban dwellers. Higher income should, according to economic theory concerning decreasing marginal utility of income, be associated with lower cost sensitivity, which would translate into generally higher WTP, ceteris paribus. Furthermore, as noted in Hassan et al. (2019), urban dwellers are often found to have higher WTPs than rural dwellers, and this might also translate into displacements of the distance decay relationship (Logar and Brouwer 2018). Hence, these observed demographic differences across the two islands would likely confound the effects of the toll bridge, potentially posing a limitation for our investigation. Essentially, any jump discontinuity observed at the bridge could thus potentially also be caused by these observed differences across the two islands. However, since the higher household income and the greater share of urban dwellers are located on Zealand, the effects on WTP would run counter to that of the sudden increase in travel cost for reaching the Lillebaelt area. In other words, while the increase in travel costs is expected to lead to generally lower WTPs for respondents from Zealand, the higher household income and the greater share of urban dwellers would have the opposite effect.

Underlining the potential relevance of distance in relation to individuals’ WTP for localized water quality changes, Table 2 also documents the geographic fact observed in Fig. 2, that the Funen subsample is much closer to the Lillebaelt area than is the case for the Zealand subsample. Since both Funen and Zealand are islands, this naturally entails that the distance to substitute coastal waters relative to the Lillebaelt coastal area is smaller on Zealand. This may be one of the key drivers behind an observed distance decay.

4.1 Stage 1: The Choice Model

The choice model described in Sect. 2 is estimated for the two subsamples of respondents living on Funen and on Zealand, as well as the full sample. Results are reported in Table 3. All random parameters are assumed to be normally distributed, and the full covariance matrix is estimated in order to allow for all forms of correlation among the random parameters (Hess and Train 2017). All models were estimated using Nlogit 6. Parameter estimates were found to stabilize when using around 700–800 Halton draws for simulations, so we used 1000 Halton draws in all the models presented in Table 3.

Table 3 Choice model results for separate as well as full sample

While the utility coefficients as such are not the focus of this paper, a few remarks concerning the results in Table 3 are in place. The choice models fit well to the data, and almost all model parameters are highly significant. Though preferences are heterogeneous, the majority of respondents have positive preferences for improvements in water quality. At the mean, respondents have significantly stronger preferences for very good ecological status compared to good ecological status (95% confidence limits for the mean parameter estimates do not overlap). Combined with the fact that the random parameter covariances for good and very good ecological status are significantly positive in all samples, this suggests that respondents are generally sensitive to scope.

Given the focus of this paper, we are particularly interested in the utility coefficients associated with the policy areas, i.e. the geographical location of water quality improvements. Models 1a and 1b in Table 3 may indicate that distance decay could be present. While the vast majority of respondents on Funen have positive preferences for improvements in Lillebaelt (relative to Oresund), the opposite is the case for respondents on Zealand. This, however, could be caused by other differences between the two regions, and even if it to some extent is caused by distance decay, these results provide no indications regarding the functional form of the distance decay. This will be investigated more thoroughly in Sect. 4.2.

The models reported in Table 3 are used for generating the individual-specific WTP estimates (as outlined in Eq. 3) which serve as input for the second stage. Given the policy scenario of interest described above, the individual-specific WTPs used in the second stage are calculated as the sum of the parameter estimates for Lillebaelt and good ecological status divided by the price parameter estimate for each individual respondent.

4.2 Stage 2: Modelling Distance Decay

In the following models, the individual-specific WTP estimates extracted in stage 1 are explained by the road network distance to Lillebaelt. Distance decay is estimated using a range of parametric distance decay specifications (Eq. 4) as well as the non-parametric GAM (Eq. 5). Specifications with an additional income variable were tested but are not presented here as the income variable was found to be insignificant across all model estimations. Initially, this raised some validity concerns as we would normally expect higher income to lead to higher WTP as previously mentioned. We thus performed an income sensitivity validity check in the choice models. This confirmed the presence of the expected positive correlation between income and the cost parameter, in line with economic theory. However, it also revealed that respondents with relatively higher income tended to have slightly lower preference parameter estimates for water quality improvements in the Lillebaelt area. Hence, when dividing the estimated individual-specific parameter for a water quality improvement with the cost parameter in order to calculate WTP, the two effects more or less offset each other. Alleviating the initial validity concerns, this is the likely explanation why we find no significant effect of income on the WTP for good water quality in the Lillebaelt area.

In total, nine different model specifications are tested given that each parametric distance decay model is estimated with and without the Zealand dummy variable. The models describe the distance decay in WTP for a new policy that would result in good ecological status in the Lillebaelt area relative to a baseline policy that instead would result in medium ecological status in the Oresund area. We run all nine models for three different segments of respondents; (1) those primarily motivated by cultural ecosystem services, (2) those primarily motivated by regulating ecosystem services, and (3) those not belonging to any of these two groups.

4.2.1 Utilizing a Priori Knowledge Concerning the Presence of a Jump Discontinuity

We first model distance decay using the individual-specific WTP estimates obtained from choice models utilizing the a priori knowledge that the toll bridge represents a jump discontinuity in travel costs, i.e., models 1a and 1b from the first stage. Tables 4, 5 and 6 summarize the model results. Figures 3, 4 and 5 in “Appendix 2” provide graphical illustrations of the estimated distance decay functions for the nine different model specifications and the three segments of respondents. The grey shaded areas illustrate the 95% confidence interval around each regression and the black dots indicate the estimated median WTP at the given distance. The horizontal axis is the distance in kilometers.

Table 4 Distance decay models for cultural services segment
Table 5 Distance decay models for regulating services segment
Table 6 Distance decay models for segment covering remaining respondents

First of all, we note that the distance parameters are significant and of the anticipated signs, implying that there is indeed distance decay in WTP for improvements in the ecological status of water in Lillebaelt relative to Oresund. When comparing across respondent segments, results confirm our hypothesis that distance decay is more pronounced among respondents particularly motivated by cultural ecosystem services. Specifically, we find the strongest distance decay in the WTP for cultural services where, e.g., the linear specification shows distance decay of around 15 DKK/km, compared to 10 DKK/km in the regulating services segment and 9 DKK/km in the segment covering remaining respondents. Visual inspection of Figs. 3, 4 and 5 in “Appendix 2” further confirm that the distance decay functions have a steeper slope for the cultural services segment.

The Zealand dummy parameter, which is intended to explicitly capture the jump discontinuity caused by the toll bridge, is significant and negative in all the parametric distance decay models, and model fit generally improves significantly when adding this parameter. Again, the impact is strongest for the cultural services segment, for which the estimated parameter indicates a sudden vertical jump downwards in WTP of 1492 DKK in the linear specification. Though not significantly lower from a statistical point of view,Footnote 7 the equivalent parameter estimate of around 835 DKK obtained in the linear model for the regulating services segment is substantially lower from a practical perspective, e.g., for use in applied cost–benefit analysis.

The four distance transformations used in the parametric models assume different relations between distance and WTP. Except for the model specifying inverse distance, model performance is similar, e.g., considering R2 or LogLikelihood measures. Differences in model fit across model specifications are most pronounced in the cultural services segment in Table 4. This was also the segment exhibiting the strongest distance decay. Across all specifications and segments in Tables 4, 5 and 6, we note that the simple linear distance decay model with a dummy variable to capture the jump discontinuity is among the best performing models, though differences in model fit are generally small.

The non-parametric GAM approach is persistently among the best performing models when considering model log likelihoods, UBRE, and proportion of variance explained as indicated by the adjusted R2. Especially for the cultural services segment, it is worth noting that the LR-tests in Table 4 indicate that GAM significantly outperforms all the parametric specification models when the Zealand dummy is not included to explicitly capture the jump discontinuity in these models. Hence, in our empirical case GAM seems to offer an approach to account for distance decay that is just as good as any of the commonly used parametric approached—but without requiring the analyst to make restrictive assumptions about the distance decay relationships. Visual inspection of the graphical representations of the distance decay functions in “Appendix 2” confirms that GAM captures the variation in WTP over distance just as well as any of the parametric specifications.

4.2.2 A Priori Knowledge Concerning Jump Discontinuity is not Available

To test the importance of prior knowledge concerning the presence of a jump discontinuity, we re-ran all models in Sect. 4.2.1 using the individual-specific WTP estimates obtained from the choice model based on the merged sample, i.e., model 2 in Table 3. In other words, this analysis scenario pretends that we are not aware of the toll bridge. While artificial in our case, it serves the purpose of mimicking a common situation for analysts applying CEs in practice. Results reported in tables and graphs equivalent to those in Sect. 4.2.1 and in “Appendix 2” are provided in “Appendix 3”. To enable full comparison, this includes models incorporating the Zealand dummy variable even though this would not be relevant if the discontinuity was truly unknown. These models generally still find distance decay in the individual-specific WTP estimates. This is indicated by the significant distance parameters in the models and the downward slope of the distance decay functions in the graphical representations. However, the jump discontinuity is much less evident in these results than in the previous section. Furthermore, the jump is much less pronounced in the graphical representations of the distance decay functions. Hence, it would seem that the distance decay models are incapable of uncovering the jump discontinuity unless it is directly controlled for in the estimation of WTPs. Model-wise, the non-parametric GAM still performs just as well as the best of the parametric models when considering R2 and LogLikelihood. The graphical illustrations leave the impression that GAM is similarly incapable of uncovering the jump discontinuity.

4.3 Consequences for Aggregate Welfare Measures

To assess the potential impacts of the different distance decay modeling assumptions on policy advice, we calculate the aggregate welfare measure of a policy scenario for the entire region based on the assumption that a priori knowledge of the toll bridge is utilized, i.e., based on Sect. 4.2.1. The proposed policy used as an example here involves improving water quality in the Lillebaelt area to good ecological status instead of improving water quality in the Oresund area to medium ecological status.

We calculate the aggregate welfare measure using all nine distance decay models for all three respondent segments. Results are reported in Table 7. The user segment WTPs and distance decay functions are applied to the entire population of 2.7 million people living on Zealand and Funen. Table 7 reports the aggregate welfare measure for each segment as well as the total aggregate welfare measure calculated as the sum of the three segments. This aggregation is based on the segment proportions found in Table 2. For comparison, the last row reports aggregate welfare measures obtained when distance decay is completely disregarded.

Table 7 Aggregate welfare measures in million DKK per year (95% confidence limitsa in brackets)

The regulating services segment obtains a positive welfare measure equivalent to just above 800 million DKK, similar across all models. For the cultural services segment, the proposed policy results in a negative welfare measure, around − 100 million DKK, but here we find slightly greater differences across model specifications. This is likely due to distance decay being more pronounced in this segment than in the other segments. Realizing that more people live in Zealand than on Funen, it is not surprising that the aggregate welfare measure for this segment is negative. Rather than an improvement from poor to good ecological status in Lillebaelt, the majority of these people would likely prefer water quality to improve from poor to medium ecological status in Oresund which is close by and, thus, easier and cheaper to reach for the recreational purposes that are particularly important for this group of people. On the other hand, people who are more interested in regulatory services on average care more about water quality improvements, than where they occur. Hence, for these people the fact that somewhere the ecological status is increased from low to high instead of low to medium is more important than the fact that the change is happening further away. This explains why the aggregate welfare measure for this group is positive even though distance decay does affect WTP and most people live on Zealand—the negative impact of distance is simply dominated by the positive impact of the larger improvement in water quality.

The total aggregate welfare measure calculated from the different distance decay models vary from 644 million DKK to 754 million DKK. Though not significantly different according to the largely overlapping confidence intervals, these differences could potentially matter for conclusions if e.g. used in cost–benefit analysis. Hence, the choice of distance decay modeling approach is not trivial in this respect. Furthermore, compared to the last row in Table 7 which reports the simple non-parametric mean, i.e., assuming no distance decay, it is evident that distance decay, in general, is important to take into account when generating this kind of policy advice. Most of the distance decay specifications produce total aggregate welfare measures for the proposed policy change close to 645 million DKK, which is around 15% lower than when distance decay is ignored.

5 Conclusion

There is good reason to suspect that distance decay in WTP for environmental goods is not continuous. In this paper, we show that barriers in the landscape, here represented by a toll bridge, may result in discrete jumps in the distance decay function. We furthermore show that the aggregate welfare measures are sensitive to the analyst’s choice of whether and how distance decay is accounted for.

We use a common two-stage process to investigate distance decay in the WTP for localized water quality improvements. In the first stage, we estimate individual-specific WTPs based on a typical choice model. In the second stage, we model the variation in these individual-specific WTPs by incorporating distance between the respondent and the good as an explanatory variable. Confirming findings in several previous studies, we find that WTP does indeed decay as distance increases. Testing a range of distance transformations commonly used when assuming some functional form of the distance decay in parametric models, we find that a simple linear specification performs well.

Inspired by Ferrini and Fezzi (2012), we also test the non-parametric GAM approach which based on smoothing terms identify the functional form of the distance decay directly from the data. This approach is somewhat similar to the spatial expansion method used in Schaafsma et al. (2013) and Cameron (2006) which essentially also smoothes over distance to allow for multi-directional spatial patterns. However, a linear relationship is assumed for the distance between the spatial coordinates in these applications. In this respect, the GAM is more flexible. The chosen approach was also inspired by recent developments in the hedonic house price models literature where the spatial coordinates of properties have been used to capture spatial omitted variable bias (e.g. Panduro et al. 2018; von Graevenitz and Panduro 2015). Overall, for our empirical case, we find the GAM approach capable of capturing the distance decay relationships just as well as any of the parametric models. Given that GAM avoids having to make functional form assumptions concerning the ‘true’ distance decay relationship—which not only lacks theoretical foundation but also introduces a risk of biasing welfare estimates if making the wrong assumptions—this would, in line with Ferrini and Fezzi (2012), suggest that GAM presents a useful alternative to the parametric approaches that are more commonly used to incorporate distance decay in WTP for non-marketed environmental goods.

Realizing that water quality changes lead to a wide range of changes in different ecosystem services, some of which are associated with use values while others are more associated with non-use values, and people are likely to have heterogeneous preferences for these ecosystem services, we identify three distinct segments of respondents. The first segment is particularly interested in and motivated by regulating ecosystem services, while the second segment is rather motivated by cultural services. The third segment covers respondents not identified as belonging to any of the first two segments.

In line with Hanley et al. (2003) and Bateman et al. (2006), we initially hypothesized that distance decay would be relatively more pronounced in the cultural services segment which for our empirical case is arguably the most directly linked to recreational use values, whereas regulating services to a greater extent would also encompass non-use values. Though differences are not statistically significant across all model specifications, our results are generally supportive of this hypothesis. This is further supported by visual inspections of the graphical illustrations of distance decay, which generally exhibit a steeper slope for the cultural services segment. As a consequence of this, we find that the choice of distance decay modeling approach has the biggest impacts on estimated welfare measures, and thus obtained policy advice, for this particular segment of the population. This, however, should be considered case-specific as cultural ecosystem services cannot generally be said to be more strongly associated with use-values than other ecosystem service categories.

A more novel investigation in our paper concerns the modeling of jump discontinuities. We utilize an empirical case, where the presence of a toll bridge between two islands presents a clear and well-known jump discontinuity in travel costs. Hence, a discontinuous downwards shift in the distance decay function is expected. Utilizing our a priori knowledge regarding the presence of the toll bridge in the first stage of our procedure, we estimate separate choice models for the two samples of respondents living on one of the two islands. In the second stage, the parametric distance decay models are generally capable of capturing and confirming the jump discontinuity when a dummy variable for the toll bridge is incorporated explicitly in the model. The non-parametric GAM approach performs neither better nor worse than these models.

In practice, CE analysts often do not have solid a priori knowledge concerning geographical barriers in the landscape that may lead to jump discontinuities in WTP. We thus re-ran the first stage choice model on a single, merged dataset, essentially disregarding our knowledge of the toll bridge and instead treating respondents from the two islands as one sample. Using individual-specific WTPs from this choice model made it much harder for the models in the second stage to capture the jump discontinuity. Some of the parametric models with a dummy variable incorporated did recover a jump discontinuity caused by the toll bridge, but at the expense of the rest of the distance decay function not fitting particularly well to the data.

An important limitation in our empirical case is that the population on the two islands differ in other aspects than merely the presence of the toll bridge. Specifically, people on the island that is farther away from the environmental good on average have significantly higher income and are more prone to living in urban areas. Both aspects are likely to affect WTP and may thus be confounding the effect of the toll bridge in terms of driving the distance decay jump discontinuity. While we are unable to disentangle these effects in our empirical case, we note that both these effects would be expected to cause an upwards jump discontinuity in the distance decay function, i.e., the opposite directional impact as that of the toll bridge. Since our models do find a significant downwards jump discontinuity, we conclude that these confounding effects are at least not offsetting the effect of the toll bridge. Had we been able to control for these other differences, we would most likely just have found an even bigger jump discontinuity caused by the toll bridge.

Ferrini and Fezzi (2012) suggested that the non-parametric GAM approach could be beneficial in a CE context. We provide a first effort in this direction by employing a two-stage procedure and with a particular focus on addressing potential differences in distance decay patterns for segments of people who differ in their interests in different ecosystem services categories. A logical next step for future research is the possibility of setting up a simultaneous modelling approach, incorporating the GAM directly in the choice models rather than using a two-stage procedure. Further research should also be dedicated to investigating the performance of GAM in the presence of more complicated non-continuous distance decay patterns such as spatial patchiness and hot spots.