1 Introduction

Environmental regulations can have substantial economics impacts. As a result, there is considerable interest in knowing the precise relationship between emissions and ambient pollution levels in order to develop the optimal stringency pollution regulation. As long as the relationship between emissions and ambient pollution is known, any uncertainty reduces to Weitzman (1974) uncertainty in damages associated with a given level of ambient pollution or the costs borne by emitters for reducing emissions by a given amount. This type of uncertainty, which we call “distributional uncertainty” to reflect the presence of stochasticity in the costs or benefits of emissions, continues to receive considerable attention in the economics of pollution control.

In this paper, we examine the implications of a different kind of uncertainty we term “structural uncertainty”. Structural uncertainty occurs when the relationship between pollution emissions and ambient pollution levels is unknown. Examples include when the regulator is unaware of specific pollution sources or substantively misinformed about the magnitude of pollution coming from a source. We show that this form of uncertainty gives rise to a unique set of modeling implications and regulatory issues.

A great deal of evidence in the physical science literature shows there is considerable uncertainty in the relationship between emissions and ambient concentrations of various pollutants. For example, it was recently discovered that up to 44% of sulfates in Los Angeles, the most studied air basin in the world, come from large ships burning bunker fuel in the Los Angeles-Long Beach harbors (Dominguez et al. 2008). This far exceeded previous apportionments of ambient sulfate concentrations to harbor traffic.Footnote 1 Even though sulfates were already known to come from ships, the level of contributions of their emissions to ambient pollution levels was very inaccurate. As a result, known sulfate emissions were “misapportioned”. Recent work regarding the U.S. suggests that yearly anthropogenic emissions of methane, a powerful greenhouse gas (GHG), are 50–70% higher than previously thought (Miller et al. 2013). Put another way, there are missing emissions inventories for methane. As a result, each ton of known methane was previously over-attributed to causing extant atmospheric methane concentrations. To that end, fugitive emissions from leaks in industrial processes of methane and Volatile Organic Compounds (VOCs) are thought to be both large and notoriously difficult to estimate (Chambers et al. 2008; Wu et al. 2014). Similarly, the relationship between mercury emissions and ambient mercury levels is subject to considerable uncertainty even though the health risks of mercury are now well-known. The uncertainty exists in large part since “it is difficult to obtain data [on mercury emissions] from national authorities” leading to uncertain emissions inventories (Pacyna et al. 2010). Regardless, mercury emissions have been subject to recent regulatory action. China’s carbon emissions are notoriously challenging to track: the government announced in late 2015 they had been burning 17% more coal per year than previously announced.Footnote 2 In each case, and in many others, ambient levels of pollution are known but the physical relationship between different emission sources and ambient pollution levels is unknown due to imprecision in attributing the set of total actual emissions to observed ambient pollution levels.Footnote 3

Potential welfare gains from a better understanding of the structural relationship between emissions and ambient pollution seem to be largely ignored. For example, the 2010 EPA budget requested $842 million (less than 9% of the total agency budget) for improvement of science and technology. Of that amount, only a very small fraction is spent on improved modeling of the pollution output-ambient quality relationship, more accurately recording emissions or searching for new pollution sources. Such expenditures are orders of magnitude less than the direct costs of pollution regulation by firms, households and local governments nationwide (Gray 2002; Gray and Shimshack 2011). Therefore, it is possible there may be economic gains from basing pollution control regulations on a more precise understanding of the relationship between emissions and ambient pollution levels.

The U.S. EPA, and its counterparts in OECD countries and many developing countries, utilizes both an air pollutant emissions monitoring system and an ambient air quality monitoring system to measure air pollutants. Controlling for other factors such as temperature and wind, the two systems are linked to each other using an air dispersion model which takes emissions as an input affecting the stock of ambient pollution. The Community Multiscale Air Quality (CMAQ) model is one such model delivering estimates for ozone and particulates and its approach has been widely adopted in the atmospheric modeling literature (Kim and Ching 1999). The potential for discord between the two monitoring systems is well known among air quality modelers (McKeen et al. 2004, 2009) and difficulties in reconciling the two systems has taken on increasing importance as efforts are made to produce real time forecasts of ambient quality (Wang et al. 2011; Zhang et al. 2012a, b).

Possible problems in translating emissions into ambient concentrations was occasionally mentioned in early economic work on pollution control (Spence and Weitzman 1978; Crandall 1981). The implications of imperfect information governing the relationship between emissions and ambient pollution, however, has received attention limited only to relatively narrow pollution control problems rather than pollution control problems more broadly. Examples include evaluating optimal pollution control when the effect of emitters on ambient pollution is stochastic (Horan et al. 1998; Hamilton and Requate 2012), when damages from emissions vary spatially (Muller 2011; Fowlie and Muller 2013), when precisely which out of a set of possible non-point source emitters do in fact emit (Segerson 1988; Hansen 1998), and non-point source water pollution permit allowances (Horan and Shortle 2005). This literature considers structural uncertainty but do not explicitly consider cases where there is uncertainty over if the full set of all polluters are identified.

We build on this literature by investigating the more general issue of how allowing uncertainty in how known emission sources maps to observed ambient pollution levels impacts (1) expected damages from emissions and (2) the optimal stringency of pollution control.Footnote 4 in doing so we focus on air pollutants but the qualitative results generalize other forms of pollution, such as the water pollution examples discussed above. We do this in three parts, each of which is a distinct contribution. First, we develop a model of structural uncertainty. A key aspect of our model is that the regulator must estimate the marginal contribution of observed emissions on ambient levels of pollution. In that sense, our model is akin to a structural model rather than a reduced form model of damages from emissions.Footnote 5 Second, we use the model to identify how resolving uncertainty through discovering new emission sources impacts optimal stringency of ambient pollution levels. This approach is motivated by discoveries of new sources in the physical sciences literature discussed above. Third, we investigate how explicitly accounting for the presence of structural uncertainty impacts optimal pollution stringency levels. This final approach seeks to be more normative, whereas the previous two are positive economic models.

2 Model

This section develops our theoretical model. Throughout we assume the vector of optimal emission levels is achievable via a tax or tradable quota system. We do not address the myriad issues with actual implementation of pollution control mechanisms here. We focus only on the stringency of pollution control in this paper.

Each firm, industry or sector i chooses a level of emissions, \(x_{i}\), to maximize profits \(\pi _{i}(x_{i})\). The indicator ‘i’ could also index countries or anthropogenic versus geological sources. We assume that \(\pi _{i}(x_{i})\) is initially increasing in x and twice differentiable with \(\pi _{i}''(x_{i})<0\). Similar in in spirit to Horan and Shortle (2005), each unregulated firm emits at a level \(x^{*}_{i}\) such that \(\pi _{i}'(x^*_{i}) = 0\). Let the regulator know the exact form of \(\pi _i(x_{i})\) for all i with certainty. There are N total emitters. Firm profit functions can be either homogeneous or heterogeneous (e.g., \(\pi _i()\) or \(\pi ()\). The steepness and level of marginal profits from emissions can vary by source.Footnote 6

Emissions from firms contribute to the ambient pollution level, y, which decreases social welfare according to a strictly increasing damage function D(y). We make two simplifying assumptions about ambient pollution and associated damages. First, the regulator knows the damage function D(y) perfectly. For exposition we assume that marginal damages are linear in y: \(D'(y) \propto c y\) although we don’t leverage it in a proof until our third Proposition. Second, the ambient level of pollution, y, is perfectly observable. We make this assumption throughout. Ambient pollution levels are actually only measured at discrete locations (e.g., monitoring stations) at discrete points in time (e.g., when readings are taken). Ambient levels of pollution between these spatio-temporal fixed points is itself uncertain. We do not investigate the implications of this source of uncertainty.

We asume ambient pollution is related to emissions according to the simple linear function \(y=f(\{ x \}_{x=1}^N)= \Sigma _i x_i \beta _i+\epsilon \), where \(\epsilon \) is an idiosyncratic term and \(\beta _i\) reflects the marginal physical impact of each emitter’s contribution to, y, the ambient pollution level. The idiosyncratic term, \(\epsilon \), explicitly models the typically assumed imperfections of any pollution dispersion model.Footnote 7 The coefficient \(\beta _i\) represents the contribution of different emitters to ambient pollution levels at a particular location. The existence and importance of these “transfer coefficients” is well-understood in the atmospheric chemistry literature (Fu et al. 2012).

A linear relationship between ambient pollution and emissions is appealing for simplicity but it is not innocuous. First, while linearity is a reasonable approximation in many cases, such as heavy metals and primary particulate matter, it is not a reasonable assumption in many other cases such as where the relationship with precursor emissions and ambient pollution is known to be non-linear. Second, since there is no explicit spatial dimension in this model, our ambient pollution y should be thought of as ambient pollution in a particular location.Footnote 8 Third, there is no temporal element in this initial flow pollutant model. We will later show that the important aspects of this model carry through directly to a stock pollutant when the stock’s decay rate is known. While our model can be generalized, we restrict our analysis to the simplest case possible to clearly show the first order effects of structural uncertainty on optimal pollution control.

Emissions from the ith polluter in the model, \(x_i\), increases ambient pollution levels at some rate \(\beta _i\) (e.g., \(E[y] = \Sigma _i x_i \beta _i\)). We initially assume all polluters’ emissions have the same physical impact on ambient pollution levels (e.g., \(\beta _i = \beta _j\)), but later relax this assumption.Footnote 9

If the marginal physical effects of emissions on ambient levels of pollution are known the social planner (e.g., regulator) sets the vector of emissions to maximize expected social welfare. As a result, the regulator maximizes the sum of all profits less expected damages from ambient levels of pollution (recall the expectations operator is needed since realizations of y are stochastic):

$$\begin{aligned} \max _{\{x\}} \Sigma _{i=1}^{N} \pi _{i}(x_{i}) - E[D(y)]. \end{aligned}$$
(1)

The N first order conditions for optimality are:

$$\begin{aligned} \pi _{i}'(x^{*}_{i})=E[D'(y^*)\beta _{i}] = D'(y^*)\beta _{i} \quad \forall \; i = 1,...,N. \end{aligned}$$
(2)

The first order conditions show the standard result that the change in profits from reducing a unit of emissions by the firm should equal the expected marginal damages caused by an increase in firm i’s contribution to the ambient pollution levels. Importantly, marginal damages of a firm’s emissions are weighted by the contribution of their emissions to ambient pollution levels, \(\beta _{i}\). Given our assumptions that marginal damages are linear and the relationship between emissions and ambient levels of pollution, \(\beta _i\), is known we can drop the expectations operator in Eq. (2). Finally, due to our homogeneity assumption across profits, \(\pi _{i}\), and the relationship between emissions and ambient levels, (e.g., \(\beta _{i} = \beta _j\)), the restricts all polluters to emit at the same level \(x^{*}\) satisfying Eq. (2). This represents the classic non-depletable externality problem.

2.1 Modeling Uncertainty Between Emissions and Ambient Pollution

We now relax the assumption that the marginal physical effect of known emitters on ambient pollution, \(\beta _i\), is known and that all emitters are identified. We model the regulator’s task as both (1) estimating the marginal physical effect of known emitters and (2) using that parameter to inform the stringency of pollution regulation. The regulator’s task in this subsection is estimating marginal physical impacts, \(\beta \), of each emitters’ contribution to ambient pollution levels (call the estimate \({\hat{\beta }}\)). We discuss the EPA’s procedure for doing this in the “Appendix”. To make our example as clear as possible, we initially assume the regulator completely ignores that they observe an incomplete set of emitters. We also initially assume emitter homogeneity (e.g., for both \(\beta \) and \(\pi (\cdot )\)).

Formally, assume that the regulator has identified only a subset of \(K = \alpha _0 N\) out of N total emitters where \(\alpha _0 < 1\). There are two helpful ways of formalizing “\(\alpha _0\)” in this example. First is as the percentage of identified emitters. A second way of representing the percentage of known emitters, \(\alpha _0\), in this situation is as a Nx1 vector of ones and zeros. If the ith emitter is known, then the ith place define a vector \(\underline{\alpha _0}\) will have a one. Alternatively, if the jth emitter is unknown then the jth place in the vector \(\underline{\alpha _0}\) will have a zero.Footnote 10 Since all emitters are homogeneous except for being identified or unidentified, then, we can interpret observed emissions as \(\alpha _0\) percent of the sum of total emissions observed by the regulator without loss of generality. Recall, though, that the regulator does not know \(\alpha _0\). For much of this section, we use the scalar notation for notational ease.Footnote 11 Introducing the \(\alpha \) parameter in this way is the key distinguishing factor in our approach relative to other structural uncertainty approaches such as those on water quality trading non-point source pollution (Segerson 1988; Horan and Shortle 2005) or transfer coefficients (Fowlie and Muller 2013).

Assume that the regulator observes a series of observations for emissions and ambient pollution levels: \(\{{\tilde{x}}_t, y_t \}_{t=1}^T\). Stack ambient pollution levels in a vector Y and observed emissions for each time period t in a vector \(\tilde{X}\). As before, assume the true data generating process is \(y_t=\Sigma _{i=1}^N x_{i,t} \beta _i+\epsilon _t\) but also assume \(\epsilon _t\) is a well-behaved normally distributed random error component and that emissions affect ambient pollution identically. For illustrative purpose, suppose the regulator estimates the coefficient vector \(\beta \) by regressing Y on \(\tilde{X}\) using Ordinary Least Squares (OLS). While regulators may not actually perform OLS to parameterize air dispersion models, we want to show very precisely in the simplest case how inferring transfer coefficients from an incomplete emissions inventory leads to biased estimates of transfer coefficients due to over-attribution of known emitters to ambient pollution levels.

Estimates from attributing known emissions to observed ambient pollution levels lead to upwardly biased estimates. Specifically, estimates take the form \({\hat{\beta }} = \frac{Y'\tilde{X}}{\tilde{X}'\tilde{X}} = \frac{Y'X}{\alpha _{0} X'X} = \frac{\beta }{\alpha _{0}}\). Because \(\alpha _{0} \in (0,1)\) due to incomplete information of the regulator, the estimate of marginal physical effects of known pollutants, \({\hat{\beta }}\), is upwardly biased in this case.Footnote 12 Note that if the regulator knew \(\alpha _0\) with certainty, they could correct their biased estimate of \({\hat{\beta }}\), which we discuss in detail below.Footnote 13

Consider a regulator who uses the estimated marginal effect of known emitters on ambient pollution from the misspecified model above, \({\hat{\beta }}\), to set emissions levels as a function of expected ambient pollution levels. The resulting optimality condition obtained from maximizing expected welfare is:

$$\begin{aligned} \max _{\{x\}_1^K}&\Sigma _{i=1}^{K} \pi _{i}(x_{i}) - E[D(y)|\alpha _0] = \Sigma _{i=1}^{K} \pi _{i}(x_{i}) - E[D(\Sigma _i x_{i,t} \hat{\beta _i})|\alpha _0] \end{aligned}$$
(3)
$$\begin{aligned} \pi _{i}'(x^{*}_{i})= & {} E[D'(y^*){\hat{\beta }}] = D'(y^*){\hat{\beta }} \; \; \; \forall \; i = 1,...,K = \alpha _0 N. \end{aligned}$$
(4)

where \(E[y^{*}] = \tilde{X}^{*}{'}{\hat{\beta }}\). Note that conditioning on \(\alpha _0\) indicates the regulator estimates \(\hat{\beta _i}\) for all identified emitters \(i=1,...,K = \alpha _0 N\). Given our homogeneity assumption, \(\hat{\beta _i} = \hat{\beta _j} = {\hat{\beta }}\). Since \(\alpha _{0}<1\) and \(\pi _{i}''(\cdot )<0\), the \({\tilde{x}}^{*}_{i}\) implicitly defined by the set of Eq. (4) must be lower than in the case above when \(\beta \) is estimated without upward bias.Footnote 14 The main implication for individual firms is that known sources are over-regulated in this misspecified model relative to when marginal damages of known emissions are known. The reason is that perceived marginal damages of known emitters are higher than their actual damages.

If the regulator’s estimate of marginal physical effects, \(\hat{\beta _{i}}\), is an unbiased estimator of \(\beta _{i}\), then expected marginal costs of a firm’s emissions equals actual marginal costs and the problem reduces to a standard treatment of multiplicative uncertainty. The main difference between this treatment and the seminal work on multiplicative uncertainty in Hoel and Karp (2001) is that our multiplicative uncertainty maps to damages and profits from emissions (via uncertainty in \(\beta \)) rather than assumed uncertainty over costs of abatement themselves.

This structural uncertainty model can be extended to fit the stock pollutant model of Newell and Pizer (2003). Assume yearly contributions to a stock of pollution, \(S_t\) is \(y_t\). Assume that \(S_t\) is perfectly observed. As in Newell and Pizer (2003), the equation of motion is \(S_t = (1-\delta )S_{t-1}+y_t\) where \(\delta \) is the stock pollutant decay rate. When \(\delta \) is known, \(y_t = S_t - (1-\delta )S_{t-1}\). In any given year, the set of known emitters contributes to the stock of ambient pollution as in the flow pollutant case since the decay rate is known (e.g., \(S_t - (1-\delta )S_{t-1} = \Sigma x_t \beta + \epsilon _t\)). As a result, uncertainty over \(\beta \) can be extended to stock pollutants so long as the net present value of damages is considered. If the decay rate \(\delta \) is not known then the stock pollutant case presents a unique set of challenges we leave to future research.

3 Analytical Results

This section presents analytical results from the theoretical model when the regulator must estimate marginal physical effects of known emissions from an incomplete set of emitters. The marginal benefit of emissions (e.g., \(\pi _i(\cdot )\)) and the marginal physical effects of emissions (e.g., \(\beta _i\)) are allowed to vary across firms.

3.1 Discovering New Emission Sources

If a scientific discovery reveals more emission sources than were previously known, then there are more emissions to attribute to ambient pollution levels (e.g., \(\alpha _0\) increases to \(\alpha _1\)). As a result the marginal damage associated with emissions from each emitter that was identified before the scientific discovery is lower since the marginal contribution of their emissions to ambient levels of pollution is lower. Put another way, \(\beta _i\) is revised down when the vector of transfer coefficients are re-estimated since observed emissions are now larger than they were before.Footnote 15 This is similar to implications of the recent discovery that there were substantial unaccounted for inventories of fugitive methane emissions (Miller et al. 2013).

Figure 1 shows the intuition for this result graphically. As the transfer coefficient of known emitters becomes less upward biased (e.g., \(\hat{\beta _0}>\hat{\beta _1}>\beta \)), the curve representing perceived marginal damage of emissions rotates down.Footnote 16 Figure 1 holds ambient pollution levels, y, fixed in order to highlight how the rotation of the marginal damage curve is exclusively tied to changes in the estimated marginal physical effect of known emitters on ambient pollution levels, \({\hat{\beta }}\).

Fig. 1
figure 1

Optimal firm emissions under different \({\hat{\beta }}\)

An important result from discovering new emissions sources is that it is always strictly cheaper to achieve a given level of ambient pollution so long as at least one newly discovered emitter has lower marginal abatement costs than that of a previously regulated firm: \(\frac{\partial \pi _{i} (x^{*}_{i,0})}{\partial x} > \frac{\partial \pi _{j} (x_{un}^{*})}{\partial x} =0\), where \(x_{un}^{*}\) is the emissions level of the unregulated emission source. An increase in the set of emitters increases the emissions of previously known emitters and decreases the emissions of a newly discovered polluter if the marginal profit of the unregulated source is less than the regulator’s expectations of their marginal damages.

It is always weakly cheaper to achieve a given level of ambient pollution if a new emissions source is discovered. However, it may not be optimal to decrease ambient levels of pollution. Whether it is optimal or not for actual ambient pollution to decrease when a new emissions source is identified depends on what happens to expected ambient pollution levels when a new emitter is discovered (all proofs in the “Appendix”):

Proposition 1

An increase in the set of known emitters may decrease the expected level of ambient pollution if 1) marginal physical effects are sufficiently biased upward and 2) the set of new emitters is sufficiently small.

Intuition suggests that discovering new emitters leads expected emissions to increase. Proposition 1 says that once new emitters are discovered, expected ambient pollution levels could be lower than before. The level of expected ambient pollution can change once the new emitters are discovered because (1) their emissions are now counted in inventories, (2) marginal physical effects of previously identified emitters were biased upward and (3) the regulator must revise their estimates of marginal physical effects of known emitters and newly discovered emitters. The new expected ambient pollution level is a function of the interaction these three parameters. Thus, expected ambient levels of pollution can change even before the regulator decides to restrict emissions from the newly identified emitter.

Proposition 1 results from non-linearity in revisions of marginal physical effects from previously identified emitters. Sometimes the change in the estimated physical effects from previously known emitters dominates the additional contribution from newly discovered sources and vice-versa. As a result, expected ambient pollution levels can increase or decrease as new sources of emissions are discovered. If marginal physical effects are biased downward; then expected emissions always increase.

While Proposition 1 deals with changes in expected pollution, there is the larger question of how newly discovered sources or transfer coefficient revisions affect optimal expected pollution levels. If expected ambient pollution falls upon the discovery of new emitters, marginal expected damages also fall, so long as the damage function is convex leading to the following two Propositions:

Proposition 2

If expected ambient pollution decreases upon the discovery of new emitters, then previously known emitters’ optimal emissions levels strictly increase. Conversely, if expected ambient pollution increases, previously known emitters’ optimal emissions levels strictly decrease.

Proposition 3

An increase in the set of known emitters can decrease the optimal level of ambient pollution if benefits of emissions for newly discovered emissions is sufficiently large (e.g., decreasing emissions is sufficiently costly).

Proposition 2 relates the nature of the damage function and to updated estimates of the marginal effect of emissions on ambient levels of pollution. Expected damages from ambient pollution levels are a function of three things: (1) the nature of the damage function, (2) the emission levels of known emitters, and (3) the estimated contribution of those known emissions on ambient pollution levels. The emission levels of known polluters and the estimated contribution of those known emissions on ambient pollution levels form expected ambient pollution levels.Footnote 17 If the joint effect of changes in damages from expected ambient levels of pollution and updated estimates for a particular emitter is strong enough (e.g., if expected marginal damages attributable to a specific emitter increase), then that emitter should be regulated more intensely.

Proposition 3 identifies the impact of discovering additional emitters on optimal ambient levels of pollution. There are four effects which all occur simultaneously when a new emitter is discovered which all affect whether expected ambient pollution levels increase or decrease upon discovery of additional emitters. First, the estimates of marginal physical effects of emissions on ambient pollution levels decrease. Second, there are now more known emitters in each period. Third, each additional known emitter is now regulated. Fourth, the level of emissions from the previously regulated firms can be adjusted. The first and third effects act to decreases ambient pollution levels while the second acts to increase them. The final effect is the response of the regulator: in response to effects one through three the regulator could increase, decrease or leave regulated emission levels unchanged. Since there are effects inducing the regulator to both increase and decrease regulated emissions, it is not clear whether previously identified emitters will be regulated more or less stringently.

Fig. 2
figure 2

Example of discovery of new emissions sources

The key intuition behind Proposition 3 is from allowing the marginal benefits of emissions to vary by source (e.g. \(\pi _i \ne \pi _j\)). If the marginal benefit of emissions of the newly discovered polluter is large (e.g., \(\pi _j'(x_j)\) steep), then the increase in emissions from the previously identified source (\(x_{i,1}^* - x_{i,0}^*\)) could be smaller than the decrease in emissions from the previously unidentified source (\(x_{j,0}^* - x_{j,1}^*\)) for a given revision in the marginal impact of emissions on ambient pollution levels. As a result, an increase in ambient pollution may be optimal, as shown in Fig. 2. This situation arises when a new source of emissions is discovered that cannot be changed for any plausible cost, as is the case when new emissions are part of the earth’s natural processes. Such discoveries were made by Keppler et al. (2006) and Etiope and Ciccioli (2009) when they found that both plants and the ocean floor are major sources of potent greenhouse gases.Footnote 18 Alternatively, if new emissions sources can be cheaply reduced, such a discovery would result in lower optimal total ambient pollution levels. Recent recognition of the greenhouse gas potential of atmospheric black carbon from stoves in India is one such example (Ramanathan and Carmichael 2008).

Taken together, Propositions 2 and 3 imply that if the resource regulator has misspecified the relationship between emissions and ambient pollution levels, then known emitters are often over-regulated relative to the full information case. Hence, known emitters in this scenario may have an incentive to learn more about the true nature of the polluting process.

Both of these Propositions rely on the assumption that the regulator under counts emissions. If they instead over count emissions then (e.g., account for too much fugitive dust) then the results reverse. These two Propositions are explicit about this with their key assumption: if \(\alpha \) is assumed to be greater than one the converse of these results would be true. As a result, each Proposition has an associated Corollary which we omit here in the interest of conserving space. If the regulator over counts fugitive emissions the opposite would be true and known emitters would be under-regulated relative to the full information case.

3.2 Accounting for Uncertain Emission Inventories

Here we consider the case when the regulator accounts for the possibility they may observe only a subset of emitters.Footnote 19 We primarily assume that all emitters are homogeneous in most, but not all, of this section to show the first order effects of accounting for structural uncertainty.

Assume the regulator knows that \(\alpha \;\le \; 1\), but only has prior beliefs over the true fraction of identified emitters at any point in time: \(\alpha \; \sim \; \mu (\alpha )\). By acknowledging imperfect information, the regulator knows their estimates of identified emissions’ marginal effect on ambient pollution levels are upwardly biased and knows there is a fraction of unknown emitters of size \((1-\alpha )\).

There are two implications of allowing for priors \(\mu (\alpha )\) over the fraction of identified emissions. First, it impacts the stock of total expected emissions inventories. Second, it impacts estimates of marginal physical impacts of emissions on ambient pollution levels. Continuing with the OLS intuition from above, conditional on a particular percentage of known emitters, \(\alpha \), the regulator can correct for the bias in marginal physical impacts by estimating \({\hat{\beta }}\) if they multiply by by \(\alpha \): \(\alpha {\hat{\beta }} = \alpha \frac{\beta }{\alpha } = \beta \). Put another way, it is possible that conditional on knowing \(\alpha \), it is possible that \(E[{\hat{\beta }}|\alpha ] - \beta < {\hat{\beta }} - \beta \) if \({\hat{\beta }}\) is the estimated effect when not accounting for bias; in this way, conditional on knowing the level of incomplete information it is possible to reduce or eliminate bias in an estimated transfer coefficient. In this way, priors \(\mu (\alpha )\) imply a distribution over physical impacts as a function of the model used to estimate marginal physical impacts by atmospheric chemists.

Maintaining homogeneity of all emitters for simplicity, take \({\tilde{x}}\) to denote the actual emissions of each of the \(\alpha N\) identified emitters and \(x_{un}^{*}\) to represent unidentified emitters. When accounting for structural uncertainty, the regulator can account for the contributions of the unknown emitters in contributing to ambient levels, \((1-\alpha ) N x_{un}^{*} \beta \) because they condition on \(\alpha \). However, at any point in time the regulator does not know the true fraction of identified emitters but instead has prior beliefs \(\mu (\alpha )\) over the distribution of \(\alpha \). As a result, the objective function of the regulator when accounting for structural uncertainty is:

$$\begin{aligned} \max _{\{{\tilde{x}}\}_{1}^{\alpha N}} \; \int (\alpha N)\pi _{i}({\tilde{x}}) + (1-\alpha )N\pi _{i}(x^{*}_{un}) - D(\alpha N {\tilde{x}} \beta + N(1-\alpha )x^{*}_{un} \beta ) \mu (\alpha )d\alpha . \end{aligned}$$
(5)

Note that the regulator has only \(\alpha N\) control variables since only \(\alpha N\) emitters are known with certainty. The resultant first order conditions for the known bias case is therefore:

$$\begin{aligned} \pi _{i}'({\tilde{x}})= & {} \int D'(\alpha N {\tilde{x}} \beta + N(1-\alpha )x^{*}_{un} \beta ) \beta \mu (\alpha ) d\alpha \; \; \forall \; i= 1,...,\alpha N \nonumber \\= & {} E [D'(Y)\beta |\mu (\alpha )] \end{aligned}$$
(6)

In this circumstance, the regulator accounts for the known bias in both calculating the expected level of ambient pollution and in correcting their estimates of the marginal physical effect of emissions on ambient pollution, making this management regime dynamically consistent. No elements of the unknown portion of ambient pollution, \((1-\alpha ) N x_{un}^{*} \beta \), are known to the regulator. However, if \(E[\alpha |\mu (\alpha )] < 1\), then there must be more total emissions than the set of observed emissions. In the first order condition, Eq. (6), we can see precisely where multiplicative uncertainty enters the regulators problem since the estimated transfer coefficient, \(\beta \), is a function of priors, \(\mu (\alpha )\). The expectations operator now is evaluated over \(\beta \).

Now consider the case of the regulator setting optimal equilibrium ambient pollution and emission levels when accounting for structural uncertainty versus the case when only a subset of emitters are known but the marginal effect of all emitters is correctly identified in both cases, akin to Sect. 2.1 above:

Proposition 4

If only a subset of homogeneous emitters is accounted for by the regulator but marginal emissions are estimated unbiasedly, it is optimal to over-regulate known emitters when accounting for structural uncertainty with Bayesian priors relative to when it is not accounted for.

Proposition 4 stems from the regulator acknowledging their policy will have less influence on actual ambient pollution levels than a regulator with naive beliefs. Regulators in this case know they have biased estimates of marginal physical effects due to the possibility of unknown unapportioned sources of emissions. They balance the loss in profits of known emitters with the social welfare gains of lower expected ambient pollution.

Consider the regulator setting optimal ambient pollution and emission levels when accounting for structural uncertainty versus the case when they are not accounting for structural uncertainty. Because \(\alpha \) can impact the estimate of marginal physical effects of emissions on ambient levels of pollution non-linearly (e.g., the OLS example has that \({\hat{\beta }} = \frac{\beta }{\alpha _0}\)), deriving analytical results are more challenging. We therefore make the strong assumption that the slope of marginal damages with respect to ambient pollution levels is linearly increasing. Even with this restrictive assumption we are unable to determine whether currently known emitters should always be regulated more or less stringently:

Proposition 5

For linearly increasing marginal damages and upwardly biased estimates, it is not optimal to regulate known emitters more stringently when accounting for structural uncertainty if expected marginal damages when accounting for structural uncertainty are lower than when not accounting for structural uncertainty.

There are three key pieces of intuition for Proposition 5. First, if the set of unknown emitters is believed to be large, then there is a large quantity of emissions that the regulator believes they cannot control. This is the direct effect of structural uncertainty on the regulator’s problem. This gives the regulator incentive to increase regulatory burden on known emitters when they account for structural uncertainty since unapportioned emission sources increase the marginal cost of an additional unit of pollution from known emitters. Second, estimated marginal physical effects of emissions from known emitters are always lower when the regulator accounts for structural uncertainty relative to when they do not. This gives the regulator incentive to reduce the regulatory burden on known emitters when they account for structural uncertainty. This is the indirect effect of uncertainty on the regulator’s problem. Third, if the marginal profit from emissions is small, then the importance of the first two effects is magnified since, from a social welfare perspective, it is costly to significantly alter emissions of known polluters. Even for linearly increasing marginal damages, then, we cannot say whether known emitters should always be regulated more stringently when accounting for structural uncertainty. Lastly, while full bias correction is feasible conditional on priors \(\mu (\alpha )\) is feasible, the proof of Proposition 5 only relies on estimates conditioning on \(\mu (\alpha )\) being less biased than when not.

The indeterminate result from Proposition 5 is unsatisfying but there are still policy lessons embedded in it. For example, if the regulator is fairly certain they have identified all emitters then it is more likely that accounting for structural uncertainty leads to increased stringency on known emitters. The reason is the non-linearity in the bias when naively estimating marginal emissions. In the OLS case, for example, as \(\alpha \) approaches one, the marginal change in \(\alpha \) is constant but the marginal change in \(\frac{1}{\alpha }\) gets smaller. The converse is true for lower levels of \(\alpha \). Similarly if marginal profits are very steep, then \(x^*\approx x^*_{un}\), and accounting for structural uncertainty will lead to decreased stringency on known emitters.

Applying this model and the insights from Proposition 5 is relatively straightforward, but the correct functional relationships between pollution and damages, the general functional form on profits, and the non-linearities induced by biased estimation will all be context specific. This naturally lends itself to simulation based exercises to size the magnitude of the problem in different contexts. One example is bounding exercises where efficiency losses from not accounting for structural uncertainty are going to be largest versus smallest. This allows regulators to focus effort on use cases which have the highest expected value in terms of increased regulatory efficiency.

4 Discussion

While we have focused on upwardly biased transfer coefficient estimates over counting fugitive emissions could lead to downwardly biased estimates. In that situation, our results would be reversed. It is unclear if fugitive emissions are over or under counted in general. In the case of methane, for example, fugitive emissions were under counted (Miller et al. 2013). In other contexts like particulate matter, though, the opposite could be true. This possibility highlights the complexity associated with the general problem of structural uncertainty in the relationship between emissions and ambient levels of pollution. Complicating matters further, the regulator might face punitive legal or political costs in over or under regulation of regulated firms if they acknowledge scientific uncertainty.

There are also welfare implications of accounting for structural uncertainty in the pollution process. Increasing the accuracy of information assumptions in any economic model should, in expectation, increase total welfare of the solution to the regulator’s problem. Indeed, the change in welfare is equal to the social value of information. We note, though, that the social value of information is a function to the regulator’s response to it; in this way the social value of the discovery of new emission sources is a function of whether the regulator explicitly accounts for structural uncertainty.

Our work suggests there may be substantial efficiency gains to investing resources to help resolve and reduce structural uncertainty since we show it affects the optimal stringency of environmental regulation. Currently, budget allocations suggest that pollution control agencies passively react to scientific discoveries that alter the understanding of the pollution process. Our work can be used as the starting point to formally estimate the expected gains from R&D programs aimed at quantifying the total amount of known emissions, identifying unapportioned sources of a particular pollutant and reducing structural uncertainty. While greenhouse gases are an obvious candidate, a much smaller system with a well-understood physical model for translating emissions into ambient pollution levels might be a better test case. In particular, extensions of this model could include optimal experimentation to learn about structural uncertainty faced by the regulator.