The optimal harvesting problem under price uncertainty: the risk averse case

Pagnoncelli, Bernardo K.; Piazza, Adriana

doi:10.1007/s10479-015-1963-9

The optimal harvesting problem under price uncertainty: the risk averse case

Published: 27 August 2015

Volume 258, pages 479–502, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Annals of Operations Research Aims and scope Submit manuscript

The optimal harvesting problem under price uncertainty: the risk averse case

Download PDF

Bernardo K. Pagnoncelli¹ &
Adriana Piazza²

425 Accesses
8 Citations
Explore all metrics

Abstract

We study the exploitation of a one species, multiple stand forest plantation when timber price is governed by a stochastic process. Our model is a stochastic dynamic program with a weighted mean-risk objective function, and our main risk measure is the Conditional Value-at-Risk. We consider two stochastic processes, geometric Brownian motion and Ornstein–Uhlenbeck: in the first case, we completely characterize the optimal policy for all possible choices of the parameters while in the second, we provide sufficient conditions assuring that harvesting everything available is optimal. In both cases we solve the problem theoretically for every initial condition. We compare our results with the risk neutral framework and generalize our findings to any coherent risk measure that is affine on the current price.

A Population Harvesting Model with Time and Size Competition Dependence Function

Article 25 September 2022

Adaptive economic and ecological forest management under risk

Article Open access 24 February 2015

Stochastic Mitra–Wan forestry models analyzed as a mean field optimal control problem

Article 02 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The presence of uncertainty in decision making can be seen in most fields, and the need to incorporate stochastic elements has been acknowledged by several authors. In forestry models, the most common elements that are considered stochastic are timber prices, interest rates, growth processes, and forest fires. A variety of tools have been proposed to deal with these problems, including Markov decision processes (Lembersky and Johnson 1975), stochastic programming (Boychuk and Martell 1996), simulation (Kim et al. 2009; Carmel et al. 2009) and others. As demonstrated by Lönnstedt and Svensson (2000), forest owners are more concerned by direct economic risks than biological hazards so in this work we focus on price variations.

The majority of papers that deal with price uncertainty have risk neutral objective functions such as maximizing expected profit or minimizing expected costs. The literature of risk averse models in forestry is still scarce. In Gong and Löfgren (2003), the authors investigate the effect of risk aversion in a two-period harvest investment decision model and show that the optimal harvesting policy depends on the sign of a marginal variance function. In Alvarez and Koskela (2006), the authors show that higher risk aversion shortens the expected rotation period, and that increased forest value volatility decreases the optimal harvesting threshold, which is not true under risk neutrality.

The work Alvarez and Koskela (2007) is the first to consider risk aversion in stochastic ongoing rotation models. It is shown that under risk aversion the optimal harvesting threshold is lower, which translates into a shorter rotation period. The effect of risk aversion on the length of the rotation period is also studied in Gong and Löfgren (2008) using a model with regeneration costs. In this case, the risk averse rotation period can be either longer or shorter than the risk neutral one depending on these costs. Clarke and Harry (1989) find the rotation period when both stock size and price are stochastic. The single-rotation problem is solved theoretically as an optimal stopping problem and the authors propose a numerical iterative scheme to solve the ongoing-rotation problem. It is worth mentioning that all these papers consider single stand models.

The works Mosquera et al. (2011) and Tahvonen and Kallio (2006) consider multiple stand models in which risk aversion is represented via expected utility. Both papers study how the introduction of risk aversion changes the optimal policy by numerical resolution. In particular, Tahvonen and Kallio (2006) concludes that risk aversion completely changes the optimal harvesting policy, favoring smooth forest configurations and making harvesting dependent of price level and forest owner’s time preference and wealth. Also considering a multiple stand model but representing risk aversion via Conditional Value-at-Risk (CVaR), Piazza and Pagnoncelli (2015) prove analytically that risk aversion shortens the optimal rotation period and study how the variation of the model’s parameters affect the optimal harvesting policy.

The main goal of this paper is to study theoretically a risk averse formulation of a harvest scheduling problem for a multiple stands forest under stochastic timber prices. While the works of Brazee and Mendelsohn (1988), Thomson (1992) and Yoshimoto and Shoji (1998) assume normality and adopt a geometric Brownian motion (GBM) to represent the evolution of timber prices, others, like Alvarez and Koskela (2005), Gjolberg and Guttormsen (2002), argue that a mean reverting process, or Ornstein–Uhlenbeck (O–U) is a better description of the timber price path due to empirical data that has been collected for several species. It is not our intention to go any further into this discussion, and we refer the reader to Insley and Rollins (2005) for more details. As the issue seems far from being settled, and following Plantinga (1998) and Tahvonen and Kallio (2006), we decided to consider both stochastic processes in our work.

Due to an increase of the available computational power, multistage stochastic problems have gained significant attention recently, in areas such as finance (Blomvall and Shapiro 2006; Valladão et al. 2014), pension funds (Haneveld et al. 2010) and energy (Philpott et al. 2013), among others. We propose a multistage stochastic programming model in which the decision maker optimizes a weighted mean-risk objective, maximizing expected return while avoiding decisions that exhibit high variability. We build upon our previous work Piazza and Pagnoncelli (2014) and extend the results for the risk averse case. The dynamics and the assumptions of the model are the same as in our previous work, so we briefly describe the model in Sect. 2 for completeness.

The main contributions of the current paper are as follows. First, we completely characterize the optimal policy in the case where prices follow GBM and, for the O–U case, we find a sufficient condition for the optimality of harvesting every mature tree that can be translated into a reservation price. Second, we establish a connection between the risk-neutral and risk averse worlds, that is, we explain how the inclusion of risk affects the optimal harvest policy. We will work mostly with the CVaR, but closed form results are also derived for the Mean Deviation Risk of order 2. In particular, we extend our results for any risk measure that is affine on the current price.

There are very few benchmark problems in stochastic harvest scheduling, and to the best of our knowledge, our work is the first that finds closed form solutions when multiple stand forest models and stochastic prices are considered. Furthermore, coherent risk measures have been widely used in applications, and we believe this is one of the first papers in the literature to incorporate them into forestry.

The rest of the paper is as follows: In Sect. 2 we give a complete description of our model. In Sect. 3 we write dynamic programming equations for the model and discuss the introduction of risk measures in a dynamic framework. With the assumption that prices follow a GBM, we completely characterize the optimal policy in Sect. 4 and compare the results obtained with the risk neutral case. In Sect. 5 we assume prices follow an O–U process and obtain sufficient conditions involving the current price. In Sect. 6 we generalize some of our results for risk measures other than the CVaR. Section 7 concludes the paper and discuss possible extensions. All proofs can be found in the “Appendix”.

2 Model formulation

The forest growth model we use was introduced by Rapaport et al. (2003) and considers a one species forest plantation of total area S with maturity age of n years. In contrast with the case of wild forests, the state of a forest plantation may be described by specifying the areas occupied by trees of different ages, making the assumption that trees are planted within a pre-specified and constant distance of each other.

For each period $t\in {\mathbb {N}}$ we denote $x_{a,t}\ge 0$ the area of trees of age $a=1,\ldots ,n$ in year t, and $\bar{x}_t\ge 0$ the area occupied by trees beyond maturity (older than n). Using a single state variable to represent the over-mature trees conveys the underlying assumption that the growth of trees is negligible beyond maturity. Each period we must decide how much land $c_t\ge 0$ to harvest. Assuming that only mature trees can be harvested we must have

$$\begin{aligned} 0\le c_t\le \bar{x}_t+x_{n,t}, \end{aligned}$$

(1)

and then the area not harvested in that period will comprise the over-mature trees at the next step, namely, $\bar{x}_{t+1}=\bar{x}_t+x_{n,t}-c_t.$ We neglect natural mortality at every age, again an assumption valid in managed forest plantations but not in wild forests. Hence, the transition between age classes is given by $x_{a+1,t+1}=x_{a,t} \ \forall a=1,\ldots ,n-1.$ The harvested area is immediately allocated to new seedlings that will comprise the one year old trees in the following year: $x_{1,t+1}=c_t.$

We represent the state of the tree population by the vector state

$$\begin{aligned} {\mathbb {X}}=(\bar{x},x_n,\ldots , x_1)^T, \end{aligned}$$

and the dynamics described in the previous paragraph can be represented by

$$\begin{aligned} {\mathbb {X}}_{t+1}=A{\mathbb {X}}_t+Bc_t, \end{aligned}$$

(2)

where

$$\begin{aligned} A=\begin{pmatrix} 1&{}\quad 1 &{}\quad &{}\quad &{}\quad \\ &{}\quad &{}\quad 1 \\ {} &{}\quad &{}\quad &{}\quad \ddots \\ &{}\quad &{}\quad &{}\quad &{}\quad 1\\ 0 &{}\quad \dots &{}\quad \dots &{}\quad \dots &{}\quad 0 \end{pmatrix} \quad \text{ and } \quad B=\begin{pmatrix} -1\\ 0\\ \vdots \\ 0\\ 1\end{pmatrix}. \end{aligned}$$

The expression of constraint (1) in terms of the defined matrices is

$$\begin{aligned} 0\le c_t\le CA{\mathbb {X}}_t, \quad \text{ with } \quad C=\begin{pmatrix}1&\quad 0&\quad&\quad \dots&\quad \dots&\quad 0\end{pmatrix}. \end{aligned}$$

(3)

The order of events is the following: at every period t the decision maker observes the state of the forest and the current price and decides how much she will harvest, $c_t$, obtaining a benefit $p_tc_t$. Her decision changes the current state of the system from ${\mathbb {X}}_t$ to ${\mathbb {X}}_{t+1}$ according to (2). Then timber price $p_{t+1}$ is observed, leaving her in position of deciding the following harvest $c_{t+1}$.

3 Weighted mean-risk formulation

Decision makers usually want to maximize expected returns while avoiding risk. To achieve this goal we propose a weighted mean-risk objective function ${\mathcal {R}}$ given by

$$\begin{aligned} {\mathcal {R}}[X] := \lambda {\mathbb {E}}[X] + (1-\lambda ){\mathbb {\rho }}[X], \end{aligned}$$

(4)

where ${\mathbb {\rho }}$ is some risk measure, X represents losses and $\lambda \in [0,1]$ balances the two objectives. In Piazza and Pagnoncelli (2014) the authors find the optimal policy for the case $\lambda = 1$.

The variance is a classical risk measure, and it dates back to the work Markowitz (1952). It has been used extensively in many fields and it is a popular risk measure in forestry (Reeves and Haight 2000; Gong 1998; Gong and Löfgren 2003; Ollikainen 1993). Another popular measure is the Value-at-Risk (VaR), defined by

$$\begin{aligned} {\text {VaR}}_{\alpha }[X] = \inf \{x:{\mathbb {P}}(X\le x) \ge 1-\alpha \}, \quad \alpha \in (0,1). \end{aligned}$$

The VaR is the left side $(1-\alpha )$ quantile of the distribution of X and it is also used in forestry (Zhong-wei and Yan 2009; Roessiger et al. 2011).

3.1 The Conditional Value-at-Risk

A risk measure that gained increasing popularity since the publication of Rockafellar and Uryasev (2000) is the Conditional Value-at-Risk (CVaR). For a continuous random variable X, the CVaR is defined as the average of losses above the VaR level:

$$\begin{aligned} {\text {CVaR}}_{\alpha }[X] = {\mathbb {E}}[X | X > {\text {VaR}}_{\alpha }[X]] = \inf \left\{ t\in {\mathbb {R}} : t + \alpha ^{-1}{\mathbb {E}}[X-t]_+\right\} . \end{aligned}$$

(5)

In the seminal work of Artzner et al. (1999), the authors defined a set of axioms that a risk measure must satisfy in order to be called coherent. While the CVaR satisfies all those properties, nor the variance or the VaR are coherent. We are going to use the CVaR extensively throughout our paper since it will be our prototype example of risk measure.

The majority of papers in forestry use expected utility to model risk aversion. Although risk measures and expected utility criteria are related through dual representations of the risk functional (see example 6.14 of Shapiro et al. 2009), both have its own advantages and drawbacks. Our goal here is to apply risk measures to forestry and a comparison of both approaches is out of the scope of this paper.

3.2 Dynamic programming formulation

In this paper we deal with risk averse stochastic dynamic problems and, in order to write dynamic programming equations, we need to extend the concept of conditional expectation to accommodate arbitrary risk measures. If X and Y are random variables, we can consider the value of ${\mathcal {R}}$ at the conditional distribution of X given $Y=y$, which we denote as ${\mathcal {R}}_{|Y}[X]$ and refer to as a conditional risk mapping (Ruszczynski and Shapiro 2005). Following Shapiro (2009), we consider a nested risk averse formulation based on conditional risk mappings:

$$\begin{aligned} \left\{ \begin{array}{l} V_{0}({\mathbb {X}}_{0},p_{0})=\mathop {\mathrm{Min}}_{c_{0}}\Big \{ - p_{0}c_{0} + \delta {\mathcal {R}}_{|p_{0}}\Big [ \mathop {\mathrm{Min}}_{c_{1}} -p_{1}c_{1} + \delta {\mathcal {R}}_{|p_1}\Big [\cdots \Big ]\Big ]\\ \text{ s.t. }\quad (2) \text{ and } (3), \end{array}\right\} \end{aligned}$$

(6)

where ${\mathcal {R}}_{|p_t}$ is a conditional risk mapping for $t\in {\mathcal {T}}$ and $\delta \in (0,1)$ is the discount factor. In this paper we will work with ${\mathcal {R}}_{|p_t}$ of the form (4). Throughout the paper we will consider two different types of sets ${\mathcal {T}}$: (i) ${\mathcal {T}}=\{1,\ldots , T\}$ for the finite horizon case, where T is the time horizon of the problem, and, (ii) ${\mathcal {T}}={\mathbb {N}}$ for the infinite horizon case, with particular attention to the convergence properties of the objective function of (6). Details can be found in Ruszczynski and Shapiro (2005), where the authors show that the corresponding dynamic programming equations are,

$$\begin{aligned} V_{t}({\mathbb {X}}_{t},p_{t}) = \mathop {\mathrm{Min}}_{c_t\in [0,CA{\mathbb {X}}_t]}\Big \{ - p_tc_t + \delta {\mathcal {R}}_{|p_t}\left[ V_{t+1}(A{\mathbb {X}}_t+Bc_t,p_{t+1})\right] \Big \}, \end{aligned}$$

(7)

for all $t\in {\mathcal {T}}$ and in the finite horizon case $V_{T}({\mathbb {X}}_{T},p_{T})= -p(T)c^*(T)$, with $c^*(T)=CA{\mathbb {X}}(T)$ which corresponds to harvest the maximum possible amount.

If the conditional risk mappings are positive homogeneous (see Shapiro 2009), it is easy to check that if

$$\begin{aligned} -p_{T-1}-\delta {\mathcal {R}}_{|p_{T-1}}[-p_T]\le 0, \end{aligned}$$

(8)

the optimal solution for $t=T-1$ is $c^*_{T-1}=CA{\mathbb {X}}_{T-1}$.^{Footnote 1}

Remark 1

In the general case, the determination of the optimal policy for an arbitrary t is more difficult, as we do not know future policies. However, we will see that in other time stages, conditions analogous to (8), i.e.,

$$\begin{aligned} -p_t- \delta {\mathcal {R}}_{|p_t}[-p_{t+1}]\le 0\quad \text{ with } \quad t\in {\mathcal {T}}, \end{aligned}$$

(9)

play a fundamental role in the characterization of the optimal policy in this article. Observe that these conditions do not consider the entire future, but only the price of timber one period ahead of time.

4 Geometric Brownian motion

We first study problem (6) when $\rho = {\text {CVaR}}_\alpha $ and the dynamics of prices follow a geometric Brownian motion (GBM). This dynamics has been extensively used to model asset prices in financial markets and therefore represents a natural choice for timber prices. It is a well known process, hence, we define it without further detail,

(10)

where $\mu \in {\mathbb {R}}$ is the drift of the GBM, $\sigma > 0$ is the constant variance and $W_t$ denotes the Wiener process.

For the GBM, the price $p_{t+1}$ conditional on $p_t$ follows a lognormal distribution, and it is possible to compute ${\text {CVaR}}_{\alpha |p_t}[-p_{t+1}]$ explicitly:

$$\begin{aligned} {\text {CVaR}}_{\alpha |p_t}[-p_{t+1}] = -p_t\frac{e^{\mu }}{\alpha }\varPhi (z_{1-\alpha }-\sigma ), \end{aligned}$$

where $\varPhi $ is the cumulative distribution function of the normal random variable with mean 0 and variance 1 and $z_{\alpha } = \varPhi ^{-1}(1-\alpha )$ (see Theorem 6.2 of Shapiro et al. 2009).

It is easy to see that (9) is equivalent to

$$\begin{aligned} p_t[-1+\lambda \delta e^\mu + (1-\lambda ) \delta e^\mu \kappa ] \le 0,\quad \text{ with } \; 0<\kappa =\frac{1}{\alpha }\varPhi (z_{1-\alpha }-\sigma )<1. \end{aligned}$$

(11)

Therefore, condition (9) is equivalent to

$$\begin{aligned} \delta e^\mu [\lambda +(1-\lambda )\kappa ]\le 1, \end{aligned}$$

that depends only on the parameters of the problem ($\delta , \lambda , \alpha $) and those of the price process ($ \mu , \sigma $). This implies that when prices follow a GBM, condition (9) is satisfied for all $p_t$ or for none.

However, throughout this section we work with the strict inequality version of the condition above

$$\begin{aligned} \delta e^\mu [\lambda +(1-\lambda )\kappa ]< 1, \end{aligned}$$

(12)

to assure convergence of the objective function in the infinite horizon case (see the proof of Lemma 1).

We are now in condition to state the main result of this section, the characterization of the greedy policy, i.e., $c^*_t=CA{\mathbb {X}}_t$ for all t, when prices follow a GBM. The theorem encompasses both finite and infinite horizon cases, and we present a brief technical lemma showing that the value function is well defined in the latter case.

Lemma 1

Consider problem (6) with ${\mathcal {T}}={\mathbb {N}}$ and ${\mathcal {R}}=\lambda \mathbb E +(1-\lambda )\text{ CVaR }_{\alpha }$ assume prices evolve according to (10). If Condition (12) holds, the value function $V_0$ is well defined.

Theorem 1

Consider problem (6) with ${\mathcal {R}}=\lambda \mathbb E +(1-\lambda )\text{ CVaR }_{\alpha }$ and assume prices evolve according to (10). If condition (12) holds, the greedy policy is optimal.

4.1 Discussion and insights

In this section we compare the optimality conditions for the risk neutral and risk averse cases. In Piazza and Pagnoncelli (2014), it is shown that in the risk neutral setting under GBM, the greedy policy is optimal if and only if

$$\begin{aligned} \delta e^{\mu } { < } 1. \end{aligned}$$

(13)

Besides the obvious observation that when $\lambda =1$ the risk neutral case is retrieved, other interesting interpretations of condition (12) can be found. Since $\lambda $ and $\kappa $ belong to [0, 1], we have the intuitive result that whenever the greedy policy is optimal for the risk neutral case it will be automatically optimal for the risk averse case for the same choice of parameters.

In Sect. 4.3 we show how the solution changes with $\lambda $; we now concentrate in the effect of the other parameters. In Fig. 1 we can see that for values of $\sigma $ close to zero, that is, in the case where GBM is less volatile, the constant $\kappa $ is close to one and conditions (12) and (13) become essentially equivalent. For larger values of $\sigma $, that is, for higher volatility, we observe that $\kappa $ is closer to zero and, unless the drift $\mu $ is sufficiently high, the optimal policy for a risk averse decision maker will be greedy.

Let us now fix the value of $\sigma $ and turn our attention to the role of the risk aversion parameter $\alpha $. In Fig. 1, as we increase $\alpha $, that is, as the decision maker becomes less risk averse, the multiplicative factor $\kappa $ increases. In the limiting case, where $\alpha =1$, it can be seen from (11) that $\kappa =1$ and the risk neutral policy is retrieved.

It may seem that the consideration of both parameters $\alpha $ and $\lambda $ is redundant as they both are related to the level of risk aversion. This idea is reinforced by the fact that extreme values of $\alpha $ and $\lambda $ produce the same result, i.e., both $\alpha =0$ and $\lambda =1$ yield the risk neutral solution. But, they are conceptually different. Indeed, the value of $\alpha $ establishes which fraction of the worst possible outcomes are taken into consideration, while $\lambda $ assigns the weight these outcomes will have in the objective function. Besides, for other risk measures the only parameter controlling risk aversion is $\lambda $.

4.2 Another optimal policy

We show that Condition (9) is tight for the optimality of the greedy policy in the sense that if it does not hold then the optimal policy is not greedy. If (9) does not hold, not only Lemma 1 does not apply, but we can prove that the value function is not defined whenever ${\mathcal {T}}={\mathbb {N}}$.^{Footnote 2} Hence, we only consider ${\mathcal {T}}=\{0,\ldots ,T\}$. It is natural to think that the decision maker should postpone the harvest as much as possible. Hence, before the final time T, harvesting should be stopped altogether in order to have the maximum surface available at time T. However, observe that every land plot harvested and planted n or more time steps before T will contain mature trees available for harvesting at time T. Hence, it is convenient to harvest every mature tree at time $T-n$, since there is enough time for seedlings to mature before reaching T. Repeating this reasoning we can conjecture that the only time steps when harvesting is allowed are $T-kn$ for $k=0,1,\ldots , \lfloor T/n\rfloor $ and that everything available then should be harvested. This is to say,

$$\begin{aligned} c_t=\left\{ \begin{array}{ll} CA{\mathbb {X}}_t &{}\quad \text{ if } \; t=T-kn\\ 0&{}\quad \text{ else }. \end{array} \right. \end{aligned}$$

We call this harvesting policy the accumulating policy (AP), and we have the following result.

Theorem 2

Consider problem (6) with ${\mathcal {R}}=\lambda \mathbb E +(1-\lambda )\text{ CVaR }_{\alpha }$ and assume prices evolve according to (10). If condition (12) does not hold, then the AP is optimal in the finite horizon case.

4.3 Efficient frontier

Following Markowitz (1952) and Zheng (2009), we construct an efficient frontier by varying the value of $\lambda $ when $\rho ={\text {CVaR}}$. For $\lambda \in [0,1]$, we calculate the total value at time $t=0$ under the greedy and accumulating policies. From Theorem 1 we know that the greedy policy will be optimal for $\delta e^\mu [\lambda +(1-\lambda )\kappa ]$ less than one, and, beyond that value, Theorem 2 indicates that the accumulating policy is optimal.

For a value of $\lambda $ in the y axis, Fig. 2 shows the negative of the total value at time $t=0$ under the greedy policy (continuous line) and the accumulated policy (dashed line) in the x-axis. The figure is generated with parameters $\mu =0.2, \sigma =0.4, p(0)=1, \delta =0.95, \alpha =0.1$ and with $T=20$ periods for the initial state ${\mathbb {X}}(0) = (0,0,0,1)$. With these values, condition (12) gives a value of $\lambda \le 0.742 $. Observe that the two lines intersect and the change of optimal policies occurs for $\lambda $ between 0.74 and 0.75, as we predicted.

5 Ornstein–Uhlenbeck process

We now study problem (6) when prices follow a mean-reverting Ornstein–Uhlenbeck process (O–U) and focus on $\rho = {\text {CVaR}}$. It is well-know that GBM does not capture some behaviors of price movement such as mean-reversion, which is best emulated by the O–U process. Since O–U is a well-known process, we define it without further detail.

$$\begin{aligned} dp_t&= \eta (\bar{p} - p_t) dt + \sigma dW_t \quad&({\text {O--U}}), \end{aligned}$$

(14)

where $\eta > 0$ is the rate of mean-reversion to an equilibrium $\bar{p}$, $\sigma > 0$ is the constant variance and $W_t$ denotes the Wiener process. A closed expression for the value of the price $p_{t+1}$ given the price $p_t$ can be written as follows:

$$\begin{aligned} p_{t+1}=e^{-\eta } p_t + (1-e^{-\eta })\bar{p} + \int _t^{t+1}\sigma e^{\eta (s-(t+1))} dW_s. \end{aligned}$$

In an O–U process, it can be shown (see for example Maller et al. 2009) that at every time period t the random variable $p_t$ follows a normal distribution^{Footnote 3} with conditional mean ${\mathbb {E}}_{|p_t}[p_{t+1}] = \bar{p} + (p_t - \bar{p})e^{-\eta },$ and conditional variance given by ${\text {Variance}}_{|p_t}[p_{t+1}] = \frac{\sigma ^{2}}{2\eta }(1-e^{-2\eta }).$

If X follows a normal distribution with mean $\mu $ and variance $\sigma $, the $\text{ CVaR }_{\alpha }[X]$ can be explicitly computed:

$$\begin{aligned} \text{ CVaR }_{\alpha }[X] = \mu + \frac{\sigma }{\alpha \sqrt{2\pi }}e^{-z^2_{\alpha }/2}, \end{aligned}$$

(15)

where $z_{\alpha } = \varPhi ^{-1}(1-\alpha )$ and $\varPhi $ is the cumulative distribution function of the normal random variable with mean 0 and variance 1 (see Theorem 6.2, Shapiro et al. 2009).

Using (15) we can calculate the CVaR at each time period as follows:

$$\begin{aligned} {\text {CVaR}}_{\alpha |p_t}[-p_{t+1}] = - p_te^{-\eta } - \bar{p}(1-e^{-\eta })+ \frac{\sigma }{\sqrt{2\pi }} \sqrt{\frac{1-e^{-2\eta }}{2\eta }}\frac{e^{-z_{\alpha }^2/2}}{\alpha }. \end{aligned}$$

Using that $\delta e^{-\eta }\in (0,1)$, condition (9) can be expressed as

$$\begin{aligned} p(t)\ge \frac{\delta }{1-\delta e^{-\eta }}\left[ \bar{p}(1-e^{-\eta })- \frac{(1-\lambda )\,\sigma }{\sqrt{2\pi }} \sqrt{\frac{1-e^{-2\eta }}{2\eta }}\frac{e^{-z_{\alpha }^2/2}}{\alpha }\right] , \end{aligned}$$

(16)

where the right hand side can be interpreted as a reservation price, i.e., a price threshold above which it is optimal to harvest everything available.

In the previous section, when prices follow a GBM, we have condition (12) that does not depend on $p_t$ and assures the satisfaction of (9) for all t. We do not have anything similar when prices follow an O–U process, as we see that condition (16) depends on $p_t$. Hence, when solving Bellman equation (7) for t, we have no information of what may happen at $t+1$ or after and we have to consider every possible situation.

However, we will show in Theorem 3 that condition (16) is sufficient to assure that the greedy policy is optimal at t, i.e., $c_t=CA{\mathbb {X}}_t$. We must stress here that (16) is not necessary and that we do not have any information about what the optimal policy is when it does not hold.

To simplify notation we name

$$\begin{aligned} a=e^{-\eta } \hbox { and } b= \bar{p}(1-e^{-\eta })-\frac{\sigma (1-\lambda )}{\sqrt{2\pi }} \sqrt{\frac{1-e^{-2\eta }}{2\eta }}\frac{e^{-z_{\alpha }^2/2}}{\alpha }, \end{aligned}$$

(17)

and hence ${\mathcal {R}}_{|p_t}[-p_{t+1}] = -ap_t-b$. The idea of the proof consists in showing that the coefficient of c in (7) is composed of terms of the form

$$\begin{aligned} \varDelta ^{m_i}_{j_i}(p_t)=\delta ^{m_i}\left\{ -p_ta^{m_i}(1-\delta ^{j_i}a^{j_i})-\frac{b}{1-a}\Big [1-\delta ^{j_i}-a^{m_i}(1-\delta ^{j_i}a^{j_i})\Big ]\right\} \end{aligned}$$

(18)

for some values of $m_i$ and $j_i$ such that $j_i\in \{0,\ldots ,n-1\}$.

Instead of checking the sign of $\varDelta ^m_j(p_t)$ for every value of m and j, we use the equivalence

$$\begin{aligned} \varDelta ^m_j(p_t) \le 0\iff p_t \ge \frac{b}{1-a}\Big [1-\frac{1-\delta ^j}{a^m(1-\delta ^ja^j)} \Big ] \end{aligned}$$

(19)

and prove in the next lemma that $\varDelta ^m_j(p_t)\le 0$ when condition (16) holds.

Lemma 2

If $a\in (0,1)$ and $b\ge 0$, condition (16) implies that

$$\begin{aligned} \varDelta ^m_j(p_t) \le 0,\quad \hbox {for all }\; m\; \hbox { and }\; j \in {\mathbb {N}}. \end{aligned}$$

When price follows an O–U process, we have that $a<1$, but for some values of the parameters we could have $b<0$. Hence, the second hypothesis of Lemma 2 has to be explicitly required in the statement of the main result of this section.

Lemma 3

Consider problem (6) with ${\mathcal {T}}={\mathbb {N}}$ with ${\mathcal {R}}=\lambda \mathbb E +(1-\lambda )\text{ CVaR }_{\alpha }$ and assume prices evolve according to (14). The value function $V_0$ is well defined.

Theorem 3

Consider problem (6) with ${\mathcal {R}}=\lambda \mathbb E +(1-\lambda )\text{ CVaR }_{\alpha }$ and assume prices follow (14). If $b\ge 0$ and condition (16) holds, then $c_t=CA{\mathbb {X}}_t$ is optimal at time t.

The right-hand side of (16) can be used as a reservation price, as we know that if $p_t$ is above that value it is optimal to harvest every mature tree. The condition is sufficient but not necessary: if (16) does not hold, we are not able to discard the greedy policy.

5.1 Discussion and insights

In Piazza and Pagnoncelli (2014) the risk neutral version of this problem was studied, and the following reservation price was found

$$\begin{aligned} p_r= \frac{\delta }{1-\delta e^{-\eta }}\,\bar{p} \,(1-e^{-\eta }), \end{aligned}$$

which is exactly the positive term of the right hand side of (16). Therefore, for every value of the parameters $\lambda , \eta ,\alpha $ and $\sigma $, the greedy policy is more likely to be optimal in the risk averse framework than in the risk neutral one.

If we choose $\alpha =1$ in the calculation of the CVaR we obtain the usual expected value operator. Indeed, observe that the term $e^{-z_{\alpha }^2/2}/\alpha $ in (16) goes to zero when $\alpha $ approaches 1, implying that resulting reservation price would be $p_r$, retrieving the solution for the risk neutral harvesting problem.

It is straightforward to see that if the volatility $\sigma $ goes to zero we have that the reservation price also coincides with $p_r$. This result is rather intuitive and it mimics what we obtained for the GBM case: if the volatility is small the sufficient conditions for the optimality of the greedy policy are essentially the same under risk neutrality and risk averseness. However, when the volatility $\sigma $ increases the two cases are significantly different.

When $\eta $ goes to infinity, b goes to $\bar{p}$ and a goes to zero. The reservation price simplifies to $\delta \bar{p}$. In this case one can expect that the optimal policy will be greedy for any t since the speed of the mean reversion is so high that the price is essentially equal to $\bar{p}$ for all time periods and therefore greater than the reservation price $\delta \bar{p}$. This is consistent with the result for constant deterministic prices presented in Rapaport et al. (2003) that establishes that the greedy policy is optimal for all time periods.

In the right hand side of (16), the term $\sqrt{\frac{1-e^{-2\eta }}{2\eta }}$ is strictly decreasing with $\eta $ and it lies between zero and one. When the speed of the mean reversion $\eta $ vanishes, the whole expression converges to 1. Since the first term in the expression of b goes to zero as $\eta $ approaches zero, we have in this case that $b=\frac{-\sigma (1-\lambda )}{\sqrt{2\pi }\alpha }e^{-z^2_\alpha /2}<0$. It is interesting to note that in this case Theorem 3 does not apply. We deal with this particular situation in the next section.

6 Extension to affine weighted mean-risk measures

It is worth investigating whether the results of the previous section can be generalized for other values of a and b. This would eliminate the hypothesis $b\ge 0$ of Theorem 3, and, more importantly, it would allow the extension of the results to other price processes and to risk measures other than $\text{ CVaR }_{\alpha }$.

Table 1 Parameters regions

Full size table

The proof of Theorem 3 relies heavily in the fact that

$$\begin{aligned} {\mathcal {R}}_{|p_t}[-p_{t+1}]=-ap_t-b \end{aligned}$$

and is valid for $(a,b)\in (0,1)\times {\mathbb {R}}_+$. In the following we study the extension of this theorem to $(a,b)\in (0,1/\delta )\times {\mathbb {R}}$. We start with the generalized version of Lemma 3.

Lemma 4

Consider problem (6) with ${\mathcal {T}}={\mathbb {N}}$ and assume that ${\mathcal {R}}_{|p_t}[-p_{t+1}]=-ap_t-b.$ The value function $V_0(\cdot ,\cdot )$ is well defined if $a\in (0,1/\delta )$.

We divide the semi-plane of parameters in five regions as shown in Table 1 and study the variation of the right-hand side of (19) in each of these regions. The following theorem summarizes the sufficient conditions for the optimality of the greedy policy, that can be found with this method.

Theorem 4

Consider problem (6) and assume ${\mathcal {R}}_{|p_t}[-p_{t+1}]=-ap_t-b$.

1.
In region (i), $p_t\ge b\delta /(1-\delta a)$ is sufficient to assure $c^*(t)=CA{\mathbb {X}}_t$.
2.
In region (ii), $p_t\ge b\delta /(1-\delta a)$ is sufficient to assure $c^*(t)=CA{\mathbb {X}}_t$.
3.
In region (iii), no sufficient condition assuring $c^*(t)=CA{\mathbb {X}}_t$ is found if ${\mathcal {T}}={\mathbb {N}}$. In the finite horizon case $p_t\ge \frac{b}{1-a}\Big [1-\frac{1-\delta }{a^{T-t}(1-\delta a)}\Big ]$ is sufficient.
4.
In region (iv), $p_t\ge b/(1-a)$ is sufficient to assure $c^*(t)=CA{\mathbb {X}}_t$.

Remark 2

Although regions (i) and (ii) deliver the same sufficient condition, we decided to keep them separated as case (i) is exactly Theorem 3. The theorem does not provide information about the region (iii) when ${\mathcal {T}}={\mathbb {N}}$, because the method of proof used delivers the condition $p_t=\infty $. However, this method can be applied in the finite horizon case to retrieve a meaningful condition. Concerning region (v), not only Lemma 4 does not apply, but it is possible to prove that the objective function is not well defined.^{Footnote 4}

Theorem 1 is a corollary of Theorem 4. Indeed, when prices follow a GBM and we use ${\mathcal {R}}=\lambda {\mathbb {E}}+(1-\lambda )\text{ CVaR }$, we have $a=\lambda e^\mu +(1-\lambda )e^\mu \kappa $ and $b=0$. When $b=0$, by simple inspection we see that conditions (i) and (ii) reduce to $p_t\ge 0$, that is always satisfied. Furthermore, $a<1/\delta $ is equivalent to Condition (12).

Corollary 1

If $b=0$ and $p_t\ge 0$ for all t (as is the case for a GBM price process) and $0<a<1/\delta $, the greedy policy is always optimal.

In the next subsection we compute explicit values of a and b for another risk measure, showing the relevance of Theorem 4.

6.1 Mean deviation risk

Another important risk measure is the Mean Deviation Risk (MDR) of order p:

$$\begin{aligned} {\text {MDR}}[X] := {\mathbb {E}}[X] + c{\mathbb {E}}\left[ |X - {\mathbb {E}}[X] |^p \right] )^{1/p}, \end{aligned}$$

where $p\in [1,+\infty )$ and $c>0$. The MDR has similarities with the expression of the variance, and under mild hypothesis, e.g, nonatomic probability measures, is a coherent risk measure for $p>1, c\ge 0$, and $p=1,c \in [0,1/2]$ (Shapiro et al. 2009). For GBM and O–U we have respectively:

$$\begin{aligned} {\text {MDR}}_{|p_t}(-p_{t+1})= & {} -p_t e^{\mu } - c p_t e^\mu \left( {\mathbb {E}}_{}\left[ \left| e^{-\frac{\sigma ^2}{2}+\sigma W}-1\right| ^p\right] \right) ^{1/p}, \\ {\text {MDR}}_{|p_t}(-p_{t+1})= & {} -p_te^{-\eta } - \bar{p}(1-e^{-\eta }) + c \left( {\mathbb {E}}\left[ \left| \int _t^{t+1} \sigma e^{\eta (s-(t+1))}dW(s) \right| ^p \right] \right) ^{1/p}. \end{aligned}$$

The MDR is much less tractable than the CVaR. However, for the particular case of $p=2$ and a O–U process, we are able to obtain explicit expressions using Itô’s isometry^{Footnote 5}:

$$\begin{aligned} {\text {MDR}}_{|p_t}(-p({t+1})) = -p_te^{-\eta } - \bar{p}(1-e^{-\eta }) + c\sigma \left( \frac{1}{2\eta } - \frac{e^{-2\eta }}{2\eta } \right) ^{1/2}. \end{aligned}$$

Therefore, when the MDR of order 2 is used as a weighted risk objective as in (4), it is affine in $p_t$ for O–U and the coefficients are

$$\begin{aligned} a=e^{-\eta },\quad b= \bar{p}(1-e^{-\eta }) - c \sigma \left( \frac{1 - e^{-2\eta }}{2\eta } \right) ^{1/2}. \end{aligned}$$

7 Conclusions

We study a harvest scheduling problem under price uncertainty. We depart from the usual risk neutral framework and incorporate weighted mean-risk measures in the objective function. Considering both the finite and infinite time horizon frameworks, we obtain conditions on the parameters of the model that characterize the optimal policy.

Our theoretical results characterize completely the optimal policy for the GBM case and provide a closed expression of a reservation price for the O–U case. We focus first on the Conditional Value-at-Risk, and later extend our results for any affine risk measure. We also prove that the Mean-Deviation Risk is affine so our results apply directly to this case.

Future work includes considering a more complex forest growth model. For example, taking into account natural mortality needs only a very simple modification in the matrix that defines the growth dynamics. However, proofs may need a significant adaptation.

Considering multi-species forests would allow us to compare our results with an important part of the present literature. The traditional paradigm indicates that a risk averse decision maker would favor a more homogeneous land allocation, establishing more ecologically friendly forests. Indeed, through numerical experiments, Hildebrandt et al. (2010), Knoke et al. (2005) and Roessiger et al. (2011) show that higher risk aversion implies higher mixture of species.

Notes

When $-p_{T-1}-\delta {\mathcal {R}}_{|p_{T-1}}[-p_T]=0$ the optimum of (7) is reached for any $c\in [0,CA{\mathbb {X}}_{T-1}]$, hence, for this particular value we may adopt the convention $c^*_{T-1}=CA{\mathbb {X}}_{T-1}$.
Details available from the authors upon request.
Even though the arithmetic O–U can lead to negative values, the process is frequently used to model the evolution of prices (see, for example, Alvarez and Koskela 2005; Gjolberg and Guttormsen 2002). The discussion of which process best represents timber prices is far from being settled. We refer the reader to Dixit and Pindyck (1994), Insley and Rollins (2005) and the references therein for more information.
Details available from the authors upon request.
The calculations are done in the “Appendix” of this manuscript.
We assume that at every step from $t+1$ onwards, we either harvest nothing at all or everything available. Due to the linearity of the forestry model, this assumption is equivalent to requiring that the coefficient of c in (7) is never zero, but having a zero coefficient is an event with zero probability.
In the particular case that $1-\delta a=0$ we observe that $\varDelta ^m_j(p_t)$ does not depend of $p_t$ and that Condition (28) can be verified a priori. The study of this particular case is straightforward and we omit it.
Although, regions (i) and (ii) yield the same sufficient condition, we keep them as independent regions because the proof we present needs to separate the cases $a\le 1$ and $a>1$, and because this theorem applied in region (i) corresponds to Theorem 3 (see Remark 2).

References

Alvarez, L., & Koskela, E. (2005). Wicksellian theory of forest rotation under interest rate variability. Journal of Economic Dynamics and Control, 29(3), 529–545.
Article Google Scholar
Alvarez, L., & Koskela, E. (2006). Does risk aversion accelerate optimal forest rotation under uncertainty? Journal of Forest Economics, 12(3), 171–184.
Article Google Scholar
Alvarez, L., & Koskela, E. (2007). Taxation and rotation age under stochastic forest stand value. Journal of Environmental Economics and Management, 54, 113–127.
Article Google Scholar
Artzner, P., Delbaen, F., Eber, J., & Heath, D. (1999). Coherent measures of risk. Mathematical Finance, 9(3), 203–228.
Article Google Scholar
Blomvall, J., & Shapiro, A. (2006). Solving multistage asset investment problems by the sample average approximation method. Mathematical Programming, 108(2–3), 571–595.
Article Google Scholar
Boychuk, D., & Martell, D. (1996). A multistage stochastic programming model for sustainable forest-level timber supply under risk of fire. Forest Science, 42(1), 10–26.
Google Scholar
Brazee, R., & Mendelsohn, R. (1988). Timber harvesting with fluctuating prices. Forest Science, 34(2), 359–372.
Google Scholar
Carmel, Y., Paz, S., Jahashan, F., & Shoshany, M. (2009). Assessing fire risk using monte carlo simulations of fire spread. Forest Ecology and Management, 257(1), 370–377.
Article Google Scholar
Clarke, J., & Harry, R. (1989). The tree-cutting problem in a stochastic environment: The case of age-dependent growth. Journal of Economic Dynamics and Control, 13(4), 569–595.
Article Google Scholar
Dixit, A., & Pindyck, R. (1994). Investment under uncertainty. Princeton: Princeton University Press.
Google Scholar
Gjolberg, O., & Guttormsen, A. (2002). Real options in the forest: What if prices are mean-reverting? Forest Policy and Economics, 4(1), 13–20.
Article Google Scholar
Gong, P. (1998). Risk preferences and adaptive harvest policies for even-aged stand management. Forest Science, 44(4), 496–506.
Google Scholar
Gong, P., & Löfgren, K. (2003). Risk-aversion and the short-run supply of timber. Forest Science, 49(5), 647–656.
Google Scholar
Gong, P., & Löfgren, K. (2008). Impact of risk aversion on the optimal rotation with stochastic price. Natural Resource Modeling, 21(3), 385–415.
Article Google Scholar
Haneveld, W. K. K., Streutker, M. H., & van der Vlerk, M. H. (2010). An alm model for pension funds using integrated chance constraints. Annals of Operations Research, 177(1), 47–62.
Article Google Scholar
Hildebrandt, P., Kirchlechner, P., Hahn, A., Knoke, T., & Mujica, R. (2010). Mixed species plantations in Southern Chile and the risk of timber price fluctuation. European Journal of Forest Research, 129(5), 935–946.
Article Google Scholar
Insley, M., & Rollins, K. (2005). On solving the multirotational timber harvesting problem with stochastic prices: A linear complementarity formulation. American Journal of Agricultural Economics, 87, 735–755.
Article Google Scholar
Kim, Y. H., Bettinger, P., & Finney, M. (2009). Spatial optimization of the pattern of fuel management activities and subsequent effects on simulated wildfires. European Journal of Operational Research, 197(1), 253–265.
Article Google Scholar
Knoke, T., Stimm, B., Ammer, C., & Moog, M. (2005). Mixed forests reconsidered: A forest economics contribution on an ecological concept. Forest Ecology and Management, 213(1), 102–116.
Article Google Scholar
Lembersky, M., & Johnson, K. (1975). Optimal policies for managed stands: An infinite horizon Markov decision process approach. Forest Science, 21(2), 109–122.
Google Scholar
Lönnstedt, L., & Svensson, J. (2000). Non-industrial private forest owners’ risk preferences. Scandinavian Journal of Forest Research, 15(6), 651–660.
Article Google Scholar
Maller, R., Müller, G., & Szimayer, A. (2009). Ornstein–Uhlenbeck processes and extensions. Handbook of financial time series (pp. 421–438). Berlin, Heidelberg: Springer.
Chapter Google Scholar
Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77–91.
Google Scholar
Mosquera, J., Henig, M., & Weintraub, A. (2011). Design of insurance contracts using stochastic programming in forestry planning. Annals of Operations Research, 190(1), 117–130.
Article Google Scholar
Ollikainen, M. (1993). A mean-variance approach to short-term timber selling and forest taxation under multiple sources of uncertainty. Canadian Journal of Forest Research, 23(4), 573–581.
Article Google Scholar
Philpott, A., de Matos, V., & Finardi, E. (2013). On solving multistage stochastic programs with coherent risk measures. Operations Research, 61(4), 957–970.
Article Google Scholar
Piazza, A., & Pagnoncelli, B. (2014). The optimal harvesting problem with price uncertainty. Annals of Operations Research, 217(1), 425–445.
Piazza, A., & Pagnoncelli, B. K. (2015). The stochastic mitra-wan forestry model: Risk neutral and risk averse cases. Journal of Economics, 115, 175–194.
Plantinga, A. (1998). The optimal timber rotation: An option value approach. Forest Science, 44(2), 192–202.
Google Scholar
Rapaport, A., Sraidi, S., & Terreaux, J. (2003). Optimality of greedy and sustainable policies in the management of renewable resources. Optimal Control Applications and Methods, 24(1), 23–44.
Article Google Scholar
Reeves, L., & Haight, R. (2000). Timber harvest scheduling with price uncertainty using markowitz portfolio optimization. Annals of Operations Research, 95(1–4), 229–250.
Article Google Scholar
Rockafellar, R., & Uryasev, S. (2000). Optimization of Conditional Value-at-Risk. Journal of Risk, 2, 21–42.
Article Google Scholar
Roessiger, J., Griess, V., & Knoke, T. (2011). May risk aversion lead to near-natural forestry? A simulation study. Forestry, 84(5), 527–537.
Article Google Scholar
Ruszczynski, A., & Shapiro, A. (2005). Conditional risk mappings. Mathematics of Operations Research, 31, 544–561.
Article Google Scholar
Shapiro, A. (2009). On a time consistency concept in risk averse multistage stochastic programming. Operations Research Letters, 37(3), 143–147.
Article Google Scholar
Shapiro, A., Dentcheva, D., & Ruszczyński, A. (2009). Lectures on stochastic programming: Modeling and theory (Vol. 9). Philadelphia: SIAM.
Book Google Scholar
Tahvonen, O., & Kallio, M. (2006). Optimal harvesting of forest age classes under price uncertainty and risk aversion. Natural Resource Modeling, 19(4), 557–585.
Article Google Scholar
Thomson, T. (1992). Optimal forest rotation when stumpage prices follow a diffusion process. Land Economics, 68(3), 329–342.
Article Google Scholar
Valladão, D. M., Veiga, Á., & Veiga, G. (2014). A multistage linear stochastic programming model for optimal corporate debt management. European Journal of Operational Research, 237(1), 303–311.
Article Google Scholar
Yoshimoto, A., & Shoji, I. (1998). Searching for an optimal rotation age for forest stand management under stochastic log prices. European Journal of Operational Research, 105(1), 100–112.
Article Google Scholar
Zheng, H. (2009). Efficient frontier of utility and cvar. Mathematical Methods of Operations Research, 70(1), 129–148.
Article Google Scholar
Zhong-wei, W., & Yan, P. (2009) Measurement of forest fire risk based on var. In International conference on management science and engineering, 2009. ICMSE 2009. IEEE.

Download references

Acknowledgments

This research was partially supported by Programa Basal PFB 03, CMM. B.K. Pagnoncelli acknowledges the financial support of FONDECYT under projects 11130056 and 1120244. A. Piazza acknowledges the financial support of FONDECYT under Project 11090254 and of CONICYT Anillo ACT1106 and CONICYT REDES 140183.

Author information

Authors and Affiliations

Escuela de Negocios, Universidad Adolfo Ibáñez, Diagonal las Torres 2640 Peñalolén, oficina 533 C, Santiago, Chile
Bernardo K. Pagnoncelli
Departamento de Matemática, Universidad Técnica Federico Santa María, Avenida España 1680, Casilla 110-V, Valparaiso, Chile
Adriana Piazza

Authors

Bernardo K. Pagnoncelli
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Piazza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bernardo K. Pagnoncelli.

Appendices

Appendix

1.1 Preliminaries

Before getting into the proof of the lemmas and theorems stated in the main body of the paper, we present some definitions that will be necessary throughout the proofs.

In (7), the optimal control $c_t$ in each period depends only on the current state of the forest and price through the decision function $c_t=\pi _t({\mathbb {X}}_t,p_t)$. A sequence $\varPi =\{\pi _t\}_{t\in {\mathcal {T}}}$ is called a policy. Of course, a policy is feasible if ${\mathbb {X}}_t$ and $c_t=\pi _t({\mathbb {X}}_t,p_t)$ satisfy (2) and (3) for every possible value of ${\mathbb {X}}_t$ and $p_t$ at every instant $t\in {\mathcal {T}}$. Observe that the non-negativity of the state variables $\bar{x}_t$ and $x_{a,t}$ for $a=1,\ldots ,n$ is assured by (2) and (3).

The expected benefit of a given policy $\varPi $ for an initial state ${\mathbb {X}}_{0}$ and an initial price $p_{0}$ is

$$\begin{aligned} Q^\varPi _0({\mathbb {X}}_{0},p_{0})= - p_{0}\pi _0({\mathbb {X}}_0,p_0) + \delta {\mathcal {R}}_{|p_{0}}\left[ -p_{1}\pi _1({\mathbb {X}}_1,p_1) + \delta {\mathcal {R}}_{|p_1}\left[ \cdots \right] \right] , \end{aligned}$$

(20)

if the time horizon is infinite. Correspondingly, we denote $Q^\varPi _{0,T}({\mathbb {X}}_{0},p_{0})$ the expected benefit of policy $\varPi $ whenever ${\mathcal {T}}=[1,\ldots ,T]$.

Problem (6) can be stated as the problem of finding a feasible policy that minimizes (20),

$$\begin{aligned} V_0({\mathbb {X}}_{0},p_{0}) =\left\{ \begin{array}{ll} \min _\varPi &{} Q^\varPi _0({\mathbb {X}}_{0},p_{0}) \\ \text{ s.t. } &{} \varPi \text{ is } \text{ a } \text{ feasible } \text{ policy }, \end{array}\right. \end{aligned}$$

(21)

or analogously, $V_{0,T}({\mathbb {X}}_{0},p_{0}) =\min _\varPi Q^\varPi _{0,T}({\mathbb {X}}_{0},p_{0}) $ s. t. $\varPi $ is a feasible policy.

In the sequel, we will also use the expected discounted benefit from an intermediate step

$$\begin{aligned} Q^{\varPi }_{t}({\mathbb {X}}_t,p_t)= - p_{t}c_t+ \delta {\mathcal {R}}_{|p_{t}}\left[ -p_{t+1}c_{t+1} + \delta {\mathcal {R}}_{|p_{t+1}}\left[ \cdots \right] \right] \end{aligned}$$

and the corresponding value function

$$\begin{aligned} V_{t}({\mathbb {X}}_t,p_t) =\left\{ \begin{array}{ll} \min _\varPi &{}\quad Q^\varPi _{t}({\mathbb {X}}_t,p_t) \\ \text{ s.t. } &{}\quad \varPi \text{ is } \text{ a } \text{ feasible } \text{ policy }, \end{array}\right. \end{aligned}$$

as well as the analogous definitions in the finite case.

Appendix 1: Proof of Lemma 1

For given initial state and price ${\mathbb {X}}_0$, $p_0$ we consider the cost resulting of the application of policy $\varPi $ (not necessarily optimal) up to T: $Q^\varPi _{0,T}({\mathbb {X}}_0,p_0)$. To lighten the notation we will use $Q^\varPi _{0,T}$ instead of $Q^\varPi _{0,T}({\mathbb {X}}_0,p_0)$ as ${\mathbb {X}}_0$ and $p_0$ remain constant throughout the proof. The expression of $Q^\varPi _{0,T}$ is a slight modification of (20)

$$\begin{aligned} Q^\varPi _{0,T}= - p_{0}c_{0} + \delta {\mathcal {R}}_{|p_{0}}[ -p_{1}c_{1} + \cdots + \delta {\mathcal {R}}_{|p_{T-1}} [-p_{T}\,c_{T}]], \end{aligned}$$

Due to the fact that $p_{t}c_{t}\ge 0$ for all t (when prices follow a GBM) and the monotonicity of any coherent risk measure we know that $Q^\varPi _{0,T}\ge Q^\varPi _{0,T+1}$, hence the sequence $Q^\varPi _{0,T}$ either converges to the limit or diverges to $-\infty $ when $T\rightarrow \infty $. We now prove that $Q^\varPi _{0,T}$ is bounded below for all T,

$$\begin{aligned} Q^\varPi _{0,T}= - p_{0}c_{0} - \delta e^\mu \chi p_{0}c_{1} - \cdots - (\delta e^\mu \chi )^{T}p_{0}c_{T}\ge -p_0S \frac{1- (\delta e^\mu \chi )^{T+1}}{1- \delta e^\mu \chi }, \end{aligned}$$

where S represents the total surface of the forest and $\chi =\lambda +\kappa (1-\lambda )$.

If $\delta e^\mu \chi <1$ we get $Q^\varPi _{0,T}>-p_0S \frac{1}{1- \delta e^\mu \chi }>-\infty $ for all T. This implies that the sequence $Q^\varPi _{0,T}$ converges when T goes to infinity. This limit is the value associated to the policy $\varPi $ denoted as $Q^\varPi _0$,

$$\begin{aligned} Q^\varPi _0=-p_0\sum _{t=0}^\infty (\delta e^\mu \chi )^tc_t>-\infty \end{aligned}$$

(22)

As the bound on $Q^\varPi _0$ does not depend on the policy $\varPi $, we conclude

$$\begin{aligned} V_0({\mathbb {X}}_0,p_0)= \mathop {\mathrm{Min}}_{\varPi } Q^\varPi _0>-\infty . \end{aligned}$$

Appendix 2: Proof of Theorem 1

To prove that the greedy policy is optimal, we check that the benefit associated with it, $Q^{\textit{GP}}$, satisfies the Bellman equation (7) from any initial condition. The formula of $Q^{\textit{GP}}$ is obtained using (22), where $\chi =\lambda +\kappa (1-\lambda )$.

We consider first the infinite time horizon case. If the initial state is ${\mathbb {X}}_t=(\bar{x},x_n,\ldots , x_2,x_1)$, it is easy to see that the harvests associated to the GP are $c_{t+in}=\bar{x}+x_{n}$ for all $i\in {\mathbb {N}}$ and $c_{t+in+j}=x_{n-j}$ for $j=1,\ldots ,n-1$ and $i\in {\mathbb {N}}$, and hence,

$$\begin{aligned} Q_t^{\textit{GP}}({\mathbb {X}}_t,p_t)=- p_t \sum _{i=0}^{\infty }\left( (\delta e^\mu \chi )^{in} \bar{x}+ \sum _{j=0}^{n-1}(\delta e^\mu \chi )^{in+j} x_{n-j} \right) . \end{aligned}$$

(23)

Given $c_t=c\in [0,CA{\mathbb {X}}_t]$, the state at $t+1$ is ${\mathbb {X}}_{t+1}=(\bar{x}+x_n-c,x_{n-1},\ldots ,x_1,c) $ and the value associated to the GP is

$$\begin{aligned} Q_{t+1}^{\textit{GP}}({\mathbb {X}}_{t+1},p_{t+1})&=- p_{t+1} \sum _{i=0}^{\infty }\Big ((\delta e^\mu \chi )^{in} (\bar{x}+x_n-c)\\&\quad + \sum _{j=0}^{n-2}(\delta e^\mu \chi )^{in+j} x_{n-j-1}+(\delta e^{\mu }\chi )^{in+n-1}c \Big ). \end{aligned}$$

Inserting $V_{t+1}=Q_{t+1}^{\textit{GP}}$ into the rhs of the Bellman equation (7), the argument of the $\mathop {\mathrm{Min}}$ operator is

$$\begin{aligned} \varPhi (c)= & {} -p_t c+\delta {\mathcal {R}}_{|p_t} \left[ -p_{t+1} \sum _{i=0}^{\infty }\left( (\delta e^\mu \chi )^{in} (\bar{x}+x_n-c) \right. \right. \nonumber \\&+\, \left. \left. \sum _{j=0}^{n-2}(\delta e^\mu \chi )^{in+j} x_{n-j-1}+(\delta e^{\mu }\chi )^{in+n-1}c \right) \right] \nonumber \\= & {} -p_tc - p_t \sum _{i=0}^{\infty }\Big ((\delta e^\mu \chi )^{in+1} (\bar{x}+x_n-c) \nonumber \\&+\,\sum _{j=0}^{n-2}(\delta e^\mu \chi )^{in+j+1} x_{n-j-1}+(\delta e^{\mu }\chi )^{in+n}c \Big ) \end{aligned}$$

(24)

The coefficient affecting c in $\varPhi (c)$ is

$$\begin{aligned} \text{ coeff }(c) \!=\!-p_t \left( \!1-\!\sum _{i=0}^\infty (\delta e^\mu \chi )^{in+1}\!+\! \sum _{i=0}^\infty (\delta e^\mu \chi )^{in+n} \!\right) \!=-p_t(1\!-\delta e^\mu \chi ) \sum _{i=0}^\infty (\delta e^\mu \chi )^{in}\!>\!0 \end{aligned}$$

As $\text{ coeff }(c)<0$, the minimum in (7) is attained when $c=\bar{x}+x_n$. Inserting this value of c in (24) yields

$$\begin{aligned} \varPhi (\bar{x}+x_n)= & {} -p_t(\bar{x}+x_n) - p_t \sum _{i=0}^{\infty }\left( \sum _{j=0}^{n-2}(\delta e^\mu \chi )^{in+j+1} x_{n-j-1}+(\delta e^{\mu }\chi )^{in+n}(\bar{x}+x_n) \right) \\= & {} - p_t \sum _{i=0}^{\infty }\left( (\delta e^{\mu }\chi )^{in}(\bar{x}+x_n)+ \sum _{j=1}^{n-1}(\delta e^\mu \chi )^{in+j} x_{n-j} \right) =Q^{\textit{GP}}({\mathbb {X}}_t,p_t), \end{aligned}$$

showing that (7) holds and that the GP is optimal.

The proof for the finite horizon case is similar but more involved. Indeed, let t be expressed as $T-(kn+j)$, where $k=\lfloor {\frac{T-t}{n}}\rfloor $ and $j\in \{0,\ldots ,n-1\}$ is the remainder of the integer division of $(T-t)$ by n. This way of expressing t puts in evidence that after completing k cycles there will be still j time steps to go until reaching the end of the horizon. Thus, the expected benefit associated to the GP is

$$\begin{aligned} Q^{\textit{GP}}_t({\mathbb {X}}_t,p_t)&=-p_t\left[ \sum _{i=0}^{k-1}(\delta e^{\mu }\chi )^{in}\left( \bar{x}+\sum _{l=0}^{n-1}(\delta e^{\mu }\chi )^l x_{n-l}\right) \right. \\&\quad \left. +\, (\delta e^{\mu }\chi )^{kn}\left( \bar{x}+\sum _{l=0}^j (\delta e^{\mu }\chi )^l x_{n-l} \right) \right] . \end{aligned}$$

The first term of the rhs represents the expected benefit of the k completed cycles, while the second term corresponds to the last j steps. Again, we need to check that $Q^{\textit{GP}}$ satisfies (7) and that the minimum is attained for $c=CA{\mathbb {X}}_t$. We leave the details to the reader.

Appendix 3: Proof of Theorem 2

To prove that the accumulating policy is optimal, we check that the benefit associated with it ($Q^{AP}$) satisfies the dynamic programming equation (7). Let t be expressed as $T-(kn+j)$, where $k=\lfloor {\frac{T-t}{n}}\rfloor $ and $j\in \{0,\ldots ,n-1\}$ is the remainder of the integer division of $(T-t)$ by n. After some computations we can prove that

$$\begin{aligned} Q^{AP}_t({\mathbb {X}}_t,p_t) = -p_t\Big [(\delta e^\mu \chi )^k (\bar{x}+\sum _{l=0}^j x_{n-l}) +\sum _{i=1}^k (\delta e^\mu \chi )^{in+j}S \Big ] \end{aligned}$$

(25)

where S represents the total surface of the forest and $\chi =\lambda +\kappa (1-\lambda )$. We point out that for $k=0$, we follow the convention $\sum _{1}^0 (\cdot )=0$.

For the rest of the proof we divide the study into two cases depending on the value of j: (i) $j>0$ and (ii) $j=0$.

(i) Here we have $t+1=T-(k'n+j')$ where $k'=k$ and $j'=j-1\in \{0,\ldots ,n-2\}$ and $Q^{AP}_{t+1}(A{\mathbb {X}}_t+Bc,p_{t+1})$ can be expressed as

$$\begin{aligned} - p_{t+1}\Big [ (\delta e^\mu \chi )^{j-1} (\bar{x}+x_n-c+\sum _{l=0}^{j-1} x_{n-l-1}) + \sum _{i=1}^k (\delta e^\mu \chi )^{in+j-1}S \Big ], \end{aligned}$$

where $\chi =\lambda +(1-\lambda )\kappa $. Inserting $V = Q^{AP}$ into the right-hand side of the dynamic programming equation (7), the argument of the min operator, $\varPhi (c)$, is

$$\begin{aligned}&-\,p_tc+\delta {\mathcal {R}}_{|p_t}\big [ -p_{t+1}\big [(\delta e^\mu \chi )^{j-1} (\bar{x}+x_n-c+\sum _{l=0}^{j-1} x_{n-l-1}) + \sum _{i=1}^k (\delta e^\mu \chi )^{in+j-1}S \big ] \big ] \\&\quad = -\,p_tc-p_t\big [(\delta e^\mu \chi )^{j} (\bar{x}+x_n-c+\sum _{l=0}^{j-1} x_{n-l-1}) + \sum _{i=1}^k (\delta e^\mu \chi )^{in+j}S \big ] \big ) \\&\quad = -\,p_tc(1-(\delta e^\mu \chi )^{j} )-p_t\big [(\delta e^\mu \chi )^{j} (\bar{x}+x_n+\sum _{l=1}^{j} x_{n-l}) + \sum _{i=1}^k (\delta e^\mu \chi )^{in+j}S \big ] \\&\quad = -\,p_tc(1-(\delta e^\mu \chi )^{j} )+Q^{AP}_t({\mathbb {X}}_t,p_t). \end{aligned}$$

As the coefficient of c is non-negative, the minimum is attained when $c=0$ and $\varPhi (0)$ is exactly $Q^{AP}_t({\mathbb {X}}_t,p_t),$ showing that equation (7) holds.

(ii) Case $t=T-kn$. In this case, we have $t+1=T-[(k-1)n+n-1)]$ and $Q_{t+1}^{AP}(A{\mathbb {X}}_t+Bc,p_{t+1})$ can be expressed as

$$\begin{aligned}&-\,p_{t+1}\Big [(\delta e^\mu \chi )^{n-1} (\bar{x}+x_n-c+\sum _{l=0}^{n-2} x_{n-l-1}+c) + \sum _{i=1}^{k-1} (\delta e^\mu \chi )^{in+n-1}S \Big ]\\&\quad = -\,p_{t+1}\left[ (\delta e^\mu \chi )^{n-1} S + \sum _{i=2}^{k} (\delta e^\mu \chi )^{in-1}S \right] = -p_{t+1} \sum _{i=1}^{k} (\delta e^\mu \chi )^{in-1}S. \end{aligned}$$

Inserting again $V = Q^{AP}$ into the right-hand side of the Bellman’s equation (7), the argument of the min operator is

$$\begin{aligned} \varPhi (c)= & {} -p_tc+\delta {\mathcal {R}}_{|p_t} \left[ -p_{t+1} \sum _{i=1}^{k} (\delta e^\mu \chi )^{in-1}S\right] . \end{aligned}$$

The coefficient of c is negative, and thus, the minimum is attained when $c=\bar{x}+x_n$. So we have,

$$\begin{aligned} \varPhi (\bar{x}+x_n)= & {} -p_t(\bar{x}+x_n) - p_t \sum _{i=1}^{k} (\delta e^\mu \chi )^{in}S. \end{aligned}$$

The right-hand side is exactly (25) when $j=0$, hence we have $\varPhi (\bar{x}+x_n)=Q^{AP}_t(\cdot ,\cdot )$ and equation (7) is satisfied.

In both cases, we have shown that $Q^{AP}_t(\cdot ,\cdot )$ satisfies equation (7), hence it is the value function and the proposed policy is optimal.

Appendix 4: Proof of Lemma 2

Due to (19), we only need to show that

$$\begin{aligned} \frac{\delta b}{1-\delta a}\ge & {} \frac{b}{1-a}\Big [1-\frac{1-\delta ^j}{a^m(1-\delta ^ja^j)} \Big ]. \end{aligned}$$

(26)

Using that

$$\begin{aligned} \frac{\delta b}{1-\delta a}=\frac{b}{1-a}\Big [1-\frac{1-\delta }{1-\delta a}\Big ], \end{aligned}$$

we have that (26) is equivalent to

$$\begin{aligned} \frac{1}{1-a}\Big [\frac{1-\delta }{1-\delta a}\Big ]\le & {} \frac{1}{1-a}\Big [\frac{1-\delta ^j}{a^m(1-\delta ^ja^j)} \Big ]\\&\quad \iff \frac{1}{1-a}\Big [\frac{a^m(1-\delta ^ja^j)}{1-\delta a}\Big ] \le \frac{1}{1-a}\Big [\frac{1-\delta ^j}{1-\delta } \Big ]\\&\quad \iff \frac{1}{1-a}\Big [a^m \sum _{l=0}^{j-1}(\delta a)^l \Big ] \le \frac{1}{1-a}\Big [\sum _{l=0}^{j-1}\delta ^l\Big ]. \end{aligned}$$

Given that $a\in (0,1)$, the last inequality is always valid.

Appendix 5: Proof of Lemma 3

Given initial state and price ${\mathbb {X}}_0$, $p_0$ we denote by $Q^\varPi _{0,T}({\mathbb {X}}_0,p_0)$ the cost resulting of the application of a (not necessarily optimal) policy $\varPi $ up to T. As in Lemma 1 we denote $Q^\varPi _{0,T}=Q^\varPi _{0,T}({\mathbb {X}}_0,p_0)$. Knowing that ${\mathcal {R}}_{p_t}[-p_{t+1}]=-ap_t-b$, where a and b are defined in (17), we can write $Q^\varPi _T$ as:

$$\begin{aligned} Q^\varPi _{0,T}= & {} - p_{0}c_{0} + \delta {\mathcal {R}}_{|p_{0}}[ -p_{1}c_{1} + \delta {\mathcal {R}}_{|p_{1}}[ -p_{2}c_{2}+ \cdots \\&+\, \delta {\mathcal {R}}_{|p_{T-3}} [-p_{T-2}c_{T-2}+ \delta {\mathcal {R}}_{|p_{T-2}} [-p_{T-1}c_{T-1}+ \delta {\mathcal {R}}_{|p_{T-1}} [-p_{T}\,c_{T}]]]]]\\= & {} - p_{0}c_{0} + \delta {\mathcal {R}}_{|p_{0}}[ -p_{1}c_{1} +\delta {\mathcal {R}}_{|p_{1}}[ -p_{2}c_{2} +\cdots \\&+\, \delta {\mathcal {R}}_{|p_{T-3}} [-p_{T-2}c_{T-2}+ \delta {\mathcal {R}}_{|p_{T-2}} [-p_{T-1}(c_{T-1}+\delta a c_T)- \delta bc_T]]]]\\= & {} - p_{0}c_{0} + \delta {\mathcal {R}}_{|p_{0}}[ -p_{1}c_{1} + \delta {\mathcal {R}}_{|p_{1}}[ -p_{2}c_{2} \cdots \\&+\, \delta {\mathcal {R}}_{|p_{T-3}} [-p_{T-2}(c_{T-2}+ \delta ac_{T-1}+(\delta a)^2c_{T})-b\delta c_{T-1}- b\delta ^2(a+1)c_T]]]\\&\vdots \\= & {} -p_0\sum _{t=0}^T (\delta a)^tc_t - b\sum _{t=1}^T \delta ^t c_t \frac{1-a^t}{1-a} \end{aligned}$$

We will show that $\lim _{T\rightarrow \infty } Q^\varPi _{0,T}$ exists and define the value associated with policy $\varPi $ for the infinite time horizon as the value of that limit. To this end we compute

$$\begin{aligned} |Q^\varPi _{0,T+\tau }-Q^\varPi _{0,T}|\le & {} p_0\sum _{t=T+1}^{T+\tau }c_t(\delta a)^t + |b| \sum _{t=T+1}^{T+\tau } \delta ^t c_t \frac{1-a^t}{1-a} \nonumber \\\le & {} p_0\sum _{t=T+1}^{\infty }c_t(\delta a)^t + |b| \sum _{t=T+1}^{\infty } \delta ^t c_t \frac{1}{1-a}\nonumber \\\le & {} (\delta a)^{T+1} p_0 S \frac{1}{1-\delta a} + \delta ^{T+1} S |b| \frac{1}{1-\delta }\frac{1}{1-a}. \end{aligned}$$

(27)

As the last expression converges to 0 when $T\rightarrow \infty $ for all $\tau \in {\mathbb {N}}$, we conclude that $\lim _{T\rightarrow \infty } Q^\varPi _{0,T}$ exists.

Appendix 6: Proof of Theorem 3

We state ${\mathbb {X}}_{t+1}$ and equation (7) in terms of ${\mathbb {X}}_t$ and c as follows.

$$\begin{aligned} {\mathbb {X}}_t=\begin{pmatrix}\bar{x}_t\\ x_{n,t}\\ x_{n-1,t}\\ \vdots \\ \vdots \\ x_{1,t} \end{pmatrix} \longrightarrow {\mathbb {X}}_{t+1}=A{\mathbb {X}}_t+Bc=\begin{pmatrix}\bar{x}_t+ x_{n,t}- c\\ x_{n-1,t}\\ x_{n-2,t}\\ \vdots \\ \vdots \\ c \end{pmatrix},\\ V_t({\mathbb {X}}_t,p_t)=\min _c \big \{- p_tc + \delta {\mathcal {R}}_{|p_t}(V_{t+1}({\mathbb {X}}_{t+1},p_{t+1}))\big \}. \end{aligned}$$

The main idea of the proof is to consider the role played by c in all the possible expressions of $V_{t+1}(\cdot ,\cdot )$. This is not an easy task, but despite all the possible harvesting policies, the coefficient of c has a particular structure: it is the sum of terms of the form $\varDelta ^{m_i}_{j_i}(p_t)$ (as defined in (18)), for some values of $m_i\in {\mathbb {N}}$ and $j_i\in \{0,\ldots ,n-1\}$ plus possibly one negative term $\varGamma ^m(p_t)=-\delta ^m[ p_ta^m +b\sum _{l=0}^{m-1}a^l]$.^{Footnote 6}

Indeed, from $t+1$ on, two different situations can arise: (i) nothing is harvested in the next n steps or (ii) the first harvest occurs at $t=j_0$ with $1\le j_0<n$.

In case (i), the state at $t+n$ will be

$$\begin{aligned} {\mathbb {X}}_{t+n}=(S-c,c,0,\ldots ,0)^T. \end{aligned}$$

It is easy to see that the influence of c extinguishes as the constraint on the harvest is $c_{t+n}\le S$. We do not know the complete expression of $V_{t+1}(\cdot ,\cdot )$ but we do know that the coefficient of c is simply $\varGamma ^0(p_t)=-p_t$ with no $\varDelta ^m_j(p_t)$ terms.

In case (ii), the first harvest after t takes place at $t+j_0$ and (7) can be written as:

$$\begin{aligned} V_t({\mathbb {X}}_t,p_t)= & {} \min _c \big \{- p_tc + \delta {\mathcal {R}}_{|p_{t+1}}[\delta {\mathcal {R}}_{|p_{t+2}}[\ldots \delta {\mathcal {R}}_{p_{t+j_0-1}}[-p_{t+j_0}(\bar{x}_t+\cdots +x_{n-j_0,t}-c)\\&+\, \delta {\mathcal {R}}_{|p_{t+j_0}}[V_{t+j_0+1}({\mathbb {X}}_{t+j_0+1},p_{t+j_0+1})]]]]\big \}. \end{aligned}$$

Hence, the first term of the coefficient of c is of the form

$$\begin{aligned} \varDelta ^0_{j_0}(p_t)=-p_t(1-\delta ^{j_0}a^{j_0})-\frac{b}{1-a}(-\delta ^{j_0}+\delta ^{j_0}a^{j_0}). \end{aligned}$$

There might be more terms including c in the expression of $V_{t+j_0+1}(\cdot ,\cdot )$. For a complete characterization of the coefficient of c we refer the reader to Piazza and Pagnoncelli (2014) where the analogous result in the risk neutral case is presented. The construction of the coefficient of c follows the same lines, the reader only needs to substitute the operator ${\mathbb {E}}_{|p_t}$ for ${\mathcal {R}}_{|p_t}$.

The number of terms comprising the coefficient of c, may or not be finite. In the infinite case, Lemma 3 implies that the sum converges.

The proof is completed by showing that the coefficient of c is negative. But, Lemma 2 shows that $\varDelta ^m_j\le 0$ when condition (16) holds, which finishes the proof.

Appendix 7: Proof of Lemma 4

For values of $a\in (0,1)$ the proof presented for Lemma 3 is valid. For values of $a\in [1,1/\delta )$ we need to modify the proof from (27) onwards.

We have that

$$\begin{aligned} |Q^\varPi _{T+\tau }-Q^\varPi _T| { \le } p_0\sum _{t=T+1}^{T+\tau }c_t(\delta a)^t + |b| \sum _{t=T+1}^{T+\tau } \delta ^t c_t \sum _{j=0}^{t-1}a^j. \end{aligned}$$

Using that $1\le a$ and $c_t\le S$ for all t we get

$$\begin{aligned} |Q^\varPi _{T+\tau }-Q^\varPi _T| { \le } (\delta a)^{T+1} p_0 S \frac{1}{1-\delta a} + S |b| \sum _{t=T+1}^{\infty } t (a\delta )^t. \end{aligned}$$

As the sum $\sum _{t=1}^\infty t (a\delta )^t$ converges whenever $a\delta <1$, its T-tail must go to zero when T goes to infinity.

Finally, we have that the right hand side of the inequality above converges to 0 when $T\rightarrow \infty $ and we conclude that $\lim _{T\rightarrow \infty } Q^\varPi _T$ exists.

Appendix 8: Proof of Theorem 4

This theorem is a generalization of Theorem 3 from $(a,b)\in (0,1)\times {\mathbb {R}}_+$ to $(a,b)\in (0,1/\delta )\times {\mathbb {R}}$. The proof of Theorem 3 consist in characterizing the coefficient of c in the Bellman equation (7). It is shown that this coefficient is the sum of infinite terms of the form $\varDelta ^m_j(p_t)$. This construction does not depend on the value of a and b but relies exclusively in the fact that the conditional risk measure is affine on $p_t$. Hence, this part of the proof extends directly to the more general setting of this theorem.

The characterization of the coefficient’ sign relies on Lemma 2 presented in Sect. 5 that gives a sufficient condition assuring that

$$\begin{aligned} \varDelta ^m_j(p_t)\le 0 \quad \text{ for } \text{ all } \; m\le T-t\quad \hbox { and for all }\; j\in \{1,\ldots ,n\}. \end{aligned}$$

(28)

Lemma 2 is valid for $(a,b)\in (0,1)\times {\mathbb {R}}_+$. In the following we study the extension of this lemma to $(a,b)\in (0,1/\delta ) \times {\mathbb {R}}$. We start by noticing that (18) is not valid for $a=1$. We will use the following representation of $\varDelta ^m_j(p_t)$,

$$\begin{aligned} \left\{ \begin{array}{ll} \delta ^{m_i}\left\{ -p_ta^{m_i}(1-\delta ^{j_i}a^{j_i})-\frac{b}{1-a}\Big [1-\delta ^{j_i}-a^{m_i}(1-\delta ^{j_i}a^{j_i})\Big ]\right\} &{}\quad \text{ if } \; a\ne 1\\ \delta ^m\{-p_t(1-\delta ^j)-b[m-\delta ^j(m+j)]\}&{}\quad \text{ if } \;a=1, \end{array}\right. \end{aligned}$$

The parameters’ semi-plane is divided in five regions as shown in Fig. 3 and Table 2.^{Footnote 7}

Table 2 Parameters regions

Full size table

In the following, we look for conditions implying Condition (28), as this is sufficient to prove that the coefficient of $c^*_t$ is negative and hence $c^*_t=CA{\mathbb {X}}_t$. We will see that the following conditions imply Condition (28):

1.
In region (i), $p_t\ge b\delta /(1-a\delta )$ (this is Lemma 2).
2.
In region (ii), $p_t\ge b\delta /(1-a\delta )$.^{Footnote 8}
3.
In region (iii), no sufficient condition assuring Condition (28) is found in the infinite horizon case.
4.
In region (iv), $p_t\ge b/(1-a)$

Let us denote by $r^m_j(a,b)$ to the rhs of (19) when $a\ne 1$ and the corresponding expression for $a=1$, i.e.,

$$\begin{aligned} r^m_j(a,b)=\left\{ \begin{array}{l@{\quad }l} \frac{b}{1-a}\Big [1-\frac{1-\delta ^j}{a^m(1-\delta ^ja^j)} \Big ] &{} \text{ if } a\ne 1\\ \frac{-b}{1-\delta ^j}[m-\delta ^j(m+j)] &{} \text{ if } a= 1 \end{array}\right. \end{aligned}$$

In the following, we prove the properties summarized in Fig. 3. We observe in the first place that

$$\begin{aligned} \varDelta ^m_j(p_t)\le 0 \iff p_t\ge r^m_j(a,b) \qquad \text{ if } 1-\delta a>0 \end{aligned}$$

(29)

We start by determining whether $r^0_1(a,b)=\frac{b\delta }{1-\delta a}$ bounds $r^m_j(a,b)$ (below or above) for all m and $j=1,\ldots ,n$. We study the case $a\ne 1$, leaving the easier particular case $a=1$ to the reader. We also make the observation that if $b=0$ then $r^m_j=0$ for all m and j.

(30)

If ${{\mathrm{sign}}}(b){{\mathrm{sign}}}(a-1)>0$, i.e., in regions (ii) or (iii), (30) is equivalent to

$$\begin{aligned}&a^m\frac{1-\delta ^j a^j}{1-\delta a} \lesseqqgtr \frac{1-\delta ^j}{1-\delta } \\&\quad \iff a^m\sum _{l=0}^{j-1}(\delta a)^j \lesseqqgtr \sum _{l=0}^{j-1}\delta ^j \end{aligned}$$

If $a>1$ (region (ii)), the inequality above holds with “$\ge $”. Hence, $r^0_1(a,b)\ge r^m_j(a,b)$.
If $a<1$ (region (iii)), it holds with “$\le $”. Hence, $r^0_1(a,b)\le r^m_j(a,b)$.

If ${{\mathrm{sign}}}(b){{\mathrm{sign}}}(a-1)<0$, i.e., in regions (i) or (iv), (30) is equivalent to

If $a>1$ (region (iv)), the inequality above holds with “$\le $”. Hence, $r^0_1(a,b)\le r^m_j(a,b)$.
If $a<1$ (region (i)), it holds with “$\ge $”. Hence, $r^0_1(a,b)\ge r^m_j(a,b)$.

Table 3 summarizes these results.

Table 3 Bounds for $r^m_j(a,b)$

Full size table

Putting this information together with that of (29), we conclude that to assure Condition (28) it is sufficient to have the conditions indicated in Table 4.

Table 4 Bounds for $p_t$

Full size table

In regions (i) and (ii) we are ready to give a sufficient condition assuring Condition (28):

$$\begin{aligned} p_t\ge r^0_1(a,b)=\frac{\delta b}{1-\delta a}. \end{aligned}$$

To reach some conclusion in regions (iii) and (iv) we need some extra information of $r^m_j(a,b)$.

In region (iv) it is very easy to check that $r^m_j(a,b)\le b/(1-a)$. Hence $p_t\ge b/(1-a)$ is sufficient to assure Condition (28).

In region (iii) is a bit different. We have that $\lim _{m\rightarrow \infty } r^m_j(a,b)=+\infty $, hence $p_t$ cannot be greater than $r^m_j$ for all m. Hence, no conclusion can be drawn in the infinite horizon case. However, in the finite horizon case Condition (28) only requires having $p_t\ge r^m_j$ for $m\le T-t$. Furthermore, some calculation shows that $r^{m+1}_j>r^m_j$ and $r^{m}_j>r^m_{j+1}.$ Hence, we can propose a condition depending on the value of $T-t$: $p_t\ge r^{T-t}_1=\frac{b}{1-a}\Big [1-\frac{1-\delta }{a^{T-t}(1-\delta a)}\Big ].$

Appendix 9: Mean deviation risk calculation for O–U

For the particular case of $p=2$ and a O–U process, the conditional MDR is given by:

$$\begin{aligned} {\text {MDR}}_{|p_t}(-p({t+1}))&= -p_t e^{-\eta } - \bar{p}(1-e^{-\eta }) \\&\quad +\,c\left( {\mathbb {E}}\left[ \left| \int _t^{t+1} \sigma e^{\eta (s-(t+1))}dW(s) \right| ^2 \right] \right) ^{1/2}. \end{aligned}$$

In order to calculate the stochastic integral we apply Itô’s isometry:

$$\begin{aligned} {\mathbb {E}}\left( \left| \int _0^T G(t,W_t) dW_t \right| ^2\right) = {\mathbb {E}}\left( \int _0^T \left| G(t,W_t) \right| ^2 dt \right) , \end{aligned}$$

(31)

for a stochastic process $G(t,W_t) \in {\mathbb {L}}^2(0,T)$. Using (31) and noting that in our case the process G is deterministic, we have

$$\begin{aligned} \left( {\mathbb {E}}\left[ \left| \int _t^{t+1} \sigma e^{\eta (s-(t+1))} dW(s) \right| ^2 \right] \right) ^{1/2} = \left( {\mathbb {E}}\left[ \int _t^{t+1} (\sigma e^{\eta (s-(t+1))})^2 ds \right] \right) ^{1/2} \\ = \sigma \left( {\mathbb {E}}\left[ \int _t^{t+1} e^{2\eta (s-(t+1))} ds \right] \right) ^{1/2} = \sigma \left( \frac{1}{2\eta } - \frac{e^{-2\eta }}{2\eta } \right) ^{1/2}. \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pagnoncelli, B.K., Piazza, A. The optimal harvesting problem under price uncertainty: the risk averse case. Ann Oper Res 258, 479–502 (2017). https://doi.org/10.1007/s10479-015-1963-9

Download citation

Published: 27 August 2015
Issue Date: November 2017
DOI: https://doi.org/10.1007/s10479-015-1963-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The optimal harvesting problem under price uncertainty: the risk averse case

Abstract

Similar content being viewed by others

A Population Harvesting Model with Time and Size Competition Dependence Function

Adaptive economic and ecological forest management under risk

Stochastic Mitra–Wan forestry models analyzed as a mean field optimal control problem

1 Introduction

2 Model formulation

3 Weighted mean-risk formulation

3.1 The Conditional Value-at-Risk

3.2 Dynamic programming formulation

Remark 1

4 Geometric Brownian motion

Lemma 1

Theorem 1

4.1 Discussion and insights

4.2 Another optimal policy

Theorem 2

4.3 Efficient frontier

5 Ornstein–Uhlenbeck process

Lemma 2

Lemma 3

Theorem 3

5.1 Discussion and insights

6 Extension to affine weighted mean-risk measures

Lemma 4

Theorem 4

Remark 2

Corollary 1

6.1 Mean deviation risk

7 Conclusions

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

1.1 Preliminaries

Appendix 1: Proof of Lemma 1

Appendix 2: Proof of Theorem 1

Appendix 3: Proof of Theorem 2

Appendix 4: Proof of Lemma 2

Appendix 5: Proof of Lemma 3

Appendix 6: Proof of Theorem 3

Appendix 7: Proof of Lemma 4

Appendix 8: Proof of Theorem 4

Appendix 9: Mean deviation risk calculation for O–U

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation