Abstract
Current methodologies for finding portfolio rules under the Merton framework employ hard-to-implement numerical techniques. This work presents a methodology that can derive an allocation à la Merton in a spreadsheet, under an incomplete market with a time-varying dividend yield and long-only constraints. The first step of the method uses the martingale approach to obtain a portfolio rule in a complete artificial market. The second step derives a closed-form optimal solution satisfying the long-only constraints, from the unconstrained solution of the first step. This is done by determining closed-form Lagrangian dual processes satisfying the primal-dual optimality conditions between the true and artificial markets. The last step estimates the parameters defined in the artificial market, to then obtain analytical approximations for the hedging demand component within the optimal portfolio rule of the previous step. The methodology is tested with real market data from 16 US stocks from the Dow Jones. The results show that the proposed solution delivers higher financial wealth than the myopic solution, which does not consider the time-varying nature of the dividend yield. The sensitivity analysis carried out on the closed-form solution reveals that the difference with respect to the myopic solution increases when the price of the risky asset is more sensitive to the dividend yield, and when the dividend yield presents a higher probability of diverging from the current yield. The proposed solution also outperforms a known Merton-type solution that derives the Lagrangian dual processes in another way.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Continuous and time-varying allocations under the Merton [9] framework are still ignored by practitioners. As mentioned by Cochrane [4], “Merton’s theory has almost no impact on portfolio practice” and solving Merton models “remains a productive and challenging enterprise”. Evidently, ignoring the dynamic nature of markets in the portfolio allocation, as myopic solutions do, can lead to considerable welfare losses in the long term (see Larsen and Munk [8] and Castañeda and Reus [2] for examples).
To shed light on this matter, this work proposes an approximation method for a portfolio problem with one or two risky assets and a mean-reverting dividend yield process, to capture the well-known relationship between stock returns and the dividend-price ratio (see Cochrane [3]). The methodology is developed under an incomplete market, where the risk emerging from the dividend yield dynamics cannot be fully hedged. A closed-form solution is provided, using the “artificial markets”technique, developed by Karatzas et al. [7] and Cvitanić and Karatzas [5].
The method has two novel features, which are interesting for both academics and practitioners. The first is that it is very easy to implement, since the portfolio rule can be computed in a spreadsheet. Current à la Merton solutions involve implementing complex numerical methods which cannot be found in open-source programming languages. For example, Munk and Sørensen[10] use a finite-difference backwards iterative solution of the Hamilton–Jacobi–Bellman (HJB) equation to find the optimal investment in a life cycle problem with two assets and mean-reverting interest rates. Bick et al. [1] also employ the “artificial markets”technique to derive a portfolio rule for the same setting as in Munk and Sørensen [10]. However, the last step of their methodology includes a procedure for determining the near-optimal solution which is left unspecified.Footnote 1 More recently,Kamma and Pelsser [6] enhance the method from Bick et al.[1] to solve the a life cycle problem with more general return structures, general trading and liquidity constraints, and state-dependent utility functions.
The second is related to the way of determining the long-only allocations after deriving the unconstrained optimal allocation in the complete artificial market. Current methodologies prune (e.g Bick et al.[1]) or project (e.g Kamma and Pelsser [6]) the unconstrained solution to satisfy the portfolio constraints, which might lead to suboptimal solutions in the two-asset case. Instead, the method proposed derives a closed-form expression for the Lagrangian dual processes emerging from the long-only constraints. These dual processes, along with the constrained allocation, satisfy the primal-dual optimality conditions defined in Cvitanić and Karatzas[5]. It is important to remark that the formula can be applied in any portfolio problem involving two assets and long-only constraints, since the closed-form values of the dual processes are a function of nothing more than the unconstrained optimal solution and the asset’s covariance matrix.
2 Methodology
2.1 Market with one risky asset
Consider a market composed of constant risk-free rate r and a risky asset (stock) of price \(S_t\), with the following dynamics
where \(\sigma _{S}\) is the instantaneous volatility of the stock’s return, \((W_{t})_{0\le t\le T}\) is a standard Brownian Motion (BM) process. The instantaneous expected return \(\mu _{S,t}=r+\alpha d_{t}\), where \(\alpha >0\) is a constant parameter and \(d_{t}\) is a mean-reverting process aimed at capturing the relationship between stock returns and the dividend-price ratio. The process \(d_{t}\) follows
where \((\kappa ,d_{\infty },\sigma _{d},\rho _{Sd})\) are constant parameters, and \(W_{d}\) is an independent standard BM process. The investor’s preferences are represented by the constant relative risk aversion (CRRA) utility
where T is the planning horizon, \(\gamma {>0}\) is the Arrow-Pratt’s coefficient of relative risk aversion. \(X_{T}\) is the terminal value of the financial wealth process, \(X_{t}\ge 0{ \ }\forall t\in \left[ 0,T\right]\), which follows
where \(\pi _{S,t}\in [a,b]\) is the fraction invested (of financial wealth, \(X_{t}\)) in the risky asset. It is assumed that the remains are invested in a risk-free asset (with return r).
Previous market is incomplete because it is not possible to fully hedge the risk emerging from process \(W_{d}\) in \(d_t\). Thus, the “artificial markets”technique is used to complete the market with a fictitious second risky asset following \(\text {d}F_t/F_t=(r+\phi _{F,t})\text {d}t+\text {d}W_{d,t}\), where \(\phi _{F,t}\) represents the market premium of the fictitious asset. Using the martingale approach, the investment problem can be written as
where \(\xi _{t}\) is the stochastic discount factor (SDF) in the artificial financial market. In this case, the SDF satisfies \(\text {d}\xi _{t}/\xi _{t}=-r\text {d}t -\tilde{\theta }_{t}\text {d}\tilde{W}_{t}\), with \(\tilde{\theta }_{t}=(\frac{\alpha }{\sigma _S} d_{t},\phi _{F,t})^{\prime }\in \mathbb {R}^{2}\), \(\tilde{W}_{t}=(W_{S,t},W_{d,t})^{\prime }\in \mathbb {R}^{2}\). Let \(\pi _{t}^{\prime }=(\pi _{S,t},\pi _{F,t})^{\prime } \in \mathbb {R}^{2}\) be the fraction of financial wealth invested in the real and fictitious assets, and let \(\mu _{t}^{\prime }=(\mu _{S,t},\mu _{F,t})^{\prime } \in \mathbb {R}^{2}\). The financial wealth \(X_t\) in this complete market follows
with \(\tilde{\sigma }= \begin{pmatrix} \sigma _{S} &{} 0 \\ 0 &{} 1 \end{pmatrix}\). To find the unconstrained optimal solution, let \(\psi\) be the Lagrange multiplier of the budget constraint. KKT conditions in (3) deliver \(\partial U(X_T)/\partial X_T= \psi \xi _{T}\). Thus, the optimal solution satisfies \(X_T^{*}= (\psi \xi _{T})^{-1/\gamma }\). Note that budget constraint is active in the optimal solution. If not, \(\psi =0\), which contradicts KKT. Defining \(\xi _{t,T}=\xi _{T}/\xi _{t}\), then
where \(R=1-\frac{1}{\gamma }\). Define function \(g_t(d_t)\) as
From (5) and the fact that \(\textbf{E}_{0}\left[ \xi _{T}X_{T}\right] = X_{0}\), is not hard to find that \(\psi =\frac{X_0}{E_0(\xi _{T}^{R})}^{-\gamma }\), and thus the indirect utility function in this artificial market (which depends of process \(\phi _F\)) is
By applying Itô’s Lemma on \((\psi \xi _{t})^{-\frac{1}{\gamma }}\), its stochastic component equals \(-\frac{(\psi \xi _{t})^{-\frac{1}{\gamma }}}{\gamma }\tilde{\theta }_{t}\text {d}\tilde{W}_{t}\). Analogously, the stochastic component of \(g_t(d_t)\) equals \(\partial _{d}g_t(d_t)(\sigma _{d}\rho _S,\sigma _{d}\sqrt{1-\rho _{Sd}^2})\text {d}\tilde{W}_t\), where \(\partial _{d}g_t(d_t)\) denotes the derivative with respect to \(d_t\). The stochastic component of \(X_{t}^{*}\) on (5) equals \(X_{t}^{*}\frac{\tilde{\theta }_{t}}{\gamma }\text {d}\tilde{W}_{t}+X_{t}^{*}\frac{\partial _{d}g_t(d_t)}{g_t(d_t)}(\sigma _{d}\rho _S,\sigma _{d}\sqrt{1-\rho _{Sd}^2})\text {d}\tilde{W}_t\). By matching the stochastic component of this SDE with the stochastic component defining \(X_t\) in (4), the following holds
where \(\pi _{t}^{*}=(\pi _{S,t}^{*},\pi _{F,t}^{*})^{\prime }\in \mathbb {R}^{2}\) are the unconstrained optimal allocations in the risky and fictitious assets respectively. Thus
The first component of \(\pi _{S,t}^{*}\) is the solution that would be obtained if \(\mu _{S,t}\) is assumed to remain unchanged after time t. This would be the myopic solution. The second term is the hedging demand component (HD hereafter), which takes into account the time-varying nature of \(d_t\) after time t.
Since \(\pi _{S,t}\in \left[ a,b\right]\), then the artificial market proposed in Cvitanić and Karatzas [5] is used. This unconstrained market has a Lagrange multiplier process \(\lambda _{S,t} \subseteq \mathbb {R}\), with a risk-free rate of \(r+\delta _{[a,b]}(\lambda _{S,t})\), where \(\delta _{[a,b]}(\lambda _{S,t}):= \sup _{x\in [a,b]}(-\lambda _{S,t} \cdot x)\). Therefore, \(\tilde{\theta }_{t,1}=\alpha d_t/\sigma _S\) changes to \(\alpha d_t+\lambda _{S,t}/\sigma _S\). Replacing last change into (7), the new optimal solution equals to \(\tilde{\pi }_{S,t}=\pi _{S,t}^{*}+\frac{1}{\gamma \sigma _S^2}\lambda _{S,t}\). Finally, Proposition 8.3 of Cvitanić and Karatzas [5] is applied; if there is a \(\lambda _{S,t}\) satisfying
then \(\tilde{\pi }_{S,t}\) is the optimal solution in set [a, b]. Defining \(\tilde{\lambda }_{S,t}=\lambda _{S,t}/\gamma\), latter conditions can be written as
Is not hard to see that conditions (9) and (10) are satisfied with
Proof
-
Case \(\pi _{S,t}^{*} < a\): Solution \(\tilde{\pi }_{S,t} = a \Rightarrow \tilde{\lambda }_{S,t}=\sigma _S^2(a-\pi _{S,t}^{*})\). Since \(\tilde{\lambda }_{S,t}>0\) then \(\delta _{[a,b]}(\tilde{\lambda }_{S,t})=-a\tilde{\lambda }_{S,t}\), and \(\pi _{S,t}^{*}\tilde{\lambda }_{S,t}+\frac{1}{\sigma _S^2}\tilde{\lambda }_{S,t}^2-a\tilde{\lambda }_{S,t}=0\).
-
Case \(\pi _{S,t}^{*} \in [a,b]\): Solution \(\tilde{\pi }_{S,t} = \pi _{S,t}^{*}\), \(\Rightarrow \tilde{\lambda }_{S,t}=0\). Thus \(\delta _{[a,b]}(\tilde{\lambda }_{S,t})=0\), and \(\pi _{S,t}^{*}\tilde{\lambda }_{S,t}+\frac{1}{\sigma _S^2}\tilde{\lambda }_{S,t}^2=0\).
-
Case \(\pi _{S,t}^{*} > b\): Solution \(\tilde{\pi }_{S,t} = b \Rightarrow \tilde{\lambda }_{S,t}=\sigma _S^2(b-\pi _{S,t}^{*})\). Since \(\tilde{\lambda }_{S,t}<0\), then \(\delta _{[a,b]}=-b\tilde{\lambda }_{S}\), and \(\pi _{S,t}^{*}\tilde{\lambda }_{S,t}+\frac{1}{\sigma _S^2}\tilde{\lambda }_{S,t}^2-b\tilde{\lambda }_{S,t}=0\).
\(\square\)
The final step is to derive a closed-form approximation for \(\partial _{d}g_t(d_{t})/g_t(d_{t})\). In such way the method can provide a closed-form approximation for \(\pi _{S,t}^{*}\) in (7), which allows to obtain \(\tilde{\lambda }_{S,t}\) and \(\tilde{\pi }_{S,t}\) using (11). To do so, \(\phi _{F,t}\) must be estimated first. As explained in [1], the indirect utility in the artificial market is an upper bound for the indirect utility function of the true market. It is an upper bound for any values of \(\phi _{F,t}\). Thus, the best bound is the process \(\phi _{F,t}\) minimizing the indirect utility function of the artificial market. As an approximation, later function is minimized over the set \(\mathcal {A}\) composed of the non-stochastic processes \(\phi _{F,t}\). In such case, \(\int _{t}^{T}\phi _{F,u}\text {d}W_{S,u}\sim \mathcal {N}(0,\int _{t}^{T}\phi _{F,u}^2\text {d}u)\). Thus
which is minimized when \(\phi _{F,u}=0 \ \forall u \in [t,T]\). The next step is to assume that \(d_u\) is not stochastic (only for the approximation of \(g_t(d_t)\)). For example, \(d_u\) can be equal to the expected value of such process, that is, \(d_u=d_t\exp {({-}\kappa (u{-}t))}{+}d_{\infty }[1{-}\exp {({-}\kappa (u{-}t))}]\). In that case, \(\int _{t}^{T}d_{u}\text {d}W_{S,u}\sim \mathcal {N}(0,\int _{t}^{T}d_{u}^2\text {d}u)\) and thus
Hence
with
Therefore, the closed-form unconstrained allocation for the risky asset equals
The HD component in (13) increases in magnitude with (i) an increase in \(\frac{\sigma _{d} \rho _{Sd}}{\sigma _{S}}\), which represents the sensitivity of the dividend yield with respect to the asset return, (ii) an increase in the \(\alpha /\sigma _S\) ratio, (iii) increases in the current and long-run dividend yields, since \(D_{t,T}^\kappa\) increases with increases in \(d_t\) and \(d_{\infty }\), (iv) a decrease in \(\kappa\), since \(D_{t,T}^\kappa\) decreases with an increase in \(\kappa\), (v) an increase in the time to horizon, since \(D_{t,T}^\kappa\) increases with an increase in \(T{-}t\). The previous sensitivity results are as expected. The HD component increases in magnitude either when the dividend yield diverges more frequently from \(d_t\) (e.g., smaller \(\kappa\), longer horizon), or when the price of the risky asset is more sensitive to the dividend yield (e.g., higher \(d_t, \alpha\), or \(\frac{\sigma _{d} \rho _{Sd}}{\sigma _{S}}\)).
2.2 Market with two risky assets
The main complexity of this extension comes in determining the Lagrangian processes emerging from the long only-constraints. Now there are two possibly correlated risky asset with prices \(S_t=(S_{1,t},S_{2,t})\) following
where \(\sigma _{S}\in \mathbb {R}^{2\times 2}\) is a volatility matrix and \(W_{S,t} \in \mathbb {R}^{2}\). Let \(\mu _{S,t}=r{+}\tilde{d}_{t}\), with \(\tilde{d}_{t}=(\alpha _1 d_{1,t},\alpha _1 d_{2,t})^{\prime } \in \mathbb {R}^{2}\), and \(d_{t}\) following
where \(\sigma _{d}=\begin{pmatrix} \sigma _{d,1}\rho _{Sd,1} &{} 0 &{} \sigma _{d,1}\sqrt{1{-}\rho _{Sd,1}^2} &{} 0 \\ 0 &{} \sigma _{d,2}\rho _{Sd,2} &{} 0 &{} \sigma _{d,2}\sqrt{1{-}\rho _{Sd,2}^2} \\ \end{pmatrix}\). Equation (2) is adapted to
where \(\pi _{S,t}\in \mathcal {K}= \{ (\pi _{S_1,t},\pi _{S_2,t}) \in \mathbb {R}^{2}:(\pi _{S_1,t},\pi _{S_2,t}) \ge 0\), \(\pi _{S_1,t}+\pi _{S_2,t}\le 1\}\).
To complete the market, two fictitious assets are added \(F_{t}=(F_{1,t},F_{2,t})\) following \(\text {d}F_{t}/F_{t}=(r+\phi _{F,t})\text {d}t+\text {d}W_{d,t}\), where \(\phi _{F,t} \in \mathbb {R}^{2}\) represents the market premium of both assets. The SDF then satisfies \(\text {d}\xi _{t}/\xi _{t}={-}r\text {d}t {-}\tilde{\theta }_{t}\text {d}\tilde{W}_{t}\), with \(\tilde{\theta }_{t}=(\sigma _S^{-1}\tilde{d}_{S,t},\phi _{F,t})^{\prime }\in \mathbb {R}^{4}\), \(\tilde{W}_{t}=(W_{S,t},W_{d,t})^{\prime }\in \mathbb {R}^{4}\). Following the same martingale approach with one risky asset, it is straightforward to show that the optimal unconstrained solution in this complete market equals
with \(g_t(d_t)\) as
Since \(\pi _{S,t}\in \mathcal {K}\), then the artificial market proposed in Cvitanić and Karatzas [5] is used. This unconstrained market has a Lagrange multiplier process \(\lambda _{S,t} \subseteq \mathbb {R}^2\), with a risk-free rate of \(r+\delta _{\mathcal {K}}(\lambda _{S,t})\), where \(\delta _{\mathcal {K}}(\lambda _{S,t}):= \sup _{x\in \mathcal {K}}(-\lambda _{S,t}^{\prime } x)\). Therefore, \(\tilde{\theta }_{t,1}=\sigma _S^{-1}\tilde{d}_{S,t}\) changes to \(\sigma _S^{-1}\tilde{d}_{S,t}+\sigma _S^{-1}\lambda _{S,t}\). Replacing last change into (16), the new optimal solution equals to \(\tilde{\pi }_{S,t}=\pi _{S,t}^{*}+\frac{1}{\gamma } (\sigma _S\sigma _S^{\prime })^{-1}\lambda _{S,t}\). Finally, Proposition 8.3 of Cvitanić and Karatzas [5] is applied exactly as with the one-asset case; if there is a \(\tilde{\lambda }_{S,t} \subseteq \mathbb {R}^2\) satisfying
then \(\tilde{\pi }_{S,t}\) is the optimal solution in \(\mathcal {K}\). Is not hard to see that \(\delta _{\mathcal {K}}(\tilde{\lambda }_{S}):= \sup _{x\in \mathcal {K}}(-x^{\prime }\tilde{\lambda }_{S,t})=\max \{-\tilde{\lambda }_{S_1,t},-\tilde{\lambda }_{S_2,t},0\}\). Let \(\sigma _S\sigma _S^{\prime }=\begin{pmatrix} \sigma _{1}^2 &{} \sigma _{1}\sigma _{2}\rho _{12} \\ \sigma _{1}\sigma _{2}\rho _{12} &{} \sigma _{2}^2, \end{pmatrix}\), representing the covariance matrix of the two assets and denote \((\pi _1,\pi _2)=(\pi _{S_1,t},\pi _{S_2,t})\). Appendix 1 shows that the solution of Eqs. (17) and (18) is given by
where \(A_1=\pi _1^{*}\sigma _1^2{+}\pi _2^{*}\sigma _1\sigma _2\rho _{12}\) and \(A_2=\pi _2^{*}\sigma _2^2{+}\pi _1^{*}\sigma _1\sigma _2\rho _{12}\).
An interesting remark of the closed-form solution in (19) is that it can be applied on other investment opportunity sets, since it only depends on the covariance matrix of both assets and the unconstrained portfolio rule on such market setting.
The last step step is to derive a closed-form approximation for \(\partial _{d}g_t(d_{t})/g_t(d_{t})\). To derive \(\phi _{F,t}\), the same methodology as in the one-asset case is used. Similarly, \(J_0(X_0,\phi _F)\) is minimized when \(\phi _{F,u}=0\), thus:
Let \(B=diag\begin{pmatrix} \exp {({-}\kappa _1(u{-}t))} \\ \exp {({-}\kappa _2(u{-}t))} \end{pmatrix}\). To derive \(g_t(d_{t})\), \(d_u\) can be approximated with
\(d_u=Bd_t{+}d_{\infty }[I{-}B]\). Then
with
Therefore, the closed-form unconstrained allocation for the risky assets equals
2.2.1 Pruning suboptimality example
Note also that the solution proposed can be different from the one obtained by pruning the unconstrained optimal solution to the nearest solution meeting the long-only constraints.Footnote 2 For illustration purposes, suppose that the dividend yield is constant, in which case the unconstrained optimal solution does not change over time and equals the myopic component. Under this assumption and by applying Itô’s Lemma to (15), it is not hard to see that the indirect utility function is
Instead of using \(J_0(X_0)\) to measure the performance of a portfolio rule, the certainty equivalent (CE) is used. The CE is the initial wealth value \(X_0^{CE}\) such that \(U_0(X_0^{CE})=J_0(X_0)\Leftrightarrow X_0^{CE}(\pi _S)=[(1{-}\gamma )J_0(X_0)]^{\frac{1}{1{-}\gamma }}\). Note that in the example
In the following setting \(\sigma _1=5\%, \sigma _1=20\%, \mu _{S}=(0\%,10\%)^{\prime }, \rho _{12}=0.7, r=2\%, \gamma =4\), the unconstrained optimal solution \(\pi _{S}^{*}=\frac{1}{\gamma }(\sigma _S\sigma _S^{\prime })^{-1}(\mu _{S}{-}r)=(-20/3,7/3)^{\prime }\). From (19), \(\tilde{\lambda }_{S}=(0.0085,0)^{\prime }\), which implies \(\tilde{\pi }_{S}=(0,0.5)^{\prime }\). For \(X_0=1,T=50\), then \(X_T^{CE}(\tilde{\pi }_{S})=7.39\) (or annual return of 4%). The nearest feasible solution from \(\pi _{S}^{*}\) is \(\pi _S^{near}=(0,1)^{\prime }\), with \(X_T^{CE}(\pi _S^{near})=2.71\) (or annual return of 2%).
3 Results
To test the methodology with data from the US equity market, a sample of monthly dividend yields and prices were taken from the stocks of the Dow Jones index. For calibration purposes, the stocks chosen were those for which data were available from Jan-1990 to Feb-2024. Stocks that did not pay dividends for long periods of time (e.g. APPL) were not considered. The final sample consisted of 16 stocks out of the 30 stocks composing the index. Figure 4 in Appendix 3 depicts the dividend yields and prices of these 16 stocks.
The next step was to calibrate the model for each of the 16 stocks. For illustration purposes, Table 1 shows the calibration results for the eight stocks that have the highest hedging demands, relative to the unconstrained solution of Eq. (13), when \(t=0\), \(T=50\) and \(\gamma =2\). A priori, these eight stocks are the ones presenting the highest potential for the methodology to outperform the myopic solution. Calibration results for the other eight stocks can be found in Table 2 (Appendix 3).
3.1 Market with one risky asset
Figure 1 shows the increase in the CE obtained by the dynamic solution with respect to the CE of the myopic solution, in the presence of long-only constraints, i.e. \(\pi _{S,t} \in [0,1]\). Note that the formula in (21) cannot be used to estimate the CE, because the dividend yield is dynamic (thus \(\mu _{S,t}\) and \(\tilde{\pi }_{S,t}\) are dynamic too). Hence, the CE is estimated through simulations. The CE is obtained by performing 200,000 simulations of the risky asset and the dividend yield process, with a discretization of \(dt=1/200\) (200 times in a year). These simulations were done in MATLAB R2021b on a personal computer, specifically a MacBook Pro with a Quad-Core Intel Core i5 and 16 GB RAM. For \(T=50\), a simulation is completed in approximately one minute.
As expected, the stocks with higher (lower) differences in CE coincide with the ones with higher (lower) HDs/\(\hat{\pi }\) ratios from Table 1. This relationship is useful because the potential benefit given by the non-myopic solution for a stock can be measured before the simulation is run. Results for the other eight stocks can be found in Fig. 5 (Appendix 3). Figure 2 shows the average allocations to the risky asset for cases \(\gamma =\{2,6\}\). Since the HD is positive, it is expected that the dynamic solution will over-weight the allocation to the risky asset. This is because a negative shock to the price of the risky asset generally comes with a positive shock to the dividend yield (\(\rho _{Sd}\, {\ll}\,0\)), which increases the drift of that asset (recall \(\mu _{S,t}=r+\alpha d_{t}\)). The figure also shows how allocations to the risky asset increase (decrease) in the scenarios where the dividend yield is above (below) the long-term dividend yield \(d_{\infty }\). As expected, the HD decreases when approaching the horizon and the allocation to the risky asset decreases when the risk aversion is higher.
Appendix 2 shows how to implement the method explained in Bick et al.[1] (BKM hereafter) for this dividend model. For the one-asset case, the BKM method produces almost the same results as the methodology proposed in this study. The reasons are the following: (i) the approximation of \(g_t(d_{t})\) to be made in BKM is similar to \(\hat{g}_t(d_{t})\) in (12) with these data, thus both methods deliver the same unconstrained solution, and (ii) the constrained solution found using (11) coincides with the pruned solution of BKM. Even if both solutions are similar, the implementation of the BKM method is more complex than that of the method proposed in this study. As explained in Appendix 2, the BKM method requires solving an optimization problem to derive the parameters included in the artificial market, which has to be done numerically. Function \(g_t(d_{t})\) and its derivative must be computed using numerical integration methods. Thus, the solution cannot be found on a spreadsheet and finding the CE through a simulation can take a considerable amount of time.
3.2 Market with two risky assets
With these calibration results, the unconstrained solution derived in Eq. (20) is similar to the unconstrained solution derived using the BKM method. However, there are differences when delivering a solution satisfying long-only constrains. Similar to the example in Sect. 2.2.1, BKM prunes the unconstrained solution instead of finding the Lagrangian solution in (19). The following link provides an spreadsheet that demonstrates how to find the constrained solution using the method proposed in this study.
Figure 3 shows the CE increase w.r.t. the myopic solution and w.r.t the BKM solution for four pairs of the eight stocks shown in Table 1. As expected, the CE increase is higher w.r.t. the myopic solution in most cases. However, the CE increase w.r.t the myopic solution is much higher than the CE increase w.r.t the BKM solution for \(\gamma =4\) and \(\gamma =6\). In those settings, this means that considering the HD is more important than using the Lagrange duals in (19). When risk aversion increases, allocation to both stocks are reduced. Hence, the unconstrained solution satisfies long-only constraints, which explains the similarities of the proposed solution to BKM. For \(\gamma =2\), the unconstrained allocations in both stocks can add more than one. Thus deriving the Lagrange duals in (19), instead of pruning the unconstrained solution, produces more differences w.r.t. the BKM solution. In terms of computational time, it took around 25 min to derive the CE for \(T=50\), by performing 200,000 simulations of each risky asset and each dividend yield process, with a discretization of \(dt=1/200\).
4 Conclusion
This work presents an example of how to solve a portfolio problem under the Merton framework with closed-form approximations instead of hard-to-implement numerical methods. The examples illustrate how important it is to include the time-varying nature of markets to prevent welfare losses, and also how to correctly use the non-constrained solution to derive the constrained solution. The methodology presented can be reduced to solving the following tasks: (i) derive the parameters of the artificial assets (e.g., \(\phi _{F,t}\)), (ii) find closed-form approximations of the HD (e.g., \(g_t(d_t)\)) and (iii) determine the Lagrange processes emerging from portfolio constraints (e.g., \(\tilde{\lambda }_{S,t}\)). Evidently, this work can be extended by changing the investment opportunity set. For example, include another time-varying driver besides the dividend yield, or develop a method to derive \(\tilde{\lambda }_{S,t}\) for multiple assets and/or other portfolio constraints. Another possibility is to apply the method to a life cycle problem.
Data availibility statement
The dataset used in the experiments is publicly available, since it corresponds to historical dividends and prices of public traded companies.
Notes
With one risky asset, Eq. (11) shows that the solution proposed coincides with the nearest solution meeting the long-only constraints.
References
Bick, B., Kraft, H., Munk, C.: Solving constrained consumption-investment problems by simulation of artificial market strategies. Manage. Sci. 59(2), 485–503 (2013)
Castañeda, P., Reus, L.: Suboptimal investment behavior and welfare costs: a simulation based approach. Financ. Res. Lett. 30, 170–180 (2019)
Cochrane, J.H.: Permanent and transitory components of GNP and stock prices*. Q. J. Econ. 109(1), 241–265 (1994)
Cochrane, J.H.: Portfolios for long-term investors. Rev. Finance 26(1), 1–42 (2022)
Cvitanić, J., Karatzas, I.: Convex duality in constrained portfolio optimization. Ann. Appl. Prob. 767–818 (1992)
Kamma, T., Pelsser, A.: Near-optimal asset allocation in financial markets with trading constraints. Eur. J. Oper. Res. 297(2), 766–781 (2022)
Karatzas, I., Lehoczky, J.P., Shreve, S.E., Xu, G.-L.: Martingale and duality methods for utility maximization in an incomplete market. SIAM J. Control. Optim. 29(3), 702–730 (1991)
Larsen, L.S., Munk, C.: The costs of suboptimal dynamic asset allocation: general results and applications to interest rate risk, stock volatility risk, and growth/value tilts. J. Econ. Dyn. Control 36(2), 266–293 (2012)
Merton, R. C.: Lifetime portfolio selection under uncertainty: the continuous-time case. Rev. Econ. Stat. 247–257 (1969)
Munk, C., Sørensen, C.: Dynamic asset allocation with stochastic income and interest rates. J. Financ. Econ. 96(3), 433–462 (2010)
Funding
Lorenzo Reus acknowledges financial support from ANID Fondecyt project #1231660.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: Optimal Lagrange multiplier process
Let \((\sigma _S\sigma _S^{\prime })^{-1}=\frac{1}{\sigma _{1}^2\sigma _{2}^2(1-\rho _{12}^2)}\begin{pmatrix} \sigma _{2}^2 &{} -\rho _{12}\sigma _{1}\sigma _{2} \\ -\rho _{12}\sigma _{1}\sigma _{2} &{} \sigma _{1}^2 \\ \end{pmatrix}\). Conditions (17) and (18) reduce to
Define \(A_1=\pi _1\sigma _{1}^2+\pi _2\sigma _{1}\sigma _{2}\rho _{12}\) and \(A_2=\pi _2\sigma _{2}^2+\pi _1\sigma _{1}\sigma _{2}\rho _{12}\)
-
1.
Case \(\pi _1<0, \pi _2 < 0\) It is not possible to have \(A_1>0\) and \(A_2>0\), otherwise term \(\pi _1 A_1+\pi _2 A_2<0 \Rightarrow \pi _1^2\sigma _{1}^2 +\pi _2^2\sigma _{2}^2 +2\pi _2\pi _1\sigma _{1}\sigma _{2}\rho _{12}<0\), which is a contradiction. Thus
Solution 1: \(\tilde{\pi } = (0,0) \Rightarrow \tilde{\lambda }_{1}=-\pi _1\sigma _{1}^2-\pi _2\sigma _{1}\sigma _{2}\rho _{12}=-A_1\) and \(\tilde{\lambda }_{2}=-A_2\). To satisfy (18), then \(\delta _{\mathcal {K}}=0\) must hold. Thus, \(\tilde{\lambda }_{1}\ge 0\) and \(\tilde{\lambda }_{2}\ge 0\), that is, \(A_1\le 0\) and \(A_2\le 0\).
Solution 2: \(\tilde{\pi }=(\pi _1+\frac{\rho _{12}\sigma _2}{\sigma _1}\pi _2,0)=(\frac{A_1}{\sigma _1^2},0)\), where \(\tilde{\lambda }=(0,{-}\pi _2(1-\rho _{12}^2)\sigma _2^2)\). Thus \(\delta _{\mathcal {K}}=0\) and (18) holds. To satisfy (17), \(0\le A_1\le \sigma _1^2\) must hold. Since \(0\le A_1\), then \(A_2\le 0\).
Solution 3: \(\tilde{\pi }=(1,0)\). In this case \(\tilde{\lambda }_{1}=(1-\pi _1)\sigma _{1}^2-\pi _2\sigma _{1}\sigma _{2}\rho _{12}=\sigma _{1}^2-A_1\) and \(\tilde{\lambda }_{2}=-\pi _2\sigma _{2}^2+(1-\pi _1)\sigma _{1}\sigma _{2}\rho _{12}=-A_2+\sigma _{1}\sigma _{2}\rho _{12}\). To satisfy (18), \(\delta _{\mathcal {K}}=-\tilde{\lambda }_{1}\)- must hold. Thus, \(\tilde{\lambda }_{1}\le 0\) and \(\tilde{\lambda }_{1}\le \tilde{\lambda }_{2}\), that is, \(A_1>\sigma _{1}^2\). Again, \(A_2\le 0\). By symmetry, there are two more solutions. In summary
-
\(\tilde{\pi }=(0,0), \tilde{\lambda }=({-}A_1,{-}A_2) \ \text {if} \ A_1 \le 0, A_2 \le 0\)
-
\(\tilde{\pi }=(\pi _1+\frac{\rho _{12}\sigma _2}{\sigma _1}\pi _2,0), \tilde{\lambda }=(0, -\pi _2(1-\rho _{12}^2)\sigma _2^2)\) if \(0\le A_1\le \sigma _1^2, A_2 \le 0\)
-
\(\tilde{\pi }=(1,0), \tilde{\lambda }=(\sigma _{1}^2-A_1,-A_2{+}\sigma _{1}\sigma _{2}\rho _{12})\), if \(A_1\ge \sigma _{1}^2, A_2 \le 0\)
-
\(\tilde{\pi }=(0,\pi _2{+}\frac{\rho _{12}\sigma _1}{\sigma _2}\pi _1), \tilde{\lambda }=({-}\pi _1(1-\rho _{12}^2)\sigma _1^2,0)\) if \(0\le A_2\le \sigma _2^2, A_1 \le 0\)
-
\(\tilde{\pi }=(0,1), \tilde{\lambda }=(-A_1+\sigma _{1}\sigma _{2}\rho _{12},\sigma _{2}^2-A_2)\), if \(A_2\ge \sigma _{2}^2, A_1 \le 0\)
-
-
2.
Case \(\pi _1<0, \pi _2 \in [0,1]\)
Solution 1: \(\tilde{\pi }=(0,0), \tilde{\lambda }=(-A_1,-A_2)\) satisfy conditions if \(A_1 \le 0, A_2 \le 0\). In this case though, \(A_2 \le 0\) implies \(A_1 \le 0\). Thus \(A_2 \le 0\) must hold.
Solution 2: \(\tilde{\pi }=(0,\pi _2+\frac{\rho _{12}\sigma _1}{\sigma _2}\pi _1), \tilde{\lambda }=(-\pi _1(1-\rho _{12}^2)\sigma _1^2,0)\) satisfy conditions if \(0\le A_2\le \sigma _2^2\).
Solution 3: \(\tilde{\pi }=(0,1), \tilde{\lambda }=(-A_1+\sigma _{1}\sigma _{2}\rho _{12},\sigma _{2}^2-A_2)\) satisfy conditions if \(A_2\ge \sigma _{2}^2\).
-
3.
Case \(\pi _1<0, \pi _2>1\): Same solution as case 2.
-
4.
Case \(\pi _1\in [0,1], \pi _2<0\): By symmetry with case 2
Solution 1: \(\tilde{\pi }=(0,0), \tilde{\lambda }=(-A_1,-A_2)\) satisfy conditions if \(A_1 \le 0, A_2 \le 0\). In this case though, \(A_1 \le 0\) implies \(A_2 \le 0\). Thus \(A_1 \le 0\) must hold.
Solution 2: \(\tilde{\pi }=(\pi _1+\frac{\rho _{12}\sigma _2}{\sigma _1}\pi _2,0), \tilde{\lambda }=(0,-\pi _2(1-\rho _{12}^2)\sigma _2^2)\) satisfy conditions if \(0\le A_1\le \sigma _1^2\)
Solution 3: \(\tilde{\pi }=(1,0), \tilde{\lambda }=(\sigma _{1}^2-A_1,-A_2+\sigma _{1}\sigma _{2}\rho _{12})\) satisfy conditions if \(A_1\ge \sigma _{1}^2\).
-
5.
Case \(\pi _1>1, \pi _2<0\): Same solution as case 4
-
6.
Case \(\pi _1\in [0,1], \pi _2\in [0,1],\pi _1+\pi _2 \le 1\): The only solution is \(\tilde{\pi }=(\pi _1,\pi _2), \tilde{\lambda }=(0,0)\)
-
7.
Case \(\pi _1\in [0,1], \pi _2\in [0,1], \pi _1+\pi _2>1\): The following cannot hold
-
(i)
\(A_1<0\), \(A_2<0\): To see this, suppose otherwise. Term \(\pi _1 A_1+\pi _2 A_2<0 \Rightarrow \pi _1^2\sigma _{1}^2 +\pi _2^2\sigma _{2}^2 +2\pi _2\pi _1\sigma _{1}\sigma _{2}\rho _{12}<0\), which is a contradiction.
-
(ii)
\(A_2<\sigma _{2}^2\), \(A_2>A_1+(\sigma _{2}^2-\sigma _{1}\sigma _{2}\rho _{12})\): \(A_2<\sigma _{2}^2\) implies \(A_2<A_1+(\sigma _{2}^2-\sigma _{1}\sigma _{2}\rho _{12})\).
-
(iii)
\(A_1<\sigma _{1}^2\), \(A_2<A_1-(\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\): \(A_1<\sigma _{1}^2\) implies \(A_2>A_1-(\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\).
-
(iv)
\(A_1<0\), \(A_2>A_1+(\sigma _{2}^2-\sigma _{1}\sigma _{2}\rho _{12})\): Since \(A_1<0\), then \(\sigma _{1}\sigma _{2}\rho _{12}<0\). This implies \((1-\pi _2)(\sigma _2^2-\sigma _{1}\sigma _{2}\rho _{12})+\pi _1(\sigma _1^2-\sigma _{1}\sigma _{2}\rho _{12})>0\), that is \(A_2<A_1+(\sigma _{2}^2{-}\sigma _{1}\sigma _{2}\rho _{12})\).
-
(v)
\(A_2<0\), \(A_2<A_1-(\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\): Since \(A_2<0\), then \(\sigma _{1}\sigma _{2}\rho _{12}<0\). This implies \((1-\pi _1)(\sigma _1^2-\sigma _{1}\sigma _{2}\rho _{12})+\pi _2(\sigma _2^2-\sigma _{1}\sigma _{2}\rho _{12})>0\), that is \(A_2>A_1-(\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\).
Solution 1: \(\tilde{\pi }=(1,0), \tilde{\lambda }=(\sigma _{1}^2-A_1,-A_2+\sigma _{1}\sigma _{2}\rho _{12})\). To satisfy (18), \(\delta _{\mathcal {K}}=-\tilde{\lambda }_1\) must hold. Thus \(A_1\ge \sigma _{1}^2\) and \(A_2 \le A_1-(\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\). Such conditions imply \(0\le A_2\).
Solution 2: \(\tilde{\pi }=(0,1), \tilde{\lambda }=(-A_1+\sigma _{1}\sigma _{2}\rho _{12},\sigma _{2}^2-A_2)\). To satisfy (18), \(\delta _{\mathcal {K}}=-\tilde{\lambda }_2\) must hold. Thus \(A_2\ge \sigma _{2}^2\) and \(A_2 \ge A_1+(\sigma _{2}^2{-}\sigma _{1}\sigma _{2}\rho _{12})\). Such conditions imply \(0\le A_1\).
Solution 3:
$$\begin{aligned} \tilde{\pi }_1 =\pi _1 - \frac{(\sigma _{2}^2{-}\sigma _{1}\sigma _{2}\rho _{12})(\pi _1{+}\pi _2{-}1)}{\sigma _2^2{-}2\sigma _1\sigma _2\rho _{12}{+}\sigma _1^2}, \quad \tilde{\pi }_2 =\pi _2 - \frac{(\sigma _{1}^2{-}\sigma _{1}\sigma _{2}\rho _{12})(\pi _1{+}\pi _2{-}1)}{\sigma _2^2{-}2\sigma _1\sigma _2\rho _{12}{+}\sigma _1^2}{,} \end{aligned}$$which implies \(\tilde{\lambda }_1 = \tilde{\lambda }_2 = {-}\frac{\sigma _1^2\sigma _2^2(\pi _1{+}\pi _2{-}1)}{\sigma _2^2{-}2\sigma _1\sigma _2\rho _{12}{+}\sigma _1^2}\). Note that \(\tilde{\pi }_1+\tilde{\pi }_2=1\) and \(\tilde{\lambda }_1 \le 0\). Thus, (18) holds. To satisfy (17), \(0\le \tilde{\lambda }_1 \le 1\) must hold. This means that the following two inequalities must hold: \(A_2 \ge A_1 - (\sigma _{1}^2-\sigma _{1}\sigma _{2}\rho _{12})\) and \(A_2 \le A_1+(\sigma _{2}^2-\sigma _{1}\sigma _{2}\rho _{12})\). Note that both inequalities imply \(A_1 \le \sigma _{1}^2\) and \(A_2 \le \sigma _{2}^2\).
-
(i)
-
8.
Case \(\pi _1 \in [0,1]\), \(\pi _2 > 1\): Solutions and conditions are the same as in case 7. What changes with respect to case 7 is that 7.(iv) can hold.
-
9.
Case \(\pi _1 > 1\), \(\pi _2 \in [0,1]\): Solutions and conditions are the same as in case 7. What changes with respect to case 7 is that 7.(v) can hold.
-
10.
Case \(\pi _1 > 1\), \(\pi _2 > 1\): Solutions and conditions are the same as in case 7. What changes with respect to case 7 is that 7.(iv) and 7.(v) can hold.
Appendix 2: BKM implementation
BKM assumes that both \(\phi _{F,t}\) and \(\lambda _t\) are affine, i.e. \(\phi _{F,t}=\phi _{0}+\phi _{1}t\) and \(\lambda _{S,t}=\lambda _0+\lambda _1t\). The first step is to estimate parameters \(\mathcal {A}:=\{\phi _{0}, \phi _{1}, \lambda _0, \lambda _1\}\) by minimizing the indirect utility function of the artificial market. For the one-asset case, this market has risk-free rate of \(r+\delta _{[0,1]}(\lambda _{S,t})=r+\max \{{-}\lambda _{S,t},0\}\), and \(\tilde{\theta }_{t,1}=\alpha d_t+\lambda _{S,t}/\sigma _S\). Thus
Using the approximation \(d_u=d_t\exp {({-}\kappa (u{-}t))}{+}d_{\infty }[1{-}\exp {({-}\kappa (u{-}t))}]\) (same as when deriving \(\hat{g}_t(d_{t})\) in (12)), then
Under this structure, \(\{\phi _{0}, \phi _1\}\) can be found independently from \(\{\lambda _0, \lambda _1\}\). Similar to the results in Sect. 2.1, is not hard to see that the optimal is found with \(\phi _{0}=\phi _{1}=0\). Thus the problem reduces in finding
which can be obtained numerically. Let \(\lambda _{S,t}^{*}=\lambda _0^{*}+\lambda _1^{*}t\) be the optimal found previously. Hence, function \(g_t(d_{t})\) can be estimated as
Thus, the approximation of the optimal allocation in the risky asset is similar to Eq. (7), that is
Numerical methods must be used to estimate \(\frac{\partial _{d}g^{BKM}_t(d_{t})}{g^{BKM}_t(d_{t})}\); first to estimate \(g^{BKM}_t(d_{t})\) and then to estimate the derivative with respect to \(d_t\). The constrained allocation is
Appendix 3: Supplementary data and results
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Reus, L. Merton portfolio allocation under stochastic dividends. Optim Lett (2024). https://doi.org/10.1007/s11590-024-02125-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11590-024-02125-w