Impulse Control of Conditional McKean–Vlasov Jump Diffusions

Agram, Nacira; Pucci, Giulia; Øksendal, Bernt

doi:10.1007/s10957-023-02370-6

Impulse Control of Conditional McKean–Vlasov Jump Diffusions

Open access
Published: 19 January 2024

Volume 200, pages 1100–1130, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Impulse Control of Conditional McKean–Vlasov Jump Diffusions

Download PDF

Nacira Agram¹,
Giulia Pucci¹ &
Bernt Øksendal²

855 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, we consider impulse control problems involving conditional McKean–Vlasov jump diffusions, with the common noise coming from the $\sigma $-algebra generated by the first components of a Brownian motion and an independent compensated Poisson random measure. We first study the well-posedness of the conditional McKean–Vlasov stochastic differential equations (SDEs) with jumps. Then, we prove the associated Fokker–Planck stochastic partial differential equation (SPDE) with jumps. Next, we establish a verification theorem for impulse control problems involving conditional McKean–Vlasov jump diffusions. We obtain a Markovian system by combining the state equation with the associated Fokker–Planck SPDE for the conditional law of the state. Then we derive sufficient variational inequalities for a function to be the value function of the impulse control problem, and for an impulse control to be the optimal control. We illustrate our results by applying them to the study of an optimal stream of dividends under transaction costs. We obtain the solution explicitly by finding a function and an associated impulse control, which satisfy the verification theorem.

The Stochastic Maximum Principle for a Jump-Diffusion Mean-Field Model Involving Impulse Controls and Applications in Finance

Article 04 January 2019

A Stochastic Maximum Principle for a Markov Regime-Switching Jump-Diffusion Model with Delay and an Application to Finance

Article 25 August 2017

Pointwise Second-Order Necessary Conditions for Stochastic Optimal Control with Jump Diffusions

Article 29 August 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Consider a complete filtered probability space $(\varOmega ,\mathcal {F},P, \mathbb {F}=\{\mathcal {F}_t\}_{t\ge 0})$ on which we are given a d-dimensional Brownian motion $B=(B_1, B_2, \ldots , B_d)$, a k-dimensional compensated Poisson random measure $\widetilde{N}(\text {d}t,\text {d}z)$ such that

$$\begin{aligned} \widetilde{N}(\text {d}t,\text {d}z)=N(\text {d}t,\text {d}z)-\nu (\text {d}z)\text {d}t, \end{aligned}$$

where $N(\text {d}t,\text {d}z)$ is a Poisson random measure and $\nu (\text {d}z)$ is the Lévy measure of N, and a random variable $Z \in L^2(P)$ that is independent of $\mathbb {F}$. We denote by $L^2(P)$ the set of all the d-dimensional, $\mathbb {F}$-measurable random variables X such that ${\mathbb { E}}[X^2]<\infty ,$ where ${\mathbb { E}}$ denotes the expectation with respect to P. We consider the state process $X(t) \in \mathbb {R}^d$ given as the solution of the following conditional McKean–Vlasov jump diffusion

$$\begin{aligned} X(t)&=Z+\int _0^t\alpha (s,X(s),\mu _s)\text {d}s+\beta (s,X(s),\mu _{s})\text {d}B(s)\nonumber \\&\quad +\int _0^t\int _{ \mathbb {R}^k }\gamma (s,X(s^-),\mu _{s^{-}},z)\widetilde{N}(\text {d}s,\text {d}z), \end{aligned}$$

(1)

where we denote by $\mu _t=\mathcal {L}(X(t) | \mathcal {G}_t)$ the conditional law of X(t) given the natural filtration $\mathcal {G}_t$ generated by the first components $(B_1(t),\widetilde{N}_1(t,z))$ of the Brownian motion and an independent compensated Poisson random measure, respectively, up to time t. Loosely speaking, equation of the form (1) models a McKean–Vlasov dynamics which is subject to what is called a "common noise" coming from $(B_1(t),\widetilde{N}_1(t,z))$, which is observed and is influencing the dynamics of the system. This type of equation arises naturally in the framework of a particle system in the large-scale limit where the particles interact in the mean-field way with common noise. For instance, see Erny [13] for the unconditional case, where the well-posedness and the propagation of chaos for McKean–Vlasov SDE with jumps have been studied and with locally Lipschitz coefficients.

Conditional propagation of chaos (in continuous case) has been studied in the literature; for example, we refer to Carmona et al [9], Coghi and Flandoli [11]. There the common noise is represented by a common Brownian motion, which is already presented at the finite particle system. We refer also to the recent papers by Buckdahn et al. [7] and Lacker et al. [17, 18] for related results. However, in Erny et al. [14], the common noise is only presented at the limiting level. It comes from the joint action of the small jumps of the finite size particle system. They obtain a Fokker–Planck SPDE related to the one considered in Agram and Øksendal [1]. Since we are not concern about finite particle system in this paper, we work directly with the limit system (1). However, conditional propagation of chaos for a common noise presented by both the Brownian motion and the compensated Poisson random measure will be a purpose of future research.

For application of conditional propagation of chaos in a spatial stochastic epidemic model, we refer to Vuong et al. [22] in the continuous case.

To the best of our knowledge, none of the papers mentioned above deal with the common noise coming from both the Brownian and the Poissonian measure as we consider in the current paper. Moreover, the type of Fokker–Planck SPDE with jumps obtained, in this paper, as the equation for the conditional law of the state has not been considered in the literature.

Precisely, in the present paper, we are going to study impulse control problems for conditional McKean–Vlasov jump diffusions. In particular, we will define a performance criterion and then attempt to find a policy that maximizes performance within the admissible impulse strategies. Using a verification theorem approach, we establish a general form of quasi-variational inequalities and identify the sufficient conditions that lead to an optimal function. See precise formulation below. Standard impulse control problems can be solved by using the Dynkin formula. We refer to e.g. Bensoussan and Lions [4] in the continuous case and to Øksendal and Sulem [20] in the setting of jump diffusions.

Impulse control problems naturally arise in many concrete applications, in particular when an agent, because of the intervention costs, decides to control the system by intervening only at a discrete set of times with a chosen intervention size: a sequence of stopping times $(\tau _1,\tau _2,\ldots ,\tau _k, \ldots )$ is chosen to intervene and exercise the control. At each time $\tau _k$ of the player’s $\textit{k}^{th}$ intervention, the player chooses an intervention of size $\zeta _k$. The impulse control consists of the sequence $\{ (\tau _k,\zeta _k) \}_{k\ge 1}$.

Impulse control has sparked great interest in the financial field and beyond. See, for example, Korn [15] for portfolio theory applications, Basei [2] for energy markets, and Cadenillas et al. [8] for insurance. All of these works are based on quasi-variational inequalities and employ a verification approach.

Despite its adaptability to more realistic financial models, few papers have studied the case of mean-field problems with impulse control. We refer to Basei et al. [3] for a discussion of a more special type of impulse, where the only type of impulse is to add something to the system. Precisely, they consider a mean-field game (MFG) where the mean-field (only the empirical mean) appears as an approximation of a many-player game. Moreover, they use the smooth fit principle (as used in the present work) to solve a specific MFG explicitly.

We refer also to Christensen et al. [10] for a MFG impulse control approach. Specifically, a problem of optimal harvesting in natural resource management is addressed.

A maximum principle for regime switching control problem for mean-field jump diffusions is studied by Li et al. [19] but in that paper the problem considered is not really an impulse control problem because the intervention times are fixed in advance.

In our setting, we do not consider a MFG setup, as in the above-mentioned works. Instead, we consider a decision-maker who chooses the control to optimize a certain reward. Moreover, the mean-field appears as a conditional probability distribution, and to overcome the lack of the Markov property, we introduce the equation of the law, which is of stochastic Fokker–Planck type.

In Djehiche et al. [12], the authors could handle a non-Markovian dynamics. However, the impulse control is given in a particular compact form, and only a given number of impulses are allowed. They use a Snell envelope approach and related reflected backward stochastic differential equations.

The rest of the paper is organized as follows: In the next section, we study the well-posedness of the conditional McKean–Vlasov SDE (1). Section 3 is devoted to the Fokker–Planck SPDE with jumps. In Sect. 4, we state the optimal control problem and prove the verification theorem. In Sect. 5, we apply the previous results to solve an explicit problem of optimal dividend streams under transaction costs.

2 Conditional McKean–Vlasov SDEs with Jumps

Let us first study the well-posedness of the conditional McKean–Vlasov dynamics. We mean by $X_t \in {\mathbb { R}}^d$ the mean-field stochastic differential equation with jumps, from now on called a McKean–Vlasov jump diffusion, of the form

$$\begin{aligned} \text {d}X_j(t)&=\alpha _j(t,X(t),\mu _t)\text {d}t+\sum _{n=1}^m \beta _{j,n}(t,X(t),\mu _t)\text {d}B_n(t)\nonumber \\&\quad + \sum _{\ell =1}^k\int _{\mathbb {R}^k }\gamma _{j,\ell }(t,X(t^{-}),\mu _{t^{-}},z)\widetilde{N}_{\ell }(\text {d}t,\text {d}z);\quad j=1,2, ..., d \nonumber \\ X_0&=x\in \mathbb {R}^d, \end{aligned}$$

(2)

or, using matrix notation,

$$\begin{aligned} \text {d}X(t)=\alpha (t,X(t),\mu _t) \text {d}t + \beta (t,X(t),\mu _t) \text {d}B(t) + \int _{\mathbb {R}^k} \gamma (t,X(t^{-}),\mu _t,z)\widetilde{N}(\text {d}t,\text {d}z), \end{aligned}$$

where $B(t)=(B_1(t),B_2(t),..., B_m(t))^{T} \in {\mathbb { R}}^m = {\mathbb { R}}^{m\times 1}, \widetilde{N}=(\widetilde{N}_1,..., \widetilde{N}_k)^{T}\in {\mathbb { R}}^k={\mathbb { R}}^{k\times 1}$ are, respectively, an m-dimensional Brownian motion and a k-dimensional compensated Poisson random measure on a complete filtered probability space $(\varOmega ,\mathcal {F},\mathbb {F}=\{\mathcal {F}_t\}_{t\ge 0},P)$, and $\beta =(\beta _{j,n}) \in \mathbb {R}^{d \times m}, \gamma =(\gamma _{j,\ell }) \in \mathbb {R}^{d \times k}$.

We assume that for all $\ell ; 1 \le \ell \le k,$ the Lévy measure of $N_{\ell }$, denoted by $\nu _{\ell }$, satisfies the condition $\int _{\mathbb {R}^k}z^2 \nu _{\ell }(\text {d}z) < \infty $, which means that $N_{\ell }$ does not have many big jumps (but $N_{\ell }$ may still have infinite total variation near 0). This assumption allows us to use the version of the Itô formula for jump diffusion given, e.g. in Theorem 1.16 in Øksendal and Sulem [20].

We denote, by $\mathcal {G}_t$, the sub-filtrations of $\mathcal {F}_t$, generated by the first components of both the Brownian motion B and the compensated Poisson random measure $\widetilde{N}$. The sub-filtration $\mathcal {G}_t$ satisfies the usual conditions as $\mathcal {F}_t$.

We define $\mu _t^X$ to be regular conditional distribution of $X_t$ given $\mathcal {G}_t$. This means that $\forall t \ge 0$, $\mu _t^X$ is a Borel probability measure on ${\mathbb { R}}^d$ and

$$\begin{aligned} \int _{\mathbb {R}^n} g(x)\mu _t^X(\text {d}x)= {\mathbb { E}}[g(X_t) | \mathcal {G}_t ](\omega ) \end{aligned}$$

(3)

for all functions g such that ${\mathbb { E}}[ |g(X_t) |] < \infty $. We refer to Theorem 9 in Protter [21].

We shall define the special weighted Sobolev norm on the space of measures.

Definition 2.1

Let d be a given natural number. Then, let $\mathbb {M}=\mathbb {M}^{d}$ be the pre-Hilbert space of random measures $\mu $ on $\mathbb {R}^{d}$ equipped with the norm

$$\begin{aligned} \left\| \mu \right\| _{\mathbb {M}}^{2}:= \mathbb {E[}\int _{\mathbb {R}^{d}} |\hat{\mu }(y)|^{2}e^{-y^{2}}\text {d}y]\text {,} \end{aligned}$$

(4)

where $y=(y_1,y_2,...,y_d)\in \mathbb {R}^{d}$ and $\hat{\mu }$ is the Fourier transform of the measure $\mu $, i.e.

$$\begin{aligned} \begin{array}{lll} \hat{\mu }(y):= & {} {\int _{\mathbb {R}^{d}} }e^{-ixy}\mu (\text {d}x);\quad y\in \mathbb {R}^{d}, \end{array} \end{aligned}$$

where $xy =x \cdot y = x_1 y_1 + x_2 y_2 +... + x_d y_d$ is the scalar product in $\mathbb {R}^{d}$.

If $\mu ,\eta \in \mathbb {M}$, we define the inner product $\left\langle \mu ,\eta \right\rangle _{\mathbb {M}}$ by

$$\begin{aligned} \left\langle \mu ,\eta \right\rangle _{\mathbb {M}}=\mathbb {E}{[} \int _{{\mathbb {R}^{d}}}{\text {Re}}(\overline{\hat{\mu }}(y)\hat{\eta }(y))|y|^2e^{-y^{2}}\text {d}y], \end{aligned}$$

where ${\text {Re}}(z)$ denotes the real part and $\bar{z}$ denotes the complex conjugate of the complex number z.

The space $\mathbb {M}$ equipped with the inner product $\left\langle \mu ,\eta \right\rangle _{\mathbb {M}}$ is a pre-Hilbert space. For not having ambiguity, we will also use the notation $\mathbb {M}$ for the completion of this pre-Hilbert space.

Lemma 2.1

Let $X_{1}$ and $X_{2}$ be two d-dimensional random variables in $L^{2}(\mathbb {P})$. Thus,

$$\begin{aligned}{}\begin{array}{lll} \left\| \mathcal {L}(X_{1}|\mathcal {G}_t)-\mathcal {L}(X_{2}|\mathcal {G}_t)\right\| _{\mathbb {M}}^{2}\le & {} \pi \ \mathbb {E}[(X_{1}-X_{2})^{2}]\text {.} \end{array} \end{aligned}$$

Proof

By using the relation (4), we have

$$\begin{aligned}&\left\| \mathcal {L}(X_{1}|\mathcal {G}_t)-\mathcal {L}(X_{2}|\mathcal {G}_t)\right\| _{\mathbb {M}}^{2}\\&\quad =\mathbb {E}\int _{\mathbb {R}^{d}} \left| \widehat{\mathcal {L}}(X_{1}|\mathcal {G}_t)(y)-\widehat{\mathcal {L}}(X_{2}|\mathcal {G}_t)(y)\right| ^{2}e^{-y^{2}}\text {d}y\\&\quad =\mathbb {E}\int _{\mathbb {R}^{d}} \left| \int _{\mathbb {R}^{d}} e^{-ix_1y}\text {d}\mathcal {L}(X_1|\mathcal {G}_t)(x_1)-\int _{\mathbb {R}^{d}} e^{-ix_2y}\text {d}\mathcal {L}(X_2|\mathcal {G}_t)(x_2)\right| ^{2}e^{-y^{2}}\text {d}y\\&\quad =\mathbb {E}\int _{\mathbb {R}^{d}} \left| \int _{\mathbb {R}^{d}} (e^{-ix_1y}-e^{-ix_2y})\text {d}\mathcal {L}((X_1,X_2)|\mathcal {G}_t)(x_1,x_2)\right| ^{2}e^{-y^{2}}\text {d}y \\&\quad =\mathbb {E}\int _{\mathbb {R}^{d}} \left| \mathbb {E} (e^{-iX_1y}-e^{-iX_2y})|\mathcal {G}_t)\right| ^{2}e^{-y^{2}}\text {d}y. \end{aligned}$$

Using the fact that conditioning is a contractive projection of $L^{2}$ spaces and by standard properties of the complex exponential function, we obtain

$$\begin{aligned} \mathbb {E}\int _{\mathbb {R}^{d}} \left| \mathbb {E} (e^{-iX_1y}-e^{-iX_2y})|\mathcal {G}_t)\right| ^{2}e^{-y^{2}}\text {d}y&\le \mathbb {E} \int _{\mathbb {R}^{d}} \mathbb {E}\Big [\left| e^{-iX_1y}-e^{-iX_2y}\right| ^2\Big ]e^{-y^{2}}\text {d}y\\&\le \int _{\mathbb {R}^{d}} y^{2}e^{-y^{2}}\text {d}y\mathbb {E}[\left| X_1-X_2\right| ^{2}]. \end{aligned}$$

Therefore, the desired result follows. $\square $

To study the well-posedness of the conditional McKean–Vlasov SDE (2), we impose the following set of assumptions on the coefficients $\alpha ,\beta $ and $\gamma $:

Assumption I

$\alpha (t,x,\mu ):[0,T]\times \mathbb {R}^d \times \mathbb {M}\rightarrow \mathbb {R}^d, \beta (t,x,\mu ):[0,T]\times \mathbb {R}^d \times \mathbb {M}\rightarrow \mathbb {R}^{d \times m}$ and $\gamma (t,x,\mu ,\zeta ):[0,T]\times \mathbb {R}^d\times \mathbb {M}\times \mathbb {R}^d \rightarrow \mathbb {R}^{d \times k}$ are locally bounded and Borel-measurable functions.
There exists a constant C, such that for all $t\in [0,T],x,x',\mu ,\mu '$, we have
$$\begin{aligned}&\left| \alpha (t,x,\mu )-\alpha (t,x',\mu ')\right| +\left| \beta (t,x,\mu )-\beta (t,x',\mu ')\right| \\&\quad +\int _{\mathbb {R}^{ k}}\left| \gamma (t,x,\mu ,z)-\gamma (t,x',\mu ',z)\right| \nu (\text {d}z)\\&\quad \le C (\left| x-x' \right| +\left| \left| \mu -\mu ' \right| \right| _{\mathbb {M}}) \end{aligned}$$
and
$$\begin{aligned} \mathbb {E}\int _0^T[\left| \alpha (t,0,\delta _0)\right| ^2+\left| \beta (t,0,\delta _0)\right| ^2+\int _{\mathbb {R}^{ k}}\left| \gamma (t,0,\delta _0,z)\right| ^2\nu (\text {d}z)]\text {d}t\le \infty , \end{aligned}$$
where $\delta _0$ is the Dirac measure with mass at zero.

Theorem 2.1

(Existence and uniqueness) Under the above assumptions, the conditional McKean–Vlasov SDE (2) has a unique strong solution.

Proof

The proof is based on the Banach fixed point argument. Let $\mathcal {S}^{2}$ be the space of cadlag, $\mathcal {F}_t$-progressively measurable processes equipped with the norm

$$\begin{aligned} ||X||^{2}:=\mathbb {E[}\underset{t\in [0,T]}{\sup }|X(t)|^{2}]<\infty . \end{aligned}$$

This space equipped with this norm is a Banach space. Define the mapping $\varPhi :\mathcal {S}^{2}\rightarrow \mathcal {S}^{2}$ by $\varPhi (x)=X.$ We want prove that $\varPhi $ is contracting in $\mathcal {S}^{2}$ under the norm defined above. For two arbitrary elements $(x_{1},x_{2})$ and $(X_{1},X_{2})$, we denote their difference by $\widetilde{x}=x_{1}-x_{2}$ and $\widetilde{X}=X_{1}-X_{2}$, respectively. In the following, $C<\infty $ will denote a constant which is big enough for all the inequalities to hold. Applying the Itô formula to $\widetilde{X}^{2}(t)$, we get

$$\begin{aligned} \widetilde{X}^{2}(t)&=2\int _{0}^{t} \widetilde{X}(s)(\alpha (s,x_{1}(s),\mu _{1}(s))-\alpha (s,x_{2} (s),\mu _{2}(s)))\text {d}s\\&\quad +2\int _{0}^{t} \widetilde{X}(s)(\beta (s,x_{1}(s),\mu _{1}(s))-\beta (s,x_{2}(s),\mu _{2}(s)))\text {d}B(s)\\&\quad +2\int _{0}^{t} \widetilde{X}(s)\int _{\mathbb {R}_{k}} (\gamma (s,x_{1}(s),\mu _{1}(s),z)-\gamma (s,x_{2} (s),\mu _{2}(s),z))\widetilde{N}(\text {d}s,\text {d}z)\\&\quad +\int _{0}^{t} (\beta (s,x_{1}(s),\mu _{1}(s))-\beta (s,x_{2}(s),\mu _{2}(s)))^{2}\text {d}s\\&\quad +\int _{0}^{t} \int _{\mathbb {R}_{k}} (\gamma (s,x_{1}(s),\mu _{1}(s),z)-\gamma (s,x_{2} (s),\mu _{2}(s),z))^{2}\nu (\text {d}z)\text {d}s. \end{aligned}$$

By the Lipschitz assumption combined with standard majorization of the square of a sum (resp. integral) via the sum (resp. integral) of the square (up to a constant), we get

$$\begin{aligned} \widetilde{X}^{2}(t)&\le C {\textstyle \int _{0}^{t}} |\widetilde{X}(s)|\varDelta _{s}\text {d}s\\&+| {\textstyle \int _{0}^{t}} \widetilde{X}(s)\widetilde{\beta }(s)\text {d}B(s)|+| {\textstyle \int _{0}^{t}} {\textstyle \int _{\mathbb {R}_{k}}} \widetilde{X}(s)\widetilde{\gamma }(s,z)\widetilde{N}(\text {d}s,\text {d}z )|+ C {\textstyle \int _{0}^{t}}\varDelta _{s}^{2}\text {d}s, \end{aligned}$$

where

$$\begin{aligned} \varDelta _{s}:= & {} |\widetilde{x}_s|+||\widetilde{\mu }_s||_{\mathbb {M}},\\ \varDelta _{s}^{2}:= & {} |\widetilde{x}_s|^2+||\widetilde{\mu }_s||_{\mathbb {M}}^{2},\\ \widetilde{\beta }(s)= & {} \beta (s,x_{1}(s),\mu _{1}(s))-\beta (s,x_{2}(s),\mu _{2}(s)), \\ \widetilde{\gamma }(s,z)= & {} \gamma (s,x_{1}(s),\mu _{1}(s),z)-\gamma (s,x_{2} (s),\mu _{2}(s),z), \end{aligned}$$

and the value of the constant C is allowed to vary from line to line.

By the Burkholder–Davis–Gundy inequality, for all $t\in [0,T]$:

$$\begin{aligned} \mathbb {E}[\sup _{0\le t_0\le t}|\int _{0}^{t_0} \widetilde{X}(s)\widetilde{\beta }(s)\text {d}B(s)|]\le C\mathbb {E}[(\int _{0}^{t} \widetilde{X}^{2}(s)\widetilde{\beta }^{2}(s)\text {d}s)^{\frac{1}{2}}]\le CT\mathbb {E}[\int _{0}^{t}|\widetilde{X}_s|\varDelta _{s}\text {d}s], \end{aligned}$$

and we need to use Kunita’s inequality for the jumps (see Corollary 2.12 in Kunita [16]):

$$\begin{aligned}{} & {} \mathbb {E}[\sup _{0\le t_0\le t}|\int _{0}^{t_0} \widetilde{X}(s)\widetilde{\gamma }(s)\widetilde{N}(\text {d}s,\text {d}z)|]\\{} & {} \quad \le C \mathbb {E}[(\int _{0}^{t} \widetilde{X}^{2}(s)\widetilde{\gamma }^{2}(s)\nu (\text {d}z)\text {d}s)^{\frac{1}{2}}]\\{} & {} \quad \le CT\mathbb {E}[\int _{0}^{t}|\widetilde{X}_s|\varDelta _{s}\text {d}s]. \end{aligned}$$

Combining the above and using that

$$\begin{aligned} |\widetilde{X}_s|\varDelta _{s}\le C(|\widetilde{X}_s|^{2}+\varDelta _{s }^{2}), \end{aligned}$$

we obtain

$$\begin{aligned} \mathbb {E}[\sup _{0\le t_0\le t}\widetilde{X} ^{2}(t_0)]\le CT\mathbb {E}[\int _{0}^{t}(|\widetilde{X}_s|^{2}+\varDelta _{s}^{2})\text {d}s]. \end{aligned}$$

By definition of the norms, we have

$$\begin{aligned} \varDelta _{t}^{2}\le C|\widetilde{x}_{t}|^{2}. \end{aligned}$$

Iterating the above inequality, we get, for any integer $k > 1$:

$$\begin{aligned} \mathbb {E}[\sup _{0\le t\le T}\widetilde{X} ^{2}(t)]\le ( CT)^{k}\int _{0}^{T} \frac{(T-s)^{k-1}}{(k-1)!}\mathbb {E}[|\widetilde{x}_s|^{2}]\text {d}s\le ( CT)^{k}\frac{T^k}{k!}||\widetilde{x}||^{2}. \end{aligned}$$

Hence, for k large enough $\varPhi $ is a contraction on $\mathcal {S}^2$. Therefore, the equation has a unique solution up to T for any $T<\infty .$ $\square $

3 Fokker–Planck SPDE with Jumps

In this part of the paper, we will formulate the associated Fokker–Planck SPDE with jumps for McKean–Vlasov SDE driven by jumps.

Equation (1) is not in itself Markovian, so to be able to use the Dynkin formula, we extend the system to the process Y defined by

$$\begin{aligned} Y(t)=(s+t,X(t),\mu _t); \quad t\ge 0;\quad Y(0)=(s,Z,\mu _0)=:y, \end{aligned}$$

for some arbitrary starting time $s\ge 0$, with state dynamics given by X(t), conditional law of the state given by $\mu _t$ and with $X(0)=Z, \, \mu _0=\mathcal {L}(X(0))$. This system is Markovian, in virtue of the following Fokker–Planck equation for the conditional law $\mu _t$.

For fixed $t,\mu ,\zeta $ and $\ell =1,2,... k$, we write for simplicity $\gamma ^{(\ell )}=\gamma ^{(\ell )}(t,x,\mu ,\zeta )$ for column number $\ell $ of the $d \times k$-matrix $\gamma $. Then $\nu _\ell $ represents the Lévy measure of $N_\ell $ for all $\ell $.

Define the operator $\mu ^{(\gamma ^{(\ell )})}$ on $C_0( \mathbb {R}^d)$ by

$$\begin{aligned} \langle \mu ^{(\gamma ^{(\ell )})},g \rangle :=\int _{\mathbb {R}^d} g(x) \mu ^{(\gamma ^{(\ell )})}(\text {d}x)=\int _{\mathbb {R}^d}g(x+\gamma ^{(\ell )}) \mu (\text {d}x), \text { for all } g \in C_0( \mathbb {R}^d), \end{aligned}$$

where $\langle \mu ^{(\gamma ^{(\ell )})},g \rangle $ denotes the action of the measure $\mu ^{(\gamma ^{(\ell )})}$ on g. Then, we see that the Fourier transform of $\mu ^{(\gamma ^{(\ell )})}$ is

$$\begin{aligned} F[\mu ^{(\gamma ^{(\ell )})}](y)=\int _{\mathbb {R}^d} e^{-ixy} \mu ^{(\gamma ^{(\ell )})}(\text {d}x)=\int _{\mathbb {R}^d} e^{-i(x+\gamma ^{(\ell )})y} \mu (\text {d}x)= e^{-i \gamma ^{(\ell )}y} \widehat{\mu }(y). \end{aligned}$$

Therefore, $\mu ^{(\gamma ^{(\ell )})} \in \mathbb {M}$ if $\mu $ is. We call $\mu ^{(\gamma ^{(\ell )})}$ the $\gamma ^{(\ell )}$-shift of $\mu $.

Assumption II The coefficients $\alpha $, $\beta $ and $\gamma $ are $C^1$ with respect to x with bounded derivatives.

Let a test function $\psi \in C^{2}_b\left( \mathbb {R}^d\right) $, and with values in the complex plane $\mathbb {C}$.

Theorem 3.1

(Stochastic Fokker–Planck equation with jumps) Let X(t) be as in (1) and let $\mu _t=\mu _t^X$ be the regular conditional distribution of X(t) given the sub-filtration $\mathcal {G}_t$ which is described by relation (3). Then, $\mu _t$ satisfies the following SPIDE (in the sense of distributions):

$$\begin{aligned} \text {d}\mu _{t} =A_0^{*} \mu _t \text {d}t + A_1^{*}\mu _t \text {d}B_1(t)+\int _{ \mathbb {R}^k} A_2^{*}\mu _t \widetilde{N}_1(\text {d}t,\text {d}z); \quad \mu _0=\mathcal {L}(X(0)), \end{aligned}$$

(5)

where $A_0^{*}$ is the integro-differential operator

$$\begin{aligned} A_0^{*}\mu&= -\sum _{j=1}^d D_j [\alpha _j \mu ] +\frac{1}{2}\sum _{n,j=1}^d D_{n,j}[(\beta \beta ^{(T)})_{n,j} \mu ] \nonumber \\&\quad +\sum _{\ell =1}^k \int _{\mathbb {R}}\Big \{\mu ^{(\gamma ^{(\ell )})}-\mu +\sum _{j=1}^d D_j[\gamma _j^{(\ell )}(s,\cdot ,z)\mu ]\Big \} \nu _{\ell } \left( \text {d}z \right) , \end{aligned}$$

and

$$\begin{aligned} A_1^{*}\mu = -\sum _{j=1}^d D_j[\beta _{1,j} \mu ],\quad A_2^{*}\mu =\mu ^{(\gamma ^{(1)})}-\mu , \end{aligned}$$

where $\beta ^{(T)}$ denotes the transposed of the $d \times m$ - matrix $\beta =\big [\beta _{j,k}\big ]_{1\le j \le d,1 \le k \le m}$, $\gamma ^{(\ell )}$ is column number $\ell $ of the matrix $\gamma $ and $\nu _{\ell } \left( \cdot \right) $ is the Lévy measure of $N_{\ell }\left( \cdot ,\cdot \right) .$

For notational simplicity, we use $D_j, D_{n,j}$ to denote $\frac{\partial }{\partial x_j}$ and $\frac{\partial ^2}{\partial x_n \partial x_j}$ in the sense of distributions.

Proof

We get by the Itô formula for jump diffusions (see, e.g. Theorem 1.16 in [20]):

$$\begin{aligned}&\psi \left( X_{t}\right) -\psi (x)= \int _{0}^{t}A_0\psi \left( X_{s}\right) \text {d}s +\sum _{n=1}^m\int _0^{t}A_{1,n} \psi (X_s) \text {d}B_n(s)\\&\qquad +\sum _{\ell =1}^k\int _{0}^{t}\int _{ \mathbb {R}^k}A_{2,\ell } \psi (X_s)\widetilde{N}_{\ell }(\text {d}s,\text {d}z), \end{aligned}$$

where

$$\begin{aligned} A_0\psi \left( X_s \right)&=\sum _{j=1}^d \alpha _j \left( s,X_s,\mu _s\right) \frac{\partial \psi }{\partial x_j}\left( X_s \right) \\&\quad +\frac{1}{2}\sum _{n,j=1}^d (\beta \beta ^{T})_{n,j} (s,X_s,\mu _s)\frac{\partial ^2 \psi }{\partial x_n \partial x_j} \left( X_{s}\right) \\&\quad +\sum _{\ell =1}^k \int _{\mathbb {R}}\Big \{ \psi \left( X_{s^-}+\gamma ^{(\ell )} \left( s,X_{s^-},\mu _{s^-},z \right) \right) -\psi \left( X_{s}\right) \\&\quad -\sum _{j=1}^d \frac{\partial \psi }{\partial x_j} \left( X_{s}\right) \gamma _j^{(\ell )} \left( s,X_{s^-},\mu _{s^-},z \right) \Big \} \nu _{\ell } \left( \text {d}z \right) , \end{aligned}$$

and

$$\begin{aligned}&A_{1,n}\psi (X_s)= \sum _{j=1}^d\frac{\partial \psi }{\partial x_j} (X_s) \beta _{j,n} (s,X_s,\mu _s), \\&\quad A_{2,\ell }\psi (X_s)= \psi ( X_{s^-}+\gamma ^{(\ell )} ( s,X_{s^-},\mu _{s^-},z)) -\psi \left( X_{s^-}\right) , \end{aligned}$$

where $\beta _j$ is the row number j of the $d \times m$ matrix $\beta $ and $\gamma ^{(\ell )}$ is column number $\ell $ of the $d \times k$ matrix $\gamma $. Since $(B_r,\widetilde{N}_r)_{r>1}$ are independent of $\mathcal {G}_t$, we get after conditioning

$$\begin{aligned}&\mathbb {E}[ \psi \left( X_{t}\right) | \mathcal {G}_t]-\psi (x) = \mathbb {E}[\int _{0}^{t}A_0\psi \left( X_{s}\right) \text {d}s\nonumber \\&\quad +\int _0^{t}A_{1,1} \psi (X_s)\text {d}B_1(s)+\int _{0}^{t}\int _{ \mathbb {R}^k}A_{2,1} \psi (X_s) \widetilde{N}_1(\text {d}s,\text {d}z)| \mathcal {G}_t] \nonumber \\&\quad = \mathbb {E}[ \int _{0}^{t}{\mathbb { E}}[A_0\psi \left( X_{s}\right) | \mathcal {G}_s]\text {d}s +\int _0^{t}{\mathbb { E}}[ A_{1,1} \psi (X_s)|\mathcal {G}_s] \text {d}B_1(s)\nonumber \\&\quad +\int _{0}^{t}\int _{ \mathbb {R}^k}{\mathbb { E}}[ A_{2,1} \psi (X_s) |\mathcal {G}_s] \widetilde{N}_1(\text {d}s,\text {d}z)| \mathcal {G}_t]\nonumber \\&\quad = \int _{0}^{t}{\mathbb { E}}[A_0\psi \left( X_{s}\right) | \mathcal {G}_s]\text {d}s +\int _0^{t}{\mathbb { E}}[ A_{1,1}\psi (X_s) |\mathcal {G}_s]\text {d}B_1(s)\nonumber \\&\quad +\int _{0}^{t}\int _{ \mathbb {R}^k}{\mathbb { E}}[ A_{2,1} \psi (X_s) |\mathcal {G}_s] \widetilde{N}_1(\text {d}s,\text {d}z), \end{aligned}$$

(6)

the last equality follows from the tower property of conditional expectation.

By the above, we see that

$$\begin{aligned} A_{1,1}\psi&=\sum _{j=1}^d\frac{\partial \psi }{\partial x_j} (X_s) \beta _{j,1} (s,X_s,\mu _s),\\ A_{2,1}\psi&= \Big \{ \psi ( X_{s^-}+\gamma ^{(1)} ( s,X_{s^-},\mu _{s^-},z)) -\psi \left( X_{s^-}\right) \Big \}. \end{aligned}$$

In particular, choosing, with $i=\sqrt{-1}$, $\psi \left( x \right) =\psi _{y}\left( x\right) =e^{-iyx};\quad y,x\in \mathbb {R}^d$, we get

$$\begin{aligned}&A_0\psi \left( X_{s}\right) \nonumber \\&\quad =\left( -i\sum _{i=1}^d y_i \alpha _i \left( s,X_{s},\mu _s\right) - \frac{1}{2}\sum _{n,j=1}^d y_n y_j (\beta \beta ^{(T)})_{n.j} \left( s,X_{s},\mu _s\right) \right. \nonumber \\&\quad \left. +\sum _{\ell =1}^k \int _{\mathbb {R}}\left\{ \exp \left( -iy\gamma ^{(\ell )} \left( s,X_{s^-},\mu _{s^-},z \right) \right) -1+i \sum _{j=1}^d y_i \gamma _j^{(\ell )} \left( s,X_{s^-},\mu _{s^-},z \right) \right\} \nu _{\ell } \left( \text {d}z \right) \right) e^{-iyX_{s}}. \end{aligned}$$

(7)

and

$$\begin{aligned} A_{1,1}\psi (X_s)&=-i \sum _{j=1}^\text {d}y_j\beta _{j,1}(s,X_s,\mu _s)e^{-iyX_{s}}, \\ A_{2,1}\psi (X_s)&= e^{-iy(X_{s^-}+\gamma ^{(1)} ( s,X_{s^-}),\mu _{s^-},z)}-e^{-iyX_{s^-}}. \end{aligned}$$

In general, we have (see (3))

$$\begin{aligned} \mathbb {E}\left[ g\left( X_{s}\right) e^{-iyX_{s}} | \mathcal {G}_s\right] =\int _{\mathbb {R}^d }g\left( x\right) e^{-iyx}\mu _{s}\left( \text {d}x\right) =F\left[ g\left( \cdot \right) \mu _{s}(\cdot )\right] \left( y\right) . \end{aligned}$$

Therefore, we get

$$\begin{aligned} {\mathbb { E}}[ e^{-iy\gamma ( s,X_{s^-},\mu _{s^-},\zeta )}e^{-iyX_s} | \mathcal {G}_s]&= \int _{{\mathbb { R}}^d} e^{-iy\gamma ( s,x,\mu _{s^-},\zeta )} e^{-ixy}\mu _s(\text {d}x) \nonumber \\&=\int _{{\mathbb { R}}^d} e^{-iy(x+\gamma ( s,x,\mu _{s^-},\zeta ))} \mu _s(\text {d}x)\nonumber \\&=\int _{{\mathbb { R}}^d} e^{-iyx} \mu ^{(\gamma )}_s(\text {d}x)=F[\mu ^{(\gamma )}_s(\cdot )](y), \end{aligned}$$

(8)

where $\mu _s^{(\gamma )}(\cdot )$ is the $\gamma $-shift of $\mu _s$. Recall that if $w \in {S}'={S}'({\mathbb { R}}^d)$ (the space of tempered distributions), using the notation $\frac{\partial }{\text {d}x_j}w\left( t,x\right) =:D_jw\left( t,x\right) ,$ and similarly with higher order derivatives, we have, in the sense of distributions,

$$\begin{aligned} F\left[ D_jw\left( t,\cdot \right) \right] ( y) =iy_j F\left[ w\left( t,\cdot \right) \right] \left( y\right) . \end{aligned}$$

Thus,

$$\begin{aligned} iy_jF[\alpha (s, \cdot )\mu _s](y)&= F[D_j(\alpha (s,\cdot )\mu _s)](y)\\ -\ y_ny_jF[\beta \beta ^{T}(s,\cdot )\mu _s](y)&=F[D_{n,j}(\beta \beta ^{T}(s,\cdot ) \mu _s)](y). \end{aligned}$$

Applying this and (8) to (7), we get

$$\begin{aligned}&{\mathbb { E}}[A_0\psi \left( X_{s}\right) | \mathcal {G}_s]= \int _{{\mathbb { R}}^d} \Big (-i\sum _{j=1}^d y_j \alpha _j \left( s,x,\mu _s\right) -\frac{1}{2}\sum _{n,j=1}^d y_n y_j (\beta \beta ^{(T)})_{n,j} \left( s,x,\mu _s\right) \nonumber \\&\quad +\sum _{\ell =1}^k \int _{\mathbb {R}}\Big \{ \exp \left( -iy\gamma ^{(\ell )} ( s,x,\mu _{s^-},z) \right) -1 \nonumber \\&\quad +i \sum _{j=1}^d y_j\gamma _j^{(\ell )} ( s,x,\mu _{s^-},z) \} \nu _{\ell } \left( \text {d}z \right) \Big )e^{-iyx}\mu _s(\text {d}x) =-i\sum _{j=1}^d y_j F[\alpha _j \mu _s](y) \nonumber \\&\quad -\frac{1}{2}\sum _{n,j=1}^d y_n y_j F[(\beta \beta ^{(T)})_{n,j} \mu _s] (y) +\sum _{\ell =1}^k F\Big [ \int _{{\mathbb { R}}}\Big \{ \exp \left( -iy\gamma ^{(\ell )} \left( s,x,\mu _{s^-},z \right) \right) \nonumber \\&\quad -1+i \sum _{j=1}^d y_j \gamma _j^{(\ell )} \left( s,x,\mu _{s^-},z \right) \Big \} \nu _{\ell } \left( \text {d}z \right) \mu _s\Big ](y)=F\Big [ -\sum _{j=1}^d D_j (\alpha _j \mu _s) \nonumber \\&\quad +\frac{1}{2}\sum _{n,j=1}^d D_{n,j}((\beta \beta ^{(T)})_{n,j} \mu _s)+\sum _{\ell =1}^k \int _{\mathbb {R}}\Big \{\mu _s^{(\gamma ^{(\ell )})}-\mu _s \nonumber \\&\quad +\sum _{j=1}^d D_j[\gamma _j^{(\ell )}(s,\cdot ,z)\mu _s] \Big \} \nu _{\ell } \left( \text {d}z \right) \Big ](y)=F[A_0^{*} \mu _s](y), \end{aligned}$$

where $A_0^{*}$ is the integro-differential operator

$$\begin{aligned} A_0^{*}\mu&= -\sum _{j=1}^d D_j [\alpha _j \mu ] +\frac{1}{2}\sum _{n,j=1}^d D_{n,j}[(\beta \beta ^{(T)})_{n,j} \mu ] +\sum _{\ell =1}^k \int _{\mathbb {R}}\{\mu ^{(\gamma ^{(\ell )})}-\mu \nonumber \\&\quad +\sum _{j=1}^d D_j[\gamma _j^{(\ell )}(s,\cdot ,\zeta )\mu ]\} \nu _{\ell } \left( \text {d}z \right) . \end{aligned}$$

Note that $A^{*}_0\mu _s$ exists in $\mathcal {S}^{\prime }$.

Similarly, we get

$$\begin{aligned} {\mathbb { E}}[ A_{1,1} \psi (X_s) |\mathcal {G}_s]&= \int _{{\mathbb { R}}^d}-i \sum _{j=1}^d y_j\beta _{j,1}(s,{x},\mu _s)e^{-iyx} \mu _s(\text {d}x)\\&=F[-i \sum _{j=1}^d y_j\beta _{j,1}(s,x,\mu _s)\mu _s] \nonumber \\&=F[-\sum _{j=1}^d D_j(\beta _{j,1} \mu _s)](y)=F[ A_1^{*}\mu _s](y), \end{aligned}$$

where $A_1^{*}$ is the operator

$$\begin{aligned} A_1^{*}\mu _s= - \sum _{j=1}^d D_j[\beta _{j,1} \mu _s]. \end{aligned}$$

and

$$\begin{aligned} {\mathbb { E}}[A_{2,1} \psi (X_s) | \mathcal {G}_s]&={\mathbb { E}}[e^{-iy(X_{s^-}+\gamma ^{(1)} ( s,X_{s^-},\mu _{s^-},\zeta ))}-e^{-iyX_{s^-}}| \mathcal {G}_s] \nonumber \\&=\int _{{\mathbb { R}}^d}\Big (\exp ( -iy\gamma ^{(1)} ( s,x,\mu _{s^-},\zeta )) -1 \Big )e^{-iyx}\mu _s(\text {d}x) \nonumber \\&=F[\mu _s^{(\gamma ^{(1)})}-\mu _s](y)=F[A_2^{*}\mu _s](y), \end{aligned}$$

$\mathcal {S}$ where $A_2^{*}$ is the operator given by

$$\begin{aligned} A_2^{*} \mu _s=\mu _s^{(\gamma ^{(1)})}-\mu _s. \end{aligned}$$

Substituting what we have obtained above into (6), leads

$$\begin{aligned}&\mathbb {E}\Big [ \psi \left( X_{t}\right) | \mathcal {G}_t\Big ] \nonumber \\&\quad =\psi (x)+\int _{0}^{t}{\mathbb { E}}[A_0\psi \left( X_{s}\right) | \mathcal {G}_s]\text {d}s +\int _0^{t}{\mathbb { E}}[ A_{1,1}\psi (X_s) |\mathcal {G}_s]\text {d}B_1(s) \nonumber \\&\qquad +\int _{0}^{t}\int _{ \mathbb {R}^k}{\mathbb { E}}[ A_{2,1} \psi (X_s) |\mathcal {G}_s] \widetilde{N}_1(\text {d}s,\text {d}z) \nonumber \\&\quad = \psi (x)+\int _0^{t}F [A_0^{*}\mu _s](y)\text {d}s +\int _0^{t} F[A_1^{*}\mu _s](y) \text {d}B_1(s)\nonumber \\&\qquad +\int _{0}^{t}\int _{ \mathbb {R}^k}F[A_2^{*}\mu _s](y)\widetilde{N}_1(\text {d}s,\text {d}z). \end{aligned}$$

(9)

On the other hand,

$$\begin{aligned} {\mathbb { E}}[ \psi \left( X_{t}\right) |\mathcal {G}_t]-\psi (x) ={\mathbb { E}}[e^{-iyX_{t} }- e^{-iyX_0} |\mathcal {G}_t] =\widehat{\mu }_t(y) -\widehat{\mu }_0(y). \end{aligned}$$

(10)

Combining (9) and (10), we get

$$\begin{aligned}&\widehat{\mu }_{t}(y)-\widehat{\mu }_0(y)=\int _0^{t}F [A_0^{*}\mu _s](y)\text {d}s +\int _0^{t} F[A_1^{*}\mu _s](y) \text {d}B_1(s)\\&\qquad +\int _{0}^{t}\int _{ \mathbb {R}^k}F[A_2^{*}\mu _s](y)\widetilde{N}_1(\text {d}s,\text {d}z). \end{aligned}$$

Since the Fourier transform of a distribution determines the distribution uniquely, we deduce that

$$\begin{aligned} \mu _{t}-\mu _0=\int _0^{t} A_0^{*}\mu _s \text {d}s +\int _0^{t} A_1^{*}\mu _s \text {d}B_1(s)+\int _{0}^{t}\int _{ \mathbb {R}^k}A_2^{*}\mu _s\widetilde{N}_1(\text {d}s,\text {d}z), \end{aligned}$$

or, in differential form,

$$\begin{aligned} \text {d}\mu _t=A_0^{*}\mu _t \text {d}t + A_1^{*}\mu _t \text {d}B_1(t)+\int _{ \mathbb {R}^k}A_2^{*}\mu _t\widetilde{N}_1(\text {d}t,\text {d}z); \quad \mu _0=\mathcal {L}(X(0)), \end{aligned}$$

as claimed. That completes the proof. $\square $

Remark 3.1

If the common noise is coming from the Lévy noise, then the corresponding Fokker–Planck equation is an SPDE driven by this noise. Similarly, if the common noise is coming only from the Brownian motion, we get the Fokker–Planck SPDE as in Agram and Øksendal [1]. When the conditioning is with respect to the trivial filtration $\mathcal {F}_0$, meaning that there is no common noise available, we get the Fokker–Planck PDE studied, for example, in Bogachev et al. [6].

4 A General Formulation and a Verification Theorem

Since Eq. (1) is not Markovian, in the sense that it does not have the flow-property, we construct the following process Y:

$$\begin{aligned} \text {d}Y(t)&=F(Y(t))\text {d}t + G(Y(t))\text {d}B(t) + \int _{\mathbb {R}^k} H(Y(t^-),z)\widetilde{N}(\text {d}t,\text {d}z) \nonumber \\&:=\left[ \begin{array}{clcr} \text {d}t \\ \text {d}X(t)\\ \text {d}\mu _t \end{array} \right] =\left[ \begin{array}{c} 1\\ \alpha (Y(t)) \\ A_0^{*}\mu _t \end{array} \right] \text {d}t +\left[ \begin{array}{rc} 0_{1\times m} \\ \beta (Y(t)) \\ A_1^{*}\mu _t,0,0 ...,0 \end{array} \right] \text {d}B(t)\nonumber \\&+ \int _{\mathbb {R}^k} \left[ \begin{array}{rc} 0_{1\times k}\\ \gamma (Y(t^-),z) \\ A_2^{*}\mu _t, 0, 0, ..., 0 \end{array}\right] \widetilde{N}(\text {d}t,\text {d}z),\quad s\le t \le T, \end{aligned}$$

(11)

where X(t) and $\mu _t$ satisfy Eqs. (1) and (5), respectively. Moreover, we have used the shorthand notation

$$\begin{aligned} \alpha (Y(t))&=\alpha (s+t,X(t),\mu (t)),\\ \beta (Y(t))&= \beta (s+t,X(t),\mu (t)),\\ \gamma (Y(t^-),z)&=\gamma (s+t,X(t^-),\mu (t^-),z). \end{aligned}$$

The process Y(t) starts at $y=(s, Z,\mu )$. We shall use the following notation:

Notation 4.1

We use

x to denote a generic value of the point $X(t,\omega ) \in \mathbb {R}^d$, and
X to denote a generic value of the random variable $X(t) \in L^2(P).$
When the meaning is clear from the context we use x in both situations.
$\mu $ for either the initial probability distribution $\mathcal {L}(X(0))$ or the generic value of the conditional law $\mu _t:=\mathcal {L}(X(t) | \mathcal {G}_t)$, when there is no ambiguity.

The concept of impulse control is simple and intuitive: At any time, the agent can make an intervention $\zeta $ into the system. Due to the cost of each intervention, the agent can intervene only at discrete times $\tau _1, \tau _2, \ldots $. The impulse problem is to find out at what times it is optimal to intervene and what is the corresponding optimal intervention sizes. We now proceed to formulate precisely our impulse control problem for conditional McKean–Vlasov jump diffusions.

Suppose that—if there are no interventions—the process $Y(t)=(s+t,X(t),\mu _t)$ is the conditional McKean–Vlasov jump diffusion given by (11).

Suppose that at any time t and any state $y=(s,X,\mu )$ we are free to intervene and give the state X an impulse $\zeta \in \mathcal {Z}\subset \mathbb {R}^d$, where $\mathcal {Z}$ is a given set (the set of admissible impulse values). Suppose the result of giving the state X the impulse $\zeta $ is that the state jumps immediately from X to $\varGamma (X,\zeta )$, where $\varGamma (X,\zeta ): L^2(P)\times \mathcal {Z}\rightarrow L^2(P)$ is a given function. In many applications, the process shifts as a result of a simple translation, i.e. $\varGamma (y,\zeta )=y+\zeta $.

Simultaneously, the conditional law jumps from $\mu _{t}=\mathcal {L}(X(t)|\mathcal {G}_t )$ to

$$\begin{aligned} \mu _t^{\varGamma (X,\zeta )}:=\mathcal {L}(\varGamma (X(t),\zeta )|\mathcal {G}_t). \end{aligned}$$

(12)

An impulse control for this system is a double (possibly finite) sequence

$$\begin{aligned} v=(\tau _1,\tau _2,\ldots ,\tau _j,\ldots ; \zeta _1,\zeta _2,\ldots ,\zeta _j,\ldots )_{j\le M},\quad M \le \infty , \end{aligned}$$

where $0 \le \tau _1 \le \tau _2 \le \cdots $ are $\mathcal {G}_t$-stopping times (the intervention times) and $\zeta _1,\zeta _2,\ldots $ are the corresponding impulses at these times. Mathematically, we assume that $\tau _j$ is a stopping time with respect to the filtration $\{\mathcal {G}_t\}_{t\ge 0}$, with $\tau _{j+1}>\tau _j$ and $\zeta _j$ is $\mathcal {G}_{\tau _j}$-measurable for all j. We let $\mathcal {V}$ denote the set of all impulse controls.

If $v=(\tau _1,\tau _2,\ldots ;\zeta _1,\zeta _2,\ldots ) \in \mathcal {V}$, the corresponding state process $Y^{(v)}(t)$ is defined by

$$\begin{aligned}&Y^{(v)}(0^-) = y \quad \text{ and } \quad Y^{(v)}(t) = Y(t);\quad 0< t\le \tau _1, \end{aligned}$$

(13)

$$\begin{aligned}&Y^{(v)}(\tau _j) = \Big (\tau _j,\varGamma [\check{X}^{(v)}(\tau _j^-),\zeta _j],\mathcal {L}(\varGamma [\check{X}^{(v)}(\tau _j^-),\zeta _j]|\mathcal {G}_t)\Big ), \quad j=1,2,\ldots \end{aligned}$$

(14)

$$\begin{aligned}&{\textrm{d}}Y^{(v)}(t) = F(Y^{(v)}(t)){\textrm{d}}t +G(Y^{(v)}(t))\textrm{d}B(t) \nonumber \\&\qquad \quad \hspace{17pt} +\int _{\mathbb {R}^k} H(Y^{(v)}(t^-),z)\widetilde{N}(\textrm{d}t,\textrm{d}z) \quad \hbox { for } \tau _j< t<\tau _{j+1}\wedge \tau ^*, \end{aligned}$$

(15)

where we have used the notation

$$\begin{aligned} \check{X}^{(v)}(\tau _j^-)=X^{(v)}(\tau _j^-) +\varDelta _N X(\tau _j), \end{aligned}$$

$\varDelta _N X^{(v)}(t)$ being the jump of $X^{(v)}$ stemming from the jump of the random measure $N(t,\cdot )$. Note that we distinguish between the (possible) jump of $X^{(v)}(\tau _j)$ stemming from the random measure N, denoted by $\varDelta _N X^{(v)}(\tau _j)$ and the jump caused by the intervention v, given by

$$\begin{aligned} \varDelta _v X^{(v)}(\tau _j):=\varGamma (\check{X}^{(v)}(\tau _j^-),\zeta ) -\check{X}^{(v)}(\tau _j^-). \end{aligned}$$

Accordingly, at the time $t= \tau _j$, $X^{(v)}(t)$ jumps from $\check{X}^{(v)}(\tau _j^-)$ to $\varGamma [\check{X}^{(v)}(\tau _j^-), \zeta _j]$

and $\mu _{\tau _j^-}$ jumps to

$$\begin{aligned} \mu _{\tau _j}=\mathcal {L}(\varGamma [\check{X}^{(v)}(\tau _j^-),\zeta _j]|\mathcal {G}_{\tau _j}). \end{aligned}$$

Consider a fixed open set (called the solvency region) $\mathcal {S}\subset [0,\infty ) \times \mathbb {R}^d \times {\mathbb {M}}$. It represents the set in which the game takes place since it will end once the controlled process leaves $\mathcal {S}$. In portfolio optimization problems, for instance, the game ends in case of bankruptcy, which may be modelled by choosing $\mathcal {S}$ to be the set of states where the capital is above a certain threshold. Define

$$\begin{aligned} \tau _{\mathcal {S}}=\inf \{ t\in (0,\infty ); Y^{(v)}(t)\not \in \mathcal {S}\}, \end{aligned}$$

and

$$\begin{aligned} \mathcal {T}=\left\{ \tau \, \, \text {stopping time,} \, 0 \le \tau \le \tau _\mathcal {S}\right\} . \end{aligned}$$

Suppose we are given a continuous profit function $f:\mathcal {S}\rightarrow \mathbb {R}$ and a continuous bequest function $g:\mathcal {S}\rightarrow \mathbb {R}$. Moreover, suppose the profit/utility of making an intervention with impulse $\zeta \in \mathcal {Z}$ when the state is y is $K(y,\zeta )$, where $K:\mathcal {S}\times \mathcal {Z}\rightarrow \mathbb {R}$ is a given continuous function.

We assume we are given a set $\mathcal {V}$ of admissible impulse controls which is included in the set of $v=(\tau _1,\tau _2,\ldots ;\zeta _1,\zeta _2,\ldots )$ such that a unique solution $Y^{(v)}$ of (13)–(15) exist, for all $v\in \mathcal {V}$, and the following additional properties hold, assuring that the performance functional below is well-defined:

$$\begin{aligned}{} & {} \mathbb {E}^y\Big [ \int _0^{\tau _{\mathcal {S}}} f^-(Y^{(v)}(s))\text {d}s\Big ]<\infty , \quad \hbox { for all }\,y\in \mathcal {S},\,v\in \mathcal {V},\\{} & {} \mathbb {E}^y\left[ g^-(Y^{(v)}(\tau _{\mathcal {S}}))\mathbb {1}_{[\tau _{\mathcal {S}}<\infty ]}\right] <\infty , \quad \hbox { for all }\,y\in \mathcal {S},\,v\in \mathcal {V}, \end{aligned}$$

and

$$\begin{aligned} \mathbb {E}^y\left[ \sum _{\tau _j\le \tau _{\mathcal {S}}} K^-(\check{Y}^{(v)}(\tau _j^-),\zeta _j)\right] <\infty , \quad \hbox { for all } y\in \mathcal {S}, v\in \mathcal {V}, \end{aligned}$$

where $\mathbb {E}^y$ denotes expectation, given that $Y(0)=y$, and we mean by $a^-=-min\{a,0\}$ for $a=f,g,K$.

We now define the performance criterion, which consists of three parts: A continuous time running profit in $[0,\tau _\mathcal {S}]$, a terminal bequest value if the game ends, and a discrete-time intervention profit, namely

$$\begin{aligned} J^{(v)}(y)&= \mathbb {E}^y \Bigg [ \int _0^{\tau _{\mathcal {S}}} f(Y^{(v)}(t))\textrm{d}t +g(Y^{(v)}(\tau _{\mathcal {S}}))\mathbb {1}_{[\tau _{\mathcal {S}}<\infty ]}+\sum _{\tau _j\le \tau _{\mathcal {S}}} K(\check{Y}^{(v)}(\tau _j^-),\zeta _j)\Bigg ]. \end{aligned}$$

We consider the following impulse control problem:

Problem 4.1

Find $\varPhi (y)$ and $v^*\in \mathcal {V}$ such that

$$\begin{aligned} \varPhi (y)=\sup \{ J^{(v)}(y);v\in \mathcal {V}\}=J^{(v^*)}(y),\quad y \in \mathcal {S}. \end{aligned}$$

The function $\varPhi (y)$ is called the value function and $v^*$ is called an optimal control.

The following concept is crucial for the solution to this problem.

Definition 4.1

Let $\mathcal {H}$ be the space of all measurable functions $h:\mathcal {S}\rightarrow \mathbb {R}$. The intervention operator $\mathcal {M}:\mathcal {H}\rightarrow \mathcal {H}$ is defined by

$$\begin{aligned} { \mathcal {M} h(s,X,\mu )=\sup _{\zeta \in \mathcal {Z}} \{ h(s,\varGamma (X,\zeta ),\mu ^{\varGamma (X,\zeta )})+K(y,\zeta );(s,\varGamma (X,\zeta ),\mu ^{\varGamma (X,\zeta )})\in \mathcal {S}\},}\nonumber \\ \end{aligned}$$

(16)

where $\mu ^{\varGamma (X,\zeta )}$ is given by (12).

Let $C^{(1,2,2)}(\mathcal {S})$ denote the family of functions $\varphi (s,x,\mu ):\mathcal {S}\rightarrow \mathbb {R}$ which are continuously differentiable w.r.t. s and twice continuously Fréchet differentiable with respect to $x \in \mathbb {R}^d$ and $\mu \in {\mathbb {M}}$. We let $\nabla _\mu \varphi \in \mathbb {L}({\mathbb {M}},\mathbb {R})$ (the set of bounded linear functionals on ${\mathbb {M}}$) denote the Fréchet derivative (gradient) of $\varphi $ with respect to $\mu \in {\mathbb {M}}$. Similarly, $D^2_\mu \varphi $ denotes the double derivative of $\varphi $ with respect to $\mu $ and it belongs to $\mathbb {L}({\mathbb {M}}\times {\mathbb {M}},\mathbb {R})$ (see Appendix for further details).

The infinitesimal generator L of the Markov jump-diffusion process Y(t) is defined on functions $\varphi \in C^{(1,2,2)}(\mathcal {S})$ by

$$\begin{aligned}&L\varphi = \frac{\partial \varphi }{\partial s} +\sum _{j=1}^d \alpha _j \frac{\partial \varphi }{\partial x_j} + \langle \nabla _{\mu } \varphi , A_0^{*} \mu \rangle + \tfrac{1}{2}\sum _{j,n=1}^{d} (\beta \beta ^{T})_{j,n}\frac{\partial ^2 \varphi }{\partial x_j \partial x_n} \nonumber \\&\qquad + \tfrac{1}{2}\sum _{j=1}^d \beta _{j,1}\frac{\partial }{\partial x_j}\langle \nabla _{\mu } \varphi ,A_1^{*}\mu \rangle +\tfrac{1}{2} \langle A_1^{*}\mu , \langle D_{\mu }^2 \varphi ,A_1^{*}\mu \rangle \rangle \nonumber \\&\qquad +\int _{\mathbb {R}^k}\Big \{\varphi (s, x+\gamma ^{(1)}, \mu + A_2^{*}\mu )-\varphi (s,x,\mu )\nonumber \\&\qquad -\sum _{j=1}^d\gamma _j^{(1)} \tfrac{\partial }{\partial x_j} \varphi (s,x,\mu )-\langle A_2^{*}\mu ,D_{\mu }\varphi \rangle \Big \}\nu _{1}(\text {d}z)\nonumber \\&\qquad +\sum _{\ell =2}^k \int _{\mathbb {R}^k} \Big \{ \varphi (s, x+\gamma ^{(\ell )}, \mu )) - \varphi (s,x,\mu ) -\sum _{j=1}^d\gamma _j^{(\ell )} \tfrac{\partial }{\partial x_j} \varphi (s,x,\mu ) \Big \}\nu _{\ell }(\text {d}z), \end{aligned}$$

where, as before, $A_0^{*}$ is the integro-differential operator

$$\begin{aligned} A_0^{*}\mu&= -\sum _{j=1}^d D_j [\alpha _j \mu ] +\frac{1}{2}\sum _{n,j=1}^d D_{n,j}[(\beta \beta ^{(T)})_{n,j} \mu ] \nonumber \\&\quad +\sum _{\ell =1}^k \int _{\mathbb {R}^k}\Big \{\mu ^{(\gamma ^{(\ell )})}-\mu +\sum _{j=1}^d D_j[\gamma _j^{(\ell )}(s,\cdot ,z)\mu ]\Big \} \nu _{\ell } \left( \text {d}z \right) , \end{aligned}$$

and

$$\begin{aligned} A_1^{*}\mu = - \sum _{j=1}^d D_j[\beta _{1,j} \mu ]. \end{aligned}$$

and

$$\begin{aligned} A_2^{*} \mu =\mu ^{(\gamma ^{(1)})}-\mu . \end{aligned}$$

We can now state a verification theorem for conditional McKean–Vlasov impulse control problems, providing sufficient conditions that a given function is the value function and a given impulse control is optimal. The verification theorem links the impulse control problem to a suitable system of quasi-variational inequalities.

Since the process Y(t) is Markovian, we can, with appropriate modifications, use the approach in Chapter 9 in [20].

For simplicity of notation, we will in the following write

$$\begin{aligned} \overline{\varGamma }(y,\zeta )=(s,\varGamma (x,\zeta ),\mu ^{\varGamma (x,\zeta )}), \text { when } y=(s,x,\mu ) \in [0,\infty ) \times L^2 (P) \times {\mathbb {M}}. \end{aligned}$$

Theorem 4.2

Variational inequalities for conditional McKean–Vlasov impulse control

(a)
Suppose we can find $\phi :\bar{\mathcal {S}}\rightarrow \mathbb {R}$ such that
1. (i)
  $\phi \in C^1(\mathcal {S})\cap C(\bar{\mathcal {S}})$.
2. (ii)
  $\phi \ge \mathcal {M}\phi $ on $\mathcal {S}$. Define
  $$\begin{aligned} D=\{y\in \mathcal {S}; \phi (y)>\mathcal {M}\phi (y)\}\quad \hbox { (the continuation region).} \end{aligned}$$
  Assume
3. (iii)
  $\displaystyle \, \mathbb {E}^y\left[ \int _0^{\tau _{\mathcal {S}}} Y^{(v)}(t)\mathbb {1}_{\partial D}{\textrm{d}}t\right] =0$ for all $y\in \mathcal {S}$, $v\in \mathcal {V}$, i.e. the amount of time Y(t) spends on $\partial D$ has Lebesgue measure zero.
4. (iv)
  $\partial D$ is a Lipschitz surface.
5. (v)
  $\phi \in C^{(1,2,2)}(\mathcal {S}\setminus \partial D)$ with locally bounded derivatives near $\partial D$. ((iv)-(v) are needed for the approximation argument in the proof).
6. (vi)
  $L\phi +f\le 0$ on $\mathcal {S}\setminus \partial {D}$.
7. (vii)
  $\phi (y) = g(y) \text { for all } y \not \in \mathcal {S}$.
8. (viii)
  $ \{\phi ^-(Y^{(v)}(\tau ));\tau \in \mathcal {T}\}$ is uniformly integrable, for all $y\in \mathcal {S}$, $v\in \mathcal {V}$.
9. (ix)
  $\displaystyle \mathbb {E}^y\left[ |\phi (Y^{(v)}(\tau ))| + \int _0^{\tau _{\mathcal {S}}} |L \phi (Y^{(v)}(t))| {\textrm{d}}t\right] <\infty $ for all $\tau \in \mathcal {T}, v \in \mathcal {V}, y \in \mathcal {S}$. Then,
  $$\begin{aligned} \phi (y)\ge \varPhi (y)\quad \hbox { for all }y\in \mathcal {S}. \end{aligned}$$
(b)
Suppose in addition that
1. (x)
  $L\phi +f=0$ in D.
2. (xi)
  $\hat{\zeta }(y)\in {\textrm{Argmax}}\{\phi ({\overline{\varGamma }(y,\cdot )})+K(y,\cdot )\}\in \mathcal {Z}$ exists for all $y\in \mathcal {S}$ and $\hat{\zeta }(\cdot )$ is a Borel measurable selection. Put $\hat{\tau }_0=0$ and define $\,\hat{v}=(\hat{\tau }_1,\hat{\tau }_2,\ldots ;\hat{\zeta }_1, \hat{\zeta }_2,\ldots )$ inductively by $\hat{\tau }_{j+1}=\inf \{ t>\hat{\tau }_j; Y^{(\hat{v}_j)}(t)\not \in D\}\wedge \tau _{\mathcal {S}}$ and $\,\hat{\zeta }_{j+1}=\hat{\zeta }(Y^{(\hat{v}_j)}(\hat{\tau }_{j+1}^-))$ if $\hat{\tau }_{j+1}<\tau _{\mathcal {S}}$, where $Y^{(\hat{v}_j)}$ is the result of applying $\hat{v}_j:=(\hat{\tau }_1,\ldots ,\hat{\tau }_j; \hat{\zeta }_1,\ldots ,\hat{\zeta }_j)$ to Y. Suppose
3. (xii)
  $\hat{\tau }_{j+1}>\hat{\tau }_{j}$ for all j, $\hat{v}\in \mathcal {V}$ and $\{ \phi (Y^{(\hat{v})}(\tau )); \tau \in \mathcal {T}\}$ is uniformly integrable. Then,
  $$\begin{aligned} \phi (y)=\varPhi (y)\quad \hbox { and }\,\hat{v}\, \hbox { is an optimal impulse control }. \end{aligned}$$

Remark 4.1

We give the intuitive idea behind intervention operator as in (16):

$$\begin{aligned} \mathcal {M}\varPhi (y)=\sup _{\zeta \in \mathcal {Z}} \{ \varPhi (\overline{\varGamma }(y,\zeta ))+K(y,\zeta ), \, \zeta \in \mathcal {Z}\text { and } \overline{\varGamma }(y,\zeta )\in \mathcal {S}\}. \end{aligned}$$

Assume that the value function $\varPhi $ is known. If $y=(s,x,\mu )$ is the current state of the process, and the agent intervenes with impulse of size $\zeta $, the resulting value can be represented as $\varPhi (\overline{\varGamma }(y,\zeta ))+K(y,\zeta )$, consisting of the sum of the value of $\varPhi $ in the new state $\overline{\varGamma }(y,\zeta )$ and the intervention profit K. Therefore, $\mathcal {M}\varPhi (y)$ represents the optimal new value if the agent decides to make an intervention at y.

Note that by (ii), $\varPhi \ge \mathcal {M}\varPhi $ on $\mathcal {S}$, so it is always not optimal to intervene. At the time $\hat{\tau }_j$, the operator should intervene with impulse $\hat{\zeta }_j$ when the controlled process leaves the continuation region, that is when $\varPhi (Y^{\hat{v}_j}) \le \mathcal {M}\varPhi (Y^{\hat{v}_j})$.

Proof

(a) By an approximation argument (see, e.g. Theorem 3.1 in [20]) and (iii)–(v), we may assume that $\phi \!\in \! C^2(\mathcal {S}) \cap C(\bar{\mathcal {S}})$.

Choose $\,v\!=\!(\tau _1,\tau _2,\ldots ;\zeta _1,\zeta _2,\ldots )\!\in \!\mathcal {V}$ and set $\tau _0 = 0$. By approximating the stopping times $\tau _j$ by stopping times with finite expectation, we may assume that we can apply the Dynkin formula to the stopping times $\tau _j$. Then for $j=0,1,2,\ldots $, with $Y=Y^{(v)}$

$$\begin{aligned} \mathbb {E}^y[\phi (Y(\tau _j))]-\mathbb {E}^y [\phi (\check{Y}(\tau _{j+1}^-))] =-\mathbb {E}^y \left[ \int _{\tau _j}^{\tau _{j+1}} L\phi (Y(t))\textrm{d}t\right] , \end{aligned}$$

where $\check{Y}(\tau _{j+1}^-)=Y(\tau _{j+1}^-)+\varDelta _N Y(\tau _{j+1})$, as before. Summing this from $j=0$ to $j=m$, we get

$$\begin{aligned}&\phi (y)+ \sum _{j=1}^m \mathbb {E}^y [\phi (Y(\tau _j)) -\phi (\check{Y}(\tau _j^-))] -\mathbb {E}^y[\phi (\check{Y}(\tau _{m+1}^-))] \nonumber \\&\quad =-\mathbb {E}^y \left[ \int _0^{\tau _{m+1}} L\phi (Y(t))\textrm{d}t\right] \ge \mathbb {E}^y \left[ \int _0^{\tau _{m+1}} f(Y(t))\textrm{d}t\right] . \end{aligned}$$

(17)

Now

$$\begin{aligned} \phi (Y (\tau _j))&=\phi (\varGamma (\check{Y}(\tau _j^-),\zeta _j)) \nonumber \\&\le \mathcal {M}\phi (\check{Y}(\tau _j^-))-K(\check{Y}(\tau _j^-),\zeta _j);\quad \hbox { if }\,\tau _j<\tau _{\mathcal {S}} \hbox { by } (16), \end{aligned}$$

and

$$\begin{aligned} \phi (Y (\tau _j))&=\phi (\check{Y}(\tau _j^-));\quad \hbox { if }\,\tau _j=\tau _{\mathcal {S}} \hbox { by }(vii). \end{aligned}$$

Therefore,

$$\begin{aligned} \mathcal {M}\phi (\check{Y}(\tau _j^-))-\phi (\check{Y}(\tau _j^-)) \ge \phi (Y(\tau _j))-\phi (\check{Y}(\tau _j^-)) +K(\check{Y}(\tau _j^-),\zeta _j), \end{aligned}$$

and

$$\begin{aligned}&\phi (y) + \sum _{j=1}^m \mathbb {E}^y [\{\mathcal {M}\phi (\check{Y}(\tau _j^-))-\phi (\check{Y}(\tau _j^-)) \} \mathbb {1}_{[\tau _j<\tau _{\mathcal {S}}]}] \\&\qquad \ge \mathbb {E}^y \left[ \int _0^{\tau _{m+1}} f(Y(t))\textrm{d}t+\phi (\check{Y}(\tau _{m+1}^-)) +\sum _{j=1}^m K(\check{Y}(\tau _j^-),\zeta _j) \right] . \end{aligned}$$

Letting $m\rightarrow M$ and using quasi-left continuity, i.e. left continuity along increasing sequences of stopping times, of $Y(\cdot )$, we get

$$\begin{aligned} \phi (y) \ge \mathbb {E}^y\! \left[ \!\int _0^{\tau _{\mathcal {S}}}\! f(Y(t))\textrm{d}t+g(Y(\tau _{\mathcal {S}})) \mathbb {1}_{[\tau _{\mathcal {S}}<\infty ]} + \sum _{j=1}^M K(\check{Y}(\tau _j^-),\zeta _j)\!\right] \!=\!J^{(v)}(y). \nonumber \\ \end{aligned}$$

(18)

Hence, $\phi (y)\ge \varPhi (y)$.

(b) Next assume (x)–(xii) also hold.

Apply the above argument to $\hat{v}=(\hat{\tau }_1,\hat{\tau }_2,\ldots ;\hat{\zeta }_1,\hat{\zeta }_2,\ldots )$. Then by (x), we get equality in (17) and by our choice of $\zeta _j=\hat{\zeta }_j$, we have equality in (18). Hence

$$\begin{aligned} \phi (y)=J^{(\hat{v})}(y), \end{aligned}$$

which combined with (a) completes the proof. $\square $

5 Example: Optimal Stream of Dividends Under Transaction Costs

One of the motivations for studying conditional McKean–Vlasov equations is that they represent natural models for stochastic systems where there is an underlying common noise, observable by all. This common noise could come from various types of uncertainty, e.g. uncertainty in the information available or unpredictable mechanical disturbances, e.g. market volatility, interest rate fluctuations, etc. Conditional McKean–Vlasov equations may therefore be regarded as a special class of systems subject to noisy observations, and they represent an alternative to filtering theory approaches.

In particular, such equations may be relevant the modelling of systems with noise in economics and finance.

Consider a company that wants to find the best way to distribute dividends to its shareholders while taking into account transaction costs. In this section, we illustrate our general results above by solving explicitly a financial problem, consisting of optimizing the overall anticipated dividend sum while considering the impact of transaction costs.

To this end, for $v = (\tau _1, \tau _2, \ldots ; \zeta _1, \zeta _2, \ldots )$ with $\zeta _i \in \mathbb {R}_+$, we define

$$\begin{aligned} Y^{(v)}(t)=(s+t, X^{(v)}(t),\mu _t^{(v)}) \end{aligned}$$

where $X(t)=X^{(v)}(t)$ denotes the value at time t of a company. We assume that between the intervention (dividend payout) times $\tau _i,\tau _{i+1}$ the growth of the value is proportional to the conditional current value $\mathbb {E}[X(t)|\mathcal {G}_t]$ given the common information $\mathcal {G}_t$, and satisfies the following conditional McKean–Vlasov equation:

$$\begin{aligned}&\text {d}X(t)= \mathbb {E}\left[ X(t)\mid \mathcal {G}_{t}\right] \Big (\alpha _{0}\text {d}t+\sigma _{1}\text {d}B_{1}(t)\\&\quad +\sigma _{2}\text {d}B_{2}(t) +\int _{\mathbb {R}}\kappa _{1}(z)\widetilde{N}_1(\text {d}t,\text {d}z )+\int _{\mathbb {R}}\kappa _{2}(z )\widetilde{N}_2(\text {d}t,\text {d}z )\Big ), \\&\quad \mu _t^{(v)} = \mathcal {L}(X^{(v)}(t)|\mathcal {G}_t); \quad \tau _i< t < \tau _{i+1},\\&\quad X^{(v)}(\tau _{i+1})= \check{X}^{(v)}(\tau ^-_{i+1}) - (1 + \lambda ) \zeta _{i+1} - c,\\&\quad \mu _{\tau _{i+1}}^{(v)}= \mathcal {L}(X^{(v)}(\tau _{i+1})|\mathcal {G}_{\tau _{i+1}}); \quad i = 0,1,2, \ldots , \\&\quad X^{(v)}(0^-)= x > 0; \text { a.s. } \end{aligned}$$

Here $\alpha _0, \sigma _1 \ne 0, \sigma _2 \ne 0$ are given constants, and we assume that the functions $\kappa _i$ satisfy $ -1 < \kappa _i(z)$ a.s. $\nu _i(\text {d}z)$ and $\int _{\mathbb {R}} |\kappa _i(z)|^2 \nu _i(\text {d}z) < \infty ; \quad i=1,2.$

Note that at any time $\tau _{i}, \, i=0,1,2,\ldots ,$ the system jumps from $\check{X}^{(v)}(\tau _i^-)$ to

$$\begin{aligned} X^{(v)}(\tau _{i})= \varGamma [\check{X}^{(v)}(\tau _i^-), \zeta _i] = \check{X}^{(v)}(\tau ^-_{i}) - (1 + \lambda ) \zeta _{i} - c, \end{aligned}$$

where $\lambda \ge 0$, and $c > 0$ and the quantity $c+\lambda \zeta _i$ represents the transaction cost with a fixed part c and a proportional part $\lambda \zeta _i$, while $\zeta _i$ is the amount we decide to take out at time $\tau _i$.

At the same time $\mu _{\tau _i^-}$ jumps to

$$\begin{aligned} \mu _{\tau _i}=\mathcal {L}(\check{X}^{(v)}(\tau _i^-)|\mathcal {G}_{\tau _i}). \end{aligned}$$

Problem 5.1

We want to find $\varPhi $ and $v^*\in \mathcal {V}$, such that

$$\begin{aligned} \varPhi (s,x,\mu ) = \sup _v J^{(v)}(s,x,\mu ) = J^{(v^*)}(s,x,\mu ), \end{aligned}$$

where

$$\begin{aligned} J^{(v)}(s,x,\mu ) = J^{(v)}(y) =\mathbb {E}^y \left[ \sum _{\tau _k < \tau _{\mathcal {S}}} {\textrm{e}}^{-\rho (s + \tau _k)} \zeta _k \right] \qquad (\rho > 0 \text { constant}) \end{aligned}$$

is the expected discounted total dividend up to time $\tau _{\mathcal {S}}$, where

$$\begin{aligned} \tau _{\mathcal {S}} = \tau _{\mathcal {S}}(\omega ) = \inf \{t> 0 ; P^y[ \mathbb {E}^y[X^{(v)}(t)|\mathcal {G}_t] \le 0 ] > 0\} \end{aligned}$$

is the time of bankruptcy.

To put this problem into the context above, we define

$$\begin{aligned}&Y^{(v)}(t)=\begin{bmatrix}s+t\\ X^{(v)}(t)\\ \mu ^{(v)}_t\end{bmatrix}, \quad Y^{(v)}(0^-)=\begin{bmatrix} s\\ x\\ \mu \end{bmatrix}=y,\\&\varGamma (y,\zeta )=\varGamma (s,x,\mu )=(s,x-c-(1+\lambda )\zeta , \mathcal {L}(x-c-(1+\lambda )\zeta )|\mathcal {G}), \quad x \in L^2(P),\\&K(y,\zeta )={\textrm{e}}^{-\rho s}\zeta , \\&f\equiv g\equiv 0, \\&\mathcal {S}=\left\{ y=(s,x,\mu ); \mathbb {E}^{y}[X^{(v)}(s)|\mathcal {G}_s]>0 \text { a.s. }\right\} . \end{aligned}$$

Comparing with our theorem, we see that in this case we have $d=1,m=2,k=1$ and

$$\begin{aligned} \begin{aligned} \alpha _{1}&=\alpha _{0}\left\langle \mu ,q\right\rangle ,\beta _{1}=\sigma _{1}\left\langle \mu ,q\right\rangle ,\\ \beta _{2}&=\sigma _{2}\left\langle \mu ,q\right\rangle ,\gamma _1(s,x,\mu ,z)=\kappa _{1}(z)\langle \mu ,q \rangle ,\gamma _2(s,x,\mu ,z) \\ {}&=\kappa _2(z) \langle \mu , q \rangle , \end{aligned} \end{aligned}$$

where we have put $q(x)=x$ so that $\left\langle \mu _t,q\right\rangle =\mathbb {E}\left[ X(t)\mid \mathcal {G}_{t}\right] .$

Therefore, the operator L takes the form

$$\begin{aligned}&L{\varphi }(s,x,\mu ) =\frac{\partial \varphi }{\partial s}+\alpha _{0}\left\langle \mu ,q\right\rangle \frac{\partial \varphi }{\partial x} +\left\langle \nabla _{\mu }\varphi ,A_{0}^{*}\mu \right\rangle \\&\qquad +\tfrac{1}{2}(\sigma _{1}^{2}+\sigma _{2}^{2})\left\langle \mu ,q\right\rangle ^{2}\frac{\partial ^{2}\varphi }{\partial x^{2}}+\frac{1}{2} \sigma _{1}\left\langle \mu ,q\right\rangle \frac{\partial }{\partial x} \left\langle \nabla _{\mu }\varphi ,A_{1}^{*}\mu \right\rangle \\&\qquad +\tfrac{1}{2}\left\langle A_{1}^{*}\mu ,\left\langle D_{\mu }^{2}\varphi ,A_{1}^{*}\mu \right\rangle \right\rangle \\&\qquad +\int _{\mathbb {R}}\left\{ \varphi (s,x+\kappa _{1}(z)\langle \mu ,q \rangle ,\mu +A_2^{*} \mu )-\varphi (s,x,\mu )\right. \\&\qquad \left. -\kappa _{1}(z)\left\langle \mu ,q\right\rangle \frac{\partial }{\partial x}\varphi (s,x,\mu )\right. \\&\qquad \left. -\langle A_2^{*} \mu , D_{\mu } \varphi \rangle \right\} \nu _1 (\text {d}z) +\int _{\mathbb {R}}\Big \{ \varphi (s,x+\kappa _{2}(z)\langle \mu ,q\rangle ,\mu )-\varphi (s,x,\mu ) \\&\qquad -\kappa _{2}(z)\langle \mu , q \rangle \tfrac{\partial }{\partial x}\varphi (s,x,\mu ) \Big \} \nu _2(\text {d}z), \end{aligned}$$

where

$$\begin{aligned}{} & {} A_{0}^{*}\mu =-D[\alpha _{0}\left\langle \mu ,q\right\rangle \mu ]+\tfrac{ 1}{2}D^{2}[(\sigma _{1}^{2}+\sigma _{2}^{2})\left\langle \mu ,q\right\rangle ^{2}\mu ], \\{} & {} \quad A_{1}^{*}\mu =-D[\sigma _{1}\left\langle \mu ,q\right\rangle \mu ] \end{aligned}$$

and

$$\begin{aligned} A_2^{*} \mu = \mu ^{(\kappa _1 \langle \mu , q \rangle )} -\mu . \end{aligned}$$

The adjoints of the first two operators are

$$\begin{aligned} A_{0}\mu =\alpha _{0}\left\langle \mu ,q\right\rangle D\mu +\tfrac{1}{2} (\sigma _{1}^{2}+\sigma _{2}^{2})\left\langle \mu ,q\right\rangle ^{2}D^{2}\mu , \end{aligned}$$

and

$$\begin{aligned} A_{1}\mu =\sigma _{1}\left\langle \mu ,q\right\rangle D\mu . \end{aligned}$$

In this case, the intervention operator becomes

$$\begin{aligned} \mathcal {M}h(s,x,\mu )=\sup _{\zeta } \left\{ h(s,x-c-(1+\lambda )\zeta ,\mu ^{x-c-(1+\lambda )\zeta }) +e^{-\rho t}\zeta ; \quad 0\le \zeta \le \frac{x-c}{1+\lambda }\right\} . \end{aligned}$$

Note that the condition on $\zeta $ is due to the fact that the impulse must be positive and $x-c-(1+\lambda )\zeta $ must belong to $\mathcal {S}$. We distinguish between two cases:

Case 1. $\alpha _0>\rho $.

In this case, suppose we wait until some time $t_1$ and then take out

$$\begin{aligned} \zeta _1=\frac{X(t_1)-c}{1+\lambda }. \end{aligned}$$

Noting that $\mathbb {E}^y[X(t)]= x \exp (\alpha _0 t)$ for $t<t_1$, we see that the corresponding performance is:

$$\begin{aligned} J^{(v_1)} (s,x,\mu )&= \mathbb {E}^y \Bigg [ \frac{\textrm{e}^{-\rho (t_1+s)}}{1+\lambda } (X(t_1)-c)\Bigg ] \\&=\mathbb {E}^x \Bigg [ \frac{1}{1+\lambda } \big ( x{\textrm{e}}^{-\rho s} {\textrm{e}}^{(\alpha _0-\rho )t_1} - c \,{\textrm{e}}^{-\rho (s+t_1)}\big ) \Bigg ] \\&\rightarrow \infty \ \text {as}\ t_1 \rightarrow \infty . \end{aligned}$$

Therefore, we obtain $\varPhi (s,x,\mu )=+\infty $ in this case.

Case 2. $\alpha _0<\rho $.

We look for a solution by using the results of Theorem 4.2.

We guess that the continuation region is of the form

$$\begin{aligned} D=\left\{ (s,x,\mu ): 0< \langle \mu ,q \rangle <\bar{x}\right\} \end{aligned}$$

for some $\bar{x} > 0$ (to be determined), and in D we try a value function of the form

$$\begin{aligned} \varphi (s,x,\mu )=e^{-\rho s}\psi (\langle \mu ,q \rangle ). \end{aligned}$$

This gives

$L\varphi (s,x,\mu )= e^{-\rho s} L_0\psi (\langle \mu ,q \rangle )$, where

$$\begin{aligned} L_0 \psi (\langle \mu ,q \rangle )&= -\rho \psi (\langle \mu ,q \rangle ) +\left\langle \nabla _{\mu }\psi ,A_{0}^{*}\mu \right\rangle +\frac{1}{2} \sigma _{1}\left\langle \mu ,q\right\rangle \frac{\partial }{\partial x} \left\langle \nabla _{\mu }\psi ,A_{1}^{*}\mu \right\rangle \\&\quad +\frac{1}{2}\left\langle A_{1}^{*}\mu ,\left\langle D_{\mu }^{2}\psi ,A_{1}^{*}\mu \right\rangle \right\rangle \\&\quad +\int _{\mathbb {R}}\Big \{ \psi (\langle \mu ^{(\kappa _1 \langle \mu , q \rangle )},q\rangle ) -\psi (\langle \mu ,q \rangle )-\langle \mu ^{(\kappa _1 \langle \mu ,q\rangle )}-\mu , D_{\mu } \psi \rangle \Big \} \nu _1(d\zeta ). \end{aligned}$$

By the chain rule for Fréchet derivatives (see Appendix), we have

$$\begin{aligned} \nabla _{\mu }\psi (h)= \psi '(\langle \mu ,q \rangle ) \langle h,q\rangle \quad \text { and }\quad D^2_{\mu } \psi (h,k)= \psi ''(\langle \mu ,q \rangle )\langle h,q\rangle \langle k,q\rangle . \end{aligned}$$

Therefore,

$$\begin{aligned} \langle \nabla _{\mu } \psi , A_0^{*}\mu \rangle = \psi '(\langle \mu ,q\rangle )\langle A_0^{*}\mu ,q\rangle =\psi '(\langle \mu ,q\rangle )\langle \mu ,A_0q\rangle = \psi '(\langle \mu ,q\rangle )\alpha _0 \langle \mu ,q\rangle , \end{aligned}$$

and similarly

$$\begin{aligned} \tfrac{1}{2} \langle A_1^{*}\mu ,\langle D^2_{\mu } \psi ,A_1^{*} \mu \rangle \rangle&= \tfrac{1}{2} \psi ''(\langle \mu ,q \rangle \langle A_1^{*} \mu ,q\rangle \langle A_1^{*} \mu ,q \rangle = \tfrac{1}{2} \psi ''(\langle \mu ,q \rangle )\langle \mu ,A_1q\rangle \langle \mu ,A_1q \rangle \nonumber \\&=\tfrac{1}{2} \psi ''(\langle \mu ,q \rangle ) \sigma _1^2 \langle \mu ,q\rangle ^2. \end{aligned}$$

Moreover, since $\psi $ does not depend on x we see that

$$\begin{aligned} \int _{\mathbb {R}}\left\{ \varphi (s,x+\gamma _{0}\left\langle \mu ,q\right\rangle ,\mu )-\varphi (s,x,\mu )-\gamma _{0}\left\langle \mu ,q\right\rangle \frac{\partial \varphi }{\partial x}(s,x,\mu )\right\} \nu _1 (\text {d}z)=0. \end{aligned}$$

Substituting this into the expression for $L_0 \psi $ we get, with $u=\langle \mu ,q \rangle $,

$$\begin{aligned} L_0 \psi (u)&= -\rho \psi (u) + \alpha _0 u \psi '(u) + \tfrac{1}{2} \sigma _1^2 u^2 \psi ''(u)\nonumber \\&\quad +\int _{\mathbb {R}} \Big \{ \psi (\langle \mu ^{(\kappa _1 \langle \mu ,q\rangle )},q\rangle ) - \psi (\langle \mu , q \rangle ) - \psi '(u) \langle \mu ^{(\kappa _1 \langle \mu ,q\rangle )}-\mu ,q\rangle \Big \} \nu _1(\text {d}z)\nonumber \\&\quad = -\rho \psi (u) + \alpha _0 u \psi '(u) + \tfrac{1}{2} \sigma _1^2 u^2 \psi ''(u)\nonumber \\&\quad +\int _{\mathbb {R}} \Big \{ \psi ((1+\kappa _1)u ) - \psi (u) -\kappa _1 u \psi '(u) \Big \} \nu _1(\text {d}z), \end{aligned}$$

(19)

where we have used that

$$\begin{aligned} \langle \mu ^{(\kappa _1 \langle \mu ,q \rangle )},q \rangle&= \int _{\mathbb {R}} q(x + \kappa _1 \langle \mu ,q \rangle ) \mu (\text {d}x) \nonumber \\&=\int _{\mathbb {R}} x \mu (\text {d}x) + \kappa _1 \langle \mu ,q\rangle =\langle \mu ,q \rangle +\kappa _1 \langle \mu , q \rangle =(1+\kappa _1)\langle \mu ,q \rangle , \end{aligned}$$

so that

$$\begin{aligned} \langle \mu ^{(\kappa _1 \langle \mu ,q \rangle )} -\mu ,q \rangle = \kappa _1 \langle \mu , q\rangle . \end{aligned}$$

By condition (x), we are required to have $L_0 \psi (u)=0$ for all $u \in (0,\bar{x})$. Note that by (19) we get that if we try the function $\psi (u)=u^{a}$ for some $a \in \mathbb {R}$, then

$$\begin{aligned} L_0\psi (u)= \Big [ -\rho + a \alpha _0 + \tfrac{1}{2} a(a-1) \sigma _1^2 + \int _{\mathbb {R}} \{ (1+\kappa _1(z))^{a} -1 -a \kappa _1(z) \} \nu _1(\text {d}z)\Big ] u^{a}. \end{aligned}$$

Therefore, if we choose $a=\hat{a}$ such that

$$\begin{aligned} -\rho + \hat{a} \alpha _0 + \tfrac{1}{2} \hat{a}(\hat{a}-1) \sigma _1^2 + \int _{\mathbb {R}} \{ (1+\kappa _1(z))^{\hat{a}} -1 -\hat{a} \kappa _1(z) \} \nu _1(\text {d}z)=0 \end{aligned}$$

(20)

then for all constants C the function

$$\begin{aligned} \psi (u)=C u^{\hat{a}} \end{aligned}$$

(21)

is a solution of the equation

$$\begin{aligned} L_0 \psi (u)=0. \end{aligned}$$

Define

$$\begin{aligned} F(a)=-\rho + a \alpha _0 + \tfrac{1}{2} a(a-1) \sigma _1^2 + \int _{\mathbb {R}} \{ (1+\kappa _1(z))^{a} -1 -a \kappa _1(z) \} \nu _1(\text {d}z); \quad a \in \mathbb {R}. \end{aligned}$$

Then we see that $F(1)=\alpha _0 - \rho < 0, F'(a) >0 \text { for } a\ge 1$ and $F(a) \rightarrow \infty $ when $a \rightarrow \infty $. Therefore, there exists a unique $a = \hat{a}>1$ such that $F(\hat{a})=0.$ Since we expect $\psi $ to be bounded near 0, we choose this exponent $\hat{a}$ in the Definition (21) of $\psi $.

It remains to determine C.

We guess that it is optimal to wait till $u=\langle \mu _t,q \rangle = \mathbb {E}^y[X(t)|\mathcal {G}_t]$ reaches or exceeds a value $u=\bar{u} > c$ and then take out as much as possible, i.e. reduce $\mathbb {E}^y[X(t)|\mathcal {G}_t]$ to 0. Taking the transaction costs into account, this means that we should take out

$$\begin{aligned} \hat{\zeta }(u) = \frac{u-c}{1+\lambda } \text { for } u \ge \bar{u}. \end{aligned}$$

We therefore propose that $\psi (u)$ has the form

$$\begin{aligned} \psi (u) = {\left\{ \begin{array}{ll} C u^{\hat{a}} \text {for}\ 0< u < \bar{u}, \\ \displaystyle \frac{u-c}{1+\lambda } \text { for } u \ge \bar{u}. \end{array}\right. } \end{aligned}$$

(22)

Continuity and differentiability of $\psi (u)$ at $u = \bar{u}$ give the equations

$$\begin{aligned} C \bar{u}^{\hat{a}} = \frac{\bar{u} - c}{1 + \lambda } \quad \text { and } \quad C \hat{a} \bar{u}^{\hat{a} - 1} = \frac{1}{1 + \lambda }. \end{aligned}$$

Combining these, we get

$$\begin{aligned} \bar{u} = \frac{\hat{a} c}{\hat{a} - 1}\quad \text {and}\quad C = \frac{\bar{u} - c}{1 + \lambda } \bar{u}^{-\hat{a}}. \end{aligned}$$

With these values of $\bar{u}$ and C, we have to verify that

$$\begin{aligned} \varphi (s,x,\mu ) = e^{-\rho s} \psi (\langle \mu , q \rangle ) \end{aligned}$$

with $\psi $ given by (22) satisfies all the requirements of Theorem 4.2. We check some of them:

(ii) $\varphi \ge \mathcal {M}\varphi $ on $\mathcal {S}$:

In our case, we have

$$\begin{aligned} \varGamma (s,X,\mu )=(s,X-c-(1+\lambda )\zeta , \mu ^{X-c-(1+\lambda )\zeta }), \end{aligned}$$

and hence we get

$$\begin{aligned} \mathcal {M}\varphi (s,X,\mu )&= \sup _{\zeta } \Big \{ \varphi (s,X - c - (1 + \lambda ) \zeta ), \mu ^{X - c - (1 + \lambda ) \zeta } )+e^{-\rho s} \zeta ; \ 0 \le \zeta \le \displaystyle \frac{\bar{u}-c}{1+\lambda } \Big \}\\&= e^{-\rho s} \sup _{\zeta } \Big \{C \langle \mu ^{X-c-(1+\lambda )\zeta },q\rangle ^{\hat{a}} + \zeta ; \ 0 \le \zeta \le \displaystyle \frac{\bar{u}-c}{1+\lambda } \Big \}\\&= e^{-\rho s} \sup _{\zeta } \Big \{C (\langle \mu ,q(x) - c - (1+\lambda )\zeta \rangle ^{\hat{a}} + \zeta ; \ 0 \le \zeta \le \displaystyle \frac{\bar{u}-c}{1+\lambda }\Big \}\\&= e^{-\rho s} \sup _{\zeta } \Big \{ C (\langle \mu ,q \rangle - c - (1+\lambda )\zeta )^{\hat{a}} + \zeta ; \ 0 \le \zeta \le \displaystyle \frac{\bar{u}-c}{1+\lambda }\Big \}. \end{aligned}$$

If $u- c - (1 + \lambda ) \zeta \ge \bar{u}$, then

$$\begin{aligned} \psi (u-c-(1+\lambda )\zeta ) + \zeta = \frac{u - 2c}{1 + \lambda } < \frac{u-c}{1 + \lambda } = \psi (u), \end{aligned}$$

and if $u-c-(1+\lambda )\zeta < \bar{u}$ then

$$\begin{aligned} h(\zeta ):= \psi (u-c-(1+\lambda )\zeta ) + \zeta = C (u-c-(1+\lambda )\zeta )^{\hat{a}} + \zeta . \end{aligned}$$

Since

$$\begin{aligned} h'\left( \frac{u-c}{1+\lambda }\right) = 1\ \text { and }\ h''(\zeta ) > 0, \end{aligned}$$

we see that the maximum value of $\displaystyle h(\zeta ); \ 0 \le \zeta \le \frac{u-c}{1+\lambda }$, is attained at $\displaystyle \zeta = \hat{\zeta }(u) = \frac{u-c}{1+\lambda }$.

Therefore,

$$\begin{aligned} \mathcal {M}\psi (u) = \max \left( \frac{x-2c}{1+\lambda },\frac{u-c}{1+\lambda }\right) = \frac{u-c}{1+\lambda }\ \text {for all}\ u > c. \end{aligned}$$

Hence, $\mathcal {M}\psi (u) = \psi (u)$ for $u \ge \bar{u}$.

For $0< u < \bar{u}$, consider

$$\begin{aligned} k(u):= Cu^{\hat{a}} - \frac{u-c}{1+\lambda }. \end{aligned}$$

Since

$$\begin{aligned} k(\bar{u}) = k'(\bar{u}) = 0\quad \text {and}\quad k''(u) > 0\ \text {for all}\ u, \end{aligned}$$

we conclude that

$$\begin{aligned} k(u) > 0\quad \text {for}\ 0< u < \bar{u}. \end{aligned}$$

Hence,

$$\begin{aligned} \psi (u) > \mathcal {M}\psi (u)\quad \text {for}\ 0< u < \bar{u}. \end{aligned}$$

(vi) $L_0 \psi (u) \le 0$ for $u \in \mathcal {S}\backslash \bar{D}$, i.e. for $u > \bar{u}$:

For $u > \bar{u}$, we have $\psi (u)=\tfrac{u-c}{1+\lambda }$, and therefore, since $\alpha _0 < \rho $,

$$\begin{aligned} L_0 \psi (u)&= - \rho \frac{u-c}{1+\lambda } + \alpha _0 u \, \frac{1}{1+\lambda } = \frac{(\alpha _0 - \rho ) u + \rho c}{1+\lambda }\le \frac{(\alpha _0 - \rho )\bar{u} + \rho c}{1+\lambda }\\&=\frac{(\alpha _0 - \rho )\tfrac{\hat{a}c}{(\hat{a}-1} + \rho c}{1+\lambda }=\frac{(\alpha _0 \hat{a}-\rho )c}{(1+\lambda )(\hat{a}-1)} < 0, \end{aligned}$$

since $1< \hat{a} < \tfrac{\rho }{\alpha _0}. $

Therefore, we have proved the following:

Theorem 5.1

The value function for Problem 5.1 is

$$\begin{aligned} \varPhi (s,x,\mu ) = {\left\{ \begin{array}{ll} e^{-\rho s} C u^{\hat{a}} \quad \text { for } \quad \ 0< u < \bar{u}, \\ \displaystyle e^{-\rho s} \frac{u-c}{1+\lambda } \quad \text { for } \quad u \ge \bar{u}, \end{array}\right. } \end{aligned}$$

where $u=\langle \mu ,q \rangle =\mathbb {E}[X(t) | \mathcal {G}_t]$ and

$$\begin{aligned} \bar{u} = \frac{\hat{a} c}{\hat{a} - 1}\quad \text {and}\quad C = \frac{\bar{u} - c}{1 + \lambda } \bar{u}^{- \hat{a}}, \end{aligned}$$

$\hat{a}$ being the positive solution of Eq. (20). The optimal impulse control is to do nothing while $u=\mathbb {E}[X(t) | \mathcal {G}_t] < \bar{u}$ and take out immediately

$$\begin{aligned} \hat{\zeta }(u) = \frac{u-c}{1+\lambda } \text { when } u \ge \bar{u}. \end{aligned}$$

This brings $\mathbb {E}[X(t) | \mathcal {G}_t]$ down to 0, and the system stops. Hence, the optimal impulse consists of at most one intervention.

References

Agram, N., Øksendal, B.: Stochastic Fokker-Planck PIDE for conditional McKean–Vlasov jump diffusions and applications to optimal control. SIAM J. Control. Optim. 61(3), 1472–1493 (2023)
Article MathSciNet Google Scholar
Basei, M.: Optimal price management in retail energy markets: an impulse control problem with asymptotic estimates. Math. Methods Oper. Res. 89(3), 355–383 (2019)
Article MathSciNet Google Scholar
Basei, M., Cao, H., Guo, X.: Nonzero-sum stochastic games and mean-field games with impulse controls. Math. Oper. Res. 47(1), 341–366 (2022)
Article MathSciNet Google Scholar
Bensoussan, A., Lions. J. L. : Impulse Control and Quasi-Variational Inequalities. Gauthiers Villars, Paris (1984)
Bertucci, C.: Fokker-Planck equations of jumping particles and mean field games of impulse control. Annales de l’Institut Henri Poincaré C 37(5), 1211–1244 (2020)
Article MathSciNet Google Scholar
Bogachev, V. I., Krylov, N. V., Röckner, M., Shaposhnikov, S. V.: Fokker–Planck–Kolmogorov Equations. American Mathematical Soc (Vol. 207) (2015)
Buckdahn, R., Li, J., Ma, J.: A general conditional McKean–Vlasov stochastic differential equation. Ann. Appl. Probab. 33(3), 2004–2023 (2023)
Article MathSciNet Google Scholar
Cadenillas, A., Choulli, T., Taksar, M., Zhang, L.: Classical and impulse stochastic control for the optimization of the dividend and risk policies of an insurance firm. Math. Finance Int. J. Math. Stat. Financ. Econ. 16(1), 181–202 (2006)
MathSciNet Google Scholar
Carmona, R., Delarue, F., Lacker, D.: Mean field games with common noise. Ann. Probab. 44(6), 3740–3803 (2016)
Article MathSciNet Google Scholar
Christensen, S., Neumann, B.A., Sohr, T.: Competition versus Cooperation: A class of solvable mean field impulse control problems. SIAM J. Control. Optim. 59(5), 3946–3972 (2021)
Article MathSciNet Google Scholar
Coghi, M., Flandoli, F.: Propagation of chaos for interacting particles subject to environmental noise. Ann. Probab. 26(3), 1407–1442 (2016)
MathSciNet Google Scholar
Djehiche, B., Hamadene, S., Hdhiri, I.: Stochastic impulse control of non-Markovian processes. Appl. Math. Optim. 61(1), 1–26 (2010)
Article MathSciNet Google Scholar
Erny, X.: Well-posedness and propagation of chaos for McKean–Vlasov equations with jumps and locally Lipschitz coefficients. Stochast. Process. Appl. 150, 192–214 (2022)
Article MathSciNet Google Scholar
Erny, X., Löcherbach, E., Loukianova, D.: Conditional propagation of chaos for mean field systems of interacting neurons. Electron. J. Probab. 26, 1–25 (2021)
Article MathSciNet Google Scholar
Korn, R.: Some applications of impulse control in mathematical finance. Math. Methods Oper. Res. 50(3), 493–518 (1999)
Article MathSciNet Google Scholar
Kunita, H.: Stochastic differential equations based on Lévy processes and stochastic flows of diffeomorphisms. In Real and Stochastic Analysis: New Perspectives (pp. 305-373). Boston, MA: Birkhäuser Boston (2004)
Lacker, D., Shkolnikov, M., Zhang, J.: Inverting the Markovian projection, with an application to local stochastic volatility models. Ann. Probab. 48(5), 2189–2211 (2020)
Article MathSciNet Google Scholar
Lacker, D., Shkolnikov, M., Zhang, J.: Superposition and mimicking theorems for conditional McKean–Vlasov equations. J. Eur. Math. Soc. 25(8), 3229–3288 (2022)
Article MathSciNet Google Scholar
Li, C., Liu, Z., Wu, J., Huang, X.: The stochastic maximum principle for a jump-diffusion mean-field model involving impulse controls and applications in finance. J. Syst. Sci. Complex. 33(1), 26–42 (2020)
Article MathSciNet Google Scholar
Øksendal, B., Sulem, A.: Applied Stochastic Control of Jump diffusions, 3rd edn. Springer, Berlin (2019)
Book Google Scholar
Plotter, P.E.: Stochastic Integration and Differential Equation Stochastic Modeling and Applied Probability, 2nd edn. Springer, Berlin (2004)
Google Scholar
Vuong, Y.V., Hauray, M., Pardoux, É.: Conditional propagation of chaos in a spatial stochastic epidemic model with common noise. Stochast. Partial Differ. Equ. Anal. Comput. 10(3), 1180–1210 (2022)
MathSciNet Google Scholar

Download references

Acknowledgements

We are grateful to the editor and the anonymous referees for their valuable and constructive comments, which have lead to several improvements of the paper. Nacira Agram and Giulia Pucci are gratefully acknowledge the financial support provided by the Swedish Research Council grants (2020-04697).

Funding

Open access funding provided by Royal Institute of Technology.

Author information

Authors and Affiliations

Department of Mathematics, KTH Royal Institute of Technology, 100 44, Stockholm, Sweden
Nacira Agram & Giulia Pucci
University of Oslo, Oslo, Norway
Bernt Øksendal

Authors

Nacira Agram
View author publications
You can also search for this author in PubMed Google Scholar
Giulia Pucci
View author publications
You can also search for this author in PubMed Google Scholar
Bernt Øksendal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giulia Pucci.

Additional information

Communicated by Fausto Gozzi.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Double Fréchet Derivatives

In this section, we recall some basic facts we are using about the Fréchet derivatives of a function $f: V \mapsto W$, where V, W are given Banach spaces.

Definition 1

We say that f has a Fréchet derivative $\nabla _xf=Df(x)$ at $x \in V$ if there exists a bounded linear map $A:V \mapsto W$ such that

$$\begin{aligned} \lim _{h \rightarrow 0} \frac{||f(x+h)-f(x)-A(h)||_{W}}{||h||_{V}} =0. \end{aligned}$$

Then we call A the Fréchet derivative of f at x and we put $Df(x) =A$.

Note that $Df(x) \in L(V,W)$ (the space of bounded linear functions from V to W), for each x.

Definition 2

We say that f has a double Fréchet derivative $D^2 f(x)$ at x if there exists a bounded bilinear map $A(h,k): V \times V \mapsto W$ such that

$$\begin{aligned} \lim _{k \rightarrow 0} \frac{||Df(x+k)(h)-Df(x)(h)-A(h,k)||_{W}}{||h||_{V}} =0. \end{aligned}$$

Example 1

Suppose $f:\mathbb {M} \mapsto \mathbb {R}$ is given by
$$\begin{aligned} f(\mu )=\langle \mu ,q\rangle ^2, \text { where } q(x)=x. \end{aligned}$$
Then
$$\begin{aligned} \begin{aligned} f(\mu +h) -f(\mu )&= \langle \mu +h,q\rangle ^2 -\langle \mu ,q\rangle ^2\nonumber \\&= 2 \langle \mu ,q\rangle \langle h,q\rangle +\langle h,q\rangle ^2, \end{aligned} \end{aligned}$$
so we see that
$$\begin{aligned} Df(\mu )(h)=2\langle \mu ,q \rangle \langle h,q \rangle . \end{aligned}$$
To find the double derivative, we consider
$$\begin{aligned}&Df(\mu +k)(h)-Df(\mu )(h)\nonumber \\&=2\langle \mu +k,q\rangle \langle h,q \rangle -2\langle \mu ,q \rangle \langle h,q \rangle \nonumber \\&=2 \langle k,q \rangle \langle h,q \rangle , \end{aligned}$$
and we conclude that
$$\begin{aligned} D^2f(\mu )(h,k)=2\langle k,q\rangle \langle h,q\rangle . \end{aligned}$$
Next assume that $g:{\mathbb {M}}\mapsto \mathbb {R}$ is given by $g(\mu )=\langle \mu ,q\rangle $. Then, proceeding as above we find that
$$\begin{aligned} Dg(\mu )(h)&=\langle h,q\rangle \text { (independent of } \mu )\\&\text { and }\nonumber \\ D^2g(\mu )&=0. \end{aligned}$$

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Agram, N., Pucci, G. & Øksendal, B. Impulse Control of Conditional McKean–Vlasov Jump Diffusions. J Optim Theory Appl 200, 1100–1130 (2024). https://doi.org/10.1007/s10957-023-02370-6

Download citation

Received: 30 January 2023
Accepted: 16 December 2023
Published: 19 January 2024
Issue Date: March 2024
DOI: https://doi.org/10.1007/s10957-023-02370-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Impulse Control of Conditional McKean–Vlasov Jump Diffusions

Abstract

Similar content being viewed by others

The Stochastic Maximum Principle for a Jump-Diffusion Mean-Field Model Involving Impulse Controls and Applications in Finance

A Stochastic Maximum Principle for a Markov Regime-Switching Jump-Diffusion Model with Delay and an Application to Finance

Pointwise Second-Order Necessary Conditions for Stochastic Optimal Control with Jump Diffusions

1 Introduction

2 Conditional McKean–Vlasov SDEs with Jumps

Definition 2.1

Lemma 2.1

Proof

Theorem 2.1

Proof

3 Fokker–Planck SPDE with Jumps

Theorem 3.1

Proof

Remark 3.1

4 A General Formulation and a Verification Theorem

Notation 4.1

Problem 4.1

Definition 4.1

Theorem 4.2

Remark 4.1

Proof

5 Example: Optimal Stream of Dividends Under Transaction Costs

Problem 5.1

Theorem 5.1

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Double Fréchet Derivatives

Appendix: Double Fréchet Derivatives

Definition 1

Definition 2

Example 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation