1 Introduction

The popularity of healthcare insurance products and the ever-increasing demand for such products can be attributed to increased awareness of longevity risk and advancements in medical technology. Because of decreasing mortality and fertility rates, the population of many wealthy countries is rapidly aging (Cristea et al. 2020). Longevity increases contribute to rising medical expenditures and increased demand in the health insurance industry. Levantesi et al. (2020) used a machine learning technique to predict mortality and improve longevity. Individuals’ financial responsibilities are reduced by health insurance in the case of an illness or injury that necessitates hospitalization or results in a loss of income. The private health insurance market covers most of the medical expenses in nations without National Health Insurance, such as the United States. The Health Insurance Portability and Accountability Act of 1996 also requires practically all individual insurance policies to be guaranteed renewable. However, their market share remains limited due to the lack of appeal and variety in these products. Looking at the health insurance industry, we can see that spending has increased dramatically over the last 50 years (Dieleman et al. 2017). Growth and aging of the human population, increased demand for health care owing to economic development, and rising healthcare expenditures are all factors that contribute to the rise in health insurance premiums. Health policymakers in the public sector and management of private health insurance companies are concerned about the rise of health insurance and its financing.

This research suggests a new variable annuity product embedded with GLWB benefit with a dynamic withdrawal strategy dependent on the policyholder’s health status. We assume that X(t) is a continuous-time Markov chain. This choice is due to the flexibility of this class of stochastic processes in describing systems that randomly change state in time, see, e.g., D’Amico and Villani (2021) and De Blasis (2020) and due to its primary benefits of simplicity and out-of-sample forecasting accuracy. Moreover the value of health state is influenced only by its current state, and not by any prior activity. In essence, it can be predicted based solely upon the current circumstances surrounding the variable. Other main reason of using Markovian models is because they can enrich the point estimates with error estimates or even provide the whole probability distribution. A large body of literature has shown the ability of Markov chains to describe disability states (see, e.g., Pitacco 1995) or hospitalization conditions, and mode of discharge (see, e.g., Jones et al. (2019)). The works by Manton et al. (1993), Haberman and Pitacco (2018), Pritchard (2006), Baione and Levantesi (2014) and Yang et al. (2016) have used continuous-time Markov processes to model the health status transitions.

The research on variable annuities with health status influences is significantly less. The Life Care Annuity-Guaranteed Lifetime Withdrawal Benefit was firstly proposed by Hsieh et al. (2018). Fard and Rong (2014) proposed a model for valuing ruin contingent life annuities under the regime-switching variance gamma process. In the direction of hospitalization coverage based on the claims of families and people covered by the policy, Tessera (2007) developed two probabilistic models for claim size in health insurance. A statistical model to predict the incidence and cost of hospitalizations for a given chronic disease was developed by Rosenberg and Farrell (2008). D’Amato et al. (2013) analyzed the performance of a portfolio of participating variable annuities focusing on the minimum income level.

Moosavi and Payandeh (2021) developed a model for the valuation of variable long-term care annuities with static guaranteed lifetime withdrawal and limited hospitalization coverage benefits. They used two investment funds in which the Geometric Brownian Motion (GBM) model is used with constant volatility. Moreover, they considered only limited health status states.

The contribution of this article is to present a modeling framework that gives the policyholder the flexibility to withdraw different amounts according to his/her health status. A one-time lump sum is invested in a risky fund that follows a GBM. The expected life and death benefits are calculated, and the fair rate of the fee charged by the insurance company is determined as the solution of the equilibrium equation between expected benefits and premium. The remainder of the paper is laid out as follows: Sect. 2 includes the description of the continuous-discrete time model and the mathematical formulas needed to determine the fair fee. Section 3 includes data description along with numerical results of calculating the fee for \( S \& P\) 500 index and the effect of age and guaranteed withdrawal amount on the fee. Section 4 summarizes our contribution and results with some potential further work.

2 Model description

This section presents the random processes involved in analyzing the GLWB with Markovian health states. The model is mixed discrete-continuous in time because while the policyholder’s health state and the fund’s value evolve continuously in time, the payment of the benefits occurs according to a discrete-time scale. This choice agrees with Piscopo and Haberman (2011), who presented a continuous and discrete-time model with death and GLWB benefit without any health status benefit but using mortality rates. The health status changes in time according to a stochastic process \(\{X(t),t \ge 0\}\) with state-space \(E=\{1,2,3,\ldots ,D\}\). The elements \(1,2,3\ldots \) represent different health states of the policyholder and D is the death of the insured. Moosavi and Payandeh (2021) provided with a continuous-time health model with GLWB with limited health states i.e. \(\{1,2,\ldots ,7\}\), with 7 as death (see Fig. 1). In this section, we will describe the policyholder’s health status as a continuous-time Markov chain with health states \(E=\{1,2,\ldots ,D\}\). The benefits are calculated based on the policyholder’s health evolution, and the fair fee is computed according to a balance condition between premium and expected benefits. This will provide the actuarial sector with a better product that combines the benefit of GLWB and includes health insurance features.

This paper uses capital letters to denote random variables, and the following notations are considered.

Fig. 1
figure 1

The health status of the policyholder according to Moosavi and Payandeh (2021)

With the notations mentioned above and for simplicity, let us make some assumptions that the model’s description will follow.

Assumption

  1. 1.

    We analyze a single index and assume that the premium is a one-time lump sum investment of \(h_{0}\) made by the insured at the start of the contract, which makes \(\frac{h_{0}}{s_{0}}\) as the number of initial stocks.

  2. 2.

    The insurance company’s fee is deducted from the funds value when fund units are canceled at a rate equal to \(\gamma \).

  3. 3.

    We will assume the fee is charged continuously, and the guarantee (g % of \(h_{0}\)) is deducted by the policyholder over a discrete-time set.

  4. 4.

    Death benefits are paid when the insured dies.

Let us consider \(\{X(t),t \ge 0\}\) be a stochastic process describing the health status of the policyholder at time t. Let \(T_{a}\) be the set of times where the payments of benefits occur, i.e.,

$$\begin{aligned} T_{a}=\{ t \ge 0 : t= ak \text { with a } \in \mathbb {R^{+}} \text { and k } \in \mathbb {N}\}. \end{aligned}$$
(1)

For example if time is measured in months, and benefits are paid annually, then \(a=12\) so that

$$\begin{aligned} k=1, t=12 \text { months }=1 \text { year },\\ k=2, t=24 \text { months }=2 \text { year }, \end{aligned}$$

and so on.

At a first stage \(\{X(t),t \ge 0\}\) is considered to be a continuous-time Markov chain with finite state space \(E = \{1, 2, \ldots , D \}\). For \(t>s\), let us define \(p_{ij}(t-s)=\) \(\mathbb {P}(X(t)=j | X(s)=i)\) be the transition probability from state i at time s to state j at time t and let \({\textbf {Q}}\) be the generator matrix for \(\{X(t),t \ge 0\}\) such that

$$\begin{aligned} q_{ij}=\lim _{t \rightarrow 0} \frac{p_{ij}(t)}{t} ~~ \forall i\ne j ~~ \text {and}~~ q_{ii}=-\sum _{j \ne i}q_{ij}. \end{aligned}$$

Then, the following formula holds true

$$\begin{aligned} p_{ij}(t-s)=\mathbb {P}(X(t)=j| X(s)=i)=(e^{{\textbf {Q}}(t-s)})_{(i,j)}=\left( \sum _{l=0}^{\infty }\frac{(t-s)^{l}{} {\textbf {Q}}^{l}}{l!}\right) _{(i,j)}. \end{aligned}$$
(2)

Let g(t) be the health expenditure at time t with health state X(t) such that for \(t \in T_{a}\)

$$\begin{aligned} g(t)= g(X(t)). \end{aligned}$$
(3)

It is important to note that the wealth and income of an individual, as well as the likelihood or probability of getting sick, can all affect health expenditures. It is possible to take these variables into account as covariates in both the generator matrix \({\textbf {Q}}\) and the function g(t). We do not go into further details about this possibility because, while it is simple to model mathematically, it is not realistic given the lack of actual information regarding the social conditions of the policyholders.

Let G(t) be the guaranteed amount at time t then specifically, we consider \(G(t)=G\left( X(t)\right) h_{0}\) as a certain percentage of the initial investment to express the random guaranteed withdrawal rate paid by the insurer to the policyholder at time \(t= ka, k \in \mathbb {N}\) as

$$\begin{aligned} G(X(t))= W(X(t)) + g(X(t)) , \end{aligned}$$
(4)

where W(X(t)) is the random amount offered by the GLWB rider. Policyholder will get a variable amount W(X(t)) through out his/her life and along with it he/she can withdraw amounts according to his/her health status \(\{X(t),t \ge 0\}\), which may depends on the policyholder’s will also.

Now, to find out the break-even fee \((\gamma ^*)\), we equate the outflow which consists of lump sum premium \(h_{0}\) to the expected present value (EPV) of the inflows, which consists of discounted expected life benefit (ELB) and discounted expected death benefit (EDB) to that of outflows. Hence,

$$\begin{aligned} h_{0}=ELB + EDB. \end{aligned}$$
(5)

It is worth noting that a safety margin could be added to the break-even fee to ensure profits for the insurance company, or loaded premiums could be used instead, see, e.g., Furman and Zitikis (2008).

2.1 Computation of life benefit

Benefits are paid as a random amount \(G(X(t))h_{0}\) according to the set of times \(T_{a}\) periodically. The accumulated benefits over the life of the policyholder are given by

$$\begin{aligned} LB(0)=\sum _{t \in T_{a}} G(X(t))h_{0} e^{-rt} \mathbbm {1}_{\{X(t) \ne D\}}. \end{aligned}$$
(6)

Now, take the conditional expectation of Eq. (6), that is

$$\begin{aligned} ELB_{i}=\mathbb {E}[LB(0)| X(0)=i]=\mathbb {E}_{i}[LB(0)]. \end{aligned}$$
(7)

It denotes the conditional expected life benefits given health state i of the policyholder at initial time zero. The \(ELB_{i}\) can be obtained as follows

$$\begin{aligned} ELB_{i}=\sum _{t \in T_{a}} \mathbb {E}_{i}[G(X(t))\mathbbm {1}_{\{X(t) \ne D\}}]h_{0}e^{-rt}, \end{aligned}$$
(8)

where \(ELB_{D}=0\). This expected value can be evaluated according to the time scale of benefit payments

$$\begin{aligned} ELB_{i}= & {} \sum _{k \ge 1} \mathbb {E}_{i}[G(X(ka))\mathbbm {1}_{\{X(ka) \ne D\}}]h_{0}e^{-rka}, \end{aligned}$$
(9)
$$\begin{aligned}= & {} \sum _{k \ge 1}\sum _{j \ne D} G(j)\mathbb {P}(X(ka)=j| X(0)=i)h_{0}e^{-rka} , \end{aligned}$$
(10)
$$\begin{aligned}= & {} \sum _{k \ge 1}\sum _{j \ne D} G(j)(e^{{\textbf {Q}}ka})_{(i,j)}h_{0}e^{-rka}. \end{aligned}$$
(11)

Formula (11) calls for the evaluation of transition probabilities at various points in time and entails an infinite summation. A more efficient way to compute the \(ELB_{i}\) is by establishing a system of linear equations for them and solving it using matrix representation. The computation are in line with Markov reward formalism, see, e.g., D’Amico et al. (2010, 2015).

Proposition 1

The expected life benefits can be calculated according to the following matrix formula:

$$\begin{aligned} {\textbf {ELB}}= \Big (I-e^{-ra}\cdot {\textbf {P}}(a)\Big )^{-1}*\Big (e^{-ra}\cdot {\textbf {P}}(a)*{\textbf {G}}\Big ), \end{aligned}$$
(12)

where

$$\begin{aligned} {\textbf {ELB}}&= \begin{bmatrix} ELB_{1},&ELB_{2},&\ldots&,ELB_{D} \end{bmatrix}^{T} \quad \text {with} \quad \quad ELB_{D}=0, \end{aligned}$$
$$\begin{aligned} {\textbf {G}}&= \begin{bmatrix} G(1),&G(2),&\ldots ,&G(D) \end{bmatrix}^{T} \quad \text {with} \quad \quad G(D)=0. \end{aligned}$$

and \({\textbf {P}}(a)=e^{{\textbf {Q}}a}\) represents the transition probability matrix at time a, with (*) as standard matrix product operator.

Proof

According to Equation (8) we have that

$$\begin{aligned} ELB_{i}&= \mathbb {E}_{i}\Big [\sum _{k\ge 1}G(X(ka)\mathbbm {1}_{\{X(t) \ne D\}}h_{0}e^{-rka}\Big ]. \end{aligned}$$

The application of the tower property of conditional expectation gives

$$\begin{aligned}&= \mathbb {E}_{i}\Big [\mathbb {E}[\sum _{k\ge 1}G(X(ka))\mathbbm {1}_{\{X(ka) \ne D\}}h_{0}e^{-rka}/X(a)]\Big ],\\&= \mathbb {E}_{i}\Big [G(X(a))\mathbbm {1}_{\{X(a) \ne D\}}h_{0}e^{-ra}\\&\quad +\mathbb {E} \Big [\sum _{k \ge 2}G(X(ka))\mathbbm {1}_{\{X_{ak} \ne D\}}h_{0}e^{-rka}/ X(a)\Big ]\Big ]. \end{aligned}$$

Now we change variable by setting \(s=k-1\) in the former expectation to get

$$\begin{aligned} ELB_{i}&= \mathbb {E}_{i}\Big [G(X(a))\mathbbm {1}_{\{X(a) \ne D\}}h_{0}e^{-ra}\end{aligned}$$
(13)
$$\begin{aligned}&\quad +e^{-ra}\mathbb {E}\Big [\sum _{s \ge 1}G(X(sa+a))\mathbbm {1}_{\{X(sa+a) \ne D\}}h_{0}e^{-rsa}/X(a)\Big ]\Big ]. \end{aligned}$$
(14)

Observe now that

$$\begin{aligned}{} & {} \mathbb {E}\Bigg [\sum _{s \ge 1}G(X(sa+a))\mathbbm {1}_{\{X(sa+a) \ne D\}}h_{0}e^{-rsa}/X(a)\Bigg ],\\{} & {} \quad = \sum _{s \ge 1}\sum _{j \ne D}G(j)P(X(sa+s)=j/X(a))h_{0}e^{-rsa},\\{} & {} \quad = \sum _{s \ge 1}\sum _{j \ne D}G(j)(e^{{\textbf {Q}}(sa)})_{(X(a),j)}h_{0}e^{-rsa},\\{} & {} \quad = \textit{ELB}_{X(a)}. \end{aligned}$$

Then,

$$\begin{aligned} ELB_{i}&= \mathbb {E}_{i}\Bigg [G(X(a))\mathbbm {1}_{\{X(a) \ne D\}}h_{0}e^{-ra}+ e^{-ra}ELB_{X(a)}\Big ],\\&= \sum _{j \in E}(e^{({\textbf {Q}}a)})_{(i,j)}\Big [G(j)h_{0}e^{-ra}+ e^{-ra}ELB_{j}\Bigg ]. \end{aligned}$$

The previous equation can be written in matrix form by using the column-vectors ELB and G as follows

$$\begin{aligned} {\textbf {ELB}}&= e^{-ra}\cdot {\textbf {P}}(a)*\Big ({\textbf {G}}+{\textbf {ELB}}\Big ). \end{aligned}$$

Algebraic manipulations give

$$\begin{aligned} \Big (I-e^{-ra}\cdot {\textbf {P}}(a)\Big )*{\textbf {ELB}}&= e^{-ra}\cdot {\textbf {P}}(a)*{\textbf {G}}, \end{aligned}$$
(15)

which in turn provides

$$\begin{aligned} {\textbf {ELB}}&=\Big (I-e^{-ra}\cdot {\textbf {P}}(a)\Big )^{-1}* e^{-ra}\cdot {\textbf {P}}(a)*{\textbf {G}}, \end{aligned}$$
(16)

provided that \(\Big (I-e^{-ra}\cdot {\textbf {P}}(a)\Big )^{-1}\) exists.

In any case, \({\textbf {P}}(a)\) is a stochastic matrix, thus its spectral radius \(\rho ({\textbf {P}}(a))=1\) as a consequence of Perron-Frobenius theorem.

Moreover, observe that the matrix \(I-e^{-ra}\cdot {\textbf {P}}(a)\) can be rewritten as follows:

$$\begin{aligned} I-e^{-ra} \cdot {\textbf {P}}(a)=\frac{1}{e^{ra}}\Big (e^{ra}\cdot I-{\textbf {P}}(a)\Big ). \end{aligned}$$
(17)

The matrix \(e^{ra}\cdot I-{\textbf {P}}(a)\) belongs to the class of M-matrix (see Le and Tsatsomeros 2021) and is non-singular if and only if \(e^{ra} \ge \rho ({\textbf {P}}(a))=1\) which is trivially satisfied being \(e >1\) and \(a \ge 0\). \(\square \)

Remark

Proposition 1 provides an exact representation of the ELB in terms of the model’s parameters. As shown, there is no need to introduce any maximal allowed age for the policyholder as done in Piscopo and Haberman (2011) or to implement Monte Carlo simulations as developed by Moosavi and Payandeh (2021).

2.2 Computation of death benefit

According to Piscopo and Haberman (2011), we adopt the GBM hypothesis under the risk-neutral measure \(\tilde{\mathbb {P}}\). The conservative GBM model assumes that return volatility remains constant throughout time. Under a risk-neutral metric, the stock price dynamic for a GBM is

$$\begin{aligned} dS_{t}={\mu }S_{t}dt + \sigma S_{t}dB_{t}, ~~~s_{0}>0, \end{aligned}$$
(18)

here \(\mu \) is the drift coefficient, \(\sigma ^{2}\) is the variance and \(\{B_{t}, t \ge 0 \}\) is a standard Wiener process. Then the stock price at time t can be represented as

$$\begin{aligned} S_{t}=s_{0} e^{\left( \mu -\frac{1}{2} \sigma ^{2}\right) t+\sigma B_{t}}. \end{aligned}$$
(19)

Moreover, as usual, we denote by \(\gamma \) (in basis points) the insurance fee paid as a function of the asset. Furthermore we have a sequence of random variables \(\{G(X(t))\}_{t \in T_{a}}\) denoting the time-varying withdrawal from the fund at time t. Note that we do not consider a rate of withdrawal, as rather a sequence of withdrawal of random amounts depending on the health state of the policyholder. Under these assumptions, the dynamic of the fund can be described as follows (Fig. 2)

$$\begin{aligned} \left\{ \begin{array}{ll} dH_{t}=(\mu -\gamma )S_{t}dt + \sigma {S_{t}}dB_{t}, &{} \forall ~ t : \, ak< t < a(k+1) \\ H_{t}= H_{t^{-}}- G(X(t))h_{0}, &{}\forall ~ t \in T_{a} . \end{array}\right. \end{aligned}$$
(20)

Here the present value of expected death benefit can be calculated. Let \(\tau \) be the time of death of the policyholder, that is

$$\begin{aligned} \tau = \inf \,\{t \in \mathbb {R} : X(t)=D\}. \end{aligned}$$
Fig. 2
figure 2

Dynamic of fund

Let \(DB_{\beta }(\gamma )\) be the death benefit conditional on the death at time \(\tau = \beta \in \mathbb {R}_{+} \) then,

$$\begin{aligned} DB_{\beta }(\gamma )=\max \{H_{\beta },0\}. \end{aligned}$$

It results that

$$\begin{aligned}{} & {} DB_{\beta }(\gamma )=e^{(\mu -\gamma -\sigma ^{2}/2)\beta +{\sigma }B_{\beta }}\max [0,h_{0} -\sum _{v=1}^{\lfloor \beta \rfloor }G(X(va))h_{0}e^{-(\mu -\gamma -\sigma ^{2}/2)va-{\sigma }B_{va}}],\nonumber \\ \end{aligned}$$
(21)

where \(\lfloor \beta \rfloor \) = max{\(k \in \mathbb {N}: k \cdot a \le \beta \)}.

Now the expected death benefit is just the discounted expectation of the conditional death benefit with respect to the random time of death, i.e.,

$$\begin{aligned} EDB(\gamma )=\int _{0}^{\infty }e^{-r\beta }\mathbb {E}[DB_{\beta }(\gamma )]\lambda (\beta )d\beta , \end{aligned}$$

where

$$\begin{aligned} \lambda (\beta )=\lim _{h \rightarrow 0}\frac{\mathbb {P}(\beta < \tau \le \beta +h| \tau > \beta )}{h}, \end{aligned}$$

is the so-called failure rate function. For an n-state continuous-time Markov chain, it can be expressed according to Sadek and Limnios (2005)

$$\begin{aligned} \lambda ({\beta })= \left\{ \begin{array}{ll} \frac{-1}{R(\beta })\cdot \frac{dR(\beta )}{d\beta }= -\frac{\alpha _{1}e^{\beta {\textbf {Q}}_{(n-1,n-1)}}\cdot {\textbf {Q}}_{(n-1,n-1)}\mathbbm {{1}}_{(n-1)}}{{\alpha _{1}}e^{\beta {\textbf {Q}}_{(n-1,n-1)}}\mathbbm {1}_{(n-1)}} &{} \quad \hbox {if } R(\beta ) \ne 0, \\ 0 &{} \quad \hbox {if } R(\beta )=0, \end{array} \right. \end{aligned}$$
(22)

where Q is the generator matrix of the Markov chain, \(\alpha \) is the initial probability distribution, \(R(\beta )\) the reliability function and \(\mathbbm {1}_{(n-1)}\) is the unitary vector of the dimension \(n-1\). We then have

$$\begin{aligned} LB_{0}+DB_{0}=\sum _{i=1}^{D}\alpha _{i}\cdot ELB_{i} + \int _{0}^{\infty }e^{-r\beta }\mathbb {E}[DB_{\beta }(\gamma )]\lambda (\beta )d\beta , \end{aligned}$$
(23)

hence, we get the final equation as

$$\begin{aligned} h_{0}= {\alpha }^{T}*{\textbf {ELB}} +EDB(\gamma ). \end{aligned}$$
(24)

The above equation in \(\gamma \) is solved numerically to obtain the break-even value for the fee (\(\gamma ^{*}\)) charged by the insurer.

3 Numerical results and sensitivity analysis

In this section description of the data taken for the numerical calculation is mentioned, along with the numerical results for the given model.

3.1 Data description

In this paper, \( S \& P\) 500 composite index prices from the US market are considered for the numerical analysis. The index was chosen based on data availability and popularity among the people of the respective country. We considered information from January 1, 1970, until October 31, 2019. The data for \( S \& P\) 500 index is taken from Thomson and Reuters Datastream. Furthermore, annual interest rate data for the US market for the previous 50 years is taken from https://fred.stlouisfed.org/.

In evaluating an insurance health product, the health state of policyholders and their corresponding transition probabilities play critical roles. According to Moosavi and Payandeh (2021), a seven-state continuous-time Markov process has been used to classify an individual’s health status based on his/her ability to perform daily activities. Successfully performing instrumental activities of daily living (IADL) such as light housework, laundry, grocery shopping, meal preparation, getting around outside, money management, and using the telephone and activities of daily living (ADL) such as eating classified as (level 1), bathing (level 2), dressing (level 3), moving around (level 4), doing personal hygiene (level 5) and going to the toilet (level 6), see also Pritchard (2006), Haberman and Pitacco (2018) for more details.

The state of each policyholder begins from state 1 (healthy) and moves to state 2 (impairment in at least one IADLs), state 3 (ADL level = 1 and 2), state 4 (ADL level = 3 and 4), state 5 (ADL level = 5 and 6), state 6 (stay at a hospital) and finally state 7 (death). The state transition diagram for the health status of the policyholder is shown in Fig. 1. This study follows two-year transition probabilities between disability states calculated from the 1982 and 1984 National Long Term Care Survey, as a percentage, for males and females mentioned in Pritchard (2006). Table 1 shows the two-year transition probabilities. For example, the (1, 7)-th entry (i.e., 4.04) of the matrix represents the chances of dying of a healthy person in percentage within two years. Parameters \(\mu \) and \(\sigma \) are first estimated from real data using statistical method using R software in order to calculate the expected death benefit EDB. Estimated value for \(\mu \) is 4.21 and \(\sigma \) is 1.17. For the numerical calculation RStudio 2021.09.0 Build 351 is used in the system with configuration as follows: AMD Ryzen 5 3500U with Radeon Vega Mobile Gfx 2.10 GHz, 64-bit operating system, x64-based processor, and 8GB RAM.

Table 1 Two-year transition probabilities between disability states calculated from the 1982 and 1984 NLTCS, as a percentage, for males and females according to Pritchard (2006)

3.2 Numerical calculation

For the numerical analysis the following assumptions are considered:

  1. 1.

    Premium paid is a lump sum amount of 100, i.e., \(h_{0}=\$100\) and a is considered to be 24 months.

  2. 2.

    The range of break-even fee (\(\gamma ^*\)) is considered to be 0 to 1000 basis points (bp).

  3. 3.

    Initially, the State of each policyholder begins with state 1 (healthy) i.e., \(\alpha =(1,0,0,0,0,0,0)\).

  4. 4.

    Withdrawal is according to the health status of the patient such that for the health states 1 and 2, the withdrawal amount is assumed to be \(2\$\), for states 3 and 4, it is assumed to be \(3\$\), and for 5th and 6th state it is assumed to be \(4\$\). That is

    $$\begin{aligned} {\textbf {G}}&= \begin{bmatrix} 2,&2&,3&,3&, 4&,4 \end{bmatrix}^{T}. \end{aligned}$$

In order to move further with applying the model, the generator matrix \({\textbf{Q}}\) of the Markov chain must be estimated. We use the results shown in Table 1 due to rough data unavailability. This table, as previously stated, provides 2-year transition probability matrices. From the knowledge of these matrices, it is possible, in some cases, to recover the generator matrix.

Indeed, the maximum likelihood estimator \(\hat{\textbf{Q}}\) of the generator matrix \(\textbf{Q}\) satisfies the relation \(\hat{\textbf{P}}=\exp (a \hat{\textbf{Q}})\) and it can be obtained as the logarithm matrix,

$$\begin{aligned} \hat{\textbf{Q}}=\frac{\log (\hat{\textbf{P}})}{a}, \end{aligned}$$
(25)

where \(\hat{\textbf{P}}\) is the estimate of transition probability matrix with data observed periodically at every integer multiple of the real number a. The matrix \(\hat{\textbf{P}}=(\hat{p}_{i j})_{i,j\in E}\) can be estimated according to

$$\begin{aligned} \hat{p}_{i j}=\frac{K_{i j}}{K_{i}}, \end{aligned}$$

where \(K_{i j}\) is the number of transitions from state i to state j, and \(K_{i}=\sum _{j=1}^{D} K_{i j}\) is the total number of times households have been allocated to state i. The period of observation is \(a=2\) years in our application. It should be remarked that the maximum likelihood estimator of \(\textbf{Q}\) under this observational scheme is not guaranteed to exist or to be unique (see, e.g., Bladt and Sørensen 2005; Regnault 2012). It was proved in D’Amico and Regnault (2018) that estimator \(\hat{\textbf{Q}}\) exists and is unique whenever the Markov chain is irreducible, and the transition probability matrix has positive eigenvalues. The estimate of transition probability matrices is given in Table 1. Thus, the estimated generator matrix for ages 65, 71 and 76 estimated are given by \(\hat{\textbf{Q}}_{65}\), \(\hat{\textbf{Q}}_{71}\) and \(\hat{\textbf{Q}}_{76}\):

\(\hat{\textbf{Q}}_{65}= \begin{pmatrix} -0.1157 &{} 0.0396 &{} 0.0237&{}0.0054&{}0.0057&{}0.0044&{}0.0367\\ 0.5195 &{} -1.2726 &{} 0.4570&{}0.0689&{}0.0876&{}0.0275&{}0.1122\\ 0.1952&{}0.3982&{}-1.2717&{}0.3801&{}0.0983&{}0.0356&{}0.1638\\ 0.0174&{}0.00008&{}1.0290&{}-1.7539&{}0.4354&{}0.0749&{}0.1976\\ 0.0608&{}0.1349&{}0.1039&{}0.3188&{}-1.1832&{}0.1036&{}0.4548\\ 0.0883&{}0.0135&{}0.01204&{}0.0453&{}0.000004&{}-0.3273&{}0.1702\\ 0&{}0&{}0&{}0&{}0&{}0&{}0 \end{pmatrix} \)

\(\hat{\textbf{Q}}_{71}= \begin{pmatrix} -0.1913&{}0.0629&{}0.0265&{}0.0110&{}0.0117&{}0.0137&{}0.0676\\ 0.4540&{}-1.2719&{}0.5823&{}0.0258&{}0.0585&{}0.0754&{}0.1276\\ 0.0940&{}0.4279&{}-1.3158&{}0.4342&{}0.0408&{}0.1412&{}0.1764\\ 0.1069&{}0.0002&{}0.5720&{}-1.4850&{}0.6984&{}0.0047&{}0.1145\\ 0.0707&{}0.0850&{}0.1921&{}0.3470&{}-1.3976&{}0.1870&{}0.5019\\ 0.0382&{}0.0074&{}0.0305&{}0.0176&{}0.0538&{}-0.6540&{}0.5065\\ 0&{}0&{}0&{}0&{}0&{}0&{}0 \end{pmatrix} \)

\(\hat{\textbf{Q}}_{76}= \begin{pmatrix} -0.2799&{}0.1012&{}0.0507&{}0.0144&{}0.0110&{}0.0183&{}0.0865\\ 0.2782&{}-1.3390&{}0.6011&{}0.0628&{}0.1035&{}0.1340&{}0.0872\\ 0.1279&{}0.4041&{}-1.3039&{}0.4069&{}0.0797&{}0.0643&{}0.2218\\ 0.0185&{}0.0147&{}0.7587&{}-1.8751&{}0.6317&{}0.2237&{}0.2774\\ 0.0540&{}0.0900&{}0.0340&{}0.2974&{}-1.1518&{}0.2267&{}0.4483\\ 0.0212&{}0.0321&{}0.0054&{}0.0140&{}0.0275&{}-0.5715&{}0.4718\\ 0&{}0&{}0&{}0&{}0&{}0&{}0 \end{pmatrix} \)

Table 2 Break even fee (\(\gamma ^*\)) for age 65, 71 and 76

According to Eq. (24), the break-even fee is the value of \(\gamma \) for which the expected discounted benefits equal the premium \(h_{0}\). The break-even fee corresponding to different guarantee amounts for ages 65, 71 and 76 is shown in Table 2. In the table’s first column, we consider a constant variation \(\epsilon \), which is added to all the elements of the vector \({\textbf {G}}\). Thus, for example when \(\epsilon \) is 1.12 the \({\textbf {G}}\) modifies to

$$\begin{aligned} {\textbf {G}}&= \begin{bmatrix} 3.12,&3.12&,4.12&,4.12&, 5.12&,5.12 \end{bmatrix}^{T}. \end{aligned}$$

Results from Table 2 illustrate that with the increase in \(\epsilon \), the fee associated with the contract also increases. Furthermore, there is no need for a break-even charge corresponding to each guarantee percentage. Even if the cost is 0 bp, a very low guarantee will not allow the contract’s present value to match the premium. Similarly, a very high guarantee cannot make the contract’s current value equal to the premium, even if the cost is 3000 bp. Therefore the value of \(\epsilon \) has been selected such that the value of the break-even fee exists between 0 and 1000 basis points.

Additionally, in Fig. 3, plots are drawn of fee vs. value of shift parameter (\(\epsilon \)) for different ages to support our results graphically. Increases in \(\epsilon \) enhance the guaranteed withdrawal amounts while decreasing the death benefit value. The increase in the value of the life benefit overcomes the reduction in the value of the death benefit as \(\epsilon \) increases. As a result, as the value of \(\epsilon \) rises, the value of GLWB as a whole rises with it. The effect is more significant for younger policyholders, as the difference in GLWB values corresponding to different \(\epsilon \) values decreases and eventually vanishes as the policyholder’s age at inception rises. Senior policyholders might thus benefit from a larger guarantee value without having to pay a higher charge.

3.3 Sensitivity analysis

In this section, we will study the change in the expected present value of the contract with respect to the different parameters.

  1. 1.

    Age: From Fig. 3, it is observed that the initial insured age also impacts the fee, i.e., with the increment of initial age, GLWB value increases, and hence \(\gamma ^*\) increases. We examine the influence on the death and living benefits of a change in age at inception y on the GLWB value. As the initial insured age increases, contract length decreases but also increases the level of the health status, resulting in fewer withdrawals with an increment in the withdrawal amount and a lower living benefit amount. Furthermore, when the initial insured age decreases, the time it takes to discount the death benefit value is reduced, resulting in a more considerable death benefit. An increase in initial insured age increases the overall GLWB value since the loss in living benefit value is lesser than the death benefit value for younger policyholders. However, for senior policyholders, this gap between the death and living benefits keeps decreasing.

  2. 2.

    Guaranteed withdrawal rate: In Fig. 4, the plot of EPV vs. \(\epsilon \) is drawn to show the change in the behavior of the GLWB value with the change in the shift parameter \(\epsilon \). An increase in \(\epsilon \) enhances the overall withdrawal amounts while decreasing the death benefit value. The increase in the value of the life benefit overcomes the reduction in the value of the death benefit as \(\epsilon \) rises. As a result, as the value of \(\epsilon \) rises, the value of the GLWB as a whole rises with it. The effect is greater for the senior policyholders, as the GLWB value increases with an increase in age and \(\epsilon \). Younger policyholders might thus have some benefit from low charges with a larger guarantee amount.

Fig. 3
figure 3

Break even fee for different age groups

4 Conclusion and future work

This study proposed and evaluated a new product that is the variable annuity which is accompanied by the health status and hospitalization coverage benefit along with the guaranteed lifelong withdrawal benefit. In this article, a mixed continuous-discrete time model with constant volatility and rate of interest is proposed to calculate the fair value of the product.

People approaching retirement must be insured from outliving their assets and protected from aging problems. This product may provide a more comprehensive solution to all these needs. In this article, expected life and death benefits are evaluated, and the fair fee is determined.

Fig. 4
figure 4

EPV versus shift parameter (\(\epsilon \))

These products can be made more attractive if the health benefits can be modeled better to provide an increase in medical and hospitalization costs to the policyholder. As discussed in the sensitivity analysis section, changes in age and Guaranteed withdrawal amount affect the fair fee (\(\gamma ^*\)) of the product effectively. Future works include the application of more sophisticated models of health evolution like semi-Markov processes, and the consideration of path-dependent health-related rewards (see, e.g., D’Amico 2011). Generalizations in the actuarial variables, such as the valuation of surrender benefits using a suitable penalty process, can also be included in future work. The removal of various financial presumptions, the subsequent application of stochastic volatility, and the interest rate as considered for a different financial problem in Bufalo et al. (2022) is also of interest. An interesting challenge is to consider a multi-dimensional scenario where the fund value depends on a basket of correlated assets. In this case, n-dimensional diffusion equations should be considered as the process generating the fund value according to the share invested in each asset.