Abstract
In the non-life insurance industry, pricing is often done relative to individual criteria of policyholders. Various classification algorithms are in use to categorize policyholders into risk classes defined by the insurer, but classification errors may result from this process. In the light of recent automatic classification practices, it becomes important to assess the risks caused by such errors. In this paper we examine the impact of risk class misspecifications for a simple situation with two risk types. We provide a mean-variance framework for quantitatively studying the insurer’s optimization problem of specifying premiums and we analyze the tradeoff of costs and benefits when classification error probabilities are known.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Risk classification is a classical tool in actuarial practice. Indeed, the distribution of individual risks will often differ substantially, depending on personal characteristics, different exposure, environmental conditions etc. If an insurer applies the same premium across all such categories, this may lead to adverse selection, in the sense that individuals who face a lower premium than appropriate for their true risk will massively enter the contract, raising the price and squeezing out rational individuals with lower risk (see e.g. Antonio and Valdez (2012) for a general discussion). Classification is commonly used in insurance during the underwriting process and the tarification, cf. Olbricht (2012) and Vekeman et al. (2019) for health and life insurance and Weidner et al. (2016) for property lines. Different types of variables, such as quantitative or categorical can be used in various algorithms for actuarial classification purposes (Shi & Shi, 2023). For an overview of classification methods we refer to Hastie et al. (2009). In the literature, numerous authors discuss risk classification in view of adverse selection and efficiency in the Pareto sense, see (Dionne et al., 2013) for a survey on this topic. In Crocker and Snow (1986, 2013) and Hoy (1982) authors study the efficiency of imperfect categorization in a utility setup. In Dionne and Rothschild (2014) and Porrini (2015), authors consider the effect of bans on classification on the market efficiency. Thomas (2007) deals with the customers’ perspective on adverse selection and efficiency of risk classification. The contributions (Crocker & Snow, 1986; Porrini, 2015) consider also costs of categorization. Indeed, risk classification may be costly both computationally (in terms of resource-consuming algorithms in the presence of huge datasets) and in monetary terms (e.g. involving data acquisition from official statistics or from competitors).
The insurer hence needs to select appropriate criteria for rating variables, which may take different forms and are not always motivated by actuarial drivers. For instance, Finger (2001) distinguishes actuarial, operational, social and legal criteria. In some cases, classification may be inaccurate when the main criterion is unobservable or there are legal constraints to use it, e.g. for social or political reasons. For instance, gender-based discrimination is nowadays forbidden in the European Union (see e.g. Sass and Seifried (2014)), even if it is widely considered a relevant characteristic from the statistical point of view. In Schmeiser et al. (2014), authors discuss the perceptions of gender as pricing criteria by the customer and explain the complex interplay between anti-discrimination laws and actuarial principles within the insurance industry, see (Lindholm et al., 2022) for suggestions to deal with this issue. We refer to the recent book (Charpentier, 2023) for a rich source of information and ideas on this topic.
Although criteria to distinguish risk profiles will exist, they may often be unknown to the insurer due to the asymmetry of information, see e.g. Albrecher (2016); Albrecher and Daily-Amir (2017). At the same time, recent years have seen a dramatic increase in both the amount of risk information available and the ability to analyse it statistically. In many situations, risks can be analysed on an almost individual basis, but in any case as elements of much smaller rating pools, i.e. pools of risks that share characteristics such that they can be assumed to have the same loss distribution. Offered insurance cover may also take into account the current risk situation, i.e. environmental variables (e.g. time, place) or even behavioural variables. The small size of rating pools leads to larger estimation errors in estimating the expected cost of insurance cover. In the presence of competing insurance providers and given the transparency of the prices of their insurance offers, customers may tend to choose the cheapest offer, i.e. an offer that is too low compared to the true but unknown production costs. From the perspective of the insurance company, this phenomenon is known as the "winner’s curse". In order to avoid the negative economic consequences of the winner’s curse, insurance companies like to apply tailor-made surcharges to the offered premium. It is likely that these surcharges, in addition to the higher cost of more granular risk assessment, will lead to higher overall costs for the entire insured portfolio of risks. Hence, the total welfare of the community is reduced. In addition, the overall coverage ratio may be reduced as some risk owners may find the increased cost of insurance too high. An overly granular approach may therefore be counterproductive and there may be an optimal level of rating granularity, together with legal boundaries, see for instance (Finger, 2001; Albrecher et al., 2019) and Tzougas and Kutzkov (2023).
Even when the resulting loss distributions are assumed to be known, each classification method will inherently contain classification errors, which may lead to unprofitable decision making. Note that error rates tend to be higher when one class is less represented in the population, cf. Khalilia et al. (2011) and Weiss and Provost (2001). For a classical reference to empirical evidence on concrete values for error probabilities arising from frequently used statistical methods, see e.g. Bauer and Kohavi (1999). One can also apply empirical methods for the estimation of the misclassification probabilities. For instance, an empirical estimate arises from historical records, when after the loss occurrences, one obtains more information on the risk type of the policyholders. The insured individual can be reclassified using, for example, maximum likelihood techniques, and thus proportions of misclassified individuals can be identified.
In this paper, we would like to quantitatively study the effects of such classification errors in the context of a simple model with only two possible risk classes. In that case one faces two types of errors: assigning a risk of the first class as one of the second and vice versa (a false positive and false negative in statistical terms, or also sensitivity vs. specificity, see e.g. Khalilia et al. (2011), Vekeman et al. (2019)). We compare three scenarios, one where the insurer does not classify the heterogeneous risks and two others where the risks are differentiated, but once with perfect knowledge and once with some probability of misspecification of policyholder types. While the introduction of the classification mechanism allows to price the insurance risks more efficiently, it entails certain fixed costs, and the potential misspecification can impact the insurer’s profit further. In Browne and Kamiya (2012), the authors study a similar framework with incomplete information from the insurer’s side, whereas the customer knows their true risk type. During the underwriting process, the customer may be required to take a test and the result will reveal for what type of coverage they are eligible. In contrast to our present work, the authors consider a plurality of insurers and customers choosing their coverages, while the present paper deals with a one-insurer setting and customers being provided with one single offer. In a similar spirit to Gatzert et al. (2012), we develop a simple framework to assess the respective trade-off between costs of classification and profits. We consider linear and sigmoid-type demand functions of the premium for the probability that a customer accepts an offered policy (rather than deterministic demand functions as in Gatzert et al. (2012)). Our main optimization target is the expected profit for the insurer (see e.g. Kliger and Levikson (1998)), and we consider several risk measures to assess the risk part in the analysis.
The remainder of the paper is structured as follows: In Sect. 2, we start with an insurer’s expected profit approach and a piece-wise linear demand setting. We then consider different scenarios for this setup in Sect. 3. In Sect. 4, we then include the variance in our considerations, and establish mean-variance frontiers for the profit. We illustrate the results and its main drivers in Sect. 5. In order to assess the sensitivity of the results, Sect. 6 then develops a number of extensions, namely a sigmoid-type rather than piece-wise linear demand function, a lower semi-variance and a value-at-risk concept for replacing the variance in the risk assessment of the strategies, as well as a utility function approach to unite the consideration of profitability and variability in one function, together with numerical illustrations for each of these cases. Finally, Sect. 7 concludes.
2 Model setting
Assume that there is only one insurer present in the market, so there is no competition. Let us further assume a population of n individuals, who are all willing to contract insurance, and independently from each other choose whether they enter the insurance contract for a given premium or not. The individuals fall into two types: low risk type with loss random variable L and high risk type with loss random variable H, with underlying cumulative loss distribution functions \(F_L(x)\) and \(F_H(x)\), \(x\ge 0\) respectively (both L and H will typically have atoms at 0, signifying the case of no claim in the considered time period). Define the respective means and variances by
which are all assumed to be finite. All risks are assumed to be independent and identically distributed within each type. Let \(\{p_L, p_H\ge 0\}\) be the actual proportion of the low- and high-risk type among the policyholders (\(p_L+p_H=1\)).
Define an acceptance function \(f_i\) which for any proposed premium P gives the probability for an individual to enter the contract; the form of this function differs for each risk type i. Each individual is assumed to take the decision about entering independent of all the others. If m individuals are offered a premium P and all use the same acceptance function \(f_i\), we then expect \(m\cdot f_i(P)\) individuals to enter the contract. Let us first assume that \(f_i\) is piece-wise linear
where \(P^{max}_i\) is the so-called reservation price, cf. Fig. 1. Clearly, \(f_i\) is non-increasing in P. We set the condition \(\mu _i<P^{max}_i\) to ensure the possibility of positive expected profits for the company.
Our approach represents a stochastic setup for the acceptance of an offered contract, in contrast to Gatzert et al. (2012) who work with (3) as a deterministic demand function for a given price. While for expected profits as dealt with in Sect. 3 this difference does not matter, it will be important for the risk considerations in the subsequent sections. Despite the non-differentiability of the piece-wise linear \(f_i\) at \(P^{max}_i\), this form will lead to simple local solutions. An extension of the results to a more complex, but analytically better tractable sigmoid form will be considered in the Sect. 6.
Remark 2.1
In the classical work of Rothschild and Stiglitz (1978) as well as further models based on it, in a comparable setting of identifying contracts to sell, the authors find optimal solutions in terms of premiums and levels of coverage. For the present purpose and simplicity we, however, prefer to use the concept of the acceptance function, with individuals facing a binary choice of entering the contract or not. That is, we do not allow a partial coverage or deductibles here.
In the following sections, we introduce three different scenarios that the insurer may face. We start with the case where the insurer can observe the risk type of each individual, which we refer to as the full information case. We then consider the situation where the insurer can not distinguish between the two types ex-ante at all. In that case the only possible method of pricing is to not differentiate individuals, and the profitability of the insurance business will then depend on the empirical fraction of each risk type in the population. Finally, the possibility to observe and measure a certain characteristic, which can be discrete or continuous, allows us to classify an observation, but with a certain error probability. This probability depends on the true class of the observation. The introduction of the classification mechanism has certain costs, but allows to better price according to the true risk class. We are interested to quantitatively assess the respective trade-off in this simple model setup.
3 Expected profit in three scenarios
In this section, we focus on the expected profit only. Let us introduce three different scenarios that the insurer may face, starting with the case where the insurer can actually observe the risk type of each individual.
3.1 Full information
The benchmark for our analysis is the situation with no asymmetry of information. Here the insurer can observe the risk type of each individual and therefore price according to the true type. Recall that we know the actual proportion \(\{p_L, p_H\}\) of the population in each class. Therefore, we can maximize profit by differentiating between groups. If, for a given individual, the price is higher than its true risk premium, his/her willingness to accept the contract decreases. Let us denote by \(X^{(L)}_j\) the \(j^{\text {th}}\) loss random variable of the low risk type (independent copies of L) and by \(X^{(H)}_j\) the \(j^{\text {th}}\) high-risk loss variable (independent copies of H). Then, the (random) profit is given by
where \(I_j^L\) and \(I_j^H\) are independent Bernoulli random variables with probabilities \(f_L(P_L)\) and \(f_H(P_H)\), respectively. In this case, an adaptation of Gatzert et al. (2012) establishes that the optimal premiums are independent of n, \(p_L\) and \(p_H\), and they are simply the average of the mean claim size and the maximum premium that the policyholders are willing to pay.
Theorem 3.1
In the full information case, the expected profit of the insurer is maximized by the premium choice
and
for the two risk classes.
Proof
The optimal premium choice is the solution of the following optimization problem:
We notice that \( \mathbb {E}\left( \Pi \right) \) is a continuous function of \(\{x,y\}\) and that for \(\{x<\mu _L,\ y<\mu _H\}\), \( \mathbb {E}\left( \Pi \right) <0\); \(\{x=\mu _L,\ y=\mu _H\}\), \( \mathbb {E}\left( \Pi \right) =0\); \(\{x>\mu _L,\ y>\mu _H\}\), \( \mathbb {E}\left( \Pi \right) >0\). Also, \(\lim _{x\rightarrow +\infty , y\rightarrow +\infty } \mathbb {E}\left( \Pi \right) =0\), which means that \( \mathbb {E}\left( \Pi \right) \) admits a strictly positive maximum for some \(\{x>\mu _L,\ y>\mu _H\}\) (and the optimization in x and y can in fact be separated). We can characterize this point by the following equations:
where the \( \overset{!}{=}\) operator denotes a necessary condition. From (8) we have
Since the optimal solution \(P_L\) respects the condition \(P_L\le P^{max}_L\), as \(\mu _L<P^{max}_L\), it is not necessary to distinguish cases of the piece-wise function. To prove it is indeed a local and global maximum, we can easily prove that the second derivative is negative for a linear f:
The same reasoning holds for the first and second derivative w.r.t. y. \(\square \)
Remark 3.2
It is easy to check that if
then \(P_L<\mu _H\), in which case charging a low-risk type premium to a high-risk type customer results in an expected loss on the individual level. \(\Box \)
In terms of sensitivities, we simply see from (5) and (6) that
for both risk types \(i\in \{L,H\}\). That is, the reactivity of the optimal premium is constant for variation in the mean loss. Consequently, in case of increasing losses, the increase in premium will only cover half of the increase in losses, thus decreasing profits by the double effect of smaller margins and smaller acceptance rate. Similarly, shifting the endpoint \(P^{max}_i\) of the acceptance function (for invariant \(\mu _i\)) also increases the optimal chargeable premium linearly with slope 1/2.
3.2 No differentiation
Next, let us consider the situation where the insurer has no possibility to distinguish between risk types on the individual level, but still has an estimate for the fractions \(\{p_L, p_H\}\) of the population in each class (e.g. through some historical figures). So we assume these numbers to be known (\(p_L+p_H=1\)). In this scenario, the insurer proposes an identical premium P to every individual. This has the advantage that one saves the cost of identification of risk types, and provides another benchmark for the sequel. The profit in this case is
where \(I_j^L\) and \(I_j^H\) are Bernoulli random variables with probabilities \(f_L(P)\) and \(f_H(P)\) respectively, and the optimal premium then amounts to
Comparing the resulting optimization problem
with (7), we see from \(P_L\ne P_H\) (which itself is due to \(\mu _H>\mu _L\)) there that the optimal solution P to (12) will now yield a smaller profit (this is intuitive, since we have less information available than in the setup of Sect. 3.1).
Theorem 3.3
In the no differentiation case, the expected profit of the insurer is maximized by the premium choice
where \(P_L\) and \(P_H\) are the optimal premiums of the full information case given in (5) and (6) and \(a^*=\frac{{p_L}/{P^{max}_L}}{{p_L}/{P^{max}_L}+{p_H}/{P^{max}_H}}\).
Proof
Problem (12) can be solved using the first order condition
Using \(z<\min \{P^{max}_L,P^{max}_H\}\), plugging in the function f yields
which leads to
and finally (13).
The second order condition yields a strictly negative result, thus confirming the global maximum. \(\square \)
Expression (13) shows that P is the average of the optimal premiums under full information, weighted by the proportions in the population and the maximum affordable premiums. Under the assumption (10), this also establishes
Remark 3.4
One should be careful to check whether \(P>P^{max}_L\): in that case, L type customers do not enter the contract. This happens if \(a^*<\frac{P_H-P^{max}_L}{P_H-P_L}\). Consequently, the optimal premium is that for higher risk types only, meaning \(P=P_H\). If the expected profit for \(P=P_H\) is greater than the one found above, then the optimal premium will be \(P_H\) and only H types will enter the contract. \(\square \)
The change of the expected profit when compared to the case of full information can now also be expressed as
In particular, for low risk types the proposed premium P is higher than their appropriate optimal premium \(P_L\) under full information. Thus, with the decreasing shape of the acceptance function, on average the insurer loses low-risk type customers and the associated expected profit (negative first term in (14)). At the same time, those who remain bring higher profits (the second term in (14)). Correspondingly, due to the cheaper than appropriate premium \(P_H\), more high-risk type customers join (positive third term in (14)), but they pay less premium now (negative fourth term).
3.3 Differentiation in two classes
Assume now that the insurer does not know the individuals’ risk type, but has access to a mechanism that can assign (classify) the risk types correctly with a certain probability. Assume that the probability of misclassification is the same for each policyholder of the same type and given by
Remark 3.5
If \(p_{H \mid L}=p_{L \mid H}=0\), we get back to the full information setting, as there is no classification error. If \(p_{H \mid L}=p_{L \mid H}=1\), all the true H end up in the L group and all the true L are classified in the H group (which would also result in knowing the true type of each one, but having to switch the categories). In the cases when \(p_{H \mid L}=1\) and \(p_{L \mid H}=0\) or \(p_{H \mid L}=0\) and \(p_{L \mid H}=1\), all individuals are classified in the same group. Typically, there is a tradeoff between the two error types: in an attempt to classify one risk more accurately, the precision on the other one will go down. For instance, in order to minimize \(p_{L\mid H}\), we could simply attribute all observations to group H, which indeed gives \(p_{L\mid H}=0\), but \(p_{H\mid L}\) would increase drastically as all L observations are then erroneously identified as H.
The cost c(n) of applying the classification algorithm will increase with population size n (the computational cost of different algorithms is increasing in the sample size (take for instance the simplest Bayesian classifier (Zheng & Webb, 2005) with linear complexity), the human time invested in analysing data and making decisions increases, and more powerful machines may be needed to run the algorithms, just to name a few reasons). At the same time, the marginal cost is likely to decrease in n (fixed costs in the process can be divided onto more policyholders, the insurer gains experience and recognizes patterns etc.). Hence, we define
A mathematically simple candidate for such a function is
where \(\gamma \) offsets for the minimal cost amount and \(c_0\) scales for the intensity of the effect of the population size.
The insurer will propose premiums, \(P^*_L\) and \(P^*_H\), different from the ones in Sect. 3.1 under full information, and some customers receive ’wrong’ offers, leading to a different customer behaviour with respect to accepting the contract. Figure 2 visualizes the pricing process. An initial population of n customers is subdivided into groups by their true risk type, rather than their identified risk type, and finally the insurer loses some customers because of the entailed acceptance patterns of policies.
In this situation, the profit is given by
where \(I_j^{L\mid L}\), \(I_j^{H\mid L}\), \(I_j^{H\mid H}\) and \(I_j^{L\mid H}\) are Bernoulli random variables with parameters \(f_L(P^*_L)\), \(f_L(P^*_H)\), \(f_H(P^*_H)\) and \(f_H(P^*_L)\) respectively. The optimization procedure now amounts to
Theorem 3.6
In the differentiation case, the expected profit of the insurer is maximized by the premium choice
and
for the two classified risk classes, where \(P_L\) and \(P_H\) are the optimal premiums (5) and (6) of the full information case, \(b^*=\frac{{p_L(1-p_{H\mid L})}/{P^{max}_L}}{{p_L(1-p_{H\mid L})}/{P^{max}_L}+{p_Hp_{L\mid H}}/{P^{max}_H}}\) and \(c^*=\frac{{p_Lp_{H\mid L}}/{P^{max}_L}}{{p_H(1-p_{L\mid H})}/{P^{max}_H}+{p_Lp_{H\mid L}}/{P^{max}_L}}\).
Proof
We make use of the following first order conditions from (15) to determine the optimal solution:
Equation (18) yields
leading to (16). Formula (17) is obtained in a completely analogous way from (19).
The second order conditions are strictly negative and thus confirm the maximum. \(\square \)
Like in the no differentiation case, the optimal premiums can again be expressed simply as a weighted average of the optimal premiums from the full information case, and the weights now involve the error probabilities.
Remark 3.7
One needs to verify the limiting case of \(P^*_H=P^{max}_L\) and \(P^*_L=P^{max}_L\) to obtain the true maximum, since misclassified L individuals may not enter the contract after the limiting premium. This is the case when \(c^*<\frac{P_H-P^{max}_L}{P_H-P_L}\) and \(b^*<\frac{P_H-P^{max}_L}{P_H-P_L}\). One can distinguish three cases. Firstly, if both \(P^*_H<P^{max}_L\) and \(P^*_L<P^{max}_L\), then the optimal solutions are given by Equations (16) and (17). Secondly, if only \(P^*_H>P^{max}_L\), then the correct premium for the proposed H contract should be \(P^*_H=P_H\), since we correctly price for only H types entering the group. H types will always enter the contract since their premium is a weighted average of \(P_L\) and \(P_H\), and both are smaller than \(P^{max}_H\) from Sect. 3.1. Thirdly, if both \(P^*_H>P^{max}_L\) and \(P^*_L>P^{max}_L\), which could happen with a high proportion of misclassified H individuals, then the optimal solution would be to offer the contract only to H types by setting \(P^*_L=P_H\) and \(P^*_H=P_H\). It is worthwhile to notice that the second-order mixed partial derivatives \(\frac{\partial ^2 \mathbb {E}\left( \Pi \right) }{\partial v\partial w}=\frac{\partial ^2 \mathbb {E}\left( \Pi \right) }{\partial w\partial v}=0\), and therefore the optimal price for the low risk types does not depend on the optimal price for the high risk types and vice versa. \(\square \)
What is of particular interest is the situation where H individuals are wrongly classified as L. Indeed, since \(P_i>\mu _i\), this is the only situation where the insurer makes losses, so it is important to maintain control over this group. The loss (presented here as a negative gain) compared to the benchmark of the situation of full information can be decomposed into
Recall that P was the optimal uniform premium for the case without differentiation. Differentiation of risk types only makes sense, if the resulting premiums \(P_L^*,P_H^*\) satisfy \(P^*_L\le P \le P^*_H\) (cf. Fig. 3).
From Eqs. (13), (16) and (17), this amounts to the condition \(c^*\le a^*\le b^*\) which can easily translated to the following condition on the error probabilities:
This will always be fulfilled in practically relevant situations.
4 A mean-variance analysis
Proposing a unique premium P to both categories of risks attracts a higher relative proportion of H than the differentiating strategies. This heterogeneity in the portfolio composition generates a higher level of risk, which one should consider in the underwriting process. As the expected profit considered in Sect. 3 does not capture this aspect of the problem, we introduce the variance of the profit as a simple indicator that can be easily implemented in practical settings, as one only needs estimates for the first two moments of the underlying claim distributions for the analysis.
Define by \(N_L,N_H\) the (random) number of insured persons of risk type L and H, respectively, entering the contract. Their first two moments are summarized in Table 1.
For each risk type i, \(N_i=\sum _{j=1}^{n}I^i_j\), where \(I^i_j\) are independent Bernoulli random variables with probability \(f_i(P_i)\), so \( \mathbb {E}\left( N_i\right) =nf_i(P_i)\) and variance \( \mathbb {V}\text {ar} \left( N_i\right) =n f_i(P_i)(1-f_i(P_i))\). The claim sizes are \(X_j^{(i)}\) and the premium is \(P_i\). From (4), we then get
With this ingredient, we can now derive the variance of the profit in our three scenarios introduced in the previous section.
-
Full information:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right)= & {} \mathbb {E}\left( N_L\right) \sigma _L^2+(P_L-\mu _L)^2 \mathbb {V}\text {ar} \left( N_L\right) \\{} & {} + \mathbb {E}\left( N_H\right) \sigma _H^2+(P_H-\mu _H)^2 \mathbb {V}\text {ar} \left( N_H\right) . \end{aligned}$$ -
No differentiation:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right)= & {} \mathbb {E}\left( N_L\right) \sigma _L^2+(P-\mu _L)^2 \mathbb {V}\text {ar} \left( N_L\right) \\{} & {} + \mathbb {E}\left( N_H\right) \sigma _H^2+(P-\mu _H)^2 \mathbb {V}\text {ar} \left( N_H\right) . \end{aligned}$$ -
Differentiation:
$$\begin{aligned} \begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) =&np_L(1-p_{H\mid L})f_L(P^*_L)\sigma _L^2+(P^*_L-\mu _L)^2\\&np_L(1-p_{H\mid L})f_L(P^*_L)(1-f_L(P^*_L))\\&+np_Lp_{H\mid L}f_L(P^*_H)\sigma _L^2+(P^*_H-\mu _L)^2\\&np_Lp_{H\mid L}f_L(P^*_H)(1-f_L(P^*_H))\\&+np_H(1-p_{L\mid H})f_H(P^*_H)\sigma _H^2+(P^*_H-\mu _H)^2 \\&np_H(1-p_{L\mid H})f_H(P^*_H)(1-f_H(P^*_H))\\&+np_Hp_{L\mid H}f_H(P^*_L)\sigma _H^2+(P^*_H-\mu _L)^2\\&np_Hp_{L\mid H}f_H(P^*_L)(1-f_H(P^*_L)). \end{aligned} \end{aligned}$$
Remark 4.1
Note that in all of the above expressions a term containing the variance due to the randomness of claim sizes is followed by one with the variance due to the randomness of underwriting, i.e. the customer’s probability to enter the contract or not. In Sect. 5, we will illustrate this decomposition with the help of a numerical example.
For both the insurance company itself and the regulator it will be natural to also include a risk constraint into the problem of optimizing profits. Consequently, we introduce a variance constraint in the optimization problem, modifying the problem from Sect. 3 to
Varying the value of \({\bar{\sigma }}\) will lead to a mean-variance efficient frontier in the spirit of Markowitz (1952). Introduce the Lagrange multipliers
for the premium \(P_i\) in any of the optimization programs (7), (11) and (15). The optimal premiums are then obtained by the first order conditions
In order to construct the efficient frontier, we maximize the expected profit subject to the constraint of the variance being smaller than a certain level (\({{{\bar{\sigma }}}}^2\)). We use the Lagrange multiplier method in order to perform the optimization under this constraint. Thus, we obtain one point of the frontier defined by the coordinated \(\mu \) and \({{{\bar{\sigma }}}}^2\). To obtain more points and draw the frontier, we augment the \({{{\bar{\sigma }}}}^2\) and redo the analysis each time. We give here the corresponding equations for the full information case, the other cases follow in an analogous way. Equation (22) translates into
The first order conditions are given by
This results in a system of three equations for the three unknowns \(P_L,P_H\) and \(\lambda \) which can be solved numerically for every choice of involved parameters.
5 Numerical illustrations
Let us now consider concrete numerical illustrations of the results of the previous sections. The following parametrization will be used throughout this section unless otherwise stated:
For the shape of the cost function we assume \(c(n)=20 \log n\) in the plots, but note that any other choice would be feasible as well.
5.1 Expected profit
Let \(f_i(P)\) have the form (3) with \(P^{max}_L=4\mu _L=4\) and \(P^{max}_H=4\mu _H=20\). Then we get from the respective formulas of Sect. 3:
-
Full information:
$$\begin{aligned}P_L= 2.5,\quad P_H= 12.5,\quad \mathbb {E}\left( \Pi \right) =0.788\,n,\quad f_L(P_L)=f_H(P_H)=0.375.\end{aligned}$$ -
No differentiation:
$$\begin{aligned}P\approx 2.717,\quad \mathbb {E}\left( \Pi \right) \approx 0.298\,n,\quad f_L(P)\approx 0.321,\quad f_H(P)\approx 0.864.\end{aligned}$$Note that \(\mu _L<P<\mu _H\). In this case, the insurer targets the low-risk L type customers because their proportion in the population is large enough to compensate for the losses on the H types.
-
Differentiation in two classes:
$$\begin{aligned} \begin{aligned}&P^*_L\approx 2.525,\quad P^*_H= 12.5,\quad \mathbb {E}\left( \Pi \right) \approx 0.687\,n-c(n),\\&f_L(P^*_L)\approx 0.369,\quad f_H(P^*_H)\approx 0.375,\quad f_L(P^*_H)= 0,\quad f_H(P^*_L)\approx 0.874. \end{aligned} \end{aligned}$$Note that \(P^*_H>P^{max}_L\).
Applying the classification is hence only an advantage if
Conversely, the maximum cost which the insurer will be willing to pay for the classification, given a population of size n, is
It is instructive to look into the sensitivity of the results. Let us first explore the variability of the profit under different error probabilities, which can be a helpful decision tool in case of limited investment resources. For each level of error probabilities, we recompute the optimal premiums. Figure 4 features the sensitivity of the expected profit with respect to both error probabilities. The classification cost still needs to be deducted here from the expected profit. If the insurer is given a choice of different classification algorithms or investment possibilities for improvement of precision with known resulting error probabilities, one can verify whether that investment is worthwhile.
This figure may help the decision makers to judge whether with given error probabilities, a refinement in the classification may be of added value to the company.
5.2 Variance
Numerically, with the parameters defined in (23), we obtain the following results for the three cases of the linear acceptance function:
-
Full information:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) = 2.505469n. \end{aligned}$$ -
No differentiation:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) =1.79213n. \end{aligned}$$ -
Differentiation:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) =2.355267n. \end{aligned}$$
As the total variance is an increasing function of the number of policyholders, it will naturally be higher under a differentiation strategy, as the insurer gets more market share. But the structure of the variance will be different. Without differentiation, the variance inside the group is much higher than the average of internal group variances from the differentiation case, that difference being larger when the two distributions are further apart.
In Fig. 5, we plot the variances in the three scenarios to illustrate their forms as a function of chosen premium. We split the variances according to the part stemming from the variability of claims (in red) and from the one of acceptance of contracts (in green). The humps indicate the region where the increase of variance due to the increasing deviation from the mean is compensated by the decrease in the number of underwritten policies. In Fig. 5b, c, we can observe two humps, appearing because of the mixture of two risk types. With the help of this decomposition, we can clearly see that the humps in the plots come from the acceptance behaviour. In Fig. 5a, we observe that if the premium becomes too high, the total variance decreases as the population does not enter the contract any more.
In Figs. 6, 7 and 8, we show the variances when varying one parameter at a time. In Fig. 6, one can observe the variance shapes for a small range acceptance function. In this case, the variance is mostly defined by the claims behaviour, since the acceptance rate remains low and the humps are less pronounced. For an acceptance function ranging up to high premiums as shown in Fig. 7, the acceptance variance dominates. The humps are more pronounced as policyholders exist in a broader range. Finally, with a bigger expected claim size difference between risk types, the relationship between the humps and the risk types becomes clearer as they are further apart, see Fig. 8.
5.3 Mean-variance efficient frontier
To complete the numerical part, we now address the illustration of the mean-variance frontier as defined in Sect. 4 for a population size of \(n=10,000\). We see in Fig. 9 that up to a certain variance level, the non-differentiation strategy dominates differentiation in terms of expected profit. This breaking point depends on the cost function c(n) and the error probabilities. One may also want to consider limiting constraints in practice such as regulatory constraints or the demands of stakeholders. The kinks in the frontier arise from the fact that for different variance limitations, a different portfolio composition becomes optimal. In other words, the optimal strategy switches in the points of the kinks by letting more of a lower or higher risk type entering the contract.
The mean-variance approach assumes variations to both sides as equally weighted since the variance is a symmetric risk measure. This framework can be extended to other risk measures, such as the lower semi-variance to take in account only one-sided deviations from the mean or the value-at-risk to consider minimal profit requirements. These adaptations are developed in Sect. 6, where we also present an alternative approach for the risk assessment based on utility functions.
6 Extensions
6.1 A sigmoid-type acceptance function
While the piece-wise linear acceptance functions used in this paper allow for intuitive and transparent results, one may want to challenge this simplistic assumption. In this section we would like to extend the previous analysis to a possibly more realistic shape that still allows for an explicit treatment. Concretely, assume that f belongs to the class of sigmoid functions, namely the logistic functions, which are smooth and monotone, thus suitable for our situation (Kyurkchiev & Markov, 2015). This form of function appears when applying a logit lapsing model with different risk factors, see e.g. Dutang (2012) and Guillén et al. (2003). An example of a model using premiums as risk factors can be found in Brockett et al. (2008) and Guillen et al. (2011) and particularly in Dutang et al. (2013). Consider the following concrete shape of the acceptance function \(f_i\) of an individual of risk type i:
where the parameters \(a_i\) and \(b_i\) need to be calibrated. We can suppose \(b_i>\mu _i\), so that the function reaches value 1/2 for premiums that are higher than the actuarially fair premium, cf. Fig. 10. As a grows, the curve becomes steeper around the pivotal position determined by the parameter b (note that the choice of b also determines the value of f for \(P=0\) which will typically be smaller than 1). From an analytical point of view, the form (24) is more attractive than the piece-wise linear shape considered in the previous sections, as it is differentiable everywhere.
Clearly, \(f_i\) is strictly decreasing in P:
Define further the price elasticity of a risk type as the change of the number of customers entering the contract with respect to the price variation:
This measure illustrates the reactivity of the portfolio size to the variation of premium, cf. for instance (Varian, 2014, Ch.15).
6.1.1 Theoretical results
We first derive the analogous results to the ones in Sects. 3, 4, under the sigmoid acceptance function. The full information case still leads to an explicit formula:
Theorem 6.1
In the full information case, the expected profit of the insurer is maximized for the premium choice
and
where W(z) denotes the (principal branch of the) Lambert W function, which is the inverse function of \(g(x)=xe^x\) (cf. Corless et al. (1996)).
Proof
The optimal premium choice is the solution of the optimization problem
We can characterize the maxima by the equations
From (27) we have
leading to (25). Equation (26) for the high-risk individuals is then obtained in a completely analogous way.
To see that the extremal point is indeed a local maximum, one needs to verify
Using \(\{P_L>\mu _L,\ P_H>\mu _H\}\) and
we can rearrange the previous condition as
Since \(-2f'_L(x)\) is always positive, \(f''_L(x)<0\) for all \(x<b_L\) and \(x=P_L>\mu _L\). The same conclusion holds for the second derivative w.r.t. y. \(\square \)
Remark 6.2
Note that we can provide a necessary condition for \(P_L\) to be smaller than \(\mu _H\):
This condition is of interest for analysing the case when the low-risk type premium yields losses in absolute terms if sold to a high-risk type. Also, one can easily obtain sensitivities of the premium with respect to the parameters by means of first derivatives.
Concerning the sign of the last term, under \(b_i>\mu _i\) the first term is positive and the second is negative. The overall difference is negative for small values of \(a_i\), but positive for larger \(a_i\), that effect manifesting itself sooner if the difference \(b_i-\mu _i\) is larger.
In case of no differentiation, we proceed as before by taking first order conditions of the expected profit defined above in Equation (11):
Plugging in the sigmoid function f yields
which can be solved numerically. We establish that \(P_L\le P\le P_H\), following the assumption in (29).
For the differentiation case, we have the problem defined in Eq. (15) to solve. Once again, as the first order conditions are symmetric, we will detail only one of them.
Similarly, we also get
These conditions characterize the optimum, which is then solved numerically.
6.1.2 Numerical illustrations
Let us look into the case of a sigmoid acceptance function (24) with parameters
All other parameters remaining identical to those from Sect. 5, we obtain the following results:
-
Full information:
$$\begin{aligned}&P_L\approx 2.567203,\\&P_H\approx 8.926367,\\&\mathbb {E}\left( \Pi \right) \approx 0.8030561n,\\&f_L(P_L)\approx 0.362 ,\quad f_H(P_H)\approx 0.745. \end{aligned}$$ -
No differentiation:
$$\begin{aligned}&P\approx 8.836827,\\&\mathbb {E}\left( \Pi \right) \approx 0.2998947n,\\&f_L(P)\approx 0.001,\quad f_H(P)\approx 0.762. \end{aligned}$$Note that \(P>\mu _H>\mu _L\). In this case, the insurer targets the high risk type audience because even if its size is smaller, with this acceptance function form he can make higher margins on them, thus compensating their smaller size.
-
Differentiation:
$$\begin{aligned}&P^*_L\approx 2.601853,\\&P^*_H\approx 8.91731,\\&\mathbb {E}\left( \Pi \right) \approx 0.6993114n-c(n),\\&f_L(P^*_L)\approx 0.354,\quad f_H(P^*_H)\approx 0.747,\quad&f_L(P^*_H)\approx 0.0009,\quad f_H(P^*_L)\approx 0.999. \end{aligned}$$Note that \(\mu _L<P^*_L<\mu _H<P^*_H\).
Consequently, differentiation is only preferable here if the population size satisfies
in our case \(n\ge 282.617\). Conversely, the maximum cost the insurer is willing to pay given a population size is given by:
For the variances, the results are as follows:
-
Full information:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) = 1.874096n. \end{aligned}$$ -
No differentiation:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) =1.089133n. \end{aligned}$$ -
Differentiation:
$$\begin{aligned} \mathbb {V}\text {ar} \left( \Pi \right) =1.800876n. \end{aligned}$$
Figure 11 depicts the form of the variance as function of the proposed premiums in the different scenarios and its decomposition into the two parts (the variance arising from the acceptance function and the one from the claim size variability), showing again a hump pattern. Figures 12, 13 and 14 illustrate the sensitivity of the variances of the profit with and without differentiation, when varying one of the parameters. In Fig. 12, we observe that under small price elasticity, with more customers entering the contract, the hump behaviour disappears, since at the limit there is no gap in different risk types behaviour. The total variance is now mostly due to the underwriting process via the acceptance function, and claim size variance has little effect on the total variance. In contrast, a high price elasticity pushes different risk types to stabilize around their pivotal point of their respective acceptance function, accepting contracts only below this point, see Fig. 13. All the variance of the profit can then be explained by the claim size variance. Finally, Fig. 15 gives the mean-variance frontier in case of this sigmoid acceptance function.
We observe that the no-differentiation strategy changes depending on the form of the acceptance function used in the analysis. This can be particularly relevant in the case when a company conducts a study using a simplified linear form instead of a more realistic logistic approximation.
6.1.3 Sensitivities
Let us now investigate the sensitivity of the expected profit w.r.t. to each parameter. The change in expected profits will allow to compute the variation in the maximal cost for which the differentiation policy is still advantageous.
Parameter a: In the sigmoid curve of the acceptance function f, a represents the steepness, giving the speed at which the function changes around its central point (see Fig. 10). We choose \(a_L=a_H=a\), so we will vary it as a unique parameter. We observe in Fig. 16 that for higher levels of a, the steepness of the twist increases, meaning that the values of the acceptance function grow closer to the points \(b_L\) and \(b_H\). Thus, the price can be set closer to the twisting point, allowing a higher proportion of individuals to enter the contract. As in our initial parametrization \(b_i>\mu _i\), we gain strictly positive profit when pricing around \(b_i\). In the case of no differentiation, we cannot entirely benefit from this feature, as one of our types twisting point will end up far from the unique price P. Therefore, the maximum cost the insurer is willing to invest into the classification method is increasing in the parameter a. We can determine the limit:
which in our case gives 0.63n.
Parameters \(b_L\) and \(b_H\): Now we simultaneously vary the parameters \(b_H\) and \(b_L\) (the central points of the acceptance functions), see Fig. 17. Naturally, the higher \(b_i\), the higher will be the overall profit, as customers accept premiums until higher thresholds. Therefore, it becomes more and more attractive to differentiate customers to actually get this profit. Conversely, if \(b_H\) grows ceteris paribus, the profit increase becomes smaller with differentiation as the proportion of high-risk types is too low to strongly influence the non-differentiation premium.
6.2 Other risk measures
We give a short comparative analysis for two other risk measures replacing the variance criterion (see (Pflug & Römisch, 2007, Ch.5) for a more extensive list of possible alternatives in the context of efficient frontier studies in decision making).
6.2.1 Lower semi-variance
The main drawback of a variance risk constraint is that positive deviations from the mean are also penalized. In Markovitz (1959), Markowitz suggests the concept of the lower semi-variance (LSV)
of the profit \(\Pi \) to account for asymmetry of positive and negative deviations from the profit target. In this case, analytical formulas are not feasible any more, but one can obtain similar results by Monte Carlo simulation, using 1000 simulation runs. For that purpose, rather than only specifying two moments, we need to take an assumption of the entire distribution of claim sizes. The left plot in Fig. 18 depicts the resulting efficient frontiers for the three scenarios for an assumption of Gamma distributions for the individual claim sizes with an additional atom at 0 with probability 0.25 (parameters consistent with their first two moments from (23)) and all other parameters chosen as in (23).
The right plot in Fig. 18 shows the results for H being log-normally distributed risks (and again matching the first two moments). For instance, the intersection between the no-differentiation and differentiation scenario takes place at a much higher threshold.
6.2.2 Value-at-risk
Let us now instead consider the Value-at-risk
for some level \(0<\alpha <1\). This measure is particularly focusing on the tail of the loss (negative profit), when using small values of \(\alpha \). As for the LSV, we depict Monte Carlo results for the case of Gamma-distributed H and Log-normal H (Fig. 19) risk types, where \(\alpha =0.025\). That is, the profit can be lower than the value of the abscissa in Fig. 19 only with probability \(\alpha =0.025\), so that the more left in the abscissa one gets, the more risk-averse the strategy is. One observes that high values of \(\text {VaR}_{0.025}(\Pi )\) can only be obtained by the no-differentiation case. In regions where that VaR-value can be attained by all strategies, the differentiation strategy always dominates the one without differentiation. Note that for this level of \(\alpha \), one virtually does not observe any difference between the case of light-tailed and heavy-tailed losses, which is also due to the size of the portfolio.
6.3 Utility functions
Utility theory is a classical tool to combine risk and profitability of an insurance undertaking in one function (see e.g. Rothschild and Stiglitz (1978), Pflug and Römisch (2007)), so in this subsection we would like to briefly look at the problem posed in this paper from the utility point of view. Note that in this case the knowledge of the full loss distribution is needed, and not only the first two moments as in Sect. 4. Assume that the insurer bases decisions on a risk-averse (i.e., increasing and concave) utility function u(x). The insurer’s optimization problem is then modified as follows:
where the profit \(\Pi \) is given by the (4), which we can also write as
Firstly, the moment-generating function of each \(\Pi ^L_j\) is
Analogously, \(M_{\Pi _j^H}(t)=e^{tP_H}M_H(-t)\). By independence and classical collective risk theory calculations (cf. Kaas et al. (2008)), we can then determine the moment generating function of \(\Pi \):
The same reasoning applies to the non-differentiation case with setting \(P_L=P_H=P\). Finally, for differentiating pricing, an analogous derivation gives
In each of the cases, \(M_{\Pi }(t)\) can be inverted to obtain the c.d.f. \(F_{\Pi }(x)\) of the profit, and the expected utility is then given by \( \mathbb {E}\left( u(\Pi )\right) =\int _x u(\Pi (x)) dF_{\Pi }(x)\).
For a numerical illustration, assume now that \(L\sim \) Exp\((\alpha _L)\) and \(H\sim \Gamma (\alpha _H,\lambda _H)\). To be consistent with (23), we choose \(\alpha _L=\mu _L=1,\ \alpha _H=\mu _H^2 / \sigma _H^2,\ \lambda _H=\mu _H / \sigma _H^2\). Since an explicit calculation of \( \mathbb {E}\left( u(\Pi )\right) \) is not feasible, we add here numerical results from a Monte Carlo simulation, simulating its value for each choice of \(P_L, P_H\) (across a discrete grid of mesh size 0.05) using 1000 runs. For the sake of comparison, we use three popular utility functions:
-
linear utility \(u(x)=x\) (leading to simply the expected value of the profit);
-
exponential utility \(u(x)=-e^{-Ax}\) for some risk aversion coefficient \(A>0\);
-
quadratic utility \(u(x)=x-Bx^2,\ x\le \frac{1}{2B}\).
The results in Figs. 20, 21, 22 show the expected utility for each of the available premium combinations and each strategy for these three utility functions. Figure 20 serves as a reference point since it represents the simple expected profit as before. One observes that the optimal solution clearly depends on the chosen utility function. With the chosen parametrization of the exponential utility function, the difference in the expected utility between the differentiation and not differentiation case is less prominent than in the quadratic utility as the marginal utility of the quadratic function is greater in this region.
7 Conclusion
In this paper, we investigated the problem of risk categorization under the possibility of classification errors for an insurance company. We highlighted the impact of misspecification of risk classes on the company’s profit, which is a relevant topic due to the growing use of black box techniques in classification. Resulting pricing errors may lead to adverse selection via a modified acceptance behaviour of individuals to enter a contract, potentially leading to extra costs due to lost market shares and loss of premium inflow. In a simple model with two risk types and piece-wise linear acceptance function, we distinguished three pricing scenarios: full information, undifferentiated pricing and costly price differentiation under error assumptions. In this framework, we studied the optimal solution for simply maximizing expected profit and more generally within a mean-variance framework, establishing efficient frontiers for the premium choices. The cost of the risk categorization as a function of population size will then eventually determine the optimal choice of premiums, and to what extent risk classification is profitable.
The simplicity of the introduced model allowed to quantify the effects and consequences of misspecification on the insurer’s profit. Clearly, it will be of interest in future research to generalize the model assumptions in various directions. Beyond the extensions to more general acceptance functions and risk measures that we already address to a first extent in Sect. 6 of the paper, it will be of interest to extend the study to more than two risk categories. Another important direction will be to introduce market competition into this model (cf. Dutang et al. (2013), Mouminoux et al. (2021)), as well as the lapse behavior of policyholders between the different market players (see e.g. Barsotti et al. (2016), Milhaud and Dutang (2018)). Also, while our probabilistic acceptance model already covers a certain degree of randomness in the choice of insurance policies, it could be interesting to more explicitly include bounded rationality as well as other elements of inertia of policyholders and the markets in the modelling framework.
Data availability
No data are used in this paper.
References
Albrecher, H. (2016). Asymmetric information and insurance. In Cahiers de l’Institut Louis Bachelier, 20, 12–15.
Albrecher, H., Bommier, A., Filipović, D., Koch-Medina, P., Loisel, S., & Schmeiser, H. (2019). Insurance: Models, digitalization, and data science. European Actuarial Journal, 9, 349–360.
Albrecher, H., & Daily-Amir, D. (2017). On effects of asymmetric information on non-life insurance prices under competition. International Journal of Data Analysis Techniques and Strategies, 9(4), 287–299.
Antonio, K., & Valdez, E. A. (2012). Statistical concepts of a priori and a posteriori risk classification in insurance. AStA Advances in Statistical Analysis, 96, 187–224.
Barsotti, F., Milhaud, X., & Salhi, Y. (2016). Lapse risk in life insurance: Correlation and contagion effects among policyholders behaviors. Insurance: Mathematics and Economics, 71, 317–331.
Bauer, E., & Kohavi, R. (1999). An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Machine Learning, 36, 105–139.
Brockett, P. L., Golden, L. L., Guillen, M., Nielsen, J. P., Parner, J., & Perez-Marin, A. M. (2008). Survival analysis of a household portfolio of insurance policies: how much time do you have to stop total customer defection? Journal of Risk and Insurance, 75(3), 713–737.
Browne, M. J., & Kamiya, S. (2012). A theory of the demand for underwriting. Journal of Risk and Insurance, 79(2), 335–349.
Charpentier, A. (2023). Insurance, Biases. Discrimination and Fairness. Springer.
Corless, R. M., Gonnet, G. H., Hare, D. E., Jeffrey, D. J., & Knuth, D. E. (1996). On the lambert W function. Advances in Computational Mathematics, 5, 329–359.
Crocker, K. J., & Snow, A. (1986). The efficiency effects of categorical discrimination in the insurance industry. Journal of Political Economy, 94(2), 321–344.
Crocker, K. J., & Snow, A. (2013). The theory of risk classification. Handbook of Insurance, 27, 81–313.
Dionne, G., Fombaron, N., & Doherty, N. (2013). Adverse Selection in Insurance Contracting. Springer.
Dionne, G., & Rothschild, C. (2014). Economic effects of risk classification bans. The Geneva Risk and Insurance Review, 39, 184–221.
Dutang, C. (2012). The customer, the insurer and the market. Bulletin Français d’Actuariat.
Dutang, C., Albrecher, H., & Loisel, S. (2013). Competition among non-life insurers under solvency constraints: A game-theoretic approach. European Journal of Operational Research, 231(3), 702–711.
Finger, R.J. (2001). Chapter 6 Risk classification. In Foundations of Casualty Actuarial Science (pp. 287–341).
Gatzert, N., Schmitt-Hoermann, G., & Schmeiser, H. (2012). Optimal risk classification with an application to substandard annuities. North American Actuarial Journal, 16(4), 462–486.
Guillén, M., Parner, J., Densgsoe, C., & Perez-Marin, A.M. (2003). Using logistic regression models to predict and understand why customers leave an insurance company. In Intelligent and Other Computational Techniques in Insurance: Theory and Applications (pp. 465–490). World Scientific.
Guillen, M., Perez, A.M., Alcañiz, M. (2011). A logistic regression approach to estimating customer profit loss due to lapses in insurance. InDocument de Treball No. XREAP (p. 13).
Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Vol. 2). Springer.
Hoy, M. (1982). Categorizing risks in the insurance industry. The Quarterly Journal of Economics, 97(2), 321–336.
Kaas, R., Goovaerts, M., Dhaene, J., & Denuit, M. (2008). Technical Report. Modern Actuarial Risk TheorySpringer.
Khalilia, M., Chakraborty, S., & Popescu, M. (2011). Predicting disease risks from highly imbalanced data using random forest. BMC Medical Informatics and Decision Making, 11, 1–13.
Kliger, D., & Levikson, B. (1998). Pricing insurance contracts an economic viewpoint. Insurance: Mathematics and Economics, 22(3), 243–249.
Kyurkchiev, N., & Markov, S. (2015). Sigmoid Functions: Some Approximation and Modelling Aspects (Vol. 4). LAP LAMBERT Academic Publishing.
Lindholm, M., Richman, R., Tsanakas, A., & Wüthrich, M. V. (2022). Discrimination-free insurance pricing. ASTIN Bulletin: The Journal of the IAA, 52(1), 55–89.
Markovitz, H. M. (1959). Portfolio Selection: Efficient Diversification of Investments. John Wiley.
Markowitz, H. (1952). Portfolio selection. Journal of Finance, 7(1), 77–91.
Milhaud, X., & Dutang, C. (2018). Lapse tables for lapse risk management in insurance: A competing risk approach. European Actuarial Journal, 8, 97–126.
Mouminoux, C., Dutang, C., Loisel, S., & Albrecher, H. (2021). On a Markovian game model for competitive insurance pricing. Methodology and Computing in Applied Probability, 24(2), 1061–1091.
Olbricht, W. (2012). Tree-based methods: A useful tool for life insurance. European Actuarial Journal, 2, 129–147.
Pflug, G. C., & Römisch, W. (2007). Modeling, Measuring and Managing Risk. World Scientific.
Porrini, D. (2015). Risk classification efficiency and the insurance market regulation. Risks, 3(4), 445–454.
Rothschild, M., Stiglitz, J. (1978). Equilibrium in competitive insurance markets: An essay on the economics of imperfect information. In Uncertainty in Economics (pp. 257–280). Elsevier.
Sass, J., & Seifried, F. T. (2014). Insurance markets and unisex tariffs: Is the European court of justice improving or destroying welfare? Scand. Actuarial J., 2014(3), 228–254.
Schmeiser, H., Störmer, T., & Wagner, J. (2014). Unisex insurance pricing: Consumers’ perception and market implications. The Geneva Papers on Risk and Insurance-Issues and Practice, 39, 322–350.
Shi, P., & Shi, K. (2023). Non-life insurance risk classification using categorical embedding. North American Actuarial Journal, 27(3), 579–601.
Thomas, R. G. (2007). Some novel perspectives on risk classification. The Geneva Papers on Risk and Insurance-Issues and Practice, 32, 105–132.
Tzougas, G., & Kutzkov, K. (2023). Enhancing logistic regression using neural networks for classification in actuarial learning. Algorithms, 16(2), 99.
Varian, H. R. (2014). Intermediate Microeconomics with Calculus: A Modern Approach. WW Norton & Company.
Vekeman, F., Piña-Garza, J. E., Cheng, W. Y., Tuttle, E., Giguère-Duval, P., Oganisian, A., Damron, J., Sheng Duh, M., Shen, V., Saurer, T. B., et al. (2019). Development of a classifier to identify patients with probable Lennox–Gastaut syndrome in health insurance claims databases via random forest methodology. Current Medical Research and Opinion, 35(8), 1415–1420.
Weidner, W., Transchel, F. W., & Weidner, R. (2016). Classification of scale-sensitive telematic observables for risk individual pricing. European Actuarial Journal, 6, 3–24.
Weiss, G.M., Provost, F. (2001). The effect of class distribution on classifier learning: An empirical study. In Technical Report, Rutgers University.
Zheng, F., & Webb, G.I. (2005). A comparative study of semi-naive Bayes methods in classification learning. InAUSDM05 (pp. 141–155).
Funding
Open access funding provided by University of Lausanne.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Finger, D., Albrecher, H. & Wilhelmy, L. On the cost of risk misspecification in insurance pricing. Jpn J Stat Data Sci (2024). https://doi.org/10.1007/s42081-024-00256-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42081-024-00256-z