Connection between higher order measures of risk and stochastic dominance

Pichler, Alois

doi:10.1007/s10287-024-00523-0

Connection between higher order measures of risk and stochastic dominance

Original Paper
Open access
Published: 05 September 2024

Volume 21, article number 41, (2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Management Science Aims and scope Submit manuscript

Connection between higher order measures of risk and stochastic dominance

Download PDF

Alois Pichler ORCID: orcid.org/0000-0001-8876-2429¹

106 Accesses
Explore all metrics

Abstract

Higher order risk measures are stochastic optimization problems by design, and for this reason they enjoy valuable properties in optimization under uncertainties. They nicely integrate with stochastic optimization problems, as has been observed by the intriguing concept of the risk quadrangles, for example. Stochastic dominance is a binary relation for random variables to compare random outcomes. It is demonstrated that the concepts of higher order risk measures and stochastic dominance are equivalent, they can be employed to characterize the other. The paper explores these relations and connects stochastic orders, higher order risk measures and the risk quadrangle. Expectiles are employed to exemplify the relations obtained.

Quadratic two-stage stochastic optimization with coherent measures of risk

Article 04 March 2017

Stochastic superiority

Article 01 June 2021

Polyhedral Coherent Risk Measures and Robust Optimization

Article 28 November 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Risk measures are considered in various disciplines to assess and quantify risk. Similarly to assigning a premium to an insurance contract with random losses after appraising its risk, risk measures assign a number to a random variable, which itself has stochastic outcomes.

This paper focuses on higher order risk measures, as these risk measures naturally combine with stochastic optimization problems or in ‘learning’ objectives, as they are the result of optimization problems. In addition, these risk measures relate to the risk quadrangle.

The paper derives explicit representations of higher order risk measures for general, elementary risk measures in a first main result. These characterizations are employed to characterize stochastic dominance relations, which are built on general norms. The second main result is a verification theorem. This is a characterization of higher order stochastic dominance relations, which is numerically tractable.

For the norm in Lebesgue spaces, stochastic dominance relations have been considered for example in Dupačová and Kopa (2014), Kopa et al. (2016, 2023), Post and Kopa (2017) and Consigli et al. (2023), in portfolio optimization involving commodities (cf. Frydenberg et al. (2019)), and by Dentcheva and Martinez (2012) and Maggioni and Pflug (2016, 2019) in a multistage setting. The paper employs the characterizations obtained to establish relations for general norms. A comparison of these methods is given in Gutjahr and Pichler (2013). The paper illustrates these connections for expectiles (Bellini et al. 2016; Bellini and Caperdoni 2007) and adds a comparison with other risk measures.

Outline of the paper The following Sect. 2 recalls the mathematical framework for higher order risk measures. Section 3 addresses the higher order risk measure associated with the spectral risks, as these risk measures constitute an elementary building block for general risk measures. This section develops the first main result, which is an explicit representation of a spectral risk’s higher order risk measure. As a special case, the subsequent Sect. 4 links and relates stochastic dominance and higher order risk measures. This section presents the second main result, which allows verifying a stochastic dominance relation by involving only finitely many risk levels. The final Sect. 5 addresses the expectile and establishes the relations of the preceding sections for this specific risk measure. Section 6 concludes.

2 Mathematical framework

Higher order risk measures are a special instance of risk measures, often also termed risk functionals. To introduce and recall their main properties we consider a space ${\mathcal {Y}}$ of ${\mathbb {R}}$-valued random variables on a probability space with measure P containing at least all bounded random variables, that is, $L^\infty (P)\subseteq {\mathcal {Y}}$. A risk measure then satisfies the following axioms, originally introduced by Artzner et al. (1999).

Definition 2.1

(Risk functional) Let ${\mathcal {Y}}$ be a space of ${\mathbb {R}}$-valued random variables on a probability space $(\Omega , \Sigma , P)$. A mapping ${\mathcal {R}}:{\mathcal {Y}}\rightarrow {\mathbb {R}}$ is

(i)
monotone, if ${\mathcal {R}}(X)\le {\mathcal {R}}(Y) $, provided that $X\le Y$ almost everywhere;
(ii)
positively homogeneous if ${\mathcal {R}}(\lambda \,Y)=\lambda \,{\mathcal {R}}(Y)$ for all $\lambda >0$;
(iii)
translation equivariant, if ${\mathcal {R}}(c+Y)=c+{\mathcal {R}}(Y)$ for all $c\in {\mathbb {R}}$;
(iv)
subadditive, if ${\mathcal {R}}(X+Y)\le {\mathcal {R}}(X)+{\mathcal {R}}(Y)$ for all X and $Y\in {\mathcal {Y}}$.

A mapping satisfying (i)–(iv) is called a risk functional, or a risk measure.

The risk quadrangle (cf. Rockafellar and Uryasev (2013)) relates risk measures with the measure of regret by

$$\begin{aligned} {\mathcal {R}}(Y)= \inf _{c\in {\mathbb {R}}}\ c+ {\mathcal {V}}(Y-c), \end{aligned}$$

(2.1)

where ${\mathcal {V}}$ is called regret function. Equation (2.1) was first introduced for the conditional value-at-risk in Rockafellar and Uryasev (2000). For the expectation type function, i.e., ${\mathcal {V}}(X)={{\,\mathrm{{\mathbb {E}}}\,}}v(X)$, the relationship (2.1) is studied in Ben-Tal and Teboulle (2007), where ${\mathcal {V}}$ was called optimized certainty equivalent; also, Krokhmal (2007) study the relation (2.1).

It follows from relation (2.1) that ${\mathcal {R}}$—if given as in (2.1)—is translation equivariant, i.e, ${\mathcal {R}}$ satisfies ${\mathcal {R}}(Y+c)= c+ {\mathcal {R}}(Y)$ for any $c\in {\mathbb {R}}$ (cf. (iii) above). In an economic interpretation, the amount c in (2.1) corresponds to an amount of cash spent today, while the remaining quantity $Y-c$ is invested and consumed later, thus subject to ${\mathcal {V}}$.

The risk functional ${\mathcal {R}}$ is positively homogeneous, if the regret function ${\mathcal {V}}$ is positively homogeneous. If ${\mathcal {V}}$ is not positively homogeneous, then one may consider the positively homogeneous envelope

$$\begin{aligned} {\mathcal {V}}_{ {\tilde{\beta }}}(Y)=\inf _{t>0}\ t\left( {\tilde{\beta }}+{\mathcal {V}}\left( {Y \over t}\right) \right) , \end{aligned}$$

where ${\tilde{\beta }} \ge 0$ is a risk aversion coefficient. The combined functional

$$\begin{aligned} {\mathcal {R}}_\beta (Y)&=\inf _{c\in {\mathbb {R}}}c+{\mathcal {V}}_\beta (Y-c)\nonumber \\&=\inf _{\begin{array}{c} t>0\\ q\in {\mathbb {R}} \end{array}}\ t\left( {\tilde{\beta }}+q+{\mathcal {V}}\left( \frac{Y}{t}-q\right) \right) \end{aligned}$$

(2.2)

is positively homogeneous and translation equivariant (cf. (ii) and (iii)). The $\varphi $-divergence risk measure is an explicit example of a risk measure, which is defined exactly as (2.2), cf. Dommel and Pichler (2021).

The paper suggests a regret for a higher-order risk starting from a given risk ${\mathcal {R}}$. To this end consider a space ${\mathcal {Y}}\subset L^1(P)$ endowed with norm $\Vert \cdot \Vert $. We shall assume the norm to be monotone, that is, $\Vert X\Vert \le \Vert Y\Vert $ provided that $0\le X\le Y$ almost everywhere. We associate the following family of risk measure with a given norm.

Definition 2.2

(Higher order risk measure) Let $\Vert \cdot \Vert $ be a monotone norm on ${\mathcal {Y}}\subset L^1(P)$ with $\Vert {\mathbbm {1}}\Vert =1$, where ${\mathbbm {1}}(\cdot )=1$ is the identically one function on ${\mathcal {Y}}$. The higher order risk measure at risk level $\beta \in [0,1)$ associated with the norm $\Vert \cdot \Vert $ is

$$\begin{aligned} {\mathcal {R}}_\beta ^{\Vert \cdot \Vert }(Y)=\inf _{t\in {\mathbb {R}}}\ t+\frac{1}{1-\beta }\Vert (Y-t)_+\Vert , \end{aligned}$$

(2.3)

where $\beta \in [0,1)$ is the risk aversion coefficient and $x_+{:}{=}\max (0,x)$.

We shall also omit the superscript and write ${\mathcal {R}}_\beta $ instead of ${\mathcal {R}}_\beta ^{\Vert \cdot \Vert }$ in case the norm is unambiguous given the context. We shall demonstrate first that the higher order risk measure is well-defined for any $\beta \ge 0$.

Proposition 2.3

Let $({\mathcal {Y}},\Vert \cdot \Vert )$ be a normed space of random variables. For the functional ${\mathcal {R}}_\beta $ defined in (2.3) it holds that

$$\begin{aligned} -\Vert Y\Vert \le {\mathcal {R}}_\beta (Y)\le {1 \over 1-\beta } \Vert Y\Vert , \end{aligned}$$

(2.4)

so that ${\mathcal {R}}_\beta (\cdot )$ is indeed well-defined on $({\mathcal {Y}},\,\Vert \cdot \Vert )$ for every $\beta \in [0,1)$.

Proof

The upper bound follows trivially from the definition by choosing $t=0$ in the defining equation (2.3).

For $t\le 0$, it holds that $-t=-Y+(Y-t)\le -Y+ (Y-t)_+$. It follows from the triangle inequality that $-t\le \Vert Y\Vert +\Vert (Y-t)_+\Vert $ and thus

$$\begin{aligned} -\Vert Y\Vert \le t+\Vert (Y-t)_+\Vert \quad \text {for all }t\le 0. \end{aligned}$$

To establish the relation also for $t\ge 0$, we start by observing the following monotonicity property of the objective in (2.3) in addition: for $\Delta t\ge 0$, it follows from the reverse triangle inequality that

$$\begin{aligned} \Vert Y_+\Vert -\Vert (Y-\Delta t)_+ \Vert \le \Vert Y_+-(Y-\Delta t)_+\Vert \le \Vert \Delta t\,{\mathbbm {1}}\Vert = \Delta t, \end{aligned}$$

where we have used that $0\le Y_+-(Y-\Delta t)_+\le \Delta t$ together with monotonicity of the norm. Replacing Y by $Y-t$ in the latter expression gives

$$\begin{aligned} t +\Vert (Y-t)_+\Vert \le t+\Delta t+\Vert (Y-(t+\Delta t))_+\Vert ; \end{aligned}$$

that is, the function $t\mapsto t+\Vert (Y-t)_+\Vert $ is non-decreasing, which finally establishes that

$$\begin{aligned} -\Vert Y\Vert \le t+\Vert (Y-t)_+\Vert \quad \text {for all }t\in {\mathbb {R}}. \end{aligned}$$

The lower bound in (2.4) thus follows from the latter inequality, as ${\mathcal {R}}_0(Y)\le {\mathcal {R}}_\beta (Y)$ for any $\beta \ge 0$. $\square $

Example 2.4

For Lebesgue spaces $L^p(P)$ and norm $\Vert Y\Vert _p{:}{=}({{\,\mathrm{{\mathbb {E}}}\,}}|Y|^p)^{1/p}$, $p\ge 1$, the higher order risk measure has been introduced in Krokhmal (2007) and studied in Dentcheva et al. (2010). For the norm $\Vert \cdot \Vert _\infty $, the higher order risk measure is

$$\begin{aligned} {\mathcal {R}}_\beta ^{\Vert \cdot \Vert _\infty }(Y)= \mathop {\mathrm {ess\,sup}}\limits Y, \quad \beta > 0; \end{aligned}$$

(2.5)

indeed, it follows from (2.3) that

$$\begin{aligned} 0\in \left[ 1-\frac{1}{1-\beta }, 1\right] = \partial _t\left. \left( t+\frac{1}{1-\beta }\Vert (Y-t)_+\Vert _\infty \right) \right| _{t= \mathop {\mathrm {ess\,sup}}\limits Y}, \end{aligned}$$

the subgradient of the convex function in the latter expression at $t=\mathop {\mathrm {ess\,sup}}\limits Y$. The infimum in (2.3) is attained at $t= \mathop {\mathrm {ess\,sup}}\limits Y$, and thus (2.5).

Lemma 2.5

${\mathcal {R}}_\beta (\cdot )$ is a risk functional, provided that the norm is monotone. Further, ${\mathcal {R}}_\beta $ is Lipschitz continuous with respect to the norm, the Lipschitz constant is ${1 \over 1-\beta }$.

Proof

The assertions (ii)–(iv) in Definition 2.1 are straight forward to verify; to verify (i) it is indispensable to assume that the norm is monotone.

As for continuity, it follows from subadditivity together with (2.4) that ${\mathcal {R}}_\beta (Y)- {\mathcal {R}}_\beta (Z)\le {\mathcal {R}}_\beta (Y-Z)\le {1 \over 1-\beta } \Vert Y-Z\Vert $, and $|{\mathcal {R}}_\beta (Y)-{\mathcal {R}}_\beta (Z)|\le {1 \over 1-\beta } \Vert Y-Z\Vert $ after interchanging the roles of Y and Z. Hence, the assertion. $\square $

Note that the higher order risk measure as defined in (2.3) defines a risk functional based on a norm. In contrast to this construction, a risk functional ${\mathcal {R}}$ defines a norm via

$$\begin{aligned} \Vert Y\Vert {:}{=}{\mathcal {R}}(|Y|) \end{aligned}$$

(2.6)

and a Banach space with ${\mathcal {Y}}=\left\{ Y\in L^1:{\mathcal {R}}(|Y|)<\infty \right\} $ (cf. Pichler (2013)). Its natural dual norm for $Z\in {\mathcal {Z}}{:}{=}{\mathcal {Y}}^*$ is

$$\begin{aligned} \Vert Z\Vert ^*&{:}{=}\sup \left\{ {{\,\mathrm{{\mathbb {E}}}\,}}YZ:\Vert Y\Vert \le 1\right\} \nonumber \\&=\sup \left\{ {{\,\mathrm{{\mathbb {E}}}\,}}YZ:{\mathcal {R}}(|Y|)\le 1\right\} . \end{aligned}$$

(2.7)

The following relationship allows defining a regret functional to connect a risk functional ${\mathcal {R}}$ with the higher-order risk quadrangle.

Proposition 2.6

(Duality) Let ${\mathcal {R}}$ be a risk functional with associated norm $\Vert \cdot \Vert $ and dual norm $\Vert \cdot \Vert ^*$. For the higher order risk functional it holds that

$$\begin{aligned} {\mathcal {R}}_\beta (Y)&=\sup \left\{ {{\,\mathrm{{\mathbb {E}}}\,}}YZ:Z\ge 0,\ {{\,\mathrm{{\mathbb {E}}}\,}}Z=1\text { and }\Vert Z\Vert ^*\le \frac{1}{1-\beta }\right\} \end{aligned}$$

(2.8)

$$\begin{aligned}&=\inf _{t\in {\mathbb {R}}}\ t+ \frac{1}{1-\beta }\Vert (Y-t)_+\Vert , \end{aligned}$$

(2.9)

where $\beta \in [0,1)$.

Remark 2.7

By the interconnecting formula (2.1), the higher order risk functional ${\mathcal {R}}_\beta ^{\Vert \cdot \Vert }$ associated with the norm $\Vert \cdot \Vert $ is the regret function ${\mathcal {V}}_\beta ^{\Vert \cdot \Vert }(\cdot ){:}{=}{1 \over 1-\beta }\Vert (\cdot )_+\Vert $.

Proof

It holds by the Hahn–Banach theorem and as $(Y-t)_+\ge 0$ that

$$\begin{aligned}\frac{1}{1-\beta }\cdot \Vert (Y-t)_+\Vert = \sup _{\Vert Z\Vert ^*\le \frac{1}{1-\beta }} {{\,\mathrm{{\mathbb {E}}}\,}}Z(Y-t)_+ \ge \sup _{\begin{array}{c} {{\,\mathrm{{\mathbb {E}}}\,}}Z=1,\ Z\ge 0,\\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} {{\,\mathrm{{\mathbb {E}}}\,}}Z(Y-t)_+. \end{aligned}$$

This establishes the first inequality ‘$\le $’ in (2.9) with $t+(Y-t)_+\ge Y$, as

$$\begin{aligned} t+\frac{1}{1-\beta }\cdot \Vert (Y-t)_+\Vert&\ge \sup _{\begin{array}{c} {{\,\mathrm{{\mathbb {E}}}\,}}Z=1\\ Z\ge 0,\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} {{\,\mathrm{{\mathbb {E}}}\,}}\bigl (t+(Y-t)_+\bigr )Z \\&\ge \sup _{\begin{array}{c} {{\,\mathrm{{\mathbb {E}}}\,}}Z=1\\ Z\ge 0,\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}}{{\,\mathrm{{\mathbb {E}}}\,}}YZ. \end{aligned}$$

As for the converse inequality assume first that Y is bounded. Note, that

$$\begin{aligned} \inf _{t\in {\mathbb {R}}} t+{{\,\mathrm{{\mathbb {E}}}\,}}(Y-t)Z = {{\,\mathrm{{\mathbb {E}}}\,}}YZ+\inf _{t\in {\mathbb {R}}} t\cdot (1-{{\,\mathrm{{\mathbb {E}}}\,}}Z) = {\left\{ \begin{array}{ll} {{\,\mathrm{{\mathbb {E}}}\,}}YZ&{}\quad \text {if }{{\,\mathrm{{\mathbb {E}}}\,}}Z=1, \\ -\infty &{}\quad \text {else}, \end{array}\right. } \end{aligned}$$

so that it follows that

$$\begin{aligned} \sup _{\begin{array}{c} {{\,\mathrm{{\mathbb {E}}}\,}}Z=1\\ Z\ge 0,\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} {{\,\mathrm{{\mathbb {E}}}\,}}YZ = \sup _{\begin{array}{c} Z\ge 0,\\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} \inf _{t\in {\mathbb {R}}} t+{{\,\mathrm{{\mathbb {E}}}\,}}(Y-t) Z. \end{aligned}$$

Further, it holds that ${{\,\mathrm{{\mathbb {E}}}\,}}YZ= t^*+{{\,\mathrm{{\mathbb {E}}}\,}}Z(Y-t^*)_+$ for $t^*\le Y$ a.s. and thus

$$\begin{aligned} \sup _{\begin{array}{c} {{\,\mathrm{{\mathbb {E}}}\,}}Z=1,Z\ge 0,\\ \ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} {{\,\mathrm{{\mathbb {E}}}\,}}YZ = \sup _{\begin{array}{c} Z\ge 0,\\ \Vert Z\Vert ^*\le \frac{1}{1-\beta } \end{array}} t^*+{{\,\mathrm{{\mathbb {E}}}\,}}Z(Y-t^*)_+ = t^* +\frac{1}{1-\beta } \Vert (Y-t^*)\Vert \ge \inf _{t\in {\mathbb {R}}} t+\frac{1}{1-\beta }\Vert (Y-t)_+\Vert , \end{aligned}$$

thus the desired converse inequality, provided that Y is bounded; if Y is not bounded, then there is a bounded $Y_\varepsilon $ with $Y\le Y_\varepsilon $ ($\varepsilon >0$) and $\Vert Y_\varepsilon -Y\Vert < \varepsilon $, so that

$$\begin{aligned} {{\,\mathrm{{\mathbb {E}}}\,}}Z(Y_\varepsilon -t)_+-\varepsilon {{\,\mathrm{{\mathbb {E}}}\,}}Z\le {{\,\mathrm{{\mathbb {E}}}\,}}Z(Y-t)_+\le {{\,\mathrm{{\mathbb {E}}}\,}}Z(Y_\varepsilon - t)_+, \end{aligned}$$

so that we may conclude that (2.9) holds for every $Y\in {\mathcal {Y}}$. $\square $

Example 2.8

(Lebesgue spaces) The dual norm of the genuine norm $\Vert X\Vert _p{:}{=}({{\,\mathrm{{\mathbb {E}}}\,}}|X|^p)^{1/p}$ in the Lebesgue space $L^p(P)$ is $\Vert Z\Vert ^*=({{\,\mathrm{{\mathbb {E}}}\,}}|Z|^q)^{1/q}$ for the Hölder conjugate exponent q with ${1\over p}+ {1\over q}= 1$. With Proposition 2.6 it follows that

$$\begin{aligned} {\mathcal {R}}_\beta ^{\Vert \cdot \Vert _p}(Y)&=\inf _{t\in {\mathbb {R}}}\ t+{1\over 1-\beta }\Vert (Y-t)_+\Vert _p\\&=\sup \left\{ {{\,\mathrm{{\mathbb {E}}}\,}}YZ:\Vert Z\Vert _{q}\le {1\over 1-\beta },\ Z\ge 0\text { and }{{\,\mathrm{{\mathbb {E}}}\,}}Z=1\right\} , \end{aligned}$$

cf. also Pichler and Shapiro (2015) and Pichler (2017).

In what follows, we shall elaborate the higher order risk measure and the associated regret function for specific risk measures, specifically the spectral risk measure.

3 Higher order spectral risk

By Kusuoka’s theorem (cf. Kusuoka (2001)), every law invariant risk functional can be assembled by elementary risk functionals, each involving the average value-at-risk.

The following section develops the explicit representations of the higher order risk measures associated with spectral risk measures first. The explicit representation then is extended to general risk functionals.

Definition 3.1

(Spectral risk measures) The function $\sigma :[0,1)\rightarrow {\mathbb {R}}$ is called a spectral function, if

(i)
$\sigma (\cdot )\ge 0$,
(ii)
$\int _0^1\sigma (u)\,\textrm{d}u=1$ and
(iii)
$\sigma (\cdot )$ is non-decreasing.

The spectral risk measure with spectral function $\sigma $ is

$$\begin{aligned} {\mathcal {R}}_\sigma (Y){:}{=}\int _0^1\sigma (u)F_Y^{-1}(u)\,\textrm{d}u, \end{aligned}$$

where

$$\begin{aligned}F_Y^{-1}(u){:}{=}{{\,\mathrm{{\mathsf {V@R}}}\,}}_u(Y){:}{=}\inf \left\{ x\in {\mathbb {R}}:P(Y\le x)\ge u\right\} \end{aligned}$$

is the value-at-risk, the generalized inverse or quantile function.

The higher order risk measure of the spectral risk measure is a spectral risk measure itself. The following theorem presents the corresponding spectral function explicitly and generalizes (Pflug 2000). The result is central towards the main characterization presented in the next sections.

Theorem 3.2

(Higher order spectral risk) Let $\beta \in [0,1)$ be a risk level. The higher order risk functional of the risk functional ${\mathcal {R}}_{\sigma }$ with spectral function $\sigma (\cdot )$ has the representation

$$\begin{aligned} \inf _{t\in {\mathbb {R}}}\ t+{1\over 1-\beta } {\mathcal {R}}_\sigma \bigl ((Y-t)_+\bigr )= {\mathcal {R}}_{\sigma _\beta }(Y), \end{aligned}$$

(3.1)

where $\sigma _\beta $ is the spectral function

$$\begin{aligned} \sigma _\beta (u){:}{=}{\left\{ \begin{array}{ll} 0 &{}\quad \text {if }u<u_\beta ,\\ {\sigma (u) \over 1-\beta } &{}\quad \text {else}; \end{array}\right. } \end{aligned}$$

(3.2)

here, $u_\beta \in {\mathbb {R}}$ is the $\beta $-quantile with respect to the density $\sigma $, that is, the solution of

$$\begin{aligned} \int _0^{u_\beta }\sigma (u)\,\textrm{d}u=\beta , \end{aligned}$$

(3.3)

which is unique for $\beta >0$.

Proof

We remark first that $\sigma _\beta $ indeed is a spectral function, as $\int _0^1 \sigma _\beta (u)\,\mathrm d u= \frac{1}{1-\beta }\int _{u_\beta }^1\sigma (u)\,\mathrm d u=\frac{1-\beta }{1-\beta }=1$ by the defining property (3.3) and (ii) in Definition 3.1. The quantile $u_\beta $ is uniquely defined for $\beta >0$, as the function $\sigma $ is non-decreasing by (iii). In what follows we shall demonstrate that the infimum in (3.1) is attained at $t^*{:}{=}F_Y^{-1}(u_\beta )$. Note first that

$$\begin{aligned} F_{(Y-t)_+}^{-1}(u)={\left\{ \begin{array}{ll} 0 &{}\quad \text {if }u<F_Y(t),\\ F_Y^{-1}(u)- t &{}\quad \text {else}, \end{array}\right. } \end{aligned}$$

so that

$$\begin{aligned} {\mathcal {R}}_\sigma \bigl ((Y-t)_+\bigr )=\int _0^1\sigma (u)F_{(Y-t)_+}^{-1}(u)\,\textrm{d}u=\int _{F_Y(t)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u \end{aligned}$$

and

$$\begin{aligned} ({\mathcal {R}}_\sigma )_\beta (Y)=\inf _{t\in {\mathbb {R}}}\ t+\frac{1}{1-\beta }\int _{F_Y(t)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u. \end{aligned}$$

(3.4)

Assume first that $t\le t^*$. The inequality $u\le F_Y(t)$ is equivalent to $F_Y^{-1}(u)\le t$ (cf. van der Vaart (1998); this relation of functions $F_Y$ and $F_Y^{-1}$ is occasionally called a Galois connection), and thus

$$\begin{aligned} \int _{F_Y(t)}^{F_Y(t^*)}\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u\le 0, \end{aligned}$$

or equivalently

$$\begin{aligned} \int _{F_Y(t)}^1 \sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u\le \int _{F_Y(t^*)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u. \end{aligned}$$

Assume next that $u_\beta \le F_Y(t^*)$, then $\int _{F_Y(t^*)}^1\sigma (u)\,\mathrm d u\le 1-\beta $ so that

$$\begin{aligned}\frac{t-t^*}{1-\beta }\int _{F_Y(t^*)}^1\sigma (u)\,\textrm{d}u\le t-t^*. \end{aligned}$$

Combining the inequalities in the latter displays gives

$$\begin{aligned} t^*+\frac{1}{1-\beta }\int _{F_Y(t^*)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u\le t+\frac{1}{1-\beta }\int _{F_Y(t)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u \end{aligned}$$

(3.5)

and thus the assertion, provided that $u_{\beta }\le F_{Y}(t^*)$ and $t^*\le t$.

Conversely, assume that $t\le t^*$. Then the inequality $u\le F_Y(t^*)$ is equivalent to $F_Y^{-1}(u)\le t^*$ and thus

$$\begin{aligned} \int _{F_Y(t)}^{F_Y(t^*)}\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u\le 0, \end{aligned}$$

which is equivalent to

$$\begin{aligned} \int _{F_Y(t)}^{1}\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u\le \int _{F_Y(t^*)}^1\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u. \end{aligned}$$

Assume further that $F_Y(t^*)\le u_\beta $, then $\int _{F_Y(t^*)}^1\sigma (u)\,\textrm{d}u\ge 1-\beta $ so that

$$\begin{aligned} t^*-t\le \frac{t^*-t}{1-\beta }\int _{F_Y(t^*)}^1\sigma (u)\,\textrm{d}u. \end{aligned}$$

Combining the latter inequalities gives

$$\begin{aligned} t^*+\frac{1}{1-\beta }\int _{F_Y(t)}^{1}\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u\le t+\frac{1}{1-\beta }\int _{F_Y(t^*)}^{1}\sigma (u)\bigl (F_Y^{-1}(u)-t\bigr )\,\textrm{d}u. \end{aligned}$$

(3.6)

It follows from (3.5) and (3.6) that $t^*{:}{=}F_Y^{-1}(u_\beta )$ is optimal in (3.4). That is,

$$\begin{aligned} ({\mathcal {R}}_\sigma )_\beta (Y)&=t^{*}+\frac{1}{1-\beta }\int _{u_\beta }^{1}\sigma (u)\bigl (F_Y^{-1}(u)-t^*\bigr )\,\textrm{d}u\nonumber \\&=\frac{1}{1-\beta }\int _{u_\beta }^{1}\sigma (u)F_Y^{-1}(u)\,\textrm{d}u\nonumber \\&=\int _0^1\sigma _\beta (u)F_Y^{-1}(u)\,\textrm{d}u \end{aligned}$$

(3.7)

and thus the assertion. $\square $

The following statement expresses the higher order risk functional by at the base value $u_\beta $, and the random variable’s aberrations to the right, involving the survival function instead of its inverse distribution function.

Corollary 3.3

The higher order spectral risk measure is

$$\begin{aligned} \bigl ({\mathcal {R}}_{\sigma }\bigr )_\beta (Y)={{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(Y)+\frac{1}{1-\beta }\int _{{{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(Y)}^\infty \Sigma \bigl (F_Y(y)\bigr )\textrm{d}y \end{aligned}$$

(3.8)

(with $u_\beta $ as in (3.3)) or, provided that Y is bounded,

$$\begin{aligned} \bigl ({\mathcal {R}}_\sigma \bigr )_{\beta }(Y)&=\mathop {\mathrm {ess\,inf}}\limits Y+\int _{\mathop {\mathrm {ess\,inf}}\limits Y}^\infty \Sigma _\beta \bigl (F_Y(y)\bigr )\textrm{d}y, \end{aligned}$$

(3.9)

where

$$\begin{aligned} \Sigma _\beta (u){:}{=}\min \left( 1,\ \frac{1}{1-\beta }\int _u^{1}\sigma (p)\,\textrm{d}p\right) \end{aligned}$$

is the cumulative spectral function and $\Sigma (u){:}{=}\Sigma _0(u)=\int _u^1\sigma (p)\,\textrm{d}p$.

Proof

Notice first that $\Sigma _\beta (u)=1$ for $u\le u_\beta $, where $u_\beta $ is given in (3.3). By Theorem 3.2, Riemann-Stieltjes integration by parts and changing the variables it holds that

$$\begin{aligned} \bigl ({\mathcal {R}}_\sigma \bigr )_\beta (Y)&={\mathcal {R}}_{\sigma _\beta }(Y) \nonumber \\&=\frac{1}{1-\beta }\int _{u_\beta }^1\sigma (u)F_Y^{-1}(u)\,\textrm{d}u \nonumber \\&=-\int _0^1F_Y^{-1}(u)\,\textrm{d}\Sigma _\beta (u) \end{aligned}$$

(3.10)

$$\begin{aligned}&=-\left. F_Y^{-1}(u)\Sigma _\beta (u)\right| _{u=0}^1+\int _0^1\Sigma _\beta (u)\,\textrm{d}F_Y^{-1}(u) \nonumber \\&=\mathop {\mathrm {ess\,inf}}\limits Y+\int _{\mathop {\mathrm {ess\,inf}}\limits Y}^\infty \Sigma _\beta \bigl (F_Y(y)\bigr )\,\textrm{d}y, \end{aligned}$$

(3.11)

where we have used that $F_Y^{-1}(0)=\mathop {\mathrm {ess\,inf}}\limits Y$ and $\Sigma _\beta (1)=0$ in (3.11). This gives (3.9).

The equation (3.8) results from sticking to the lower bound $u_\beta $ (instead of 0) in (3.10). That is,

$$\begin{aligned} \bigl ({\mathcal {R}}_\sigma \bigr )_\beta (Y)&=-\int _{u_\beta }^1F_Y^{-1}(u)\,\textrm{d}\Sigma _\beta (u)\\&=-\left. F_Y^{-1}(u)\Sigma _\beta (u)\right| _{u=u_\beta }^1+\int _{u_\beta }^1\Sigma _\beta (u)\,\textrm{d}F_Y^{-1}(u)\\&={{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(Y)+\int _{{{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(Y)}^\infty \Sigma _\beta \bigl (F_Y(y)\bigr )\,\textrm{d}y, \end{aligned}$$

which is assertion (3.8). $\square $

Corollary 3.4

The higher order spectral risk measure has the representation

$$\begin{aligned} {\mathcal {R}}_{\sigma _\beta }(Y)=\sup \{ {{\,\mathrm{{\mathbb {E}}}\,}}[Y\cdot \sigma _\beta (U)]:U\in [0,1]\text { is uniformly distributed} \}. \end{aligned}$$

Proof

Recall first that $Y\sim F_Y^{-1}(U)$ for U uniformly distributed. By the rearrangement inequality, ${{\,\mathrm{{\mathbb {E}}}\,}}Y\sigma _\beta (U)\le {{\,\mathrm{{\mathbb {E}}}\,}}F_{Y}^{-1}(U)\sigma _{\beta }(U)$, because $F_Y^{-1}(U)$ and $\sigma _\beta (U)$ are comonotone and both, $F_Y^{-1}(\cdot )$ and $\sigma _\beta (\cdot )$ are non-decreasing functions. The assertion follows with (3.7). $\square $

The celebrated formula (cf. Pflug (2000), Rockafellar and Uryasev (2000), Ogryczak and Ruszczyński (2002))

$$\begin{aligned} {{\,\mathrm{{\mathsf {AV@R}}}\,}}_\alpha (Y)= \frac{1}{1-\alpha }\int _\alpha ^1{{\,\mathrm{{\mathsf {V@R}}}\,}}_u(Y)\,\textrm{d}u =\inf _{t\in {\mathbb {R}}}\ t+\frac{1}{1-\alpha }{{\,\mathrm{{\mathbb {E}}}\,}}(Y-t)_+ \end{aligned}$$

for the average value-at-risk is a special case of Theorem 3.2 for the spectral function $\sigma (\cdot )= \frac{1}{1-\alpha }{\mathbbm {1}}_{[\alpha ,1]}(\cdot )$.

The following corollary estabilshes this risk functional’s higher order variant.

Corollary 3.5

(Average value-at-risk) The higher order average value-at-risk is

$$\begin{aligned} ({{\,\mathrm{{\mathsf {AV@R}}}\,}}_\alpha )_\beta (Y)={{\,\mathrm{{\mathsf {AV@R}}}\,}}_{1-(1-\alpha )(1-\beta )}(Y), \end{aligned}$$

(3.12)

where $Y\in L^1$; equivalently,

$$\begin{aligned} {{\,\mathrm{{\mathsf {AV@R}}}\,}}_\beta (Y)=\inf _{t\in {\mathbb {R}}}\ t+\frac{1}{1-\frac{\beta -\alpha }{1-\alpha }}{{\,\mathrm{{\mathsf {AV@R}}}\,}}_\alpha \bigl ((Y-t)_+\bigr ), \end{aligned}$$

(3.13)

where $\beta \ge \alpha $.

Proof

The spectral function of the average value-at-risk is $\sigma _\alpha (\cdot )=\frac{{\mathbbm {1}}_{\cdot \ge \alpha }}{1-\alpha }$. It follows from (3.3) that $u_\beta =\alpha +\beta (1-\alpha )=1-(1-\alpha )(1-\beta )$ and $(\sigma _\alpha )_\beta ={\left\{ \begin{array}{ll} 0 &{}\quad \text {if }u\le u_\beta ,\\ \frac{1}{(1-\alpha )(1-\beta )} &{}\quad \text {else}. \end{array}\right. }$ This is the spectral function of the average value-at-risk at risk level $u_\beta $.

The assertion (3.13) follows by replacing $\beta $ with $\frac{\beta -\alpha }{1-\alpha }$ in (3.12). $\square $

Corollary 3.6

(Kusoka representation spectral risk measures)Suppose the risk functional is

$$\begin{aligned} {\mathcal {R}}(Y)=\int _0^1{{\,\mathrm{{\mathsf {AV@R}}}\,}}_\gamma (Y)\,\mu (\textrm{d}\gamma ), \end{aligned}$$

(3.14)

where $\mu $ is a probability measure on [0, 1]. Then the higher order risk measure is

$$\begin{aligned}{\mathcal {R}}_\beta (Y)=\int _0^1{{\,\mathrm{{\mathsf {AV@R}}}\,}}_\gamma (Y)\,\mu _\beta (\textrm{d}\gamma ), \end{aligned}$$

where $\mu _{\beta }(\cdot )$ is the measure

$$\begin{aligned} \mu _\beta (A){:}{=}p_0\cdot \delta _{u_\beta }(A)+\frac{1}{1-\beta }\mu \bigl (A\cap (u_\beta ,1 ]\bigr ) \end{aligned}$$

(3.15)

and $u_\beta $ and $p_0$ are determined by the equation and definition

$$\begin{aligned} \int _0^{u_\beta }\frac{u_\beta -\alpha }{1-\alpha }\mu (\textrm{d}\alpha )=\beta \ \text { and }\ p_0{:}{=}\frac{1-u_\beta }{1-\beta }\int _0^{u_\beta }\frac{\mu (\textrm{d}\alpha )}{1-\alpha }. \end{aligned}$$

(3.16)

Proof

Above all, $\mu _\beta $ is a probability measure, because $p_0\ge 0$ and

$$\begin{aligned} \mu _\beta ([0,1])&=p_0+\frac{1}{1-\beta }\int _{u_\beta +}^1\mu (\textrm{d}\alpha )\\&=\frac{1}{1-\beta }\int _0^{u_\beta }\frac{1-\alpha -(u_{\beta }-\alpha )}{1-\alpha }\mu (\textrm{d}\alpha )+\frac{1}{1-\beta }\int _{u_\beta +}^1\mu (\textrm{d}\alpha )\\&=\frac{1}{1-\beta }\int _0^1\mu (\textrm{d}\alpha )-\frac{\beta }{1-\beta }=1. \end{aligned}$$

The spectral function of the average value-at-risk at risk level $\alpha $ is $\sigma _\alpha (\cdot )=\frac{{\mathbbm {1}}_{\cdot \ge \alpha }}{1-\alpha }$. The quantile condition (3.3) thus is

$$\begin{aligned} \beta =\int _0^1\frac{\max (0,u_\beta -\alpha )}{1-\alpha }\mu (\textrm{d}\alpha ) \end{aligned}$$

and thus (3.16).

For $u<u_\beta $, the spectral function corresponding to the measure ${\mathcal {R}}_\beta $ in (3.14) is 0, which coincides with (3.2). For $u>u_\beta $, the spectral function for ${\mathcal {R}}_\beta $ is

$$\begin{aligned} \frac{p_0}{1-u_\beta }{\mathbbm {1}}_{u\ge u_\beta }+\int _{u_\beta }^1\frac{1}{1-\beta }\frac{{\mathbbm {1}}_{u\ge \alpha }}{1-\alpha }\mu (\textrm{d}\alpha )&={1 \over 1-\beta }\int _0^{u_\beta }\frac{{\mathbbm {1}}_{u\ge u_\beta }}{1-\alpha }\mu (\textrm{d}\alpha )+\int _{u_\beta }^1\frac{1}{1-\beta }\frac{{\mathbbm {1}}_{u\ge \alpha }}{1-\alpha }\mu (\textrm{d}\alpha )\\&=\frac{1}{1-\beta }\int _0^1\frac{\mu (\textrm{d}\alpha )}{1-\alpha }, \end{aligned}$$

which is the desired result in light of (3.2). $\square $

In situations of practical interest, the risk measure is often given as finite combination of average values-at-risk at varying levels. The following corollary addresses this situation explicitly.

Corollary 3.7

Suppose that

$$\begin{aligned} {\mathcal {R}}(Y)=\sum _{i=1}^n p_i\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{\alpha _i}(Y) \end{aligned}$$

(3.17)

with $p_i\ge 0$, $\sum _{i=1}^n p_i=1$ and $\alpha _i\in [0,1]$ for $i=1,\dots ,n$. Then

$$\begin{aligned} {\mathcal {R}}_\beta (Y)= p_0\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{u_\beta }(Y)+\sum _{i:\alpha _i>u_\beta }{p_i \over 1-\beta }{{\,\mathrm{{\mathsf {AV@R}}}\,}}_{\alpha _i}(Y), \end{aligned}$$

(3.18)

where $u_\beta $ satisfies $\beta =\sum _{i=1}^n p_i\frac{\max (0,u_\beta -\alpha _i)}{1-\alpha _i}$ and $p_0{:}{=}\sum _{i:\alpha _i\le u_\beta }\frac{p_i}{1-\alpha _i}\frac{1-u_\beta }{1-\beta }$.

For large risk levels $\beta $, specifically if

$$\begin{aligned} \beta \ge 1-\Bigl (1-\max _{i=1,\dots ,n}\alpha _i\Bigr )\cdot \sum _{i=1}^n{p_i \over 1-\alpha _i}, \end{aligned}$$

(3.19)

the involved risk measure (3.18) collapses to the average value-at-risk, it holds that

$$\begin{aligned} {\mathcal {R}}_\beta (Y)={{\,\mathrm{{\mathsf {AV@R}}}\,}}_{1-(1-{\tilde{\alpha }})(1-\beta )}(Y), \end{aligned}$$

where ${\tilde{\alpha }}$ is the weighed risk quantile ${\tilde{\alpha }}{:}{=}\frac{\sum _{i=1}^n\frac{p_i}{1-\alpha _i}\alpha _i}{\sum _{i=1}^n\frac{p_i}{1-\alpha _i}}$.

Proof

The result corresponds to the measure $\mu =\sum _{i=1}^n p_i\,\delta _{\alpha _i}$ in (3.14), which is a special case in Corollary 3.6.

For $u_\beta \ge \alpha _i$, $i=1,\dots ,n$, it holds that $\beta =\sum _{i=1}^n p_i\frac{u_\beta -\alpha _i}{1-\alpha _i}= \sum _{i=1}^n p_i\frac{1-\alpha _i-(1-u_\beta )}{1-\alpha _i} =1-(1-u_\beta )\sum _{i=1}^n\frac{p_i}{1-\alpha _i}$, so that $u_\beta \ge \max _{i=1,\dots ,n}\alpha _i$ is equivalent to (3.19). It follows that

$$\begin{aligned} u_\beta&=1-\frac{1-\beta }{\sum _{i=1}^n \frac{p_i}{1-\alpha _i}} \nonumber \\&=1-(1-\beta )\left( 1-\frac{\sum _{i=1}^n\frac{p_i}{1-\alpha _i}-\sum _{i=1}^n\frac{p_i(1-\alpha _i)}{1-\alpha _i}}{\sum _{i=1}^n\frac{p_i}{1-\alpha _i}}\right) \nonumber \\&=1-(1-\beta )(1-{\tilde{\alpha }}), \end{aligned}$$

(3.20)

and $p_0= \sum {p_i \over 1-\alpha _i}{1-u_\beta \over 1-\beta }=1$, thus the result with (3.18). $\square $

Remark 3.8

Corollary 3.5 is a special case of (3.18) in the preceding corollary, as ${\tilde{\alpha }}= \alpha $ in this case.

The following statement generalizes the statements from and provides the higher order risk functional for general risk measures.

Theorem 3.9

(Kusuoka representation of higher order risk measures) Let ${\mathcal {R}}$ be a law invariant risk measure with Kusuoka representation

$$\begin{aligned} {\mathcal {R}}(Y)=\sup _{\mu \in {\mathcal {M}}}{\mathcal {R}}_\mu (Y). \end{aligned}$$

(3.21)

The higher order risk measure is

$$\begin{aligned} {\mathcal {R}}_\beta (Y)=\sup _{\mu \in {\mathcal {M}}}{\mathcal {R}}_{\mu _\beta }(Y), \end{aligned}$$

where the truncated measures $\mu _\beta $ are given in (3.15).

Proof

For the risk functional defined in (3.21) it follows from the min-max inequality that

$$\begin{aligned} {\mathcal {R}}_\beta (Y)&=\inf _{t\in {\mathbb {R}}}\ t+\sup _{\mu \in {\mathcal {M}}}{\mathcal {R}}_\mu \bigl ((Y-t)_+\bigr )\nonumber \\&\ge \sup _{\mu \in {\mathcal {M}}}\inf _{t\in {\mathbb {R}}}\ t+\frac{1}{1-\beta }{\mathcal {R}}_\mu \bigl ((Y-t)_+\bigr )\nonumber \\&=\sup _{\mu \in {\mathcal {M}}}({\mathcal {R}}_\mu )_\beta (Y)=\nonumber \\&=\sup _{\mu \in {\mathcal {M}}}{\mathcal {R}}_{\mu _\beta }(Y), \end{aligned}$$

(3.22)

where we have used Corollary 3.6.

For the reverse inequality in (3.22) consider the function

$$\begin{aligned} (t,\mu )\mapsto t+{\mathcal {R}}_\mu \bigl ((Y-t)_+\bigr ) \end{aligned}$$

on ${\mathbb {R}}\times {\mathcal {M}}([0,1])$, where ${\mathcal {M}}([0,1])$ collects the probability measures on [0, 1] (with its Borel $\sigma $-algebra). By its definition (3.14), this function is linear in $\mu $, and convex in t, where $t\in {\mathbb {R}}$ and $\mu $ is a measure on [0, 1]. By Prokhorov’s theorem, the set ${\mathcal {M}}([0,1])$ of probability measures is sequentially compact, as [0, 1] is compact. From Sion’s minimax theorem (cf. Sion (1958)) it follows that equality holds in (3.22). Thus, the result. $\square $

Theorem 3.9 provides an explicit characterization for the general higher order risk measure. The following section exploits this representation to characterize general stochastic dominance relations.

4 General stochastic dominance relations

As Sect. 2 mentions above, the risk measure ${\mathcal {R}}$ defines a norm via the setting $\Vert \cdot \Vert {:}{=}{\mathcal {R}}(|\cdot |)$ (cf. (2.6)), and conversely, the norm $\Vert \cdot \Vert $ defines a risk measure via ${\mathcal {R}}_\beta ^{\Vert \cdot \Vert }$, cf. (2.3). In what follows we connect a specific stochastic dominance relation with the norm. This stochastic dominance relation can be described by higher order risk measures, developed in the preceding Sect. 3.

We start by defining the stochastic dominance relation based on a monotone norm and consider the Lebesgue norm and stochastic dominance for integer orders in below.

Definition 4.1

(Stochastic dominance) Let X, $Y\in {\mathcal {Y}}$ be ${\mathbb {R}}$-valued random variables in a Banach space $({\mathcal {Y}},\Vert \cdot \Vert )$. The random variable X is dominated by Y, denoted

$$\begin{aligned} X\preccurlyeq ^{\Vert \cdot \Vert }Y, \end{aligned}$$

if

$$\begin{aligned} \Vert (t-X)_+\Vert \ge \Vert (t-Y)_+\Vert \ \text { for all }t\in {\mathbb {R}}. \end{aligned}$$

(4.1)

If the norm is unambiguous from the context, we shall also simply write $\preccurlyeq $ instead of $\preccurlyeq ^{\Vert \cdot \Vert }$.

The cone of random variables triggered by a single variable is convex.

Lemma 4.2

(Convexity of the stochastic dominance cone) For $X\in {\mathcal {Y}}$ given, the set

$$\begin{aligned} \left\{ Y\in {\mathcal {Y}}:X\preccurlyeq Y\right\} \end{aligned}$$

is convex.

Proof

The map $y\mapsto (t-y)_+$ is convex, as follows from reflecting and translating the convex function $x\mapsto x_+$. Suppose that $X\preccurlyeq Y_0$ and $X\preccurlyeq Y_1$. Then it follows for $Y_\lambda {:}{=}(1-\lambda )Y_0+\lambda \,Y_1$, together with monotonicity of the norm and (4.1), that

$$\begin{aligned} \Vert (t-Y_\lambda )_+\Vert&\le \left\| \bigl ((1-\lambda )(t-Y_0)+\lambda (t-Y_{1})\bigr )_{+}\right\| \\&\le (1-\lambda )\Vert (t-Y_0)_+\Vert +\lambda \Vert (t-Y_1)_+\Vert \\&\le (1-\lambda )\Vert (t-X)_+\Vert +\lambda \Vert (t-X)_+\Vert \\&=\Vert (t-X)_+\Vert . \end{aligned}$$

That is, it holds that $X\preccurlyeq Y_\lambda $ and thus the assertion. $\square $

4.1 Characterization of stochastic dominance relations

Stochastic dominance relations can be fully characterized by higher order risk measures. The following theorem presents this main result, which integrates the details developed above for these risk functionals and stochastic dominance relations.

Theorem 4.3

(Characterization of stochastic dominance, cf. Gómez et al. (2022)) The following are equivalent:

(i)
$X\preccurlyeq ^{\Vert \cdot \Vert }Y$,
(ii)
${\mathcal {R}}_\beta (-X)\ge {\mathcal {R}}_\beta (-Y)$ for all $\beta \in [0,1)$, and
(iii)
$\inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZX\le \inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZY$ for every $\beta \in (0,1)$, where
$$\begin{aligned} {\mathcal {Z}}_\beta {:}{=}\left\{ Y\in {\mathcal {Y}}^*:\Vert Z\Vert _*\le \frac{1}{1-\beta },\ {{\,\mathrm{{\mathbb {E}}}\,}}Z=1,\ Z\ge 0\right\} \end{aligned}$$
is the positive cone ($Z\ge 0$) in the dual ball with radius $\frac{1}{1-\beta }$ ($\Vert Z\Vert _*\le \frac{1}{1-\beta }$), intersected with the simplex (${{\,\mathrm{{\mathbb {E}}}\,}}Z=1$).

Proof

Suppose that $X\preccurlyeq ^{\Vert \cdot \Vert }Y$, then, by definition, $\Vert (t-X)_+\Vert \ge \Vert (t-Y)_+\Vert $ for every $t\in {\mathbb {R}}$. It follows that $t+\frac{1}{1-\beta }\Vert (-X-t)_+\Vert \ge t+\frac{1}{1-\beta }\Vert (-Y-t)_+\Vert $ for all $t\in {\mathbb {R}}$, and thus assertion (ii) after passing to the infimum.

As for the contrary, assume that (ii) holds. To demonstrate (i) note first that $q\mapsto \Vert (q-X)_+\Vert $ is convex; indeed, with $q_\lambda {:}{=}(1-\lambda )q_0+\lambda \,q_1$ and $(a+b)_+\le a_+ +b_+$ it holds that

$$\begin{aligned} (q_\lambda -X)_+= \bigl ((1-\lambda )(q_0-X)+\lambda (q_1-X)\bigr )_+\le (1-\lambda )(q_0-X)+\lambda (q_1-X)_+ \end{aligned}$$

and thus

$$\begin{aligned} \Vert (q_\lambda -X)\Vert \le (1-\lambda )\cdot \Vert (q_0-X)_+\Vert +\lambda \cdot \Vert (q_1-X)_+\Vert \end{aligned}$$

by the triangle inequality of the norm.

For $q\in {\mathbb {R}}$ fixed, choose

$$\begin{aligned}\alpha \in \partial _\eta \ \Vert (\eta -Y)_+\Vert \Big |_{\eta =q}, \end{aligned}$$

that is, the subdifferential (of the convex function $\eta \mapsto \Vert (\eta -Y)_+\Vert $) evaluated at $\eta =q$, and note that $\alpha \in [0,1]$. Set $\beta {:}{=}1-\alpha $, and observe that

$$\begin{aligned} 0\in \partial _q\ -q+\frac{1}{1-\beta }\Vert (q-Y)_+\Vert \end{aligned}$$

so that

$$\begin{aligned} {\mathcal {R}}_\beta (-Y)= -q+\frac{1}{1-\beta }\Vert (q-Y)_+\Vert \end{aligned}$$

by (2.3). Employing the definition (2.3) again and assumption (ii), it follows that

$$\begin{aligned} -q+\frac{1}{1-\beta }\Vert (-X+q)_+\Vert&\ge {\mathcal {R}}_\beta (-X)\\&\ge {\mathcal {R}}_\beta (-Y)\\&= -q+ \frac{1}{1-\beta }\Vert (q-Y)_+\Vert , \end{aligned}$$

or equivalently

$$\begin{aligned} \Vert (q-X)_+\Vert \ge \Vert (q-Y)_+\Vert . \end{aligned}$$

The assertion (i) follows, as $q\in {\mathbb {R}}$ was arbitrary; this establishes equivalence of (i) and (ii).

Finally, let $\beta \in (0,1)$. With (ii) and Proposition 2.6 we have that

$$\begin{aligned} \inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZX\le \inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZY, \end{aligned}$$

where the infimum in both expressions is among $Z\in {\mathcal {Z}}_\beta = \left\{ Z\in {\mathcal {Z}}:\ \Vert Z\Vert _*\le \frac{1}{1-\beta }\right\} $, as the set ${\mathcal {Z}}_\beta $ collects the constraints in (2.8). This establishes equivalence between (ii) and (iii). $\square $

Remark 4.4

The quantity $-{\mathcal {R}}(-Y)=:{\mathcal {A}}(Y)$ arising naturally in Theorem 4.3 (ii) above is often called an acceptability functional, cf. Pflug and Römisch (2007).

Corollary 4.5

Suppose that

$$\begin{aligned} {{\,\mathrm{{\mathbb {E}}}\,}}ZX\le {{\,\mathrm{{\mathbb {E}}}\,}}ZY\text { for all }Z\in {\mathcal {Z}}{:}{=}\bigcup _{\beta \in (0,1)}{\mathcal {Z}}_{\beta }, \end{aligned}$$

(4.2)

then X is dominated by Y, $X\preccurlyeq ^{\Vert \cdot \Vert }Y$. Further, the assertion (4.2) is equivalent to

$$\begin{aligned} {\mathcal {R}}_\beta (X-Y)\le 0\quad \text {for all }\beta \in (0,1). \end{aligned}$$

(4.3)

Proof

Fix $\beta \in (0,1)$, then $\inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZX\le \inf _{Z\in {\mathcal {Z}}_\beta }{{\,\mathrm{{\mathbb {E}}}\,}}ZY$ by (4.2). With (iii) in the preceding Theorem 4.3 it follows that $X\preccurlyeq Y$.

With (2.7), the statement (4.3) is equivalent with ${{\,\mathrm{{\mathbb {E}}}\,}}Z(X-Y)\le 0$ for $Z\in {\mathcal {Z}}$ and hence the assertion. $\square $

Remark 4.6

The assertion (4.3), however, is strictly stronger than (ii) in Theorem 4.3. Indeed, it follows with convexity and (4.3) that

$$\begin{aligned} {\mathcal {R}}(-Y) \le {\mathcal {R}}(X-Y) +{\mathcal {R}}(-X) \le {\mathcal {R}}(-X), \end{aligned}$$

and hence (ii), the assertion, although the reverse implication does not hold true.

Example 4.7

(Uniform norm) For the uniform norm $\Vert \cdot \Vert _\infty $, the defining relation (4.1) is equivalent to

$$\begin{aligned}X\preccurlyeq ^{\Vert \cdot \Vert _\infty }Y \iff \mathop {\mathrm {ess\,inf}}\limits X \le \mathop {\mathrm {ess\,inf}}\limits Y;\end{aligned}$$

this relation derives from the characterization (i) in Theorem 4.3 as well.

4.2 Higher order stochastic dominance

A traditional way of introducing stochastic dominance relations is by iterating integrals of the cumulative distribution function. This is a special case of the Lebesgue norm $\Vert \cdot \Vert _p$, $p\in [1,\infty )$, with $p\in {\mathbb {N}}$.

Definition 4.8

(Higher order stochastic dominance, cf. Müller and Stoyan (2002)) The random variable X is dominated by Y in first order stochastic dominance, if

$$\begin{aligned} F_X(x)\ge F_Y(x)\text { for all }x\in {\mathbb {R}}, \end{aligned}$$

where $F_X(x){:}{=}P(X\le x)$ is the cumulative distribution function. We shall write $X\preccurlyeq ^{(1)}Y$. For $p\in [1,\infty ]$, the random variable X is stochastically dominated by Y in p^th-stochastic order, if

$$\begin{aligned} {{\,\mathrm{{\mathbb {E}}}\,}}(x-X)_+^{p-1}\ge {{\,\mathrm{{\mathbb {E}}}\,}}(x-Y)_+^{p-1}\text { for all }x\in {\mathbb {R}}; \end{aligned}$$

(4.4)

we write $X\preccurlyeq ^{(p)}Y$.

By (4.1) in Definition 4.1,

$$\begin{aligned} X\preccurlyeq ^{(p+1)} Y \text { is equivalent to }X\preccurlyeq ^{\Vert \cdot \Vert _p}Y, \qquad p\ge 1, \end{aligned}$$

where $\Vert \cdot \Vert _p$ is the usual norm in the Lebesgue space $L^p$. It is for historical—although unfortunate—reasons that the p-indici in the preceding display do not match. The higher order stochastic dominance of integral orders has been indtroduced and considered in earlier publications.

Lemma 4.9

(Cf. Ogryczak and Ruszczyński (1999, 2001)) With $F_X^{(1)}(\cdot ){:}{=}F_X(\cdot )$, the kth ($k=2,3,\dots $) repeated integral is $F_X^{(k)}(x){:}{=}\int _{-\infty }^x F_X^{(k-1)}(y)\,\textrm{d}y$. The following two points are equivalent, they characterize stochastic dominance of integer orders ($k=1,2,\dots $) by repeated integrals:

(i)
$X\preccurlyeq ^{(k)}Y$,
(ii)
$F_Y^{(k)}(x)\ge F_X^{(k)}(x)$ for all $x\in {\mathbb {R}}$.

Proof

It holds with Cauchy’s formula for repeated integration that

$$\begin{aligned} F_X^{(k)}(x)=\frac{1}{(k-2)!}\int _{-\infty }^x(x-y)^{k-2}F_X(y)\,\textrm{d}y. \end{aligned}$$

By integration by parts, the latter is

$$\begin{aligned} F_X^{(k)}(x)=\frac{1}{(k-1)!}\int _{-\infty }^x(x-y)^{k-1}\,\textrm{d}F_X(y), \end{aligned}$$

so that

$$\begin{aligned} F_X^{(k)}(x)=\frac{1}{(k-1)!}\int _{-\infty }^\infty (x-y)_+^{k-1}\textrm{d}F_X(y)=\frac{1}{(k-1)!}{{\,\mathrm{{\mathbb {E}}}\,}}(x-X)_+^{k-1}, \end{aligned}$$

from which the assertion follows from the defining condition (4.1) in Definition 4.1. $\square $

Remark 4.10

It follows from the iterated integral and (ii) in Lemma 4.9 that $X\preccurlyeq ^{(k)} Y \implies X\preccurlyeq ^{(k+1)}Y$ for all natural numbers $k=1,2,\dots $. We notice next that

$$\begin{aligned} X\preccurlyeq ^{(p)} Y \implies X\preccurlyeq ^{(p^\prime )} Y\ \text { for all real numbers } 1\le p\le p^\prime \in {\mathbb {R}}. \end{aligned}$$

(4.5)

To this end note first that the characterization (4.4) is equivalent to

$$\begin{aligned} \int _{-\infty }^x(x-z)^{p-1}\textrm{d}F_X(z) \ge \int _{-\infty }^x(x-z)^{p-1}\textrm{d}F_Y(z)\text { for all }x\in {\mathbb {R}}. \end{aligned}$$

(4.6)

With $\int _z^x(x-y)^{\alpha -1}(y-z)^{\beta -1}\,\textrm{d}y =B(\alpha ,\beta ) (x-z)^{\beta +\alpha -1}$ (B is Euler’s integral of the first kind) and integration by parts it follows that

$$\begin{aligned} \int _{-\infty }^x(x-z)^{p^\prime -1}\,\textrm{d}F_X(z)&=\frac{1}{B(p,p^\prime -p)}\int _{-\infty }^x\int _z^x(x-y)^{p^\prime -p-1}(y-z)^{p-1}\,\mathrm d y\,\mathrm d F_X(z)\nonumber \\&=\frac{1}{B(p,p^\prime -p)}\int _{-\infty }^x(x-y)^{p^\prime -1-p}\int _{-\infty }^y(y-z)^{p-1}\,\mathrm d F_X(z)\,\mathrm d y\nonumber \\&\ge \frac{1}{B(p,p^\prime -p)}\int _{-\infty }^x(x-y)^{p^\prime -1-p}\int _{-\infty }^x(y-z)^{p-1}\, \mathrm d F_Y(z)\,\mathrm d y\nonumber \\&= \int _{-\infty }^x(x-z)^{p^\prime -1}\, \mathrm d F_Y(z), \end{aligned}$$

(4.7)

where we have used the characterization (4.6) in (4.7), as $x-y\ge 0$ and that $B(p,p^\prime -p)$ is well-defined and positive for $p^\prime >p$. The assertion again follows with (4.6).

4.3 Characterization of stochastic dominance for spectral risk measures

The following builds on the spectral risk measure ${\mathcal {R}}_\sigma (\cdot )$ introduced in Definition 3.1 and considers the norm

$$\begin{aligned} \Vert \cdot \Vert _\sigma {:}{=}{\mathcal {R}}_\sigma (|\cdot |) \end{aligned}$$

for the spectral function $\sigma $. Theorem 4.3 and the characterization of higher order spectral risk measures (Theorem 3.2) give rise to the following result.

Theorem 4.11

The stochastic dominance relation

$$\begin{aligned}X\preccurlyeq ^{\Vert \cdot \Vert _\sigma }Y \end{aligned}$$

with respect to the norm associated with the spectral risk measure ${\mathcal {R}}_\sigma $ is equivalent to

$$\begin{aligned}&-\sigma _p\cdot {{\,\mathrm{{\mathsf {V@R}}}\,}}_p(Y)+\int _{-\infty }^{{{\,\mathrm{{\mathsf {V@R}}}\,}}_p(Y)}\Sigma \bigl (S_Y(y)\bigr )\,\textrm{d}y\\&\le -\sigma _p\cdot {{\,\mathrm{{\mathsf {V@R}}}\,}}_p(X)+\int _{-\infty }^{{{\,\mathrm{{\mathsf {V@R}}}\,}}_p(X)}\Sigma \bigl (S_X(x)\bigr )\,\textrm{d}x\quad \text {for all }p\in (0,1), \end{aligned}$$

where $\sigma _p{:}{=}\int _{1-p}^1 \sigma (u)\,\textrm{d}u$ and $S_X(x){:}{=}1-F_X(x)= P(X>x)$ is the survival function of the random variable X.

Proof

We argue with the norm $\Vert Y\Vert _\sigma {:}{=}{\mathcal {R}}_\sigma (|Y|)$. Note, that $(Y-t)_+\ge 0$, hence the defining equation (2.3) is

$$\begin{aligned} {\mathcal {R}}_\beta ^{\Vert \cdot \Vert _\sigma }(Y)&=\inf _{t\in {\mathbb {R}}} \ t+\frac{1}{1-\beta }\Vert (Y-t)_+\Vert _\sigma \nonumber \\&=\inf _{t\in {\mathbb {R}}}\ t+{1 \over 1-\beta }{\mathcal {R}}_\sigma \bigl ((Y-t)_+ \bigr )\nonumber \\&={\mathcal {R}}_{\sigma _\beta }(Y), \end{aligned}$$

(4.8)

where we have used Theorem 3.2 in (4.8).

From (3.8) we have that

$$\begin{aligned} {\mathcal {R}}_{\beta }(-Y)&={{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(-Y)+\frac{1}{1-\beta }\int _{{{\,\mathrm{{\mathsf {V@R}}}\,}}_{u_\beta }(-Y)}^{\infty }\Sigma \bigl (F_{-Y}(y)\bigr )\,\textrm{d}y\\&=-{{\,\mathrm{{\mathsf {V@R}}}\,}}_{1-u_\beta }(Y)+\frac{1}{1-\beta }\int _{-{{\,\mathrm{{\mathsf {V@R}}}\,}}_{1-u_\beta }(Y)}^{\infty }\Sigma \bigl (S_Y(-y)\bigr )\,\textrm{d}y\\&=-{{\,\mathrm{{\mathsf {V@R}}}\,}}_{1-u_\beta }(Y)+\frac{1}{1-\beta }\int _{-\infty }^{{{\,\mathrm{{\mathsf {V@R}}}\,}}_{1-u_\beta }(Y)}\Sigma \bigl (S_Y(y)\bigr )\,\textrm{d}y, \end{aligned}$$

where we have used that $F_{-Y}(y)=P(-Y\le y)=P(Y\ge -y)= 1-F_Y(-y)=S_Y(-y)$ and ${{\,\mathrm{{\mathsf {V@R}}}\,}}_\alpha (-Y)=-{{\,\mathrm{{\mathsf {V@R}}}\,}}_{1-\alpha }(Y)$ at points of continuity of $F_Y(\cdot )$.

Now set $1-u_{\beta }{=}{:}p$. Then, by employing the characterizing relation (3.3) for the $\beta $-quantile of $\sigma $, it holds that

$$\begin{aligned} 1-\beta =\int _{u_\beta }^1 \sigma (u)\,\textrm{d}u= \int _{1-p}^1 \sigma (u)\,\textrm{d}u= \sigma _p, \end{aligned}$$

so that

$$\begin{aligned} {\mathcal {R}}_\beta (-Y)=-{{\,\mathrm{{\mathsf {V@R}}}\,}}_{p}(Y)+\frac{1}{\sigma _p}\int _{-\infty }^{{{\,\mathrm{{\mathsf {V@R}}}\,}}_p(Y)}\Sigma \bigl (S_Y(y)\bigr )\,\textrm{d}y. \end{aligned}$$

By Theorem 4.3, the relation $X\preccurlyeq ^{\Vert \cdot \Vert _\sigma }Y$ is equivalent to ${\mathcal {R}}_\beta ^{\Vert \cdot \Vert _\sigma }(-Y)\le {\mathcal {R}}_\beta ^{\Vert \cdot \Vert _\sigma }(-X)$ for all $\beta \in (0,1)$. With that, the assertion follows. $\square $

4.4 Comparison of stochastic order relations

Different stochastic dominance relations may vary in strength (the implication (4.5) in the preceding Remark 4.10 is an example). In what follows, we provide an explicit relation to compare stochastic dominance relations, which are built on different spectral functions.

Proposition 4.12

(Comparison of spectral stochastic orders) Suppose that

$$\begin{aligned} \sigma _\mu (u) = \sigma (u)\cdot \int _0^{u_\beta }\frac{\mu (\textrm{d}\beta )}{1-\beta } \end{aligned}$$

(4.9)

for some probability measure $\mu $, where $u_\beta $ is as defined in (3.3). Then the stochastic order associated with $\sigma _\mu $ is weaker than the genuine stochastic order associated with $\sigma $. Specifically, for different spectral functions $\sigma $ and $\sigma _\mu $, it holds that

$$\begin{aligned} X\preccurlyeq ^{\Vert \cdot \Vert _\sigma }Y\implies X\preccurlyeq ^{\Vert \cdot \Vert _{\sigma _\mu }}Y. \end{aligned}$$

Remark 4.13

The function $\sigma _\mu $ in (4.9) is indeed a spectral function. It is positive, as $\mu $ is a positive measure (thus (i) in Definition 3.1). The function is non-decreasing, as $u_\beta $ is non-decreasing for $\beta $ increasing. Finally, the function $\sigma _\mu $ is a density: indeed, it holds that

$$\begin{aligned} \int _0^1\sigma _\mu (u)\,\textrm{d}u = \int _0^1\sigma (u)\cdot \int _0^{u_\beta } {\mu (\mathrm d\beta ) \over 1-\beta }\textrm{d}u = \int _0^1\int _{\beta _u}^1 \sigma (u)\,\textrm{d}u\ \frac{\mu (\mathrm d\beta )}{1-\beta } = \int _0^1 \mu (\mathrm d\beta ) = 1 \end{aligned}$$

by integration by parts, where we have used the definition of $u_\beta $ in (3.3).

Proof of Proposition 4.12

Since $x\preccurlyeq ^{\Vert \cdot \Vert _\sigma }Y$, it holds with Theorem 4.3 that ${\mathcal {R}}_{\sigma _\beta }(-X)\ge {\mathcal {R}}_{\sigma _\beta }(-Y)$ for all $\beta \in (0,1)$, where $\sigma _\beta $ is defined in (3.2). By the characterization (3.1), this is

$$\begin{aligned} \int _{u_\beta }^1{\sigma (u) \over 1-\beta }F_{-X}^{-1}(u)\,\textrm{d}u\ge \int _{u_\beta }^1{\sigma (u) \over 1-\beta }F_{-Y}^{-1}(u)\,\textrm{d}u,\qquad \beta \in (0,1). \end{aligned}$$

Integrating the latter expression with respect to $\mu (\textrm{d}\beta )$ establishes the inequality

$$\begin{aligned} \int _\beta ^1\int _{u_{\beta ^\prime }}^1{\sigma (u) \over 1-\beta ^\prime }F_{-X}^{-1}(u)\,\textrm{d}u\,\mu (\textrm{d}\beta ^\prime ) \ge \int _\beta ^1\int _{u_{\beta ^\prime }}^1{\sigma (u) \over 1-\beta ^\prime }F_{-Y}^{-1}(u)\,\textrm{d}u\,\mu (\textrm{d}\beta ^\prime ),\qquad \beta \in (0,1). \end{aligned}$$

Interchanging the order of integration together with (3.17) gives that

$$\begin{aligned} \int _{u_\beta }^1\int _\beta ^{\beta _u}{\sigma (u) \over 1-\beta ^\prime }\mu (\textrm{d}\beta ^\prime )F_{-X}^{-1}(u)\,\textrm{d}u \ge \int _{u_{\beta }}^1\int _\beta ^{\beta _u}{\sigma (u) \over 1-\beta ^\prime }\,\mu (\textrm{d}\beta )F_{-Y}^{-1}(u)\,\textrm{d}u,\qquad \beta \in (0,1), \end{aligned}$$

which in turn is

$$\begin{aligned} \int _{u_\beta }^1\sigma _\mu (u)F_{-X}^{-1}(u)\,\textrm{d}u \ge \int _{u_\beta }^1\sigma _\mu (u)F_{-Y}^{-1}(u)\,\textrm{d}u,\qquad \beta \in (0,1). \end{aligned}$$

This is the assertion. $\square $

5 Example: the expectile

The expectile risk measure, originally introduced by Newey and Powell (1987), has recently gained additional interest (cf. Malandii et al. (2024), Balbás et al. (2023) or Farooq and Steinwart (2018) for conditional regressions). A main reason for the additional interest in this risk measure is because it is the only elicitable risk functional (cf. Ziegel (2014)).

As Proposition 2.6 indicates, the higher order risk measure can be based on the dual norm. For this reason, the following section establishes the dual norm of expectiles first, as it is crucial in understanding its regret function in the risk quadrangle. Next, we provide an explicit characterization of the higher order expectiles, that is, the higher order risk measure based on the expectile risk measure.

The expectile is defined as a minimizer. Its Kusuoka representation is central in elaborating the corresponding higher order risk functional.

Definition 5.1

For $\alpha \in (0,1)$, the expectile is

$$\begin{aligned} e_\alpha (Y)=\mathop {\mathrm {{arg\,min}}}\limits _{x\in {\mathbb {R}}}{{\,\mathrm{{\mathbb {E}}}\,}}\ell _\alpha (Y-x), \end{aligned}$$

(5.1)

where

$$\begin{aligned} \ell _{\alpha }(x)={\left\{ \begin{array}{ll} \quad \alpha \,x^2 &{}\quad \text {if }x\ge 0,\\ (1-\alpha )x^2 &{}\quad \text {else} \end{array}\right. } \end{aligned}$$

is the asymmetric loss, or quadratic error function.

The expectile satisfies the first order condition

$$\begin{aligned} (1-\alpha ){{\,\mathrm{{\mathbb {E}}}\,}}(x-Y)_+=\alpha {{\,\mathrm{{\mathbb {E}}}\,}}(Y-x)_+, \end{aligned}$$

(5.2)

and $e_\alpha (\cdot )$ is a risk measure for $\alpha \in [1/2,1]$. We mention that condition (5.2) provides a definition for $Y\in L^1$, it is thus more general than (5.1), which requires $Y\in L^2$. The Kusuoka representation of the expectile (cf. Bellini et al. (2014, Proposition 9)) is given by

$$\begin{aligned} e_\alpha (Y)=\max _{\gamma \in [0,1-\eta ]}(1-\gamma )\cdot {{\,\mathrm{{\mathbb {E}}}\,}}Y+\gamma \cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta }}(Y), \end{aligned}$$

(5.3)

where $\eta =\frac{1-\alpha }{\alpha }$, so that the risk level in (5.3) is $1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta }=\frac{\alpha (2-\gamma )-1}{(2\alpha -1)(1-\gamma )}$. Involving spectral risk measures, the expectile can be recast as

$$\begin{aligned}e_\alpha (Y)=\sup \left\{ {\mathcal {R}}_{\sigma _\gamma }(Y):\sigma _\gamma \in {\mathcal {S}}\right\} , \end{aligned}$$

where ${\mathcal {S}}=\left\{ \sigma _\gamma :\gamma \in \left[ 0,1-\eta \right] \right\} $ collects the spectral functions

$$\begin{aligned} s_\gamma (u)= {\left\{ \begin{array}{ll} 1-\gamma &{}\quad \text {if }u\le 1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta },\\ \frac{1-\gamma }{\eta } &{}\quad \text {else}. \end{array}\right. } \end{aligned}$$

The higher order expectile can be described by involving its dual norm (cf. (2.9)), as well as its Kusuoka representation (cf. Corollary 3.6). The following two (sub)sections elaborate these possibilities for the expectile.

5.1 The dual norm of expectiles

The higher order expectile can be described with the dual representation (2.8), for which the dual norm of the expectile is necessary.

By the characterization of the loss function (5.2) it holds that $e_\alpha (Y)$ is well-defined for $Y\in L^1(P)$. This is enough to conclude that ${{\,\mathrm{{\mathbb {E}}}\,}}|Y|\le C_\alpha \cdot e_\alpha (|Y|)$ for some constant $C_\alpha >0$ (Lakshmanan and Pichler 2023, Corollary 2.16) elaborate the tight bound $C_\alpha =\frac{\alpha }{1-\alpha }$). It follows that ${\mathcal {Y}}^*= L^\infty $, so that $\Vert Z\Vert _\infty $ is well-defined for $Z\in {\mathcal {Y}}^*$.

The following result provides the dual norm of the expectile explicitly.

Proposition 5.2

(Dual norm of the expectile) For $\alpha \ge {1/2}$, the dual norm is

$$\begin{aligned} \Vert Z\Vert _\alpha ^*{:}{=}\sup \left\{ {{\,\mathrm{{\mathbb {E}}}\,}}YZ:e_\alpha (|Y|)\le 1\right\} \end{aligned}$$

(5.4)

(cf. (2.7)) . It holds that

$$\begin{aligned} \Vert Z\Vert _\alpha ^*=\sup _{\beta \in (0,1)}(1-\beta )\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{\beta }(|Z|)+\beta \frac{1-\alpha }{\alpha }\Vert Z\Vert _\infty . \end{aligned}$$

(5.5)

Notably, the norm $\Vert \cdot \Vert _\alpha ^*$ is not a risk measure itself, and (5.5) is not a Kusuoka representation; indeed, the total weight in the representation (5.5) is

$$\begin{aligned} (1-\beta ) + \beta \frac{1-\alpha }{\alpha }< 1 \end{aligned}$$

for $\alpha \in ({1/2},1]$.

Proof of Proposition 5.2

We may assume that $Z\ge 0$, as otherwise we may consider ${{\,\textrm{sign}\,}}(Z)\cdot Y$ instead of Y. For arbitrary sets B and G with $B\subset G$ and $P(G)<1$ define the random variable

$$\begin{aligned} {\tilde{Y}}_{B,G}(\omega ){:}{=}{\left\{ \begin{array}{ll} 0 &{}\quad \text {if }\omega \in B,\\ 1 &{}\quad \text {if }\omega \in G\setminus B,\text { and}\\ \frac{1-\alpha }{\alpha }\cdot \frac{P(B)}{1-P(G)}+1 &{}\quad \text {else}. \end{array}\right. } \end{aligned}$$

(5.6)

Note, that

$$\begin{aligned} (1-\alpha )\cdot P(B)(1-0)= \alpha \cdot \bigl (1-P(G)\bigr )\left( \frac{(1-\alpha )P(B)}{\alpha (1-P(G))}+1-1\right) , \end{aligned}$$

and hence $e_\alpha ({\tilde{Y}}_{B,G})=1$ by the defining equation (5.2). It follows with (5.4) that

$$\begin{aligned} \Vert Z\Vert _\alpha ^*\ge {{\,\mathrm{{\mathbb {E}}}\,}}Z\,Y_{B,G}. \end{aligned}$$

As $B\subset G$ are arbitrary, we conclude in particular that

$$\begin{aligned} \Vert Z\Vert _\alpha ^*\ge \bigl ((1-P(B)\bigr )\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{P(B)}(Z)+P(B)\frac{1-\alpha }{\alpha }\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{P(G)}(Z), \end{aligned}$$

because the random variables

$$\begin{aligned} {{\tilde{Y}}}_{B,G}= \bigl (1-P(B)\bigr )\cdot \frac{1}{1-P(B)}{\mathbbm {1}}_{[P(B),1]}(U)+P(B)\frac{1-\alpha }{\alpha }\cdot \frac{1}{1-P(G)}{\mathbbm {1}}_{[P(G),1]}(U) \end{aligned}$$

satisfy all conditions from above for any uniform variable U. Now let $P(G)\rightarrow 1$ and by denoting $\beta =P(B)$ it follows that

$$\begin{aligned} \Vert Z\Vert _\alpha ^*\ge \sup _{\beta \in (0,1)}(1-\beta )\cdot {{\,\mathrm{{\mathsf {AV@R}}}\,}}_\beta (Z)+\beta {1-\alpha \over \alpha }\mathop {\mathrm {ess\,sup}}\limits Z, \end{aligned}$$

as ${{\,\mathrm{{\mathsf {AV@R}}}\,}}_\gamma (Z)\rightarrow \mathop {\mathrm {ess\,sup}}\limits Z$ for $\gamma \rightarrow 1$.

As for the converse observe that we may assume $e_\alpha (Y)=1$ for the optimal random variable in (5.4). Consider the Lagrangian

$$\begin{aligned} L(Y;\lambda ,\mu ){:}{=}{{\,\mathrm{{\mathbb {E}}}\,}}ZY-\lambda \bigl ((1-\alpha ){{\,\mathrm{{\mathbb {E}}}\,}}(1-Y)_+-\alpha {{\,\mathrm{{\mathbb {E}}}\,}}(Y-1)_+\bigr )+{{\,\mathrm{{\mathbb {E}}}\,}}\mu Y, \end{aligned}$$

(5.7)

where the Lagrangian multiplier $\lambda \in {\mathbb {R}}$ is associated with the equality constraint $e_\alpha (Y)=1$, i.e., (5.2), and the measurable variable $\mu \in L^1$, $\mu \ge 0$, is associated with the inequality constraint $Y\ge 0$. Provided That the derivative exists, the first order conditions are

$$\begin{aligned} 0&={\partial \over \partial Y}L(Y;\lambda ,\mu ), \end{aligned}$$

or

$$\begin{aligned} Z&=\lambda \bigl (-(1-\alpha ){\mathbbm {1}}_{\{Y<1\}}-\alpha \,{\mathbbm {1}}_{\{Y>1\}}\bigr )-\mu \cdot {\mathbbm {1}}_{\{Y=0\}}. \end{aligned}$$

(5.8)

Now note that the left-hand side of (5.8) involves the variable Z, while the right-hand side only involves constants, except on $\{Y=0\}$, where $\mu $ is not necessarily constant. The first order conditions (5.8) thus hold true on plateaus of Z, if they coincide with $\{Y<1\}$ or $\{Y>1\}$; for $\{Y=0\}$, equation (5.8) is $\mu =-Z-\lambda (1-\alpha )$; for $\{Y=1\}$, the derivative of (5.7) does not exist or depends on the direction.

It follows, that the optimal Y in (5.4) exactly is of form (5.6) and hence the assertion. $\square $

5.2 Higher order expectiles

The Kusuoka representation (5.3) is the basis for the expectile’s higher order risk measure.

Proposition 5.3

For $\beta \in (0,1)$, the higher order expectile is

$$\begin{aligned} \bigl (e_\alpha \bigr )_\beta (Y)=\max _{\gamma \in [0,1-\eta ]} {\left\{ \begin{array}{ll} \left( 1-\frac{\gamma }{1-\beta }\right) {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{\frac{\beta }{1-\gamma }}(Y)+\frac{\gamma }{1-\beta }{{\,\mathrm{{\mathsf {AV@R}}}\,}}_{1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta }}(Y) &{}\quad \text {if }\frac{\gamma }{1-\beta }<1-\eta ,\\ {{\,\mathrm{{\mathsf {AV@R}}}\,}}_{1-(1-\beta )(1-{\tilde{\alpha }})}(Y) &{}\quad \text {else}, \end{array}\right. } \end{aligned}$$

(5.9)

where $\eta =\frac{1-\alpha }{\alpha }$ (as above) and ${\tilde{\alpha }}{:}{=}1- {1 \over 1-\gamma }$.

Proof

The measure in the Kusuoka representation (5.3) is $\mu (\cdot )=(1-\gamma )\delta _0 +\gamma \cdot \delta _{1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta }}$. To apply Corollary 3.7 we set $p_1{:}{=}1-\gamma $ and $p_2= \gamma $, the corresponding risk levels are $\alpha _1=0$ and $\alpha _2 = 1-\frac{\gamma }{1-\gamma }\frac{\eta }{1-\eta }$. The mixed risk level is ${\tilde{\alpha }}{:}{=}\frac{\alpha _1\frac{p_1}{1-\alpha _1}+\alpha _2\frac{p_2}{1-\alpha _2}}{\frac{p_1}{1-\alpha _1}+\frac{p_2}{1-\alpha _2}} =\frac{\alpha (2-\gamma )-1}{\alpha (1-\gamma )}= 1-\frac{\eta }{1-\gamma }$.

We distinguish the cases $\frac{\gamma }{1-\beta }<1-\eta $ and $\frac{\gamma }{1-\beta }<1-\eta $, which are equivalent to $u_\beta \lessgtr \alpha _2$, i.e., $1-\frac{1-\beta }{\frac{\gamma }{1-\alpha _1}+\frac{1-\gamma }{1-\alpha _2}}\lessgtr \alpha _2$ in view of (3.20). In the first case, the critical equation (3.16) is $(1-\gamma )u_\beta = \beta $, while it is $(1-\gamma )u_\beta +\gamma \frac{u_\beta -\alpha _2}{1-\alpha _2}=\beta $ in the other case; the solutions thus are $u_\beta =\frac{\beta }{1-\gamma }$ and $u_\beta =\frac{\alpha (2-\beta -\gamma )-1+\beta }{\alpha (1-\gamma )}$. The corresponding weights $p_0$ (cf. (3.16) again) are $p_0=\frac{1-u_\beta }{1-\beta }(1-\gamma )$, or $p_0=\frac{1-u_\beta }{1-\beta }\left( \frac{1-\gamma }{1-0}+\frac{\gamma }{1-\alpha _2}\right) =1$. Finally, note that $u_\beta = 1-(1-\beta )(1-{\tilde{\alpha }})$.

The assertion follows with (3.17) and (3.18) in Corollary 3.7. $\square $

The average value-at-risk is ‘closed under higher orders’, as its higher order variant is an average value-at-risk as well (cf. (3.12)). This is not the case for the expectile, as the first term in (5.9) is not an expectation as in the genuine Kusuoka representation (5.3). Repeating the construction and passing to higher order expectiles leads to more complicated risk measures.

Remark 5.4

The results in Mafusalov and Uryasev (2016) on stochastic properties of the average value-at-risk reveal relations and risk measures, which are similar to the risk measures exposed in (5.9).

6 Summary

Higher order risk measures naturally integrate with stochastic optimization, as they are stochastic optimization problems themselves. This paper presents and derives explicit forms of higher order risk measures, specifically for spectral risk measures. These risk measures constitute the central building block of general law invariant risk measures.

Extending these results result it is demonstrated that stochastic dominance relations can be characterized by employing higher order risk measures, and vice versa. We provide a verification theorem, which makes higher stochastic dominance relations accessible to numerical computations.

The results are exemplified for expectiles, a specific risk measure with unique properties.

Data availability

No datasets were generated or analysed during the current study.

References

Artzner P, Delbaen F, Eber J-M, Heath D (1999) Coherent measures of risk. Math Finance 9:203–228. https://doi.org/10.1111/1467-9965.00068
Article Google Scholar
Balbás A, Balbás B, Balbás R, Charron J-P (2023) Bidual representation of expectiles. Risks 11(12):220. https://doi.org/10.3390/risks11120220
Article Google Scholar
Bellini F, Caperdoni C (2007) Coherent distortion risk measures and higher-order stochastic dominances. N Am Actuar J 11(2):35–42. https://doi.org/10.1080/10920277.2007.10597446
Article Google Scholar
Bellini F, Klar B, Müller A, Rosazza Gianin E (2014) Generalized quantiles as risk measures. Insur Math Econ 54:41–48. https://doi.org/10.1016/j.insmatheco.2013.10.015
Article Google Scholar
Bellini F, Klar B, Müller A (2016) Expectiles, omega ratios and stochastic ordering. Methodol Comput Appl Probab 20(3):855–873. https://doi.org/10.1007/s11009-016-9527-2
Article Google Scholar
Ben-Tal A, Teboulle M (2007) An old-new concept of convex risk measures: the optimized certainty equivalent. Math Finance 17:449–476
Article Google Scholar
Ch Pflug G (2000) Some remarks on the value-at-risk and the conditional value-at-risk. In: Uryasev S (ed) Probabilistic constrained optimization, volume 49 of nonconvex optimization and its application, chapter 15. Springer, New York, pp 272–281
Google Scholar
Ch Pflug G, Römisch W (2007) Modeling, measuring and managing risk. World Scientific, River Edge. https://doi.org/10.1142/9789812708724
Book Google Scholar
Consigli G, Dentcheva D, Maggioni F, Micheli G (2023) Asset liability management under sequential stochastic dominance constraints. https://optimization-online.org/?p=24837
Dentcheva D, Martinez G (2012) Two-stage stochastic optimization problems with stochastic ordering constraints on the recourse. Eur J Oper Res 219(1):1–8. https://doi.org/10.1016/j.ejor.2011.11.044
Article Google Scholar
Dentcheva D, Penev S, Ruszczyński A (2010) Kusuoka representation of higher order dual risk measures. Ann Oper Res 181:325–335. https://doi.org/10.1007/s10479-010-0747-5
Article Google Scholar
Dommel P, Pichler A (2021) Convex risk measures based on divergence. Pure Appl Funct Anal 6(6):1157–1181
Google Scholar
Dupačová J, Kopa M (2014) Robustness of optimal portfolios under risk and stochastic dominance constraints. Eur J Oper Res 234(2):434–441. https://doi.org/10.1016/j.ejor.2013.06.018
Article Google Scholar
Farooq M, Steinwart I (2018) Learning rates for kernel-based expectile regression. Mach Learn 108(2):203–227. https://doi.org/10.1007/s10994-018-5762-9
Article Google Scholar
Frydenberg S, Sønsteng Henriksen TE, Pichler A, Westgaard S (2019) Can commodities dominate stock and bond portfolios? Ann Oper Res 282(1–2):155–177. https://doi.org/10.1007/s10479-018-2996-7
Article Google Scholar
Gómez F, Tang Q, Tong Z (2022) The gradient allocation principle based on the higher moment risk measure. J Bank Finance 1:43. https://doi.org/10.1016/j.jbankfin.2022.106544
Article Google Scholar
Gutjahr WJ, Pichler A (2013) Stochastic multi-objective optimization: a survey on non-scalarizing methods. Ann Oper Res 236(2):1–25. https://doi.org/10.1007/s10479-013-1369-5
Article Google Scholar
Kopa M, Moriggia V, Vitali S (2016) Individual optimal pension allocation under stochastic dominance constraints. Ann Oper Res 260(1–2):255–291. https://doi.org/10.1007/s10479-016-2387-x
Article Google Scholar
Kopa M, Moriggia V, Vitali S (2023) Multistage stochastic dominance: an application to pension fund management. Ann Oper Res. https://doi.org/10.1007/s10479-023-05658-y
Article Google Scholar
Krokhmal PA (2007) Higher moment coherent risk measures. Quant Finance 7(4):373–387. https://doi.org/10.1080/14697680701458307
Article Google Scholar
Kusuoka S (2001) On law invariant coherent risk measures. In: Advances in mathematical economics, vol 3, chap 4, pp 83–95. Springer. https://doi.org/10.1007/978-4-431-67891-5
Lakshmanan R, Pichler A (2023) Expectiles in risk averse stochastic programming and dynamic optimization. Pure Appl Funct Anal
Mafusalov A, Uryasev S (2016) CVaR (superquantile) norm: stochastic case. Eur J Oper Res 249(1):200–208. https://doi.org/10.1016/j.ejor.2015.09.058
Article Google Scholar
Maggioni F, Pflug GCh (2016) Bounds and approximations for multistage stochastic programs. SIAM J Optim 26(1):831–855. https://doi.org/10.1137/140971889
Article Google Scholar
Maggioni F, Pflug GCh (2019) Guaranteed bounds for general non-discrete multistage risk-averse stochastic optimization programs. SIAM J Optim 29(1):454–483. https://doi.org/10.1137/17M1140601
Article Google Scholar
Malandii A, Kuzmenko V, Uryasev S (2024) Expectile risk quadrangles and applications. Summited for review
Müller A, Stoyan D (2002) Comparison methods for stochastic models and risks. Wiley series in probability and statistics. Wiley, Chichester
Google Scholar
Newey WK, Powell JL (1987) Asymmetric least squares estimation and testing. Econometrica 55(4):819–847. https://doi.org/10.2307/1911031
Article Google Scholar
Ogryczak W, Ruszczyński A (1999) From stochastic dominance to mean-risk models: semideviations as risk measures. Eur J Oper Res 116:33–50. https://doi.org/10.1016/S0377-2217(98)00167-2
Article Google Scholar
Ogryczak W, Ruszczyński A (2001) On consistency of stochastic dominance and mean-semideviation models. Math Program Ser B 89:217–232. https://doi.org/10.1007/s101070000203
Article Google Scholar
Ogryczak W, Ruszczyński A (2002) Dual stochastic dominance and related mean-risk models. SIAM J Optim 13(1):60–78. https://doi.org/10.1137/S1052623400375075
Article Google Scholar
Pichler A (2013) The natural Banach space for version independent risk measures. Insur Math Econ 53(2):405–415. https://doi.org/10.1016/j.insmatheco.2013.07.005
Article Google Scholar
Pichler A (2017) A quantitative comparison of risk measures. Ann Oper Res 254(1):251–275. https://doi.org/10.1007/s10479-017-2397-3
Article Google Scholar
Pichler A, Shapiro A (2015) Minimal representation of insurance prices. Insur Math Econ 62:184–193. https://doi.org/10.1016/j.insmatheco.2015.03.011
Article Google Scholar
Post T, Kopa M (2017) Portfolio choice based on third-degree stochastic dominance. Manag Sci 63(10):3381–3392. https://doi.org/10.1287/mnsc.2016.2506
Article Google Scholar
Rockafellar RT, Uryasev S (2000) Optimization of conditional value-at-risk. J Risk 2(3):21–41. https://doi.org/10.21314/JOR.2000.038
Article Google Scholar
Rockafellar RT, Uryasev S (2013) The fundamental risk quadrangle in risk management, optimization and statistical estimation. Surv Oper Res Manag Sci 18(1–2):33–53. https://doi.org/10.1016/j.sorms.2013.03.001
Article Google Scholar
Sion M (1958) On general minimax theorems. Pac J Math 8(1):171–176
Article Google Scholar
van der Vaart AW (1998) Asymptotic statistics. Cambridge University Press, Cambridge
Book Google Scholar
Ziegel JF (2014) Coherence and elicitability. Math Finance 26(4):901–918. https://doi.org/10.1111/mafi.12080
Article Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Technische Universität Chemnitz, Faculty of Mathematics, 90126, Chemnitz, Germany
Alois Pichler

Authors

Alois Pichler
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Everything by AP.

Corresponding author

Correspondence to Alois Pichler.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pichler, A. Connection between higher order measures of risk and stochastic dominance. Comput Manag Sci 21, 41 (2024). https://doi.org/10.1007/s10287-024-00523-0

Download citation

Received: 08 February 2024
Accepted: 21 August 2024
Published: 05 September 2024
DOI: https://doi.org/10.1007/s10287-024-00523-0

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Connection between higher order measures of risk and stochastic dominance

Abstract

Similar content being viewed by others

Quadratic two-stage stochastic optimization with coherent measures of risk

Stochastic superiority

Polyhedral Coherent Risk Measures and Robust Optimization

1 Introduction

2 Mathematical framework

Definition 2.1

Definition 2.2

Proposition 2.3

Proof

Example 2.4

Lemma 2.5

Proof

Proposition 2.6

Remark 2.7

Proof

Example 2.8

3 Higher order spectral risk

Definition 3.1

Theorem 3.2

Proof

Corollary 3.3

Proof

Corollary 3.4

Proof

Corollary 3.5

Proof

Corollary 3.6

Proof

Corollary 3.7

Proof

Remark 3.8

Theorem 3.9

Proof

4 General stochastic dominance relations

Definition 4.1

Lemma 4.2

Proof

4.1 Characterization of stochastic dominance relations

Theorem 4.3

Proof

Remark 4.4

Corollary 4.5

Proof

Remark 4.6

Example 4.7

4.2 Higher order stochastic dominance

Definition 4.8

Lemma 4.9

Proof

Remark 4.10

4.3 Characterization of stochastic dominance for spectral risk measures

Theorem 4.11

Proof

4.4 Comparison of stochastic order relations

Proposition 4.12

Remark 4.13

Proof of Proposition 4.12

5 Example: the expectile

Definition 5.1

5.1 The dual norm of expectiles

Proposition 5.2

Proof of Proposition 5.2

5.2 Higher order expectiles

Proposition 5.3

Proof

Remark 5.4

6 Summary

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information