The Min-characteristic Function: Characterizing Distributions by Their Min-linear Projections

Falk, Michael; Stupfler, Gilles

doi:10.1007/s13171-019-00184-1

The Min-characteristic Function: Characterizing Distributions by Their Min-linear Projections

Published: 25 November 2019

Volume 83, pages 254–282, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Sankhya A Aims and scope Submit manuscript

The Min-characteristic Function: Characterizing Distributions by Their Min-linear Projections

Download PDF

262 Accesses
1 Citation
Explore all metrics

Abstract

Motivated by a (seemingly previously unnoticed) result stating that d −dimensional distributions on $(0,\infty )^{d}$ are characterized by the collection of their min-linear projections, we introduce and study a notion of min-characteristic function (min-CF) of a random vector with strictly positive components. Unlike the related notion of max-characteristic function which has been studied recently, the existence of the min-CF does not hinge on any integrability conditions. It is itself a multivariate distribution function, which is continuous and concave, no matter which properties the initial distribution function has. We show the equivalence between convergence in distribution and pointwise convergence of min-CFs, and we also study the functional convergence of the min-CF of the empirical distribution function of a sample of independent and identically distributed random vectors. We provide some further insight into the structure of the set of min-CFs, and we conclude by showing how transforming the components of an arbitrary random vector by a suitable one-to-one transformation such as the exponential function allows the construction of a notion of min-CF for arbitrary random vectors.

Multi-normex distributions for the sum of random vectors. Rates of convergence

Article 13 January 2023

The Distributions of the Mean of Random Vectors with Fixed Marginal Distribution

Article Open access 25 September 2023

Higher dimensional quasi-power theorem and Berry–Esseen inequality

Article Open access 01 August 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction and motivation

The well-known Cramér-Wold theorem (see Cramér and Wold 1936) states that any distribution on $\mathbb {R}^{d}$ is determined by the collection of its one-dimensional projections. In other words, if X = (X₁,…, X_d) and Y = (Y₁,…, Y_d) are two random vectors (rvs), we have

$$ \boldsymbol{X}\overset{d}{=}\boldsymbol{Y} \Leftrightarrow \forall t_{1},\ldots,t_{d}\in \mathbb{R}, \ \sum\limits_{k=1}^{d} t_{k} X_{k} \overset{d}{=}\sum\limits_{k=1}^{d} t_{k} Y_{k}. $$

The Cramér-Wold theorem is well adapted to the case when the individual components of X and Y behave nicely with respect to summation. There are, however, important examples of situations in which this is not the case. For instance, in multivariate extreme value theory, the individual components of X may represent marginal financial or actuarial risk variables, whose distributions would typically be modeled by heavy-tailed distributions such as the Pareto distribution (see e.g. Embrechts et al. 1997; Resnick 2007). Another example is the class of multivariate max-stable distributions, where usually the marginals are assumed to be unit Fréchet. Calculations involving sums of Pareto or Fréchet distributed rvs are typically very complicated, even if these rvs are independent (see e.g. Blum 1970; Nadarajah and Pogány 2013; Nadarajah et al. 2018). The relevant operator in this kind of situation is the maximum rather than the sum, leading one to consider instead the collection of max-linear projections of X, that is:

$$ \bigvee_{k=1}^{d} t_{k} X_{k} := \max_{1\leq k\leq d} t_{k} X_{k}, \qquad t_{1},\ldots,t_{d} > 0. $$

It turns out, somewhat surprisingly, that the distributions of such projections also characterize multivariate distributions, if they are assumed to be nonnegative. This is the focus of our first result, which does not seem to have been shown in the literature so far.

Proposition 1.1.

We have, for arbitrary random vectors X = (X₁,…, X_d) and Y = (Y₁,…, Y_d) with nonnegative components:

$$ \boldsymbol{X}\overset{d}{=}\boldsymbol{Y} \Leftrightarrow \forall t_{1},\ldots,t_{d} > 0, \ \bigvee_{k=1}^{d} t_{k} X_{k} \overset{d}{=} \bigvee_{k=1}^{d} t_{k} Y_{k}. $$

Proof.

Since, for any t₁,…, t_d > 0,

$$ \mathbb{P}\left( \bigvee_{k=1}^{d} t_{k} X_{k} \leq 1 \right) = \mathbb{P}\left( X_{1}\leq 1/t_{1},\ldots, X_{d}\leq 1/t_{d} \right), $$

the knowledge of the distribution of the max-linear projections of X is equivalent to the knowledge of the distribution function (df) F of X at any point $\boldsymbol {x} \in (0,\infty )^{d}$. By right-continuity of F, this is equivalent to the knowledge of F on $[0,\infty )^{d}$. Since X is concentrated on $[0,\infty )^{d}$, the result follows. □

That the distribution of a componentwise nonnegative rv is characterized by the collection of distributions of its max-linear projections is nicely linked to the notion of max-characteristic function (max-CF), introduced by Falk and Stupfler (2017): if X = (X₁,…, X_d) has nonnegative and integrable components, then the knowledge of the mapping

$$ \begin{array}{@{}rcl@{}} \varphi_{\boldsymbol{X}}(\boldsymbol{t}) &=& \mathbb{E}(\max(1,t_{1} X_{1},\ldots,t_{d} X_{d}))\\ &=& \mathbb{E}\left( \max\left( 1, \bigvee_{k=1}^{d} t_{k} X_{k} \right) \right), \quad t_{1},\ldots,t_{d}>0, \end{array} $$

characterizes the distribution of the rv X. This notion of max-characteristic function is particularly interesting when considering standard extreme value distributions such as the Generalized Pareto distribution, for which it has a simple closed form, although the standard characteristic function based on taking a Fourier transform does not (see Falk and Stupfler 2017). However, this notion requires the integrability of the components of X; its generalization to random vectors without sign constraints, suggested by Falk and Stupfler (2019), even requires an exponential moment. This is of course a serious restriction.

The motivation for this work resides in combining this last remark with the following observation. For d = 2 and any t₁, t₂ > 0, one clearly has

$$ \max(t_{1} X_{1}, t_{2} X_{2}) = t_{1} X_{1} + t_{2} X_{2} - \min(t_{1} X_{1},t_{2} X_{2}). $$

By Proposition 1.1, we know that the distribution of (X₁, X₂) is characterized by the collection of distributions of the max-linear projections $\max \limits (t_{1} X_{1},$ t₂X₂) when t₁ and t₂ vary. We also know, by the Cramér-Wold theorem, that it is characterized by the collection of distributions of the one-dimensional projections t₁X₁ + t₂X₂. We may therefore ask whether the distribution of (X₁, X₂) is determined by the collection of distributions of $\min \limits (t_{1} X_{1},t_{2} X_{2})$, when t₁, t₂ range over $(0,\infty )$. More generally, we may ask if the distribution of a d-dimensional rv X is characterized by the min-linear projections

$$ \bigwedge_{k=1}^{d} t_{k} X_{k} := \min_{1\leq k\leq d} t_{k} X_{k}, \ t_{1},\ldots,t_{d} > 0. $$

By analogy with the notion of max-CF, this would then suggest to define the following notion of min-characteristic function:

$$ \begin{array}{@{}rcl@{}} \psi_{\boldsymbol{X}}(\boldsymbol{t}) &=& \mathbb{E}(\min(1,t_{1} X_{1},\ldots,t_{d} X_{d}))\\ &=& \mathbb{E}\left( \min\left( 1, \bigwedge_{k=1}^{d} t_{k} X_{k} \right) \right), \ t_{1},\ldots,t_{d} > 0, \end{array} $$

which, unlike the max-CF, does not require any integrability on the components of X, since

$$ \forall t_{1},\ldots,t_{d}>0, \ 0\leq \min(1,t_{1} X_{1},\ldots,t_{d} X_{d}) \leq 1 \ \text{ almost surely.} $$

If we require the distribution of X to be concentrated on $(0,\infty )^{d}$, then it is indeed characterized by its min-linear projections, as our next result shows. We denote throughout by $\mathbb {X}_{d}$ the set of all rvs $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ on $\mathbb {R}^{d}$ with almost surely positive components (i.e. X_i > 0 for any i).

Proposition 1.2.

Let X = (X₁,…, X_d) and Y = (Y₁,…, Y_d) be two rvs in $\mathbb {X}_{d}$. Then

$$ \boldsymbol{X}\overset{d}{=}\boldsymbol{Y} \Leftrightarrow \forall t_{1},\ldots,t_{d} > 0, \ \bigwedge_{k=1}^{d} t_{k} X_{k} \overset{d}{=} \bigwedge_{k=1}^{d} t_{k} Y_{k}. $$

Proof.

Note that, for any t₁,…, t_d > 0,

$$ \mathbb{P}\left( \bigwedge_{k=1}^{d} t_{k} X_{k} > 1 \right) = \mathbb{P}\left( X_{1}> 1/t_{1},\ldots, X_{d}> 1/t_{d} \right). $$

By right-continuity of a multivariate df, the knowledge of the distribution of the min-linear projections of X is therefore equivalent to the knowledge of the probabilities $\mathbb {P}(X_{1}> x_{1},\ldots , X_{d}> x_{d})$, for any x₁,…, x_d ≥ 0. Since all components of X are assumed to be strictly positive, we obtain that, for any k ∈{1,…, d}, all indices i₁ < … < i_k in {1,…, d} and $x_{i_{1}},\ldots ,x_{i_{k}} \geq 0$, the probabilities $\mathbb {P}(X_{i_{1}}> x_{i_{1}},\ldots , X_{i_{k}}> x_{i_{k}} )$ are also determined. The result now follows by writing

$$ \mathbb{P}(X_{1}\leq x_{1},\ldots, X_{d}\leq x_{d}) = 1-\mathbb{P}\left( \bigcup_{k=1}^{d} \{ X_{k}>x_{k} \} \right) $$

and using the inclusion-exclusion principle. □

In measure-theoretic terms, unlike max-linear projections, min-linear projections cannot in general characterize a distribution which puts mass on $[0,\infty )^{d} \setminus (0,\infty )^{d}$, because the class of quadrants

$$ \left\{ {\prod}_{k=1}^{d} (a_{k},\infty), \ a_{1},\ldots,a_{d}\geq 0 \right\} $$

is an intersection-stable system of open sets but only generates the Borel σ-algebra on $(0,\infty )^{d}$. A simple illustrative example is the following: if X has a Bernoulli distribution with parameter p ∈ (0,1), then clearly $\min \limits (t_{1} X, t_{2} (1-X)) = 0$ for all t₁, t₂ > 0, but (X,1 − X) does not have the same distribution as the degenerate vector (0,0).

The present work builds on Proposition 1.2. The paper is organized as follows: in Section 2 we show that the function

$$ \psi_{\boldsymbol{X}} : \boldsymbol{t} = (t_{1},\ldots,t_{d}) \in [0,\infty)^{d} \mapsto \mathbb{E}(\min(1,t_{1} X_{1},\ldots,t_{d} X_{d})) $$

(1)

indeed characterizes the distribution of any rv $\boldsymbol {X} \in \mathbb {X}_{d}$. Referring to this function as the min-characteristic function (min-CF) of X, we derive basic properties of the min-CF, including an inversion formula. One of the most intriguing results we find is that the min-CF induces a continuous and concave df. In Section 3.1 we examine the sequential behavior of min-CFs with respect to convergence in distribution, and we consider the asymptotic properties of the empirical min-CF (that is, the random min-CF generated by the empirical df of a sample of independent and identically distributed rvs) in Section 3.2. Some insight into the structure of the set of min-CFs, such as its convexity, is provided in Section 4. Finally, in Section 5, we initiate the study of an extension of the min-CF to arbitrary, not necessarily componentwise positive rvs X, by using transformations of the components of X.

2 The Min-characteristic Function for Positive Random Vectors

The fundamental result of this paper, stated below, is that the mapping in Eq. 1 characterizes the distribution of any rv $\boldsymbol {X} \in \mathbb {X}_{d}$. Here and throughout, any operation on vectors such as +,≥,>,… is meant componentwise.

Theorem 2.1.

Let X = (X₁,…, X_d) and Y = (Y₁,…, Y_d) be two rvs in $\mathbb {X}_{d}$. Then

$$ \boldsymbol{X}\overset{d}{=}\boldsymbol{Y} \Leftrightarrow \forall \boldsymbol{t}=(t_{1},\ldots,t_{d}) > \boldsymbol{0}, \ \psi_{\boldsymbol{X}}(\boldsymbol{t}) = \psi_{\boldsymbol{Y}}(\boldsymbol{t}). $$

Proof.

By 1-homogeneity of the $\min \limits $ operator, we have

$$ \begin{array}{@{}rcl@{}} \forall t_{1},\ldots,t_{d} > 0, \ \mathbb{E}(\min(1,t_{1} X_{1},\ldots,t_{d} X_{d})) &=& \mathbb{E}(\min(1,t_{1} Y_{1},\ldots,t_{d} Y_{d})) \\ \Leftrightarrow \forall x,t_{1},\ldots,t_{d} > 0, \ \mathbb{E}(\min(x,t_{1} X_{1},\ldots,t_{d} X_{d})) &=& \mathbb{E}(\min(x,t_{1} Y_{1},\ldots,t_{d} Y_{d})). \end{array} $$

Now, for any positive rv Z and any x > 0,

$$ \mathbb{E}(\min(x,Z)) = {\int}_{0}^{\infty} \mathbb{P}(\min(x,Z) > u) du = {{\int}_{0}^{x}} \mathbb{P}(Z > u) du. $$

Differentiating from the right with respect to x entails that the knowledge of $\mathbb {E}(\min \limits (x,Z))$, for any x > 0, entails that of $\mathbb {P}(Z > x)$ for any x > 0 and thus of the distribution of the positive rv Z. Applying this to the rv $Z=\min \limits (t_{1} X_{1},\ldots ,t_{d} X_{d})$ for arbitrary t₁,…, t_d > 0 and using Proposition 1.2 concludes the proof. □

Definition 2.2.

The min-characteristic function (min-CF) of $\boldsymbol {X}=(X_{1},\dots ,$ $X_{d})\in \mathbb {X}_{d}$ is the function ψ_X on $[0,\infty )^{d}$ defined by

$$ \forall \boldsymbol{t}=(t_{1},\ldots,t_{d}) \in [0,\infty)^{d}, \ \psi_{\boldsymbol{X}}(\boldsymbol{t}) = \mathbb{E}(\min(1,t_{1} X_{1},\ldots,t_{d} X_{d})). $$

One immediate benefit of using the min-CF rather than the max-CF is that it does not require any integrability assumption on the components of X. At the same time, it can be calculated in much the same way and thus generally applies to the same kind of distributions the max-CF is well-suited to, thanks to the following basic formula.

Lemma 2.3.

We have, for $\boldsymbol {X}=(X_{1},\ldots ,X_{d})\in \mathbb {X}_{d}$, the identity

$$ \psi_{\boldsymbol{X}}(\boldsymbol{t}) = {{\int}_{0}^{1}} \mathbb{P}(X_{1}>u/t_{1},\ldots, X_{d}>u/t_{d}) du, \qquad \boldsymbol{t}=(t_{1},\ldots,t_{d}) \in (0,\infty)^{d}. $$

This formula is indeed similar in spirit to the identity

$$ \mathbb{E}(\max(1,t_{1} X_{1},\ldots,t_{d} X_{d})) = 1+{\int}_{1}^{\infty} \left[ 1-\mathbb{P}(X_{1}\leq u/t_{1},\ldots, X_{d}\leq u/t_{d}) \right] du $$

making it possible to calculate the max-CF of a componentwise nonnegative and integrable rv (see Falk and Stupfler 2017).

Proof.

Use the identity

$$ \mathbb{E}(\min(1,Z)) = {{\int}_{0}^{1}} \mathbb{P}(Z > u) du $$

valid for any positive rv Z, with $Z=\min \limits (t_{1} X_{1},\ldots ,t_{d} X_{d})$. □

We give a short list of examples next.

Example 2.1 (Exponential distribution).

The min-CF of a rv X having the exponential distribution with mean 1/λ, λ > 0, is given by

$$ \forall t>0, \ \psi_{\lambda}(t) = {{\int}_{0}^{1}} e^{-\lambda u/t} du = \frac{t}{\lambda}\left[ 1-e^{-\lambda/t} \right]. $$

Example 2.2 (Pareto distribution).

The min-CF of a rv X having the Pareto distribution with tail index γ > 0, namely, with df $\mathbb {P}(X\leq x)=1-x^{-1/\gamma }$, x ≥ 1, is given by

Consequently,

$$ \begin{array}{@{}rcl@{}} \psi_{1}(t) &=& \left\{\begin{array}{ll} 1 & \text{if } t\geq 1, \\ t(1-\log t) & \text{if } t\in (0,1) \end{array}\right. \text{ and } \\ \forall \gamma\neq 1, \ \psi_{\gamma}(t) &=& \left\{\begin{array}{ll} 1 & \text{if } t\geq 1, \\ \frac{t-\gamma t^{1/\gamma}}{1-\gamma} & \text{if } t\in (0,1). \end{array}\right. \end{array} $$

Example 2.3 (Generalized Pareto distribution).

The min-CF of a rv X having the Generalized Pareto distribution with location parameter μ ≥ 0, scale parameter σ > 0 and tail index ξ > 0, namely, with df

$$ \mathbb{P}(X\leq x)=1-\left( 1+\xi\frac{x-\mu}{\sigma} \right)^{-1/\xi}, \ x\geq \mu, $$

is given by

Consequently,

$$ \psi_{(\mu,\sigma,1)}(t) = \left\{\begin{array}{ll} 1 & \text{if } t\geq 1/\mu, \\ t\left( \mu + \sigma \log \left[ 1+\frac{1-\mu t}{\sigma t} \right] \right) & \text{if } t<1/\mu, \end{array}\right. $$

and for any ξ≠ 1,

$$ \psi_{(\mu,\sigma,\xi)}(t) = \left\{\begin{array}{ll} 1 & \text{if } t\geq 1/\mu, \\ t\left( \mu + \frac{\sigma}{1-\xi} - \frac{\sigma}{1-\xi} \left[ 1+\xi\frac{1-\mu t}{\sigma t} \right]^{1-1/\xi} \right) & \text{if } t<1/\mu. \end{array}\right. $$

This is readily seen to agree with the max-CF calculation in Example 1.3 of Falk and Stupfler (2017), for the case ξ ∈ (0,1), thanks to the identity

$$ \psi_{(\mu,\sigma,\xi)}(t) = \mathbb{E}(\min(1,t X)) = 1 + t\mathbb{E}(X) - \mathbb{E}(\max(1,t X)) $$

valid in this case where $\mathbb {E}(X) = \mu + \sigma /(1-\xi )<\infty $.

Example 2.4 (Unit Fréchet distribution).

The min-CF of a rv X having a unit Fréchet distribution, namely, with df $\mathbb {P}(X\leq x)=e^{-1/x}$, x > 0, is given by

$$ \forall t>0, \ \psi(t) = {{\int}_{0}^{1}} \left[ 1-e^{-t/u} \right] du = 1 - {\int}_{1}^{\infty} \frac{e^{-tv}}{v^{2}} dv =: 1-E_{2}(t) $$

in the notation of Abramovitz and Stegun (1972, Formula 5.1.4 p.228).

Example 2.5 (Independent unit Fréchet variables).

The min-CF of a rv X = (X₁,…, X_d), whose components are independent unit Fréchet distributed, is

$$ \begin{array}{@{}rcl@{}} \psi(\boldsymbol{t}) &=& {{\int}_{0}^{1}} {\prod}_{k=1}^{d} \left[ 1-e^{-t_{k}/u} \right] du \\ &=& 1-{{\int}_{0}^{1}} {\sum}_{k=1}^{d} (-1)^{k-1} {\sum}_{1\leq i_{1}<\cdots<i_{k}\leq d} \exp\left( -\frac{1}{u} [t_{i_{1}}+\cdots+t_{i_{k}}] \right) du \\ &=& 1-{\sum}_{k=1}^{d} (-1)^{k-1} {\sum}_{1\leq i_{1}<\cdots<i_{k}\leq d} E_{2}(t_{i_{1}}+\cdots+t_{i_{k}}), \end{array} $$

for any $\boldsymbol {t}=(t_{1},\ldots ,t_{d}) \in (0,\infty )^{d}$, with the notation of Example 2.4.

It is already apparent from the definition of a min-CF that it is a componentwise nondecreasing function on $[0,\infty )^{d}$. A further list of elementary properties of the min-CF is given in our next result.

Proposition 2.4.

Choose $\boldsymbol {X}=(X_{1},\ldots ,X_{d})\in \mathbb {X}_{d}$ and let ψ_X be its min-CF. We have, for $\boldsymbol {t}=(t_{1},\ldots ,t_{d}) \in [0,\infty )^{d}$:

(i) 0 ≤ ψ_X(t) ≤ 1.
(ii) ψ_X(t) = 0 if and only if t_i = 0 for some i ∈{1,…, d}.
(iii) ψ_X(t) → 1 as $\min \limits (t_{1},\ldots ,t_{d})\to \infty $.
(iv) X ≥c > 0 almost surely if and only if ψ_X(t) = 1 for t ≥ 1/c.
(v) If Y is another rv in $\mathbb {X}_{d}$, then
$$ \sup_{\boldsymbol{t}\geq \boldsymbol{0}} |\psi_{\boldsymbol{X}}(\boldsymbol{t})-\psi_{\boldsymbol{Y}}(\boldsymbol{t})| \leq \sup_{\boldsymbol{t}\geq \boldsymbol{0}} |\mathbb{P}(\boldsymbol{X} > \boldsymbol{t}) - \mathbb{P}(\boldsymbol{Y} > \boldsymbol{t})|. $$
The function ψ_X is a continuous and concave df on $[0,\infty )^{d}$.
The function t↦ψ_X(1/t), $\boldsymbol {t}>\boldsymbol {0}\in \mathbb {R}^{d}$, is a (continuous) survival function, in the sense that there exists a rv $\boldsymbol {Y}\in \mathbb {X}_{d}$ with $\psi _{\boldsymbol {X}}(\boldsymbol {1}/\boldsymbol {t})=\mathbb {P}(\boldsymbol {Y}>\boldsymbol {t})$.

The most interesting result here is probably Proposition 2.4(vi): combined with Theorem 2.1, it shows that any df on $\mathbb {X}_{d}$, no matter how irregular, is characterized by an associated continuous and concave df, which is its min-CF.

Proof.

Assertions (i)–(iv) are elementary. Assertion (v) is a straightforward consequence of Lemma 2.3. We prove (vi). The function ψ_X is clearly continuous. Its concavity follows from that of the function $(x_{1},\ldots ,x_{d})\mapsto {\min \limits } (1,x_{1},\ldots ,x_{d})$ on $[0,\infty )^{d}$. It only remains to prove that ψ_X is a df. Since ψ_X(0) = 0 and ψ_X(t) → 1 as $\min \limits (t_{1},\ldots ,t_{d})\to \infty $, it is sufficient to prove that ψ_X is Δ-monotone (see Reiss 1989, Equation (2.2.19)), i.e., for any $\boldsymbol {0}\le \boldsymbol {a}\le \boldsymbol {b}\in \mathbb {R}^{d}$,

To show this it suffices to establish that the integrand in the above expectation is always nonnegative. Let U be a random variable which follows the uniform distribution on [0,1]. Using repeatedly the identity

$$ \mathbb{P} (U\in (s,t], U\leq u) = \mathbb{P}(U\leq t, U\leq u) - \mathbb{P} (U\leq s, U\leq u) $$

valid for any s ≤ t and u, we find that for any $\boldsymbol {0}\le \boldsymbol {c}\le \boldsymbol {d}\in \mathbb {R}^{d}$, we have,

$$ \begin{array}{@{}rcl@{}} 0 &\le& \mathbb{P}(U\in (\min(1,c_{i}),\min(1,d_{i})], 1\le i\le d)\\ &=& {\sum}_{T\subset \left\{1,\dots,d\right\}}(-1)^{d-|T|} \mathbb{P}\left( U\le \min(1,d_{i}),i\in T; U\le \min(1,c_{j}),j\notin T\right)\\ &=& {\sum}_{T\subset \left\{1,\dots,d\right\}}(-1)^{d-|T|}\min\left\{1; d_{i},i\in T; c_{j},j\notin T\right\}. \end{array} $$

With c = (a₁X₁,…, a_dX_d) and d = (b₁X₁,…, b_dX_d) this yields the desired inequality. Finally, part (vii) is an immediate consequence of (vi): ψ_X is a (continuous) df of some rv $\boldsymbol {Z}\in \mathbb {X}_{d}$, i.e. $\psi _{\boldsymbol {X}}(\boldsymbol {t})=\mathbb {P}(\boldsymbol {Z}\le \boldsymbol {t})$, $\boldsymbol {t}>\boldsymbol {0}\in \mathbb {R}^{d}$. Then $\psi _{\boldsymbol {X}}(\boldsymbol {1}/\boldsymbol {t})=\mathbb {P}(\boldsymbol {Z}\le \boldsymbol {1}/\boldsymbol {t})=\mathbb {P}(\boldsymbol {Y}\ge \boldsymbol {t})=\mathbb {P}(\boldsymbol {Y}>\boldsymbol {t})$, with Y := 1/Z. □

Of course, since the min-CF identifies distributions concentrated in the positive orthant of $\mathbb {R}^{d}$, it is important to find the inversion formula making it possible to go from a min-CF to its pertaining distribution. Since, by Lemma 2.3, computing the min-CF essentially consists in integrating the survival function, it makes sense to expect that a survival function can be recovered by differentiating the pertaining min-CF in a suitable way. Making this intuition rigorous is the focus of the next result.

Theorem 2.5.

For $\boldsymbol {X}=(X_{1},\ldots ,X_{d})\in \mathbb {X}_{d}$ with min-CF ψ_X, we have

$$ \begin{array}{@{}rcl@{}} \forall \boldsymbol{x}&=&(x_{1},\ldots,x_{d})\in (0,\infty)^{d}, \\ \mathbb{P}(X_{1}>x_{1},\ldots, X_{d}>x_{d}) &=& \frac{\partial_{+}}{\partial t} \left\{ t \psi_{\boldsymbol{X}}\left( \frac{1}{t\boldsymbol{x}} \right) \right\} |_{{t=1}} \end{array} $$

where ∂₊/∂t denotes differentiation from the right with respect to t.

Proof.

By Lemma 2.3, we find, for any t > 0 and $(x_{1},\ldots ,x_{d})\in (0,\infty )^{d}$,

$$ \begin{array}{@{}rcl@{}} t \psi_{\boldsymbol{X}}\left( \frac{1}{t\boldsymbol{x}} \right) &=& t{{\int}_{0}^{1}} \mathbb{P}(X_{1}>u t x_{1},\ldots, X_{d}>u t x_{d}) du \\ &=& {{\int}_{0}^{t}} \mathbb{P}(X_{1}>v x_{1},\ldots, X_{d}>v x_{d}) dv. \end{array} $$

Conclude by differentiating from the right with respect to t and taking t = 1. □

It should be apparent from this result that, while a max-CF is adapted to working with the joint df (see Falk and Stupfler 2017, Proposition 2.15), the min-CF is rather adapted to working with the joint survival function. The next example illustrates this point nicely.

Example 2.6 (Exponential distribution in several dimensions).

Let X be a rv in $\mathbb {R}^{d}$ which follows a min-stable distribution with standard exponential margins $\mathbb {P}(X_{i}>x)=\exp (-x)$, x ≥ 0. This is equivalent to assuming that there exists a D-norm ||⋅||_D on $\mathbb {R}^{d}$ such that

$$ \mathbb{P}(\boldsymbol{X} > \boldsymbol{x}) =\exp\left( -\left\Vert\boldsymbol{x}\right\Vert_{D}\right), \ \boldsymbol{x}\ge\boldsymbol{0}\in\mathbb{R}^{d}; $$

see Falk (2019, Equation (2.27)). Then we have, for $\boldsymbol {t}>\boldsymbol {0}\in \mathbb {R}^{d}$,

$$ \begin{array}{@{}rcl@{}} \psi_{\boldsymbol{X}}(\boldsymbol{t})= {{\int}_{0}^{1}} \mathbb{P}\left( \boldsymbol{X}>\frac{u}{\boldsymbol{t}} \right) du&=& {{\int}_{0}^{1}} \exp\left( -u\left\Vert\frac 1{\boldsymbol{x}}\right\Vert_{D} \right) du\\ &=& \frac 1{\left\Vert1/\boldsymbol{x}\right\Vert_{D}}\left[1-\exp\left( - \left\Vert\frac 1{\boldsymbol{x}}\right\Vert_{D}\right) \right]. \end{array} $$

This generalizes Example 2.1 to an arbitrary dimension d ≥ 1.

We may now provide an application of our results to the theory of D −norms. Recall that a D-norm on $\mathbb {R}^{d}$, d ≥ 2, is a norm of the form

$$ \left\Vert\boldsymbol{x}\right\Vert_{D}:= \mathbb{E}(\max(\left\vert x_{1}\right\vert Z_{1},\ldots,\left\vert x_{d}\right\vert Z_{d}) ) $$

where $\boldsymbol {Z}=(Z_{1},\dots ,Z_{d})$ is a componentwise nonnegative rv such that E(Z_i) = 1, 1 ≤ i ≤ d, called the generator of $\left \Vert \cdot \right \Vert _{D}$. The concept of D-norms has come to prominence recently for its importance in multivariate extreme value theory, not least because it allows for a simple characterization of max-stable dfs (see Theorem 2.3.3 in Falk 2019). Attached to a D-norm ||⋅||_D is the concept of dual D-norm function

$$ \wr{\kern-2.5pt}\wr \boldsymbol{x} \wr{\kern-2.5pt}\wr_{D}:= \mathbb{E}(\min(\left\vert x_{1}\right\vert Z_{1},\ldots,\left\vert x_{d}\right\vert Z_{d}) ) $$

which has recently found applications in the analysis of multivariate records (Dombry et al. 2019; Dombry and Zott, 2018). It is known that the mapping

$$ \left\Vert\cdot\right\Vert_{D} \mapsto \wr\wr \cdot \wr\wr_{D} $$

is indeed well-defined, in the sense that two generators Z of the same D-norm also generate the same dual D-norm function, but this mapping is not one-to-one (see Section 1.6 of Falk 2019). The next result shows that if we actually restrict this mapping to componentwise positive generators Z, it becomes one-to-one.

Proposition 2.6.

Let Z⁽¹⁾, Z⁽²⁾ be componentwise positive generators of two D-norms $\left \Vert \cdot \right \Vert _{D_{1}}$ and $\left \Vert \cdot \right \Vert _{D_{2}}$. Then

$$ \left\Vert\cdot\right\Vert_{D_{1}} = \left\Vert\cdot\right\Vert_{D_{2}} \Leftrightarrow \wr{\kern-2.5pt}\wr \cdot \wr{\kern-2.5pt}\wr_{D_{1}} = \wr{\kern-2.5pt}\wr \cdot \wr{\kern-2.5pt}\wr_{D_{2}}. $$

The proof rests on the following lemma.

Lemma 2.7.

Any D-norm with a generator $\boldsymbol {Z}\in \mathbb {X}_{d}$ also has a generator $\boldsymbol {Z}^{*}\in \mathbb {X}_{d}$ with $Z^{*}_{1}=1$.

Proof.

That there is a generator Z^∗ with $Z^{*}_{1}=1$ follows from Lemma 2.10 in Falk and Stupfler (2019). We need only show that $\boldsymbol {Z}^{*}\in \mathbb {X}_{d}$, translating into $\mathbb {P}(Z_{i}^{*}>0)\! =\! 1$ for any i ∈{2,…, d}. For any x > 0, $ \mathbb {E}(\max \limits (Z_{1}, x Z_{i}) )$ $= \mathbb {E}(\max \limits (1, x Z_{i}^{*}) ), $ and thus $ \mathbb {E}(\min \limits (Z_{1}, x Z_{i}) ) = \mathbb {E}(\min \limits (1, x Z_{i}^{*}) ) $ by the identity $\max \limits (a,b)+\min \limits (a,b)=a+b$ and the fact that all the Z_j and $Z_{j}^{*}$ have expectation 1. Letting $x\uparrow \infty $ and using the dominated convergence theorem entails $1=\mathbb {P}(Z_{i}^{*}>0)$, as required. □

Proof of Proposition 2.6.

We only need to show that the equality of the dual D-norm functions implies that of the original D-norms. By Lemma 2.7, we may assume that the first element of each generator is equal to 1: in particular,

$$ \mathbb{E}(\min(\left\vert x_{1}\right\vert, \left\vert x_{2}\right\vert Z_{2}^{(1)},\ldots,\left\vert x_{d}\right\vert Z_{d}^{(1)}) ) = \mathbb{E}(\min(\left\vert x_{1}\right\vert, \left\vert x_{2}\right\vert Z_{2}^{(2)},\ldots,\left\vert x_{d}\right\vert Z_{d}^{(2)}) ) $$

for any $\boldsymbol {x}\in \mathbb {R}^{d}$. The random vectors $(Z_{2}^{(1)},\ldots ,Z_{d}^{(1)})$ and $(Z_{2}^{(2)},\ldots ,Z_{d}^{(2)})$ then have the same distribution, by Theorem 2.1. The result follows. □

We conclude this section by discussing an interesting example of interplay between max-stability, min-stability and the notion of min-CF. Recall that a copula C is said to be in the domain of attraction of a standard max-stable df G if

$$ \lim_{n\to\infty} C^{n}\left( 1+\frac{\boldsymbol{x}}{n}\right) = G(\boldsymbol{x}), \ \boldsymbol{x}\in\mathbb{R}^{d}. $$

In this context, it is a consequence of Falk (2019, Theorem 2.3.3) that $G(\boldsymbol {x})=\exp (-\left \Vert \boldsymbol {x}\right \Vert _{D})$, $\boldsymbol {x}\le \boldsymbol {0}\in \mathbb {R}^{d}$, for some D-norm ||⋅||_D which, in this case, describes the extremal dependence within the copula C. We then have the following result.

Proposition 2.8.

Let $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ be a rv that follows a copula C in the domain of attraction of the standard max-stable df $G(\boldsymbol {x})=\exp (-\left \Vert \boldsymbol {x}\right \Vert _{D})$, $\boldsymbol {x}\le \boldsymbol {0}\in \mathbb {R}^{d}$. Let (by Proposition 2.4 (vii)) $\boldsymbol {Y}\in \mathbb {X}_{d}$ be a rv with survival function $\boldsymbol {t}\mapsto \psi _{-\log \boldsymbol {X}}(\boldsymbol {1}/\boldsymbol {t})$, $\boldsymbol {t}>\boldsymbol {0}\in \mathbb {R}^{d}$. Then Y is asymptotically min-stable, in the sense that

$$ \lim_{n\to\infty} \mathbb{P}\left( \frac n2 \min_{1\le i\le n}\boldsymbol{Y}^{(i)}>\boldsymbol{x} \right) = \exp(-\left\Vert\boldsymbol{x}\right\Vert_{D}), \ \boldsymbol{x}>\boldsymbol{0}\in\mathbb{R}^{d}, $$

where $\boldsymbol {Y}^{(1)},\boldsymbol {Y}^{(2)},\dots $ are independent copies of Y.

Proof.

The domain of attraction assumption on C is equivalent with the expansion

$$ C(\boldsymbol{u})=1-\left\Vert\boldsymbol{1}-\boldsymbol{u}\right\Vert_{D} + \operatorname{o}(\left\Vert\boldsymbol{1}-\boldsymbol{u}\right\Vert) $$

(2)

as u →1, uniformly for u ∈ [0,1] (see Proposition 3.1.5 in Falk 2019), in the sense that

$$ \forall \varepsilon>0, \ \exists \delta>0, \ \boldsymbol{u}\in [1-\delta,1]^{d} \Rightarrow \frac{C(\boldsymbol{u}) - (1-\left\Vert\boldsymbol{1}-\boldsymbol{u}\right\Vert_{D})}{\left\Vert\boldsymbol{1}-\boldsymbol{u}\right\Vert} \leq \varepsilon. $$

Note then that, from Lemma 2.3,

$$ \psi_{-\log \boldsymbol{X}}\left( \frac{\boldsymbol{1}}{s\boldsymbol{x}} \right) = {{\int}_{0}^{1}} C(\exp(-st \boldsymbol{x})) dt $$

and thus, combining a Taylor expansion of the exponential function around 0 and Eq. 2, the min-CF of $-\log (\boldsymbol {X})$ satisfies, for $\boldsymbol {x}>\boldsymbol {0}\in \mathbb {R}^{d}$,

$$ \lim_{s\downarrow 0}\frac 2s \left( 1- \psi_{-\log \boldsymbol{X}}\left( \frac{\boldsymbol{1}}{s\boldsymbol{x}} \right) \right)=\left\Vert\boldsymbol{x}\right\Vert_{D}. $$

(3)

In other words, since $\boldsymbol {Y}\in \mathbb {X}_{d}$ has survival function $\boldsymbol {t}\mapsto \psi _{-\log \boldsymbol {X}}(\boldsymbol {1}/\boldsymbol {t})$, $\boldsymbol {t}>\boldsymbol {0}\in \mathbb {R}^{d}$, we have

$$ \mathbb{P}\left( \frac{1}{2} \boldsymbol{Y} > s\boldsymbol{x} \right) = 1- s\left\Vert\boldsymbol{x}\right\Vert_{D}+\operatorname{o}(s) $$

as s ↓ 0 for $\boldsymbol {x}>\boldsymbol {0}\in \mathbb {R}^{d}$. For independent copies $\boldsymbol {Y}^{(1)},\boldsymbol {Y}^{(2)},\dots $ of Y, this yields

$$ \mathbb{P}\left( \frac n2 \min_{1\le i\le n}\boldsymbol{Y}^{(i)} > \boldsymbol{x} \right) = \left[ \mathbb{P}\left( \frac{1}{2} \boldsymbol{Y} > \frac{1}{n}\boldsymbol{x} \right) \right]^{n} \to \exp(-\left\Vert\boldsymbol{x}\right\Vert_{D}), \ \boldsymbol{x}>\boldsymbol{0}\in\mathbb{R}^{d}, $$

completing the proof. □

We highlight the following consequence of Proposition 2.8, which follows from Eqs. 2 and 3 in its proof. It can be used to suggest estimators of a D-norm as done in Example 3.1 below.

Proposition 2.9.

Let $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ follow a copula C. If C is in the domain of attraction of a standard max-stable df $G(\boldsymbol {x})=\exp (-\left \Vert \boldsymbol {x}\right \Vert _{D})$, $\boldsymbol {x}\le \boldsymbol {0}\in \mathbb {R}^{d}$, then, for all $\boldsymbol {x}\ge \boldsymbol {0}\in \mathbb {R}^{d}$, the limit

$$ \ell(\boldsymbol{x}) := \lim_{s\downarrow 0}\frac 2s \left( 1- \mathbb{E}\left( \min\left( 1,\frac{-\log(X_{1})}{sx_{1}},\dots, \frac{-\log(X_{d})}{sx_{d}} \right)\right)\right) $$

exists, and $\ell (\boldsymbol {x})=\left \Vert \boldsymbol {x}\right \Vert _{D}$.

When considering notions of characteristic functions, such as the Fourier transform, the Laplace transform, the moment-generating function, or the max-CF, it is important to examine the connection between convergence of a sequence of characteristic functions and convergence in distribution of the associated rvs. This is the focus of the next section.

3 Sequential Behavior of the Min-characteristic Function

3.1 With Respect to Convergence in Distribution

An important result regarding the max-CF is that the pointwise convergence of a sequence of max-CFs to a max-CF is equivalent to the convergence of the pertaining distributions in the Wasserstein metric

$$ d_{W}(P,Q) := \inf\{\mathbb{E}(\left\Vert\boldsymbol{X}-\boldsymbol{Y}\right\Vert_{1})\!: \boldsymbol{X}\mathrm{\ has\ distribution\ }P, \ \boldsymbol{Y} \mathrm{\ has\ distribution\ }Q\}. $$

Convergence in this metric is nothing but convergence in distribution plus convergence of first moments, according to Villani (2009, Definition 6.8 and Theorem 6.9). Of course, the use of min-CFs does not require any integrability assumption, so one cannot hope that a similar theorem would link pointwise convergence of min-CFs to convergence in the metric d_W, but we could still anticipate a convergence in distribution of the pertaining dfs. This is precisely the content of our next result.

Theorem 3.1.

Let X⁽ⁿ⁾, X be rvs in $\mathbb {X}_{d}$. Then

$$ \boldsymbol{X}^{(n)} \overset{d}{\longrightarrow} \boldsymbol{X} \Leftrightarrow \psi_{\boldsymbol{X}^{(n)}} \to \psi_{\boldsymbol{X}} \ \text{ pointwise}. $$

Proof of Theorem 3.1.

Suppose that $\boldsymbol {X}^{(n)} \overset {d}{\to } \boldsymbol {X}$. For any $\boldsymbol {t}=(t_{1},\ldots ,t_{d}) \in (0,\infty )^{d}$, the function h on $\mathbb {R}^{d}$ defined by

$$ h(x_{1},\ldots,x_{d}) := \min(1,t_{1} x_{1},\ldots,t_{d} x_{d}) \ \text{ if } x_{1},\ldots,x_{d}>0 \text{ and } 0 \text{ otherwise} $$

is continuous and bounded. Consequently

$$ \psi_{\boldsymbol{X}^{(n)}}(\boldsymbol{t}) = \mathbb{E}(h(\boldsymbol{X}^{(n)})) \to \mathbb{E}(h(\boldsymbol{X})) = \psi_{\boldsymbol{X}}(\boldsymbol{t}) $$

as required. Suppose conversely that $\psi _{\boldsymbol {X}^{(n)}} \to \psi _{\boldsymbol {X}}$ pointwise. We show that $-\boldsymbol {X}^{(n)} \overset {d}{\longrightarrow } -\boldsymbol {X}$, or equivalently that

$$ G^{(n)}(\boldsymbol{x}) := \mathbb{P}(-\boldsymbol{X}^{(n)}\leq \boldsymbol{x}) \to \mathbb{P}(-\boldsymbol{X}\leq \boldsymbol{x}) =: G(\boldsymbol{x}) $$

at every point of continuity x ≤0 of G. Let

$$ \overline{G}^{(n)}(\boldsymbol{x}) := \mathbb{P}(\boldsymbol{X}^{(n)}\geq \boldsymbol{x}) \ \text{ and } \ \overline{G}(\boldsymbol{x}) := \mathbb{P}(\boldsymbol{X}\geq \boldsymbol{x}) $$

so that $\overline {G}^{(n)}(\boldsymbol {x}) = G^{(n)}(-\boldsymbol {x})$ and $\overline {G}(\boldsymbol {x}) = G(-\boldsymbol {x})$. From the proof of Theorem 2.5 we know that, for any x > 0 and s, t > 0,

$$ t \psi_{\boldsymbol{X}^{(n)}}\left( \frac{1}{t\boldsymbol{x}} \right) - s \psi_{\boldsymbol{X}^{(n)}}\left( \frac{1}{s\boldsymbol{x}} \right) = {{\int}_{s}^{t}} \mathbb{P}(X_{1}^{(n)}>v x_{1},\ldots, X_{d}^{(n)}>v x_{d}) dv. $$

Using the fact that the distributions of $X_{1}^{(n)},\ldots ,X_{d}^{(n)}$ have at most countably many atoms, we get

$$ \begin{array}{@{}rcl@{}} {{\int}_{s}^{t}} \overline{G}^{(n)}(v\boldsymbol{x}) dv &=& t \psi_{\boldsymbol{X}^{(n)}}\left( \frac{1}{t\boldsymbol{x}} \right) - s \psi_{\boldsymbol{X}^{(n)}}\left( \frac{1}{s\boldsymbol{x}} \right) \\ &\to & t \psi_{\boldsymbol{X}}\left( \frac{1}{t\boldsymbol{x}} \right) - s \psi_{\boldsymbol{X}}\left( \frac{1}{s\boldsymbol{x}} \right) \\ &=& {{\int}_{s}^{t}} \overline{G}(v\boldsymbol{x}) dv. \end{array} $$

(4)

Let x > 0 be a point of continuity of $\overline {G}$. If

$$ \limsup_{n\to\infty} \overline{G}^{(n)}(\boldsymbol{x}) > \overline{G}(\boldsymbol{x}) \ \text{ or } \ \liminf_{n\to\infty} \overline{G}^{(n)}(\boldsymbol{x}) < \overline{G}(\boldsymbol{x}) $$

then, by exploiting the monotonicity properties of $\overline {G}^{(n)}$ and the continuity of $\overline {G}$ at x, Eq. 4 readily produces a contradiction by putting s = 1 and t = 1 + ε or t = 1 and s = 1 − ε with a small ε > 0. This gives $\overline {G}^{(n)}(\boldsymbol {x})\to \overline {G}(\boldsymbol {x})$ at any point of continuity x > 0 of $\overline {G}$, or equivalently

$$ G^{(n)}(\boldsymbol{x}) \to G(\boldsymbol{x}) $$

(5)

at every point of continuity x < 0 of G. To show that this convergence also holds at the points of continuity x of G with one or several components equal to zero, we fix one such point, and we note that it is enough to prove that every subsequence G^(m(n))(x) of G⁽ⁿ⁾(x) has itself got a subsequence that converges to G(x) (a result known as Cantor’s lemma). From Helly’s selection theorem, we can take a subsequence G^(k(m(n))) of the sequence G^(m(n)) which converges to some finite measure-generating function G^∗ on $\mathbb {R}^{d}$, at all points of continuity of G^∗: in other words, we can find a measure μ^∗ on $\mathbb {R}^{d}$ with $G^{(k(m(n)))}(\boldsymbol {t}) \to G^{*}(\boldsymbol {t}) := \mu ^{*}((-\boldsymbol {\infty },\boldsymbol {t}])$ at every point of continuity t of the limit.

We claim that actually G^∗ = G on $(-\infty ,0]^{d}$ irrespective of the choice of the subsequence, which will obviously imply G^(k(m(n)))(x) → G^∗(x) = G(x) as required. We prove this claim as follows. Clearly G^∗ = G on $(-\infty ,0)^{d}$ wherever G and G^∗ are both continuous, by Eq. 5. The set of such points is dense in $(-\infty ,0)^{d}$, since G and G^∗ are finite measure-generating functions. Right-continuity of G and G^∗ then implies that G^∗ = G everywhere on $(-\infty ,0)^{d}$. Besides, the monotonicity of G^∗ together with the fact that $\mathbb {P}(-\boldsymbol {X}<\boldsymbol {0}) = 1$ implies that

$$ \begin{array}{@{}rcl@{}} 1 = G(\boldsymbol{0}) = \lim_{\boldsymbol{\varepsilon}\downarrow\boldsymbol{0}} G(\boldsymbol{0}-\boldsymbol{\varepsilon})= \lim_{\boldsymbol{\varepsilon}\downarrow\boldsymbol{0}} G^{*}(\boldsymbol{0}-\boldsymbol{\varepsilon}) &=& \mu^{*}((-\infty,0)^{d}) \\ &\leq & \mu^{*}((-\infty,0]^{d}) = G^{*}(\boldsymbol{0})\le 1. \end{array} $$

Thus each of the $E_{j}:=\{ \boldsymbol {y}\in (-\infty ,0]^{d} | y_{j}=0\}$ satisfies μ^∗(E_j) = 0. Conclude by letting T be the set of indices for which x_i < 0 and by writing

$$ \begin{array}{@{}rcl@{}} G^{*}(\boldsymbol{x}) = \mu^{*}((-\boldsymbol{\infty},\boldsymbol{x}]) &=& \mu^{*}(\{ \boldsymbol{y}\in \mathbb{R}^{d} | y_{i}\leq x_{i}, i\in T, y_{j}\leq 0, j\notin T\}) \\ &=& \mu^{*}(\{ \boldsymbol{y}\in \mathbb{R}^{d} | y_{i}\leq x_{i}, i\in T, y_{j}<0, j\notin T\}) \\ &=& \lim_{\varepsilon\downarrow 0} \mu^{*}(\{ \boldsymbol{y}\in \mathbb{R}^{d} | y_{i}\!\leq x_{i}, i\in T, y_{j}\leq -\varepsilon, j\notin T\}) \\ &=& \lim_{\varepsilon\downarrow 0} \mathbb{P}(X_{i}\leq x_{i}, i\in T, X_{j}\leq -\varepsilon, j\notin T) \\ &=& \mathbb{P}(X_{i}\leq x_{i}, i\in T, X_{j}<0, j\notin T) \\ &=& \mathbb{P}(X_{i}\leq x_{i}, i\in T, X_{j}\leq 0, j\notin T) = G(\boldsymbol{x}). \end{array} $$

Complete the proof by noting that G(0) = 1 = G^∗(0) and thus G^∗ = G. □

Remark 3.1.

The pointwise convergence in Theorem 3.1 can be strengthened to uniform convergence on $[0,\infty )^{d}$. Note that $\psi _{\boldsymbol {X}^{(n)}}$, $n\in \mathbb {N}$, is a sequence of df, which converges pointwise to the continuous df ψ_X. But this means weak convergence of a sequence of rvs Y⁽ⁿ⁾, having df $\psi _{\boldsymbol {X}^{(n)}}$, to a rv Y having df ψ_X. Since the limiting df ψ_X is continuous, this implies uniform convergence of $\psi _{\boldsymbol {X}^{(n)}}$ to ψ_X, see, e.g., Billingsley (1968, Problem 3, Section 3). We thus have

$$ \boldsymbol{X}^{(n)} \overset{d}{\longrightarrow} \boldsymbol{X} \Leftrightarrow \sup_{\boldsymbol{t}\ge\boldsymbol{0}} \left\vert \psi_{\boldsymbol{X}^{(n)}}(\boldsymbol{t}) - \psi_{\boldsymbol{X}}(\boldsymbol{t}) \right\vert \to 0. $$

3.2 The Empirical min-CF

The fact that the min-CF identifies convergence in distribution suggests that it may also be used in estimation settings. We briefly explore this context here from the asymptotic point of view. Let $\boldsymbol {X}^{(1)},\dots ,\boldsymbol {X}^{(n)}$ be independent copies of a rv $\boldsymbol {X}\in \mathbb {X}_{d}$. The (random) min-CF induced by the empirical measure $\widehat {P}_{n}:= n^{-1} {\sum }_{i=1}^{n} \delta _{\boldsymbol {X}^{(i)}}$ is

$$ \widehat{\psi}_{\boldsymbol{X}}^{(n)}(\boldsymbol{t}) = \frac{1}{n} {\sum}_{i=1}^{n}\min\left( 1,t_{1} X_{1}^{(i)},\dots, t_{d} X_{d}^{(i)}\right). $$

By the law of large numbers, we have, for any $\boldsymbol {t}=(t_{1},\ldots ,t_{d}) \in [0,\infty )^{d}$, that almost surely:

$$ \widehat{\psi}_{\boldsymbol{X}}^{(n)}(\boldsymbol{t}) \to \mathbb{E}(\min(1, t_{1} X_{1},\dots,t_{d} X_{d})) = \psi_{\boldsymbol{X}}(\boldsymbol{t}) \ \text{ as } \ n\to\infty. $$

Since $\widehat {\psi }_{\boldsymbol {X}}^{(n)}$ is a df with probability 1, we also have uniform almost sure convergence of this estimator, by the same argument as in Remark 3.1:

$$ \sup_{\boldsymbol{t}\ge\boldsymbol{0}}\left\vert \widehat{\psi}_{\boldsymbol{X}}^{(n)}(\boldsymbol{t}) -\psi_{\boldsymbol{X}}(\boldsymbol{t}) \right\vert \to 0 \ \text{ almost surely.} $$

Example 3.1.

Our results so far open a way to estimate a D-norm by using the empirical min-CF. Let $\boldsymbol {X}^{(1)},\dots ,\boldsymbol {X}^{(n)}$ be independent copies of a rv $\boldsymbol {X}\in \mathbb {X}_{d}$ following a copula C in the domain of attraction of a standard max-stable df G. From Proposition 2.9 we obtain that the min-CF of $-\log (\boldsymbol {X})$ satisfies

$$ \lim_{s\downarrow 0}\frac 2s \left( 1- \psi_{-\log \boldsymbol{X}}\left( \frac{\boldsymbol{1}}{s\boldsymbol{x}} \right) \right)=\left\Vert\boldsymbol{x}\right\Vert_{D}. $$

This suggests to estimate $\left \Vert \boldsymbol {x}\right \Vert _{D}$ by

$$ \begin{array}{@{}rcl@{}} \widehat{\left\Vert\boldsymbol{x}\right\Vert_{D}} &=& \frac{2}{s_{n}} \left( 1 - \widehat{\psi}_{-\log \boldsymbol{X}}^{(n)}\left( \frac{1}{s_{n}\boldsymbol{x}} \right) \right) \\ &=& \frac{2}{s_{n}} \left( 1 - \frac{1}{n} {\sum}_{i=1}^{n} \min\left( 1,\frac{-\log(X_{1}^{(i)})}{s_{n} x_{1}},\dots, \frac{-\log(X_{d}^{(i)})}{s_{n} x_{d}} \right) \right) \end{array} $$

where (s_n) is a positive sequence converging to 0. Although the study of this estimator is outside the scope of this paper, it offers a potentially interesting alternative to existing techniques for the estimation of an extremal dependence structure, such as the classical tail dependence estimators developed by Drees and Huang (1998), Schmidt and Stadtmüller (2006) and Einmahl et al. (2008), among others.

Turning to rates of convergence, the central limit theorem implies that $\widehat {\psi }_{\boldsymbol {X}}^{(n)}(\boldsymbol {t})$ is a $\sqrt {n}-$consistent estimator of ψ_X(t). The above local uniform convergence then naturally raises the question of the weak convergence of the process

$$ S_{n} = (S_{n}(\boldsymbol{t}))_{\boldsymbol{t}\ge\boldsymbol{0}}:= \sqrt{n} \left( \widehat{\psi}_{\boldsymbol{X}}^{(n)}(\boldsymbol{t}) -\psi_{\boldsymbol{X}}(\boldsymbol{t}) \right)_{\boldsymbol{t}\ge\boldsymbol{0}} $$

on $[0,\infty )^{d}$. This stochastic process has continuous sample paths and satisfies S_n(0) = 0. For ease of exposition, we state a result on the weak convergence of this process in the case d = 1.

Theorem 3.2.

Let $X^{(1)},\dots ,X^{(n)}$ be independent copies of a univariate rv X > 0 with df F. For any t₀ > 0, we have

$$ S_{n}(t) := \sqrt{n} \left( \widehat{\psi}_{X}^{(n)}(t)-\psi_{X}(t) \right) \to S(t) := t{\int}_{0}^{1/t} W \circ F(u) du $$

weakly in the space C[0, t₀] of continuous functions over [0, t₀], where W is a standard Brownian bridge on [0,1]. The limiting process S, which should be read as 0 when t = 0, is a Gaussian process with covariance structure

$$ \operatorname{Cov}(S(t_{1}), S(t_{2}) ) = \iint_{[0,1]^{2}} \left[ F\left( \min\left\{ \frac{x}{t_{1}}, \frac{y}{t_{2}} \right\} \right) - F\left( \frac{x}{t_{1}} \right) F\left( \frac{y}{t_{2}} \right) \right] dx dy. $$

Proof.

We adapt the proof of Theorem 3.4 in Falk and Stupfler (2019). By Theorem 1, p.93 of Shorack and Wellner (1986), we can construct, on a common probability space, a triangular array of rowwise independent, standard uniform rvs (U^(n,1),…, U^{(n, n)})_n≥ 1, and a Brownian bridge $\widetilde {W}$ such that

Furthermore, if we denote by q the quantile function of X (i.e. the left-continuous inverse of F) and by $\widetilde {X}^{(n,i)} := q(U^{(n,i)})$, we have, for any n ≥ 1,

$$ S_{n}(t) \overset{d}{=} \widetilde{S}_{n}(t) := \frac{1}{\sqrt{n}} {\sum}_{i=1}^{n} \left[ \min\left( 1,t \widetilde{X}^{(n,i)}\right) - \mathbb{E}(\min(1,t X)) \right], $$

as processes in C[0, t₀]. Besides, we have for any t > 0:

Since $\widetilde {X}^{(n,i)} \leq u\Leftrightarrow U^{(n,i)}\leq F(u)$, this yields

$$ \widetilde{S}_{n}(0)=0 \ \text{ and } \ \forall t>0, \ \widetilde{S}_{n}(t) = -t{\int}_{0}^{1/t} \mathbb{W}_{n} \circ F(u) du. $$

Defining a process $\widetilde {S}$ by $\widetilde {S}(0)=0$ and $\widetilde {S}(t) = -t{\int \limits }_{0}^{1/t} \widetilde {W} \circ F(u) du$ for t > 0, we get

$$ \sup_{0\leq t\leq t_{0}} \left| \widetilde{S}_{n}(t) - \widetilde{S}(t) \right| \leq \sup_{0\leq t\leq 1} \left| \mathbb{W}_{n}(t) - \widetilde{W}(t) \right| \to 0 $$

almost surely. The process $\widetilde {S}$ is then almost surely continuous, as it is the almost sure uniform limit of the sequence of continuous processes $(\widetilde {S}_{n})$. By symmetry of the standard Brownian bridge, we conclude that, as processes in C[0, t₀],

$$ S_{n}(t) \overset{d}{=} \widetilde{S}_{n}(t) \overset{\mathrm{a.s.}}{\longrightarrow} \widetilde{S}(t) \overset{d}{=} S(t). $$

This shows the desired weak convergence; the assertion on the covariance structure of the limiting process follows from a simple calculation using the well-known covariance properties of the Brownian bridge. □

In the case d > 1, and under regularity conditions (e.g. those of Massart 1989), a similar proof can be written to show an analogue of Theorem 3.2, giving the convergence of the process S_n, in a space of continuous functions over compact subsets of $[0,\infty )^{d}$, to a d −dimensional Gaussian process S with covariance structure

$$ \operatorname{Cov}(S(\boldsymbol{t}_{1}), S(\boldsymbol{t}_{2}) ) = \iint_{[0,1]^{2d}} \left[ F\left( \min\left\{ \frac{\boldsymbol{x}}{\boldsymbol{t}_{1}}, \frac{\boldsymbol{y}}{\boldsymbol{t}_{2}} \right\} \right) -\! F\left( \frac{\boldsymbol{x}}{\boldsymbol{t}_{1}} \right) F\left( \frac{\boldsymbol{y}}{\boldsymbol{t}_{2}} \right) \right] d\boldsymbol{x} d\boldsymbol{y}. $$

Note that the asymptotic distribution in Theorem 3.2 bears some similarity to the asymptotic distribution of the empirical max-CF process

$$ \sqrt{n}\left( \widehat{\varphi}_{X}^{(n)}(t) - \varphi_{X}(t) \right) = \frac{1}{\sqrt{n}} {\sum}_{i=1}^{n} \left[ \max\left( 1, t X^{(i)} \right) - \mathbb{E}\left( \max\left( 1, t X \right) \right) \right] $$

when $\mathbb {E}(X^{2})<\infty $, which is obtained as a particular case of Theorem 3.4 of Falk and Stupfler (2019).

4 On the Structure of the Set of Min-characteristic Functions

Theorem 3.1 shows that the convergence of a sequence of min-CFs to a min-CF is equivalent to the convergence of the pertaining distribution functions. The requirement that the limit be a min-CF is necessary: if X_n = n almost surely ($n\in \mathbb {N}$), then the corresponding sequence of min-CFs satisfies

$$ \forall x>0, \ \psi_{X_{n}}(x) = \mathbb{E}(\min(1,nx)) \to 1 \ \text{ as } \ n\to\infty, $$

but the function ψ(x) = 1, if x > 0, and ψ(0) = 0, is not a min-CF because it is not continuous at zero. In other words, the set of min-CFs is not closed in the topology of pointwise convergence. This is certainly not specific to the notion of min-CF; the set of Fourier transforms is not closed either (think for example of the sequence of normal distributions with mean 0 and variance n²). It is nonetheless interesting to get some further understanding of the structure of the set of min-CFs and the elements it contains: this is the focus of the present section. We start by noting that the set of min-CFs is convex.

Lemma 4.1.

The convex combination of two min-CFs is again a min-CF.

Proof.

Let X⁽¹⁾, X⁽²⁾ be two rvs in $\mathbb {X}_{d}$ and λ ∈ (0,1). Let $Z\in \left \{1,2\right \}$ be a rv that is independent of X⁽¹⁾, X⁽²⁾, with $\mathbb {P}(Z=1)=\lambda =1-\mathbb {P}(Z=2)$. Then X = X^(Z) is a rv in $\mathbb {X}_{d}$ with

$$ \begin{array}{@{}rcl@{}} \psi_{\boldsymbol{X}}(\boldsymbol{t}) &=& \mathbb{E}\left( \min\left( 1,t_{1}X_{1}^{(Z)},\dots,t_{d}X_{d}^{(Z)} \right) \right)\\ &=&\lambda \mathbb{E}\left( \min\left( 1,t_{1}X_{1}^{(1)},\dots,t_{d}X_{d}^{(1)} \right)\right)\\ &&+ (1-\lambda) \mathbb{E}\left( \min\left( 1,t_{1}X_{1}^{(2)},\dots,t_{d}X_{d}^{(2)} \right)\right)\\ &=& \lambda \psi_{\boldsymbol{X}^{(1)}}(\boldsymbol{t}) + (1-\lambda)\psi_{\boldsymbol{X}^{(2)}}(\boldsymbol{t}). \end{array} $$

This shows that the convex combination $\lambda \psi _{\boldsymbol {X}^{(1)}} + (1-\lambda )\psi _{\boldsymbol {X}^{(2)}}$ of the min-CFs $\psi _{\boldsymbol {X}^{(1)}}$ and $\psi _{\boldsymbol {X}^{(2)}}$ is a min-CF again. □

Our next result informally states that the set of min-CFs is relatively compact in the space of concave pseudo-distribution functions on $[0,\infty )^{d}$.

Proposition 4.2.

Any sequence of min-CFs (ψ_n) on $\mathbb {R}^{d}$ has a subsequence that converges pointwise to a concave function ψ (at each of its points of continuity) such that ψ = 0 outside of $[0,\infty )^{d}$ and $\psi (\boldsymbol {t})\to \psi _{\infty }\in [0,1]$ as $\min \limits (t_{1},\ldots ,t_{d})\to \infty $.

Proof.

Use jointly Proposition 2.4(vi) with Helly’s theorem. □

We now give some examples of functions which are (or not) min-CFs. Our first example focuses on copula functions. Recall that a copula on $\mathbb {R}^{d}$ is a d −dimensional df with standard uniform marginal distributions.

Proposition 4.3.

The only copula which is also a min-CF is the completely dependent copula

$$ C(\boldsymbol{u})=\min(u_{1},\dots,u_{d}), \ \boldsymbol{u} = (u_{1},\ldots,u_{d})\in[0,1]^{d}, $$

corresponding to the constant rv $\boldsymbol {X}=(1,\ldots ,1) \in \mathbb {R}^{d}$.

Proof.

Let C be a copula function which is also a min-CF. In other words, there is a vector U with df C (and in particular, standard uniform marginals) and $\boldsymbol {X}\in \mathbb {X}_{d}$ such that

$$ \forall (t_{1},\ldots,t_{d}) \in [0,\infty)^{d}, \ \mathbb{P}(U_{1}\leq t_{1},\ldots, U_{d}\leq t_{d}) = \mathbb{E}(\min(1,t_{1} X_{1},\ldots,t_{d} X_{d})). $$

Letting, for any i, all t_j except t_i tend to infinity, we obtain, by the dominated convergence theorem,

$$ \forall t\geq 0, \ \mathbb{E}(\min(1,t)) = \min(1,t) = \mathbb{P}(U_{i}\leq t) = \mathbb{E}(\min(1,t X_{i})). $$

This implies that X_i and the constant 1 have the same min-CF, and thus, by Theorem 2.1, X_i = 1 almost surely. Then

$$ C(\boldsymbol{u}) = \mathbb{P}(U_{1}\leq u_{1},\ldots, U_{d}\leq u_{d}) = \mathbb{E}(\min(1,u_{1},\ldots,u_{d})) = \min(u_{1},\ldots,u_{d}) $$

for any u = (u₁,…, u_d) ∈ [0,1]^d, completing the proof. □

This result implies that the df of the uniform distribution on [0,1] is also a min-CF. It is straightforward to show (using Proposition 2.4) that actually, a necessary and sufficient condition for the uniform df on [a, b] to be a min-CF is that a = 0, corresponding to the min-CF of the constant rv X = 1/b.

Proposition 4.3 shows that, although a min-CF is always a df by Proposition 2.4(vi), it can have a rather different structure from the df of its generating rv. We elaborate on this observation in our next result, which shows that the min-CF transformation has no fixed point.

Proposition 4.4.

There is no rv $\boldsymbol {X}\in \mathbb {X}_{d}$ such that its df F satisfies ψ_X = F.

Proof.

Suppose indeed that there were such a rv $\boldsymbol {X}=(X_{1},\ldots ,X_{d})\in \mathbb {X}_{d}$. Writing ψ_X(t) = F(t) for any t = (t₁,…, t_d) ≥0 and letting $t_{2},\ldots ,t_{d}\to \infty $, we find

$$ \forall t_{1}\geq 0, \ \psi_{X_{1}}(t_{1}) = \mathbb{E}(\min(1,t_{1} X_{1})) = \mathbb{P}(X_{1}\leq t_{1}). $$

It is thus enough to show that no univariate positive rv X = X₁ can satisfy this identity. If this were the case then, by Proposition 2.4(vi), F would be continuous on $[0,\infty )$. Using the identity

$$ \forall t>0, \ t F\left( \frac{1}{t} \right) = t \psi_{X}\left( \frac{1}{t} \right) = {{\int}_{0}^{t}} \mathbb{P}(X_{1}>v) dv = {{\int}_{0}^{t}} [1-F(v)] dv $$

shows that F is actually continuously (and even infinitely) differentiable on $(0,\infty )$. By Theorem 2.5, we get

$$ \forall x>0, \ 1-F\left( \frac{1}{x} \right) = \frac{\partial}{\partial t} \left\{ t F\left( \frac{x}{t} \right) \right\} |_{{t=1}} = F(x) - x F^{\prime}(x). $$

(6)

Replacing x with 1/x in this identity immediately entails

$$ \forall x>0, \ \frac{1}{x} F^{\prime}\left( \frac{1}{x} \right) = x F^{\prime}(x) $$

and therefore

$$ \frac{d}{dx} \left[ F(x) + F\left( \frac{1}{x} \right) \right] = F^{\prime}(x) - \frac{1}{x^{2}} F^{\prime}\left( \frac{1}{x} \right) = 0 \ \text{ on } (0,\infty). $$

There is then a constant c such that F(x) + F(1/x) = c on $(0,\infty )$. Letting $x\to \infty $ gives F(x) + F(1/x) = 1 on $(0,\infty )$. Plugging this back in Eq. 6 entails

$$ \forall x>0, \ x F^{\prime}(x) = F(x) + F\left( \frac{1}{x} \right) - 1 = 0 $$

and therefore $F^{\prime } \equiv 0$ on $(0,\infty )$, which finally yields that F is constant on $(0,\infty )$ and thus necessarily equal to 1 on this interval. But F is also right-continuous at 0 with F(0) = 0, which is an obvious contradiction. □

The above result raises the following question: can we compare the df F of a rv $\boldsymbol {X}\in \mathbb {X}_{d}$ and its min-CF ψ_X? In other words, although we know that ψ_X≠F, can we write that F is in general greater or less than ψ_X? Our next result examines this question if F is a copula.

Lemma 4.5.

Let X follow a copula. Then the copula C _ψ corresponding to the df ψ _X satisfies

$$ C_{\psi}(\boldsymbol{u})\ge \psi_{\boldsymbol{X}}(\boldsymbol{u}), \ \boldsymbol{u}\in[0,1]^{d}. $$

Proof.

First of all, the univariate margins ψ_i of ψ_X are identical and given by

$$ \psi(t)= \psi_{i}(t) = \mathbb{E}\left( \min(1,t X_{i})\right) = \!{{\int}_{0}^{1}} \mathbb{P}\left( X_{i}>\frac{s}{t}\right) ds = \left\{\!\begin{array}{ll} t/2 ,&t\in[0,1],\\ 1-1/(2t),& t\ge 1. \end{array}\right. $$

The corresponding quantile function is

$$ \psi^{-1}(u) = \psi_{i}^{-1}(u) = \left\{\begin{array}{ll} 2u ,&u\in[0,1/2],\\ 1/[2(1-u)],& u\in[1/2,1). \end{array}\right. $$

Note that ψ^− 1(u) ≥ u, u ∈ [0,1). The copula C_ψ is then given by

$$ \begin{array}{@{}rcl@{}} C_{\psi}(\boldsymbol{u})&=& \psi_{\boldsymbol{X}}\left( \psi^{-1}(u_{1}),\dots,\psi^{-1}(u_{d}) \right)\\ &=& \mathbb{E}\left( \min\left( 1,\psi^{-1}(u_{1})X_{1},\dots,\psi^{-1}(u_{d})X_{d}\right) \right)\\ &\ge& \mathbb{E}\left( \min\left( 1,u_{1}X_{1},\dots,u_{d}X_{d}\right) \right)\\ &=& \psi_{\boldsymbol{X}}(\boldsymbol{u}), \ \boldsymbol{u}\in[0,1)^{d}, \end{array} $$

which is the result. □

The above proof shows that Lemma 4.5 is actually true for each rv X whose min-CF satisfies $\psi _{i}^{-1}(u)\ge u$, which is equivalent to ψ_i(u) ≤ u, for each u ∈ (0,1) and 1 ≤ i ≤ d. This is for instance the case if $\mathbb {E}(X_{i})\le 1$, 1 ≤ i ≤ d, since then

$$ \forall t\geq 0, \ \psi_{i}(t) = \mathbb{E}\left( \min(1,t X_{i})\right) \leq \mathbb{E}(t X_{i}) = t. $$

5 Min-characteristic Functions for Arbitrary Random Vectors

The concept of min-CF as we defined it can be extended to a rv $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ with not necessarily strictly positive components by applying a continuous one-to-one transformation T mapping $\mathbb {R}$ onto $(0,\infty )$, and considering

$$ (t_{1},\ldots,t_{d})\mapsto \mathbb{E}(\min(1,t_{1}T(X_{1}),\dots,t_{d}T(X_{d}))). $$

The assumptions on T ensure that, by Theorem 2.1, such a mapping identifies the distribution of X. The purpose of this section is to show an example of such a construction and explore some of its properties.

A particularly simple and convenient transformation T is the exponential function $T(x)=\exp (x)$, $x\in \mathbb {R}$. For an arbitrary rv X, this leads us to consider the mapping

$$ \psi_{\exp(\boldsymbol{X})}(\boldsymbol{t}) \!:=\! \mathbb{E}(\min(1,t_{1}\exp(X_{1}),\dots,t_{d}\exp(X_{d}))),\! \ \boldsymbol{t} = (t_{1},\dots,t_{d}) \in [0,\infty)^{d}. $$

Example 5.1.

Let (U, V ) be a bivariate rv which follows a copula, say C. Then the corresponding Kendall df is

$$ K(s):= \mathbb{P}(C(U,V)\le s), \ s\in[0,1]. $$

This function was introduced in Genest and Rivest (1993) in the context of the class of Archimedean copulas

$$ C(u,v)=\varphi^{[-1]}(\varphi(u)+\varphi(v)), $$

where $\varphi :[0,1]\to [0,\infty ]$ is a convex, continuous and strictly decreasing function with φ(1) = 0, and

$$ \varphi^{[-1]}(t)=\left\{\begin{array}{ll} \varphi^{-1}(t),&0\le t\le \varphi(0),\\ 0,&\varphi(0)< t\le \infty; \end{array}\right. $$

see Theorem 4.1.4 in Nelsen (2006). Such an Archimedean copula has Kendall df

$$ K(s)= s-\frac{\varphi(s)}{\varphi^{\prime}(s)}, \ s\in[0,1], $$

and the Kendall df characterizes the generator φ; see Genest and Rivest (1993).

Choose for example φ_p(s) = (1 − s)^p, s ∈ [0,1], with $p\in [1,\infty )$. The pertaining Archimedean copula is given by

$$ C_{p}(u,v)=\max\left( 0,1-\left\Vert(1-u,1-v)\right\Vert_{p}\right), \ u,v\in[0,1], $$

and Kendall’s df is

$$ K(s)=\frac 1p + s\left( 1-\frac 1p \right), \ s\in[0,1]. $$

Thus K(0) = 1/p > 0, which means that K has an atom at 0 and therefore we cannot use the ordinary min-CF for the Kendall df. We then transform the rv C_p(U, V ) by the exponential function and obtain, for t ∈ (0,1] and p ≥ 1,

$$ \begin{array}{@{}rcl@{}} \psi_{\exp(K)}(t)&=& \mathbb{E}\left( \min\left( 1,t\exp(C_{p}(U,V))\right)\right)\\ &=& {{\int}_{0}^{1}} \mathbb{P}\left( \exp(C_{p}(U,V))>\frac ut\right) du\\ &=&\left\{\begin{array}{ll} t\left( 1+\left( 1-\dfrac 1p\right)(\exp(1)-2)\right),&0<t\le\exp(-1),\\ 2\!\left( \!1 - \dfrac 1p\right)+\left( \dfrac 2p - 1\! \right)t + \left( \!1 - \dfrac 1p\right)\log(t),&\exp(-1)\le t\le 1. \end{array}\right. \end{array} $$

This is itself a df having density

$$ \psi_{\exp(K)}^{\prime}(t)=\left\{\begin{array}{ll} 1+\left( 1-\dfrac 1p\right)(\exp(1)-2) ,&0<t\le\exp(-1),\\ \dfrac 2p-1 + \left( 1-\dfrac 1p\right)\dfrac 1t, &\exp(-1)<t\le 1. \end{array}\right. $$

The particular case p = 1 yields $\psi _{\exp (K)}(t)=t$, 0 ≤ t ≤ 1, i.e., the df of the uniform distribution on [0,1].

In general, we clearly have, by monotonicity of the exponential function, that for $t_{1},\dots ,t_{d}>0$:

$$ \psi_{\exp(\boldsymbol{X})}(\boldsymbol{t}) = \mathbb{E}(\exp(\min(0,X_{1}+\log(t_{1}),\dots,X_{d}+\log(t_{d})) ) ). $$

Replacing $\log (t_{i})$ in the above formula by $x_{i}\in \mathbb {R}$, 1 ≤ i ≤ d, leads to the following definition.

Definition 5.1.

The log-min-CF of $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ is the function $\psi ^{\exp }_{\boldsymbol {X}}$ on $\mathbb {R}^{d}$ defined by

$$ \forall \boldsymbol{x}=(x_{1},\dots,x_{d})\in\mathbb{R}^{d}, \ \psi^{\exp}_{\boldsymbol{X}}(\boldsymbol{t}) = \mathbb{E}(\exp(\min(0,X_{1}+x_{1},\dots,X_{d}+x_{d}) ) ). $$

Note that obviously $\psi ^{\exp }_{\boldsymbol {X}+\boldsymbol {a}}(\boldsymbol {x})=\psi ^{\exp }_{\boldsymbol {X}}(\boldsymbol {x}+\boldsymbol {a})$, for any $\boldsymbol {a},\boldsymbol {x}\in \mathbb {R}^{d}$. The following two results are immediate consequences of Theorems 2.1 and 3.1.

Corollary 5.2.

Let $\boldsymbol {X}=(X_{1},\dots ,X_{d})$ and $\boldsymbol {Y}=(Y_{1},\dots ,Y_{d})$ be two rvs. Then

$$ \boldsymbol{X}\overset{d}{=}\boldsymbol{Y} \Leftrightarrow \forall \boldsymbol{x}\in\mathbb{R}^{d}, \ \psi^{\exp}_{\boldsymbol{X}}(\boldsymbol{x})=\psi^{\exp}_{\boldsymbol{Y}}(\boldsymbol{x}). $$

Corollary 5.3.

Let X⁽ⁿ⁾, X be rvs. Then

$$ \boldsymbol{X}^{(n)}\overset{d}{\longrightarrow}\boldsymbol{X}\Leftrightarrow \psi^{\exp}_{\boldsymbol{X}^{(n)}} \to \psi^{\exp}_{\boldsymbol{X}} \ \text{ pointwise}. $$

We now show two examples of calculation of a log-min-CF.

Example 5.2.

Let B = (B_t)_t≥ 0 be a standard Brownian motion, choose t > 0 and put

$$ X := B_{t}-\frac t2. $$

Then $\exp (X)$ follows a log-normal distribution with mean one. From Falk (2019, Lemma 1.10.6) we find that, for any x,

$$ \mathbb{E}(\max(1,\exp(X+x))) = {\Phi}\left( \frac{\sqrt t}2- \frac x{\sqrt t} \right) +\exp(x) {\Phi}\left( \frac{\sqrt t}2+\frac x{\sqrt t} \right) $$

where Φ denotes the df of the univariate standard normal distribution. From the identity $\min \limits (a,b)=a+b-\max \limits (a,b)$, $a,b\in \mathbb {R}$, we obtain the log-min-CF of X = B_t − t/2 as:

$$ \begin{array}{@{}rcl@{}} \psi_{X}^{\exp}(x)&=& \mathbb{E}(\min(1,\exp(X+x)))\\ &=& 1- {\Phi}\left( \frac{\sqrt t}2- \frac x{\sqrt t} \right) + \exp(x)\left( 1- {\Phi}\left( \frac{\sqrt t}2+\frac x{\sqrt t} \right) \right), \ x\in\mathbb{R}, \end{array} $$

and, thus, that of B_t:

$$ \psi_{B_{t}}^{\exp}(x)=\psi_{X}^{\exp}(x+t/2)= 1- {\Phi}\left( - \frac x{\sqrt t} \right) + \exp\left( x + \frac t2\right) \left( 1 - {\Phi}\left( \sqrt t+\frac x{\sqrt t} \right) \right). $$

Example 5.3.

Let $\boldsymbol {\eta }=(\eta _{1},\dots ,\eta _{d})$ follow a max-stable distribution with standard negative exponential margins, i.e. there exists a D-norm ||⋅||_D on $\mathbb {R}^{d}$ such that $\mathbb {P}(\boldsymbol {\eta }\le \boldsymbol {x})=\exp (-||{\boldsymbol {x}}||_{D})$, $\boldsymbol {x}\le \boldsymbol {0}\in \mathbb {R}^{d}$. The df of each η_i is $\mathbb {P}(\eta _{i}\le x)=\exp (x)$, x ≤ 0. The log-min-CF of η is

$$ \begin{array}{@{}rcl@{}} \psi_{\boldsymbol{\eta}}^{\exp}(\boldsymbol{x}) &=& \mathbb{E}\left( 1,\min\left( \exp(\eta_{1}+x_{1}),\dots,\exp(\eta_{d}+x_{d})\right)\right)\\ &=:&\mathbb{E}\left( \min\left( 1,\exp(x_{1})U_{1},\dots,\exp(x_{d})U_{d}\right)\right), \end{array} $$

where $\boldsymbol {U}=(U_{1},\dots ,U_{d})$ follows an extreme value copula on [0,1]^d:

$$ \mathbb{P}(\boldsymbol{U}\le\boldsymbol{u})=\exp\left( -\left\Vert\left( \log(u_{1}),\dots,\log(u_{d})\right)\right\Vert_{D} \right), \ \boldsymbol{u}=(u_{1},\dots,u_{d})\in(0,1]^{d}. $$

Every extreme value copula has this representation; see Falk (2019, Equation (3.10)).

We conclude this section and the paper by an observation regarding the log-min-CF of a sum of independent rvs. Let then X and Y be two independent rv in $\mathbb {R}^{d}$. The distribution of the sum X + Y is characterized by the product of the corresponding log-min-CFs, defined for $\boldsymbol {s}\in \mathbb {R}^{d}$ by the product rule

$$ \begin{array}{@{}rcl@{}} \left( \psi_{\boldsymbol{X}}^{\exp}*\psi_{\boldsymbol{Y}}^{\exp}\right)(\boldsymbol{s})\! &:=&\psi_{\boldsymbol{X}+\boldsymbol{Y}}^{\exp}(\boldsymbol{s})\\ &=&\mathbb{E}\left( \min\left( 1, \exp(X_{1} + Y_{1} + s_{1}),\dots,\exp(X_{d} + Y_{d}+s_{d})\right)\right). \end{array} $$

This multiplication operation can be extended to finitely many rvs in an obvious way. In particular, we can establish a lower bound on the product of the log-min-CFs of univariate rvs. This is the content of the next lemma.

Proposition 5.4.

Let $X_{1},X_{2},\dots ,X_{n}$ be independent rvs in $\mathbb {R}$. Then we have for $s\in \mathbb {R}$ and $\lambda _{1},\dots ,\lambda _{n}\ge 0$, ${\sum }_{i=1}^{n}\lambda _{i}=1$,

$$ \left( \psi_{X_{1}}^{\exp}*\dots*\psi_{X_{n}}^{\exp} \right)(s) \ge {\prod}_{i=1}^{n} \psi_{X_{i}}^{\exp}(\lambda_{i} s). $$

Proof.

We show the case n = 2 first. We have, for arbitrary numbers a, b ≥ 0,

$$ \min(1,ab)\ge \min(1,a)\min(1,b). $$

Suppose that X and Y are independent. This yields

$$ \begin{array}{@{}rcl@{}} \left( \psi_{X}^{\exp}*\psi_{Y}^{\exp}\right)(s)&=& \mathbb{E}\left( \min\left( 1,\exp(X+Y+s)\right)\right)\\ &=& \mathbb{E}\left( \min\left( 1,\exp(X+\lambda s)\exp(Y+(1-\lambda)s)) \right) \right)\\ &\ge& \mathbb{E}\left( \min\left( 1,\exp(X+\lambda s)\right)\min\left( 1,\exp(Y+(1-\lambda)s) \right) \right)\\ &=& \mathbb{E}\left( \min\left( 1,\exp(X + \lambda s)\right)\right) \mathbb{E}\left( \min\left( 1,\exp(Y+(1 - \lambda s))\right)\right)\\ &=& \psi_{X}^{\exp}(\lambda s)\psi_{Y}^{\exp}((1-\lambda)s), \ \lambda\in[0,1], s\in\mathbb{R}. \end{array} $$

The result is then shown for n = 2. The general case follows from a straightforward proof by induction. □

In particular we obtain for identical copies $X_{1},X_{2},\dots ,X_{n}$ of X the lower bound

$$ \left( \psi_{X}^{\exp}\right)^{*n}(s)\ge \left( \psi_{X}^{\exp}\left( \frac sn\right) \right)^{n}. $$

6 Conclusion and Perspectives

This paper introduces the concept of min-CF, as a way to identify probability distributions concentrated on $(0,\infty )^{d}$. This min-CF is in fact a continuous and concave df. We have worked here on a variety of aspects of this notion, such as a theorem linking convergence in distribution to pointwise convergence of min-CFs, the functional convergence of the sample min-CF for independent and identically distributed random variables, and a construction of the min-CF for arbitrary rvs.

It is natural to think about the probabilistic and statistical applications of the notion of min-CF. In this paper, we use this concept to provide a development of the theory of D-norms by showing that the canonical mapping from the set of D-norms to the set of dual D-norms is one-to-one when restricted to D-norms generated by componentwise positive generators (Proposition 2.6). We further suggest an estimator of a D-norm by using the empirical min-CF, based on our Proposition 2.9, in Example 3.1. D-norms are the skeleton of multivariate extreme value theory (Falk, 2019) which, being the framework adapted to the simultaneous analysis of extremal events, is part of the toolbox for risk management. This estimator suggested by the min-CF provides an alternative to existing methods in multivariate extreme value theory; the investigation of its theoretical and numerical properties appears to be an interesting avenue of research.

Another potential application of the notion of min-CF is goodness-of-fit testing. Theorem 3.2 provides the functional convergence of the sample min-CF to its population counterpart. Paired with examples of explicit calculations of min-CFs for parametric families, such as in Examples 2.1–2.5, it is not hard to see how one may define goodness-of-fit testing procedures by comparing the gap between the empirical min-CF and the min-CF of the hypothesized distribution. Of course, such procedures already exist using standard CFs such as the Fourier or Laplace transforms; however, a goodness-of-fit procedure based on the min-CF is likely to be most interesting in cases such as that of the Generalized Pareto distribution, for which the Fourier transform does not have a simple closed form, although the min-CF does (see Example 2.3). An even more relevant development would be the assessment of the performance of such a goodness-of-fit procedure in the context of Peaks-Over-Threshold modeling, which is a major part of univariate extreme value analysis. In this framework, the Generalized Pareto distribution naturally arises as an approximation of the distribution of exceedances over a high threshold (see Beirlant et al. 2004), and examining the performance of a goodness-of-fit testing procedure based on the min-CF in this context appears to be a stimulating problem, not least because standard post-inference model checking largely seems to be based on the use of graphical tools such as QQ-plots.

References

Abramovitz, M. and Stegun, I.A. (1972). Handbook of Mathematical Functions 10th printing. National Bureau of Standards Applied Mathematics Series, Washington D.C.
Google Scholar
Beirlant, J., Goegebeur, Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications. Wiley, Chichester.
Book Google Scholar
Billingsley, P. (1968). Convergence of Probability Measures, 1st ed. Wiley, New York.
MATH Google Scholar
Blum, M. (1970). On the sums of independently distributed Pareto variates. SIAM J. Appl. Math. 19, 191–198.
Article MathSciNet Google Scholar
Cramér, H. and Wold, H. (1936). Some theorems on distribution functions. J. Lond. Math. Soc. s1-11, 290–294.
Article MathSciNet Google Scholar
Dombry, C. and Zott, M. (2018). Multivariate records and hitting scenarios. Extremes 21, 343–361.
Article MathSciNet Google Scholar
Dombry, C., Falk, M. and Zott, M. (2019). On functional records and champions. J. Theoret. Probab. 32, 1252–1277.
Article MathSciNet Google Scholar
Drees, H. and Huang, X. (1998). Best attainable rates of convergence for estimators of the stable tail dependence function. J. Multivariate Anal. 64, 25–47.
Article MathSciNet Google Scholar
Einmahl, J.H.J., Krajina, A. and Segers, J. (2008). A method of moments estimator of tail dependence. Bernoulli 14, 1003–1026.
Article MathSciNet Google Scholar
Embrechts, P., Klüppelberg, C. and Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. Springer, Berlin-Heidelberg.
Book Google Scholar
Falk, M. (2019). Multivariate Extreme Value Theory and D-Norms. Springer International, Berlin.
Book Google Scholar
Falk, M. and Stupfler, G. (2017). An offspring of multivariate extreme value theory: the max-characteristic function. J. Multivariate Anal. 154, 85–95.
Article MathSciNet Google Scholar
Falk, M. and Stupfler, G. (2019). On a class of norms generated by nonnegative integrable distributions. Dependence Modeling 7, 259–278. https://doi.org/10.1515/demo-2019-0014.
Article MathSciNet Google Scholar
Genest, C. and Rivest, L.-P. (1993). Statistical inference procedures for bivariate Archimedean copulas. J. Amer. Statist. Assoc. 88, 1034–1043.
Article MathSciNet Google Scholar
Massart, P. (1989). Strong approximation for multivariate empirical and related processes, via KMT constructions. Ann. Probab. 17, 266–291.
Article MathSciNet Google Scholar
Nadarajah, S. and Pogány, T.K. (2013). On the characteristic functions for extreme value distributions. Extremes 16, 27–38.
Article MathSciNet Google Scholar
Nadarajah, S., Zhang, Y. and Pogány, T.K. (2018). On sums of independent generalized Pareto random variables with applications to insurance and CAT bonds. Probab. Engrg. Inform. Sci. 32, 296–305.
Article MathSciNet Google Scholar
Nelsen, R.B. (2006). An Introduction to Copulas Springer Series in Statistics, 2nd ed. Springer, New York.
Google Scholar
Reiss, R.-D. (1989). Approximate Distributions of Order Statistics: With Applications to Nonparametric Statistics. Springer, New York.
Book Google Scholar
Resnick, S. (2007). Heavy-Tail Phenomena: Probabilistic and Statistical Modeling. Springer, New York.
MATH Google Scholar
Schmidt, R. and Stadtmüller, U. (2006). Non-parametric estimation of tail dependence. Scand. J. Stat. 33, 307–335.
Article MathSciNet Google Scholar
Shorack, G.A. and Wellner, J.A. (1986). Empirical Processes with Applications to Statistics. Wiley, New York.
MATH Google Scholar
Villani, C. (2009). Optimal Transport. Old and New, Grundlehren der mathematischen Wissenschaften, 338. Springer, Berlin.
Google Scholar

Download references

Acknowledgments

This research was in part carried out when M. Falk was visiting G. Stupfler at the University of Nottingham in July 2018. The first author is grateful to his host for his hospitality and the extremely constructive atmosphere. Support from the London Mathematical Society Research in Pairs Scheme (reference 41710) is gratefully acknowledged. The authors are indebted to an anonymous reviewer for his/her constructive remarks which led to an improved presentation of the results of the paper.

Author information

Authors and Affiliations

Institute of Mathematics, University of Würzburg, Würzburg, Germany
Michael Falk
School of Mathematical Sciences, University of Nottingham, Nottingham, UK
Gilles Stupfler

Authors

Michael Falk
View author publications
You can also search for this author in PubMed Google Scholar
Gilles Stupfler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gilles Stupfler.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Falk, M., Stupfler, G. The Min-characteristic Function: Characterizing Distributions by Their Min-linear Projections. Sankhya A 83, 254–282 (2021). https://doi.org/10.1007/s13171-019-00184-1

Download citation

Received: 26 December 2018
Published: 25 November 2019
Issue Date: February 2021
DOI: https://doi.org/10.1007/s13171-019-00184-1

Keywords and phrases

AMS (2000) subject classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Min-characteristic Function: Characterizing Distributions by Their Min-linear Projections

Abstract

Similar content being viewed by others

Multi-normex distributions for the sum of random vectors. Rates of convergence

The Distributions of the Mean of Random Vectors with Fixed Marginal Distribution

Higher dimensional quasi-power theorem and Berry–Esseen inequality

1 Introduction and motivation

Proposition 1.1.

Proof.

Proposition 1.2.

Proof.

2 The Min-characteristic Function for Positive Random Vectors

Theorem 2.1.

Proof.

Definition 2.2.

Lemma 2.3.

Proof.

Example 2.1 (Exponential distribution).

Example 2.2 (Pareto distribution).

Example 2.3 (Generalized Pareto distribution).

Example 2.4 (Unit Fréchet distribution).

Example 2.5 (Independent unit Fréchet variables).

Proposition 2.4.

Proof.

Theorem 2.5.

Proof.

Example 2.6 (Exponential distribution in several dimensions).

Proposition 2.6.

Lemma 2.7.

Proof.

Proof of Proposition 2.6.

Proposition 2.8.

Proof.

Proposition 2.9.

3 Sequential Behavior of the Min-characteristic Function

3.1 With Respect to Convergence in Distribution

Theorem 3.1.

Proof of Theorem 3.1.

Remark 3.1.

3.2 The Empirical min-CF

Example 3.1.

Theorem 3.2.

Proof.

4 On the Structure of the Set of Min-characteristic Functions

Lemma 4.1.

Proof.

Proposition 4.2.

Proof.

Proposition 4.3.

Proof.

Proposition 4.4.

Proof.

Lemma 4.5.

Proof.

5 Min-characteristic Functions for Arbitrary Random Vectors

Example 5.1.

Definition 5.1.

Corollary 5.2.

Corollary 5.3.

Example 5.2.

Example 5.3.

Proposition 5.4.

Proof.

6 Conclusion and Perspectives

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases

AMS (2000) subject classification

Search

Navigation