Mean Values: A Multicriterial Analysis

Podinovski, Vladislav V.; Nelyubin, Andrey P.

doi:10.1007/978-3-031-31654-8_17

Vladislav V. Podinovski²⁰ &
Andrey P. Nelyubin²¹

Part of the book series: Springer Optimization and Its Applications ((SOIA,volume 202))

343 Accesses

Abstract

We introduce new notions of mean values based on ideas of multicriteria optimization. The distances between the current point to all points in the sample are regarded as elements of a vector estimate. We introduce preference relations on the set of all such vectors, based on the information about the preferences of the decision maker who could be a statistician, analyst or researcher. Such preference relations reflect the distances between points, including the case in which all distances are equally important. We define the mean values as the points whose corresponding vector estimates are nondominated with respect to the defined preference relation, and investigate their properties. Such mean values turn out to be multi-valued. We further explore the relationship between the new notions of mean values with their conventional definitions and suggest computational approaches to the calculation of the suggested new means.

Access provided by Autonomous University of Puebla. Download chapter PDF

An Analysis of Winsorized Weighted Means

Article 15 April 2019

Comparing the Medians of a Random Interval Defined by Means of Two Different L 1 Metrics

Least Square Consensus Clustering: Criteria, Methods, Experiments

Keywords

1 Introduction

Mean values are widely used in management, economics, sociology, engineering and other areas of theory and practice. In statistics (see, for example, [8, 22]), mean values are aggregate representations of the varying characteristics of a group of homogeneous objects. Mean values cancel out random variations of a particular characteristic and tend to represent the effect caused by the main factors affecting it. Mean values allow us to compare the levels of the same characteristic in different groups of objects and to investigate the causes of such differences.

It is known that it is impossible to define a universally applicable notion of the mean value which satisfies all desirable properties [1, 8]. Instead, different notions of the mean value are required for different problems and situations. However, in some applications, it may be unclear which of the known mean values should be used, and different means may point to different conclusions. Policy recommendations in such situations may become problematic [6, 8, 10, 12, 16].

Grabisch et al. [7] regarded mean values as idempotent aggregation functions and concluded that the class of such functions “is huge, making the problem of choosing the right function (or family) for a given application a difficult one”.

In this paper, we consider new approaches to the definition of the mean value based on the ideas and methods of multicriteria optimization. Such means turn out to be multi-valued, i.e., represented by sets of points. These allow two interpretations, either as the range of possible mean values in some specific situations (characterized by scale properties, such as equal importance or ordinality, and/or transfer principles), or as whole sets for the given sample.

2 Definition of Mean Values as Nondominated Points

Let X be the set of real numbers consisting of at least n ≥ 2 elements referred to as data or points. These elements are typically obtained as a result of measurement of some characteristic:

$$ X=\left\{{x}_1,{x}_2,\dots, {x}_n\right\}. $$

(1)

These data are assumed homogeneous in the sense that they are obtained by utilizing the same scale of measurement [21, 23]. We assume that the data (1) are quantitative, i.e., the measurement is performed either on the interval scale or on the ratio scale [15].

The elements of the set (1) can be ranked in the non-decreasing and non-increasing order.

$$ {X}_{\uparrow }=<{x}_{(1)},{x}_{(2)},\dots, {x}_{(n)}>;{X}_{\downarrow }=<{x}_{\left[1\right]},{x}_{\left[2\right]},\dots, {x}_{\left[n\right]}>, $$

(2)

where x₍₁₎ ≤ x₍₂₎ ≤ … ≤ x_(n) and x_[1] ≥ x_[2] ≥ … ≥ x_[n]. In statistics, the set (1) is typically referred to as a sample and its non-decreasing sequence X_↑ as a variational series.

Let х be an arbitrary fixed number (a point in Re). Its distance from any point x_i from X is given by y_i = |x - x_i|. Then the distance from х to the dataset X can be characterized by the vector y = (y₁, y₂, …, y_n). We can view this vector as the value of the vector criterion f(x) = (f₁(x), f₂(x), …, f_n(x)), where f_i(x) = |x - x_i|, which is an element of the nonnegative quadrant $ {\operatorname{Re}}_{+}^n={\left[0,+\infty \right)}^n $.

Let P^Γ be a preference relation (strict partial order) on $ {\operatorname{Re}}_{+}^n $, where Γ is information about the preferences with respect to distance: if yP^Γy′, where y = f(x) and y′ = f(x′), then the point x is closer to the dataset X than x′.

The relation P^Γ generates the corresponding relation P_Γ on the numeric axis Re: xP_Γx′ ⇔ f(x)P^Γf(x′).

Therefore, any candidate that we may choose as the closest to X and representing the set X must be nondominated under P_Γ. If the set G^Γ(X) of nondominated under P_Γ points is externally stable, we refer to all such points as pn-means (principal new means) and, more specifically (reflecting the information Γ), as the means with respect to P_Γ.

If there is no further information about the preferences of the DM on $ {\operatorname{Re}}_{+}^n $, we obtain the Pareto relation P^∅ defined as follows:

$$ {yP}^{\varnothing }z\iff {y}_i\le {z}_i,i=1,2,\dots, n;y\ne z. $$

Relation P^∅ generates the Pareto relation P_∅ on Re: xP_∅x′ ⇔ f(x)P^∅f(x′).

Theorem 1

The set of all means of the dataset (1) with respect to P_∅ is the segment G^∅(X) = $ \overline{X} $ = [x₍₁₎, x_(n)], where x₍₁₎ = min_i∈N x_i and x_(n) = max_i∈N x_i, N = {1, 2, …,n}. This set is externally stable.

Therefore, the notion of the means with respect to P_∅ is equivalent to the means in the sense of Cauchy.

Proofs of this and the following theorems can be found in [19, 20].

Let us note that, if the function φ is increasing on Re₊, then changing the original criteria f_i(x) = |x - x_i| by φ(f_i(x)) does not change the set G^∅(X). For example, one can use “smooth” criteria f_i(x) = (x - x_i)². Therefore, the original use of formula f_i(x) = |x - x_i| as a measure of distance is not essential and is not a limiting assumption for the suggested approach.

3 Mean Values for Equally Important Criteria

In this section, we assume that all criteria are equally important [17] and denote this information E. In this case, the distance from the point x to the dataset X is represented by the preference relation P_E on Re, which is defined by the two equivalent decision rules [17], where f_i(x) = |x - x_i|:

$$ {xP}_E{x}^{\prime}\iff \left({f}_{(1)}(x)\le {f}_{(1)}\left({x}^{\prime}\right),{f}_{(2)}(x)\le {f}_{(2)}\left({x}^{\prime}\right),\dots, {f}_{(n)}(x)\le {f}_{(n)}\left({x}^{\prime}\right)\right), $$

and at least one of these inequalities is strict;

$$ {xP}_E{x}^{\prime}\iff \left({f}_{\left[1\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right),{f}_{\left[2\right]}(x)\le {f}_{\left[2\right]}\left({x}^{\prime}\right),\dots, {f}_{\left[n\right]}(x)\le {f}_{\left[n\right]}\left({x}^{\prime}\right)\right), $$

and at least one of these inequalities is strict.

In this case, the pn-means (with respect to P_E) (elements of the set G^E(X)) are the points on the numerical axis which are nondominated under P_E.

Theorem 2

We have G^E(X) ⊆ G^∅(X) = $ \overline{X} $, and the set G^E(X) is externally stable.

Note that, if the function φ is increasing on Re₊, then changing the original criteria f_i(x) to criteria φ(f_i(x)) does not change the relation P_E and the set G^E(X).

Let us consider examples of sets G^E(X) constructed according to the methods described in Sect. 5.

Example 1

Let n = 3 and X = {1, 2, 5}. In this example, G^E(X) = [1.5, 3].

In the above example, the set G^E(X) is a single line segment. However, for large n, this set may be the union of several segments, excluding their endpoints.

Example 2

For n = 6 and different sets X, we have:

$$ {G}^E\left(\left\{10,11,15,61,107,110\right\}\right)=\left[10.5,83\right)\cup \left(83.5,85\right)\cup \left(106.5,108\right);\vspace*{-2pc} $$

$$ {G}^E\left(\left\{10,11,40,55,70,110\right\}\right)=\left[10.5,18\right)\cup \left(18;67.5\right)\cup \left(68,75\right); \vspace*{-2pc} $$

$$ {G}^E\left(\left\{10,57,61,64,109,110\right\}\right)=\left(56.5,57.5\right)\cup \left(58.5,88.5\right)\cup \left(108,109.5\right]. $$

Examples 1 and 2 also illustrate the following result.

Theorem 3

Let the distance between two adjacent elements x_(i) and x_(i + 1) of the variational series (2) be the smallest among all other pairs of adjacent elements of this series, and let these two elements be uniquely defined. Then the midpoint x^c = ½(x_(i) + x_(i + 1)) is an element of G^E(X). Moreover, if x_(i) is x₍₁₎ or if x_(i + 1) is x_(n), then x^c is the left or, respectively, right, endpoint of the set G^E(X).

If x₍₁₎ ≠ x_(n) and (x₍₁₎, x_(n)) ⊄ G^E(X), for some values of parameter s, the power mean.

$$ {g}^s(X)={\left(\frac{1}{n}\sum \limits_{i=1}^n{\left({x}_i\right)}^s\right)}^{1\backslash s},s\ne 0. $$

is not the mean with respect to P_E. This is because, as s increases on Re, the function g^s(X), extended to preserve continuity, passes through all values from the interval (x₍₁₎, x_(n)) [8]. However, we have the following result:

Theorem 4

The arithmetic mean is a mean with respect to P_E, i.e., g¹(X)∈G^E(X).

Example 3

According to Example 2, for X = {10, 57, 61, 64, 109, 110} we have: G^E(X) = (56.5, 57.5) ∪ (58.5, 88.5) ∪ (108, 109.5]. In this example, the geometric mean g⁰(X) = 54.66∉G^E(X) and harmonic mean g⁻¹(X) = 35.75∉G^E(X), and g¹(X) = 68.5∈G^E(X). In Example 1, for X = {1, 2, 5}, we have G^E(X) = [1.5, 3]. Here, the quadratic mean g²(X) = 3.162∉G^E(X), but g¹(X) = 2.67∈G^E(X).

Theorem 5

The median is a mean with respect to P_E, i.e., if n is an odd integer and the median is unique, we have $ \mu (X)={x}_{\left(\frac{n+1}{2}\right)}\in {G}^E(X) $. If n is an even number, the median is not unique and we have $ \mu (X)=\left[{x}_{\left(\frac{n}{2}\right)},{x}_{\left(\frac{n}{2}+1\right)}\right]\subseteq {G}^E(X) $.

Examples 1 and 2 provide illustrations to the above theorem.

It should be noted the following peculiarity of the means with respect to P_E: if the points x_i∈X and x_j∈X, x_i < x_j, are included in G^E(X), then the point x_k∈X, such that x_i < x_k < x_j, may not belong to G^E(X)!

Example 4

For n = 7 and different sets X, we have:

$$ {X}^{\prime }=\left\{1,2,3,6,8,9,11\right\},{G}^E\left({X}^{\prime}\right)=\left[2;3\right)\cup \left(3;8.5\right]; \vspace*{-1.5pc} $$

$$ {X}^{{\prime\prime} }=\left\{1,2,3,7,8,10,11\right\},{G}^E\left({X}^{{\prime\prime}}\right)=\left[2;3\right)\cup \left(3.5;9\right]. $$

Here x₍₂₎, x₍₄₎∈G^E(X), whereas x₍₃₎∉G^E(X) for both sets X = X′ and X = X″. Moreover, the point x₍₃₎ = 3 is a punctured point of the set G^E(X′) (it is dominated under P_E by the point x₍₅₎ = 8, and in its arbitrarily small neighborhood there are points nondominated under P_E).

This feature clearly violates the very principle of constructing means as points closest to points from X, and is not consistent with the intuitive concept of a mean value. Therefore, the presence of this feature can be considered as a paradox of means with respect to P_E.

4 Mean Values for Equally Important Criteria Measured on the First Ordered Metric Scale

Let y be any vector estimate such that y_i > y_j. Consider any δ > 0 such that y_i – δ ≥ y_j + δ. Define the vector estimate z by replacing component y_i by y_i − δ and y_j by y_j + δ, but y_i – δ ≥ y_j + δ. Moving from y to z reduces the larger deviation y_i from one point in the sample and increases a smaller deviation y_j from a different point, by the same amounts δ. The resulting set of distances becomes closer to the ideal set of minimally possible equal deviations. Assume that, for any y and δ described above, the vector estimate z is preferred to the original vector estimate y, in the sense that z is “closer” to X than y and is therefore more suitable for the definition of the mean. Denote Δ the information about the described principle. Such approach is an analogue of Pigou-Dalton’s principle of transfer for income distribution [2, 5]. This means that the equally important criteria have a common first ordered metric scale [4]. The preference relation P_EΔ, generated on Reⁿ by the joint information E and Δ, is defined by the following decision rule [14, 18]:

$$ {\displaystyle \begin{array}{l}{xP}_{E\Delta}{x}^{\prime}\iff {f}_{\left[1\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right),{f}_{\left[1\right]}(x)+{f}_{\left[2\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right)+{f}_{\left[2\right]}\left({x}^{\prime}\right),\dots \\ {}\kern3.919997em \dots {f}_{\left[1\right]}(x)+{f}_{\left[2\right]}(x)+\dots +{f}_{\left[n\right]}(x)\!\le\! {f}_{\left[1\right]}\left({x}^{\prime}\right){+}{f}_{\left[2\right]}\left({x}^{\prime}\right){+}\dots {f}_{\left[n\right]}\left({x}^{\prime}\right),\end{array}} $$

and at least one of these inequalities is strict. In this case, the pn-means are the points that are nondominated under P_EΔ. Because P_EΔ ⊃ P_E, we have G^E(X) ⊇ G^EΔ(X).

Theorem 6

The arithmetic mean is a mean with respect to P_EΔ, i.e., g¹(X)∈G^EΔ(X).

Theorem 7

If n is odd, the median (which is uniquely defined), is a mean with respect to P_EΔ, i.e., μ(X) ∈ G^EΔ(X). If n is even and the median is not uniquely defined, we only have μ(X) ∩ G^EΔ(X) ≠ ∅.

Example 5

If n = 5 and X = {1, 2, 3, 5, 11}, we have G^EΔ(X) = [3, 6], μ(X) = 3 and g¹(X) = 4.4. If n = 4 and X = {10, 11, 12, 110}, we have G^EΔ(X) = [11.5, 60], μ(X) = [11, 12] and g¹(X) = 35.75. If X = {10, 11, 20, 110}, we have G^EΔ(X) = [15.5, 60], μ(X) = [11, 20] and g¹(X) = 37.75.

Let us define the set H = {1, 2, …, h}, where h = ⌊(n + 1)/2⌋ is the integer part of (n + 1)/2.

Theorem 8

The set G^EΔ(X) is externally stable and coincides with the segment [α, β], where

$$ \alpha =\frac{1}{2}{\min}_{p\in H}\left({x}_{(p)}+{x}_{\left(n+1-p\right)}\right),\beta =\frac{1}{2}{\max}_{p\in H}\left({x}_{(p)}+{x}_{\left(n+1-p\right)}\right) $$

(3)

Example 6

For n = 5, we have h = ⌊(n + 1)/2⌋ = 3 and H = {1, 2, 3}. For X = {1, 2, 7, 8, 11}, using Theorem 8, we have:

$$ \alpha =\frac{1}{2}\ \min \left\{{x}_{(1)}+{x}_{(5)},{x}_{(2)}+{x}_{(4)},{x}_{(3)}+{x}_{(3)}\right\}=\frac{1}{2}\ \min \left\{1+11,2+8,7+7\right\}\\ =\frac{1}{2}\ \min \left\{12,10,14\right\}=5; $$

$$ \beta =\frac{1}{2}\ \max \left\{{x}_{(1)}+{x}_{(5)},{x}_{(2)}+{x}_{(4)},{x}_{(3)}+{x}_{(3)}\right\}=\frac{1}{2}\ \max \left\{12,10,14\right\}=7; $$

Therefore, G^EΔ(X) = [α, β] = [5, 7].

5 On the Construction of Sets of Mean Values

For the construction of the set G^E(X), we can use known methods of multicriteria optimization developed for the construction of the sets of nondominated variants [17]. Such methods utilize families of functions that are increasing (decreasing), or at least non-decreasing (non-increasing) with respect to P_E. For example, we can solve a parametric program which minimizes the function of single variable ψ(f(x)|c) = min_π∈Π max_i∈N {f_π(i)(x) – c_i} on the set X, by varying the vector parameter $ c\in f\left(\overline{X}\right) $. However, even if n is not very large, the number n! of terms of this function (with respect to which the maximization is performed) turns out unacceptably large.

Taking into account that the set X is one-dimensional, we can utilize a different approach. Namely, we can consider a dense grid with the small step h which covers the set X, and identify the nondominated (with respect to P_E) points of this grid by simple enumeration [9]. The step h depends on the required precision and can decrease in the process of calculations of the set G^E(X). We used this approach for the construction of the set G^E(X) in Examples 1 и 2.

Example 7

Let us demonstrate the construction of the set G^E(X) for X = {1, 2, 5, 9, 11}. Using computer for the calculations, while reducing the step length h, we obtain the following results:

h = 1:	[2, 7] ∪ [9, 9].
h = 0.1:	[1.5, 7.4] ∪ [8.6, 9.4].
h = 0.01:	[1.50, 7.49] ∪ [8.51, 9.49].
h = 0.001:	[1.500, 7.499] ∪ [8.501, 9.499].
h = 0.0001:	[1.5000, 7.4999] ∪ [8.5001, 9.4999]

Using the enumeration approach with h = 0.01, we found out that the point 4.5 dominates the points 7.5 and 8.5. Similarly, the point 2.5 dominates the point 9.5. Therefore, by Theorem 2, we have G^E(X) = [1.5, 7.5) ∪ (8.5, 9.5).

Let us highlight another result that may be useful in the construction of the set G^E(X).

Theorem 9

Let vector estimates of all x∈X be located at the points of some uniform grid covering X. Then, in order to test if any grid point is a mean with respect to P_E, it suffices to compare its vector estimate only with the vector estimates of all the other points of the grid.

Let us note that the uniform grid required by the conditions of Theorem 9 can always be constructed if all points in X are rational numbers. In practical applications, these would typically be integer numbers or decimal fractions.

It is worth noting that it is easier to construct the set of means G^EΔ(X) than the set G^E(X). According to Theorem 8, the set G^EΔ(X) is easily found by calculating the endpoints α and β of the segment [α, β] using formulae (3) – see Example 6.

6 On Comparing Multi-valued Means

In practice, it is important that we can compare the mean values measured on the same scale. For the means that are uniquely defined, this is a simple task of comparing the two numerical values. In the case of multi-valued means, in statistics, it is common to substitute such means by a single number, e.g., in the case of a median when n is an even number.

The set G^Γ(X) consists of l intervals with the endpoints x¹, x²; x³, x⁴; …; x^2l-1, x^2l, and these intervals do not intersect with each other. Define the length D^Γ(X) of the set G^Γ(X) as the sum of the lengths of all these intervals: $ {D}^{\Gamma}(X)=\sum_{k=1}^l\left|{x}^{2k}-{x}^{2k-1}\right| $. Furthermore, define $ {D}_x^{\Gamma}(X) $ the length of the part of the set G^Γ(X) that is located to the right of the point x. It includes the (part) of one interval and all the other intervals located to the right of x. The relative length $ {d}_x^{\Gamma}(X) $ is defined as the ratio $ {d}_x^{\Gamma}(X) $ = $ {D}_x^{\Gamma}(X) $: D^Γ(X).

Because none of the points of the set G^Γ(X) has any advantages (in the sense of representing the sample) compared to its other points, any of them may be regarded as an equally valid candidate for the choice of the mean. This is analogous to the principle of insufficient reason for decision making under ignorance [13]. Using first-order stochastic dominance [11], we say that the mean G^Γ(X′) is not less than the mean G^Γ(X″) and state this as G^Γ(X′)) ≿ G^Γ(X″), if $ {d}_x^{\Gamma}\left({X}^{\prime}\right) $ ≥ $ {d}_x^{\Gamma}\left({X}^{{\prime\prime}}\right) $ for each x∈Re. If the latter inequality is strict for at least one x∈Re, the former mean is greater than the latter. This relationship between the means (“is not less than”) is a partial quasi-order. The corresponding relation “is greater than” is denoted ≻ and is a partial strict order (it is irreflexive and transitive). This strict relation is essentially a probabilistic dominance relation, or a strict first-order stochastic dominance relation [11]. Note that we have $ {d}_x^{\varGamma }(X)=1-F(x) $, where F(x) is the cumulative distribution function corresponding to the uniform distribution with the density equal to 1 / D^Γ(X) on G^Γ(X) and equal to zero outside G^Γ(X).

It is clear that the relation ≽ is weak in the sense that it would typically not result in a definitive comparison of the means. Relation ≽ can be extended using the ideas of second-order stochastic dominance, but this approach does not appear to be sufficiently effective in practice either.

Another approach would be to “compress” the means that are not uniquely defined to single-valued means. However, this would lead to a loss of information, and the results of comparison would be approximate. For example, let the mean G^Γ(X) consist of several not intersecting intervals defined by the endpoints x¹, x²; x³, x⁴; …; x^2l-1, x^2l. We can represent this mean by its the centre of mass x^Γ(X) and refer to it as the centroid mean.

Example 8

Let G^E(X′) = [1, 2) ∪ (5, 8) and G^E(X″) = [1.5, 4.5] ∪ (8, 9]. We have:

$$ {x}^E\left({X}^{\prime}\right)=\left(1.5\cdot 1+6.5\cdot 3\right)/4=5.25;{x}^E\left({X}^{{\prime\prime}}\right)=\left(3\cdot 3+8.5\cdot 1\right)/4=4.375. $$

Because 5.25 > 4.375, we can accept that the mean G^E(X′) is greater than G^E(X″).

It is useful to note that, if G^Γ(X′) ≻ G^Γ(X″), then x^Γ(X′) > x^Γ(X″) [11].

It is worth noting that it easier to compare the means G^EΔ(X′) and G^EΔ(X″) than the means G^E(X′) and G^E(X″), because the former are the segments [α′, β′] and [α″, β″] respectively. Because the graph of the function $ {d}_x^{E\Delta}(X) $ is a broken line consisting of the single segment [α, β] on which it decreases from 1 to 0, G^EΔ(X′) G^EΔ(X″) is true if and only if α′ ≥ α″ and β′ ≥ β″.

For the simplified application of the mean G^EΔ, we can represent the segment [α, β] by its midpoint γ = ½ (α + β), which can be referred to as the centroid mean (with respect to P_EΔ).

Example 9

The means of the real GDP per capita in Europe calculated based on the data from Eurostat [3] are shown in Table 1 and Fig. 1.

Table 1 Mean real GDP per capita in Europe (in Euro)

Full size table

A multi-line chart represents the means of the G D P per capita in 1000 euro from 2012 to 2019. It has 5 lines for alpha, Mu, m, gamma, and beta, from bottom to top. The line beta dips in 2017, and the line mu is decreasing until 2014. All other lines increase. — **Fig. 1**

Table 1 shows that the GDP per capita m is increasing in the period from 2012 to 2019, but the median GDP per capita μ is decreasing until 2014 and is increasing afterwards. Therefore, it is impossible to make a definite conclusion about the GDP growth in the given period. However, the mean G^EΔ (defined by its boundaries α and β) is increasing (there is only an insignificant decrease of β in 2017), and the condensed mean γ is increasing over the whole period. This observation supports the conclusion that the GDP per capita in Europe has been increasing in the given period.

7 On Stability of Pn-Means

The question of stability of the means with respect to small perturbations of the data (1) is important from both theoretical and practical points of view.

Because G^∅(X) = $ \overline{X} $ = [x₍₁₎; x_(n)], a small change of the values x_i may lead only to small changes of x₍₁₎ and x_(n). Therefore, the set of the means G^∅(X) is stable.

However, the mean with respect to P_E may not be stable in the sense that a very small perturbation of a single point in X may lead to a noticeable change of the set G^E(X). The following examples illustrate this possibility.

Example 10

For X = {1, 2, 3}, we have G^E(X) = [1.5, 2.5]. However, for X^ε = {1, 2 − ε, 3}, where ε > 0 is very small, we have G^E(X^ε) = [1.5 – 0.5ε, 2]. The right endpoint of the set of the means with respect to P_E has changed by 0.5.

Example 11

For X = {10, 25, 40, 110}, we have G^E(X) = [25, 60]. However, for X^ε = {10, 25, 40 + ε, 110}, where ε > 0 is very small, we have G^E(X^ε) = [17.5, 60]. It is interesting that, although only one point in X has increased by a very small ε, the left endpoint of the set of the means (with respect to P_E) has decreased by 7.5.

Let us now consider the issue of stability of the mean with respect to P_EΔ.

Example 12

In the setting of Example 10, we have G^EΔ(X) = {2} and G^EΔ(X^ε) = [2 − ε, 2]. Here, a change of one of the data points in X by ε leads to the change of one of the endpoints of the set of the means with respect to P_EΔ by the same ε.

Example 13

Under the conditions of Example 11, we have G^EΔ(X) = [32.5, 60] и G^EΔ(X^ε) = [32.5 + 0.5ε, 60]. In this example, a change of one of the data points in X by ε results in the change of one of the endpoints for the set of the means by 0.5ε.

Consider the general case. Suppose that the dataset X stated by (1) has changed to the set X^ε = {x₁ + ε₁, x₂ + ε₂, …, x_n + ε_n}, where ε₁, ε₂, …, ε_n are arbitrary numbers.

Theorem 10

The mean with respect to P_EΔ is stable in the following sense: If X is changed to X^ε, the endpoints of the set of the means G^EΔ(X) = [α, β] do not change by more than the following value:

max{|ε₁|, |ε₂|, …, |ε_n|}.

Therefore, the means with respect to P_∅ and P_EΔ are stable with respect to small perturbations of the dataset (1), while the means with respect to P_E may be noticeably unstable.

8 The Case of Data with Repetitions

Assume that the dataset allows repetitions, i.e., the point x₁ occurs β₁ times, x₂ occurs β₂ times, …, x_n occurs β_n times. In this case, the dataset (1) is replaced by Table 2.

Table 2 Data with repetitions

Full size table

In statistics, the numbers β_i are referred to as weights or (absolute) frequencies, and they are used for the calculation of weighted means.

All our results obtained above, starting with the definition of pn-means, are extended to the described more general case. For this, we consider the dataset consisting of the repeating values x₁ (β₁ times), x₂ (β₂ times) and so on, i.e., we restate the data in Table 2 as follows:

$$ \left(\underset{\upbeta_1}{\underbrace{x_1,\dots, {x}_1}},\underset{\upbeta_2}{\underbrace{x_2,\dots, {x}_2}},\dots, \underset{\upbeta_n}{\underbrace{x_n,\dots, {x}_n}}\right). $$

The use of the described methods of construction of pn-means in this case may be computationally demanding as the dimension of the problem becomes very large for large values β_i. To overcome this problem, we may use decision rules developed in theory of qualitative criteria importance measured on continuous scale [18]. In this approach, we treat the integer numbers β_i as quantitative coefficients reflecting the importance of criteria and use notation P^β and P^βΔ to denote the corresponding relations instead of P^E and P^EΔ.

To state the relevant decision rules for the vector estimates y and z, define the following set and values:

$$ \begin{aligned} W\left(y,z\right) & =\left\{{y}_1\right\}\cup \left\{{y}_2\right\}\cup \dots \cup \left\{{y}_m\right\}\cup \left\{{z}_1\right\}\cup \left\{{z}_2\right\}\cup \dots \cup \left\{{z}_m\right\} \\ & =\left\{{w}_1,{w}_2,\dots, {w}_q\right\},{w}_1>{w}_2>\dots >{w}_q; \end{aligned} $$

$$ \begin{aligned} & ={\sum}_{i:{y}_i\ge {w}_k}{\upbeta}_i,\kern0.5em {b}_k(z)={\sum}_{i:{z}_i\ge {w}_k}{\upbeta}_i\ {b}_k(y)={\sum}_{i:{y}_i\ge {w}_k}{\upbeta}_i\\ & = {\sum}_{i:{z}_i\ge {w}_k}{\upbeta}_i,k=1,2,\dots, q-1; \end{aligned} $$

$$ d_k(y)=\sum_{j=1}^k{b}_j(y)\left({w}_j-{w}_{j+1}\right),k=1,2,\dots, q-1. $$

Decision rule for P^β:

$$ yP^{\beta} z\iff {b}_k(y)\le {b}_k(z),k=1,2,\dots, q-1, $$

(4)

and at least one of these inequalities is strict.

Decision rule for P^βΔ:

$$ {yP}^{\beta \Delta}z\iff {d}_k(y)\le {d}_k(z),k=1,2,\dots, q\hbox{--} 1, $$

(5)

and at least one of these inequalities is strict.

Example 14

Consider the dataset in Table 3.

Table. 3 Data in Example 14

Full size table

Using decision rules (4) and (5), let us compare points 5 and 3 which have the following vector estimates: y = f(5) = (4, 3, 1, 0, 2, 4, 6) and z = f(3) = (2, 1, 1, 2, 4, 6, 8). In this case, W = (8, 6, 4, 3, 2, 1, 0). Therefore, q = 7. We have:

b(y) = (b₁(y), b₂(y), …, b₆(y)) = (0, 1, 6, 7, 9, 13);
b(z) = (b₁(z), b₂(z), …, b₆(z)) = (1, 4, 6, 6, 9, 14);
d(y) = (d₁(y), d₂(y), …, d₆(y)) = (0, 2, 8, 15, 24, 37);
d(z) = (d₁(z), d₂(z), …, d₆(z)) = (2, 10, 16, 22, 31, 45).

Note that b₁(y) = 0 < b₁(z) = 1 but b₄(y) = 7 > b₁(z) = 6. According to (4), neither yP^βz nor zP^βy is true. However, because all 6 inequalities (5) are true and at least one of them is strict, we have yP^βΔz.

Note that formula (3) is easier to use if we first rearrange data with repetitions in the form (1).

Example 15

Consider the data from Table 3 of Example 14. We can rearrange these data as in Table 4 in which we specify the ordinal number i for each point and the corresponding value x_(i).

Table. 4 Data for Example 15

Full size table

Using formulae (3), we consecutively calculate:

$$ {\displaystyle \begin{array}{l}\alpha =\frac{1}{2}\ \min \left\{{x}_{(1)}+{x}_{(14)},{x}_{(2)}+{x}_{(13)},{x}_{(3)}+{x}_{(12)},{x}_{(4)}+{x}_{(11)},{x}_{(5)}+{x}_{(10)},\right. \\ \left. \qquad \qquad \qquad {x}_{(6)}+{x}_{(9)},{x}_{(7)}+{x}_{(8)}\right\}=\\ {}\ \ \ =\frac{1}{2}\ \min \left\{1+11,1+9,2+9,4+9,4+7,4+7,4+5\right\}\\ {}\ \ \ =\frac{1}{2}\ \min \left\{12,10,11,13,11,9\right\}=4.5;\end{array}} \vspace*{-1pc} $$

$$ {\displaystyle \begin{array}{l}\beta =\frac{1}{2}\ \max \left\{{x}_{(1)}+{x}_{(14)},{x}_{(2)}+{x}_{(13)},{x}_{(3)}+{x}_{(12)},{x}_{(4)}+{x}_{(11)},{x}_{(5)}+{x}_{(10)}, \right. \\ {} \qquad \qquad \qquad \left. {x}_{(6)}+{x}_{(9)},{x}_{(7)}+{x}_{(8)}\right\}=\\ {}\kern0.96em =\frac{1}{2}\ \max \left\{12,10,11,13,11,11,9\right\}=6.5.\end{array}} $$

Therefore, G^βΔ(X) = [4.5, 6.5].

9 Conclusion

In this paper, we introduced new notions of the means based on unifying ideas of multicriteria optimization. These notions do not require certain properties of the means, which are typically assumed by the conventional approaches in statistics and which can sometimes complicate the choice of a suitable mean in some problems [8]. Instead, our approach utilizes the distance from a current point to each point of the dataset. The proximity from a current point to all points in the dataset is characterized by the vector components of which are the distances between the current point and each point of the dataset. The means are defined as the points which are nondominated with respect to the preference relation among the vectors of distances characterized by scale properties, such as equal importance or ordinality, and/or transfer principles.

It turns out that such means are typically not unique and that their sets may have a complex structure. This potentially complicates the calculation of such means for large samples. However, the advances in computer and software technologies make this computational issue less problematic.

The suggested means allow two different interpretations, either as the range of possible mean values in some specific situations characterized by scale properties, or as whole sets that characterize the chosen sample.

Among the new means introduced in this paper, the means defined with respect to relation P_EΔ should be of the most practical interest. The set G^EΔ(X) of such means has a simple structure (it is a segment [α, β]), and it is stable with respect to small perturbations of the dataset. Furthermore, there exists a simple exact method for the calculation of the set G^EΔ(X). Namely, we have suggested analytical formulae for the calculation of the endpoints α и β.

In applications, the comparison of different multi-valued means developed in our paper may be uninteresting because they usually turn out to be incomparable under the corresponding partial preference relation. However, in some problems, the described multi-valued approach has advantages over the use of known means (see, e.g., Example 9). If, instead of the set of pn-means, we consider their corresponding centres of mass, then such centroid means are uniquely defined. The latter are equally operational as the conventional means and but are less informative than the original pn-means. For example, instead of the mean G^EΔ(X) = [α, β], we may use the corresponding centroid mean (with respect to P_EΔ) γ = ½ (α + β).

The suggested new means are a useful complement to the range of conventional means used in statistics. Among further research avenues arising from our paper, let us note development of new pn-means under different assumptions about the properties of the scales of measurement and corresponding computational methods.

References

Beliakov, G., Pradera, A., Calvo, T.: Aggregation functions: a Guide for Practitioners. Springer, Berlin (2007)
MATH Google Scholar
Dalton, H.: The measurement of the inequality of incomes. Economic J. 30, 348–361 (1920)
Article Google Scholar
Eurostat: Real GDP per capita. https://ec.europa.eu/eurostat/databrowser/view/sdg_08_10/default/table?lang=en. (2020).
Fishburn, P.C.: Decision and Value Theory. Wiley, New York (1964)
MATH Google Scholar
Fishburn, P.C., Willig, R.D.: Transfer principles in income redistribution. J. Public Econom. 25, 323–328 (1984)
Article Google Scholar
Foster, C.: Being mean about the mean. Math. Sch. 43, 32–33 (2014)
Google Scholar
Grabisch, M., Marichal, J.-L., Mesiar, R., Pap, E.: Aggregation functions: means. Inf. Sci. 181, 1–22 (2011)
Article MathSciNet MATH Google Scholar
Gini, C.: Le Medie. Ulet, Torino (1957)
MATH Google Scholar
Knuth, D.E.: The Art of Computer Programming: Vol. 3: Sorting and Searching, 2nd edn. Addison-Wesley, New York (1998)
MATH Google Scholar
Kricheff, R.S.: Means move – analyze the averages. In: That Doesn’t Work Anymore – Retooling Investment Economics in the Age of Discruption, pp. 37–42. De Gruyter, Berlin (2018)
Google Scholar
Levy, H.: Stochastic dominance and expected utility: survey and analysis. Manag. Sci. 38, 555–593 (1992)
Article MATH Google Scholar
Lewontin, R., Levins, R.: The politics of averages. Capital. Nat. Social. 11, 111–114 (2000)
Article Google Scholar
Luce, R.D., Raiffa, H.: Games and Decisions. Wiley, New York (1957)
MATH Google Scholar
Marshall, A.W., Olkin, I.: Inequalities: Theory of Majorization and its Applications. Academic, New York (1979)
MATH Google Scholar
Mirkin, B.G.: Group choice. Wiley, New York (1979)
MATH Google Scholar
Nelson, L.S.: Some notes on averages. J. Quality Technology. 30, 100–101 (1998)
Article Google Scholar
Podinovskii, V.V.: Multicriterial problems with uniform equivalent criteria. USSR Comp. Math. Math. Physics. 15, 47–60 (1975)
Article MathSciNet MATH Google Scholar
Podinovski, V.V.: On the use of importance information in MCDA problems with criteria measured on the first ordered metric scale. J. of Multi-Crit. Decis. Anal. 15, 163–174 (2009)
Article Google Scholar
Podinovski, V.V., Nelyubin, A.P.: Mean quantities: a multicriteria approach. Control Sciences. #. 5, 3–16 (2020) (In Russian)
Google Scholar
Podinovski, V.V., Nelyubin, A.P.: Mean quantities: a multicriteria approach. II. Control Sciences. 2, 33–41 (2021) (In Russian)
Google Scholar
Roberts, F.S.: Measurement Theory: with Applications to Decisionmaking, Utility, and Social Sciences ( Encyclopedia of Mathematics and its Applications). Cambridge University Press, Cambridge (1984)
Book Google Scholar
Smith, M.J.: Statistical Analysis. Handbook. A Comprehensive Handbook of Statistical: Concepts, Techniques and Software Tools. The Winchelsea Press, Edinburgh (2018)
Google Scholar
Stevens, S.S.: On the theory of scales of measurement. Sci. New Series. 103, 677–680 (1946)
MATH Google Scholar

Download references

Acknowledgments

This work is an output of a research project implemented as part of the Basic Research Program at the National Research University Higher School of Economics (HSE University).

Author information

Authors and Affiliations

National Research University Higher School of Economics, Moscow, Russian Federation
Vladislav V. Podinovski
Mechanical Engineering Research Institute RAS, Moscow, Russian Federation
Andrey P. Nelyubin

Authors

Vladislav V. Podinovski
View author publications
You can also search for this author in PubMed Google Scholar
Andrey P. Nelyubin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrey P. Nelyubin .

Editor information

Editors and Affiliations

Department of Mathematics, New Uzbekistan University, Tashkent, Uzbekistan
Boris Goldengorin
Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
Sergei Kuznetsov

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Podinovski, V.V., Nelyubin, A.P. (2023). Mean Values: A Multicriterial Analysis. In: Goldengorin, B., Kuznetsov, S. (eds) Data Analysis and Optimization. Springer Optimization and Its Applications, vol 202. Springer, Cham. https://doi.org/10.1007/978-3-031-31654-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-031-31654-8_17
Published: 24 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31653-1
Online ISBN: 978-3-031-31654-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Mean Values: A Multicriterial Analysis

Abstract

Similar content being viewed by others

An Analysis of Winsorized Weighted Means

Comparing the Medians of a Random Interval Defined by Means of Two Different L 1 Metrics

Least Square Consensus Clustering: Criteria, Methods, Experiments

Keywords

1 Introduction

2 Definition of Mean Values as Nondominated Points

Theorem 1

3 Mean Values for Equally Important Criteria

Theorem 2

Example 1

Example 2

Theorem 3

Theorem 4

Example 3

Theorem 5

Example 4

4 Mean Values for Equally Important Criteria Measured on the First Ordered Metric Scale

Theorem 6

Theorem 7

Example 5

Theorem 8

Example 6

5 On the Construction of Sets of Mean Values

Example 7

Theorem 9

6 On Comparing Multi-valued Means

Example 8

Example 9

7 On Stability of Pn-Means

Example 10

Example 11

Example 12

Example 13

Theorem 10

8 The Case of Data with Repetitions

Example 14

Example 15

9 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation