Keywords

1 Introduction

Mean values are widely used in management, economics, sociology, engineering and other areas of theory and practice. In statistics (see, for example, [8, 22]), mean values are aggregate representations of the varying characteristics of a group of homogeneous objects. Mean values cancel out random variations of a particular characteristic and tend to represent the effect caused by the main factors affecting it. Mean values allow us to compare the levels of the same characteristic in different groups of objects and to investigate the causes of such differences.

It is known that it is impossible to define a universally applicable notion of the mean value which satisfies all desirable properties [1, 8]. Instead, different notions of the mean value are required for different problems and situations. However, in some applications, it may be unclear which of the known mean values should be used, and different means may point to different conclusions. Policy recommendations in such situations may become problematic [6, 8, 10, 12, 16].

Grabisch et al. [7] regarded mean values as idempotent aggregation functions and concluded that the class of such functions “is huge, making the problem of choosing the right function (or family) for a given application a difficult one”.

In this paper, we consider new approaches to the definition of the mean value based on the ideas and methods of multicriteria optimization. Such means turn out to be multi-valued, i.e., represented by sets of points. These allow two interpretations, either as the range of possible mean values in some specific situations (characterized by scale properties, such as equal importance or ordinality, and/or transfer principles), or as whole sets for the given sample.

2 Definition of Mean Values as Nondominated Points

Let X be the set of real numbers consisting of at least n ≥ 2 elements referred to as data or points. These elements are typically obtained as a result of measurement of some characteristic:

$$ X=\left\{{x}_1,{x}_2,\dots, {x}_n\right\}. $$
(1)

These data are assumed homogeneous in the sense that they are obtained by utilizing the same scale of measurement [21, 23]. We assume that the data (1) are quantitative, i.e., the measurement is performed either on the interval scale or on the ratio scale [15].

The elements of the set (1) can be ranked in the non-decreasing and non-increasing order.

$$ {X}_{\uparrow }=<{x}_{(1)},{x}_{(2)},\dots, {x}_{(n)}>;{X}_{\downarrow }=<{x}_{\left[1\right]},{x}_{\left[2\right]},\dots, {x}_{\left[n\right]}>, $$
(2)

where x(1) ≤ x(2) ≤ … ≤ x(n) and x[1] ≥ x[2] ≥ … ≥ x[n]. In statistics, the set (1) is typically referred to as a sample and its non-decreasing sequence X as a variational series.

Let х be an arbitrary fixed number (a point in Re). Its distance from any point xi from X is given by yi = |x - xi|. Then the distance from х to the dataset X can be characterized by the vector y = (y1, y2, …, yn). We can view this vector as the value of the vector criterion f(x) = (f1(x), f2(x), …, fn(x)), where fi(x) = |x - xi|, which is an element of the nonnegative quadrant \( {\operatorname{Re}}_{+}^n={\left[0,+\infty \right)}^n \).

Let PΓ be a preference relation (strict partial order) on \( {\operatorname{Re}}_{+}^n \), where Γ is information about the preferences with respect to distance: if yPΓy′, where y = f(x) and y′ = f(x′), then the point x is closer to the dataset X than x′.

The relation PΓ generates the corresponding relation PΓ on the numeric axis Re: xPΓx′ ⇔ f(x)PΓf(x′).

Therefore, any candidate that we may choose as the closest to X and representing the set X must be nondominated under PΓ. If the set GΓ(X) of nondominated under PΓ points is externally stable, we refer to all such points as pn-means (principal new means) and, more specifically (reflecting the information Γ), as the means with respect to PΓ.

If there is no further information about the preferences of the DM on \( {\operatorname{Re}}_{+}^n \), we obtain the Pareto relation P defined as follows:

$$ {yP}^{\varnothing }z\iff {y}_i\le {z}_i,i=1,2,\dots, n;y\ne z. $$

Relation P generates the Pareto relation P on Re: xPx′ ⇔ f(x)Pf(x′).

Theorem 1

The set of all means of the dataset (1) with respect to P is the segment G(X) = \( \overline{X} \) = [x(1), x(n)], where x(1) = miniN xi and x(n) = maxiN xi, N = {1, 2, …,n}. This set is externally stable.

Therefore, the notion of the means with respect to P is equivalent to the means in the sense of Cauchy.

Proofs of this and the following theorems can be found in [19, 20].

Let us note that, if the function φ is increasing on Re+, then changing the original criteria fi(x) = |x - xi| by φ(fi(x)) does not change the set G(X). For example, one can use “smooth” criteria fi(x) = (x - xi)2. Therefore, the original use of formula fi(x) = |x - xi| as a measure of distance is not essential and is not a limiting assumption for the suggested approach.

3 Mean Values for Equally Important Criteria

In this section, we assume that all criteria are equally important [17] and denote this information E. In this case, the distance from the point x to the dataset X is represented by the preference relation PE on Re, which is defined by the two equivalent decision rules [17], where fi(x) = |x - xi|:

$$ {xP}_E{x}^{\prime}\iff \left({f}_{(1)}(x)\le {f}_{(1)}\left({x}^{\prime}\right),{f}_{(2)}(x)\le {f}_{(2)}\left({x}^{\prime}\right),\dots, {f}_{(n)}(x)\le {f}_{(n)}\left({x}^{\prime}\right)\right), $$

and at least one of these inequalities is strict;

$$ {xP}_E{x}^{\prime}\iff \left({f}_{\left[1\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right),{f}_{\left[2\right]}(x)\le {f}_{\left[2\right]}\left({x}^{\prime}\right),\dots, {f}_{\left[n\right]}(x)\le {f}_{\left[n\right]}\left({x}^{\prime}\right)\right), $$

and at least one of these inequalities is strict.

In this case, the pn-means (with respect to PE) (elements of the set GE(X)) are the points on the numerical axis which are nondominated under PE.

Theorem 2

We have GE(X) ⊆ G(X) = \( \overline{X} \), and the set GE(X) is externally stable.

Note that, if the function φ is increasing on Re+, then changing the original criteria fi(x) to criteria φ(fi(x)) does not change the relation PE and the set GE(X).

Let us consider examples of sets GE(X) constructed according to the methods described in Sect. 5.

Example 1

Let n = 3 and X = {1, 2, 5}. In this example, GE(X) = [1.5, 3].

In the above example, the set GE(X) is a single line segment. However, for large n, this set may be the union of several segments, excluding their endpoints.

Example 2

For n = 6 and different sets X, we have:

$$ {G}^E\left(\left\{10,11,15,61,107,110\right\}\right)=\left[10.5,83\right)\cup \left(83.5,85\right)\cup \left(106.5,108\right);\vspace*{-2pc} $$
$$ {G}^E\left(\left\{10,11,40,55,70,110\right\}\right)=\left[10.5,18\right)\cup \left(18;67.5\right)\cup \left(68,75\right); \vspace*{-2pc} $$
$$ {G}^E\left(\left\{10,57,61,64,109,110\right\}\right)=\left(56.5,57.5\right)\cup \left(58.5,88.5\right)\cup \left(108,109.5\right]. $$

Examples 1 and 2 also illustrate the following result.

Theorem 3

Let the distance between two adjacent elements x(i) and x(i + 1) of the variational series (2) be the smallest among all other pairs of adjacent elements of this series, and let these two elements be uniquely defined. Then the midpoint xc = ½(x(i) + x(i + 1)) is an element of GE(X). Moreover, if x(i) is x(1) or if x(i + 1) is x(n), then xc is the left or, respectively, right, endpoint of the set GE(X).

If x(1) ≠ x(n) and (x(1), x(n)) ⊄ GE(X), for some values of parameter s, the power mean.

$$ {g}^s(X)={\left(\frac{1}{n}\sum \limits_{i=1}^n{\left({x}_i\right)}^s\right)}^{1\backslash s},s\ne 0. $$

is not the mean with respect to PE. This is because, as s increases on Re, the function gs(X), extended to preserve continuity, passes through all values from the interval (x(1), x(n)) [8]. However, we have the following result:

Theorem 4

The arithmetic mean is a mean with respect to PE, i.e., g1(X)∈GE(X).

Example 3

According to Example 2, for X = {10, 57, 61, 64, 109, 110} we have: GE(X) = (56.5, 57.5) ∪ (58.5, 88.5) ∪ (108, 109.5]. In this example, the geometric mean g0(X) = 54.66∉GE(X) and harmonic mean g−1(X) = 35.75∉GE(X), and g1(X) = 68.5∈GE(X). In Example 1, for X = {1, 2, 5}, we have GE(X) = [1.5, 3]. Here, the quadratic mean g2(X) = 3.162∉GE(X), but g1(X) = 2.67∈GE(X).

Theorem 5

The median is a mean with respect to PE, i.e., if n is an odd integer and the median is unique, we have \( \mu (X)={x}_{\left(\frac{n+1}{2}\right)}\in {G}^E(X) \). If n is an even number, the median is not unique and we have \( \mu (X)=\left[{x}_{\left(\frac{n}{2}\right)},{x}_{\left(\frac{n}{2}+1\right)}\right]\subseteq {G}^E(X) \).

Examples 1 and 2 provide illustrations to the above theorem.

It should be noted the following peculiarity of the means with respect to PE: if the points xiX and xjX, xi < xj, are included in GE(X), then the point xkX, such that xi < xk < xj, may not belong to GE(X)!

Example 4

For n = 7 and different sets X, we have:

$$ {X}^{\prime }=\left\{1,2,3,6,8,9,11\right\},{G}^E\left({X}^{\prime}\right)=\left[2;3\right)\cup \left(3;8.5\right]; \vspace*{-1.5pc} $$
$$ {X}^{{\prime\prime} }=\left\{1,2,3,7,8,10,11\right\},{G}^E\left({X}^{{\prime\prime}}\right)=\left[2;3\right)\cup \left(3.5;9\right]. $$

Here x(2), x(4)GE(X), whereas x(3)GE(X) for both sets X = X′ and X = X″. Moreover, the point x(3) = 3 is a punctured point of the set GE(X′) (it is dominated under PE by the point x(5) = 8, and in its arbitrarily small neighborhood there are points nondominated under PE).

This feature clearly violates the very principle of constructing means as points closest to points from X, and is not consistent with the intuitive concept of a mean value. Therefore, the presence of this feature can be considered as a paradox of means with respect to PE.

4 Mean Values for Equally Important Criteria Measured on the First Ordered Metric Scale

Let y be any vector estimate such that yi > yj. Consider any δ > 0 such that yiδ ≥ yj + δ. Define the vector estimate z by replacing component yi by yiδ and yj by yj + δ, but yiδ ≥ yj + δ. Moving from y to z reduces the larger deviation yi from one point in the sample and increases a smaller deviation yj from a different point, by the same amounts δ. The resulting set of distances becomes closer to the ideal set of minimally possible equal deviations. Assume that, for any y and δ described above, the vector estimate z is preferred to the original vector estimate y, in the sense that z is “closer” to X than y and is therefore more suitable for the definition of the mean. Denote Δ the information about the described principle. Such approach is an analogue of Pigou-Dalton’s principle of transfer for income distribution [2, 5]. This means that the equally important criteria have a common first ordered metric scale [4]. The preference relation PEΔ, generated on Ren by the joint information E and Δ, is defined by the following decision rule [14, 18]:

$$ {\displaystyle \begin{array}{l}{xP}_{E\Delta}{x}^{\prime}\iff {f}_{\left[1\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right),{f}_{\left[1\right]}(x)+{f}_{\left[2\right]}(x)\le {f}_{\left[1\right]}\left({x}^{\prime}\right)+{f}_{\left[2\right]}\left({x}^{\prime}\right),\dots \\ {}\kern3.919997em \dots {f}_{\left[1\right]}(x)+{f}_{\left[2\right]}(x)+\dots +{f}_{\left[n\right]}(x)\!\le\! {f}_{\left[1\right]}\left({x}^{\prime}\right){+}{f}_{\left[2\right]}\left({x}^{\prime}\right){+}\dots {f}_{\left[n\right]}\left({x}^{\prime}\right),\end{array}} $$

and at least one of these inequalities is strict. In this case, the pn-means are the points that are nondominated under PEΔ. Because PEΔ ⊃ PE, we have GE(X) ⊇ GEΔ(X).

Theorem 6

The arithmetic mean is a mean with respect to PEΔ, i.e., g1(X)∈GEΔ(X).

Theorem 7

If n is odd, the median (which is uniquely defined), is a mean with respect to PEΔ, i.e., μ(X) ∈ GEΔ(X). If n is even and the median is not uniquely defined, we only have μ(X) ∩ GEΔ(X) ≠ ∅.

Example 5

If n = 5 and X = {1, 2, 3, 5, 11}, we have GEΔ(X) = [3, 6], μ(X) = 3 and g1(X) = 4.4. If n = 4 and X = {10, 11, 12, 110}, we have GEΔ(X) = [11.5, 60], μ(X) = [11, 12] and g1(X) = 35.75. If X = {10, 11, 20, 110}, we have GEΔ(X) = [15.5, 60], μ(X) = [11, 20] and g1(X) = 37.75.

Let us define the set H = {1, 2, …, h}, where h = ⌊(n + 1)/2⌋ is the integer part of (n + 1)/2.

Theorem 8

The set GEΔ(X) is externally stable and coincides with the segment [α, β], where

$$ \alpha =\frac{1}{2}{\min}_{p\in H}\left({x}_{(p)}+{x}_{\left(n+1-p\right)}\right),\beta =\frac{1}{2}{\max}_{p\in H}\left({x}_{(p)}+{x}_{\left(n+1-p\right)}\right) $$
(3)

Example 6

For n = 5, we have h = ⌊(n + 1)/2⌋ = 3 and H = {1, 2, 3}. For X = {1, 2, 7, 8, 11}, using Theorem 8, we have:

$$ \alpha =\frac{1}{2}\ \min \left\{{x}_{(1)}+{x}_{(5)},{x}_{(2)}+{x}_{(4)},{x}_{(3)}+{x}_{(3)}\right\}=\frac{1}{2}\ \min \left\{1+11,2+8,7+7\right\}\\ =\frac{1}{2}\ \min \left\{12,10,14\right\}=5; $$
$$ \beta =\frac{1}{2}\ \max \left\{{x}_{(1)}+{x}_{(5)},{x}_{(2)}+{x}_{(4)},{x}_{(3)}+{x}_{(3)}\right\}=\frac{1}{2}\ \max \left\{12,10,14\right\}=7; $$

Therefore, GEΔ(X) = [α, β] = [5, 7].

5 On the Construction of Sets of Mean Values

For the construction of the set GE(X), we can use known methods of multicriteria optimization developed for the construction of the sets of nondominated variants [17]. Such methods utilize families of functions that are increasing (decreasing), or at least non-decreasing (non-increasing) with respect to PE. For example, we can solve a parametric program which minimizes the function of single variable ψ(f(x)|c) = minπ∈Π maxiN {fπ(i)(x) – ci} on the set X, by varying the vector parameter \( c\in f\left(\overline{X}\right) \). However, even if n is not very large, the number n! of terms of this function (with respect to which the maximization is performed) turns out unacceptably large.

Taking into account that the set X is one-dimensional, we can utilize a different approach. Namely, we can consider a dense grid with the small step h which covers the set X, and identify the nondominated (with respect to PE) points of this grid by simple enumeration [9]. The step h depends on the required precision and can decrease in the process of calculations of the set GE(X). We used this approach for the construction of the set GE(X) in Examples 1 и 2.

Example 7

Let us demonstrate the construction of the set GE(X) for X = {1, 2, 5, 9, 11}. Using computer for the calculations, while reducing the step length h, we obtain the following results:

h = 1:

[2, 7] ∪ [9, 9].

h = 0.1:

[1.5, 7.4] ∪ [8.6, 9.4].

h = 0.01:

[1.50, 7.49] ∪ [8.51, 9.49].

h = 0.001:

[1.500, 7.499] ∪ [8.501, 9.499].

h = 0.0001:

[1.5000, 7.4999] ∪ [8.5001, 9.4999]

Using the enumeration approach with h = 0.01, we found out that the point 4.5 dominates the points 7.5 and 8.5. Similarly, the point 2.5 dominates the point 9.5. Therefore, by Theorem 2, we have GE(X) = [1.5, 7.5) ∪ (8.5, 9.5).

Let us highlight another result that may be useful in the construction of the set GE(X).

Theorem 9

Let vector estimates of all xX be located at the points of some uniform grid covering X. Then, in order to test if any grid point is a mean with respect to PE, it suffices to compare its vector estimate only with the vector estimates of all the other points of the grid.

Let us note that the uniform grid required by the conditions of Theorem 9 can always be constructed if all points in X are rational numbers. In practical applications, these would typically be integer numbers or decimal fractions.

It is worth noting that it is easier to construct the set of means GEΔ(X) than the set GE(X). According to Theorem 8, the set GEΔ(X) is easily found by calculating the endpoints α and β of the segment [α, β] using formulae (3) – see Example 6.

6 On Comparing Multi-valued Means

In practice, it is important that we can compare the mean values measured on the same scale. For the means that are uniquely defined, this is a simple task of comparing the two numerical values. In the case of multi-valued means, in statistics, it is common to substitute such means by a single number, e.g., in the case of a median when n is an even number.

The set GΓ(X) consists of l intervals with the endpoints x1, x2; x3, x4; …; x2l-1, x2l, and these intervals do not intersect with each other. Define the length DΓ(X) of the set GΓ(X) as the sum of the lengths of all these intervals: \( {D}^{\Gamma}(X)=\sum_{k=1}^l\left|{x}^{2k}-{x}^{2k-1}\right| \). Furthermore, define \( {D}_x^{\Gamma}(X) \) the length of the part of the set GΓ(X) that is located to the right of the point x. It includes the (part) of one interval and all the other intervals located to the right of x. The relative length \( {d}_x^{\Gamma}(X) \) is defined as the ratio \( {d}_x^{\Gamma}(X) \) = \( {D}_x^{\Gamma}(X) \): DΓ(X).

Because none of the points of the set GΓ(X) has any advantages (in the sense of representing the sample) compared to its other points, any of them may be regarded as an equally valid candidate for the choice of the mean. This is analogous to the principle of insufficient reason for decision making under ignorance [13]. Using first-order stochastic dominance [11], we say that the mean GΓ(X′) is not less than the mean GΓ(X″) and state this as GΓ(X′)) ≿ GΓ(X″), if \( {d}_x^{\Gamma}\left({X}^{\prime}\right) \)\( {d}_x^{\Gamma}\left({X}^{{\prime\prime}}\right) \) for each x∈Re. If the latter inequality is strict for at least one x∈Re, the former mean is greater than the latter. This relationship between the means (“is not less than”) is a partial quasi-order. The corresponding relation “is greater than” is denoted ≻ and is a partial strict order (it is irreflexive and transitive). This strict relation is essentially a probabilistic dominance relation, or a strict first-order stochastic dominance relation [11]. Note that we have \( {d}_x^{\varGamma }(X)=1-F(x) \), where F(x) is the cumulative distribution function corresponding to the uniform distribution with the density equal to 1 / DΓ(X) on GΓ(X) and equal to zero outside GΓ(X).

It is clear that the relation ≽ is weak in the sense that it would typically not result in a definitive comparison of the means. Relation ≽ can be extended using the ideas of second-order stochastic dominance, but this approach does not appear to be sufficiently effective in practice either.

Another approach would be to “compress” the means that are not uniquely defined to single-valued means. However, this would lead to a loss of information, and the results of comparison would be approximate. For example, let the mean GΓ(X) consist of several not intersecting intervals defined by the endpoints x1, x2; x3, x4; …; x2l-1, x2l. We can represent this mean by its the centre of mass xΓ(X) and refer to it as the centroid mean.

Example 8

Let GE(X′) = [1, 2) ∪ (5, 8) and GE(X″) = [1.5, 4.5] ∪ (8, 9]. We have:

$$ {x}^E\left({X}^{\prime}\right)=\left(1.5\cdot 1+6.5\cdot 3\right)/4=5.25;{x}^E\left({X}^{{\prime\prime}}\right)=\left(3\cdot 3+8.5\cdot 1\right)/4=4.375. $$

Because 5.25 > 4.375, we can accept that the mean GE(X′) is greater than GE(X″).

It is useful to note that, if GΓ(X′) ≻ GΓ(X″), then xΓ(X′) > xΓ(X″) [11].

It is worth noting that it easier to compare the means GEΔ(X′) and GEΔ(X″) than the means GE(X′) and GE(X″), because the former are the segments [α′, β′] and [α″, β″] respectively. Because the graph of the function \( {d}_x^{E\Delta}(X) \) is a broken line consisting of the single segment [α, β] on which it decreases from 1 to 0, GEΔ(X′) GEΔ(X″) is true if and only if α′ ≥ α″ and β′ ≥ β″.

For the simplified application of the mean GEΔ, we can represent the segment [α, β] by its midpoint γ = ½ (α + β), which can be referred to as the centroid mean (with respect to PEΔ).

Example 9

The means of the real GDP per capita in Europe calculated based on the data from Eurostat [3] are shown in Table 1 and Fig. 1.

Table 1 Mean real GDP per capita in Europe (in Euro)
Fig. 1
A multi-line chart represents the means of the G D P per capita in 1000 euro from 2012 to 2019. It has 5 lines for alpha, Mu, m, gamma, and beta, from bottom to top. The line beta dips in 2017, and the line mu is decreasing until 2014. All other lines increase.

Means of the GDP per capita in Europe

Table 1 shows that the GDP per capita m is increasing in the period from 2012 to 2019, but the median GDP per capita μ is decreasing until 2014 and is increasing afterwards. Therefore, it is impossible to make a definite conclusion about the GDP growth in the given period. However, the mean GEΔ (defined by its boundaries α and β) is increasing (there is only an insignificant decrease of β in 2017), and the condensed mean γ is increasing over the whole period. This observation supports the conclusion that the GDP per capita in Europe has been increasing in the given period.

7 On Stability of Pn-Means

The question of stability of the means with respect to small perturbations of the data (1) is important from both theoretical and practical points of view.

Because G(X) = \( \overline{X} \) = [x(1); x(n)], a small change of the values xi may lead only to small changes of x(1) and x(n). Therefore, the set of the means G(X) is stable.

However, the mean with respect to PE may not be stable in the sense that a very small perturbation of a single point in X may lead to a noticeable change of the set GE(X). The following examples illustrate this possibility.

Example 10

For X = {1, 2, 3}, we have GE(X) = [1.5, 2.5]. However, for Xε = {1, 2 − ε, 3}, where ε > 0 is very small, we have GE(Xε) = [1.5 – 0.5ε, 2]. The right endpoint of the set of the means with respect to PE has changed by 0.5.

Example 11

For X = {10, 25, 40, 110}, we have GE(X) = [25, 60]. However, for Xε = {10, 25, 40 + ε, 110}, where ε > 0 is very small, we have GE(Xε) = [17.5, 60]. It is interesting that, although only one point in X has increased by a very small ε, the left endpoint of the set of the means (with respect to PE) has decreased by 7.5.

Let us now consider the issue of stability of the mean with respect to PEΔ.

Example 12

In the setting of Example 10, we have GEΔ(X) = {2} and GEΔ(Xε) = [2 − ε, 2]. Here, a change of one of the data points in X by ε leads to the change of one of the endpoints of the set of the means with respect to PEΔ by the same ε.

Example 13

Under the conditions of Example 11, we have GEΔ(X) = [32.5, 60] и GEΔ(Xε) = [32.5 + 0.5ε, 60]. In this example, a change of one of the data points in X by ε results in the change of one of the endpoints for the set of the means by 0.5ε.

Consider the general case. Suppose that the dataset X stated by (1) has changed to the set Xε = {x1 + ε1, x2 + ε2, …, xn + εn}, where ε1, ε2, …, εn are arbitrary numbers.

Theorem 10

The mean with respect to PEΔ is stable in the following sense: If X is changed to Xε, the endpoints of the set of the means GEΔ(X) = [α, β] do not change by more than the following value:

max{|ε1|, |ε2|, …, |εn|}.

Therefore, the means with respect to P and PEΔ are stable with respect to small perturbations of the dataset (1), while the means with respect to PE may be noticeably unstable.

8 The Case of Data with Repetitions

Assume that the dataset allows repetitions, i.e., the point x1 occurs β1 times, x2 occurs β2 times, …, xn occurs βn times. In this case, the dataset (1) is replaced by Table 2.

Table 2 Data with repetitions

In statistics, the numbers βi are referred to as weights or (absolute) frequencies, and they are used for the calculation of weighted means.

All our results obtained above, starting with the definition of pn-means, are extended to the described more general case. For this, we consider the dataset consisting of the repeating values x1 (β1 times), x2 (β2 times) and so on, i.e., we restate the data in Table 2 as follows:

$$ \left(\underset{\upbeta_1}{\underbrace{x_1,\dots, {x}_1}},\underset{\upbeta_2}{\underbrace{x_2,\dots, {x}_2}},\dots, \underset{\upbeta_n}{\underbrace{x_n,\dots, {x}_n}}\right). $$

The use of the described methods of construction of pn-means in this case may be computationally demanding as the dimension of the problem becomes very large for large values βi. To overcome this problem, we may use decision rules developed in theory of qualitative criteria importance measured on continuous scale [18]. In this approach, we treat the integer numbers βi as quantitative coefficients reflecting the importance of criteria and use notation Pβ and PβΔ to denote the corresponding relations instead of PE and PEΔ.

To state the relevant decision rules for the vector estimates y and z, define the following set and values:

$$ \begin{aligned} W\left(y,z\right) & =\left\{{y}_1\right\}\cup \left\{{y}_2\right\}\cup \dots \cup \left\{{y}_m\right\}\cup \left\{{z}_1\right\}\cup \left\{{z}_2\right\}\cup \dots \cup \left\{{z}_m\right\} \\ & =\left\{{w}_1,{w}_2,\dots, {w}_q\right\},{w}_1>{w}_2>\dots >{w}_q; \end{aligned} $$
$$ \begin{aligned} & ={\sum}_{i:{y}_i\ge {w}_k}{\upbeta}_i,\kern0.5em {b}_k(z)={\sum}_{i:{z}_i\ge {w}_k}{\upbeta}_i\ {b}_k(y)={\sum}_{i:{y}_i\ge {w}_k}{\upbeta}_i\\ & = {\sum}_{i:{z}_i\ge {w}_k}{\upbeta}_i,k=1,2,\dots, q-1; \end{aligned} $$
$$ d_k(y)=\sum_{j=1}^k{b}_j(y)\left({w}_j-{w}_{j+1}\right),k=1,2,\dots, q-1. $$

Decision rule for Pβ:

$$ yP^{\beta} z\iff {b}_k(y)\le {b}_k(z),k=1,2,\dots, q-1, $$
(4)

and at least one of these inequalities is strict.

Decision rule for PβΔ:

$$ {yP}^{\beta \Delta}z\iff {d}_k(y)\le {d}_k(z),k=1,2,\dots, q\hbox{--} 1, $$
(5)

and at least one of these inequalities is strict.

Example 14

Consider the dataset in Table 3.

Table. 3 Data in Example 14

Using decision rules (4) and (5), let us compare points 5 and 3 which have the following vector estimates: y = f(5) = (4, 3, 1, 0, 2, 4, 6) and z = f(3) = (2, 1, 1, 2, 4, 6, 8). In this case, W = (8, 6, 4, 3, 2, 1, 0). Therefore, q = 7. We have:

  • b(y) = (b1(y), b2(y), …, b6(y)) = (0, 1, 6, 7, 9, 13);

  • b(z) = (b1(z), b2(z), …, b6(z)) = (1, 4, 6, 6, 9, 14);

  • d(y) = (d1(y), d2(y), …, d6(y)) = (0, 2, 8, 15, 24, 37);

  • d(z) = (d1(z), d2(z), …, d6(z)) = (2, 10, 16, 22, 31, 45).

Note that b1(y) = 0 < b1(z) = 1 but b4(y) = 7 > b1(z) = 6. According to (4), neither yPβz nor zPβy is true. However, because all 6 inequalities (5) are true and at least one of them is strict, we have yPβΔz.

Note that formula (3) is easier to use if we first rearrange data with repetitions in the form (1).

Example 15

Consider the data from Table 3 of Example 14. We can rearrange these data as in Table 4 in which we specify the ordinal number i for each point and the corresponding value x(i).

Table. 4 Data for Example 15

Using formulae (3), we consecutively calculate:

$$ {\displaystyle \begin{array}{l}\alpha =\frac{1}{2}\ \min \left\{{x}_{(1)}+{x}_{(14)},{x}_{(2)}+{x}_{(13)},{x}_{(3)}+{x}_{(12)},{x}_{(4)}+{x}_{(11)},{x}_{(5)}+{x}_{(10)},\right. \\ \left. \qquad \qquad \qquad {x}_{(6)}+{x}_{(9)},{x}_{(7)}+{x}_{(8)}\right\}=\\ {}\ \ \ =\frac{1}{2}\ \min \left\{1+11,1+9,2+9,4+9,4+7,4+7,4+5\right\}\\ {}\ \ \ =\frac{1}{2}\ \min \left\{12,10,11,13,11,9\right\}=4.5;\end{array}} \vspace*{-1pc} $$
$$ {\displaystyle \begin{array}{l}\beta =\frac{1}{2}\ \max \left\{{x}_{(1)}+{x}_{(14)},{x}_{(2)}+{x}_{(13)},{x}_{(3)}+{x}_{(12)},{x}_{(4)}+{x}_{(11)},{x}_{(5)}+{x}_{(10)}, \right. \\ {} \qquad \qquad \qquad \left. {x}_{(6)}+{x}_{(9)},{x}_{(7)}+{x}_{(8)}\right\}=\\ {}\kern0.96em =\frac{1}{2}\ \max \left\{12,10,11,13,11,11,9\right\}=6.5.\end{array}} $$

Therefore, GβΔ(X) = [4.5, 6.5].

9 Conclusion

In this paper, we introduced new notions of the means based on unifying ideas of multicriteria optimization. These notions do not require certain properties of the means, which are typically assumed by the conventional approaches in statistics and which can sometimes complicate the choice of a suitable mean in some problems [8]. Instead, our approach utilizes the distance from a current point to each point of the dataset. The proximity from a current point to all points in the dataset is characterized by the vector components of which are the distances between the current point and each point of the dataset. The means are defined as the points which are nondominated with respect to the preference relation among the vectors of distances characterized by scale properties, such as equal importance or ordinality, and/or transfer principles.

It turns out that such means are typically not unique and that their sets may have a complex structure. This potentially complicates the calculation of such means for large samples. However, the advances in computer and software technologies make this computational issue less problematic.

The suggested means allow two different interpretations, either as the range of possible mean values in some specific situations characterized by scale properties, or as whole sets that characterize the chosen sample.

Among the new means introduced in this paper, the means defined with respect to relation PEΔ should be of the most practical interest. The set GEΔ(X) of such means has a simple structure (it is a segment [α, β]), and it is stable with respect to small perturbations of the dataset. Furthermore, there exists a simple exact method for the calculation of the set GEΔ(X). Namely, we have suggested analytical formulae for the calculation of the endpoints α и β.

In applications, the comparison of different multi-valued means developed in our paper may be uninteresting because they usually turn out to be incomparable under the corresponding partial preference relation. However, in some problems, the described multi-valued approach has advantages over the use of known means (see, e.g., Example 9). If, instead of the set of pn-means, we consider their corresponding centres of mass, then such centroid means are uniquely defined. The latter are equally operational as the conventional means and but are less informative than the original pn-means. For example, instead of the mean GEΔ(X) = [α, β], we may use the corresponding centroid mean (with respect to PEΔ) γ = ½ (α + β).

The suggested new means are a useful complement to the range of conventional means used in statistics. Among further research avenues arising from our paper, let us note development of new pn-means under different assumptions about the properties of the scales of measurement and corresponding computational methods.