1 Introduction

Multiple criteria decision aiding (MCDA) [for an exhaustive collection of state-of-the-art surveys see Figueira et al. (2005)] concerns a set of alternatives \(A=\left\{ a,b,c,\ldots \right\} \) evaluated on a set of m criteria \(G=\left\{ g_{1},\ldots ,g_m\right\} \), and deal with three main types of problems: ranking, sorting and choice taking into account preferences of a Decision Maker (DM). To handle these problems, three different approaches are commonly used:

  • assigning to each alternative a value through a value function, i.e., a real number reflecting its degree of desirability,

  • comparing the evaluations of alternatives on the considered criteria using a binary relation,

  • using a set of “if..., then...” decision rules induced from the DM’s preference information.

In the first case, the multi-attribute value theory (MAVT) (Keeney and Raiffa 1993) is most frequently used; MAVT provides a methodology for building a value function. In the second case, the most popular are the methods which build an outranking relation on the set of alternatives to compare them pairwise (Roy 1996). In the third case, the decision rules are derived from the DM’s preference information structured by the dominance-based rough set approach (DRSA) (Greco et al. 2001; Słowiński et al. 2009).

In the context of MAVT, one often uses additive value functions, that is functions obtained by adding up marginal value functions representing the degree of preference on the corresponding evaluation criteria. In order to use this approach, one needs to construct marginal value functions for all considered criteria. The construction requires some preference information elicited by the DM. An analyst can obtain it in one of two ways: asking the DM to provide this information directly, or indirectly. As direct definition of marginal value functions requires too big cognitive effort from the DM, indirect elicitation of preference information has been proposed and widely used in MCDA (see, e.g., Jacquet-Lagreze and Siskos 2001). Indirect preference information is expressed by the DM in terms of decision examples, e.g., holistic pairwise comparisons of some reference alternatives. When looking for an additive value function which is compatible with the decision examples provided by the DM, i.e., which reproduces the DM’s decisions, one can find, in general, many compatible instances of such value function, and each of these instances can give a different recommendation in the considered decision context. For this reason, robust ordinal regression (ROR) (Greco et al. 2008) was proposed (for a survey on ROR see Corrente et al. 2013), that takes into consideration all compatible instances of the value function simultaneously. In the context of ROR, possible and necessary preference relations are built for each pair of alternatives a and b, such that the first one is true if a is at least as good as b for at least one compatible instance of the value function, and the second is true when a is at least as good as b for all compatible instances of the value function.

In this paper, we extend ROR on a new important issue: imprecise evaluations of alternatives. In many real world problems, alternatives are imprecisely evaluated on the considered criteria; this is due to several reasons: inexact definition of criteria, uncertainty or imprecision of data used for calculation of performances of alternatives on particular criteria, or subjective assessment of the performances. Different types of imprecise information on weights, value functions and probabilities have been dealt with in the literature (Park and Kim 1997; Weber 1987). Moreover, imprecise evaluation of alternatives on criteria has been considered in different studies. Among them, let us remember an adaptation of the DRSA to the case of multiple criteria sorting problems with imprecise evaluations and assignments (Dembczynski et al. 2009).

A review of literature on handling imprecise evaluations of alternatives in the MCDA context leads us to conclusion that there are three main approaches to this issue:

  • considering imprecise evaluations by means of probability distributions, as in the decision under uncertainty (Moskowitz et al. 1993),

  • stochastic multicriteria acceptability analysis (SMAA) (Lahdelma et al. 1998) which considers probability distributions on the space of evaluations as well as on the space of weights and computes for each alternative the probability of getting a given ranking position or the frequency with which it is preferred to another one,

  • application of fuzzy numbers for modeling imprecise evaluations of alternatives (Zadeh 1975), so that a membership function assigns to each performance a value ranging from 0 (in case of certain non-membership) to 1 (in case of certain membership), like in the adaptation of outranking methods to fuzzy evaluations of alternatives proposed by Czyżak and Słowiński (1997) [for a survey of applications of fuzzy set theory to MCDA see, e.g., Dubois (2011); handling of imprecision due to verbal evaluation of alternatives has been considered by Dong and Herrera-Viedma (2015) and surveyed by Herrera et al. (2009)].

In this paper, we approach the issue of imprecise evaluations of alternatives in a different way than above. The basic idea is that the DM, or experts advising the DM, can specify an imprecise evaluation not only in terms of the interval of possible values, but also in terms of several gradually embedded subintervals, such that each subinterval contained in a larger subinterval includes more plausible values than the larger one. Suppose, for example, that three experts assessed an investment alternative with respect to a “profit” it may bring. The first one estimates the profit could vary between 2000 and 5000, the second estimates it between 2500 and 6000, and the third one estimates it between 4000 and 8000. If the DM would like to summarize these estimates, (s)he could say that the range of the possible profit (confirmed values by at least one expert) is equal to the interval [2000, 8000], the subinterval of more plausible values (confirmed by at least two experts) is [2500, 6000], and the subinterval of the most plausible values (confirmed by all three experts) is [4000, 5000]. This type of information can be modeled by means of the n-point interval recently proposed by Ozturk et al. (2011). In this example, the information related to the profit of the investment alternative can be represented by the 6-point interval [2000, 2500, 4000, 5000, 6000, 8000]. Handling imprecise evaluations on considered criteria by n-point intervals has several advantages over other models. These benefits are listed below:

  • n-point intervals permit the DM to express imprecision in a quite easy way by using few meaningful reference values, that does not require a great cognitive effort;

  • using n-point intervals, the DM can give a finer information than just intervals of possible performances of an alternative on the considered criteria;

  • using n-point intervals, the DM is not obliged to give any exact probability distribution on imprecise performances of alternatives which, in general, is an information not available to her/him;

  • the use of n-point intervals avoids the adoption of the linear interpolation typically considered in triangular and trapezoidal fuzzy numbers, that is an arbitrary assumption to some extent.

All above points have been detailed in Sect. 3, where we compared the n-point intervals with other ways of handling imprecise evaluations, such as probabilities, stochastic multiobjective acceptability analysis and fuzzy numbers.

The paper is structured in the following way: Sect. 2 describes basic concepts of imprecise evaluations; in Sect. 3 we compare the proposed method with other ways of handling imprecision in multiple criteria evaluations; Sect. 4 provides definitions of dominance relations and their properties; Sect. 5 shows how ROR can handle imprecise evaluations; Sect. 6 provides some properties of necessary and possible preference relations; Sect. 7 presents a didactic example, and conclusions are gathered in Sect. 8; all proofs and some further results are deferred to the Appendix.

2 Imprecise evaluations—intuition behind the model and basic definitions

We are considering a decision problem in which a finite set of alternatives, denoted by \(A=\left\{ a,b,c,\ldots \right\} \), can have imprecise evaluations with respect to a set of m evaluation criteria \(G=\left\{ g_{1},\ldots ,g_m\right\} \). In case of precise evaluations, a criterion \(g_{j}\), \(j\,{\in }\,J\,{=}\,\left\{ 1,\ldots ,m\right\} \), is a function \(g_{j}:A\rightarrow X_{j}\), where \(X_{j}\) is the set of all possible evaluations (cardinal or ordinal, depending on the evaluation scale of criterion \(g_{j}\)) that an alternative could assume on criterion \(g_{j}\). In case of n-point intervals we assume that each criterion is a function \(g_{j}\,{:}\,A\,{\rightarrow }\,\mathcal{I}_{j}\), where \(\mathcal{I}_{j}=\left\{ (x_{1},\ldots ,x_{n})\,{:}\,x_{l+1}\succsim _{j}x_{l},l=1,\ldots ,n-1\right\} \subseteq {X_{j}}^{n}\), and \(\succsim _{j}\) is a complete preorder (transitive and complete binary relation) on \(\mathcal{I}_j\), representing preferences with respect to criterion \(g_j\). \(\succsim _j\) coincides with the binary relation \(\ge \) on \(\mathbb {R}\) if criterion \(g_j\) has a quantitative-numerical scale, however, it is defined differently if criterion \(g_{j}\) has a qualitative-nominal scale. To explain this difference, let us consider the evaluation of a student regarding a certain subject denoted by \(g_{j}\). If the evaluation is expressed on a quantitative-numerical scale (for example the evaluation of the student can vary between 2 and 10), then \(\succsim _{j}\) coincides with \(\ge \) because to state that student a is not worse than student b on criterion \(g_j\), it is sufficient to check if \(g_{j}(a)\ge g_j(b)\). Now, let us suppose that the evaluations with respect to this subject are expressed on the following qualitative-nominal scale: “very bad”, “bad”, “medium”, “good” and “very good”. Then, one needs to define an ordering \(\succsim _{j}\) of these nominal terms that is obviously different from the inequality \(\ge \) between two real numbers, that is “very good” \(\succsim _{j}\) “good” \(\succsim _{j}\) “medium” \(\succsim _{j}\) “bad” \(\succsim _{j}\) “very bad”.

In the description of the proposed methodology, for the sake of simplicity and without loss of generality, we shall assume the following:

  • each criterion has a quantitative scale; therefore \(X_{j}\subseteq \mathbb {R}\), and thus \(\succsim _{j}\) coincides with \(\ge \) (indeed, if the scale \(X_{j}\) of criterion \(g_j\) is ordinal, one can always encode it in numerical terms, such that for all \(x_j, y_j \in X_j\)

    $$\begin{aligned} x_j \succsim _j y_j \Leftrightarrow \overline{x}_j \ge \overline{y}_j, \end{aligned}$$

    with \(\overline{x}_j\) and \(\overline{y}_j\) being number codes of \(x_j\) and \(y_j\));

  • the greater \(g_{j}^{i}(a), a\in A\), the better is alternative a on indicator \(g_{j}^{i}\), \(j=1,\ldots ,m\), \(i=1,\ldots ,n\) (in the opposite case, we can take as indicator \(-g_{j}^{i}\) and we come back to the previous case).

We represent the imprecise evaluation of alternative a on criterion g by means of an n-point interval \([g^1(a),\ldots ,g^n(a)]\), where \(g^1(a) \le \ldots \le g^n(a)\). The n-points \(g^{1}(a),\ldots ,g^{n}(a)\) define a sequence of c nested intervals, where \(c=\frac{n}{2}\) if n is even and \(c=\frac{n+1}{2}\) if n is odd,

related to c increasing levels of plausibility \(L_1, \ldots , L_c\), such that the evaluation of alternative a on criterion g belongs to the interval \(\left[ g^1(a),g^n(a)\right] \) with level of plausibility \(L_1\), while the evaluation of a on g belongs to the interval \(\left[ g^{c}(a),g^{c+1}(a)\right] \) (or \(\left[ g^{c}(a),g^{c}(a)\right] \)) with level of plausibility \(L_c\). In general, the evaluation of a on g belongs to the interval \([g^r(a),g^{n-r+1}(a)]\) with level of plausibility \(L_{r}\), \(r=1,\ldots ,c\).

For example, consider an imprecise evaluation of an investment a on criterion “profit”, denoted \(g_1\), expressed by the following 6-point interval

[2000, 2500, 4000, 5000, 6000, 8000] with the corresponding plausibility levels

\(L_1=\) “possible”, \(L_2=\) “fairly plausible”, \(L_3=\) “very plausible”. This means that:

  • values in the interval \([2000,8000]=[g_1^1(a),g_1^6(a)]\) are possible, that is their plausibility is \(L_1\),

  • values in the interval \([2500,6000]=[g_1^2(a),g_1^5(a)]\) are fairly plausible, that is their plausibility is \(L_2\),

  • values in the interval \([4000,5000]=[g_1^3(a),g_1^4(a)]\) are very plausible, that is their plausibility is \(L_3\).

In the following, considering the same number n of indicators for each considered criterion, the evaluations of a will be represented by the following vector:

$$\begin{aligned} g(a)=\left( \left[ g_{1}^{1}(a),\ldots ,g_{1}^{n}(a)\right] ,\ldots ,\left[ g_{j}^{1}(a),\ldots ,g_{j}^{n}(a)\right] ,\ldots ,\left[ g_{m}^{1}(a),\ldots ,g_{m}^{n}(a)\right] \right) . \end{aligned}$$

Even if in this paper we consider ROR, n-point intervals can be used with any MCDA method. Indeed, from this point of view, each one of the indicators \(g_j^i(a)\), \(g_j \in G\) and \(i=1,\ldots ,n\), can be seen as a specific criterion in a reformulation of the original MCDA problem, where the original set of criteria \(G=\{g_1,\ldots ,g_m\}\) is replaced by the new set of criteria \(\overline{G}=\{g_1^1,\ldots ,g_1^n,\ldots ,g_m^1,\ldots ,g_m^n\}\). For example, if a weighted sum would be the utility model, then for each alternative \(a \in A\) the following overall value of a would be defined as:

$$\begin{aligned} U(a)=\sum _{j=1}^m \sum _{i=1}^n w_j^i g_j^i(a) \end{aligned}$$

with \(w_j^i\) representing the weight given to the indicator \(g_j^i\), such that \(w_j^i \ge 0,\) \(j =1, \ldots ,m\), \(i=1, \ldots ,n,\) and \(\displaystyle {\sum \limits _{j=1}^m \sum \limits _{i=1}^n w_j^i=1}\).

3 Comparison with other ways of handling imprecision in multiple criteria evaluations

In this section, we compare the proposed way of handling imprecise evaluations with other methods known from the literature: decision under uncertainty, SMAA, and fuzzy numbers.

3.1 Decision under uncertainty

Let us begin this section observing that the values \(g^1(a)\),...,\(g^n(a)\) of the n-point interval \([g^1(a),\ldots ,g^n(a)]\) can be interpreted as qualitative counterparts of the Hurwicz criterion (Hurwicz 1951). In case of decision under uncertainty the Hurwicz criterion suggests to evaluate the payoff of an act by the value

$$\begin{aligned} \alpha M + (1 - \alpha ) m, \end{aligned}$$

where m and M are the minimum and the maximum outcomes, and \(\alpha \in [0,1]\) is a coefficient measuring the optimism of the DM. Indeed, one can imagine that each indicator \(g^1(a)\),...,\(g^n(a)\) is related to an increasing degree of optimism, such that \(g^1(a)\) is the most pessimistic evaluation, \(g^n(a)\) is the most optimistic evaluation, and, in general, \(g^r(a)\), \(r=1,\ldots ,n,\) are evaluations such that the greater is r the more optimistic they are.

Let us observe that our approach can easily be applied also in case of probabilistic evaluations on criteria \(g_j \in G\) (see, e.g., Moskowitz et al. 1993) in the sense that imprecision is related to some probability distribution on the set of values that alternatives from A can assume on criteria \(g_j\in G\). In this case, for each \(a \in A\) and \(g_j \in G\), one can associate to each interval \([g_j^r(a),g_j^{n-r}(a)]\) when \(r\ne c\), and to the value \(g_j^r(a)\), when \(r = c\), the probability levels \(p_j^r, r=1,\ldots ,c,\) where

$$\begin{aligned} 0 \le p_j^1 \le p_j^2 \le \ldots \le p_j^c \le 1, \end{aligned}$$

such that there is a probability \(p_j^r\) that \(g_j(a)\) is not smaller than \(g_j^r(a)\), and there is an analogous probability \(p_j^r\) that \(g_j(a)\) is not greater than \(g_j^{n-r}(a)\). For example, if one knows the probability distribution \(\mathbf P \) on values of the profit for the investment considered in the previous section, and supposing that the DM is focusing (her)his attention on probability levels 10, 25 and 40 %, the 6-point interval [2000, 2500, 4000, 5000, 6000, 8000] can be interpreted as follows:

  • there is a probability of 10 % that the profit is smaller than \(g^{1}(a)=2000\), and there is the same probability of 10 % that the profit is greater \(g^{6}(a)=8000\),

  • there is a probability of 25 % that the profit is smaller than \(g^{2}(a)=2500\), and there is the same probability of 25 % that the profit is greater than \(g^{5}(a)=6000\),

  • there is a probability of 40 % that the profit is smaller than \(g^{3}(a)=4000\), and there is the same probability of 40 % that the profit is greater than \(g^{4}(a)=5000\).

In our opinion, evaluations expressed as n-point intervals have some advantages over probabilistic evaluations for the following reasons:

  • even if the probability distributions of the values taken by alternatives \(a\in A\) on criteria \(g_j \in G\) are perfectly known, n-point intervals permit to focus on the probability levels most important for the DM, so that the MCDA procedure becomes more controllable;

  • if the probability distributions of the values taken by alternatives \(a\in A\) on criteria \(g_j \in G\) are not perfectly known, n-point intervals permit to use this imperfect information taking as indicators \(g_j^r\) the values corresponding to the probability levels for which there is some information;

  • n-point intervals can also represent qualitative probability distributions by considering plausibility levels \(L_1, \ldots , L_c\) with a probabilistic meaning such as

    • \(L_1=\) “at least weakly probable”,

    • \(L_2=\) “at least fairly probable”,

    • \(L_3=\) “very probable”.

Due to the verbal ordinal scale of plausibility \(\mathcal {L}=\{L_1,\ldots ,L_c\}\), which is intuitively understandable for the DM, expressing the imprecision in terms of n-point intervals is relatively easy for the DM. Consequently, the use of n-point intervals is the best way of getting a reliable information about imprecise evaluations of alternatives.

3.2 Stochastic multiobjective acceptability analysis (SMAA) and fuzzy numbers

Handling of imprecision by n-point interval evaluations is related to SMAA (Lahdelma et al. 1998) and to the approach based on the use of fuzzy numbers.

Let us consider an interval characterized by two extreme values only, that is, the most pessimistic and the most optimistic evaluation of an alternative on the considered criterion. This means that the evaluation of alternative a on criterion g is represented by the 2-point interval \([g^{1}(a),g^{2}(a)]\), such that a could get whichever evaluation between \(g^{1}(a)\) (the most pessimistic evaluation) and \(g^{2}(a)\) (the most optimistic evaluation). This is typical for handling imprecise evaluation of alternatives on particular criteria in SMAA. However, differently from SMAA, we do not assume any probability distribution of the evaluations in the interval \([g^{1}(a),g^{2}(a)]\). In fact, within SMAA, a uniform probability distribution is considered, even if, in general, any probability distribution could be assumed. It is not easy, however, to select the proper probability distribution. Is it enough to justify the use of a uniform distribution by saying that it is the simplest one? It is rather more reasonable and methodologically correct to avoid consideration of any probability distribution (which is almost always the case in multiple criteria decision making), unless one has a strong evidence in favour of one specific probability distribution. In this sense, we retain the most stable and robust part of the information given by the interval \([g^{1}(a),g^{2}(a)]\), that is the two extreme values, and we do not assume any probability distribution.

Now, consider the case where each interval is characterized not only by the two extreme values but also by another point between them; in this way \(g(a)=[g^{1}(a),g^{2}(a),g^{3}(a)]\), and this 3-point interval indicates that, with respect to criterion g, alternative a can assume whatever evaluation between \(g^{1}(a)\) and \(g^{3}(a)\), but, it is very plausible that the evaluation assumed by a on criterion g is around \(g^{2}(a)\). This interpretation is coherent with an evaluation expressed by a linguistic variable represented by means of a triangular fuzzy number. If a linguistic variable v is represented by a triangular fuzzy number \(\widetilde{v}=(v_1,v_2,v_3)\), then the possible values of v are between \(v_1\) and \(v_3\), such that other values have a null membership and the maximum membership equal to 1 is assumed in \(v_2\). Moreover, the membership between \(v_1\) and \(v_2\), as well as that one between \(v_2\) and \(v_3\), is supposed to follow a linear interpolation. Formally, we have that the membership function \(\mu _v: \mathbb {R}\rightarrow [0,1]\) assigns to each \(x \in \mathbb {R}\) a value as shown in Fig. 1.

Fig. 1
figure 1

Membership function of a triangular fuzzy number

Observe, however, that the information represented by the 3-point interval, is less arbitrary than the one represented by a triangular fuzzy number. Indeed, in the first case, the information specifies only the minimum and maximum values, \(v_1\) and \(v_3\), respectively, and the most plausible value, \(v_2\). We avoid to assign a value of membership to all the values between \(v_1\) and \(v_3\), which would be arbitrary. The question is: why a linear interpolation and not some other interpolating function? Even if one would assume another interpolating function, how to verify that the values taken by the membership function are correct? This doubt makes our proposal more trustworthy again.

Finally, let us consider an imprecise evaluation represented by a 4-point interval \([g^{1}(a),g^{2}(a),g^{3}(a),g^{4}(a)]\), which means that the possible evaluation of a is included between \(g^{1}(a)\) and \(g^{4}(a)\), but the most plausible evaluation is between \(g^{2}(a)\) and \(g^{3}(a)\). This interpretation is coherent with an evaluation expressed by a linguistic variable represented by a trapezoidal fuzzy number. If a linguistic variable v is represented by a trapezoidal fuzzy number \(\widetilde{v}=(v_1,v_2,v_3,v_4)\), then the possible values of v are between \(v_1\) and \(v_4\), such that other values have a null membership and the maximum membership equal to 1 is assumed between \(v_2\) and \(v_3\). Moreover, the membership between \(v_1\) and \(v_2\), as well as that one between \(v_3\) and \(v_4\), is following a linear interpolation. Formally we have that the membership function \(\mu _v: \mathbb {R}\rightarrow [0,1]\) assigns to each \(x \in \mathbb {R}\) a value as shown in Fig. 2.

Fig. 2
figure 2

Membership function of a trapezoidal fuzzy number

The analogy between n-point intervals and fuzzy numbers could be continued for n greater than 4, however, the argument about the sensitivity of the end result on the choice of the interpolating function would be still in favor of our proposal.

4 Definitions of dominance relations and their properties

Various types of dominance relations stem from the formulation of multiple criteria decision problem without considering preferences. The following two concepts of dominance arise naturally in the context of imprecise evaluations of alternatives:

Definition 4.1

Given alternatives \(a,b\in A\), and \(i,k\in \left\{ 1,\ldots ,n\right\} \), we say that “a (ik)-dominates b”, denoted by \(a\varDelta ^{(i,k)}b\), if \(g_{j}^{i}(a)\ge g_{j}^{k}(b),\) \(\forall j=1,\ldots ,m\).

Definition 4.2

Given alternatives \(a,b\in A,\) we say that “a normally dominates b”, denoted by \(a\varDelta b\), if \(g_{j}^{i}(a)\ge g_{j}^{i}(b)\), \(\forall j=1,\ldots ,m\), and \(\forall i=1,\ldots ,n.\)

Equivalently, we can say that “a normally dominates b” if a (ii)-dominates b, \(\forall i=1,\ldots ,n.\)

Let us explain the meaning of the two above concepts of dominance taking into consideration alternatives abd that get 3-point evaluations on criteria \(g_1\), \(g_2\) and \(g_3\), where \(L_1\) = “possible” and \(L_2\) = “very plausible”, as follows:

  • \(\big [g_1^1(a),g_{1{}_{\phantom {\frac{1}{1}}}}^2(a),g_1^3(a)\big ]=[25, 35, 40]\), \(\big [g_2^1(a),g_2^2(a),g_2^3(a)\big ]=[40, 55, 65]\),

    \(\big [g_{3{}_{\phantom {\frac{1}{1}}}}^1(a),g_3^2(a),g_3^3(a)\big ]=[25, 50, 55],\)

  • \([g_{1{}_{\phantom {\frac{1}{1}}}}^1(b),g_1^2(b),g_1^3(b)\big ]=[10, 20, 45]\), \(\big [g_2^1(b),g_2^2(b),g_2^3(b)\big ]=[35, 50, 60]\),

    \(\big [g_{3{}_{\phantom {\frac{1}{1}}}}^1(b),g_3^2(b),g_3^3(b)\big ]=[20, 45, 60],\)

  • \(\big [g_{1{}_{\phantom {\frac{1}{1}}}}^1(d),g_1^2(d),g_1^3(d)\big ]=[10, 15, 20]\), \(\big [g_2^1(d),g_2^2(d),g_2^3(d)\big ]=[30, 50, 55]\),

    \(\big [g_3^1(d),g_3^2(d),g_3^3(d)\big ]=[20, 45, 55].\)

Observe that, according to indicator \(g_j^1\), \(j=1,2,3\), alternative a is at least as good as alternative b: indeed \(g_1^1(a) \ge g_1^1(b)\), \(g_2^1(a) \ge g_2^1(b)\) and \(g_3^1(a) \ge g_3^1(b)\). Thus, considering the most pessimistic evaluation, a is not worse than b on all considered criteria. Therefore, one can say that “a (1, 1)-dominates b”, denoted by \(a\varDelta ^{(1,1)}b\). Analogously, according to indicator \(g_j^2\), \(j=1,2,3,\) a is at least as good as b. This means that considering the most plausible evaluation, a is not worse than b on all considered criteria. Instead, it is not true that according to indicators \(g_j^3\), \(j=1,2,3,\) a is at least as good as b, because \(g_3^3(a) < g_3^3(b)\). This means that considering the most optimistic evaluation, a is not at least as good as b on all considered criteria. Observe, instead, that \(a\varDelta ^{(1,1)}d\), \(a\varDelta ^{(2,2)}d\) and \(a\varDelta ^{(3,3)}d\), so one can conclude that \(a\varDelta d\), which means that a is not worse than d on all criteria considering pessimistic evaluations, the most plausible evaluations and the most optimistic evaluations on all three criteria.

Observe also that for some criterion g, one could take into account the most optimistic evaluation for a and the most plausible evaluation for b, i.e., one could consider indicator \(g^3\) for a and indicator \(g^2\) for b. This is consistent with preference representation by interval orders (Fishburn 1985). An interval order is a binary relation R on a set X which is reflexive and Ferrers transitive (i.e., for all \(x,y,w,z \in X, xRy\) and wRz imply xRz or wRy). If X is finite, then a binary relation R on X is an interval order if and only if there exists \(u^+:X\rightarrow \mathbb {R}\) and \(u^-:X\rightarrow \mathbb {R}\) with \(u^+(z) \ge u^-(z)\) for all \(z \in X\), such that for all \(x,y \in X\)

$$\begin{aligned} xRy \Leftrightarrow u^+(x) \ge u^-(y). \end{aligned}$$

Suppose that the interval order R is a weak preference relation on X, such that for each \(x,y \in X, xRy\) means that x is at least as good as y. In this case, \(u^+(z)\) and \(u^-(z)\) can be interpreted as the optimistic and the pessimistic evaluation of \(z \in X\) and, consequently, one can say that x is weakly preferred to y, i.e., xRy, if the optimistic evaluation of x, i.e., \(u^+(x)\), is not worse than the pessimistic evaluation of y, i.e., \(u^-(y)\). Observe moreover that \(R^d\), being the dual of R, i.e., the complement of the inverse of R, such that for all \(x,y \in X\), \(xR^d y\) iff not(yRx),  can be interpreted as a strong preference relation on X. Thus, we have that for all \(x,y \in X\),

$$\begin{aligned} xR^dy \Leftrightarrow u^-(x) > u^+(y), \end{aligned}$$

which can be interpreted as “x is strongly preferred to y if and only if the pessimistic evaluation of x is better than the optimistic evaluation of y”.

The idea of considering the pessimistic and the optimistic evaluations of alternatives to define preference relations with respect to criterion \(g_j\in G\) can be easily extended to the case of n-point intervals considering indicator \(g_{j}^{i}\) for alternative x and indicator \(g_{j}^{k}\) for alternative y, which permit to say that x is (ik)-preferred to y on criterion \(g_j \in G\) (denoted by \(x \succsim _j^{(i,k)} y\)), if \(g_j^{i}(x) \ge g_j^{k}(y)\). Considering the above example, we have \(a \succsim _3^{(3,2)} b\) because \(g_3^3(a) \ge g_3^2(b)\), which means that with respect to criterion \(g_3\) the most optimistic evaluation of a is not worse than the most plausible evaluation of b.

Using preference relations \(\succsim _j^{(i,k)}\) one can say that for all \(a,b \in A\), \(a \varDelta ^{(i,k)} b\) if and only if \(a \succsim _j^{(i,k)} b\) for all \(g_j \in G\). For example, \(a \varDelta ^{(3,2)} b\), because \(g_1^3(a) \ge g_1^2(b)\), \(g_2^3(a) \ge g_2^2(b)\) and \(g_3^3(a) \ge g_3^2(b)\), which means that on all three criteria the optimistic evaluation of a is at least as good as the most plausible evaluation of b.

It is also worth noting that among all the considered pairs (ik), an important place has to be given to the pairs \((i,n-i+1)\) with \(i \le c\) Footnote 1 because \(a \succsim _j^{(i,n-i+1)} b\) means that all the evaluations of \(a\in A\) on \(g_j \in G\) with plausibility \(L_i\), i.e., \([g_j^i(a),g_j^{n-i+1}(a)]\), are not worse than all evaluations of \(b \in A\) on \(g_j\) with the same plausibility \(L_i\), \([g_j^i(b),g_j^{n-i+1}(b)]\). Looking at the above example, \(a \succsim _1^{(1,3)} d\), are not worse than all values with plausibility \(L_1\) for d, i.e. \([g_1^1(d),g_1^3(d)]=[10,20]\).

The following proposition provides some basic properties of the dominance relations introduced above for n-point interval evaluations. These are the properties of reflexivity and transitivity that, if satisfied conjointly, characterize the structure of a partial preorder.

Proposition 4.1

  1. 1.

    If \(i\ge k,\) \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(\varDelta ^{(i,k)}\) is reflexive,

  2. 2.

    If \(i\le k,\) \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(\varDelta ^{(i,k)}\) is transitive,

  3. 3.

    For each \(i\in \left\{ 1,\ldots ,n\right\} ,\) \(\varDelta ^{(i,i)}\) is a partial preorder,

  4. 4.

    If \(r\ge i\) and \(s\le k,\) \(i,k,r,s\in \left\{ 1,\ldots ,n\right\} \), then \(\varDelta ^{(i,k)}\;\subseteq \;\varDelta ^{(r,s)}\),

  5. 5.

    Given alternatives \(a,b,c\in A\), if \(a\varDelta ^{(i,k)}b,\) \(b\varDelta ^{(i_1,k_1)}c,\) and \(k\ge i_1\), \(i,k,i_1,k_1\in \left\{ 1,\ldots ,n\right\} \), then \(a\varDelta ^{(r,s)}c\) with \(r,s\in \left\{ 1,\ldots ,n\right\} \), such that \(r\ge i\) and \(s\le k_1,\)

  6. 6.

    \(\varDelta \) is a partial preorder,

  7. 7.

    Given alternatives \(a,b,c\in A\), if \(a\varDelta ^{(i,k)}b,\) \(b\varDelta c,\) \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(a\varDelta ^{(s,t)}c\) with \(s,t\in \left\{ 1,\ldots ,n\right\} \), such that \(s\ge i\) and \(t\le k\),

  8. 8.

    Given alternatives \(a,b,c\in A\), if \(a\varDelta b,\) \(b\varDelta ^{(i,k)} c,\) \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(a\varDelta ^{(s,t)}c\) with \(s,t\in \left\{ 1,\ldots ,n\right\} \), such that \(s\ge i\) and \(t\le k\).

Note 4.1

In the following, we shall call strong dominance, and we shall denote it by \(\varDelta ^{S}\), the dominance relation \(\varDelta ^{(1,n)}\). Similarly, we shall call weak dominance, and we shall denote it by \(\varDelta ^{W}\), the dominance relation \(\varDelta ^{(n,1)}\). When \(n=2\), then the dominance relation \(\varDelta ^{(i,k)}\) boils down to strong dominance \(\varDelta ^{(1,2)}\), weak dominance \(\varDelta ^{(2,1)}\) and to the dominance relations \(\varDelta ^{(1,1)}\) and \(\varDelta ^{(2,2)}\) comparing two alternatives considering only their best values or their worst values. Using Proposition 4.1, we can state that weak dominance and normal dominance are reflexive relations, normal dominance and strong dominance are transitive relations, and so on.

Considering the strong and the weak dominance relations simultaneously, we can state the following proposition:

Proposition 4.2

  1. 1.

    \(\varDelta ^{(1,n)} \;\subseteq \; \varDelta \;\subseteq \; \varDelta ^{(n,1)},\)

  2. 2.

    For \(i,k=1,\ldots ,n\), \(\varDelta ^{(1,n)} \;\subseteq \; \varDelta ^{(i,k)} \;\subseteq \; \varDelta ^{(n,1)}.\)

Note 4.2

Proposition 4.2 shows how important are the weak and the strong dominance relations, as they are the only two relations that can be compared directly with \(\varDelta \). Indeed, in general, for any \((i,k)\in \left\{ 1,\ldots ,n\right\} \times \left\{ 1,\ldots ,n\right\} \), with \((i,k)\ne (1,n)\) and \((i,k)\ne (n,1)\), we can have \(\varDelta ^{(i,k)}\not \subseteq \varDelta \) and \(\varDelta \not \subseteq \varDelta ^{(i,k)}\).

5 Robust ordinal regression for imprecise evaluations—description of the methodology

The dominance relation, which is the only objective information that comes with the statement of a multiple criteria decision problem, is very poor. For this reason, in order to handle DM’s preferences in one of the three typical multiple criteria decision problems, we are using the MAVT (Keeney and Raiffa 1993). MAVT considers value functions \(U(g_1(a), \ldots , g_m(a))\) where

$$\begin{aligned} \mathbb {R}^m \rightarrow \mathbb {R}\end{aligned}$$

such that:

$$\begin{aligned} ``a \,\,\text {is at least as good as}\,\, b{\text {''}} \ \Leftrightarrow \ U(g_1(a), \ldots , g_m(a)) \ge U(g_1(b), \ldots , g_m(b)), \end{aligned}$$

taking into account the evaluations of alternatives on the m considered criteria. In case of imprecise evaluations, we are considering for each criterion \(g_{j}, j\in J,\) n indicators \(g_{j}^{i}:A\rightarrow X_j\), \(i=1,\ldots ,n\), assigning to each alternative \(a\in A\) the i-th evaluation from interval \(g_{j}(a)\). Using this notation, we can distinguish different types of value functions:

  • i-th sub-marginal value function referring to the i-th indicator of criterion \(g_{j}\), \(u_{j}^{i}(g_{j}^{i}(a)):X_{j}\rightarrow \mathbb {R}\), for all \(j\in J\), and \(i=1,\ldots ,n,\)

  • marginal value function referring to criterion \(g_{j}\), such that \(U_j([g_{j}^{1}(a),\ldots , g_{j}^{n}(a)]):\mathcal{I}_{j}\rightarrow \mathbb {R}\), and

    $$\begin{aligned} U_{j}\left( \left[ g_{j}^{1}(a),\ldots ,g_{j}^{n}(a)\right] \right) =u_{j}^{1}\left( g_{j}^{1}(a)\right) +\ldots +u_{j}^{n}\left( g_{j}^{n}(a)\right) . \end{aligned}$$

    The marginal value of alternative a with respect to criterion \(g_{j}\) depends on all the n indicators \(g_{j}^{i}\) because each of them takes part in the evaluation of a on criterion \(g_{j}\) with a different level of plausibility, which is represented by the corresponding sub-marginal value function \(u_{j}^i\). In the paper, we admit that the indicators of each criterion are preferentially independent, which permits an additive aggregation of sub-marginal value functions (Keeney and Raiffa 1993). An analogous assumption is adopted with respect to aggregation of marginal value functions.

  • total additive value function

    $$\begin{aligned} U\left( \left[ g_{1}^{1}(a)\ldots ,g_{1}^{n}(a)\right] ,\ldots ,\left[ g_{m}^{1}(a),\ldots ,g_{m}^{n}(a)\right] \right) :\mathcal{I}_{1}\times \cdots \times \mathcal{I}_{m}\rightarrow \mathbb {R}\end{aligned}$$

    such that

    $$\begin{aligned}&U\left( \left[ g_{1}^{1}(a)\ldots ,g_{1}^{n}(a)\right] ,\ldots ,\left[ g_{m}^{1}(a),\ldots ,g_{m}^{n}(a)\right] \right) = \nonumber \\&\quad =\sum _{j=1}^{m}U_{j}\left( \left[ g_{j}^{1}(a),\ldots ,g_{j}^{n}(a)\right] \right) =\sum _{j=1}^{m}\left[ \sum _{i=1}^{n}u_{j}^{i}\left( g_{j}^{i}(a)\right) \right] . \end{aligned}$$
    (1)

In the following, for the sake of simplicity, for each \(j\in J\) we write \(U_{j}(a)\) instead of \(U_j([g_{j}^{1}(a),\ldots ,g_{j}^{n}(a)])\), and U(a) instead of \(U\left( \left[ g_{1}^{1}(a),\ldots ,g_{1}^{n}(a)\right] ,\ldots ,\left[ g_{m}^{1}(a),\ldots ,g_{m}^{n}(a)\right] \right) \).

In order to take into account the imprecise nature of evaluations, we consider for each alternative \(a\in A\), n fictitious alternatives \(a^{(i)}\), having precise evaluations on all criteria, equal to the i-th point of interval \(g_{j}(a)\), for each \(j\in J\), i.e., \(g_{j}^1\left( a^{(i)}\right) =\cdots =g_{j}^n\left( a^{(i)}\right) =g_{j}^i(a)\), for each \(j\in J\).

For example, in case of three point intervals, \(a^{(1)}\), \(a^{(2)}\) and \(a^{(3)}\) represent the “most pessimistic”, the “most plausible” and the “most optimistic” realizations of alternative a, because these fictitious alternatives take the worst, the average and the best evaluations on all considered criteria, respectively. Note that given \(a\in A\), a value function U assigns to the corresponding alternatives \(a^{(i)}\) the value:

$$\begin{aligned} U\left( a^{(i)}\right) =u_{1}^{1}\left( g_{1}^{i}(a)\right) +\cdots +u_{1}^{n}\left( g_{1}^{i}(a)\right) +\ldots +u_{m}^{1}\left( g_{m}^{i}(a)\right) +\cdots +u_{m}^{n}\left( g_{m}^{i}(a)\right) . \end{aligned}$$
(2)

Proposition 5.1 describes the relationship between the value functions of the fictitious alternatives \(a^{(i)}\), and the relationship between the total additive value function U(a) and the value obtained in correspondence to the most pessimistic and the most optimistic realizations of alternative a.

Proposition 5.1

  1. 1.

    For each \(a\in A,\) if \(i\ge k\), \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(U(a^{(i)})\ge U(a^{(k)}),\)

  2. 2.

    For each \(a\in A,\) \(U(a^{(1)})\le U(a)\le U(a^{(n)}).\)

Let us now discuss elicitation of preference information and application of ROR to model (1).

In order to assign to each alternative a real number representing its degree of desirability, we need to know the sub-marginal value functions \(u_{j}^{i}(\cdot ),\) for all \(j\in J\) and for all \(i\in \left\{ 1,\ldots ,n\right\} \). They can be obtained in two different ways: asking directly the DM which is the analytical expression of functions \(u_{j}^{i}(\cdot )\), or inducing them from indirect preference information elicited by the DM on a set \(A^{R}\subseteq A\) of alternatives called reference alternatives. The reference alternatives will be marked with a dash, like \(\overline{a}\). We propose to use the second method, and thus the DM is asked to provide some preference information regarding pairs of alternatives or intensity of preference for quadruples of alternatives, such that, for all \(\overline{a}, \overline{b}, \overline{c}, \overline{d} \in A^R\),

  • \(\overline{a}\succsim \overline{b}\) iff \(\overline{a}\) is at least as good as \(\overline{b}\),

  • \(\overline{a}\succsim _j \overline{b}\) iff \(\overline{a}\) is at least as good as \(\overline{b}\) on criterion \(g_j\),

  • \((\overline{a},\overline{b}) \succsim ^{*} (\overline{c}, \overline{d})\) iff \(\overline{a}\) is preferred to \(\overline{b}\) at least as much as \(\overline{c}\) is preferred to \(\overline{d}\),

  • \((\overline{a},\overline{b})\succsim _{j}^{*} (\overline{c},\overline{d})\) iff, on criterion \(g_j\), \(\overline{a}\) is preferred to \(\overline{b}\) at least as much as \(\overline{c}\) is preferred to \(\overline{d}\).

Let us notice that \(\sim \), \(\sim _{j},\) \(\sim ^*\) and \(\sim ^*_{j}\) are the symmetric parts of \(\succsim \), \(\succsim _{j},\) \(\succsim ^*\) and \(\succsim ^*_{j}\) while \(\succ \), \(\succ _{j},\) \(\succ ^*\) and \(\succ ^*_{j}\) are the asymmetric parts of \(\succsim \), \(\succsim _{j},\) \(\succsim ^*\) and \(\succsim ^*_{j}\). That is, for example, \(\overline{a}\sim \overline{b}\) iff \(\overline{a}\succsim \overline{b}\) and \(\overline{b}\succsim \overline{a}\), while \(\overline{a}\succ \overline{b}\) iff \(\overline{a}\succsim \overline{b}\) and not(\(\overline{b}\succsim \overline{a}\)).

All this preference information can be translated into inequality constraints on the values as follows:

  • \(U(\overline{a})\ge \left[ >\right] U(\overline{b})\) iff \(\overline{a}\succsim \left[ \succ \right] \overline{b}\),

  • \(U_j(\overline{a})\ge \left[ >\right] U_j(\overline{b})\) iff \(\overline{a}\succsim _j \left[ \succ _j\right] \overline{b}\),

  • \(U(\overline{a})-U(\overline{b})\ge \left[ >\right] U(\overline{c})-U(\overline{d})\ge 0\) iff \((\overline{a},\overline{b})\succsim ^*\left[ \succ ^*\right] (\overline{c},\overline{d})\),

  • \(U_j(\overline{a})-U_j(\overline{b})\ge \left[ >\right] U_j(\overline{c})-U_j(\overline{d})\ge 0\) iff \((\overline{a},\overline{b})\succsim _j^*\left[ \succ _j^*\right] (\overline{c},\overline{d})\).

Strict inequality constraints are translated into weak inequality constraints by using an auxiliary positive variable \(\varepsilon \) such that, for example, \(U(\overline{a})>U(\overline{b})\) becomes \(U(\overline{a})\ge U(\overline{b})+\varepsilon \). In the following, \(E^{DM}\) denotes the set of inequality constraints translating the preference information provided by the DM.

We shall call compatible a value function satisfying the set of constraints in \(E^{DM}\), as well as some monotonicity and normalization constraints:

$$\begin{aligned} \left. \begin{array}{l} \;u_{j}^{i}(x_{j}^{k})-u_{j}^{i}(x_{j}^{k-1})\ge 0, \; \text{ for } \text{ each }\; j\in J,\;k=2, \ldots , m_{j}(A), \;i=1,\ldots ,n\\ \;u_{j}^{i}(x_{j}^{1})=0, \; \text{ for } \text{ each }\; j\in J,i=1,\ldots ,n\\ \;\displaystyle \sum _{\begin{array}{c} j\in J \\ i=1,\ldots ,n \end{array}}u_{j}^{i}\left( x_{j}^{m_{j}(A)}\right) =1 \; \\ \end{array} \right\} E^{MN} \end{aligned}$$

where, for each \(j\in J\), \(m_{j}(A)=\left| \left\{ g_{j}^{i}(a), \;i=1,\ldots ,n, \;a\in A\right\} \right| \), \(x_{j}^{m_{j}(A)}=\displaystyle \max _{a\in A} g_{j}^{n}(a)\), \(\displaystyle x_{j}^{1}=\min _{a\in A} g_{j}^{1}(a),\) and the values \(x_{j}^{k}, k=1, \ldots ,m_{j}(A),\) are ordered in an increasing way, i.e., \(x_{j}^{1}<x_{j}^{2}< \ldots<x_{j}^{m_{j}(A)-1}<x_{j}^{m_{j}(A)}.\)

Denoting by \(E^{A^{R}}=\left\{ E^{DM}\cup E^{MN}\right\} \) the whole set of constraints, to check the existence of at least one compatible value function, one has to solve the following optimization problem

$$\begin{aligned} \begin{array}{l} \;\varepsilon ^*=\max \varepsilon , \\ \;\text{ subject } \text{ to } \;E^{A^R} \end{array} \end{aligned}$$

where the variables are \(u_{j}^{i}(x_{j}^{k})\), \(j\in J\), \(i\in \left\{ 1,\ldots ,n\right\} \), \(k=1,\ldots ,m_{j}(A)\), and \(\varepsilon .\) If \(E^{A^R}\) is feasible and \(\varepsilon ^*> 0\), then there exists at least one compatible value function \(U(\cdot )\); conversely, there does not exist any compatible value function \(U(\cdot )\).

Supposing that more than one compatible value function exists, we indicate by \(\mathcal{U}\) the set of all compatible value functions; in general, each of these functions will produce a different ranking on the set A of alternatives. This is why ROR methods take into account all compatible value functions instead of only one.

Definition 5.1

Given two alternatives \(a,b\in A\) and the set \(\mathcal{U}\) of compatible value functions on \(A^{R}\subseteq A,\) we say that a is possibly preferred to b, if a is at least as good as b for at least one compatible value function:

$$\begin{aligned} a\succsim ^{P}b \Leftrightarrow \;\text{ there } \text{ exists }\; U\in \mathcal{U}: U(a)\ge U(b). \end{aligned}$$

Definition 5.2

Given two alternatives \(a,b\in A\) and the set \(\mathcal{U}\) of compatible value functions on \(A^{R}\subseteq A,\) we say that a is necessarily preferred to b, if a is at least as good as b for all compatible value functions:

$$\begin{aligned} a\succsim ^{N}b \Leftrightarrow U(a)\ge U(b), \;\text{ for } \text{ all }\; U\in \mathcal{U}. \end{aligned}$$

Following the description of the \(\varDelta ^{(i,k)}\) dominance relation defined in Sect. 4, it is meaningful from the DM’s point of view to compare \(U(a^{(i)})\) and \(U(b^{(k)})\) for all compatible value functions \(U\in \mathcal{U}\) and for all pairs of indicators (ik). In fact, even if \(not(a\succsim ^{N}b)\), that is a is not at least as good as b for all compatible value functions, it is interesting to check if there exists some pair of indicators (ik) such that \(U(a^{(i)})\ge U(b^{(k)})\) for all compatible value functions in order to understand which is the degree of plausibility of the necessary preference of a over b. For example, considering two alternatives \(a,b\in A\) evaluated by means of 3-point intervals, as done in Sect. 4, if one discovers that \(U(a^{(2)})\ge U(b^{(3)})\) for all \(U\in \mathcal{U}\) this means that, considering the most plausible evaluations for a (i.e., \(g_{j}^{2}(a)\) for all j) and the most optimistic evaluations for b (i.e., \(g_{j}^{3}(b)\) for all j), a is at least as good as b for all compatible value functions.

Analogously, it is interesting to understand if there exists some pair of indicators (ik) such that \(U(a^{(i)})\ge U(b^{(k)})\) for at least one value function compatible with the preferences provided by the DM. Going back to the previous example, if one gets that there does not exist any value function such that \(U(b^{(3)})\ge U(a^{(2)})\), this means that even considering the best evaluations of b (i.e., \(g_{j}^{(3)}(b)\) for all j) and the most plausible evaluations for a (i.e., \(g_{j}^{(2)}(a)\) for all j), there is no compatible value function for which b is at least as good as a.

In consequence of these considerations, the following two types of necessary and possible preference relations can be considered:

Definition 5.3

Given two alternatives \(a,b\in A\), the set \(\mathcal{U}\) of compatible value functions on \(A^{R}\subseteq A,\) and \(i,k\in \left\{ 1,\ldots ,n\right\} \), we say that a is (ik)-possibly preferred to b, if \(a^{(i)}\) is at least as good as \(b^{(k)}\) for at least one compatible value function:

$$\begin{aligned} a\succsim _{(i,k)}^{P}b \Leftrightarrow \;\text{ there } \text{ exists }\; U\in \mathcal{U}: U\left( a^{(i)}\right) \ge U\left( b^{(k)}\right) . \end{aligned}$$

Definition 5.4

Given two alternatives \(a,b\in A\), the set \(\mathcal{U}\) of compatible value functions on \(A^{R}\subseteq A,\) and \(i,k\in \left\{ 1,\ldots ,n\right\} \), we say that a is (ik)-necessarily preferred to b, if \(a^{(i)}\) is at least as good as \(b^{(k)}\) for all compatible value functions:

$$\begin{aligned} a\succsim _{(i,k)}^{N}b \Leftrightarrow U\left( a^{(i)}\right) \ge U\left( b^{(k)}\right) , \;\text{ for } \text{ all }\; U\in \mathcal{U}. \end{aligned}$$

For all \(a,b\in A\), and for all \(i,k\in \left\{ 1,\ldots ,n\right\} \), we have that:

  • \(a\succsim ^{P}b\) if \(E^{P}(a,b)=E^{A^{R}}\cup \left\{ U(a)\ge U(b)\right\} \) is feasible and \(\varepsilon ^{P}(a,b)>0,\) where \(\varepsilon ^{P}(a,b)=\max \varepsilon \),    s.t. constraints \(E^{P}(a,b)\),

  • \(a\succsim ^{N}b\) if \(E^{N}(a,b)=E^{A^{R}}\cup \left\{ U(b)\ge U(a)+\varepsilon \right\} \) is infeasible or \(\varepsilon ^{N}(a,b)\le 0,\) where \(\varepsilon ^{N}(a,b)=\max \varepsilon \),    s.t. constraints \(E^{N}(a,b)\),

  • \(a\succsim _{(i,k)}^{P}b\) if \(E_{(i,k)}^{P}(a,b)=E^{A^R}\cup \left\{ U(a^{(i)})\ge U(b^{(k)})\right\} \) is feasible and \(\varepsilon _{(i,k)}^{P}(a,b)>0,\) where \(\varepsilon _{(i,k)}^{P}(a,b)=\max \varepsilon \),    s.t. constraints \(E_{(i,k)}^{P}(a,b)\),

  • \(a\succsim _{(i,k)}^{N}b\) if \(E_{(i,k)}^{N}(a,b)=E^{A^R}\cup \left\{ U(b^{(k)})\ge U(a^{(i)})+\varepsilon \right\} \) is infeasible or \(\varepsilon _{(i,k)}^{N}(a,b)\le 0,\) where \(\varepsilon _{(i,k)}^{N}(a,b)=\max \varepsilon \),    s.t. constraints \(E_{(i,k)}^{N}(a,b)\).

6 Properties of necessary and possible preference relations

As to the basic properties of the classical necessary and possible preference relations, they were discussed in Greco et al. (2008) and Giarlotta and Greco (2013). Let us recall that the necessary relation is included in the possible preference relation (\(\succsim ^N\subseteq \succsim ^P\)), that the necessary preference relation is a partial preorder on A, and that the possible preference relation is strongly complete and negatively transitive.

Proposition 6.1 describes the relationship between the classical dominance and necessary preference relations, and then the relationship between the dominance and necessary preference relations in case of imprecise evaluations.

Proposition 6.1

  1. 1.

    \(\varDelta \;\subseteq \;\succsim ^{N}\),

  2. 2.

    For all \(i,k\in \left\{ 1,\ldots ,n\right\} \), \(\varDelta ^{(i,k)}\;\subseteq \;\succsim _{(i,k)}^{N}.\)

Proposition 6.2 gives some properties of the necessary and possible preference relations in case of imprecise evaluations. Moreover, it provides an inclusion property for \(\succsim ^{N}_{(i,k)}\) and \(\succsim ^{P}_{(i,k)}\), and a completeness property for \(\succsim ^{N}_{(i,k)}\) and \(\succsim ^{P}_{(k,i)}\).

Proposition 6.2

  1. 1.

    For all \(i,k\in \left\{ 1,\ldots ,n\right\} \), \(\succsim _{(i,k)}^{N}\;\subseteq \;\succsim _{(i,k)}^{P}\),

  2. 2.

    If \(i\ge k\), \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(\succsim _{(i,k)}^{N}\) is reflexive,

  3. 3.

    If \(i\le k\), \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(\succsim _{(i,k)}^{N}\) is transitive,

  4. 4.

    For all \(a,b\in A\), for all \(i,k\in \left\{ 1,\ldots ,n\right\} \), we have \(a\succsim _{(i,k)}^{N}b\) or \(b\succsim _{(k,i)}^{P}a\),

  5. 5.

    If \(i\ge k\), \(i,k\in \left\{ 1,\ldots ,n\right\} \), then \(\succsim _{(i,k)}^{P}\) is strongly complete and negatively transitive.

Note 6.1

Let us observe that by points 1 and 3 of Proposition 6.2, \(\succsim ^{N}_{(i,i)}\) is a partial preorder for all \(i=1,\ldots ,n\).

Proposition 6.3 specifies the inclusion between different necessary and possible preference relations in case of imprecise evaluations. Moreover, it links the preference relations for the case of imprecise evaluations with the preference relations for the case of precise evaluations.

Proposition 6.3

  1. 1.

    If \(i_1\ge i\) and \(k_1\le k\), \(i,k,i_1,k_1\in \left\{ 1,\dots ,n\right\} \), then \(\succsim _{(i,k)}^{N}\;\subseteq \;\succsim _{(i_1,k_1)}^{N}\),

  2. 2.

    If \(i_1\ge i\) and \(k_1\le k\), \(i,k,i_1,k_1\in \left\{ 1,\dots ,n\right\} \), then \(\succsim _{(i,k)}^{P}\;\subseteq \;\succsim _{(i_1,k_1)}^{P}\),

  3. 3.

    \(\succsim _{(1,n)}^{N}\;\subseteq \;\succsim ^{N}\;\subseteq \;\succsim _{(n,1)}^{N},\)

  4. 4.

    \(\succsim _{(1,n)}^{P}\;\subseteq \;\succsim ^{P}\;\subseteq \;\succsim _{(n,1)}^{P}.\)

Some propositions specifying more properties of the necessary and possible preference relations in case of imprecise evaluations are given in the Appendix.

Note 6.2

Since the necessary and possible preference relations \(\succsim ^{N}_{(1,n)}\), \(\succsim ^{N}_{(n,1)}\), \(\succsim ^{P}_{(1,n)}\) and \(\succsim ^{P}_{(n,1)}\), are the only preference relations for the case of imprecise evaluations that can be linked to the necessary and possible preference relations for the case of precise evaluations, we shall use also the following notation:

  • “strongly necessary preference relation”, to indicate the necessary preference relation \(\succsim _{(1,n)}^{N}\) (denoted by \(\succsim ^{SN}\)),

  • “strongly possible preference relation”, to indicate the possible preference relation \(\succsim _{(1,n)}^{P}\) (denoted by \(\succsim ^{SP}\)),

  • “weakly necessary preference relation”, to indicate the necessary preference relation \(\succsim _{(n,1)}^{N}\) (denoted by \(\succsim ^{WN}\)), and

  • “weakly possible preference relation”, to indicate the possible preference relation \(\succsim _{(n,1)}^{P}\) (denoted by \(\succsim ^{WP}\)).

Considering the weak, normal and strong dominance, necessary and possible preference relations, as a straightforward consequence of Propositions 4.2, 5.1, 6.1 and 6.2, and of the inclusion \(\succsim ^{N}\subseteq \succsim ^P\), we obtain the set of relationships shown in Fig. 3.

Fig. 3
figure 3

Relationships between all kinds of dominance relations and preference relations

7 A simple example

Let us imagine that the dean of a high school intends to give a scholarship to a good student; for this reason, she has to choose a laureate among 10 students of the school considered to be the best candidates. In order to manage this situation, the dean decides to use an MCDA approach taking into account evaluations of the student on three subjects: Mathematics (Mat), Physics (Phy) and Computer Science (Com). Each subject is thus an evaluation criterion with an ordinal scale composed of five levels ordered from the worst to the best: Very Bad (VB), Bad (B), Medium (M), Good (G), and Very Good (VG). Differently from previous cases, the dean has to face a new problem because some students got imprecise evaluations on some criteria. The students’ evaluations are shown in Table 1.

Table 1 Evaluations of students on three criteria
Fig. 4
figure 4

Dominance relations in the set of students obtained for imprecise evaluations shown in Table 1 and according to definitions introduced in Sect. 2; (a) dominance relation \(\varDelta \); (b) weak dominance relation \(\varDelta ^{(3,1)}\); (c) strong dominance relation \(\varDelta ^{(1,3)}\); (d) dominance relation for evaluations with the highest level of plausibility \(\varDelta ^{(2,2)}\)

Fig. 5
figure 5

Necessary preference relations in the set of students obtained after including the preference information provided by the dean and computed according to definitions given in Sect. 5. Full arrows represent the corresponding dominance relation; dotted arrows represent preference information provided by the dean; bold arrows represent new necessary preferences got in consequence of the added preference information. Note that by point 6.2 of Proposition 6.2, the strong necessary preference relation \(\succsim ^{N}_{(1,3)}\) is transitive while the weak necessary preference relation \(\succsim ^{N}_{(3,1)}\) is not; (a) necessary preference relation \(\succsim ^{N}\); (b) weak necessary preference relation \(\succsim ^N_{3,1}\); (c) strong necessary preference relation \(\succsim ^N_{(1,3)}\); (d) necessary preference relation having the highest level of plausibility \(\succsim ^N_{(2,2)}\)

One can see that in the evaluation table there are either crisp evaluations or 3-point intervals. In order to apply our method we need to consider them all as 3-point intervals. For 3-point intervals we have nested subintervals, \(\left[ g^{1}(a),g^{3}(a)\right] \) and \(\left[ g^{2}(a),g^{2}(a)\right] \), with the corresponding levels of plausibility \(L_{1}\) and \(L_{2}\), where \(L_{1}<L_{2}\). Remark that a crisp evaluation g(a) is, formally, a 1-point interval and it can be represented by the 3-point interval \(\left[ g(a),g(a),g(a)\right] \) with equal subintervals \(\left[ g(a),g(a)\right] \), \(\left[ g(a),g(a)\right] \). In our example this would mean that the evaluation of student \(\mathbf {A}\) on Mat is represented by the 3-point interval \(\left[ M,M,M\right] \). As to the levels of plausibility, we have to distinguish between 3-point intervals with three different evaluations and 3-point intervals with two different evaluations only. On one hand, \(Mat(\mathbf {F})=[VB,M,G]\) means that the evaluation of \(\mathbf {F}\) with respect to Mat belongs to the interval \(\left[ VB,G\right] \) with plausibility \(L_{1}\), and it can be equal to M with level of plausibility \(L_2\). On the other hand, \(Com(\mathbf {C})=\left[ M,M,G\right] \) and \(Com(\mathbf {D})=\left[ M,G,G\right] \) mean that the evaluation of \(\mathbf {C}\) and \(\mathbf {D}\) on Com belongs to the interval \(\left[ M,G\right] \) with level of plausibility \(L_{1}\), but the level of plausibility \(L_{2}\) is assigned to the evaluation M for student \(\mathbf {C}\) and to the evaluation G for student \(\mathbf {D}\).

The only preference relation that stems from the evaluations of the students is the normal dominance relation shown in Fig. 4a. Therefore, taking into account all different indicators, one can observe that five of the nine students dominate \(\mathbf {L}\). Referring to the weak and strong dominance relations shown in Fig. 4b, c, the first impression about the bad quality of \(\mathbf {L}\) is confirmed. Indeed, considering the best evaluations of \(\mathbf {L}\) and the worst evaluations for all others students, \(\mathbf {L}\) weakly dominates \(\mathbf {F}\) only (see Fig. 4b), while four students strongly dominate \(\mathbf {L}\) (see Fig. 4c). Since two different levels of plausibility are considered in case of 3-point intervals, it is interesting to see what happens when the evaluations have the highest level of plausibility for all students. This result is shown in Fig. 4d. One can observe that \(\mathbf {L}\) is dominated there by all other students but \(\mathbf {F}\), and that \(\mathbf {D}\) and \(\mathbf {H}\) dominate four other students each. Remark that due to point 2 of Proposition 4.1, the strong dominance relation \(\varDelta ^{(1,3)}\) and the dominance relation \(\varDelta ^{(2,2)}\) shown in Fig. 4c, d are transitive, while the weak dominance relation shown in Fig. 4b is not transitive.

In order to get a more conclusive recommendation, the dean has expressed her preferences through three pieces of the following preference information:

  • “student \(\mathbf {M}\) is preferred to student \(\mathbf {D}\)”, which is translated into the constraint \(U(\mathbf {M})>U(\mathbf {D})\);

  • “Student \(\mathbf {M}\) is preferred to student \(\mathbf {I}\) more than student \(\mathbf {C}\) is preferred to student \(\mathbf {H}\)”, which is translated into the constraint \(U(\mathbf {M})-U(\mathbf {I})>U(\mathbf {C})-U(\mathbf {H})\);

  • “Student \(\mathbf {C}\) and student \(\mathbf {M}\) are indifferent”, which is translated into the constraint \(U(\mathbf {C})=U(\mathbf {M})\).

Taking into account this preference information, we compute the necessary preference relation \(\succsim ^{N}\), the weak necessary preference relation \(\succsim _{(3,1)}^{N}\), the strong necessary preference relation \(\succsim ^{N}_{(1,3)}\) and the necessary preference relation \(\succsim ^{N}_{(2,2)}\) obtained for evaluations with the highest level of plausibility. These necessary preference relations are shown in Fig. 5.

One can see that the preference information provided by the dean enriched considerably the dominance preference relations. Without any preference information, \(\mathbf {A}\) dominated students \(\mathbf {L}\) and \(\mathbf {M}\) only, while after the preference information was added, \(\mathbf {A}\) is necessarily preferred to eight other students and neither necessarily nor weakly necessarily preferred only to \(\mathbf {E}\) (Fig. 5a, b). Looking at Fig. 5c, d, one can observe two interesting things: \(\mathbf {A}\) is strongly necessarily preferred to the largest number (3) of students, and \(\mathbf {H}\) is necessarily preferred to the largest number of students (5) considering the necessary preference relation computed for evaluations with the highest level of plausibility. This means that even taking into account the worst possible evaluations, \(\mathbf {A}\) is a good student, however for evaluations with the highest level of plausibility, \(\mathbf {H}\) is better than \(\mathbf {A}\).

The above discussion shows that the proposed method permits to get a useful insight into the multiple criteria choice problem that became more complicated by the fact of imprecise evaluations of alternatives. Based on minimum available information about the range and tipicality of imprecise evaluations, and using few examples of exhibited preferences, the method sheds light on the preference relations in the set of alternatives and facilitates a conscious decision making.

8 Conclusions

In this paper, we dealt with one of the most important issues of multiple criteria decision aiding (MCDA), that is the imprecise evaluations of alternatives. The possible sources of this imprecision are, for example, lack of data, imprecise measurement or intangible criteria. Many authors have studied different types of imprecision regarding weights of criteria, utility functions or probabilities about the different states of the world. In our approach, we are supposing that evaluations of the alternatives with respect to the different criteria can be imprecise and expressed by n-point intervals. These intervals are characterized not only by the largest interval of possible evaluations, but also by its subintervals sequentially nested one in another. To each of these subintervals is associated an increasing level of plausibility such that the plausibility of a subinterval is not lower than the plausibility of the subinterval containing it. Due to this way of representing the imprecision, our approach permits fine modelling of imprecise multiple criteria evaluations, taking into account a whole spectrum of attitudes ranging from an extremely pessimistic one to an extremely optimistic one in the evaluations. Moreover, differently from other ways of dealing with imprecision, such as stochastic multiobjective acceptability analysis (SMAA) or fuzzy numbers, n-point intervals take into account only the most stable, robust and meaningful information carried by imprecise evaluations. n-point intervals can be applied to any MCDA method but, in this paper, we focused on additive value functions and, in order to take into account the whole set of value functions compatible with the preference information provided by the DM, we adapted robust ordinal regression (ROR). In result of applying ROR, one gets necessary and possible preference relations for all realizations of the imprecise evaluations.

The methodology proposed in this paper follows the constructivist approach (Roy 1993). This means that MCDA methods do not assume that there pre-exist some preference system in the DM’s mind that need to be discovered, but the DM’s preferences have to be built step by step in the course of an interaction between the DM and the analyst responsible for mathematical modeling. In other words, MCDA methods should be seen as tools for going deeper into the decision problem, for exploring various possibilities, interpreting them, debating and arguing, rather than tools able to make the decision. As a consequence, the performances of the MCDA methods cannot be tested on some benchmarks, like, for example, machine learning methods (Corrente et al. 2013). Instead, the MCDA methods are acceptable if they possess some practical and theoretical properties judged as desirable in the actual decision context (Keeney and Raiffa 1993; Roy and Słowiński 2013). Unfortunately, many researchers are tempted to compare different MCDA methods by basing their conclusions on comparison of end results obtained by these methods. As argued in Roy and Słowiński (2013) such a comparison is ill-founded.

The presented methodology can be extended in several directions that are shortlisted below:

  • consideration of preference models in the form of outranking relations instead of value functions (Roy 1996);

  • consideration of the decision rule preference model composed of “if..., then...” decision rules induced from the DM’s preference information structured by dominance-based rough set approach (Greco et al. 2001; Słowiński et al. 2009);

  • consideration of the hierarchy of criteria using the multiple criteria hierarchy process (Corrente et al. 2012).