1 Introduction

The Human Development Index (HDI) is a summary measure that reflects average achievements in three basic dimensions of human development: a long and healthy life, access to knowledge, and a decent standard of living. Since 1990 the HDI has been widely used as an indicator of human well-being and progress closely related to the idea of human capabilities proposed by Sen (1985), and in broader terms than exclusively income-based progress. Despite its simplicity and limitations, the HDI has aroused great interest among researchers, practitioners and policy makers. For more than 20 years, the HDI has permitted evaluating the well-being of citizens from the perspective of human development across countries and over time, and has helped recommend policies which might lead to improve the lives of people and to enhance their choices and capabilities throughout the world, particularly in developing countries.

At the same time, since its launch, the HDI has generated in the academic field an extensive literature providing numerous critiques and proposing a number of potential improvements. The critiques have mainly focused on two major areas: (1) how to define human development and how to observe and measure its components, concentrating on topics such as the choice of indicators to reflect the human development idea accurately, and the intrinsic high correlation of all the HDI components with each other as well as with the HDI; (2) how to aggregate the different indicators to obtain a commonly acceptable composite index, centered mainly on the functional form of the HDI and aspects such as the substitutability assumptions, the normalization of indicators, the asymmetric treatment of income, and the choice of weights.Footnote 1

In 2010, coinciding with the twentieth anniversary of the first global Human Development Report (HDR), the HDI was revised and several major changes were introduced in UNDP (2010). Though this is not the first time that the HDI was modified, it was the first time that major changes were simultaneously made to its components and to the functional form used. While life expectancy at birth was maintained as the indicator of a long and healthy life, the indicators used to measure progress in knowledge and standard of living were modified. In the knowledge dimension, mean years of schooling replaced literacy and gross enrolment was recast as expected years of schooling. To measure the standard of living, gross national income (GNI) per capita replaced gross domestic product (GDP) per capita. In addition, in each dimension the maximum values were shifted to the observed maximum, rather than having a predefined cut-off beyond which achievements were ignored. In relation to the functional form, one key modification was the replacement of the arithmetic mean of country-level attainments in the three basic dimensions for the geometric mean as the aggregation formula. In fact, the main reason given for introducing the new HDI was to avoid the past assumption of perfect substitutability between the HDI components.Footnote 2

Many of the problems pointed out by critics were tackled with the changes made to the manner in which the new HDI is calculated, although some authors consider that serious drawbacks still persist, such as some flaws in the design of the index from a purely theoretical viewpoint, the troubling tradeoffs across core dimensions of human development, and the excessively static nature of the HDI (see Ravallion 2010, 2011, 2012; Klugman et al. 2011a, b; Chakravarty 2011; Tofallis 2012; Herrero et al. 2012; Bilbao-Ubillos 2013; among others).

In general terms, many real-life problems involve tackling several criteria which are in conflict. In these problems both the criteria and the constraints that determine the feasible set of alternatives can be expressed mathematically by functions. Many methods have been developed over time. Reference point-based methods constitute an important group of techniques in this field, whose popularity and usefulness are unquestionable nowadays (see e.g. Miettinen 1999; Luque et al. 2009; and references therein). In this context, the decision maker can provide a reference value for each objective function that is considered acceptable. The important mathematical properties of these techniques—each solution obtained is efficient, and it is possible to generate every efficient solution, even the so-called non-supported solutions—have led to new proposals for more flexible models. Concretely, Wierzbicki et al. (2000) proposes the idea of the double reference point scheme, where, for each criterion, two values are introduced: an aspiration value, a level that is desirable to reach; and a reservation value, a level under which the objective function is not considered acceptable. These levels are frequently set by experts according to objective criteria, or, in their absence, in relative terms following statistical criteria. Recently, Bilbao-Terol et al. (2014) propose, among other things, using the TOPSIS methodology (Hwang and Yoon 1981) to measure the HDI. TOPSIS is also a multi-criteria technique used to rank a finite set of alternatives (countries in our case), based on the minimization of Euclidean distance from an ideal point (vector with the maximum values) and the maximization of Euclidean distance from a nadir point (vector with the minimum values in the efficient set). Ideal and nadir points provide maximum and minimum values, respectively, of the objective functions for all the efficient solutions, that is, non-dominated solutions.

In accordance with the human development paradigm, this paper introduces the multi-criteria approach in the debate about the revision of the HDI. This new methodological perspective proposes alternative normalization and aggregation formulas for the HDI, based on the double reference point methodology. In particular, for each component, the value of each country is normalized in the range [−1, 2] by means of two reference values (aspiration and reservation values) using an achievement scalarizing function which is piecewise linear. In the absence of absolute values widely accepted by the international community that can be used as desirable and reservation thresholds for each HDI component, these levels are calculated in a relative manner, taking into account for each component the situation of any country with respect to others. Aggregating the new normalized values of the components (values of the achievement scalarizing functions), we calculate three indices with different degrees of substitutability: (1) a weak index that allows full compensation between the components; (2) a strong index that measures the state of the worst component and allows no compensation; (3) and a mixed index that is a linear combination of the first two.

This double reference point methodology goes beyond the concepts of maximum and minimum value of the indicators, and introduces a new perspective that is more in line with the human development paradigm and the capability approach. Concretely, this methodology introduces desirable and acceptable thresholds to be achieved by each component, and allows us to assess the progress in the measurement of human development depending on whether the value of the component is below the reservation level, between the reservation and aspiration levels, or above the aspiration level. This more flexible normalization allows determining s which values of the different indicators are equivalent.Footnote 3 In addition, the variety of indices proposed (strong, weak and mixed) offers the possibility of choosing the desired degree of substitutability. The minimization of Euclidean distance to the ideal point proposed in Bilbao-Terol et al. (2014) is in accordance with our weak index (if instead of using the Euclidean distance we use the distance L1, both calculations would be equivalent) and the maximization of Euclidean distance to the nadir point proposed is in line with our strong index (a bad value for an indicator implies a smaller distance to the nadir point and a bad value of the strong index).

We apply the new indices proposed to 187 countries worldwide. They are calculated under the self-imposed restriction of using the data in the 2011 HDR in order to ensure the homogeneity and availability of all the values and to be able to make comparisons.

The remainder of the paper is as follows. Section 2 briefly presents some notes on the calculation of the HDI. Section 3 states our methodological approach. Section 4 displays the results for data from 2011 HDR. The final section presents our conclusions.

2 Calculating the HDI

The HDI is a summary composite index that measures a country’s average achievements in the three core dimensions of human developmentFootnote 4: (1) A long and healthy life, by using life expectancy at birth (LE) as an indicator (this is the only component that was not changed in 2010); (2) access to knowledge, measured as mean years of schooling (MS) and expected years of schooling (ES), the latter defined as the years of schooling that a child can expect to receive given current enrolment rates. These indicators have replaced literacy and gross enrolment rate. Both new indicators are summarized by using the geometric mean (S); and (3) a decent standard of living, measured as the natural log of GNI per capita at purchasing-power parity (PPP) (Y). As stated above, GNI has replaced GDP, also at PPP and logged.Footnote 5

Following UNDP (2011), there are two steps to calculate the HDI. Firstly, the dimension indices are created. To this end, minimum and maximum values (goalposts) are set in order to transform the indicators into indices between 0 and 1. The maximums are the highest observed values in the time series (1980–2011) and the minimum values can be appropriately conceived as subsistence values. In particular, in the 2011 HDR the minimum values are set at 20 years for life expectancy, at 0 years for both education variables and at $100 for per capita GNI.Footnote 6

$$I_{LE} = \frac{{LE - LE^{\hbox{min} } }}{{LE^{\hbox{max} } - LE^{\hbox{min} } }}$$
(1)
$$I_{S} = \frac{{S - S^{\hbox{min} } }}{{S^{\hbox{max} } - S^{\hbox{min} } }}$$
(2)
$$I_{Y} = \frac{{\ln Y - \ln Y^{\hbox{min} } }}{{\ln Y^{\hbox{max} } - \ln Y^{\hbox{min} } }}$$
(3)

Having defined the minimum and maximum values and calculated the normalized dimension indices in the zero to one range, these are aggregated to produce the HDI as the geometric mean of the three dimension indices, instead of the arithmetic mean considered in the old aggregation formula. This way, in a multiplicative setting, the weights are applied by raising each variable to a power. Equal weights continue to be taken.Footnote 7 Thus, the HDI is calculated through the geometric mean of normalized indices measuring achievements in each core dimension.

$${\text{HDI}} = I_{LE}^{1/3} \cdot I_{S}^{1/3} \cdot I_{Y}^{1/3}$$
(4)

As can be inferred from the foregoing, the trade-offs between components is not so troubling in this way, even though the implementation of the new functional form continues to involve implicit trade-offs. In this context, Herrero et al. (2012) highlight that although the choice of the geometric mean is certainly an important improvement, UNDP (2010) does not provide any theoretical justification of the new aggregation method. In order to justify the choice of the aggregation formula, they suggest an elementary characterization of the geometric mean following the axiomatic method, and identify the proposed indicator with a unique set of intuitive properties: neutrality, scale, and ratio consistency.

Although it is obvious that any composite index of this sort will entail potentially troubling trade-offs, as Ravallion (2010, 2012) recognizes, he highlights that the new multiplicative form appears to generate highly problematic trade-offs from the standpoint of assessing human development. In particular, he shows that the new HDI has greatly reduced its implicit weight on longevity in poor countries, and the valuations of extra schooling as a whole seem high. Ravallion (2010, 2012) and Chakravarty (2011) agree that the troubling trade-offs found in the new HDI could have been avoided to a large extent by using an alternative aggregation function from the literature, namely the generalized form of the old HDI proposed by Chakravarty (2003).

In the literature there are other significant contributions proposing alternative approaches to measuring human development. For instance, Bilbao-Ubillos (2013) propose an index called ‘Composite, Dynamic Human Development Index’, in order to palliate limitations of the HDI. This index incorporates eight different components that are significant in the context of the current concept of human development (the three basic dimensions of human development already covered in the HDI, plus five other dimensions: inequality, poverty, gender situation, environmental sustainability, and personal safety) and a dynamic factor that distinguishes between countries on the basis of achievements attained. They adopt the simple additive weighting method to determine the weights subjectively and use the arithmetic mean for the aggregation of the eight components, without considering the problem of substitutability.

To the best of our knowledge, this is the first time that human development is treated by applying a multi-criteria approach based on the double reference point methodology, introducing a new perspective highly consistent with the human development paradigm. Likewise, the problem of substitutability between HDI components is tackled, offering a range of indices with different degrees of substitutability.

3 Methodology

3.1 Multi-criteria Approach

Many real life problems involve dealing with optimization problems, in which multiple objective functions are maximized or minimized simultaneously within a feasible set of solutions or alternatives. The general form of a multi-objective optimization problem (MOP) can be represented by:

$$\begin{array}{*{20}l} {\hbox{max}\,\,f\left( {\mathbf{x}} \right) = \left( {f_{1} \left( {\mathbf{x}} \right), \ldots ,f_{k} \left( {\mathbf{x}} \right)} \right)} \hfill \\ {s.t.:\quad \quad {\mathbf{x}} \in X } \hfill \\ \end{array}$$
(5)

where \(\varvec{x} = \left( {x_{1} , \ldots ,x_{n} } \right)^{T}\) is an n-dimensional vector of decision variables, \(X \subset {\mathbb{R}}^{n}\) is the feasible region, Z = f(X) is the feasible objective space, and \(\varvec{z} = f(\varvec{x})\) an objective vector where \(\varvec{z} \in Z\) if \(\varvec{x} \in X\) exists. The purpose is to simultaneously maximize all the k (k ≥ 2) objective functions. All the objective functions can be considered in the same sense (all maximizing or all minimizing), since minimizing an objective function is equivalent to maximizing the opposite one.

In multi-objective optimization, which generally lacks a feasible solution to simultaneously maximize all objective functions, there appears another concept of optimal where none of the components can be improved without deteriorating at least one of the others. A decision vector \(\varvec{x}^{{\prime }} \in X\) is called efficient or Pareto optimal of the problem MOP if there does not exist another \(\varvec{x} \in X\) such as \(f_{i} (\varvec{x}^{{\prime }})\,\le\,f_{i} (\varvec{x})\) for all i = 1, …, k and \(f_{j} \left( {\varvec{x}^{{\prime }} } \right) <\,f_{j} (\varvec{x})\) for at least one index j. In this case, \(\varvec{z}^{{\prime }} = f(\varvec{x}^{{\prime }} )\) is called nondominated objective vector. The efficient set is denoted by E and f(E) is the nondominated objective set. A decision vector \(\varvec{x}^{{\prime }} \in X\) is called weakly efficient or weakly Pareto optimal if there does not exist another \(\varvec{x} \in X\) such as \(f_{i} (\varvec{x}^{{\prime }} )\,<\,f_{i} (\varvec{x})\) for all i = 1, …, k. The corresponding objective vectors are called weakly nondominated objective vectors. Note that the set of efficient solutions is a subset of weakly efficient solutions.

Since the set of non-dominated objective vectors contains more than one vector, it is useful to know the bounds for the objective vectors in the non-dominated set. Upper bounds are given by the ideal values \({\mathbf{z}}^{\varvec{*}} = \left( {z_{1}^{*} , \ldots ,z_{k}^{*} } \right)\), easily obtained by maximizing each objective function separately \(z_{i}^{*} = \mathop {\hbox{max}}\limits_{{{\mathbf{x}} \in E}}\,f_{i} ({\mathbf{x}}) = \mathop {\hbox{max} }\limits_{{{\mathbf{x}} \in X}} f_{i} ({\mathbf{x}})\) for all i = 1, …, k. However, nadir vector \({\mathbf{z}}^{nad} = \left( {z_{1}^{nad} , \ldots ,z_{k}^{nad} } \right)\), which gives lower bounds for the objective vectors in the non-dominated set (\(z_{i}^{nad} = \mathop {\hbox{min} }\limits_{{{\mathbf{x}} \in E}}\,f_{i} ({\mathbf{x}})\) for all \({\text{i}} = 1, \ldots ,{\text{k}}\)), is usually difficult to obtain (see Miettinen 1999 and references therein).

A very common way to express preferences about the efficient solutions is given by the so-called reference point \({\mathbf{q}} = \left( {q_{1} , \ldots ,q_{k} } \right)^{T}\), which consists of reference values for the objective functions. The multi-objective MOP problem and the reference point are combined in an achievement scalarizing function (ASF), which is optimized to generate (weakly) efficient solutions.

One of the most commonly used ASFs was proposed by Wierzbicki (1980):

$$s\left( {{\mathbf{q}}, f\left( {\mathbf{x}} \right),\mu } \right) = \mathop {\hbox{min} }\limits_{i = 1, \ldots ,k} \left\{ {\mu_{i} (f_{i} \left( {\mathbf{x}} \right) - q_{i} )} \right\} + \rho \mathop \sum \limits_{i = 1}^{k} (f_{i} \left( {\mathbf{x}} \right) - q_{i} )$$
(6)

which must be maximized in the feasible region:

$$\begin{array}{*{20}l} {\hbox{max}\,s \left( {{\mathbf{q}}, f\left( {\mathbf{x}} \right),\mu } \right)} \hfill \\ {s.t.:\quad {\mathbf{x}} \in X } \hfill \\ \end{array}$$
(7)

The parameter ρ > 0 is the so-called augmentation term, which must be a small value, and which assures the efficiency of the solutions generated. If the second term is not used, then only the weak efficiency of the solution is assured. The vector μ = (μ 1, …, μ k )T with μ i  > 0 for all \(i=1, \ldots, k\) is formed by the weights assigned to reach the reference values, which can range from a purely normalizing coefficient to a preferential parameter (Ruiz et al. 2009; Luque et al. 2009).

Another achievement scalarizing function (Wierzbicki et al. 2000), used in both continuous and discrete programming, normalizes the objective functions (or indicators in our case) in a very appropriate way, taking into account two types of values of reference for each objective function. This type of ASF, called the double reference point (aspiration and reservation values) scheme, is based on considering an aspiration value z a i for each objective function f i (it being desirable to reach that value) and a reservation value z r i (level under which the objective function is not considered acceptable). This type of methodology is also used in Luque et al. (2010) but for convex multiobjective problems.

In our case, let us consider the achievement scalarizing function:

$$s\left( {f\left( {\mathbf{x}} \right), {\mathbf{z}}^{a} , {\mathbf{z}}^{r} } \right) = \mathop {\hbox{min} }\limits_{i = 1, \ldots ,k} \left\{ {s_{i} \left( {f_{i} \left( {\mathbf{x}} \right), z_{i}^{a} , z_{i}^{r} } \right)} \right\} + \rho \mathop \sum \limits_{i = 1}^{k} s_{i} \left( {f_{i} \left( {\mathbf{x}} \right), z_{i}^{a} , z_{i}^{r} } \right)$$
(8)

where s i for all i = 1, …, k are the individual achievement scalarizing functions:

$$s_{i} \left( {f_{i} \left( {\mathbf{x}} \right), z_{i}^{a} , z_{i}^{r} } \right) = \left\{ {\begin{array}{*{20}l} {1 + \frac{{f_{i} \left( {\mathbf{x}} \right) - z_{i}^{a} }}{{z_{i}^{max} - z_{i}^{a} }}} \hfill & {{\text{if}}\quad z_{i}^{a} \le f_{i} \left( {\mathbf{x}} \right) \le z_{i}^{max} } \hfill \\ {\frac{{f_{i} \left( {\mathbf{x}} \right) - z_{i}^{r} }}{{z_{i}^{a} - z_{i}^{r} }}} \hfill & {{\text{if}}\quad z_{i}^{r} \le f_{i} \left( {\mathbf{x}} \right) \le z_{i}^{a} } \hfill \\ {\frac{{f_{i} \left( {\mathbf{x}} \right) - z_{i}^{r} }}{{z_{i}^{r} - z_{i}^{min} }}} \hfill & {{\text{if}}\quad z_{i}^{min} \le f_{i} \left( {\mathbf{x}} \right) \le z_{i}^{r} } \hfill \\ \end{array} } \right.$$
(9)

The values z max i and z min i are upper and lower bounds for each objective function f i in the feasible region or even in the efficient set, if possible. z max i  = \(z_{i}^{*}\) and, z min i  = z nad i can be considered if available. Two parameters of the original formulation have been considered equal to 1. For more details about this ASF, see Wierzbicki et al. (2000).

This kind of ASF allows scaling all indicators in the interval [−1, 2], so that different interpretations are given on the basis of the aspiration and reservation values. Although in the continuous case the ASF function must be maximized in the feasible region, as mentioned previously, in the discrete case (our case) it allows us to establish a ranking for the different alternatives. For our purposes, the values of the objective functions \(f_{i} \left( {\mathbf{x}} \right)\) are substituted by the values of the indicators in the different alternatives (countries). It is convenient to emphasize that in our case we are not maximizing the alternatives (countries), but ranking them.

3.2 Application to Measure the HDI

Let us consider a total of N I indicators and N C the number of alternatives (countries). In our case, N I  = 3 (life expectancy at birth, combined education index, GNI per capita) and N C is the number of countries considered (N C  = 187). Let us denote by y ij (i = 1, …, N C and j = 1, …, N I ) the value of the country i and the indicator j. For each indicator it is necessary to determine whether it is of the type “more is better” (equivalent to maximizing in the continuous case) or “less is better” (equivalent to minimizing in the continuous case); in our case, the three are of type “more is better”.

For each indicator j, we have to calculate the maximum and minimum values:

$$y_{j}^{max} \mathop { = \hbox{max} }\limits_{{i = 1, \ldots ,N_{C} }} y_{ij}$$
(10)
$$y_{j}^{min} = \mathop {\hbox{min} }\limits_{{i = 1, \ldots ,N_{C} }} y_{ij}$$
(11)

However, these values can be modified by other values considered more appropriate.

The values of the aspiration and reservation levels, denoted by y a j and y r j respectively, are key to interpreting and analyzing the results. In the next section, we will explain which values are considered in our study.

Taking into account all the previous values calculated for each indicator j, let us consider the value given by the individual achievement scalarizing function in each alternative i (country):

$$z_{ij} = s_{j} \left( {y_{ij} , y_{j}^{a} , y_{j}^{r} } \right) = \left\{ {\begin{array}{*{20}l} {1 + \frac{{y_{ij} - y_{j}^{a} }}{{y_{j}^{max} - y_{j}^{a} }}} \hfill & {{\text{if}}\quad y_{j}^{a} \le y_{ij} \le y_{j}^{max} } \hfill \\ {\frac{{y_{ij} - y_{j}^{r} }}{{y_{j}^{a} - y_{j}^{r} }}} \hfill & {{\text{if}}\quad y_{j}^{r} \le y_{ij} \le y_{j}^{a} } \hfill \\ {\frac{{y_{ij} - y_{j}^{r} }}{{y_{j}^{r} - y_{j}^{min} }}} \hfill & {{\text{if}}\quad y_{j}^{min} \le y_{ij} \le y_{j}^{r} } \hfill \\ \end{array} } \right.$$
(12)

Given a country i and an indicator j, if z ij is between −1 and 0, it means that the value of the indicator for this country is under the reservation value; between 0 and 1, that it is between reservation and aspiration values; and between 1 and 2, that it is over the aspiration value.

For each country i, let us define the weak index (W i ) as the arithmetic mean of the N I values of the indicators and the strong index (S i ) as the minimum of all, that is, the worst one:

$$W_{i} = \frac{1}{{N_{I} }}\mathop \sum \limits_{j = 1}^{{N_{I} }} z_{ij}$$
(13)
$$S_{i} = \mathop {\hbox{min} }\limits_{{j = 1, \ldots ,N_{I} }} z_{ij}$$
(14)

While the weak index allows compensation among different indicators (substitutability), the strong index does not allow any compensation since it represents the worst value. In case we want to assign different weights to the indicators, let \(\omega_{j} \,{\text{with}}\,\,j = 1, \ldots ,N_{I}\) be the weight values, which have to be strictly positive (\(\omega_{j} > 0\quad \forall \,j = 1, \ldots ,N_{I} )\). The weak index is calculated directly:

$$W_{i} = \mathop \sum \limits_{j = 1}^{{N_{I} }} \bar{\omega }_{j} z_{ij}$$
(15)

where \(\bar{\omega }_{j}\) is the normalized weight (\(\bar{\omega }_{j} = \frac{{\bar{\omega }_{j} }}{{\mathop \sum \nolimits_{k = 1}^{{N_{I} }} \bar{\omega }_{k} }} \quad \forall j = 1, \ldots ,N_{I} )\). However, for the strong index, it is necessary to make some changes to avoid unwanted effects. Specifically, let us consider the following weights normalized by its maximum value:

$$\bar{\omega }_{j} = \frac{{\bar{\omega }_{j} }}{{\mathop {\hbox{max} }\limits_{{k = 1, \ldots ,N_{I} }} \bar{\omega }_{k} }} \quad \forall j = 1, \ldots ,N_{I}$$
(16)

and for each country i, we define the following values:

$$\bar{z}_{ij} = z_{ij} - C_{i} \quad \forall j = 1, \ldots ,N_{I} \,{\text{with}}\,C_{i} = \left[ {\mathop {\hbox{min} }\limits_{{j = 1, \ldots ,N_{I} }} z_{ij} } \right] + 1$$
(17)

where [ ] is the integer part of a real number. Then, the strong index is given by:

$$S_{i} = C_{i} + \mathop {\hbox{min} }\limits_{{j = 1, \ldots ,N_{I} }} \bar{\omega }_{j} \bar{z}_{ij}$$
(18)

The strong index indicates that if its value is below 0, at least one indicator is under 0 (at least one indicator does not reach its corresponding reservation value). If the strong index is above 1, it means that all the indicators improve their corresponding aspiration values.

As a combination of both we propose a mixed indicator (MI i ), which is a linear combination of the previous ones:

$$MI_{i} = \lambda W_{i} + (1 - \lambda )S_{i} \,\,{\text{with}}\,\,0 < \lambda < 1$$
(19)

and reflects an intermediate state between total substitutability (weak index) and no substitutability (strong index).

3.3 Calculation of Aspiration and Reservation Levels

As mentioned previously, in order to apply the proposed normalization, an aspiration level and reservation level have to be defined for each component. Let us recall that a component’s level of aspiration is the desirable level to be achieved by said component, whereas the level of reservation is the value below which all values are considered unacceptable (Table 1).

Table 1 Goalposts for the Human Development Index in 2011 HDR

In our methodological proposal, these levels are key for normalization. As far as we are concerned, these values should be defined objectively in an absolute manner. However, at present there are no values widely accepted by the international community that can be used as desirable and reservation thresholds for each HDI component. This issue, which is beyond the scope of this paper, clearly constitutes a challenging and promising extension of this study. Thus, in this paper the aspiration and reservation levels have been calculated in a relative manner, considering the situation of a country with respect to the others for each component.Footnote 8 In particular, we use two different statistical criteria to calculate these values:

  1. 1.

    Criterion I: Weighted mean of first and third group countries. The UNDP (2011) classifies countries as Very High Human Development, High Human Development, Medium Human Development and Low Human Development. To do so, it divides the countries listed according to their HDI level into 4 equal parts. Similarly, we ranked countries according to the values of the respective components, taking as level of aspiration the corresponding mean values weighted by population of the countries with Very High levels for the component in question. On the other hand, as level of reservation we used the mean weighted values of the group of countries with Medium levels for the respective components. The figures for each component are shown in Table 2.

    Table 2 Maximum and minimum and reference values
  2. 2.

    Criterion II: First and third quartile. The second criterion takes as level of aspiration the first quartile (value below which 75 % of the countries -127 countries- appear for each component), according to the order of the list of countries mentioned above for each component. We consider the third quartile as the reservation value; in other words, the value below which 25 % of the countries -42 countries- appear for each component. The figures are shown in Table 2.

Both criteria give us the relative position of a country with respect to the others. A priori no criterion is preferred to any otheri, and we work with both in parallel.

4 Results

4.1 Calculation of Normalized Components

In addition to the aspiration and reservation values, for the normalization of the components, we need a maximum and a minimum value for each indicator. For our calculations, we consider the maximum and minimum goalposts for each indicator used to calculate the official HDI. The respective maximum and minimum values for each component are shown in Table 2.

After calculating the necessary parameters, we obtained the normalized components by applying Eq. (12). Since we are working with two criteria to calculate aspiration and reservation values, we obviously obtained two different results. In Table 3 we show the components normalized for a selection of countriesFootnote 9 according to reference values calculated by means of criterion I and criterion II.

Table 3 Normalized components

Table 3 shows, for example, that Japan’s normalized life expectancy value is 2. This means that its value for this component coincides with the maximum considered. Indeed, let us recall that in this case the maximum considered for this component was taken from the value registered in Japan in 2011 (see Table 1). In respect to the other results, normalized values above 1 mean that countries are above the aspiration level for that component, whereas values below 0 indicate that they are below the reservation value. If we look at Table 3—Criterion I, we can see, for example, that the United States is above the level of aspiration in attainments in education and income, although in respect to life expectancy it is below said level, indicating a very high, though slightly imbalanced, level of human development. Nigeria, on the other hand, shows values below the level of reservation for all the components in both tables, thus indicating low but relatively balanced levels of human development in the three core dimensions.

4.2 Calculation of Strong, Weak and Mixed Indices

Let us calculate the weak and strong indices using the Eqs. (1318). Although our methodology allows establishing different weights for each component, in this paper we have considered the same weightsFootnote 10 for all components, in line with the position followed by the UNDP in calculating the official HDI. To calculate the mixed index with the Eq. (19), we use a value of λ = 0.5.

We show the values of these indices for the group of countries selected using values of reference I and II (Table 4), denoted as DRP-WI (Double Reference Point—Weak Index), DRP-SI (Double Reference Point—Strong Index) and DRP-MI (Double Reference Point—Mixed Index).

Table 4 DRP-WI, DRP-SI and DRP-MI

Table 4 shows that the DRP-SI -which measures the state of the worst component and allows no compensation between components- is much stricter than the DRP-WI -which allows compensation among components-, and is lower in all cases. The DRP-MI, in turn, always presents values between the other two indices.

4.3 Rankings and Differences in Respect to HDI Rank

If we take into account country rankings for each new index calculated (DRP-WI, DRP-SI, DRP-MI), we should analyze the consistency of those rankings with HDI country rankings, and the differences in the number of positions between said rankings for each country. The results for the group of countries selected obtained using criteria I and II appear in Table 5 (the position in the corresponding ranking appears in parenthesis). To be more precise, the HDI values (ranks in parentheses) are displayed in the second column; the weak, strong and mixed indices in the third, fourth and fifth columns, respectively, whilst the differences of rank between the official HDI and our indices are shown in the last three columns. The new indices are calculated using both criteria I and II.

Table 5 Rankings and differences

In general, the correlation between HDI’s ranking of countries and the rankings according to DRP-WI, DRP-SI and DRP-MI is obviously quite high, in particular if we bear in mind that we are only working with three components and are analyzing a relatively large sample (N = 187). Specifically, using criterion I for the reference values, Spearman’s correlation coefficients rho (ρ) are 0.997, 0.976 and 0.994, respectively, and using criterion II, they are 0.997, 0.974 and 0.992. On the other hand, Kendall’s correlation coefficients tau (τ) are 0.958, 0.874 and 0.939 (criterion I) and 0.958, 0.867 and 0.93 (criterion II).

For example, Table 5—Criterion I shows how the United States descends 13 positions in the DRP-SI country ranking as compared to the HDI rank, influenced by its worse relative situation in terms of life expectancy; whereas Nigeria, with less inequality between components than other countries with similar levels of human development, rises 12 positions. Likewise, the cases of the Russian Federation and Indonesia are interesting. With respect to both criteria, the former loses 20 and 29 positions respectively in the DRP-SI ranking as compared to the HDI ranking, also affected by its relatively low attainments in health, whereas the latter rises 16 and 17 positions.Footnote 11

In order to facilitate the analysis of the results, the differences between the HDI and DRP-WI and DRP-WI country ranks have been represented graphically (Fig. 1). As can be seen, the distribution in Fig. 1a, b, on one hand, and Fig. 1c, d, on the other, is similar, with no significant differences in respect to country rankings between the two criteria used to determine the values of reference.Footnote 12 In this sense, in order to avoid reiterations and since that the correlation between the results of both criteria is very high, hereafter we focus our analysis on the results obtained with the values of reference calculated using criterion I (the results for the other criterion may be obtained from the authors upon request).

Fig. 1
figure 1

a Differences between HDI rank and DRP-WI rank (I). b Differences between HDI rank and DRP-WI rank (II). c Differences between HDI rank and DRP-SI rank (I). d Differences between HDI rank and DRP-SI rank (II)

In respect to the differences in country positions between the HDI and DRP-WI rankings, it should be noted that the maximum variation of positions is (+13), whereas it reaches (−60) if we compare HDI and DRP-SI rankings. Furthermore, in the first case 80 % of countries change positions in the range of [−5, 5], whereas in the second case only 40 % of countries are in that range. This clearly shows that the difference of positions of the HDI ranking in respect to DRP-SI’s are greater than in respect to DRP-WI’s, which seems to confirm that the valuations of the level of human development stemming from the official HDI are more similar to the values of our weak index than to those of our strong index.

4.4 Analyzing Differences Between DRP-WI Rank and DRP-SI Rank

In order to compare a composite index with total substitutability and one that does not allow any substitutability between components, the country ranks resulting from DRP-WI and DRP-SI should also be compared. Figure 2 shows the differences in country rank positions between the alternative indicators using reference values I.

Fig. 2
figure 2

Differences between DRP-WI rank and DRP-SI rank (I)

The graph shows, first of all, that 6 countries maintain their position in both rankings: Eritrea, Iran, Lesotho, Mali, Mauritania and Norway (with first place in the HDI, it maintains that position in all the indices calculated). Likewise, note should be taken of the countries which lose more than 20 positions with DRP-SI as compared to DRP-WI, and in some cases up to 71 positions (left side of Fig. 2). These countries are Qatar, Kuwait, Cuba, Oman, Bhutan, Botswana, South Africa, Andorra, Kazakhstan and Georgia, all having very imbalanced levels in respect to health, education and income achievements. On the other hand, the maximum number of positions a country has gained with the DRP-SI is 28, the following four having improved more than 20 positions: Saint Vincent and the Grenadines, Indonesia, Azerbaijan and Kiribati (right side of Fig. 2). For these countries, the three components show highly similar levels, meaning that their human development is very balanced.

In order to assess the degree of imbalance amongst components of countries on the basis of their level of income, Fig. 3 relates the difference of positions in the DRP-WI and DRP-SI rankings to the level of the countries’ GNI per capita. In general, differences of rank increase as country income level rises, both in positive and negative terms, this difference tending to be solely negative in countries with a high level of income. In this sense, 3 areas of the graph are noteworthy: the upper area, where we find positive differences of more than 20 positions which begin at a level that corresponds to a GNI per capita of $3,140 (constant 2005 PPP$), and which includes Kiribati, Indonesia, Azerbaijan and Saint Vincent and the Grenadines; a lower middle area, where we find the first negative differences of more than 20 positions, which occur starting at a GNI per capita of $4,780 (constant 2005 PPP$), with Georgia, Bhutan, Cuba, South Africa, Kazakhstan, Botswana, Oman and Andorra; and lastly, a striking lower area to the right, where starting at a level of GNI per capita of $39,924 (constant 2005 PPP$) all the countries (except Norway, with a difference of 0) show negative rank differences, namely Switzerland, United States, Hong Kong, China (SAR), Brunei Darussalam, Norway, Kuwait, Luxembourg, Singapore, United Arab Emirates, Liechtenstein and Qatar.

Fig. 3
figure 3

Relation between level of income and DRP-WI rank—DRP-SI rank differences (I)

To underscore the imbalances existing between the different core dimensions in countries with negative rank differences of more than 20 positions between DRP-WI and DRP-SI, Table 6 shows the normalized components, highlighting in bold for each case the one which is in a worse situation. Countries are ordered according to their rank differences, also specifying the income group to which they belong pursuant to the World Bank classification (2013), which distinguishes 4 categories: low-income, lower-middle-income, upper-middle-income, and high-income.

Table 6 Differences DRP-WI rank—DRP-SI rank and normalized components

If we observe the situation of the different components in the countries with the greatest imbalance amongst core dimensions, note should be taken, first of all, of countries such as Qatar and Kuwait, which are well ranked by DRP-WI basically due to their high value of GNI per capita, descending 71 and 62 positions, respectively, in the DRP-SI ranking. In both cases, the other components do not reach the aspiration value, with Qatar very close to the reservation value in education and Kuwait even below said value. Other countries with significant imbalances where education is the component at the lowest level are Oman, Bhutan and Andorra. On the other hand, countries whose health component is in relatively more unfavorable situation include Botswana, South Africa and Kazakhstan. Note should also be taken of countries such as Cuba and Georgia, in which the results for the income component are clearly the worst.

In general terms, it is worth pointing out that out of 32 countries with negative differences of more than 10 positions, 29 are high-income (19) or upper middle-income (10) countries and 23 have a very high (11) or high (12) HDI. Meanwhile, out of 32 countries with positive differences of more than 10 positions, 27 are upper middle-income (16) and lower middle-income (11) countries, although they present diverse HDI levels (7 low, 8 middle, 13 high, 4 very high). Both groups of countries are located across different continents, without any particular pattern being detected in respect thereto.

5 Conclusions

The problem of trade-offs between components is present to a greater or lesser extent in the majority of composite indices. One of major critiques of the original HDI was the perfect substitutability between different dimensions of well-being. The new HDI has replaced the arithmetic mean with the geometric mean as the aggregation formula of country-level attainments in health, education and income, reaching a compromise between the extremes of perfect substitutability and no substitutability. However, some leading authors, such as Ravallion (2010, 2012), point out that there are still considerable troubling trade-offs that might involve inappropriate implications in terms of development policy.

This paper introduces a multi-criteria approach in the debate about the measurement of human development, providing two major innovations for the computation of the HDI: a new normalization more in line with the human development paradigm, and alternative aggregation formulas that provide a variety of composite indices with different degrees of substitutability. Concretely, we implement a multi-criteria approach based on the double reference point methodology (aspiration and reservation). In order to be able to compare, we use the same indicators as the new HDI and data from the 2011 HDR. Nevertheless, we take advantage of a different normalization: for each indicator the value of each country is normalized by means of two reference values (aspiration and reservation values) by using an achievement scalarizing function which is piecewise linear. In the absence of commonly agreed objective reference values, we use two relative reference values for each indicator, taking into account that the reservation value can be interpreted as a minimal level of acceptable achievement for an indicator across countries, and the aspiration value as a minimal level of desirable achievement.

As in the official HDI, each core dimension has been equally weighted, although our methodology would enable adopting any alternative weighting scheme. Aggregating the normalized values of the achievement scalarizing functions, we calculate three composite indicators: (1) a weak index that allows total substitutability; (2) a strong index that measures the state of the worst component and allows no substitutability; (3) and a mixed index that is a linear combination of the first two and would allow different degrees of substitutability. In contrast with the conventional range [0, 1] between maximum and minimum values of the HDI, these indices range between −1 and 2, so that if their values are between −1 and 0, it means that the value of the indicator for this country is under the reservation value; between 0 and 1, that they are between reservation and aspiration values; and between 1 and 2, that they are over the aspiration value. This normalization could be moved to the range [0, 1] by means of a linear transformation and the results would be identical (the same ranks for the different countries). However, the interpretation of said results, taking into account whether the corresponding value is in [−1, 0], [0, 1] or [1, 2], would not be as intuitive.

As an application of this new approach, we have calculated these indices for 2011 and analyzed and compared the resulting country ranks with the official HDI. We observe that the results of the HDI are closer to the results of the weak index than to those of the strong index. On comparing the country ranks of the strong index with those of HDI and the weak index, we note that when a country moves down in the strong index ranking with respect to the HDI ranking or weak index ranking, it is due to imbalances between HDI dimensions; that is, at least one component is significantly worse than the others. The countries that have the biggest imbalances between components include Qatar, Kuwait, Cuba, Oman, Bhutan, Botswana, South Africa, Andorra, Kazakhstan and Georgia. They are all placed in the upper-middle- and high-income groups. On the contrary, when a country moves up in the strong index ranking, it is because its normalized values are very similar; that is, its human development is very balanced. This category includes, amongst others, Saint Vincent and the Grenadines, Indonesia, Azerbaijan and Kiribati, in addition to Norway, which maintains the first position with all the indices analyzed.

We should not forget that in 1990 the first global HDR already addressed the importance of seeking a balance in priorities across dimensions of human development (UNDP 1990). If every core dimension has the same significance in terms of human development, it should be desirable to achieve balanced development across dimensions. From the point of view of development policy, it seems rational that the worse the deprivation is in a particular core dimension, the more urgent should be the efforts to improve achievements in that dimension. Our proposal would strengthen Mahbub ul Haq’s point that the HDI ‘emphasizes sufficiency rather than satiety’ (ul Haq 1995, p. 49). Therefore, in addition to the progress of human development as a whole, it is worth taking into consideration the state and evolution of the weakest dimension for each country.

This new methodological perspective is more in line with the human development paradigm and the capability approach, opening the door to whole new possibilities for future research. In this context, beyond the concepts of maximum and minimum value of the indicators, objective aspiration and reservation values would enrich the debate on the HDI. The international community should tend to set objective aspiration and reservation values worldwide, not only according to an empirical justification, but also according to ethical and political justifications.