1 Introduction

GDP per capita has long been used as a proxy measure of well-being. However, it is now widely recognized that GDP data provide a partial perspective on the array of factors that affect people’s lives. The consensus on the deficiency of per capita income as a measure of well-being has led to a search for alternative measures of well-being. Research over the last two decades has considerably improved our understanding of them. The past research and unresolved issues are summarized in the recommendation of the Commission on the Measurement of Economic Performance and Social Progress, appointed by President Nicholas Sarkozy of France (Stiglitz et al. 2009).

The OECD has actively led the research seeking an alternative measure of well-being. On the occasion of the OECD’s 50th Anniversary, held under the theme ‘Better Policies for Better Lives’, the organization launched the OECD Better Life Initiative. Drawing upon the recommendations of the commission (Stiglitz et al. 2009), the OECD identified 11 dimensions as being essential to well-being. The dimensions cover material living conditions, such as income and wealth, as well as quality of life (QOL), such as community, environment and work–life balance. These dimensions are explored and analysed in detail in Organization for Economic Cooperation and Development (2011). The web-based tool Your Better Life Index, a key instrument of the OECD Better Life Initiative, profiles the 34 OECD member countries and two non-member countries across the 11 topics of well-being (known as OECD Better Life Index).Footnote 1 However, evaluating the overall well-being by summarizing the 11 individual indicators is left to users of the statistics.Footnote 2

Since each individual well-being indicator may trend in different directions, the set of 11 individual indicators itself is not enough to compare overall well-being of people across countries. A composite indicator is formed by compiling the underlying individual indicators into a single number. Composite indicators can measure multidimensional concepts, which are characterized by multiple individual indicators. Thus, they are used to analyse a wide range of concepts such as competitiveness, single market integration, sustainability and well-being. A composite indicator of well-being based on the 11 individual indicators also enables a comparison of the overall well-being of people across countries.

There exist many composite indicators and, more specifically, composite indicators of well-being. Despite their increasing use, identifying the ideal composite indicators remains controversial. Much criticism associated with indicator construction stems from the fact that the weights assigned to individual indicators are often fixed, and that individual indicators of all countries are aggregated under these fixed weights (Cherchye et al. 2007). Weight plays a crucial role in the construction of a composite indicator; it determines the trade-offs between underlying individual indicators. Moreover, by assigning zero to the other individual indicators, it selects the set of individual indicators actually incorporated. Thus, weights used to construct the composite indicator of well-being reflect a particular value judgement on how ‘good’ a life is. Schokkaert (2007) reports much interpersonal variation in people’s opinions on the ‘good life’. Therefore, no matter how elaborately weights are chosen, people across countries, and even within a society, are very likely to disagree on them. In the context of international comparison of well-being, certain countries (i.e. their authorities or experts) are likely to consider the predetermined fixed weights as unfair because undue importance is attached to individual indicators on which they underperform.Footnote 3

Among a number of construction techniques of the composite indicator, the ‘benefit of the doubt’ approach (BOD), which has received increasing attention from researchers, avoids subjectivity in the determination of weights (Mahlberg and Obersteiner 2001; Cherchye et al. 2004, 2007, Despotis 2005a, b and Organization for Economic Cooperation and Development 2008). Under BOD, the weights are country-specific and endogenously determined such that they maximize the value of each country’s resulting composite indicator. Thus, larger weights are given to the individual indicators (topics of well-being) on which each country performs well. The core idea is that a good relative score of a country on an individual indicator shows that it considers the individual indicator as relatively important. Therefore, a country cannot attribute the lower score of its composite indicator to a harmful or unfair weighting scheme under the international comparison based on BOD.

BOD is rooted in Data envelopment analysis (DEA), which is designed to compute efficiency indexes. DEA is an established technique to measure the relative efficiency of decision-making units based on inputs and outputs of units in a sample. It measures the efficiency of each unit by its distance from the production frontier, which is represented by the best practice units. BOD is formally tantamount to the input-oriented DEA model, with all individual indicators as outputs and a ‘dummy input’ equal to one for all countries.Footnote 4 As Dasgupta (2001) and Arrow et al. (2004) advocate, it is widely believed that the productive base of economies, which consists of a variety of capital assets and a social infrastructure, determines the well-being of people. Thus, countries’ productive bases are reasonably considered as inputs to achieve greater well-being indicators. There is a wide difference in the productive bases among countries. Thus, BOD’s assumption that inputs are invariant across units is inappropriate when computing composite indicators of well-being.

The specification of inputs and outputs enables us to apply the original DEA to evaluating countries’ performances in terms of efficiency. Since DEA specifies the best practice countries more appropriately than does BOD, the frontier represented by them correctly describes the benchmark technology of providing well-being. We call this the ‘World Better Life Frontier’. This provides an unbiased estimate of efficiency, and thus an accurate measure of countries’ performances. The efficiency score on DEA is also the weighted sum of 11 individual indicators, and it defines the DEA composite indicator of well-being. In the present paper, we apply BOD and DEA to compute the composite indicator aggregating 11 individual well-being indicators. A series of studies by the World Bank (Kunte et al. 1998 and World Bank 1997, 2006, 2011) estimates the comprehensive wealth of many countries, which corresponds to their productive bases. Our application of DEA relies on the most recent of these studies (World Bank 2011).

Mahlberg and Obersteiner (2001) was the first study to apply BOD to the construction of a composite indicator of well-being. They revisited the construction procedure of the UN’s Human Development Index (HDI) (United Nations Development Program 1990). The existing HDI is a weighted sum of three individual indicators—income, longevity and educational attainment—using the fixed weight. Mahlberg and Obersteiner (2001) propose an alternative HDI based on BOD by endogenizing the weight assigned to the three individual indicators. However, as they admit, the application of BOD does not allow them to distinguish between resources, which differentiate countries’ conditions of providing human development or well-being.

Zaim et al. (2001) is an exception that differentiates countries’ inputs used for providing well-being to their people.Footnote 5 The production process they consider transforms two inputs (produced capital and labour) into one private good (real GDP) and four social goods (infant survival rate, life expectancy at birth, primary school enrolment rate and secondary school enrolment rate). Although Zaim et al. (2001) construct the frontier more appropriately than does BOD, the constructed frontier is not utilized to evaluate countries’ performance in terms of efficiency. Instead, it proposes a measure of QOL called ‘achievement index’, which aggregates only four social goods and is a quantity index formulated by the frontier. The quantity index is a theoretically well-founded aggregation tool that captures the radial distance between the social goods of a country and the frontier. However, the set of inputs and outputs presented by Zaim et al. (2001) is severely limited relative to the present paper.Footnote 6 Specifically, their index, which aggregates only social goods, fails to capture overall well-being because private goods such as income, wealth and dwellings also affect people’s well-being.

There are different approaches to comparing overall well-being across countries, which are also well-founded in economic theory. Fleurbaey and Gaulier (2009) and Jones and Klenow (2010) aggregate several dimensions of well-being by expressing them in monetary units.Footnote 7 The calculation of the former is based on an income equivalent, while that of the latter is based on a consumption equivalent. Both approaches impute the individual preferential attitude to several dimensions of well-being, explicitly using the expected utility function. While they consider several dimensions of well-being as goods being consumed and propose composite indicators (summary statistics in their words) based on consumption theory, we consider them as outputs and propose composite indicators based on production theory.

How would the scores and rankings of countries under our composite indicators based on BOD and DEA differ from those in the previously mentioned studies? Our indicators compare countries’ performance by incorporating more dimensions of well-being, including key dimensions such as environmental equality and health status. Thus, countries showing higher well-being in the dimensions previously missed would be evaluated higher. For example, while countries known for their high environmental quality, such as Luxembourg and the UK, rank relatively low among OECD countries under HDI, Mahlberg and Obersteiner (2001) and Zaim et al. (2001), they might be underestimated and would rank higher under the composite indicator based on BOD.Footnote 8 On the other hand, countries with lower scores in many dimensions of well-being are likely to be economically poor and have a limited productive base. Furthermore, since countries with a smaller productive base are considered to be at a disadvantage in providing well-being to their people, their performance would be more highly valued in terms of efficiency as compared with if inputs are not differentiated. Thus, countries ranked extremely low under HDI or BOD would be valued higher under DEA.

The paper unfolds as follows. Section 2 discusses two approaches to construct a composite indicator. Section 3 explains data of well-being indicators and comprehensive wealth. Section 4 computes composite indicators under different cases and compares them across countries. Section 5 concludes.

2 Methodology

The present paper aggregates 34 countries’ 11 individual well-being indicators into composite indicators and compares countries’ performance in terms of well-being. We adopt two approaches to construct composite indicators. Since they are sufficiently versatile to be applicable to a variety of problems and situations, we explain them below in a more general setting independent of the number of countries and underlying individual indicators.

We consider that there are K countries and that the well-being of people in a country k is characterized by a set of M individual indicators \(\varvec{y}_{k} = \left( {y_{1k} , \ldots ,y_{Mk} } \right),\) with \(y_{mk}\) representing the value of the mth individual indicator of country k. BOD aggregates these individual indicators using their weighted average. Denoting \(\mu_{mc}\) as the weight of the mth individual indicator of country c, the composite indicator based on BOD for country c, \(CI_{BOD,c} ,\) is formulated as follows:

$$CI_{BOD,c} = \mathop {\hbox{max} }\limits_{{\mu_{1c} , \ldots ,\mu_{Mc} }} \left\{ {\sum\limits_{m = 1}^{M} {\mu_{mc} y_{mc} :} \sum\limits_{m = 1}^{M} {\mu_{mc} y_{mk} } \le 1 {\quad\text{for }}k = 1, \ldots ,K;\mu_{mc} \ge 0\,{\quad\text{for }}m = 1, \ldots ,M} \right\}$$
(1)

For an international comparison, the above procedure is repeated for each country in our sample. The weight \(\left( {\mu_{1c} , \ldots ,\mu_{Mc} } \right)\) is endogenously determined to maximize the value of the composite indicator for country c. Thus, a larger weight is assigned to an individual indicator on which the country performs well. In this procedure, the good performance of country c on an individual indicator is considered to indicate that the country prioritizes it. Therefore, countries cannot argue that their poor performance is due to an unfair weighting scheme, because any weight other than that used for their evaluation would not improve their position.

There are two constraints in (1). The first is that every country in a sample has a resulting composite indicator smaller than one when applying the most favourable weights for the evaluated country c. This guarantees that the resulting composite indicator for country c will be below one. The second constraint limits the weights to be non-negative. This means that the composite indicator is a non-decreasing function of individual indicators. Thus, each individual indicator is necessarily formed so that the larger score reflects an improvement in a specific aspect.

As Mahlberg and Obersteiner (2001) graphically illustrate, an alternative interpretation of \(CI_{BOD,c}\) is possible. Considering individual indicators \(\varvec{y}\) as outputs and a dummy input equal to one for all countries, \(CI_{BOD,c}\) is considered as evaluating the performance of country c in terms of its efficiency. The dummy input can be considered as a helmsman in each country, which intended to provide the people a better life; this is reflected by the values of individual well-being indicators.Footnote 9

Countries’ input and output sample data enable us to construct the production set \(\varPsi_{BOD} ,\) which includes achievable individual well-being indicators, through the minimal extrapolation principle. There, \(\varPsi_{BOD}\) is constructed as the smallest subset of \({\mathbb{R}}_{ + }^{M}\) covering individual indicators \(\varvec{y}\) for all countries and satisfying free disposability and convexity.Footnote 10 The boundary of \(\varPsi_{BOD}\) is the production frontier and represents the optimal practices to produce well-being, which is the benchmark to evaluate countries’ performances. Countries with individual indicators on the frontier are considered the best performers and are ranked highest under BOD. The farther from the frontier and closer to the origin the individual indicators of a country are, the lower its performance is evaluated. Thus, the distance between individual indicators of a country and the frontier measures its performance. We can relate the composite indicator to the production frontier as follows:

$$CI_{BOD,c} = \hbox{min} \left\{ {\theta :\varvec{y}_{c} /\theta \in \varPsi_{BOD} } \right\}$$
(2)

The right hand side of (2) is the smallest proportional contraction of \(\varvec{y}_{c}\) to the best practice frontier, indicating the distance from \(\varvec{y}_{c}\) to the best practice frontier. For countries with individual indicators on the frontier, no contraction is necessary, and thus \(\theta\) equals one. For countries with individual indicators located farther below the frontier, individual indicators must be well expanded to reach the frontier, and thus \(\theta\) is less than one.

Figure 1 illustrates what \(CI_{BOD}\) measures in the case of two individual indicators—\(y_{1}\) and \(y_{2}\)—and four countries—A, B, C and D. Each dot represents the underlying individual well-being indicator of each country. Here the line connecting A, B and C constitutes the best practice frontier. A, B and C on the frontier are classified as the best performing, while D, which lies below the frontier, is classified as the worst performing. To expand D to reach the frontier, its individual indicators need to be divided by a number less than one. Therefore, the composite indicator for country D is also less than one. This leads to \(CI_{BOC,A} = CI_{BOC,B} = CI_{BOC,C} = 1\) and \(CI_{BOD,D} = 0D/0D^{'} < 1\).

Fig. 1
figure 1

Frontier constructed by BOD

Here we highlight the assumption of invariant input associated with BOD. This assumption dismisses the fact that each country faces different conditions for providing well-being to its people. As Dasgupta (2001) and Arrow et al. (2004) advocate, it is widely believed that the productive base of economies, which consists of a variety of assets and a social infrastructure, determines people’s well-being. When considering individual well-being indicators as outputs and the productive base as corresponding inputs, is a composite indicator based on BOD still a reasonable performance measure in terms of well-being?

Figure 2 depicts how the inappropriate assumption of invariant input by BOD distorts the calculation of the composite indicators. Suppose a situation where country B is endowed with a productive base two times greater than the other three countries, and that the productive bases of the other three countries are equal to each other. As mentioned before, BOD forms the best practice frontier connecting A, B and C, and classifies the three countries on the frontier as the best performing. However, the production frontier in output space coincides with a set of outputs that is attainable from the given input. Hence, once we assume constant returns to scale, individual well-being indicators of country B which are attainable from the same level of productive base as the other countries, are indicated by \(\tilde{B}\). Thus, when we appropriately incorporate differences in the conditions that affect countries’ abilities to provide well-being to their people, the best practice frontier is formed by connecting A, D and C as drawn in Fig. 2. Thus, while the three countries on the frontier are classified as being the best performing, B is classified as being the worst performing. This leads to \(CI_{BOC,A} = CI_{BOC,C} = CI_{BOC,D} = 1\) and \(CI_{BOD,B} = 0\tilde{B}/0\tilde{B}^{'} < 1\). It suggests that neglecting the differences among the productive bases distorts estimation of the production frontier, leading to a biased performance measure of countries.

Fig. 2
figure 2

Frontier constructed by DEA

Countries endowed with a larger productive base have an advantage in providing well-being to their people compared with those endowed with a smaller one. Since BOD neglects differences in the size of the productive bases, it overestimates countries having a larger productive base. Reasonable performance measures in terms of well-being should evaluate countries’ capabilities of converting the productive base to well-being indicators. Therefore, countries’ performances must be evaluated by comparing the combination of inputs and outputs across countries.

BOD’s estimation procedure for composite indicators or efficiency is rooted in DEA (Charnes et al. 1978). The original DEA is the established tool to measure the efficiency of decision-making units based on the comparison of combinations of inputs and outputs across units in a sample. An efficiency score obtained under DEA is the weighted average of outputs of the evaluated country, for which it defines the composite indicator. By applying DEA to the aggregate of individual well-being indicators along with the productive bases, we can more accurately estimate the production frontier. This leads to a more appropriate performance measure for each country. DEA itself is an established tool; however, we describe it in further detail below to contrast it with BOD in terms of calculating the composite indicator.

We consider the general case that each country k is bestowed with a productive base characterized by a set of N types of wealth \(\varvec{x}_{k} = \left( {x_{1k} , \ldots ,x_{Nk} } \right),\) with \(x_{nk}\) the value of nth wealth in country k.Footnote 11 Countries’ individual well-being indicators are considered as being produced from this wealth vector. As with BOD, DEA constructs the production set \(\varPsi_{DEA}\) through the minimal extrapolation principle. However, unlike \(\varPsi_{BOD} ,\,\varPsi_{DEA}\) deals with input and output pairs. While the minimal extrapolation principle requires \(\varPsi_{BOD}\) to include individual indicators \(\varvec{y}\) for all countries, it requires \(\varPsi_{DEA}\) to include pairs \(\left( {\varvec{x},\varvec{y}} \right)\) for the same.Footnote 12 Inclusion of the productive base changes the shape of the production set, leading to a different frontier. Efficiency is calculated as follows:

$$CI_{DEA,c} = \hbox{min} \left\{ {\theta :\varvec{y}_{c} /\theta \in \varPsi_{DEA} } \right\}$$
(3)

As formulated in (3), \(CI_{DEA,c}\) radially measures the distance from individual well-being indicators of country c, \(\varvec{y}_{c} ,\) to the benchmark production frontier. An alternative formulation of \(CI_{DEA,c}\) is given below to compare \(CI_{DEA,c}\) and \(CI_{BOD,c}\) from the viewpoint of an endogenous weighting scheme.

$$CI_{DEA,c} = \mathop {\hbox{max} }\limits_{{\mu_{1c} , \ldots ,\mu_{Mc} ,\nu_{1c} , \ldots ,\nu_{Nc} }} \left\{ {\sum\limits_{m = 1}^{M} {\mu_{mc} y_{mc} } :\sum\limits_{m = 1}^{M} {\mu_{mc} y_{mk} } \le \sum\limits_{n = 1}^{N} {\nu_{nc} x_{nk} } \,{\quad\text{for }}k = 1, \ldots ,K;\sum\limits_{n = 1}^{N} {\nu_{nc} x_{nc} } = 1 ;\mu_{mc} \ge 0 \,{\quad\text{for }}m = 1, \ldots ,M ;\nu_{nc} \ge 0 \,{\quad\text{for }}n = 1, \ldots ,N} \right\}$$
(4)

Both DEA and BOD select country-specific weights that maximize the composite indicator for each country under evaluation. However, we find a difference in constraint between DEA and BOD in (1) and (4). The constraints in (1) and (4) state that when we apply favourable weights for the evaluated country c, the resulting composite indicators of other countries are under the upper bounds. While (1) sets one as the upper bound across countries, (4) sets \(\sum\nolimits_{n = 1}^{N} {\nu_{nc} x_{nk} }\) as the upper bound for each country k, which varies across countries. Weights assigned to the wealth vector \(\left( {\nu_{1c} , \ldots ,\nu_{Nc} } \right)\) are optimally chosen for the evaluated country c under the assumption that \(\sum\nolimits_{n = 1}^{N} {\nu_{nc} x_{nc} } = 1\). Thus, if the evaluated country c has a relatively small productive base, the upper bound of the constraint for many countries becomes greater than one so that \(\sum\nolimits_{n = 1}^{N} {\nu_{nc} x_{nk} } > 1\), relaxing the constraint of the optimization problem and making the composite indicator of the evaluated country c greater than 1. It indicates that between two countries with the same well-being indicators, the one with the smaller productive base will be more highly evaluated. Generally, countries with smaller productive bases raise their ranking when changing from BOD to DEA. These are the properties that we believe a reasonable composite indicator should satisfy.

3 Data

3.1 OECD Better Life Index

In the midst of growing concerns about identifying an alternative approach to measuring well-being, in 2011, the OECD launched the Better Life Initiative and released a set of 11 well-being indicators covering the 34 OECD member countries, comprising advanced and emerging economies. The data have been updated in 2012 to include the latest figures and add more dimensions to calculate indicators. Moreover, the country coverage has been expanded beyond the OECD to include Brazil and Russia. We use the most recent data covering individual indicators drawn from Your Better Life Index. Unfortunately, since the data of the productive base, which are necessary for computing the composite indicator in the present paper, are missing for Estonia and Slovenia, we focus on the remaining 34 countries.

The 11 individual well-being indicators evaluate topics that the OECD considers essential to people’s well-being. Each individual indicator corresponding to each topic is based on one to four secondary indicators. These underlying secondary indicators are expressed in different units such as dollars, years or number of people. To compare and aggregate values expressed in different units, the values are normalized. The normalization is done according to a standard formula which converts the original values of the individual indicators into numbers between 0 and 10 as follows: \(\frac{{{\text{value to convert}} - {\text{minimum value}}}}{{{\text{maximum value}} - {\text{minimum value}}}} \times 10\). Within each topic, the secondary indicators are averaged with equal weight. For example, while the topic of environment is constructed using two secondary indicators, water quality and air pollution, their scores are first normalized in a range between 0 and 10. They are then aggregated as follows: \(\frac{{{\text{water quality score}} + {\text{air pollution score }}}}{2}\). The 11 individual indicators and their corresponding 24 secondary indicators are shown below.

  • (1) Income (Household income; Household financial wealth)

  • (2) Jobs (Employment rate; Personal earnings; Job security; Long-term unemployment rate)

  • (3) Housing (Rooms per person; Housing expenditure; Dwellings with basic facilities)

  • (4) Work–life balance (Employees working very long hours; Time devoted to leisure and personal care)

  • (5) Health (Life expectancy; Self-reported health)

  • (6) Education (Educational attainment; Years in education; Students’ skills)

  • (7) Community (Social network)

  • (8) Civic engagement (Consultation on rule-making; Voter turnout)

  • (9) Environment (Water quality; Air pollution)

  • (10) Safety (Homicide rate; Assault rate)

  • (11) Life Satisfaction (Life Satisfaction)

Among the 11 individual indicators, the first three topics are categorized under material living conditions and the remaining eight are categorized as QOL. According to the file released by the OECD Better Life Initiative, the data years of the underlying detailed indicators range from 2005 to 2011. Averaging them with each topic equally weighted suggests a year close to 2009. Thus, we consider that the 11 indicators of each country measure the socioeconomic situation of people around 2009.

Table 1 summarizes the statistics of the 11 indicators. As Organization for Economic Cooperation and Development (2011) finds, these tables show that while life is good in many dimensions in countries such as Australia, Canada, Sweden, New Zealand, Norway and Denmark, it is significantly less so in countries such as Turkey, Mexico, Chile, Estonia, Portugal and Hungary. While the latter countries are characterized by their lower per capita income, the former are not necessarily the richest countries, that is, those with the highest per capita income.

Table 1 Descriptive statistics, OECD well-being indicators

From here onwards, we group countries based on per capita GDP to consider the link between well-being and economic development, which is well reflected in per capita GDP. The grouping is as follows: nine high-income countries with per capita GDP greater than USD 40,000, 18 middle-income countries with per capita GDP between USD 20,000 and 40,000 and seven low-income countries with per capita GDP below USD 20,000. Table 1 suggests that people’s well-being improves as income grows in every aspect apart from work–life balance. In this respect, on average, people in middle-income countries enjoy a better life than do those in high-income countries.

3.2 Comprehensive Wealth Accounts

Acknowledging the deficiency of GDP as a measure for tracking sustainable development, the World Bank has been developing comprehensive wealth accounts along with statistics addressing genuine saving. Their wealth accounts are reported in a series of publications (World Bank 1997, 2006, 2011). The concept of comprehensive wealth in its wealth account is broad, and it includes produced capital, natural capital, human capital, social capital and institutional capital. It corresponds to the productive base as a source of well-being of both the present and future generations. Among their publications, we use the data in the most recent publication (World Bank 2011), which covers more than 120 countries. We use the estimates of comprehensive wealth and its subcomponents by country for 2005.Footnote 13 We use the following primary subcomponents.

  • Comprehensive wealth = produced capital + natural capital + intangible capital.

  • Produced capital, comprising machinery, structures and equipment.

  • Natural capital, comprising agricultural land, protected areas, forests and subsoil assets.Footnote 14

  • Intangible capital, comprising human capital, social capital and institutional capital. The rule of law and government, which contribute to an efficient economy, are also included in institutional capital.

Comprehensive wealth is estimated as the present value of future consumption. Produced capital is derived from historical investment data using a perpetual inventory method. The stock value of natural capital is based on country data of physical stocks and estimates of natural resource rents based on prices and costs data. Intangible capital is measured as the difference between comprehensive wealth, produced capital and natural capital. Thus, since all the measurement errors associated with comprehensive wealth, produced capital and natural capital are drawn into the measure of intangible capital, intangible capital data must be cautiously interpreted.

All estimates are as per the US dollar in 2005. Table 2 summarizes the statistics of comprehensive wealth and its subcomponents. World Bank (2011) finds that intangible capital represents the largest share of the wealth of all countries, and that its share increases as per capita income grows. In addition, the important role of natural capital is highlighted. In the early stage of development, countries are relatively dependent on natural capital such as agricultural land, subsoil assets and forests. Countries use these to build produced capital and intangible capital in the course of economic development.

Table 2 Descriptive statistics, World Bank comprehensive wealth accounts

Table 2 reveals a similar pattern between wealth and development even among our sample, which have relatively higher incomes compared with other countries included in World Bank (2011). Intangible capital has the largest share in all the country groups. Natural capital has the smallest share among the three types of capital; however, natural capital of low-income countries is 11 % on average, much higher than that of the other groups.Footnote 15

4 Result

We compute composite indicators based on BOD formulated by (1) \(\left( {CI_{BOD} } \right)\) and DEA formulated by (4) \(\left( {CI_{DEA} } \right)\).Footnote 16 \(CI_{DEA}\) measures countries’ performances by incorporating the individual indicators of well-being as well as the productive bases for countries in a sample. Although the productive base of each country consists of its produced, natural and intangible capital, we compute \(CI_{DEA}\) under three compositions of the productive base. The comparison of the composite indicator calculated under the three different conditions reveals the impact of including each asset into the productive base for composite indicator construction. Denoting produced, natural and intangible capital by \(x_{p} , x_{n} \,{\text{and }}x_{in} ,\) respectively, we consider the following cases of the input vector \(\varvec{x}\), which corresponds to the productive base:

  • Case 1: \(\varvec{x} = \left( {x_{p} } \right)\)

  • Case 2: \(\varvec{x} = \left( {x_{p} + x_{n} } \right)\)

  • Case 3: \(\varvec{x} = \left( {x_{p} + x_{n} ,x_{in} } \right)\)

Case 3 is ideal for using the comprehensive set of produced, natural and intangible capital. In cases 1 and 2, as the second best, the productive bases are approximated by part of the productive bases, such as produced capital or tangible capital. In case 3, we differentiate intangible capital from produced and natural capital, rather than summing up all three forms of capital into a single measure of the productive base.Footnote 17 This is because intangible capital differs considerably in character from produced and natural capital.

Table 3 presents the full empirical result, containing the scores of composite indicators and corresponding rankings along with existing HDI and GDP per capita, which is denoted by income in the table. To ensure comparability with composite indicators, we rescale HDI score so that its maximum value is one. It is shown that composite indicators and HDI fail to completely discriminate among all the countries in our sample, and that some countries are ranked equally. While HDI indicates ties among at most three countries a between the lowest score zero and highest score one, a larger number of countries are equally ranked highest in BOD and DEA. However, comparing the number of countries equally ranked (19 out of 34 for BOD, six for DEA case 1, eight for DEA case 2, 10 for DEA case 3), we note the better discriminating power of \(CI_{DEA}\) than \(CI_{BOD}\). Incorporating the information about the productive base as well as individual well-being indicators allows us to distinguish countries’ performances in greater detail.

Table 3 Composite indicator with HDI and per capita income

As expected, \(CI_{BOD}\) is evaluated higher than the previous studies for countries that achieve high scores in dimensions of well-being other than income, education and health. For example, the scores of Luxembourg and the UK are one, which is the highest under \(CI_{BOD} ,\) while they are ranked relatively low among OECD countries by HDI, Mahlberg and Obersteiner (2001) and Zaim et al. (2001). The well-being of people in the UK is close to the highest level in multiple dimensions such as community, environment and safety. Luxembourg records decent scores of well-being in all dimensions except education, leading to a lower evaluation under HDI. On the other hand, countries that lose their ranking largely because of switching from HDI to \(CI_{BOD}\) have relatively low scores in dimensions of QOL other than education and health.

We compare the distribution of \(CI_{BOD} ,\,CI_{DEA}\) and HDI among countries. According to Table 4, the mean, the variation characterized by the standard deviation and the range of the distribution characterized by the difference between the maximum and minimum scores are similar for \(CI_{BOD}\) and HDI. In contrast, \(CI_{DEA}\) is distributed in a wider range and has a higher mean than \(CI_{BOD}\) and HDI, which improves discriminating power by reducing the number of equally ranked countries, which has already been mentioned.

Table 4 Descriptive statistics, composite indicator with HDI and per capita income

Table 5 shows the correlation of composite indicators with HDI score and GDP per capita. It shows that \(CI_{BOD}\) and HDI, which share a similar pattern of distribution, are highly correlated with each other. The quest for an alternative welfare measure stems from an acknowledgement of the limitations of GDP per capita as a welfare measure. We now consider the relationship between composite indicators and GDP per capita. While \(CI_{BOD}\) is positively correlated with GDP per capita, \(CI_{DEA}\) is negatively correlated with it. Table 1 suggests that GDP per capita and the scores of individual well-being indicators are directly proportional. Larger values of individual indicators raise the composite indicator \(CI_{BOD} ,\) of which they are components. Therefore, the positive correlation between \(CI_{BOD}\) and GDP per capita is straightforward. While the correlations between \(CI_{BOD}\) and HDI and between \(CI_{BOD}\) and GDP per capita are 0.7859 and 0.6314, respectively, the corresponding correlations of the composite indicator in Mahlberg and Obersteiner (2001) are 0.893 and 0.565. Incorporating more dimensions of well-being than HDI and Mahlberg and Obersteiner (2001), \(CI_{BOD}\) further deviates from HDI. Consequently, \(CI_{BOD}\) more closely approaches GDP per capita.

Table 5 Correlation among composite indicators

Interpreting the negative correlation between \(CI_{DEA}\) and GDP per capita is not as intuitive. It exhibits a marked contrast to the previous studies. Strikingly, all the countries in the low-income group, except Hungary, are ranked highest under \(CI_{DEA}\). Before we continue the detailed analysis of \(CI_{DEA} ,\) we compare the computed scores of \(CI_{DEA}\) under the three different cases. In Table 5, the largest correlation coefficient is approximately 0.95, which is found between cases 2 and 3 of \(CI_{DEA}\) in terms of score values as well as ranking. The most difficult aspect of implementing \(CI_{DEA}\) is preparing the productive bases of countries, especially the intangible capital. The similarity between cases 2 and 3 suggests that integrating the natural capital into the productive base is the most critical step in calculating \(CI_{DEA}\). Thus, even when estimates of intangible capital are unavailable, by using produced and natural capital, we approximate the result that should be obtained under the most appropriate definition, which states that the productive base consists of produced, natural and intangible capital.

The source of this negative correlation between \(CI_{DEA}\) and GDP per capita can be inferred from Table 6.Footnote 18 The productive base strongly correlates negatively with \(CI_{DEA}\) and positively with GDP per capita, leading to negative correlation between \(CI_{DEA}\) and GDP per capita. High-income countries can afford to invest a variety of assets that are accumulated into their large productive base. However, these countries are likely to fail in providing the level of well-being appropriate for their large productive base. This leads to larger correlations between the productive base and \(CI_{DEA}\) and between the productive base and GDP per capita in absolute terms than the correlation between \(CI_{DEA}\) and GDP per capita.

Table 6 Correlation among composite indicators and underlying statistics

The large negative correlation between \(CI_{DEA}\) and the productive base does not necessarily imply that their country rankings are in reverse order. The US and Canada raise their \(CI_{DEA}\) ranking compared with their ranking based on the productive base. In particular, the US is ranked highest, with countries having the lowest GDP per capita and the smallest productive base. Although the large productive bases of the US and Canada are advantageous to these countries, their people enjoy a better life than is explained by their large productive base, leading to a higher evaluation of their performance. Similarly, Belgium and the UK, whose productive bases are comparably large, are also ranked relatively high under \(CI_{DEA}\) among the high-income countries, indicating the significant well-being their people enjoy, which is balanced with their large productive bases. On the other hand, countries that lose their country ranking are not always endowed with a large productive base. The productive bases of Greece, New Zealand and Spain are approximately USD 40,000, below the average of USD 45,000. However, their country rankings based on \(CI_{DEA}\) are even lower than those based on the productive base. This indicates that the well-being of their people is even lower than that expected from their relatively modest productive base.

\(CI_{BOD}\) and \(CI_{DEA}\) choose the country-specific weights that maximize the resulting composite indicator score of each country under evaluation. Therefore, higher weights are assigned to the individual indicators on which each country performs well. Therefore, it is possible to conclude that the composite indicators reflect the policy priority of each country on country-specific weights. Table 7 shows the average of the weight for each individual indicator in Eqs. (1) and (4). Widely distributed weights indicate that policy priority encompasses a spectrum covering 11 dimensions, revealing the importance of increasing dimensions in measuring countries’ performance. A clear distinction in relative weights is found between \(CI_{BOD}\) and \(CI_{DEA} ,\) indicating that neglecting the productive base distorts the inference of policy priorities of countries. Three individual indicators characterizing material living conditions—housing, income, and jobs—have relative priority in \(CI_{BOD}\). Especially, the components of health and safety have the largest weights. On the other hand, the remaining individual indicators characterizing QOL have relative priority in \(CI_{DEA}\). Especially, the largest weight is attached to a component of income. Civic engagement retains a constant role in every composite indicator. This might reflect that institutional characteristics in the productive base which affect civic engagement are not captured by capital assets considered in this paper.

Table 7 Average weight for composite indicators

5 Conclusion

Well-being is a multidimensional concept. The OECD recently specified 11 topics that are essential to people’s well-being. However, the fact that the OECD leaves their aggregation to the user motivates our research. We adopt two approaches, BOD and DEA, to construct a composite indicator from 11 individual indicators of well-being. Unlike HDI, both approaches aggregate individual indicators by investigating country-specific weights that maximize the composite indicator of each country.

The composite indicator based on BOD is distributed similarly to and is highly correlated with the existing HDI. It is also positively correlated with GDP per capita. On the other hand, the composite indicator based on DEA is negatively correlated with HDI as well as GDP per capita. Especially, the group of countries with the least GDP per capita is now ranked highest under DEA. Although these countries have very low well-being indicator scores, the volume of their productive base is further limited. Thus, once we consider that they find it difficult to provide well-being to their people, even though scores of their individual indicators of well-being are relatively low among a sample of countries, their performances are highly appreciated. Many countries with a larger productive base fail to provide an adequate level of well-being and are devaluated under DEA, leading to a significantly negative correlation between the DEA composite indicator and the productive base. Nevertheless, there are exceptional countries. Although the US and UK have high GDP per capita and larger productive bases, they successfully ensure a sufficiently high level of well-being for their people. Thus, they are highly ranked even under DEA. On the other hand, while the productive bases of Greece, New Zealand and Spain are relatively modest in size, they are ranked very low under DEA. This reflects a significantly low level of well-being that is unjustifiable considering their productive base.

This study has certain limitations. First, we consider an aggregation across 11 individual indicators, but we do not address how to construct each individual indicator. There are between one and four secondary indicators underlying each individual indicator. By merely utilizing indicators released by OECD, we do not consider the aggregation of these secondary indicators into a single individual indicator. The resulting composite indicators, which aggregate the 11 individual indicators, are easily influenced in this preliminary stage of aggregation. It is important to verify that the construction process of OECD’s 11 individual indicators is appropriate. Second, the 11 individual indicators of each country measure national average well-being in a specific aspect of people’s lives. Thus, the composite indicators we propose do not incorporate the distribution of well-being within each country in any dimension. When comparing two countries with the same national average of well-being, the one with a more equitable distribution of well-being people is likely to be evaluated higher. These shortcomings invite further investigation to establish a more comprehensive framework for international comparison of well-being that covers all the secondary indicators as well as distributional measures such as income inequality and leisure inequality.

While our analysis is preoccupied with comparing countries’ current performances, how their performance changes over periods is also important. There is significant DEA literature that measures the change in efficiency and divides it into several components. Once individual well-being indicators become available for multiple periods, they can easily be used to consider the change in countries’ abilities to ensure their people’s well-being based on their productive base. We leave this exercise to future research.