Introduction

Categorization of the countries and their ranking as per the development status has been a major focus for the institutions like the World Bank, IMF, and UNDP as it could facilitate better policymaking and efficient use of resources. For example, the release of funds for development activities (Mylevaganam, 2017). However, this classification of countries according to the level of development has been challenging, as the process of development is multidimensional. According to Nielsen (2011), “when it comes to classifying countries according to their levels of development, there is no criterion (either grounded in theory or based on an objective benchmark) that is generally accepted”. The major difficulties are on two grounds. The first is to arrive at a unique and widely acceptable measure of development that could capture the overall development process. And the second relates to its methodological difficulties in choosing an appropriate technique to classify the countries based on the chosen measure of development.

The measurement issue could be tackled by choosing an existing measure of development and using it as the criteria for classification. For example, some of the popular indicators of development could be, GDP measuring level of income, Level of Living Index (Drewnowski & Scott, 1966), Socio-Economic Development Index (UNRISD, 1970), Physical Quality of Life Index (Morris, 1979), Human Development Index (UNDP, 1990), Happy Planet Index (NEF, 2006), Multidimensional Poverty Index (Alkire & Santos, 2010) and Social Progress Index (Social Progress Imperative,Footnote 1 2010). Examples for classification based on the existing measures could be the ones by international organizations such as the World Bank, the OECD, the UNDP, and UNCTAD. Since 1978, the World Bank has been classifying the countries based on per capita Gross National Income (GNI) as it considers per capita income as the simple and appropriate indicator of economic progress and human well-being (World Bank, 2012). Accordingly, the World Development Reports by the World Bank classifies countries into four income groups. These are: (1) Low-Income Economies (LIE) with GNI of $1025 or less. (2) Lower-Middle Income Economies (LMIE) with GNI between $1026 and $3995. (3) Upper-Middle Income Economies (UMIE) with GNI between $3996 and $12,375. (4) High-Income Economies (HIE) with GNI of $12,376 or more.

The classification by OECD’s Development Assistance Committee (DAC), uses the World Bank’s income classification and categorizes countries into two groups namely, “developed” and “developing” where all the countries belonging to LIE, LMIE, and UMIE are classified as developing countries and HIE countries to developed countries. Though these classifications are widely accepted, it suffers from the drawbacks of using GDP as the underlying measure of development and its classification.Footnote 2 While the OECD DAC and the World Bank use income as the criteria, UNDP takes an alternative route and classifies countries based on the HDI where the countries are classified into four groups viz., (1) Very high human development countries, with HDI greater than 0.800 in 2019. (2) High human development countries, with HDI between 0.700 and 0.799. (3) Medium human development countries, with HDI between 0.550 and 0.699. (4) Low human development countries, with HDI less than 0.550. Even though the use of HDI is a definite improvement over the earlier classifications based on income,Footnote 3 the former is criticized by many as development in recent times implies far more than just literacy, good health and quality of living could capture (Decancq & Lugo, 2009; Desai, 1991; Lind, 2004; Mylevaganam, 2017; Noorbaksh, 1998; Ravallion, 1997; Santos & Santos, 2014; Srinivasan, 1994; Streeten, 1995).

In short, a major drawback in choosing GDP or HDI as an indicator of development is that they only capture a few aspects of development and fail to represent the overall development that is a multi-dimensional process (Basel et al., 2020). As observed by Todaro and Smith (2011), development should encompass all the economic, social, institutional, and political mechanisms that could enrich human life. Moreover, in recent times with emphasis on the sustainability of development, we believe that an appropriate measure of development should also account for variables related to environment and energy use. This would require going beyond the already existing measures and arriving at a new, broader indicator of development. This is easier said than done, as there are a lot of difficulties involved right from selecting variables to choosing an appropriate methodology to aggregate the variables.

Once the appropriate measure is identified, the next difficulty is in selecting a methodology to classify the countries according to the state of development. The classifications by the World Bank, OECD, and UNDP mentioned above follow the ordinal criterion methodology while grouping the countries (Vázquez & Sumner, 2012). The major limitation in such classification is that it fails to include an appropriate number of groups of countries and more importantly, the “development thresholds” that separate the groups are subjective (Nielsen, 2011). This gap can be bridged through cluster analysis, a statistical technique that provides a more accurate and objective classification of countries instead of mere ordering or ranking based on a given development indicator. Further, it also allows to include a large number of indicators that could reflect the multidimensionality of the development process. The existing panel clustering methods are based purely on multivariate statistical theory (Liu et al., 2021). For instance, Bayesian approach (Aßmann & Boysen-Hogrefe, 2011), artificial bee colony (Banharnsakun, 2018), data mining (Alizadeh et al., 2008), Density-Based Spatial Clustering of Applications with Noise (Azzalini & Menardi, 2016) and Gaussian Mixture Model (Malsiner-Walli et al., 2016). While these methods are a significant improvement over the univariate methods like the ordinal criterion techniques, a major limitation with the above-mentioned methods is that they mainly focus on the improvement and design of the clustering algorithm, rather than examining the multiple dynamic information and features present in the panel data (Liu et al., 2021). The dynamic relationship between the variables across the panel is important when one speaks about development, its measurement, and classification. Ignoring these dynamics may lead to underutilization of the available information that could turn significant while forming clusters thereby resulting in clustering errors (Cameron & Miller, 2015).

In this context, this paper has two broad objectives. The first is to capture the broader dimensions of development through various development indicators such as economic, social, sustainability, and institutional factors for 102 countries starting from the year 1996 through 2015. To this end, the paper attempts to construct four indices for measuring development namely, Economic Index (EI), Social Index (SI), Sustainability Index (SUI), and Institutional Index (II). The variables under each of the indices are aggregated using the weights obtained from the PCA analysis. Having constructed the indices, the second objective is to use them for classifying the countries through the clustering analysis. Using K-means clustering algorithms we try to categorize the countries as per the above-mentioned development indices. However, before proceeding with K-means clustering, we apply grey relational analysis (GRA) as it will explain the partial information and uncertainty related to the development trend for the chosen countries. Using the grey relational analysis and then proceeding with K-means clustering will allow us to incorporate all the available information and panel dynamics. Apart from constructing four indices of development and integrating the GRA and K-means methodology, a major contribution of our study is its analytical framework. As noted, the clustering exercise is based on four indicators of development rather than taking one composite measure of development. This, according to us, will help capture the development profile of each cluster with a clear understanding of those factors that have led to its development or underdevelopment.

The sections in the paper following this discussion are organized as follows: “Literature survey: a review of the methodologies” section  presents the literature review on the construction of development index, grey relational analysis, and K-means clustering; “Data and methodology” section covers the explanation of the data and outlines the methodologies; “Empirical analysis” section  discusses the empirical findings; and, finally, “Summary and conclusion” section concludes the paper.

Literature survey: a review of the methodologies

In this section, we discuss various studies undertaken related to the construction of development indices, grey relational analysis, and K-means clustering.

Construction of development indexFootnote 4

In the recent past, an argument has been that GDP as a measure of development suffers from many limitations as this narrow definition of development fails to encompass the broader aspects of development which have a considerable effect on individual and social well-being (Goossens et al., 2007; Schepelmann et al., 2010; Sen, 1983; Stiglitz et al., 2009; Wilkinson et al., 2010). There exists extensive research on the limitation of and criticism for the use of income alone as a parameter of the overall development of the country. The general opinion is that the use of GDP as an indicator of development may be misleading because it merely gives a monetary measure of the level of production (Costanza et al., 2009; McGranahan et al., 1972; Victor, 2010). This argument is valid as the growth in GDP fails to account for the satisfaction of basic needs and also the distribution pattern of income in the society (Van den Bergh & Antal, 2014). It is in this context that Sen (1983) argues, “supplementing data on GNP per capita by income distributional information is quite inadequate to meet the challenges of development analysis”.

These limitations and criticisms with income as a general measure of development led to a paradigm shift in the way by which economists viewed the development process. The popular view was that economic development is a much broader concept than just the growth of income (Goossens et al., 2007; Nordhaus & Tobin, 1973). This led to redefining the concept of economic development in a way that emphasized the inclusion of economic, social, political, and institutional mechanisms that could bring about rapid and large-scale improvement in the standard of living of the people (Todaro, 1989). Thus, in the post-1970s the emphasis was on measuring development as a multidimensional concept involving various indicators of development which goes beyond just income and its related measures (Booysen, 2002; Greco et al., 2016). The initial steps towards the construction of a comprehensive measure for development were led by institutions such as UNESCO (1974, 1976), UN (1975), UNRISD (1978, 1979), OECD (1973, 1977). The pioneer in this field was the UN Research Institute for Social Development (UNRISD) which had taken several initiatives for the formulation of composite development indices. For example, ‘Level of Living Index’ (Drewnowski & Scott, 1966), ‘Socio-Economic Development Index’ (McGranahan et al., 1972) and Physical Quality of Life Index (Morris, 1979).

Later the World Bank (World Development Report, 1991) presented a more comprehensive view of development which asserted that the development should encompass better education, good health, less poverty, cleaner environment e.t.c. Around the same time, Mahbub ul Haq devised the Human Development Index (HDI) which has been recognized as the measure of economic development by the United Nations Development Programme (UNDP). However, there are several criticisms concerning the construction and components of HDI. Among others relating to construction of the index, a major conceptual criticism was that it fails to capture broader dimensions of the development (Decancq & Lugo, 2009; Desai, 1991; Lind, 2004; Noorbaksh, 1998; Ravallion, 1997; Santos & Santos, 2014; Srinivasan, 1994; Streeten, 1995). Post-2000’s witnessed a surge in the construction of various measures of societal well-being comprising wider indicators of development. Some of the major indices of this period are the Happy Planet Index (2006), Multidimensional Poverty Index (2010), Social Progress Index (2010), and Gross Happiness Index (2012).

The major challenge and greater difficulty in constructing a broader measure of development are in choosing an appropriate methodology to aggregate them into a single composite index. Common practice is to aggregate the variables into a composite index by assigning equal weights. However, assigning equal weights would lead to biasness as it fails to distinguish between more important and less important variables (Greco et al., 2018). Hence, for our study, we have considered PCA to derive weights. PCA is advantageous while dealing with ‘double counting’ by correcting overlapping information in two or more variables (OECD, 2008). Among others, Ram (1982), Noorbakhsh (1996), Lai (2003), Guptha and Rao (2018) and Basel et al. (2020) have used PCA for construction of various developmental indices.

Grey relational analysis (GRA) and K-means clustering

Grey System Theory was developed by Deng in 1982 for understanding the process where information is partial, unclear, and uncertain (Deng, 1989). This system is mainly used for modeling uncertainty, analyze system relations, establish models and make forecasts and decisions (Tsai et al., 2005). This system has been widely used in various fields of social science and natural sciences (Huang et al., 2006; Shi et al., 2008; Wang et al., 2010). In economics, this methodology has been used in different areas such as supply chain management, decision-making process, financial performance evaluation, credit risk, energy consumption, forecasting, etc.

For instance, based on the Grey system theory, Sahu et al. (2014) have developed a multi-criteria group decision-making method and multi-objective optimization ratio analysis, which facilitate multi-criteria group decision-making (MCGDM) problems under uncertain environments. His analysis also provides an appropriate compromise ranking order for available possible alternatives. For the evaluation of financial performance, Wu et al. (2010) have used the grey relational analysis for a performance evaluation of one of Taiwan’s banks through a combined analytical hierarchy process (AHP) and GRA. Similarly, using the GRA, Jin et al. (2012) have analyzed the credit risk of a commercial bank in China and have recognized the implementation of differentiated credit policy to promote the development of the banking and financial sector. For understanding the relationship between energy consumption and economic growth, Kose et al. (2013) have analyzed the development pattern of Turkey through GRA and concluded that oil, renewable energy sources, and GDP are the major drivers of development in Turkey. Likewise, Huang and Wang (2013) constructed and evaluated an index system for the industrial economy-ecology-coordinated development in China.

The studies covering the grey forecasting models include that of Zhang and Chen (2014) where the model GM (1, 1) and the metabolic GM (1, 1) have been used for predicting the monetization ratio to determine the facts that are responsible for the financing difficulty of the small and medium-sized enterprises in China. On similar lines, Tai et al. (2011) propose both a grey decision and prediction model based on the GM (1, 1) model for easing the decision-making process and enabling an effective forecasting approach. A study by Huang (2014) used a grey forecasting method to predict the development of the logistics industry in the Henan province. It forecasts the trends and factors leading to industrial development in Henan. Further, the studies that have also used the grey systems theory in different fields of economics include maximizing bidder’s profit in online auctions (Lim et al., 2012), dynamics and chaos control (Hu & Xia, 2012), analysis of consumer income elasticity and predicting its trend (Luo & Song, 2012), evaluation of innovation competency in the aviation industry (Zhu et al., 2012), ranking the advanced manufacturing systems (Goyal & Grover, 2012), examining the qualitative relationship among the innovative capability of the manufacturing industry and local government behavior (Bi et al., 2011) and a real estate marketing investment model (Shui, 2014).

While GRA was used for analyzing a variety of economic problems, the use of K-means in economics was largely confined to clustering analysis. An example of such an exercise in context to our study could be one by Mylevaganam (2017) where he clustered countries into various groups using the K-means clustering algorithm with a modified HDI through PCA as a measure of development. Recently, Mallikarjuna and Rao (2020) have used K-means clustering to classify stock markets based on financial factors. Their study considered forty-five stock markets for classification and they clustered the financial markets by using data mining techniques, viz. K-Means, Hierarchical, and Fuzzy C-Means. Apart from economics, K-means has been used in other fields of studies too. Some examples could be, Zhao et al. (2006), where they applied a K-means Clustering algorithm using data detection and symbol-timing recovery for a burst-mode optical receiver. Barca et al. (2008), using a modified K-means algorithm, have removed noise in multicolor motion Capture Image Sequences. K-means clustering was used by Roy and Sharma (2010) to identify several gene-expression benchmarks. However, despite its use in analyzing various issues across the fields, studies using the K-means cluster algorithm have remained subdued.

Considering studies that incorporate the fusion of GRA and K-means clustering, despite its advantage over individual exercises using either K-means or GRA, we notice that studies of such nature are rather rare. Using the mix of GRA and K-means would allow us to integrate and efficiently use the available information while accounting for panel dynamics. Emami and Razi (2014) applied a hybrid grey relational analysis and K-means to cluster and measure the performance of the banking system for Semnan, Iran. They also attempts to re-examine the implementation of this method for a newly established bank in the city of Tehran, Iran. A similar study was conducted by Zarandi et al. (2014) measuring the relative efficiency of different banks using a hybrid of K-means and grey relational analysis.

With this, we conclude our literature survey on the various issues that are important while we attempt to cluster countries based on their development status. As seen in the aforesaid discussion, (1) the need for an appropriate measure for development and (2) the methodology to cluster the countries without losing the information and dynamics involved in the panel are two important issues that need to be addressed while one attempts to group countries.

Data and methodology

In this section, at first, we present details of the data and variables that are used in the construction of the development indices that are used in our study. Then we proceed with explaining the methodologies, PCA, grey relational analysis, and K-means clustering.

Data and variables

The data has been obtained from official sources such as UNESCO and the World Bank covering 102 countries from 1996 to 2015. The variables used to construct social, economic, sustainability, and institutional development indices are given along with notation in Table 1.

Table 1 Variables, notations, and sources

The social index comprises the variables which encompass the aspect of education, health, and access to drinking water and sanitation facilities. Better education and good health contribute to better human capital and enhances productivity. These factors play a crucial role in improving the standard of living as well as the way of living and thus contribute to the overall development of the country. The variables for the construction of the sustainablility index include consumption of renewable and non-renewable energy sources, forest cover, and emission of CO2. Forest cover along with the use of energy sources are taken as a proxy for the availability of natural resources. Thus, a higher availability of these resources at present could imply sustainable development in the later phases of the development process. The variables used for the construction of the economic index show the overall macro-economic performance of the country. High levels of GDP per capita, productivity, and capital stock can help to achieve higher levels of economic growth. Whereas, lower inflation is desirable for a stable economy. Finally, the governance attributes captured by the rule of law and voice and accountability are proxies for the institutional index.

Methodology: construction and classification

The methodologies employed for the construction and classification of the development indices are discussed in the following section.

Principal component analysis

In the construction of Economic Index (EI), Social Index (SI), Sustainablility Index (SUI), and Institutional Index (II), we employed the statistical technique of factor analysis to obtain the weights of these indices. We identify the factor weights by using the principal components analysis (PCA). PCA is a technique of analyzing and identifying patterns in data and expressing the data in condensed form to highlight their similarities and differences. It transforms a large number of correlated variables into a smaller number of uncorrelated variables but retains the information in a large set. These uncorrelated variables which are extracted from original set’s variables using their correlation matrix are called principal components.

Let \({X}_{i}\) (i = 1, 2, 3, …, n) be the set of n variables that are transformed into a new set of m variables.

Let \({P}_{j}\) (j = 1, 2, 3, …, m) (m << n) be the principal component which is represented as the linear combination of the Xs i.e.,

$${PC}_{k}={a}_{k1}{X}_{1}+{a}_{k2}{X}_{2}+\dots +{a}_{kn}{X}_{n}$$
(1)

where \({PC}_{k}\) is the kth principal component. \({X}_{1}+{X}_{2}+\dots +{X}_{n}\) are the original set of variables. \({a}_{k1}+{a}_{k2}+\dots +{a}_{kn}\) are loading or scores of respective \({X}_{i}\) in the kth principal component. The component loadings are the weights showing the variance contribution of principal components to variables. The variance of the principal components is given by eigenvector (x) and eigenvalues (λ) of the variance–covariance matrix (A) as,

$$Ax=\lambda$$
(2)
$$\left|A-\lambda I\right|=0$$
(3)

where ‘I’ is an identity matrix.

Grey relational analysis

The categorization of the countries based on development is challenging as it often encounters uncertainty, lack of information, and complexity of data. To overcome this, we choose to employ grey relational analysis. The steps involved in this method are as follows:

  1. 1.

    The formation of Decision Matrix: The first step is to transform the panel data into the decision matrix or behavior matrix. For this, we first remove the dimension effect of different indices by averaging the original dataset. The decision matrix is given as,

    $${X}_{i}=\left[\begin{array}{cccc}{x}_{1}(1)& {x}_{1}(2)& \cdots & {x}_{1}(n)\\ {x}_{2}(1)& {x}_{2}(2)& \dots & {x}_{2}(n)\\ \vdots & \vdots & \ddots & \vdots \\ {x}_{n}(1)& {x}_{n}(2)& \cdots & {x}_{n}(n)\end{array}\right]\quad {\mathrm{where}}\,\,{\text{i}}=1, 2, \dots ,\mathrm{n}$$
    (4)
  2. 2.

    The formation of reference series: Let \({X}_{0}\) be the reference series which is taken as the maximum values within the dataset.

    $${X}_{0}=\{{x}_{0}\left(1\right),{x}_{0}\left(2\right),\dots ,{x}_{0}\left(n\right)\}$$
    (5)
  3. 3.

    The formation of Absolute value: The absolute value is obtained as,

    $${\Delta X}_{0i}\left(j\right)=\left|{x}_{i}\left(j\right)-{x}_{0}\left(j\right)\right|=\left[\begin{array}{cccc}{\Delta x}_{01}(1)& {\Delta x}_{01}(2)& \cdots & {\Delta x}_{01}(n)\\ {\Delta x}_{02}(1)& {\Delta x}_{02}(2)& \dots & {\Delta x}_{02}(n)\\ \vdots & \vdots & \ddots & \vdots \\ {\Delta x}_{0n}(1)& {\Delta x}_{0n}(2)& \cdots & {\Delta x}_{0n}(n)\end{array}\right]$$
    (6)
  4. 4.

    The formation of grey relational coefficient matrix: The grey relational coefficient is computed as follows,

    $${\gamma }_{0i}\left(j\right)=\frac{\Delta min+\xi \Delta max}{{\Delta X}_{0i}\left(j\right)+\xi \Delta max}$$
    (7)

    where \(\Delta min={min}_{i}{min}_{j}{\Delta X}_{0i}\left(j\right)\) and \(\Delta max={max}_{i}{max}_{j}{\Delta X}_{0i}\left(j\right)\).

  5. 5.

    The calculation of grey relational degree: The grey relational degree is calculated through the formula as,

    $${\theta }_{0i}=\frac{1}{n}\sum_{j=1}^{n}{\gamma }_{0i}\left(j\right)$$
    (8)

    If the weights (\({W}_{i})\) of the data are to be taken into consideration then we have,

    $${\uptheta }_{0\mathrm{i}}=\sum_{\mathrm{j}=1}^{\mathrm{n}}\left[{\mathrm{W}}_{\mathrm{i}}(\mathrm{j}){\upgamma }_{0\mathrm{i}}\left(\mathrm{j}\right)\right].$$
    (9)

K-means clustering

K means clustering, developed by MacQueen (1967) is the most popular and effective clustering technique which is widely used for dividing data sets into clusters (Aggarwal, 2003; Han et al., 2011; Jain & Dubes, 1988; Jain & Flynn, 1996). K means clustering algorithms aim at minimizing the variation in the inter-clustering data point while maximizing the variation between the clusters. This is done by minimizing the objective function J, which is defined as:

$$J=\sum_{i=1}^{m}\sum_{k=1}^{n}{\gamma }_{ik}{\Vert {x}_{i}-{\mu }_{k}\Vert }^{2}$$
(10)

where \({\gamma }_{ik}\) = 1 for the data point \({x}_{i}\) if it belongs to cluster k; otherwise \({\gamma }_{ik}\) = 0 and \({\mu }_{k}\) is the centroid of \({x}_{i}\) cluster.

Now, the objective function \(J\) is minimized into two ways, first w.r.t \({\gamma }_{ik}\) treating \({\mu }_{k}\) as constant and second w.r.t \({\mu }_{k}\) treating \({\gamma }_{ik}\) as constant. Thus, we have,

$$\frac{\partial J}{\partial {\gamma }_{ik}}=\sum_{i=1}^{m}\sum_{k=1}^{n}{\Vert {x}_{i}-{\mu }_{k}\Vert }^{2}$$
(11)
$$\begin{aligned}\underset{.}{\Rightarrow }{\gamma }_{ik}=\left\{\begin{array}{ll}1,& if\, k={argmin}_{j}{\Vert {x}_{i}-{\mu }_{k}\Vert }^{2}\\ 0,& otherwise\end{array}\right.\end{aligned}$$
(12)

And,

$$\frac{\partial J}{\partial {\mu }_{k}}=2\sum_{i=1}^{m}{\gamma }_{ik}({x}_{i}-{\mu }_{k})=0$$
(13)
$$\underset{.}{\Rightarrow }{\mu }_{k}=\frac{\sum_{i=1}^{m}{\gamma }_{ik}{\mu }_{k}}{\sum_{i=1}^{m}{\gamma }_{ik}}$$
(14)

The steps involved in the k-means algorithm steps are:

  1. 1.

    Select the number of clusters (k).

  2. 2.

    Select the cluster centers, \({\mu }_{k}\).

  3. 3.

    Assign the data points \({x}_{i}\) to the nearest cluster.

  4. 4.

    Calculate the centroid or mean of all objects in each cluster.

  5. 5.

    Repeat steps 2, 3, and 4 until J is invariant (variance \(<\varepsilon\)).

Empirical analysis

Having explained the data and methodology, in this section we turn into empirical analysis.

Principal component analysis

In the construction of the indices, we employed the statistical technique of factor analysis to obtain the weights for each of the variables that are mentioned in Table 1 earlier. We identify the factor weights by using the principal components analysis (PCA). To ensure the suitability of PCA, the Kaiser–Meyer–Olkin (KMO) test is conducted to measure the sampling adequacy. If the K–M–O measure is between 0.5 and 1.0, then the factorability is assumed. The result of the adequacy test is given in Table 2.

Table 2 Results of sample adequacy test

From the above table, we can see that all the proposed indices satisfy the condition for KMO. In Table 3, we present factor loading or scores for social, sustainablility, economic, and institution indices.

Table 3 Results of rotated factor loadings

The selection of principal components is based on Eigenvalues, where the minimum value is expected to be more than one. In our case, principal component one satisfies this criterion for all the indices and thus we have used only the first components for calculating the weights.

The weights are obtained by squaring each of the factor loadings representing the proportion of the total unit variance of the indicator explained by the factors.Footnote 5 For instance, the weight of EYS is 0.1669 which is calculated as (0.9964/(0.9964 + 0.9988 + 0.9919 + 0.9963 + 0.9974 + 0.9900)). Similarly, the weights of the other variables are calculated using explained variation and are presented in Table 4. These weights are then aggregated to determine the various indices for each of the countries using the following formula:

$${SI}_{t}^{(j)}=\frac{\sum {{W}_{i}z}_{ti}^{(j)}}{\sum {W}_{i}}$$
(15)
$${SUI}_{t}^{(j)}=\frac{\sum {W}_{i}{z}_{ti}^{(j)}}{\sum {W}_{i}}$$
(16)
$${EI}_{t}^{(j)}=\frac{\sum {W}_{i}{z}_{ti}^{(j)}}{\sum {W}_{i}}$$
(17)
$${II}_{t}^{(j)}=\frac{\sum {W}_{i}{z}_{ti}^{(j)}}{\sum {W}_{i}}$$
(18)

where \({SI}_{t}^{(j)}, {SUI}_{t}^{(j)}, {EI}_{t}^{(j)}\,\,{\mathrm{and}}\,\, {II}_{t}^{(j)}\) are the social, sustainability, economic, and institution indices for \({j}^{th}\) country at period t. \({z}_{ti}^{(j)}\) are the normalized variables taken for the study.

Table 4 Squared factor loadings and weights of the variables (scaled to unity sum)

Grey relational analysis

Considering the spatio-temporal feature of panel data, and the uncertainty and variation of the development process across the countries during the period of study we employ grey relational analysis before proceeding further for cluster analysis. At first we average the panel data indices over the period and then form a decision matrix. We then select the reference series as the maximum value of each index across the 102 countries as

$${X}_{0}=\{0.618, 0.622, 0.649, 0.668\}$$
(19)

After selecting the reference series, we calculated the absolute value of each index as,

$${\Delta X}_{0i}\left(SI\right)=\left|{x}_{i}\left(SI\right)-0.618\right|$$
(20)
$${\Delta X}_{0i}\left(SUI\right)=\left|{x}_{i}\left(SI\right)-0.622\right|$$
(21)
$${\Delta X}_{0i}\left(EI\right)=\left|{x}_{i}\left(SI\right)-0.649\right|$$
(22)
$${\Delta X}_{0i}\left(II\right)=\left|{x}_{i}\left(SI\right)-0.668\right|$$
(23)

Now, for the calculation of the grey relational coefficient we have,

$$\Delta min={max}_{i}{max}_{j}{\Delta X}_{0i}\left(j\right)= 0.405$$
(24)
$$\Delta max={min}_{i}{min}_{j}{\Delta X}_{0i}\left(j\right)=0$$
(25)

Thus,

$${\gamma }_{0i}\left(j\right)=\frac{\Delta min+\xi \Delta max}{{\Delta X}_{0i}\left(j\right)+\xi \Delta max}=\frac{0+0.5*0.405}{{\Delta X}_{0i}\left(j\right)+0.5*0.405}$$
(26)

The calculation result of the grey relational coefficient for all the 102 countries is presented in “Appendix 1”.

K-means clustering

Based on the grey relational coefficient obtained through Eq. (26) we now proceed to cluster the countries on the basis of various development indices. For this purpose, we use K-means clustering and the cluster visualization plot that is presented in Fig. 1. For clear understanding, the countries are presented in Table 5 and also in pictorial form in Fig. 2.

Fig. 1
figure 1

Cluster plot with K-means method

Table 5 Countries with respective clusters
Fig. 2
figure 2

Cluster plot of the countries

Before analyzing the characteristics of the four clusters obtained in the analysis, at first, we should examine if the four indices of development are influential in discriminating these four groups of countries. This is important as it allows us to recognize whether the groups of developing countries are statistically distinguishable. In other words, it explains whether the cluster exhibits significantly different means in the development indices. For this, we perform a one-way ANOVA analysis to calculate the cluster centroids and see if there exists any significant difference between the clusters. The result of the ANOVA analysis is given in Table 6. The higher and significant value of the F statistic indicates that the relationship between the overall between-cluster variation and the overall within-cluster variation is good. It reflects that the designed indices for identifying groups of countries are significant.

Table 6 ANOVA output

With this result, we now proceed to discuss the characteristics of the four clusters obtained in the study. We compare the classification of the countries made by the World Bank in terms of per-capita GNI, classification of UNDP based on HDI, and the clusters that we obtained from the four indices of development. Table 7 provides an overview of the clusters along with the development taxonomies used by the World Bank and UNDP.

Table 7 Classification of countries

We observe that the first cluster comprises 12 countries, the second cluster consists of 50 countries, while there are 27 and 13 countries each in the third and fourth clusters respectively. Out of 12 countries in the first cluster (C1), 11 countries belong to high-income economies (HIE) and a very high HDI group. In this cluster Thailand belongs to upper-middle-income economies with high HDI. The second cluster (C2) consists of a diverse group of countries where 6 countries belong to low-income economies (LIE), 16 to lower-middle-income economies (LMIE), 21 to upper-middle-income economies (UMIE) and 7 countries are from high-income economies. Comparing with the UNDP classification of countries, the second cluster has 4 countries with low HDI, 14 with middle HDI, 18 with high HDI, and 14 having very high HDI. The third cluster (C3) comprises 2 LIE, 6 LMIE, 9 UMIE, and 10 HIE. The HDI classification for this cluster turns out to be 3 with low HDI, 2 with medium HDI, 11 having high HDI, and 11 with very high HDI. Finally, the fourth cluster (C4) consists of 10 HIE, 2 LMIE, and 1 LIE. However, it has 10 very high HDI countries, 1 high, 1 medium, and 1 low HDI country.

From our clusters, C1 and C4 include a relatively major share of “developed” countries as per the World Economic Situation and Progress (WESP, 2019), except four countries namely, El Salvador, Thailand, Tunisia, and Yemen. However, Thailand and Tunisia are considered as high HDI countries by UNDP. Moreover, all the G7 countries (except Germany) are found in C1 and C4.

Cluster 2 contains almost all the “developing” economies of the world as per the classification made by WESP.Footnote 6 This cluster also includes nine emerging market economies.Footnote 7 Most importantly, the BRICS countries are present in this cluster. Cluster 3 consists of both developed and developing economies. Even though C3 consists of developing countries in terms of income, the developing countries in this cluster are characterized by high and very high HDI. As explained above, 22 out of 27 belong to the very high and high HDI group. This shows that the countries belonging to cluster 3 are more developed in terms of HDI in comparison to the developing countries present in cluster 2.

However, we observe that to some extent our development taxonomy differs from the usual income and HDI classifications as some of the countries labeled as “developed” are not in the developed economies cluster while some developing countries are found to be clustered with developed economies. The basic rationale behind such diverse result is that the clusters in this study are based not only on one dimension of development but includes various facets of well-being that leads to the overall development of the country. We also note that each cluster in the study has its own and characteristic development issues. There is no group of countries with the best (or worst) indicators in all development dimensions. Therefore, it would not be appropriate to classify one cluster as the developed countries cluster and the other as the least developed economies cluster. Needless to say, it is the framework of our analysis considering each of the dimensions of development independently that allowed us to make such conclusions.

Further, assessing the cluster centroids i.e. the average value of each index of all countries in a certain cluster, we outline a more precise interpretation of the four clusters obtained in the analysis. This comparative procedure enables us to have a clear understanding of each cluster as per the development indices. The outcome is presented in Table 8.

Table 8 Cluster centroid

From Table 8, the four development clusters can be described as follows:

Cluster 1 (C1): Developed with High Social and Institutional development. Based on the average values of the indices, C1 stands first among the four clusters. Besides this, the average value of the social index and institution index for the members in C1 is also higher than the other clusters. This reflects that the countries belonging to cluster 1 have excelled in their social and institution dimension of development when compared to C2, C3, and C4. Moreover, the other dimensions of development namely sustainablility and economic in this cluster are also better than C2 and C3.

Cluster 2 (C2): Moderate development: The average value of the indices in this cluster is lower than all the other clusters and thus could be taken as moderately developed when compared to other clusters. However, as mentioned earlier, a lesser overall average value cannot tag them as underdeveloped. The social development in this cluster is found to be better compared to C4 and on par with C3.

Cluster 3 (C3): Moderately high development: C3 appears to be more developed than C2 as all the indices values of C3 are higher than that of C2, yet lower than C1. This cluster also excels in social and institutional development when compared to C2 and C4. Moreover, C3 also stands second in the average institution index value, next to cluster 1. Finally, based on the average of indices, C3 stands third, marginally lesser than C4.

Cluster 4 (C4): Developed with High Economic and Sustainable development but low Social development. Cluster 4 exhibits a higher value of economic and sustainable development but shows a lower average value in social development in comparison with the other three clusters. Besides this, the institutional development in this cluster is also found lacking when compared to C3. However, based on the average of indices, C4 stands second. As one could see, C4 is driven by excellent economic performance backed with the sustainability of development.

Summary and conclusion

The concept of development has grown multi-dimensional which includes several facets such as economic, social, sustainability, and institutional aspects. However, the traditional way of looking at economic development stressed only on the growth of national income as its indicator is inadequate. Therefore, have we attempted to construct four indices of development namely social, sustainability, economic, and institution index which could capture a wider aspect of development covering 102 countries during the period 1996 to 2015. Further, using these indices the study categorises countries using grey relational analysis and K-means clustering.

The results show that the countries taken for the study can be classified into four clusters with twelve countries in the first cluster, fifty in the second, twenty-seven and thirteen countries in the third and fourth clusters respectively. Further, based on the average value of the four indices, we have also labeled the four clusters (C1, C2, C3, and C4) as developed with high social and institutional development, moderately developed, moderately high developed, and developed with high economic and sustainable development respectively. We observe that the average value of the social and institution index is high for the countries belonging to the first cluster and the average value of the economic and sustainability index is more for the countries in cluster four. While cluster 3 flairs fairly well in social and institutional development, cluster 2 is marginally better in social development.