1 Introduction

Nowadays, sustainability is an inseparable topic for companies and organizations, especially in their survival and competition. Poor sustainability performance (SP) deprives managers of the stability they need for decision-making and disruption creation (Ding et al., 2016). This causes the organization to be unbalanced during a recession and takes it out of normalcy. Predetermined and anticipated decisions of managers to deal with such an atmosphere in a competitive world is of great importance. Organizational and management decisions play important roles in the overall performance of engineering, procurement, construction, and installation systems (Toutounchian et al., 2018). The ability to recover and return quickly after downturns is associated with the ability to cope with vulnerabilities. This ability will be enhanced if decision-making tools are used to deal with the effects of negative shocks.

Businesses need to identify the factors which affect their SP. More importantly, companies need to identify which elements to prioritize and accordingly manage their limited resources. Studying the key performance indicators (KPIs) of an organization in the past can help examine the SP. On the other hand, it will enable predictions about the organizational system behavior by assessing the past SP of an organization (Dev et al., 2020). Moreover, the past performance of the organization can also indicate its attention to KPIs. Based on an organization's past performance, it is possible to obtain comprehensive information about programs, identify challenges and make plans for the future. Hence, an organization’s performance in the past plays a significant role in its current and future performance.

Since the strategic definition for individual KPIs may disrupt the organization or have high costs, they can be clustered. The combination of KPIs also helps develop strategies to improve the organization's SP and inspire new thinking toward the organization's goals. Accordingly, practical operational, tactical, and strategic planning can be associated with each cluster instead of planning for each KPI. Therefore, it is essential to propose an integrated approach to continuously assess and improve the SP based on its previous performance. For this purpose, the identification and investigation of KPIs affecting the improvement of the SP should be attained. To specify a set of factors for measuring SP, their equilibrium to ensure various dimensions is necessary.

This paper aims to assess the SP of an organization through associated KPIs to improve its functional operating level and return to its normal condition during probable social, economic, and environmental downturns and crises. Dealing with sustainability under social, economic, and environmental dimensions, the research objectives are to evaluate annual SP ranking and to realize the importance degree of factors to be monitored.

With this background in mind, this paper examines these research questions:

  1. 1.

    How can an integrated multivariate quantitative approach be proposed to assess the SP of an organization using annual historical data?

  2. 2.

    What is the importance degree of effective KPIs in SP?

  3. 3.

    What is the relationship between sustainability dimensions in terms of KPIs' classification?

This paper has addressed the aforementioned issue by introducing a simple-to-follow quantitative methodology with five phases and demonstrated a use case for an upstream oil and gas company. In the first phase, KPIs are determined concerning the sustainability dimensions. The second phase implies data gathering and scaling. A unique procedure is proposed for the monetary factors based on annual inflation rates (AIRs) and calculating consumer price indices (CPIs). Phase three develops principal component analysis (PCA) to achieve annual SP ranking and importance degree of KPIs simultaneously. In phase four, numerical taxonomy (NT) is developed to verify and validate the results of the developed PCA. Finally, phase five analyzes the results. Two quantitative methodologies are presented, for which the description of steps of data preparation, computation, and inference are provided. The main contributions of this paper can be highlighted and summarized as follows:

  • Proposing a simple-to-follow multivariate quantitative approach considering sustainability dimensions by synthesizing PCA, NT, clustering, and statistical analysis to assess SP;

  • Proposing a unique procedure to commensurate the monetary factors based on AIRs and CPIs;

  • Developing PCA which is capable of ranking annual SP and KPIs by defining utility functions;

  • Developing NT which can examine both positive and negative ideal values and integrate them for simultaneously evaluating the annual SP and KPIs;

  • Validating and implementing the proposed approach through application in a real-world case study.

The remainder of this paper is organized as follows. Section 2 reviews the related literature. Section 3 describes the proposed methodology. Section 4 deals with the case study and computational results. Verification and validation of the results are investigated in Sect. 5. Section 6 provides discussion and managerial insights. The conclusion is summarized in Sect. 7.

2 Literature review

Although many studies and efforts have been made to improve the SP of organizations, adverse events still occur due to recession and various crises (Azadeh et al., 2017). In times of crisis, organizations face significant threats to their financial, competitive, and survival potential (Pal et al., 2014). Azadeh et al. (2017) studied the performance of a production plant in crises considering management decisions using data envelopment analysis. Chen et al. (2019) proposed a mathematical formulation to capture favorable economic outcomes and environmental performance. Battisti et al. (2019) proposed an integrated approach to specify the role of learning in the performance of small enterprises using datasets in the period of financial crises. According to Alnajem et al. (2021), circular economy research exponentially increases with a 47.1% growth rate per year, and the sustainability topic has been one of the hotspots in recent years.

Many of the studies have tended to specify the organizational KPIs to assess SP. Hayami et al. (2015) found that the suppliers with fewer waste products (WP) desire to have better SP. Khan and Qianli (2017) examined the relationships between environmental and economic factors of the UK using time series data. Sabogal-De La Pava et al. (2021) considered the market value-added (VA) as a criterion to maximize the SP. Shan et al. (2021) studied the effect of economic and energy KPIs in limiting carbon emissions to evaluate SP. Pashapour et al. (2019) specified resilience-based factors to improve the performance of a petrochemical plant in terms of economics. Schwab et al. (2019) examined the effect of economic KPIs on the financial sustainability of a company in Swiss within business growth using quantitative and qualitative data. Ahmad, Muslija, et al. (2021) investigated the relationship between environmental sustainability and economic prosperity in 11 developing economies. D’Inverno et al. (2020) evaluated the performance of water utilities in Italy considering financial and economic factors, service quality, and environmental sustainability. D'Ambra et al. (2020) assessed the economic and social effects of carbon dioxide on sustainability. Singh and Misra (2021) proposed a framework to evaluate the safety performance of laborers by identifying and ranking key safety indicators in the Indian construction industry. Ahmad, Chandio, et al. (2021) examined dynamic interactions among sustainable development, air pollution, and energy investment in china using panel data in 27 provinces of China.

Some scholars have used annual data to evaluate the SP. Ahmad, Jan, et al. (2021) provided a model accompanying sustainability challenges by considering financial-based factors as inputs of production growth during 1997–2017. Brandenburg (2016) evaluated the trends of cost efficiency and SP using 8 years of data in European automotive industries. Ganda and Milondzo (2018) investigated the effect of carbon emissions on the SP of some African organizations. Salmanzadeh-Meydani and Fatemi Ghomi (2019) examined the causalities between economic growth and capital stock in Iran by the vector autoregression model. Forteza et al. (2017) examined the relationship among economic performance, accident rate, and site risk of Spanish companies using annual data. Hussain et al. (2021) evaluated financial development and economic openness on SP using annual data of Pakistan and performing time series methods.

Various studies have assessed the SP in the supply chain. Huo et al. (2018) investigated an organization's ability to improve economic and operational performance through the flexibility of the supply chain. Sabbaghnia et al. (2019) investigated reducing carbon emissions on the members’ profit in a supply chain. Das (2018) examined all three sustainability dimensions to determine their impacts on organizational performance in India. Arıkan et al. (2014) investigated the causalities between economic and environmental performance. Furthermore, most previous studies have focused on the economic dimension, while the social and environmental dimensions have still received less attention (Elfarouk et al., 2021). Moreover, the knowledge system that shapes companies' engineering, procurement, construction, and installation and encompasses their broad concepts and numerous complimentary ingredients have been neglected (Tseng et al., 2021). Even though there are many studies in terms of SP evaluation in the supply chain, the intra-organizational studies regarding the assessment of SP are limited and require more attention.

The literature determines that multiple-criteria decision-making method (MCDM) techniques considerably help to address and assess SP. Moreover, the application of MCDMs is closely interlinked to enable the managers of organizations to make better decisions. Solangi et al. (2020) applied Delphi and fuzzy analytical hierarchy process methods to evaluate the SP of electricity generation in Turkey. Although the application of PCA and NT exists in the literature (Azadeh et al., 2009, 2011; Pashapour et al., 2019; Shirali et al., 2013, 2016), the development to simultaneously ranking the alternatives and determining importance degree of variables has not taken adequate attention. Therefore, this paper contributes to the existing literature by developing PCA, and NT methodologies for the assessment in the context of SP, analyzing the results by utilizing statistical and clustering, and the application in a real-world case study.

3 Methodology description

Figure 1 demonstrates the methodology proposed in this paper.

Fig. 1
figure 1

Proposed methodology

The nomenclatures for sets and indices are defined as follows:

  • \(I\) Set of input factors indexed by \(i \in \left\{ {1, 2, \ldots , a} \right\}\).

  • \(O\) Set of output factors indexed by \(o \in \left\{ {1, 2, \ldots , b} \right\}\).

  • \(F\) Set of financial-based factors indexed by \(f \in \left\{ {1, 2, \ldots , c} \right\}\).

  • \(U\) Set of non-financial-based factors indexed by \(u \in \left\{ {1, 2, \ldots , d} \right\}\).

  • \(V\) Set of all variables including input and output factors indexed by \( v \in \left\{ {1, 2, \ldots , m} \right\}\) where \(V = I \cup O = U \cup F\) and \(m = a + b = c + d\).

  • \(L\) Set of the reduced dimension of principal components (PCs) indexed by \( l \in \left\{ {1, 2, \ldots , p} \right\}\).

  • \(T\) Set of annual records indexed by \(t \in \left\{ {1, 2, \ldots , n} \right\}\).

3.1 Phase 1: Determining factors

The factors are categorized based on the sustainability dimensions. For the social dimension, the factors affecting the performance of personnel are considered. For the economic dimension, the outcome-based factors are specified, which mainly affect the sales and income. For the environmental dimension, pollution-based factors like different emissions are investigated.

Table 1 provides the extracted factors and their related dimensions in sustainability. Also, their input- and output-oriented nature and unit of measurements are given in this table. Output-oriented factors are positive criteria where the great values are favorable. In contrast, input-oriented factors are negative criteria where the small values are favorable.

Table 1 Extracted factors

3.2 Phase 2: Collecting and scaling data

The steps of this phase are as follows.

Step 1 Collecting annual data.

Raw annual data are collected from the studied organization as matrix \(D = \left[ {d_{tv} } \right]_{n \times m}\).

Step 2 Investigating commensurability of factors.

If factors are on the same scale, they are recorded without any changes. Otherwise, they are transformed to the same scale values. The unit of measurement for IV, EV, and VA are reported as million IRR. Therefore, the time value of money is considered, which is necessitated by the requirement of importance degree calculation.

Step 3 Transforming non-scale factors.

The AIR and CPI are applied for transforming monetary factors. CPI is a tool to measure the financial level of products and services to examine price changes and calculates as Eq. (1).

$$ {\text{CPI}}_{t} = {\text{CPI}}_{t - 1} \left( {1 + {\text{AIR}}_{t} } \right)\quad \forall t \in T $$
(1)

To apply Eq. (1), the CPI is considered 100 for the last period. Concerning a backward computation, Eq. (2) calculates the CPI for all periods.

$$ {\text{CPI}}_{t} = \left\{ {\begin{array}{*{20}l} {\frac{{{\text{CPI}}_{t + 1} }}{{\left( {1 + {\text{AIR}}_{t + 1} } \right)}}} \hfill & {\forall t \in T - \left\{ n \right\}} \hfill \\ {100} \hfill & {\forall t = n} \hfill \\ \end{array} } \right. $$
(2)

Step 4 Determining commensurate matrix.

Equation (3) calculates the fixed value of financial factor \(f\) in period \(t\) (\({\text{FV}}_{tf}\)) based on its original value in period \(t\) (\({\text{OV}}_{{{\text{tf}}}}\)). Matrix \(D\) is transformed to the commensurate matrix \(X = \left[ {x_{tv} } \right]_{n \times m}\) by Eq. (4).

$$ {\text{FV}}_{tf} = \frac{{{\text{CPI}}_{n} }}{{{\text{CPI}}_{t} }}{\text{OV}}_{{{\text{tf}}}} \quad \forall t \in T, \,f \in F $$
(3)
$$ x_{tv} = \left\{ {\begin{array}{*{20}l} {d_{tu} } \hfill & {\forall t \in T,\,u \in U} \hfill \\ {\frac{{{\text{CPI}}_{n} }}{{{\text{CPI}}_{t} }}d_{{{\text{if}}}} } \hfill & {\forall t \in T, \,f \in F} \hfill \\ \end{array} } \right. $$
(4)

3.3 Phase 3: Developing PCA

In PCA, multidimensional factors are summarized into non-correlated components, linear combinations of the main factors. It has the following steps.

Step 1 Normalizing annual dataset.

Since the unit of measurement for various factors is different, they are normalized. Equations (5) and (6) apply for normalizing the input- and output-oriented factors, respectively. Matrix \(R = \left[ {r_{tv} } \right]_{n \times m}\) is obtained as a normalized matrix.

$$ r_{ti} = \frac{{\frac{1}{{x_{ti} }}}}{{\mathop {\max }\limits_{t \in T} \left\{ {\frac{1}{{x_{ti} }}} \right\}}} = \frac{{\mathop {\min }\limits_{i \in I} x_{ti} }}{{x_{ti} }}\quad \forall t \in T, \,i \in I $$
(5)
$$ r_{to} = \frac{{x_{to} }}{{\mathop {\max }\limits_{t \in T} \left\{ {x_{to} } \right\}}}\quad \forall t \in T,\, o \in O $$
(6)

Step 2 Standardizing annual dataset.

For homogenizing factors, the standard data are considered with mean zero and unit variance as Eq. (7). Matrix \(S = \left[ {s_{tv} } \right]_{n \times m} = \left[ {S_{1} ,S_{2} , \ldots , S_{m} } \right]\) is a standardized matrix.

$$ s_{tv} = \frac{{r_{tv} - \mathop {{\text{mean}}}\limits_{v \in V} \left\{ {r_{tv} } \right\}}}{{\mathop {{\text{StDev}}}\limits_{v \in V} \left\{ {r_{tv} } \right\}}}\quad \forall t \in T, \,v \in V $$
(7)

Step 3 Computing correlation coefficient matrix.

The covariance between factors \(j\) and \(k\) is calculated as Eq. (8). Covariance depends on the unit of measurements, while the correlation coefficient is a unit-free index. The Pearson's correlation coefficient is calculated as Eq. (9). Since \(S_{1}\), \(S_{2}\), …, \(S_{m}\) are standard, we have Eq. (10). Pearson's correlation coefficient matrix is asymmetrical as Eq. (11). The main diagonal of the matrix is \(+ 1\) because \({\text{Corr}}\left( {S_{j} ,S_{j} } \right) = E\left[ {S_{j} S_{j} } \right] = E\left[ {S_{j}^{2} } \right] = {\text{Var}}\left( {S_{j} } \right) = 1\).

$$ {\text{Cov}}\left( {S_{j} ,S_{k} } \right) = E\left[ {\left( {S_{j} - E\left[ {S_{j} } \right]} \right)\left( {S_{k} - E\left[ {S_{k} } \right]} \right)} \right] = E\left[ {S_{j} S_{k} } \right] - E\left[ {S_{j} } \right]E\left[ {S_{k} } \right]\quad \forall j,\,k \in V $$
(8)
$$ {\text{Corr}}\left( {S_{j} ,S_{k} } \right) = \frac{{{\text{Cov}}\left( {S_{j} ,S_{k} } \right)}}{{\left[ {{\text{Var}}\left( {S_{j} } \right){\text{Var}}\left( {S_{k} } \right)} \right]^{1/2} }}\quad \forall j,k \in V $$
(9)
$$ {\text{Cov}}\left( {S_{j} ,S_{k} } \right) = {\text{Corr}}\left( {S_{j} ,S_{k} } \right) = E\left[ {S_{j} S_{k} } \right]\quad \forall j,\,k \in V $$
(10)
$$ C = \left[ {c_{jk} } \right]_{m \times m} = \left[ {{\text{Corr}}\left( {S_{j} ,S_{k} } \right)} \right]_{m \times m} = \left[ {\begin{array}{*{20}l} { + 1} \hfill & {E\left[ {S_{1} S_{2} } \right]} \hfill & \cdots \hfill & {E\left[ {S_{1} S_{m} } \right]} \hfill \\ {E\left[ {S_{2} S_{1} } \right]} \hfill & { + 1} \hfill & \cdots \hfill & {E\left[ {S_{2} S_{m} } \right]} \hfill \\ \vdots \hfill & \vdots \hfill & \ddots \hfill & \vdots \hfill \\ {E\left[ {S_{m} S_{1} } \right]} \hfill & {E\left[ {S_{m} S_{2} } \right]} \hfill & \cdots \hfill & { + 1} \hfill \\ \end{array} } \right] $$
(11)

Step 4 Calculating eigenvalues.

Equation (12) calculates eigenvalues from the matrix \(C\). Solving Eq. (12), \(m\) eigenvalues are obtained. They are arranged in decreasing order, \(\lambda_{1} , \lambda_{2} , \ldots , \lambda_{m}\) so that \(\sum\nolimits_{v = 1}^{m} {\lambda_{v} = m}\).

$$ \det \left[ {C - \lambda I} \right] = \left| {\begin{array}{*{20}l} {1 - \lambda } \hfill & {E\left[ {S_{1} S_{2} } \right]} \hfill & \cdots \hfill & {E\left[ {S_{1} S_{m} } \right]} \hfill \\ {E\left[ {S_{2} S_{1} } \right]} \hfill & {1 - \lambda } \hfill & \cdots \hfill & {E\left[ {S_{2} S_{m} } \right]} \hfill \\ \vdots \hfill & \vdots \hfill & \ddots \hfill & \vdots \hfill \\ {E\left[ {S_{m} S_{1} } \right]} \hfill & {E\left[ {S_{m} S_{2} } \right]} \hfill & \cdots \hfill & {1 - \lambda } \hfill \\ \end{array} } \right| = 0 $$
(12)

Step 5 Calculating eigenvectors and PCs.

Equation (13) calculates eigenvectors in which \(E_{v}\) is an \(m \times 1\) matrix as Eq. (14)

To avoid multiple solutions, a unit vector is considered by Euclidean norm as Eq. (15). These unit vectors are PCs in this methodology: \({\text{PC}}_{1} , \,{\text{PC}}_{2} , \ldots , \,{\text{PC}}_{m}\).

$$ {\text{CE}}_{v} = \lambda_{v} E_{v} \quad \forall v \in V $$
(13)
$$ E = \left[ {e_{jk} } \right]_{m \times m} = \left[ {\begin{array}{*{20}c} {e_{11} } & {e_{12} } & \ldots & {e_{1m} } \\ {e_{21} } & {e_{22} } & \ldots & {e_{2m} } \\ \vdots & \vdots & \ddots & \vdots \\ {e_{m1} } & {e_{m2} } & \ldots & {e_{mm} } \\ \end{array} } \right] = \left[ {E_{1} , E_{2} , \ldots , E_{m} } \right] $$
(14)
$$ E_{v} = E_{v}^{T} E_{v} = e_{1v}^{2} + e_{2v}^{2} + \cdots + e_{mv}^{2} = 1\quad \forall v \in V $$
(15)

The first extracted PC accounts for the enormous possible variance in the dataset. The correlation between the two PCs is zero, and they are orthogonal. Generally, Eq. (16) is established among the coefficients of PCs.

$$ E_{j}^{T} E_{k} = e_{1j} e_{1k} + e_{2j} e_{2k} + \cdots + e_{mj} e_{mk} = 0\quad \forall j,k \in V\quad {\text{and}}\quad j \ne k $$
(16)

Step 6 Reducing the number of PCs.

The variance explained by each PC is calculated as Eq. (17), where \(w_{v}\) can be considered as a weight criterion of the \(v\) th PC.

$$ w_{v} = \frac{{\lambda_{v} }}{{\mathop \sum \nolimits_{v = 1}^{m} \lambda_{v} }} = \frac{{\lambda_{v} }}{m} \forall v \in V $$
(17)

Three criteria are applied to reduce the number of PCs as follows.

  • Scree test: The eigenvalues are plotted in decreasing order of their values. The point at which the last significant drop or break occurs determines the number of PCs (Ledesma et al., 2015).

  • Kaiser rule: Those PCs whose eigenvalues are greater than one are considered.

  • Explained variance: The Pareto principle or 20/80 rule considers \(p\) first PCs to define 80 percent of variance.

Considering these criteria, the number of PCs is reduced to \(p\) where \(p < m\).

Step 7 Determining performance values.

Equation (18) calculates the performance value of record \(t\) for \({\text{PC}}_{l}\). \(n\) annual records regarding \(p\) PCs constitute matrix \(Y = \left[ {y_{tl} } \right]_{n \times p}\).

$$ y_{tl} = \mathop \sum \limits_{v \in V} s_{tv} e_{vl} \quad \forall t \in T,\,l \in L $$
(18)

Step 8 Scores and annual ranking.

The total weighted performance of each annual record denotes the score of that record and is calculated as Eq. (19). The higher the \(z_{t}^{{{\text{PCA}}}}\), the better the annual record is. The ranking of annual records is made via these values.

$$ z_{t}^{{{\text{PCA}}}} = \mathop \sum \limits_{l \in L} w_{l} y_{tl} \quad \forall t \in T $$
(19)

Step 9 Importance degree of factors.

The influential factors are those that lead to the most score for the annual records and the most standard value. Therefore, the aggregation of record scores and standard values is applied for comparative measurement of factors as Eq. (20).

$$ A_{v}^{{{\text{PCA}}}} = \mathop \sum \limits_{t \in T} z_{t}^{PCA} s_{tv} \quad \forall v \in V $$
(20)

\(A_{v}^{{{\text{PCA}}}}\) is linearly normalized between − 1 and + 1 as Eq. (21), named the utility of each factor. The higher magnitude of the \(U_{v}^{{{\text{PCA}}}}\), the more influential the factor is.

$$ U_{v}^{{{\text{PCA}}}} = \frac{{\left( {A_{v}^{{{\text{PCA}}}} - \mathop {\max }\limits_{v \in V} \left\{ {A_{v}^{{{\text{PCA}}}} } \right\}} \right) + \left( {A_{v}^{{{\text{PCA}}}} - \mathop {\min }\limits_{v \in V} \left\{ {A_{v}^{{{\text{PCA}}}} } \right\}} \right)}}{{\mathop {\max }\limits_{v \in V} \left\{ {A_{v}^{{{\text{PCA}}}} } \right\} - \mathop {\min }\limits_{v \in V} \left\{ {A_{v}^{{{\text{PCA}}}} } \right\}}}\quad \forall v \in V $$
(21)

3.4 Phase 4: Developing NT analysis

NT method is able to identify homogeneous states. In this method, factors are applied the same as the PCA. It has the following steps.

Step 1 Normalizing annual dataset.

Similar to step 1 of PCA analysis.

Step 2 Standardizing annual dataset.

Similar to step 2 of PCA analysis.

Step 3 Computing distance matrix.

The distance of every two annual records \(g\) and \(h\) for factor \(v\) is \(\left| {s_{gv} - s_{hv} } \right|\). Therefore, Euclidean distance of annual record \(g\) from \(h\) can be computed as Eq. (22). This is done to homogenize the annual records. Computing pairwise distances, the symmetrical distance matrix is obtained as \(Q = \left[ {q_{gh} } \right]_{n \times n}\).

$$ q_{gh} = \sqrt {\mathop \sum \limits_{vV} \left( {s_{gv} - s_{hv} } \right)^{2} } \quad \forall g,\,h \in T\quad {\text{and}}\quad g \ne h $$
(22)

Step 4 Calculating upper and lower bounds.

To calculate lower and upper bounds (\({\text{LB}},\,{\text{UB}}\)), vector \(q = \left[ {q_{t} } \right]_{n \times 1}\) is extracted from matrix \(Q\), where \(q_{t}\) is the minimum amount of the \(t\) th row in matrix \(Q\) as Eq. (23).

$$ q_{t} = \mathop {\min }\limits_{h \in T} \left\{ {q_{th} } \right\} = \mathop {\min }\limits_{g \in T} \left\{ {q_{gt} } \right\}\quad \forall t \in T\quad {\text{ and}}\quad g,h \ne t $$
(23)

Equation (24) calculates the bounds. If all \(q_{t}\) are within the bounds, homogeneity is attained, and the next step is performed. Otherwise, cluster analysis should be made until all annual records become homogeneous. To do so, the records outside the bounds are removed. Again, the standard matrix is formed with these records removed. It repeats until the homogeneity is achieved.

$$ \left\{ {\begin{array}{*{20}c} {{\text{LB}} = \mathop {{\text{mean}}}\limits_{t \in T} \left\{ {q_{t} } \right\} - 2\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {q_{t} } \right\}} \\ {{\text{UB}} = \mathop {{\text{mean}}}\limits_{t \in T} \left\{ {q_{t} } \right\} + 2\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {q_{t} } \right\}} \\ \end{array} } \right. $$
(24)

Step 5 Positive and negative ideal values.

The positive and negative ideal values for each factor are as Eqs. (25) and (26), respectively.

$$ s_{v}^{{{\text{PIV}}}} = \mathop {\max }\limits_{t \in T} \left\{ {s_{{{\text{tv}}}} } \right\}\quad \forall v \in V $$
(25)
$$ s_{v}^{{{\text{NIV}}}} = \mathop {\min }\limits_{t \in T} \left\{ {s_{{{\text{tv}}}} } \right\}\quad \forall v \in V $$
(26)

Step 6 Distance from positive and negative ideal values.

The positive and negative distances of factor \(v\) for annual record \(t\) in the standard matrix are \(s_{v}^{{{\text{PIV}}}} - s_{tv}\) and \(s_{tv} - s_{v}^{{{\text{NIV}}}}\). Therefore, Euclidean distance of annual record \(t\) for all factors from positive and negative ideal values is calculated as Eqs. (27) and (28), respectively.

$$ c_{t}^{{{\text{PIV}}}} = \sqrt {\mathop \sum \limits_{vV} \left( {s_{v}^{{{\text{PIV}}}} - s_{tv} } \right)^{2} } \quad \forall t \in T $$
(27)
$$ c_{t}^{{{\text{NIV}}}} = \sqrt {\mathop \sum \limits_{vV} \left( {s_{tv} - s_{v}^{{{\text{NIV}}}} } \right)^{2} } \quad \forall t \in T $$
(28)

Step 7 Positive and negative growth levels.

Each annual record's positive and negative growth levels are calculated as Eqs. (29) and (30), respectively.

$$ \begin{aligned} f_{t}^{{{\text{PIV}}}} = & \frac{{c_{t}^{{{\text{PIV}}}} }}{{c^{{\text{PIV*}}} }}\quad \forall t \in T \\ {\text{where}}\quad c^{{\text{PIV*}}} = & \mathop {{\text{mean}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{PIV}}}} } \right\} + 2\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{PIV}}}} } \right\} \\ \end{aligned} $$
(29)
$$ \begin{aligned} f_{t}^{NIV} = & \frac{{c_{t}^{{{\text{NIV}}}} }}{{c^{{\text{NIV*}}} }} \forall t \in T \\ where\quad c^{{\text{NIV*}}} = & \mathop {{\text{mean}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{NIV}}}} } \right\} + 2\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{NIV}}}} } \right\} \\ \end{aligned} $$
(30)

Since \(c_{t}^{{{\text{PIV}}}} ,c_{t}^{{{\text{NIV}}}} \ge 0\), \(\mathop {{\text{mean}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{PIV}}}} } \right\},\mathop {{\text{mean}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{NIV}}}} } \right\} \ge 0\). Also, \(\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{PIV}}}} } \right\},\mathop {{\text{StDev}}}\limits_{t \in T} \left\{ {c_{t}^{{{\text{NIV}}}} } \right\} \ge 0\). Hence, \(c^{{\text{PIV*}}} ,\,c^{{\text{NIV*}}} \ge 0\). Therefore, \(f_{t}^{{{\text{PIV}}}} ,\,f_{t}^{{{\text{NIV}}}} \ge 0\). \(f_{t}^{{{\text{PIV}}}}\) and \(f_{t}^{{{\text{NIV}}}}\) indicate the distance of annual record \(t\) from the best and the worst possible one, respectively. Therefore, the lower \(f_{t}^{{{\text{PIV}}}}\) and the higher \(f_{t}^{{{\text{NIV}}}}\) are desirable.

Step 8 Scores and annual ranking.

The scores of annual records are calculated as Eq. (31). The higher the \(z_{t}^{{{\text{NT}}}}\), the better the annual record is. The ranking of annual records can be obtained via these values.

$$ z_{t}^{{{\text{NT}}}} = \frac{{f_{t}^{{{\text{NIV}}}} }}{{f_{t}^{{{\text{PIV}}}} + f_{t}^{{{\text{NIV}}}} }} , \quad 0 \le z_{t}^{{{\text{NT}}}} \le 1\quad \forall t \in T $$
(31)

Step 9 Importance degree of factors.

The aggregated comparative measurement and the utility of each factor in NT are obtained as Eqs. (32) and (33), respectively. The higher the magnitude of the \(U_{v}^{{{\text{NT}}}}\), the more influential the factor is.

$$ A_{v}^{{{\text{NT}}}} = \mathop \sum \limits_{t \in T} z_{t}^{{{\text{NT}}}} s_{tv} \quad \forall v \in V $$
(32)
$$ U_{v}^{{{\text{NT}}}} = \frac{{\left( {A_{v}^{{{\text{NT}}}} - \mathop {\max }\limits_{v \in V} \left\{ {A_{v}^{{{\text{NT}}}} } \right\}} \right) + \left( {A_{v}^{{{\text{NT}}}} - \mathop {\min }\limits_{v \in V} \left\{ {A_{v}^{{{\text{NT}}}} } \right\}} \right)}}{{\mathop {\max }\limits_{v \in V} \left\{ {A_{v}^{{{\text{NT}}}} } \right\} - \mathop {\min }\limits_{v \in V} \left\{ {A_{v}^{{{\text{NT}}}} } \right\}}}\quad \forall v \in V $$
(33)

3.5 Phase 5: Analyzing, validating, and verifying the results

The results are discussed in four categories:

  • First, the impact of dimension reduction in the proposed PCA is examined.

  • Second, extracted factors are clustered in different partitions.

  • Third, the annual SP is studied.

  • And finally, the importance degree of factors is evaluated.

To verify and validate the results of PCA, the annual ranking and importance degree of factors of the PCA and NT are compared. For this purpose, first, the existence of differences in the results is examined by a pairwise comparison test. If there is no significant difference, then the compatibility of the results is carried out. If it is high, it is concluded that the results of PCA are valid.

3.5.1 Pairwise comparison test

The statistical hypothesis is as follows:

$$ \left\{ {\begin{array}{*{20}c} {H_{0} :\mu_{{{\text{PCA}}}} = \mu_{{{\text{NT}}}} } \\ {H_{1} :{\text{Otherwise}}} \\ \end{array} } \right. $$
(34)

where \(\mu_{{{\text{PCA}}}}\) and \(\mu_{{{\text{NT}}}}\) are the mean of annual ranking by PCA and NT, respectively. Since the scales obtained from PCA and NT methods for annual records are different, they cannot be used directly for pairwise comparison. Therefore, the annual ranking of records is evaluated. The Wilcoxon signed-rank nonparametric statistical test is applied with the following steps.

Step 1 Calculate \(\left| {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right|\) and \({\text{sgn}} \left( {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right) \) for \(t = 1, \ldots ,n\), where \(R_{t}^{{{\text{PCA}}}}\) and \(R_{t}^{{{\text{NT}}}}\) are ranking of PCA and NT for annual record \(t\) and \({\text{sgn}}\) indicates sign function.

Step 2 Omit \(\left| {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right| = 0\) annual records. Let \(n_{r}\) the number of the reduced sample size.

Step 3 Order nonzero \(\left| {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right|\) from smallest to largest.

Step 4 Rank so that the smallest \(\left| {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right|\) is 1. For equal values, insert average rank. Let \(R_{t}^{W}\) denote the ranking.

Step 5 Calculate Wilcoxon test statistic as follows which is the sum of signed ranks:

$$ W = \mathop \sum \limits_{t = 1}^{{n_{r} }} \left[ {{\text{sgn}} \left( {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right) \times R_{t}^{W} } \right] $$
(35)

Step 6 Compare \(W\) with the critical value of the Wilcoxon signed-rank test (Wilcoxon et al., 1970). For a two-tailed test, \(H_{0}\) rejects if \(\left| W \right| > W_{{\alpha ,n_{r} }}\).

3.5.2 Compatibility test

To evaluate the compatibility of PCA and NT results, Spearman's rank correlation coefficient is examined as Eq. (36).

$$ \rho = 1 - \frac{{6\mathop \sum \nolimits_{t \in T} \left( {R_{t}^{{{\text{PCA}}}} - R_{t}^{{{\text{NT}}}} } \right)^{2} }}{{n\left( {n^{2} - 1} \right)}} $$
(36)

4 Case study

The case study is an upstream oil and gas company in Iran. The main activities of such companies are contracting in engineering, procurement, construction, and installation in various projects for large offshore structures.

4.1 Data description

The data are reported based on Solar Hijri (SH) calendar in the organization. They cover 15 years from the first of 1382 SH (equals to March 21, 2003) to the end of 1396 SH (equals to March 20, 2018). Let \({\text{AR}}_{t}\) denote annual record \(t\). It should be noted that the total number of incidents is categorized into the NOI and the NOA. NOA implies intense incidents, and personnel needs more than four days' absence and even for several months.

To commensurate IV, EV, and VA, the time value of money was considered. The AIR of Iran is applied according to the report of the central bank of Iran.Footnote 1 Table 2 gives the AIR and calculated CPI for 15 periods. Thereafter, normalizing and standardizing are performed, but the corresponding values are not reported because of brevity.

Table 2 Annual AIR and CPI for considered periods

4.2 Dimension reduction

Table 3 gives the correlation coefficient matrix. The eigenvalues of the correlation matrix in relation to PCA are as shown in Table 4. The higher the eigenvalue, the higher the explained variance of each PC is. Figure 2 illustrates the scree plot of eigenvalues. The scree plot is able to display how much each PC captures from the dataset. According to Table 4 and Fig. 2, the first four PCs comprise about 80 percent of data variability. The magnitudes of eigenvalues in these PCs are more than one. Therefore, the other 10 PCs are ignored according to the three criteria described in the proposed methodology. Table 5 introduces the coefficients of PCs.

Table 3 Correlation coefficient matrix
Table 4 Eigenvalue and weight of PCs
Fig. 2
figure 2

Scree plot of eigenvalues

Table 5 Coefficients of PCs

Figure 3 demonstrates the 2D loading plot of factors with PC1 and PC2. The value of a factor on each PC illustrates how much weight it has on each PC. EV and VA have the highest positive values and NOP, and IV have the highest negative values on PC1. The other point from the 2D loading plot is the angles between factors that specify how a factor is correlated. The factors are categorized into two groups according to their dispersion, schematically considering the first two PCs. Figure 4 displays the 3D loading plot of factors with PC1, PC2, and PC3. Like the 2D loading plot, schematically, considering the first three PCs, the factors are categorized into four groups according to their dispersion.

Fig. 3
figure 3

2D loading plot

Fig. 4
figure 4

3D loading plot

4.3 Performance values, annual ranking, and importance degree

Table 6 provides the performance values, scores, and annual ranking by PCA. Performance values of each annual record are obtained by multiplying standardized values of the dataset by eigenvectors of each PC. Consequently, the scores of PCA for each annual record are attained by multiplying weights of reduced PCs by performance values as Eq. (37). According to these scores, the annual records are ranked. Eventually, the utility and importance degree of factors are gained, as shown in Table 7.

$$ z_{t}^{{{\text{PCA}}}} = 0.3595y_{t1} + 0.1648y_{t2} + 0.1538y_{t3} + 0.1145y_{t4} \quad \forall t \in \left\{ {1,2, \ldots ,15} \right\} $$
(37)
Table 6 Performance values, scores, and annual ranking by PCA
Table 7 Utilities and importance degree of factors by PCA

4.4 Results of NT

Table 8 computes the distance matrix. The distance vector is the minimum distance of annual records given in the last row of the table. Figure 5 illustrates the line chart of the distance vector. Since all of the values of this vector are within the bounds, the homogeneity is confirmed.

Table 8 Distance matrix
Fig. 5
figure 5

Line chart of distance vector

Positive and negative ideal values are extracted, and subsequently, the Euclidean distance from these values and positive and negative growth levels are calculated for each annual record. Table 9 reports the results of the annual ranking from the NT. Eventually, Table 10 gives the utility and importance degree of factors for NT.

Table 9 Results of annual ranking from the NT
Table 10 Utilities and importance degree of factors by NT

5 Verification and validation

The verification and validation of results are performed in different aspects:

  1. 1.

    Categorizing factors in different groups,

  2. 2.

    Investigating the annual SP,

  3. 3.

    Assessing the importance degree of factors,

  4. 4.

    Comparing the results of developed NT with traditional NT.

5.1 Cluster analysis

Based on PCA results, the factors were categorized into two and four partitions. In support of this, cluster analysis is applied. Table 11 shows the results of cluster analysis for both two and four partitions. The nearest neighbor method and the Euclidean distance have been utilized in cluster analysis. Figure 6 shows the dendrogram plot of this analysis. In this figure, line (a) and line (b) identify two and four partitions in clustering, respectively. Two partitions of cluster analysis are the same as the two extracted groups of Fig. 3. In addition, four partitions of cluster analysis are the same as the four determined groups of Fig. 4.

Table 11 Cluster analysis results
Fig. 6
figure 6

Dendrogram plot of cluster analysis

5.2 Comparison of results

It was shown that the first four PCs explain about 80 percent of data variability in PCA. Here, the effect of considering more PCs on the results is investigated. To do so, 90%, 99%, and 100% of data variability, which forms PC1–PC6, PC1–PC10, and PC1–PC14, respectively, are examined. Therefore, the annual ranking and the importance degree of factors by PCA are studied with different variabilities. On the other hand, the NT was performed in two modes: developed and traditional. Tables 12 and 13 give the annual ranking and importance degree of factors with different analyses. These findings express the results by PCA for 90%, 99%, and 100% variabilities are the same. Therefore, one of them is taken for pairwise comparison and compatibility tests. Figures 7 and 8 schematically compare the annual ranking and importance degree of factors by different analyses.

Fig. 7
figure 7

Comparison of annual ranking by different analyses

Fig. 8
figure 8

Comparison of importance degree of factors by different analyses

To statistically compare the results of different analyses, differences between the results are examined. In this paper, the level of significance is considered 95%. Table 14 contains the nonparametric stepwise Wilcoxon signed-rank test for annual ranking results by PCA with PC1-PC4 and developed NT. The Wilcoxon statistic is less than the critical value. Therefore, statistically, the null hypothesis is accepted.

Table 15 summarizes the results of all pairwise comparison tests by Wilcoxon signed-rank and Spearman's rank correlation. It is concluded that there is a direct relationship between the results of all pairwise analyses.

6 Discussion and managerial insights

Two quantitative methodologies are presented, for which the description of steps of data preparation, computation, and inference are provided. Comparable results on the importance degree obtained from the two methodologies increase the confidence in the results of the importance and rankings. Hypothesis testing procedures conduct the comparison of the results of the two methods. After dimensionality reduction to only the important ones, a company's performance is studied based on the clustered factors and ranks of the annual records. Concerning three introduced criteria to reduce the dimension of PCA, instead of 14 factors, the first four PCs can be used as new variables. The results of the Wilcoxon signed-rank test showed no statistically significant difference between the results of PCA with the first 4 PCs and those with more PCs. Spearman's results also showed a high correlation between the results. Therefore, the introduced criteria can reduce the computational complexity and ensure that dimensional reduction is desirable.

2D loading plot and 3D loading plot of PCA schematically categorized the factors in two and four different groups. Cluster analysis for two and four partitions showed precisely the same results. The first partition includes NOP, ECO, ESO, and EN, and the second one comprises NOA, NOL, and ENO, which are related to social and environmental dimensions of sustainability. Reducing workplace pollution can affect personnel performance. Workplace-related factors not only endanger the health of operators but also avoid them being active for several months. Therefore, the SP of the organization can be improved via environmental aspects and providing facilities for personnel. The third partition contains NOI, IV, and PR factors related to the social and economic dimensions of sustainability. NOI causes short-term interruption for personnel and production fluctuations. Thus, a short-term plan for personnel rest seems necessary for appropriate production. In addition, IV and PR are both related to production. Therefore, balancing imports and adapting with PR reduces costs such as inventory and improves the SP of the organization. Finally, the fourth partition encompasses WP, EV, ROI, and VA factors primarily related to the economic dimension of sustainability. All these factors are in the direction of the organization's outcomes and can guarantee the competitiveness and resiliency of the organization in the future. Increasing exports and reducing WP will improve VA and ROI. Therefore, improving these factors should be one of the priorities of the organization.

The annual ranking specified poor SP of organization in recent years. The management should make more efforts to improve it and also learn from lessons from partitioning the factors. EV and VA were identified as the most influential factors in the SP. Both of these factors are related to the economic dimension. Data on these factors indicated interconnected causality. The following remarks can help management to improve SP.

  • Proper marketing expedites the organization to improve EV.

  • Having a plan to reduce WP and using modern technology cause to increase PR.

  • Focusing on EV and IV along with NOP lead to balancing work-related NOI and NOA.

  • Paying attention to PR along with WP accelerates the VA and ROI.

7 Conclusion

This paper dealt with sustainability performance (SP) assessment via a multivariate quantitative approach. To do so, a five-phase methodology was proposed. Related factors were identified by considering sustainability dimensions. For monetary factors, a unique procedure was proposed to transform them. Principal component analysis (PCA) was developed so that it can simultaneously measure the annual SP and the importance degree of the factors. The utility function was defined to recognize the influential factors. Then, the traditional numerical taxonomy (NT) was expanded. The developed NT considers the distances of both positive and negative ideal values by integrating them for ranking. Moreover, it can rank annual records besides identifying the importance degree of factors by proposing a suitable utility function.

An upstream oil and gas company was investigated as a case study. The results derived from the two developed NT and PCA analyses were statistically compared. These analyses were made from four different aspects: (1) the impact of dimension reduction, (2) clustering of factors in different partitions, (3) the assessment of annual SP, and (4) the evaluation of the importance degree of factors.

Cluster analysis showed that the environmental factors affect only social factors related to the personnel. In addition to the environmental dimension, social-based factors influence the economic dimension, especially in production and imports. Also, economic factors have a high interaction in the financial output of the organization and create value-added. The correlation of developed NT with PCA is higher than that of traditional NT and PCA. Regarding the case of the oil and gas company, it can be concluded that the company SP in recent years has deteriorated. Exports value and value-added were identified as the most influential factors.

The paper's contributions are twofold: First, the methodology can be easily scaled to larger data sets which may consider a more significant number of performance factors than those considered in this paper. Second, it can be extended to different industries. The approach of this paper can be applied to other industry organizations to improve their SP. The impact of other factors can be investigated as other topics for future researches. Furthermore, the approach of this paper is extended simultaneous recognition of the importance of variables and alternatives. The knowledge in this paper will be helpful for further academic investigations into this topic and immediate industrial practice.