Introduction

Manufacturing sector is generating a huge amount of unstructured data due to the use of various digital machines, electronic devices, and sensors on shop floors and production lines (Zhong et al. 2015). Managing this unstructured huge data is becoming a herculean task for industry professionals. Brinch et al. (2017) have observed that big data analytics can help in streamlining of this huge data for decision making and planning of operations. Gong et al. (2018) have found that BDA applications are getting popular across different functions of the supply chain. BDA is a new age technology to manage and integrate data to improve manufacturing performance (Bi and Cochran 2014). In addition, BDA can help plant automation in the era of the fourth industrial revolution (Telukdarie et al. 2018; Tseng et al. 2019). The term BDA refers to the use of advanced analytic techniques such as applied mathematical analysis, predictive analytics, data processing, etc.. These techniques provide better insight about the processes that help in timely and right decision making and improving business processes. In the context of BDA, 5 V stands for the following characteristics of data: volume (dimension of data), velocity (flow rate of data), variety (different formats of data), veracity (uncertainty of data), and value (quality of data)” (Tao et al. 2018). In general, BDA is associated with typical data analysis and mining approaches, employed on a big volume of data. Data usually comes from various sources and needs to go through a sequence of procedures for meaningful analysis (Chen et al. 2014). BDA may be applied across the different functions of the supply chain, such as sourcing, manufacturing, distribution, and marketing (Sanders 2016).

BDA plays a crucial role in improving business benefits and operational efficiency. A business benefit is a quantifiable outcome of an activity or decision that helps meet business goals. For example: cost savings, revenue growth, improved quality, risk reduction, etc. Application of BDA in the manufacturing sector reduces processing defects to get better quality of items produced in a manufacturing firm, saves time and money, as it aids in better and real-time decision making. The real-time data capture and instant processing through advanced algorithms in BDA will help in better and real-time decision making. It has the ability to analyze equipment failures, production bottlenecks, supply chain deficiencies, etc., that enables better decision making. BDA also gives manufacturers a better insight by identifying patterns, measuring impact, and predicting outcomes. BDA is helpful to improve product lifecycle management (Li et al. 2015). In addition, BDA is useful to other areas such as performance management, health care, and governmental services (Elgendy and Elragal 2014). BDA provides improved data-driven decision making, which lead to deeper insight into business operations and as a result better performance management. In the health industry, a large amount of data is generated to control and monitor the various processes of treatment, protection, and management of patient’s medical records, and regulatory requirements. BDA helps in health care as online patient scheduling, to prescribe treatments, and make clinical decisions with greater accuracy. BDA assists the government in fighting crime, enhancing transparency, improving transportation, etc. by applying predictive analytics and machine learning to big data and providing real-time access. In addition to this, with the high volume of data from sensors and satellite images, government can forecast in advance and quickly take action to minimize losses due to natural disaster. Higher productivity goals may be also achieved if predictive analytic tools are employed for monitoring and forecasting of workers’ performance. Zhong et al. (2016) has observed that BDA plays a significant role in the efficient management of the health care industry and the digitization of records. Here, the digitalization of record means all the available data in a text-searchable format to retrieve the content easily for specific information. The data can be shared instantaneously to all the relevant parties, without managing a large volume of paper documents.

BDA reduces the processing time of structured and unstructured data (Barlow 2013). Overall business performance is getting impacted by BDA as BDA-enabled organizational activities assist to lower operating costs, improve product quality, and improve product delivery (Lin et al. 2018). It helps organizations for the better forecast; as a result, organizations realize operational process benefits in the form of cost reductions, better operations planning, lower inventory levels, and elimination of waste. Thus it improves the overall performance of the organization, i.e., profitability, productivity, efficiency, etc. (Gunasekaran et al. 2017). Wamba et al. (2020) have observed that BDA can improve supply chain agility, adaptability, and operational excellence. For a manufacturing organization, BDA increases comparative advantage by strengthening the decision-making process making faster and better decisions, and increasing the capability of organizations (Dubey et al. 2016). It promotes innovation by providing the valuable information to improve the manufacturing operations and offers a means to manage environmental uncertainty; thus, it improves the overall performance of the organization.

Big data predictive analytics also facilitates in the sustainability of supply chains (Gandomi and Haider 2015). Predictive analytics tools of BDA help to get insights from data about what can happen in the future from available big data by discovering meaningful patterns of data using data mining tool like machine learning, artificial intelligence, and pattern repository system. Artificial intelligence and machine learning assist to identify inefficiencies, automate decision making, and improve the customer buying experience, which will improve sustainability of supply chain. Thus, big data predictive analysis assists in identifying and prioritizing the most important environmental and social challenges for improving the sustainability supply chain. According to Dubey et al. (2016), manufacturing organizations should adopt BDA for sustainable manufacturing operations. Continuous monitoring and analysis of operational data in real time helps in removing bottlenecks. Big data analytics helps in defect detection, predict machine failure, minimize risks, improve performance, and reduce downtime. BDA could help manufacturing firms to apply sustainable practices in more effective manner to enhance productivity. Big data support and improve sustainability measure in various operations by better management of sustainability practices of reduce, reuse and recycle. BDA also ensures lean and green manufacturing with resources optimization (Gunasekaran et al. 2017; Ji-fan Ren et al. 2017; Doolun et al. 2018). Sustainable manufacturing operations improve overall firm performance in terms of resources saving and reducing negative environmental impact (Braganza et al. 2017). Bag et al. (2020) have observed in context to the South Africa mining industry that big data analytics significantly contribute to green product development and sustainable supply chain outcomes. The environmental impact of manufacturing processes can influence the reputation of the organization in the long term (Wood et al. 2016).

The manufacturing organizations realize operational benefits in the form of cost reductions, better operations planning, lower inventory levels, better organization of the labor force and elimination of waste, while they leverage improvements in operations effectiveness and customer service as a result of BDA implementation. Popovič et al. (2018) established that the implementation of BDA reduces the consumption of materials (10–15%), energy (about 5%), scrap and rework (about 15%), and manual labor (about 20%). Also, their study revealed that implementation of BDA reduced the maintenance and waste costs for about 12.5% on a year-to-year basis. On the other hand, particularly concerning the delivery of goods to the customer, it increases the satisfaction of customers. Companies incorporate effective decision-making process on the basis of meaningful information derived through data analytics which help them to run smarter, more agile and efficient businesses (Demirkan and Delen 2013).

Belhadi et al. (2019) have observed that researchers have not significantly explored the capabilities of BDA for sustainable manufacturing processes. Apart from it, studies on application of BDA in manufacturing sector in context to devloping countries like India are limited. Many organizastions are still working on silos and not automating their processes as per philosophy of Industry 4.0. This provides a research gap for the problem of current study, i.e., to investigate the strategic factors in implementation of I4.0 technologies such as BDA to achieve sustainable operations goal. The findings of this study will positively impact the successful implementation of BDA in manufacturing operations and will motivate the industry professionals to prioritize their resources to invest in BDA. Manufacturing organizations are facing severe challenges to adopt new technology for sustainable operations (Singh et al. 2019). Therefore, this study investigates strategic factors for implementing BDA in the manufacturing sector to make manufacturing operations more sustainable. The main objectives of this study are as follows:

  • To justify the applications of BDA to ensure sustainable manufacturing operations.

  • To identify strategic factors and rank them for implementing BDA in the manufacturing sector.

  • To categories strategic factors in causes and effects from strategic perspective.

The structure of this paper is as follows: “Literature review” section deals with the literature review on big data applications for sustainable manufacturing operations, and the strategic factors for its implementation. In “Research methodology” section, the research methodology is discussed. “Results and discussion” section presents the obtained results and discussion. Finally, the conclusion, limitations, future scope, and managerial implications of the study are provided in Conclusion, limitations, future scope, and managerial implications” section.

Literature review

In this section, literature review is presented on BDA applications for sustainable manufacturing operations and identification of strategic factors for BDA implementation in the manufacturing sector.

BDA applications for sustainable manufacturing operations

In the prevailing business environment of economic slowdown, manufacturers are aiming to reduce waste and improve value. Customers are looking for high-quality and low-cost products (Dubey et al. 2016; Fercoq et al. 2016). Therefore, there is a challenge in front of manufacturing organizations to meet these expectations. BDA may be a possible solution to this problem due to its many benefits. In this subsection, the authors identify the various benefits of BDA applications in the manufacturing sector from a sustainability perspective from the existing literature which are summarized in Table 1. The main sustainability benefits of BDA application in manufacturing operations are enhanced production recovery/reuse, energy-efficient and safe processes, improved customer satisfaction, improvement in profit margin, waste minimization, resource optimization, developing sustainable capabilities. Here, developing sustainable capabilities means the ability of firms to respond to their short-term financial objectives as well as future goals for profound changes in the current production systems. These benefits are further classified into three aspects, i.e., economic, social, and environmental. The benefit “Enhanced production recovery/reuse” refers to the increase in the production rate of a manufacturing system that is achieved through the effective implementation of various techniques such as lean, Kaizen, six sigma, cloud-based enterprise resource planning, etc., with the help of BDA. Reuse infers that things are utilized by the second user without prior operations or as originally designed. The energy consumption, impact on the environment, and cost, would be reduced by accurate and timely decisions taken with the support of data analytics (Hazen et al. 2016; Raut et al. 2019). “Improved customer satisfaction” is the continuous change due to optimal forecast to meet organization targets and customer requirements. Utilizing BDA, the customer may be effectively involved with green purchasing practices, cleaner production, eco-labelling, and eco-design feedback (Raut et al. 2019). For instance, optimization and machine learning have been used to select suppliers with low carbon emissions and supply chain carbon maps are generated using BDA for identifying hot spots of carbon emission so that they can be reduced (Acquaye et al. 2014; Singh et al. 2018). Additionally, customer loyalty can be improved with BDA analysis of sentiment (Dubey et al. 2016). The firm should have a BDA-based data offering structure to its customers.

Table 1 Big data analytics sustainability benefits for manufacturing organizations

Tseng et al. (2019) stated that to build successful sustainable manufacturing operations, firms should upgrade the synchronization of financial-related decisions, obtain cost information, focus on service and quality of the product, and ensure improved customer satisfaction. BDA helps in reducing costs of manufacturing processes along with final costs of products and components. Using predictive analytics, the manufacturer can schedule predictive maintenance that allows manufacturers to prevent costly asset breakdowns and avoid unexpected downtime which lead to reduce operational costs.

Sustainable manufacturing focuses on resource optimization without compromising with the productivity or effectiveness of manufacturing operations. Resource optimization refers to the optimal usage of available resources and to reduce carbon dioxide emissions, harmful materials, etc., from different manufacturing processes (Piyathanavong et al. 2019). Sustainable natural resource management requires complete thought of different factors with the goal that available resources may meet the requirement of contemporary society along with future generations (Mustapha et al. 2017). Sustainable capability is the ability of firms to respond to their short-term financial objectives as well as future goals. The capacity to integrate intricate resources in a successful way to accomplish sustainable goals, convey sustained values to its stakeholders, and gain sustainable competitive advantage constitute a firm's sustainable capabilities. Subsequently, by coordinating green human resources management, and green supply chain management, etc., firms can develop such sustainable capabilities that lead to enhancing performance at the ecological, environmental and social levels (Amui et al. 2017). Singh and El-Kassar (2019) have observed that organizations should have an environmental policy in place and their management should support implementing such environment-friendly practices.

Table 1 categorizes these sustainability benefits from three perspectives of the triple bottom line approach, i.e., social, economical and environmental. In manufacturing organization, minimization of manufacturing time and increasing reuse of components belong to social aspects of sustainability. Economic aspects include reduction of manufacturing cost, maintenance, and recycling. Environmental aspects refer to reduction in carbon dioxide emission, electric consumption, component packaging, and component weight (Raut et al. 2019).

Identification of strategic factors for big data analytics application

In this subsection, authors have reviewed the literature to identify strategic factors of BDA application for sustainable manufacturing. Strategic factors are defined as the attributes required to ensure overall success for an enterprise. In other words, strategic factors include issues vital to an organization's current activities. Based on the review of the literature, several strategic factors are identified. These strategic factors are listed in Table 2 with their brief description. Many organizations in developing countries are constrained with limited access to technology, finances, infrastructure and skilled manpower. However, this is not the case with most of the organizations located in the developed countries (Kumar et al. 2020). Therefore, the stratetegic factors may be different for developing countries from the developed countries. The literature review reveals that a lot of work has been reported on the application of BDA particularly in developed countries (Mathias Kalema and Mokgadi 2017). Many organizations in developing countries are still struggling to leverage the benefits of BDA application for improving their performance from a sustainability perspective. Manufacturing organizations are showing reluctance for technological changes happening in the market with the same pace.

Table 2 Strategic factors for big data analytics application

Research methodology

This study comprises four phases. The first phase is the identification of benefits and strategic factors for BDA application in the manufacturing sector. The second phase is the justification of BDA application in manufacturing, the third phase is the ranking of strategic factors, and the fourth phase is on the categorization of strategic factors into cause and effect. The research framework for all phases of the study is shown in Fig. 1. Based on the literature review, the benefits of BDA application and strategic factors for its implementation are identified. AHP is used for justification of the BDA application. Fuzzy TOPSIS and DEMATEL approaches are applied for the ranking and categorization of strategic factors for BDA application.

Fig.1
figure 1

Proposed research framework

Analytic hierarchy process (AHP)

AHP-based methodology is proposed for justification of BDA application for sustainable manufacturing operations. It is a multi-criteria decision making (MCDM) approach used to solve complex decision making problems. The AHP approach was developed in 1972 (Saaty 1980). This is selected as it is easy to use and has high applicability in MCDM procedures. MCDM procedure is a methodology for decision making where various alternates are ranked on the basis of different criteria. Popular MCDM tools are AHP, TOPSIS,VIKOR, etc. AHP was used for decision making for flexible manufacturing system supply chain justification in Small and medium enterprises (SMEs), prioritizing the factors for coordinated supply chain and microalgae cultivation systems (Singh 2012, 2013; Tan et al. 2017).

As AHP is a hierarchical process, 3 levels are considered for this work. The goal of problem, benefits, and alternatives, i.e., big data-enabled manufacturing (BDM) and without big data-enabled manufacturing (WBDM). These are placed at the first, second, and third level of the hierarchy, respectively. The solution procedure passes through structural hierarchy development, construction and development of comparative judgments, and synthesis of priorities and consistency calculation.

In structural hierarchy development, the authors built analytic hierarchy model for a given problem as shown in Fig. 2. The goal of this problem is to justify the application of BDA in manufacturing sector, and it is placed at top level. The main factors in the present context are benefits of BDA application in manufacturing sector are placed on second level of hierarchy. The justification of BDA is analyzed on the basis of its benefits. At level 3 which is the last level of hierarchy, two alternatives namely big data-enabled manufacturing (BDM) and without big data-enabled manufacturing (WBDM) are positioned as these are the final outcome.

Fig. 2
figure 2

Schematic of the analytic hierarchy process

In the construction and development of comparative judgments, the priorities of elements are determined at every level. The pairwise comparisons are done in terms of which benefit dominates the other. For this purpose, responses from experts are obtained in linguistic terms and converted into crip values. A pairwise comparison matrix (nxn), P1 for all benefits of BDA is constructed on the basis of Satty’s nine-point scale (1994) as given in Appendix 1, and it is expressed as: \(P_{1} = \left[ {a_{ij} } \right]_{nxn}\), where the matrix element \(a_{ij}\) represents the relative importance \(i{\text{th}}\) factor w.r.t. \(j{\text{th}}\) factor and aij*aji = 1 when \(i \ne j\)(symmetric elements are reciprocal) and aij = 1 when \(i \equiv j\)(diagonal elements are reciprocal).

Next, the matrix P1 is normalized by dividing each entry in a column by the sum of column entries. Let bij be the normalized value of ith criteria with respect to jth criteria, and it is expressed as: \(b_{ij} = \frac{{a_{ij} }}{{\sum\nolimits_{i = 1}^{n} {a_{ij} } }}\) for all j = 1 to n. Further, the normalized matrix (n X n) is used to obtain a priority vector matrix (principal matrix), P2 (n X 1) by taking the average of each row elements. The matrix, P2 is expressed as: \(P_{2} = [c_{ij} ]_{nx1}\) where \(c_{ij} = \frac{{\sum\nolimits_{j = 1}^{n} {b_{ij} } }}{n}\) for all i = 1–n. The matrix P2 is a column vector where the element indicates the weight of each benefit.

The consistency ratio (CR) of the pairwise comparison matrix is determined to check whether the expert responses are consistent. To find the consistency ratio, P3 and P4 column vectors are evaluated using the matrices P1 and P2 as per Eq. (1) (Singh 2012).

$$ P_{{3}} \, = \,P_{{1}} {*}P_{{{2} }} {\text{ and }}P_{{{4}\,}} = \,P_{{3}} /P_{{2}} $$
(1)

Next, \(\lambda_{\max }\) is evaluated by taking average of the P4 vector, i.e., \(\lambda_{\max } = \frac{{\sum\limits_{j = 1}^{n} {p_{j} } }}{n}\) where \(p_{j}\) is the element of column vector P4. Finally, consistency index (CI) is calculated as:

$$ {\text{CI}}\; = \;\frac{\lambda \max - n}{{n - 1}} $$

where n is the size of the matrix.

The ratio of consistency index (CI) to random consistency index (RCI) is known as the consistency ratio which is expressed as:

$$ {\text{CR}}\, = \,{\text{CI}}/{\text{RCI}} $$
(3)

where RCI is random consistency index and RCI value taken as per Appendix 2.

If CI is less than 0.1 decisions are considered as consistent. For CR value more than 0.1, the nature of decisions ought to be revised till CR value reaches in a consistent range.

The acceptable CR depends on the size of the matrix, and it is 0.1 for matrix size 4 × 4 and larger (Saaty 2000). If the value of consistency ratio is equal to or less than the permissible value, it suggests that the assessment within the matrix is satisfactory or shows a good level of consistency in the relative decisions. Similar procedure is followed for last hierarchy for computing the weights of BDM and WBDM for each benefit. The list of variables/symbols used in the expressions/ equations in this work is given in Appendix 9.

Fuzzy TOPSIS approach

Fuzzy TOPSIS is used for the ranking of strategic factors for BDA application for sustainable manufacturing operations. Fuzzy TOPSIS was employed for analyzing disposition strategies in reverse supply chains, disposition decision in reverse logistics, prioritizing strategic factors for reverse logistics, and environmental sustainability (Agrawal et al. 2016; Singh and Agrawal 2018; Samaie et al. 2020). This approach was also implemented to support outsourcing of logistics service, evaluation of 3PL in a supply chain, and supplier selection (Kumar and Singh 2012; Lima Junior et al. 2014).

In the fuzzy TOPSIS method, the data is collected in linguistic terms (later converted on a fuzzy scale) for selected alternatives for the chosen criteria, unlike the TOPSIS approach where crisp values are gathered. Fuzzy TOPSIS is a simple, realistic form of modeling and compensatory method which includes and excludes alternative solutions based on hard cutoff (Singh and Agrawal 2018). Additionally, it is a computation process that can be easily programmed into a spreadsheet that contains a data on a scalar value that represents both the best and worst alternatives at the same time, a sound logic that represents the rationale of human choice and the performance measures of all alternatives can be visualized on polyhedron (Kim et al. 1997). Moreover, the integration of fuzzy will further increases its strength as it is capable of handling vague and uncertain information (Zimmermann 1985). These benefits of fuzzy TOPSIS make it a better choice among MCDM approaches. Therefore, this approach is employed for this work, and the step-by-step procedure of fuzzy TOPSIS is detailed below:

Step 1 Collect the data through the survey method in the form of linguistics terms. The experts should be asked to select the best option. The options are expressed in linguistic terms for a given question. A 5-point scale with the linguistic terms low (L), fairly low (FL), medium (M), fairly high (FH), and high (H) is generally used in the questionnaire. Once the data is collected in linguistic terms, the same is converted into fuzzy numbers.

Step 2 A fuzzy decision matrix is derived based on the data collected in step 1 and converted into triangular fuzzy numbers.

$$ D = \left[ {\begin{array}{*{20}c} {Y_{11} } & {Y_{12} } & \ldots & {Y_{1j} } & \ldots & {Y_{1n} } \\ {Y_{21} } & {Y_{22} } & \ldots & {Y_{2j} } & \ldots & {Y_{21} } \\ \ldots & \ldots & \ldots & \ldots & \ldots & \ldots \\ {Y_{i1} } & {Y_{i2} } & \ldots & {Y_{ij} } & \ldots & {Y_{in} } \\ \ldots & \ldots & \ldots & \ldots & \ldots & \ldots \\ {Y_{m1} } & {Y_{m2} } & \ldots & {Y_{mj} } & \ldots & {Y_{mn} } \\ \end{array} } \right] $$

where Yij = (dij, eij, fij) is a triangular fuzzy number for the linguistic term allocated by the ith respondent to the jth factor. i = 1, 2, …, m are the number of respondents ,and j = 1, 2, …, n are the number of factors (Strategic factors). Table 3 shows the scale in terms of triangular fuzzy numbers used for each linguistic term.

Table 3 Scale for Linguistic terms

Step 3 A fuzzy decision matrix (D) is converted into a fuzzy un-weighted matrix (R) using the following relationship (Singh and Agrawal 2018).

$$ R = \left[ {r_{ij} } \right]_{m \times n;} = \left( {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\frac{{d_{ij} }}{{c_{{j^{*} }}^{*} }},} & {\frac{{e_{ij} }}{{c_{j}^{*} }},} & {\frac{{f_{ij} }}{{c_{j}^{*} }}} \\ \end{array} } \\ \end{array} } \right)\;{\text{and}}\;R = \left[ {r_{ij} } \right]_{m \times n;} = \left( {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\frac{{d_{j}^{ - } }}{{d_{ij} }},} & {\frac{{d_{j}^{ - } }}{{e_{ij} }},} & {\frac{{d_{j}^{ - } }}{{f_{ij} }}} \\ \end{array} } \\ \end{array} } \right) $$
(4)

For benefit criteria, \(c_{{j^{*} }}^{*}\) = \(\mathop {\max }\limits_{i} c\)\(and\) for cost criteria, \(d_{j}^{ - }\) = \(\mathop {\min d_{ij} }\limits_{i}\).

Step 4 Evaluate the weighted normalized decision matrix (V) using Eq. 5 (Singh and Agrawal 2018).

$$ {\text{V}}\, = \,{\text{R}}*{\text{W}} $$
(5)

where W is the weight of vector criteria as evaluated with AHP and \(V_{{}} = \left[ {v_{ij} } \right]_{m \times n;}\) i = 1, 2,… m; j = 1,2,…n

Step 5 Generate the ideal and negative ideal solution for the strategic factors using Eq. 6 (Singh and Agrawal 2018).

$$ {\text{A}}^{ + \,} = \,\left\{ {{\text{V}}_{{{\text{w1}}}}^{*} ,{\text{ V}}_{{{\text{w2}}}}^{*} \ldots {\text{V}}_{{{\text{wn}}}}^{*} } \right\}{\text{ and A}}^{ - \,} = \,\left\{ {{\text{V}}_{{{\text{w1}}}}^{ - } ,{\text{ V}}_{{{\text{w2}}}}^{ - } \ldots {\text{V}}_{{{\text{wn}}}}^{ - } } \right\} $$
(6)

The values as per Eq. 7 are considered for the ideal and negative ideal solution.

$$ {\text{V}}^{*} = \, \left( {{1},{ 1},{ 1}} \right){\text{ and V}}^{ - } = \, \left( {0, \, 0, \, 0} \right) $$
(7)

Step 6 Compute the total of distances from fuzzy positive ideal solution (FPIS) and fuzzy negative ideal solution (FNIS) for each factor using Eq. 8 (Singh and Agrawal 2018).

$$ D^{ + } = \frac{{\sum\limits_{i = 1}^{m} {d\left( {V_{{}} - V^{ + } } \right)} }}{m} $$
(8)

\(d\left( {V_{{}} - V_{{}}^{*} } \right)\) is the distance between two fuzzy numbers which is determined using Eq. 9

$$d({V}_{1}\hspace{0.17em}-\hspace{0.17em}{V}_{2})\hspace{0.17em}=\hspace{0.17em}\sqrt{\frac{1}{3}[{\left({g}_{2}-{g}_{1}\right)}^{2}+{\left({h}_{2}-{h}_{1}\right)}^{2}+{\left({i}_{2}-{i}_{1}\right)}^{2}]}$$
(9)

On the similar lines, the distance from the negative ideal solution is evaluated using Eq. 10

$$ D_{{}}^{ - } = \frac{{\sum\limits_{i = 1}^{m} {d\left( {v_{{}} - v^{ - } } \right)} }}{m} $$
(10)

Step 7 Compute the relative closeness to the ideal solution using Eq. 11

$$ C = D^{ - } /D^{ + } + D^{ - } $$
(11)

Step 8 Rank the strategic factors based on the order of the values of C.

DEMATEL approach

DEMATEL method was applied for evaluating direct and indirect influences among criteria in multi-criteria decision scenarios (Gandhi et al. 2015). DEMATEL was applied for prioritizing of investment projects portfolio, agri-food supply chains for sustainable initiatives, and analysis of cloud adoption determinants (Hidayanto et al. 2015; Altuntas and Dereli 2015; Mangla et al. 2018). This method was also applied to evaluate green supply chain management practices and enablers in solar power developments (Lin 2013; Luthra et al. 2016). Singh et al. (2019) have used DEMATEL for analyzing the application of ICT in SMEs from the food industry. The steps used in AHP, Fuzzy TOPSIS and DEMATEL approach are described in the following subsection. The step by step process of DEMATEL approach is given below:

Step 1 Collect expert response and evaluate their average to obtain average matrix Z.

Consider ‘m’ experts and ‘n’ factors for the analysis. Expert opinion is taken based on pairwise comparison to get the degree of direct influence between two factors. xij denotes the degree of influence of factor ion factor j as per expert view. The integer scores of 0, 1, 2, 3, and 4 denote no influence, low influence, medium influence, high influence, and very high influence, respectively. An n x n nonnegative matrix, Xk = [\(x_{ij}^{k}\)] is obtained from each expert. The average matrix Z = [zij] is obtained as per Eq. 12 and represents the aggregate of all responses.

$${Z}_{ij}=\frac{1}{m}{\sum }_{i=1}^{m}{x}_{ij}^{k}$$
(12)

Step 2 Generate the normalized initial direct-relation matrix, N

The matrix, N = [nij], where the value of each element in matrix N is ranged between [0, 1], and it is evaluated using Eq. 13.

$$ {\text{N}} = \lambda *{\text{Z}},{\text{ or}} \left[ {n_{ij} } \right]_{{{\text{nxn}}}} = \lambda \left[ {z_{ij} } \right]_{{{\text{nxn}}}} $$
(13)

where \(\lambda \; = \;Min\left[ {\frac{1}{{\max 1 \le i \le n\sum\nolimits_{j = 1}^{m} {\left| {Z_{ij} } \right|} }},\frac{1}{{\max 1 \le i \le n\sum\nolimits_{j = 1}^{n} {\left| {Z_{ij} } \right|} }}} \right]\).

Step 3 Develop the total relation matrix Y.

Total relation matrix Y is derived using Eq. 14 and its individual element represents the indirect effect of factor i on factor j. Matrix Y shows the total relationship between each pair of strategic factors.

$$ Y\; = \;N\left( {I - N} \right)^{ - 1} $$
(14)

where I is the Identity matrix.

Step 4: Determine the sums of rows and columns of Total relation matrix Y.

The sums of rows and columns of matrix Y are denoted by vectors SR and SC, and these are evaluated using Eq. 15.

$$ S_{R} = \left[ {r_{i} } \right]_{nx1} = \left( {\sum\nolimits_{j = 1}^{n} {y_{ij} } } \right)_{nx1} \;{\text{and}}\;S_{C} = \left[ {c_{j} } \right]_{1xn}^{^{\prime}} = \left( {\sum\nolimits_{j = 1}^{n} {y_{ij} } } \right)_{1xn}^{^{\prime}} $$
(15)

The values of SR and SC indicate the total given and received both, directly and indirectly, effects that factor i has on the other factors.

Step 5 Develop a cause and effect relationship.

The cause and effect diagram is constructed in a coordinate plane using the values of SR + SC and SR—SC as abscissa and ordinate, respectively. Interrelationships among system factors are established using the cause and effect diagram. Based on the values of (SR—SC), strategic factors are classified into cause and effect group. If the score of (SR—SC) is positive, strategic factors fall in cause group and directly affect other strategic factors. On the other hand, if the score of (SR—SC), is negative, such strategic factors belong to the effect group, and these are influenced by the other strategic factors.

Results and discussion

As discussed in the research methodology framework (Fig. 1), this study is carried out in four phases. The first phase of identification of BDA benefits and strategic factors for its implementation has been done through literature review. The remaining three phases are analyzed based on using different MCDM techniques as discussed in the preceding section. AHP method is used for the justification of the application of BDA. Fuzzy TOPSIS approach is employed for the ranking of strategic factors, and the DEMATEL tool is utilized for the analysis of cause and effect of strategic factors. This study is applicable to the Indian manufacturing sector. The findings of this work will help the managers to take decision for BDA application for sustainable manufacturing operations in different fields under manufacturing sector. Therefore, the study will motivate to industry professionals to invest in BDA applications across different functions to achieve the goal of sustainable manufacturing operations. Results are discussed in the following subsections.

Justification of BDA application for sustainable manufacturing operations

This subsection evaluates the priority vector in terms of relative weights for the applications identified and the global desirability index (GDI) for two alternatives. The first alternative is big data-enabled manufacturing (BDM), and the second alternative is without big data-enabled manufacturing (WBDM). A higher value of GDI indicates a better alternative. Justification of BDA in the manufacturing sector is analyzed based on the framework developed in “Analytic hierarchy process” section (Fig. 2) using AHP. Initially, a pairwise comparison matrix (P1) for seven applications of BDA at level 2 of the AHP model is developed as shown in Table 4. Each element of this matrix signifies the relative importance. For example, p23 = 5 signifies benefit at second row (Energy efficient and safe processes) has strong importance over benefit at the third column (Improved customer satisfaction). Element p32 is the reciprocal of p23 and interpreted accordingly as per the Saaty scale (refer to Appendix 1). Further, the priority vector is determined as per the procedure in “Analytic hierarchy process” section for all seven benefits and it signifies the relative weight of each benefit. The priority vector is shown in the last column of Table 4. Also, the CR is evaluated following the procedure given in “Analytic hierarchy process” section to examine the degree of consistency in the pairwise comparison of seven applications (Appendix 3), Results of the pairwise comparison are shown in Table 4. The evaluated value of CR is 0.0923 which is less than 0.1. This signifies the good level of consistency in the relative decision about applications.

Table 4 Pairwise comparison matrix of sustainability benefits (Level 2)

Subsequently, priority vector is evaluated for both the alternatives, i.e,. big data-enabled manufacturing (BDM) and without big data-enabled manufacturing (WBDM) for each sustainability benefit. The results are shown in Table 5. For example, for benefit, PF the value of PV is 0.889 and 0.111 for BDM and WBDM, respectively. A higher value of PV in the case of BDM shows that big data-enabled manufacturing is justified when performance forecasting benefit is considered. It is observed from results (Table 5) that in terms of all seven benefits, manufacturing organizations with BDA have more priority vector value in comparison with manufacturing organizations without BDA.

Table 5 Pairwise comparison judgment matrices

Global weight for each alternative is calculated by multiplying the local weight of each alternative to the weight of each benefit, and these evaluated global weights for each alternative are shown in Table 6. Subsequently, the global desirability index (GDI) is obtained by summing the all values of corresponding alternatives. The GDI value for BDM and WBDM is 0.8811 and 0.1189, respectively, as given in Table 6. A higher value of GDI justifies the application of BDA for sustainable manufacturing operations.

Table 6 Weights of applications for alternatives and global desirability index

Ranking of strategic factors for the application of BDA

Strategic factors for BDA application in the manufacturing sector are identified through literature. After discussion with experts, fifteen factors are finalized for their ranking from a strategic perspective. These factors are listed in Table 2. A questionnaire-based survey was conducted in the context of big data analytics application for sustainable manufacturing operations in the Indian manufacturing sector, and the experts were selected from industry and academia. The experts from industries and academia were requested to give the responses in the questionnaire designed for this study. The experts team comprises two production managers, one marketing manager, one operation engineer, one logistics manager, and two academicians. The experts from industries have more than ten years of experience in their domain, whereas the experts from academia have more than fifteen years of experience in teaching and research. In total, seven experts were requested to provide their responses for rating of all 15 strategic factors in linguistic terms. A 5-point scale having the linguistic terms as very low (VL), low (L), medium (M), high (H), and very high (VH) was used for this purpose. The procedure described in “Fuzzy TOPSIS approach” section is applied here for ranking. The authors collected the responses from the experts in linguistic terms and converted them into crisp values by referring scale given in Table 3. Thus, the matrix so obtained is called fuzzy decision matrix D, and it is shown in Appendix 4. Then matrix D is converted into an un-weighted fuzzy matrix, R using Eq. 4, and the same is given in Appendix 5. Further the weighted normalized matrix is evaluated using Eq. 5. This evolves the product of un-weighted fuzzy decision matrix R (Appendix 5) and PV value for sustainability benefits given in Table 4. The same is shown in Table 7.

Table 7 Weighted normalized fuzzy matrix

Next, the distance of the rating of each factor from a positive ideal solution is evaluated using Eq. 8. This is given in Appendix 6. Similarly, the distance of the rating of each factor from a negative ideal solution is evaluated using Eq. 10 and shown in Appendix 7. Further, the total distance of each factor is calculated from the positive and negative ideal solution. These are represented by D+ and D, and the same is given in Appendices 6 and 7. Subsequently, the relative closeness with respect to ideal solution A+ is evaluated using Eq. 11, and the same is used in the ranking of performance. The biggest value of closeness is ranked “1” and the lowest value of closeness is ranked “15.” By following this closeness value, all the strategic factors are ranked and tabulated in Table 8. Commitment and engagement of top management, strategy development for BDA, and development of capability for handling big data are prioritized as 1st, 2nd, and 3rd in their relative importance, and these are very crucial for BDA implementation. Without commitment and support from top management, such kind of high-cost initiatives cannot be successful. Management should also develop a trained workforce to manage huge data through BDA. Responsive information sharing framework and development of contract agreement among all stakeholders are ranked 14th and 15th, respectively, and these factors have relatively less impact on the implementation of BDA.

Table 8 Closeness coefficient matrix and ranking of strategic factors

Categorization of strategic factors in terms of cause and effect

As mentioned earlier, the DEMATEL approach is applied to categorize the strategic factors into two classes, viz. cause and effect. This is implemented by the evaluation of direct and indirect influences among strategic factors. Following the procedure given in “DEMATEL approach” section, the influence matrix data is recorded in terms of the 15 strategic factors from 7 experts and an average influence matrix, Z, is constructed using Eq. 12 (Appendices 3, 4, 5, 6, 7, and 8). Further, a normalized initial direct-influence matrix, N, is derived using Eq. 13. Average influence matrix, Z, and normalized initial direct-influence matrix, N, are given in Table 9. The steps of the DEMATEL approach were evaluated according to the above-mentioned procedure in “DEMATEL approach” section.

Table 9 Average direct-influence matrix, Z, and the normalized initial direct-influence matrix, N

Subsequently in the DEMATEL approach the total influence matrix, Y, is calculated using Eq. 14, and results are shown in Table 10. Further, the rows sum vector (SR), columns sum vector (SC), SR + SC vector, and SR—SC vector of matrix Y are calculated using Eq. 15 and these are shown in Table 11. Finally, the ranking of strategic factors is obtained on the basis of their (SR–SC) values and this is also presented in Table 11. Further, the results of the DEMATEL approach are also summarized in Fig. 3 in cause and effect groups.

Table 10 Total direct-influence matrix (Y)
Table 11 Categorization of strategic factors into the cause and effect groups
Fig. 3
figure 3

The Causal diagram for strategic factors

Following the criteria of the positive score of (SR–SC), eight strategic factors falling in the category of cause group are identified, which directly affect other strategic factors in effect group. These cause group factors are: development of contract agreement among all stakeholders, commitment and engagement of top management, development of capacity for handling big data, problems identification and solving capabilities, strategy development for BDA, availability of quality and reliable data, knowledgeable and capable decision-makers and integrating customers’ requirements with performance framework. These cause group strategic factors are treated as independent factors and these have direct influence on the organization. These strategic factors require more attention for the successful implementation of BDA in the manufacturing sector.

The availability of quality and reliable big data has the highest (SR–SC) score, and it is the most important cause that has the highest direct impact on the other strategic factors. Based on (SR–SC) score development of contract agreement among all stakeholders and Commitment and engagement of top management are placed at the second and third highest positions in cause group strategic factors. This suggests that the development of contract agreement among all stakeholders and the engagement of top management are necessary for the implementation of BDA in manufacturing organizations. Problems identified and solving capabilities, with (SR–SC) score of 1.262 have fourth position pointing its importance on BDA in the manufacturing sector. Further, the knowledgeable and capable decision-makers with (SR–SC) score of 1.2016 are an important factor that will help in taking the right decisions timely for the organization. Next, strategy development for BDA, with (SR–SC) score of 0.3199 will aid in development of strategy. Development of capability for handling big data with (SR–SC) score of 0.2464 is another crucial factor that will ensure data handling. Integrating customer requirements with performance framework has the eighth rank with the smallest (SR–SC) score of (0.14178).

There is similarity up to a certain extent in drawing inferences from Fuzzy TOPSIS and DEMATEL approaches. The top five strategic factors as ranked by Fuzzy TOPSIS (refer to Table 8) are also falling in cause group as identified by the DEMATEL approach. Therefore, management should give more attention to these strategic independent factors as they have a crucial role in implementing BDA for manufacturing organizations.

Further, based on the negative (SR–SC) values seven strategic factors fall in the effect group. There effect group factors are: robust cybersecurity system, coordination among big data stakeholders, process integration and institutionalization, flexible digital infrastructure, data-driven organization culture, process monitoring and control, and responsive information sharing framework. These strategic factors were most affected by the other strategic factors.

Conclusion, limitations, future scope, and managerial implications

In the present uncertain business environment, organizations are exploring application of emerging technologies for decision making and efficient production planning to ensure sustainable operations. BDA can help organization in managing unstructured huge data generated from digitization of operations. So far, the application of BDA for sustainable operations in the manufacturing sector is not extensively researched specifically in context to developing countries like India. Apart from it, the successful application of BDA in manufacturing processes is another big challenge for most of organizations. Major contribution of present study is that it has explored the application of BDA in manufacturing from sustainability perspective. From the literature, it has been observed that BDA application improves process and energy efficiency, product recovery, resource optimization, waste minimization, sustainable capabilities, etc. In spite of many benefits, several manufacturing organizations are still reluctant to integrate BDA with their processes due to heavy investment and risk of failure. Therefore, firstly based on industry inputs, authors justify BDA applications in manufacturing functions by using AHP. It has been observed from the results that BDA integrated manufacturing processes are more sustainable. This result is in line with the findings of other studies done in different context. Researchers have observed that BDA reduces manufacturing waste (Lee et al. 2013), increases equipment availability for the manufacturing process (Munirathinam and Ramadoss 2014), and improves insights into identification of faulty products and thus prevents returns and rework (Lavalle et al. 2011).

Further, strategic factors for BDA implementation are ranked by fuzzy TOPSIS. It has been found that commitment and engagement of top management, development of capability for handling big data, strategy development for BDA, knowledgeable and capable decision-makers, availability of quality and reliable data, process integration, and institutionalization are the major strategic factors for successful implementation of BDA in manufacturing. Commitment and engagement of top management is the most important factor as the top management plays an important role in implementation of BDA and other supporting technologies that may ensure the benefits identified in the study.

Finally, the DEMATEL approach is used to categorize strategic factors in terms of cause and effect. Availability of quality and reliable big data, commitment, and engagement of top management, development of contract agreement among all stakeholders are major factors in the cause category. This information of cause factors will help managers to prioritize the actions for the implementation of BDA. Although findings of the study will be highly motivating to industry professionals to invest in BDA applications to achieve the goal of sustainable operations, but results cannot be generalized. For further validation, empirical study along with few case studies can be carried out to validate the findings. The importance of these factors may vary as per sector and country, so in the future studies may be carried out to compare the findings in different contexts. Generally, industries have operational and business data about their inventories, products, human resources, and distributors. Companies may use big data analytics to capitalize on this and many other sources of data to optimize manufacturing operations. Therefore, companies should formulate their policies and channelize the resources as per priority of strategic factor for a smooth implementation of BDA in manufacturing sector.