1 Introduction

Production systems rely heavily on traditional fossil fuels, mainly coal and oil (Wang et al. 2023). It is estimated that industrial sectors account for over 50% of global energy consumption (Safarian 2023). Almost all coal is composed of dead plant material. As a result of accumulated plant material being buried under anoxic conditions for millions of years, and being exposed to high temperatures and pressures over that time, coal was formed (Australian Government 2022). Coal is the world’s largest source of energy for electricity generation and the production of steel, cement, and paper (U.S. Energy Information Administration (EIA), 2021). About 75% of coal is found in only 5 countries (USA, Russia, Australia, China, and India), while the biggest coal consumers are China (54%), India (18%), USA (6%), Japan (3%), and South Africa (2.3%) (Phengsaart et al. 2023). According to Notes from Poland (2022), Poland ranks 9th in the world in coal consumption to generate 70% of electricity, by far the highest figure in Europe. In terms of production, China tops the list supplying about 50% of global coal demand. Other key players in the global coal trade include India (9.9%), Indonesia (7.5%), USA (6.4%), Australia (5.9%), Russia (5.3%) and Poland (1.3%) (Phengsaart et al. 2023).

Moreover, coal-related SCs represent one of the major concerns for stakeholders (Mehmood et al. 2015) since these industries constitute a significant proportion of carbon dioxide (CO2) emissions (Sun and Yang 2021). Iron and steel manufacturing, for instance, emitted about 2,600 million tons of carbon in 2019. This number is expected to rise to 2,700 million tons by 2050 if no sustainable development scenario is applied (U.S. Energy Information Administration (EIA), 2022). As society becomes more aware of the value of the environment, waste disposal (imperfect quality items) and carbon dioxide emissions must become leading indicators of coal SC assessment. According to the European Union's Joint Research Centre, China is the largest emitter of CO2 in the world, with 11,680 Mt (11.680 GT) of carbon dioxide emissions in 2020. This is just over 32% of the world’s total 2020 emissions. The United States and India released the second- and third-highest amount of carbon emissions at 4.535 and 2.411 GT (or roughly 12.6% and 6% of total global emissions). Moreover, Japan and Iran are the 5th and 6th CO2-emitting countries in the world. It should be mentioned that China, the USA, and India are also three of the most populous countries on Earth. In general, developed countries and major emerging markets lead in total carbon dioxide emissions.

Various countries worldwide have set impressive emission-cut goals in the outlook to tackle climate change and the function of sustainable development (Malladi and Sowlati 2020; Sun and Yang 2021). In this effort, environmental administrations around the globe agree that pricing carbon emissions is the most inexpensive and successful means to achieve their emission reduction goals (Environment and Climate Change Canada 2018). The primary carbon pricing strategies are carbon tax, carbon cap, carbon offset, and carbon trade (Malladi and Sowlati 2020), whereas each approach has different costs and carbon reductions. The benefits of applying each carbon emission policy are not equal for companies involved in coal SC. While some carbon policies are more environmentally friendly, others are more economically beneficial.

Moreover, emerging Industry 4.0 technologies and concerns about global warming show that decision-makers need to change their point of view in assessing the SC's performance (Roozbeh Nia et al. 2020). Shifting from traditional assessment methods to novel and more sustainable methods is one of the critical aspects of the fourth industrial revolution. Extended Exergy Accounting is an innovative method that can help SCs become more sustainable (Aghbashlo et al. 2018). This method integrates the effect of non-energetic manufacturing features into the complete loss assessment (Jawad et al. 2018; Sciubba 2011). The primary benefit of employing the extended exergy accounting method in the production system is that this method states all outcomes in Joules (instead of dollars); therefore, acceptable assessments among different products can be achieved (Naderi et al. 2021b; Jawad et al. 2018). Moreover, energy (in terms of Joules) is essential to operate all manufacturing and SC processes (Jawad et al. 2015).

It is true that the energy market (natural gas, oil, and coal) today tends to be maturing and unbalanced, characterized by increasing demand and fluctuating supply (Roozbeh Nia et al. 2021). There are tangible signs to verify that demand and price are not predetermined and can influence a broad collection of market influences and customer behaviors. While some scholars have focused on the direct issues, there are also unforeseen issues such as the economic environment, business events, and global politics (Su et al. 2021). For example, oil and gas prices have risen to their highest levels in nearly a decade because of Russia's unprovoked invasion of Ukraine. As a result, many countries have re-evaluated their energy sources. The fact is that uncertainties in demand and energy consumption significantly affect the total SC cost as the penalty cost of unsatisfied demand increases (Priyan et al. 2022). In response to this issue, Zadeh (1965) proposed "fuzzy set theory (FST)," which translates "ill-defined" data into mathematical terms.

Considering these issues, we can present the main research questions of this study as follows:

  • Q1. Is it possible to assess the sustainability of coal SC under a carbon reduction policy in terms of Joules rather than money to benefit the economy and the environment?

  • Q2. Generally speaking, coal SC in developing countries, or even China, has the lowest overall cost; however, considering sustainability aspects (social, economic, and environmental characteristics) in Joules, does this assumption still hold true?

  • Q3. Which country has the most sustainable coal SC in terms of Joules?

  • Q4. What is the best percentage of exergy components (social, economic, environmental characteristics) to achieve the most significant saving wherever coal SCs are working?

Consequently, the first goal of this study is to find the optimum total exergy of coal SC in different developed and developing countries under carbon trade policy in an uncertain environment (for carbon trade price and customer demand). The second objective is comparing the sustainability of coal SC in eight countries in terms of Joules rather than money. Finally, this research aims to improve the sustainability of coal SC by performing a sensitivity analysis on the three exergy parameters of sustainability (economic, environmental, and social) in the extended exergy accounting method.

2 Literature review

The leading publications related to carbon policies in coal-based industries such as cement, steel, etc. are shown in Table 1. Based on this table, there is no study that employs the extended exergy accounting method or considers uncertain environment for carbon. Therefore, in this section, the literature related to our study is reviewed in two categories: exergy analysis concepts, and the extended exergy accounting method. After that, research gaps and our contributions in this research compared to existing studies are presented.

Table 1 A short review of studies on carbon policies in heavy industries

2.1 Exergy analysis concepts

Although Rant (1956) first introduced the name "exergy," parallel denotations had previously been defined by other researchers. Exergy is the capability to produce work or adequate energy or a quantity of work (Liu et al. 2020). Jaber et al. (2004) tried to connect thermodynamics with inventory management and showed the pertinency of the first and second laws of thermodynamics to manufacture systems through the economic order (production) quantity (EOQ/EPQ) model. Later, Jaber et al. (2006) supposed that the performance of the production systems is like physical systems. Their results showed that the order quantity strategy is to order in more oversized lots less often than when the entropy cost is omitted considering entropy cost. Moreover, Jaber et al. (2009) established Jaber et al. (2004) 's research paper by extending an entropic mathematical model for deciding batch sizes for deteriorating goods. The outcomes of the entropy EOQ model indicated ordering in larger quantities than recommended by the traditional model. Later, Jaber et al. (2011) presented the notion of exergy (valuable energy) cost. The authors added exergy and entropy costs to the EOQ model and established it in a simple reverse logistics system. They supposed forward and backward product streams to be cost-related, and consequently, a revenue method is accepted.

In another study, Jawad and Jaber (2015) proposed using exergy-economics and exergetic costing when developing inventory models. The authors encourage that employing the suggested inventory modeling may be more effective for other sustainable industries. Additionally, Jaber et al. (2017) developed the traditional models of the economical manufacture quantity (EMQ) and Just-in-time (JIT) by comprising other issues. Their outcomes indicated that JIT, which produces items in small quantities more often, experiences lower costs than the EMQ model once associated stress and entropy costs were not counted. Afterward, Jawad et al. (2018) studied the chief issues that can impact the entire cost of an SC, for example, emissions, labor, energy, social effects of shipping, and entropy. The outcomes presented that optimizing the exergetic cost function grows the money significantly to society for a slight extra rise in cost on the section of the SC.

Moreover, in an industrial bread SC in the Netherlands, Banasik et al. (2017a) studied a multi-objective mixed-integer linear programming model to evaluate the collection of eco-efficient solutions relating to manufacturing planning decisions. The authors employed exergy analysis to state environmental performance of the SC. Their outcomes approve the results from the literature that avoidance is the most acceptable waste management policy from an ecological viewpoint. In another study for a mushroom SC, Banasik et al. (2017b) investigated a multi-objective mixed-integer linear programming model to calculate interchanges among financial and ecological gauges and investigate quantitatively substitute recycling tools. The total exergy loss is used in this study as a single metric gauge for environmental performance. They discovered that accepting closing loop tools in modern mushroom manufacture can grow both the overall productivity of the SC and the environmental functioning. Naderi et al. (2021a) presented a mathematical model for enhancing sustainability involving the cost of exergy demolition (entropy) for a coal SC in Iran. The authors employed exergy analysis for a model that involves economic and wasted exergy costs. Their outcomes showed an extra-economic cost, but it will support managers to measure this added cost which is essential for other decisions.

2.2 Extended exergy accounting method

It was Sciubba (1998) who developed the traditional analysis of exergy and later introduced the “Extended Exergy Accounting” method (Sciubba 2003a, 2003b). The extended exergy accounting is expressed as the quantity of the main exergy aggregately exploited to manufacture and discard actual products or services (Song et al. 2019). This method contains energy and material's main aggregate exergy subject and cost corresponding to economic externality (labor and capital) and ecological externality (environmental remediation). The extended exergy accounting connects production systems' processes with surrounding systems (Song et al. 2019). Regarding the method, to the best of the authors' knowledge, only three studies employed this method for inventory management or SC. For example, Jawad et al. (2015) employed the notions of the extended exergy accounting method in inventory management for three factories in the USA, China, and Germany to involve the three aspects of sustainability: financial, ecological, and social. The outcomes presented that the order quantity in the companies is different since the corresponding exergy of money, labor and environment costs are not the same in each company. Later, Jawad et al. (2016) extended the traditional EPQ model by employing the extended exergy accounting method and thermodynamics laws to determine the degree of sustainability of a manufacture-inventory model. The outcomes revealed that an item’s cost has a crucial function in diminishing the model's entropy creation (exergy lost). Moreover, for a conventional cement production SC in China, Song et al. (2019) utilized the extended exergy accounting method to estimate the cumulative exergy consumption (CExC), labor and money exergy, and ecological remediation exergy. They measured cement manufacture's environmental costs and the segments with exergy deficiencies. Finally, Naderi et al. (2021b) studied the utilized exergy for a sustainable SC through an extended exergy accounting method for a food SC in Iran. They suggested a hybrid global- and local-search metaheuristic algorithm to solve the model. Their findings revealed that exergy minimization substantially reduces the cost for society as different from raising the cost in some sections of the SC. For example, the recommended method delivers 4.48% savings in the utilized exergy of the SC through undertaking added economic costs.

To explore more about exergy components, exergy analysis and the extended exergy accounting method in detail, we suggest Arango-Miranda et al. (2018), Dincer and Rosen (2013), and Ehyaei et al. (2019) to interested readers. Additionally, a brief review of papers that used exergy analysis and the extended exergy accounting method (comparing with our proposed model) is available in Table 2. Based on this table, for example, no study considers carbon policy with the extended exergy accounting method.

Table 2 A brief review of research works in exergy analysis of supply chain

2.3 Research gaps and our contributions

Regarding literature review, Tables 1 and 2, there are still several research gaps, including G1. There is a lack of research that assess a SC under carbon policy within an uncertain environment, for example, fuzzy carbon price or customer demand. G2. It is rare to find studies that assess a SC in terms of Joules instead of money (as traditional performance measures) and simultaneously evaluate all sustainability aspects, such as economic, labour, and environmental. G3. There is a lack of examinations that employ the extended exergy accounting method to assess a SC under any carbon reduction policy. As a matter of fact, no exergy analysis method in the literature takes into consideration carbon emission policy. G4. There is a scarcity of studies that compare the sustainability of coal SCs between developed and developing countries under carbon trade policy with the extended exergy accounting method. G5. There is a deficiency of investigation to find the best percentage of exergy components (social, economic, environmental aspects) in the extended exergy accounting method for a SC. G6. In addition, some real-world issues are ignored, such as considering the inventory turnover ratio for SC models, defective quality products discarded into the environment, shipping charges on the whole of coal SC (mining, railway transportation and steel making), vendor managed inventory (VMI) policy for coordinating SC, and the costs of loan/investment for budget limitation. In brief, the three contributions of this study to the literature are as follows:

  • Improving the sustainability of coal SCs in terms of Joules (total exergy rather than traditional monetary objectives) in developed and developing countries under carbon trade policy and the uncertain environment by employing the extended exergy accounting method.

  • Comparing the sustainability of coal SC in eight countries to determine which country has the most sustainable coal SC in terms of Joules.

  • Finding the best value of exergy components (social, economic, environmental characteristics) for coal SC in both developed and developing countries which creates the highest sustainability.

The remainder of the study is structured as follows. In Sect. 3, the problem is outlined, the suppositions are stated, and the problem is mathematically expressed into a fuzzy nonlinear integer-programming model under emission trade policy. In Sect. 4, exergy modeling of fuzzy optimization using extended exergy accounting is presented. The proposed solution method is presented in Sect. 5 to solve the problem. Section 6 presents computational test problems and sensitivity analysis of exergy values to reveal the recommended solution methods' relevance. Finally, conclusions and potential studies are offered in Sect. 7.

3 Problem description and model formulation

3.1 Problem description

Elevated energy market uncertainties (e.g., price and demand), disruptions (e.g., COVID-19 and global warming), and competition (e.g., global market and customer satisfaction) over current years have produced variations (negative and positive) to coal SC administration (Teerasoponpong and Sopadang 2022). It is true that coal is a low-cost and plentiful resource, but carbon dioxide (CO2) from coal usage in industries such as power plant, cement, steel and paper is responsible for about 40% of global greenhouse gas (GHG) emissions. Therefore, it is the responsibility of legislations and coal SC decision making to invest and innovate for cutting their carbon emissions.

This paper is inspired by the studies of Jawad et al. (2016) and Naderi et al. (2021a) and uses them to develop a multi-product multi-limitation EPQ model with backorder for a coal SC in eight countries under the fuzzy environment. Moreover, a VMI contract is employed for a single supplier and multi-buyer to coordinate the coal SC. The extended exergy accounting method with Mega-Joules (MJ) as a universal unit of measure is used to find the total exergy of the model. Besides, the buyers' demand, purchasing price per unit of product, cost of goods sold per unit of product, and carbon price of each unit of carbon are considered fuzzy. A famous carbon reduction policy, called carbon trade, is used to compare the model's performance as a sustainability measure and control the produced carbon emission of SC enterprises. Moreover, three recent metaheuristic algorithms are exercised to obtain a near-optimum solution of the developed exergy fuzzy nonlinear integer programming (EFNIP) to diminish the fuzzy total exergy of a coal SC. Additionally, ten numerical examples, including an actual case study in coal SC in Iran, were presented to display the pertinency of the proposed model. Likewise, the results are compared with the exact method (GAMS) to confirm the outcomes. Finally, a sensitivity analysis with changing the percentage of exergy parameters, including the capital, labor, and environmental remediation, has been done with seven different exergy sets of percentages (A-G) in eight developed and developing countries. Sensitivity analysis aims to find the best exergy values (capital, labour, and environmental remediation) of the extended exergy accounting method that create the highest sustainability for coal SC of Iran, Australia, China, India, Japan, Poland, the USA, and Zimbabwe.

3.2 Assumptions

Considering the purpose of this research to develop the sustainability of coal SC by integrating carbon trade policy and the extended exergy accounting method, we consider the succeeding assumptions for the mathematical preparation. More sophisticated assumptions are considered for future research in Sect. 7. There is a single supplier, multi-buyer coal SC with n products (different grades of coal) when stockout is permitted in the type of backorder for all products. The supplier’s production rate for all products is fixed and known (EPQ model). In this model, quantity discount is not permitted, and the supplier pays the shipping cost whereas the setup and keeping costs are known. There are constraints on the capacity of the buyer’s warehouse, budget and order quantity of a product and the total number of orderings. Additionally, all transportation between supplier and buyers are done by the railway system when distance between them is fixed and known. Moreover,

  1. (a)

    Buyer’s demand for the entire product, the price for all products and the price of carbon trade are fuzzy (trapezoidal fuzzy number).

  2. (b)

    The linear backorder cost per unit per time unit is known for the entire products while the time-independent fixed backorder cost per unit is supposed to be zero.

  3. (c)

    Orders are supposed to be immediate (lead time = 0).

  4. (d)

    Coal Mining (supplier), shipping, and utilizing coal in the steel companies (buyers) produce carbon emission and waste (defective quality products) disposal to the environment.

4 Notations

The indices, factors, and decision variables of the SC model are described in Table 3.

Table 3 Notations

The following subsections will develop a non-exergy mathematical model (a basic model) of the coal SC for carbon trade policy (Sect. 3.4). Then it has converted to a fuzzy model in Sect. 3.5.

4.1 A non-exergy modeling of coal SC under carbon trade policy

4.1.1 Objective function

Carbon trade integrates government regulations and market methods in a flexible policy that the Kyoto Protocol plans. With this policy, companies' carbon emissions are restricted (see Eq. 9); consequently, if a company generates carbon dioxide further than the launched cap, it must purchase extra carbon credits (\({e}^{+}\)). In contrast, the company could sell its carbon credits (\({e}^{-}\)) to other companies on the carbon market (Jiang et al. 2016), whereas the carbon price (\({C}_{trade}\)) is determined by supply and demand in this market (Li et al. 2020). Although the price of carbon is considered known and fixed in the literature, this study considers it fuzzy (see Sect. 3.5). The trading strategy provides businesses with a great motivation to save money by reducing emissions in the most economical methods. This policy is employed in the European Union, Quebec province in Canada, California in the United States of America, and seven areas in China (Haites 2018). Consequently, the carbon trade cost is

$${Z}_{1}=\sum_{j}^{m}{C}_{trade}\times \left({e}_{j}^{+}-{e}_{j}^{-}\right)$$
(1)

The shipping costs accounted for about 40% of the entire delivered cost of coal in 2019 (U.S. Energy Information Administration (EIA), 2019). Transportation costs are also impacted by road distance, accessibility of shipping mode and supply source alternatives, and the competition among coal and other goods for shipping. Therefore, the total transportation cost of coal includes constant (\({t}_{f}\)) and variable (\({t}_{v}\)) costs, along with the cost of loading/unloading coal (\({t}_{L}\)) in/from railcars and cost of equipment (\({t}_{M}\)) is

$${Z}_{2}=\sum_{i}^{n}\sum_{j}^{m}\left[\left(\frac{{D}_{ij}}{{Q}_{ij}}.{t}_{f}\right)+\left({Q}_{ij}.{t}_{v}\right)+\left(\frac{{D}_{ij}}{{Q}_{ij}}.\left(Lo+Un\right).\left({t}_{L}+{t}_{M}\right)\right)\right]$$
(2)

where (\(Lo,Un\)) are the loading/unloading time of coal in/from a railcar. The vendor-managed inventory (VMI) strategy is the regular inventory management in SC in which the upstream company completely controls the inventory at the downstream company’s location (Giovanni 2021). In the VMI system, the determinations about scheduling and amount of buyer’s replenishment are decided by the supplier that is assumed to have comprehensive information concerning the customers’ requirements, to prevent stockouts (Çomez-Dolgan et al. 2021; Maio and Lagana 2020). Therefore, it is expected that the supplier gives the ordering, shipping, and keeping costs rather than the buyer as a part of the stated contract; the buyer gives no cost (Mateen et al. 2014; Yao et al. 2007; Razmi et al. 2010; Pasandideh et al. 2011; Roozbeh Nia et al. 2014, 2015). Furthermore, in an EPQ model with defective quality items and stockout as a backorder that utilizes the VMI strategy, the coal SC’s total inventory cost is established by calculating the ordering/setup (\({TC}_{{O}_{ij}}\)), keeping (\({TC}_{{H}_{ij}}\)), stockout (\({TC}_{{S}_{ij}}\)), and purchasing (\({TC}_{{P}_{ij}}\)) costs as (Pasandideh et al. 2010, 2011)

$${Z}_{3}={TC}_{{O}_{ij}}+{TC}_{{H}_{ij}}+{TC}_{{S}_{ij}}+{TC}_{{P}_{ij}}$$
(3)

where,

$${TC}_{{O}_{ij}}=\sum_{i}^{n}\sum_{j}^{m}\frac{{D}_{ij}}{{Q}_{ij}}\left({K}_{i,s}+{K}_{ij,b}\right)$$
(4)
$${TC}_{{H}_{ij}}=\sum_{i}^{n}\sum_{j}^{m}\frac{{h}_{ij}}{2{Q}_{ij}\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)}{\left({Q}_{ij}(1-{\delta }_{m})\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)-{b}_{ij}\right)}^{2}$$
(5)
$${TC}_{{S}_{ij}}=\sum_{i}^{n}\sum_{j}^{m}\left(\frac{{s}_{1}.{b}_{ij}^{2}}{2{Q}_{ij}\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)}+\frac{{s}_{2}.{b}_{ij}.{D}_{ij}}{{Q}_{ij}\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)}\right)$$
(6)
$${TC}_{{P}_{ij}}=\sum_{i}^{n}\sum_{j}^{m}{C}_{i}{.D}_{ij}$$
(7)

where (\({D}_{ij}, {Q}_{ij},{h}_{ij}\)) are the demand rate, order quantity and holding cost per unit of coal i for buyer j, respectively. As mentioned previously, the existing budget of each buyer could be deposited in a bank account or invested in other projects to get profits. Now, we take into account a real-world balanced limitation (see Sect. 3.4.2) where the total amount of the existing budget for each buyer is restricted (see Eq. 8). To the best of the authors' knowledge, this type of objective function and limitation, have not been studied yet. On the one hand, each buyer's under-achievement budget (\({x}_{j}^{+}\) as a decision variable) is regarded as the benefit. It means this amount of money (\({x}_{j}^{+}\)) may be invested in a new project with an actual interest rate (\({int}^{+}\)) and make a profit (as a \({int}^{+}\times {x}_{j}^{+}\)) for the buyer. On the other hand, the over-achievement budget (\({x}_{j}^{-}\) as a decision variable) is regarded as the cost. It means the buyer must get a loan with the amount of (\({x}_{j}^{-}\)) and an interest rate of (\({int}^{-}\)). After All, the buyer should pay this loan as well as the interest rate (\({x}_{j}^{-}+\left[{int}^{-}\times {x}_{j}^{-}\right]\)) at the end of the period. Therefore, the total cost/benefit associated with the budget of all buyers is

$${Z}_{4}=\sum_{j}^{m}\left[{x}_{j}^{-}+\left({int}^{-}\times {x}_{j}^{-}\right)-\left({int}^{+}\times {x}_{j}^{+}\right)\right]$$
(8)

wherever in Eq. (8), the first two components are linked to the cost functions, and the last part with a negative symbol is related to the benefit obtained. Moreover, under and over-achievement budgets (\({x}_{j}^{+},{x}_{j}^{-}\)) are not known parameters and are considered decision variables. Hence, the non-exergy total cost of coal SC under the carbon trade policy is the summation of \({TC}_{trade}={Z}_{1}+{Z}_{2}+{Z}_{3}+{Z}_{4}\).

4.1.2 The constraints

The constraints of this model are as follows:

$$\frac{\sum_{i}^{n}\sum_{j}^{m}{C}_{0}.{D}_{ij}}{\sum_{i}^{n}\sum_{j}^{m}\frac{{C}_{0}.{\left({Q}_{ij}(1-{\delta }_{m})\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)-{b}_{ij}\right)}^{2}}{2{Q}_{ij}\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)} }\ge {ITR }_{j}$$
(9)
$$\sum_{i}^{n}\left[\left({Q}_{ij}.{\theta }_{m}\right)+\left(\frac{{D}_{ij}}{{Q}_{ij}}.{L}_{j}.{\theta }_{t}\right)+\left({Q}_{ij}.{D}_{ij}.{\theta }_{k}\right)\right]+\left({e}_{j}^{-}-{e}_{j}^{+}\right)={E}_{j}$$
(10)
$$\sum_{i}^{n}\sum_{j}^{m}\left[\left({Q}_{ij}.{\delta }_{m}\right)+\left({Q}_{ij}.\left(1-{\delta }_{m}\right).{\delta }_{t}\right)+\left({Q}_{ij}\left(1-{\delta }_{m}\right).\left(1-{\delta }_{t}\right).{\delta }_{k}\right)\right]\le F$$
(11)
$$\sum_{i}^{n}\left[{Q}_{ij}(1-{\delta }_{m})\left(1-\frac{{D}_{ij}}{{P}_{i}}\right)-{b}_{ij}\right]\le {W}_{j}$$
(12)
$$\sum_{i}^{n}\left[{C}_{i}.{Q}_{ij}(1-{\delta }_{m})\right]+\left({x}_{j}^{+}-{x}_{j}^{-}\right)={X}_{j}$$
(13)
$$\sum_{i}^{n}\sum_{j}^{m}\frac{{D}_{ij}}{{Q}_{ij}}\le {N}_{Max}$$
(14)
$${Q}_{ij}\le {Q}_{Max}$$
(15)
$${b}_{ij}\le {Q}_{ij}$$
(16)

Equation (9) is an inventory turnover ratio (\({ITR }_{j}\)) limitation. To the best of the authors' knowledge, this limitation has not been presented in SC literature before. The inventory turnover ratio is applied as a comparative measure of inventory performance between competitors and is crucial to control inventory (Beklari et al. 2018). This proportion is an economic index that merges the cost of goods sold with average inventories at cost (Kwak 2019). The inventory turnover ratio shows how often inventories are turned over a period. For Eq. (10), as mentioned before, with the policy of carbon trade, each buyer inside coal SC can only produce within an offered cap (\({E}_{j}\)) of emission. If this actual emission amount goes above the emission limit, the company must purchase carbon credits (\({e}^{+}\)). The company can vend these extra emission credits (\({e}^{-}\)) if the actual emission amount runs under the emission limit (Li et al. 2020). Hence, with the emission trade policy, a new emission restriction is included in the model where Eq. (10) corresponds to the total generated carbon in mining, shipping, and steelmaking processes. In Eq. (10), \(({\theta }_{m}, {\theta }_{t},{\theta }_{k})\) are emissions factors in mining, transportation, and steel manufacturer processes, respectively. Additionally, \({L}_{j}\) is the distance between the coal vendor and buyer j. Equation (11) aims to make the model green since it considers a limitation (\(F\)) on total defective products (waste) disposal to the environment by all processes in coal SC. In this equation, \(({\delta }_{m},{\delta }_{t},{\delta }_{k})\) are the proportions of imperfect quality items in mining, transportation, and steel manufacturer processes, respectively. Furthermore, Eq. (12) expresses that the warehouse space of each buyer (\({W}_{j}\)) is restricted, where (\({b}_{ij}\)) is the backorder amount of coal i for buyer j in a cycle (a decision variable).

As shown before, a real-world contractual agreement grants balanced constraints (Eq. 13) for the existing budget of each buyer (\({X}_{j}\)). To the best of the authors' knowledge, this type of limitation has not been given in SC literature in the past. Where Eq. (13) indicates that, on the one hand, if the total paid-out money of a buyer is below the existing budget (\(\sum_{i}^{n}{C}_{i}.{Q}_{ij}(1-{\delta }_{m})<{X}_{j}\)), the buyer saves an amount of (\({x}_{j}^{+}>0\)). It is possible the company invests this amount in a new project and makes a profit (see Eq. 8). On the other hand, if the total paid out money of a buyer is more than the existing budget (\(\sum_{i}^{n}{C}_{i}.{Q}_{ij}(1-{\delta }_{m})>{X}_{j}\)), so the buyer demands to get a loan with the amount of (\({x}_{j}^{-}>0\)). The total cost/benefit linked to this balanced limitation is expressed in Eq. (8). In addition, Eq. (14) is related to the limitation on the total number of orders (\({N}_{Max}\)) by all buyers. Additionally, there is a constraint for the shipping system (railway) while the Max. of shipping capacity (\({Q}_{Max}\)) for each order quantity is stated in Eq. (15). Finally, based on Eq. (16), the quantity of backorder of product i for \(j^{th}\) buyer (\({b}_{ij}\)) in a cycle should be fewer than or equal to its order amount (\({Q}_{ij}\)). It should be mentioned that intending to simplify the mathematical model; we ignore the cost of purchasing (Eq. 7) in our model. Regarding Eqs. (1)–(16) and under carbon trade policy, the non-exergy crisp model of “multi-product” balanced limitations single-vendor multi-buyer (SVMB) EPQ can be easily achieved as

$$\begin{gathered} TC_{trade} = \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\frac{{D_{ij} }}{{Q_{ij} }}\left( {K_{i,s} + K_{ij,b} } \right) + \frac{{h_{ij} }}{{2Q_{ij} \left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right)}}\left( {Q_{ij} \left( {1 - \delta_{m} } \right)\left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right) - b_{ij} } \right)^{2} + \left( {\frac{{s_{1} .b_{ij}^{2} }}{{2Q_{ij} \left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right)}} + \frac{{s_{2} .b_{ij} .D_{ij} }}{{Q_{ij} \left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right)}}} \right)} \right] + \mathop \sum \limits_{j}^{m} C_{trade} \times \left( {e_{j}^{ + } - e_{j}^{ - } } \right) \hfill \\ + \mathop \sum \limits_{j}^{m} \left[ {x_{j}^{ - } + \left( {int^{ - } \times x_{j}^{ - } } \right) - \left( {int^{ + } \times x_{j}^{ + } } \right)} \right] + \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\left( {\frac{{D_{ij} }}{{Q_{ij} }}.t_{f} } \right) + \left( {Q_{ij} .t_{v} } \right) + \left( {\frac{{D_{ij} }}{{Q_{ij} }}.\left( {Lo + Un} \right).\left( {t_{L} + t_{M} } \right)} \right)} \right] \hfill \\ s.t. \hfill \\ \frac{{\mathop \sum \nolimits_{i}^{n} \mathop \sum \nolimits_{j}^{m} C_{0} .D_{ij} }}{{\mathop \sum \nolimits_{i}^{n} \mathop \sum \nolimits_{j}^{m} \frac{{C_{0} .\left( {Q_{ij} \left( {1 - \delta_{m} } \right)\left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right) - b_{ij} } \right)^{2} }}{{2Q_{ij} \left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right)}} }} \ge ITR_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \left[ {\left( {Q_{ij} .\theta_{m} } \right) + \left( {\frac{{D_{ij} }}{{Q_{ij} }}.L_{j} .\theta_{t} } \right) + \left( {Q_{ij} .D_{ij} .\theta_{k} } \right)} \right] + \left( {e_{j}^{ - } - e_{j}^{ + } } \right) = E_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\left( {Q_{ij} .\delta_{m} } \right) + \left( {Q_{ij} .\left( {1 - \delta_{m} } \right).\delta_{t} } \right) + \left( {Q_{ij} \left( {1 - \delta_{m} } \right).\left( {1 - \delta_{t} } \right).\delta_{k} } \right)} \right] \le F \hfill \\ \mathop \sum \limits_{i}^{n} \left( {Q_{ij} \left( {1 - \delta_{m} } \right)\left( {1 - \frac{{D_{ij} }}{{P_{i} }}} \right) - b_{ij} } \right) \le W_{j} \hfill \\ \mathop \sum \limits_{i}^{n} C_{i} .Q_{ij} \left( {1 - \delta_{m} } \right) + \left( {x_{j}^{ + } - x_{j}^{ - } } \right) = X_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \frac{{D_{ij} }}{{Q_{ij} }} \le N_{Max} \hfill \\ Q_{ij} \le Q_{Max} b_{ij} \le Q_{ij} \hfill \\ Q_{ij} > 0, integer, i = 1,2, \ldots ,n \hfill \\ b_{ij} \ge 0, integer,j = 1,2, \ldots ,m \hfill \\ x_{j}^{ + } ,x_{j}^{ - } ,e_{j}^{ + } ,e_{j}^{ - } \ge 0, \hfill \\ \end{gathered}$$
(17)

In this non-exergy sustainable model, we are looking to optimize four objectives simultaneously: (a) the total inventory cost, (b) the entire cost associated with the additional required budget of all buyers, (c) the total coal transportation cost among SC members, (d) and the cost of produced carbon emission by all processes. Consequently, we have six decision variables, for example, the amount of required loan/investment for each buyer (\({x}_{j}^{-}\), \({x}_{j}^{+}\)), the carbon credits for each buyer (\({e}_{j}^{+},{e}_{j}^{-}\)), the order quantity of each item for each buyer (\({Q}_{ij}\)), and the amount of backorder of each item for each buyer (\({b}_{ij}\)). The following subsection considers uncertainty to the non-exergy model in Eq. (17).

4.2 The inventory model in fuzzy environment

Stochastic modelling methods can solve the inventory model with sufficient historical data for ambiguous parameters (Aka and Akyüz, 2021). Despite this, it is problematic to have actual and exact random distributions because of the unavailability of historical data on the coal SC in Iran. Moreover, in the real coal SC business world, the market environments are full of ambiguities in a non-stochastic sense (Panja and Mondal 2019). Therefore, most inventory models in the literature consider an impractical assumption; all the inventory settings occur in a deterministic and particular condition. To cope with this unrealistic assumption, Zadeh (1965) proposed “fuzzy set theory (FST),” which converts “ill-defined” data to mathematical terminologies. Accordingly, the problem considered in this study is a fuzzy EPQ SVMB multi-product SC. As discussed in Hanss (2005), different types of fuzzy numbers exist, for example, triangular, trapezoidal, and Gaussian fuzzy numbers. Trapezoidal numbers are usually used to express ambiguous or uncertain information since they can deal with the ambiguity or uncertainty of complex fuzzy information (Wan et al. 2021). Moreover, the trapezoidal fuzzy number is a commonly used representation of uncertain information in real applications (He et al. 2018). Therefore, in this study, the buyer's demands, the unit price of products, the cost of goods sold per unit of product, and the carbon trade price are considered ill-defined and trapezoidal fuzzy numbers.

4.2.1 Graded mean integration representation technique

To figure out and employ the consequent responses from fuzzy SC, the results should be relevant for the top management of the companies. Therefore, defuzzification is necessary (Shekarian et al. 2017). As several techniques for the defuzzification of fuzzy numbers can be applied, one of the most employed, the “graded mean integration” technique (Chen and Hseih 1998), is used in this paper. In most circumstances employing the extension rule to get the membership function of the fuzzy total cost function is not easy. Because the membership function does not alter with fuzzy arithmetic procedures, it is probable to estimate the defuzzified amount immediately through the graded mean integration technique through arithmetic procedures (Mahata and Goswami 2013). Chen and Hseih (1998) method is helpful since it scores each point of support set of fuzzy numbers, and it is probable to determine the level of resemblance among fuzzy numbers concerning graded mean integration amounts. Suppose \(\widetilde{A}=({a}_{1},{a}_{2},{a}_{3},{a}_{4})\) is a trapezoidal fuzzy number and \({L}^{-1}\), \({R}^{-1}\) are correspondingly the inverse functions of L and R. Describe the graded mean h-level amount of \(\widetilde{A}\) as \(\frac{h\left[{L}^{-1}\left(h\right)+{R}^{-1}(h)\right]}{2}\) (Mahata and Goswami 2013). So, the graded mean integration description of fuzzy number \(\widetilde{A}\) can be calculated as

$$P\left(\widetilde{A}\right)=\frac{{\int }_{0}^{1}\frac{h\left[{L}^{-1}\left(h\right)+{R}^{-1}(h)\right]}{2}dh}{{\int }_{0}^{1}h.dh}={\int }_{0}^{1}h\left[{L}^{-1}\left(h\right)+{R}^{-1}(h)\right].dh$$
(18)

For trapezoidal fuzzy number \(\widetilde{A}=({a}_{1},{a}_{2},{a}_{3},{a}_{4})\), \({L}^{-1}\left(h\right)={a}_{1}+\left({a}_{2}-{a}_{1}\right)h\) and \({R}^{-1}\left(h\right)={a}_{4}+\left({a}_{4}-{a}_{3}\right)h\). Afterward, the graded mean integration depiction of trapezoidal fuzzy number \(\widetilde{A}=({a}_{1},{a}_{2},{a}_{3},{a}_{4})\) by Eq. (18) is given by

$$P(\widetilde{A})=\frac{{a}_{1}+2{a}_{2}+2{a}_{3}+{a}_{4}}{6}$$
(19)

Therefore, in this study the buyers’ demand (\(\widetilde{{D}_{ij}}\)), purchasing price per unit of product i (\(\widetilde{{C}_{i}}\)), cost of goods sold per unit of product i (\(\widetilde{{C}_{0}}\)), and trade price of each unit of carbon (\(\widetilde{{C}_{trade}}\)) are considered trapezoidal fuzzy numbers i.e. \(\widetilde{{D}_{ij}}=\left({D}_{ij,1},{D}_{ij,2},{D}_{ij,3},{D}_{ij,4}\right)\), \({\widetilde{C}}_{i}=\left({C}_{i,1},{C}_{i,2},{C}_{i,3},{C}_{i,4}\right)\), \({\widetilde{C}}_{0}=\left({C}_{\mathrm{0,1}},{C}_{\mathrm{0,2}},{C}_{\mathrm{0,3}},{C}_{\mathrm{0,4}}\right)\), and \(\widetilde{{C}_{trade}}=\left({C}_{t,1},{C}_{t,2},{C}_{t,3},{C}_{t,4}\right)\).

5 Exergy modeling of fuzzy optimization of multi-buyer coal SC

The earlier section presents a fuzzy monetary sustainable EPQ model (minimum Dollar or Euro) for a coal SC under a carbon trade policy. In this section, we consider three factors of hidden cost in a coal SC such as capital (Cap), labor (L), and environment (Env.) remediation by employing the extended exergy accounting method and then convert the monetary model (Eq. 17) to the equivalent exergy model.

5.1 Extended exergy accounting

Extended exergy accounting is the quantity of initial exergy (in Joules; J) aggregate consumed in the manufacture, operation, and discarding procedure of certain goods or services. This method delivers more information than an entirely financial method, which cannot support any suggestion about utilizing global resources (Jawad et al. 2016). The initial aggregate exergy includes material (M), and energy (E), corresponding exergy of labor (L), money (Cap.), and ecological (Env.) remediation costs, of which the last three components are counted as the cost correspondence of economic externality and ecological externality (Song et al. 2019). It can be expressed as (Naderi et al. 2021b)

$$EEA={ee}_{M}+{ee}_{E}+{ee}_{Cap}+{ee}_{L}+{ee}_{Env}$$
(20)

where (\({ee}_{M}+{ee}_{E}\)) are the exergy of raw materials and energy flows, used in producing a product. The summation of these two exergies (\({ee}_{M}+{ee}_{E}\)) could be determined by transforming the summation of purchasing costs (\(\sum_{i}^{n}\sum_{j}^{m}{C}_{i}{D}_{ij}\)) in the inventory model to the exergy equivalents (Jawad et al. 2015). As mentioned in Sect. 3.4.2, for simplifying the mathematical model, we ignore the purchasing costs (and therefore exergy equivalents: \({ee}_{M}+{ee}_{E}\)) since it does not affect the model’s order quantity (\({Q}_{ij}\) as decision variable). All related costs should be transformed into comparable exergetic amounts to employ the extended exergy accounting method in an inventory model. The setup (K), buying (C), and keeping (h) costs can be categorized into the summation of three exergetic amounts of capital, labor, and environment \(\left({ee}_{Cap,i}+{ee}_{L,i}+{ee}_{Env,i}\right)\), respectively (Jawad et al. 2018),

$${ee}_{Cap,i}={\left({i}_{Cap}\right)\times ee}_{Cap}$$
(21)
$${ee}_{L,i}={i}_{L}\times {ee}_{L}/Labor\; cost$$
(22)
$${ee}_{Env,i}={{i}_{Env}\times ee}_{Env}$$
(23)

where \(i=K, C, or h\) are calculated in J/order, J/unit, and J/unit/year, respectively. Concerning Eq. (23) for the exergy of environment characteristic, we accept the approach of Chen and Chen (2009), who respected (\({ee}_{Env}={ee}_{Cap}\)). Consequently, Eq. (23) is switched to (\({ee}_{Env,i}={{i}_{Env}\times ee}_{Cap}\)). It comprises any cost paid to get labor, capital, material, and other items used to reduce the damaging environmental effect of manufacturing a product, operating a SC, or delivering some other service (Jawad et al. 2015). Moreover (Jawad et al. 2015, 2018; Sciubba 2011; Naderi et al. 2021b),

$${ee}_{Cap}=\alpha .\beta \left(\frac{{Ex}_{in}}{{M}_{2}}\right)$$
(24)
$${ee}_{L}=\frac{{\alpha .Ex}_{in}}{{(NWH)}_{total}}$$
(25)

where (\({ee}_{Cap},{ee}_{L}\)) are the specific exergy equivalent of one monetary unit (€, $, £, ¥) and the unit equivalent exergy of labor, respectively. Additionally, (\({Ex}_{in}\)) is the total incoming exergy fluctuation (J/yr), can be defined based on the energy budget of the country under investigation. Based on Sciubba (2011), the extended exergy accounting method determines the exergy corresponding to Labour, Money, and Ecological remediation (Eqs. 24 and 25) in goods or services by elements of “α” and “β” and some financial factors like GDP. These aspects are highly inspired by population, labor statistics, regular and international income, and normal workload. The stated aspects and exergy counterparts were examined and figured out by Sciubba (2011) for some developed and developing countries. For example, if setup cost (\(K =30 Euro\)) and we consider the percentages of money, labor, and ecological remediation denote the order cost, e.g., 60%, 30%, and 10%, therefore, \(K_{Cap} = 0.6 \times 30 =18 Euro\), \(K_{L} = 0.3 \times 30 =9 Euro\) and \(K_{Env} = 0.1 \times 30 =3 Euro\). Considering Eqs. (21)–(25), one can calculate the three exergetic values of capital, labor, and environment \(\left({ee}_{Cap,K}+{ee}_{L,K}+{ee}_{Env,K}\right)\) related to setup/order cost to achieve exergy \({K}_{(x)}\).

5.2 Applying extended exergy accounting to fuzzy optimization of multi-buyer coal SC

Under the carbon trade policy, the exergy equivalent of the total cost is (\({{{{TC}_{(x)}=Z}_{(x)1}+Z}_{(x)2}+Z}_{(x)3}+{Z}_{\left(x\right)4}\)), These equivalents can be done with the following formulas (Jawad et al. 2015)

$${K}_{(x)i,s}=\left({ee}_{Cap,K(i,s)}+{ee}_{L,K(i,s)}+{ee}_{Env,K(i,s)}\right)$$
(26)
$${K}_{(x)ij,b}=\left({ee}_{Cap,K(ij,b)}+{ee}_{L,K(ij,b)}+{ee}_{Env,K(ij,b)}\right)$$
(27)
$${h}_{(x)ij}=\left({ee}_{Cap,h(ij)}+{ee}_{L,h(ij)}+{ee}_{Env,h(ij)}\right)$$
(28)
$${s}_{(x)1}={s}_{1}\times \left({ee}_{Cap}\right)$$
(29)
$${s}_{\left(x\right)2}={s}_{2}\times \left({ee}_{Cap}\right)$$
(30)
$${t}_{\left(x\right)f}={t}_{f}\times \left({ee}_{Cap}\right)$$
(31)
$${t}_{(x)v}={t}_{v}\times \left({ee}_{Cap}\right)$$
(32)
$${t}_{(x)L}={t}_{L}\times \left({ee}_{Cap}\right)$$
(33)
$${t}_{(x)M}={t}_{M}\times \left({ee}_{Cap}\right)$$
(34)
$$\widetilde{{C}_{(x)i}}=\left({ee}_{Cap,\widetilde{C\left(i\right)}}+{ee}_{L,\widetilde{C\left(i\right)}}+{ee}_{Env,\widetilde{C\left(i\right)}}\right)$$
(35)
$$\widetilde{{C}_{(x)trade}}=\widetilde{{C}_{trade}}\times \left({ee}_{Cap}\right)$$
(36)
$${X}_{\left(x\right)j}={X}_{j}\times \left({ee}_{Cap}\right)$$
(37)

Therefore, by using the above formulas to the objective functions and limitations of the model in Eq. (17), it is converted to a fuzzy exergy model as follows:

5.3 A fuzzy exergy modeling of coal SC with carbon trade policy

$$\begin{gathered} TC_{{\left( x \right)trade}} = \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\frac{{\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} }}\left( {K_{{\left( x \right)i,s}} + K_{{\left( x \right)ij,b}} } \right) + \frac{{h_{{\left( x \right)ij}} }}{{2Q_{{ij}} \left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right)}}\left( {Q_{{ij}} \left( {1 - \delta _{m} } \right)\left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right) - b_{{ij}} } \right)^{2} + \left( {\frac{{s_{{\left( x \right)1}} .b_{{ij}}^{2} }}{{2Q_{{ij}} \left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right)}} + \frac{{s_{{\left( x \right)2}} .b_{{ij}} .\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} \left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right)}}} \right)} \right]\mathop \sum \limits_{j}^{m} \widetilde{{C_{{\left( x \right)trade}} }} \times \left( {e_{j}^{ + } - e_{j}^{ - } } \right) \hfill \\ + \mathop \sum \limits_{j}^{m} \left[ {x_{{\left( x \right)j}}^{ - } + \left( {int^{ - } \times x_{{\left( x \right)j}}^{ - } } \right) - \left( {int^{ + } \times x_{{\left( x \right)j}}^{ + } } \right)} \right] + \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\left( {\frac{{\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} }}.t_{{\left( x \right)f}} } \right) + \left( {Q_{{ij}} .t_{{\left( x \right)v}} } \right) + \left( {\frac{{\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} }}.\left( {Lo + Un} \right).\left( {t_{{\left( x \right)L}} + t_{{\left( x \right)M}} } \right)} \right)} \right] \hfill \\ s.t. \hfill \\ \frac{{\mathop \sum \nolimits_{i}^{n} \mathop \sum \nolimits_{j}^{m} C_{{\left( x \right)0}} .\widetilde{{D_{{ij}} }}}}{{\mathop \sum \nolimits_{i}^{n} \mathop \sum \nolimits_{j}^{m} \frac{{C_{{\left( x \right)0}} .\left( {Q_{{ij}} \left( {1 - \delta _{m} } \right)\left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right) - b_{{ij}} } \right)^{2} }}{{2Q_{{ij}} \left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right)}}~}} \ge ITR_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \left[ {\left( {Q_{{ij}} .\theta _{m} } \right) + \left( {\frac{{\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} }}.L_{j} .\theta _{t} } \right) + \left( {Q_{{ij}} .\widetilde{{D_{{ij}} }}.\theta _{k} } \right)} \right] + \left( {e_{j}^{ - } - e_{j}^{ + } } \right) \le E_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \left[ {\left( {Q_{{ij}} .\delta _{m} } \right) + \left( {Q_{{ij}} .\left( {1 - \delta _{m} } \right).\delta _{t} } \right) + \left( {Q_{{ij}} \left( {1 - \delta _{m} } \right).\left( {1 - \delta _{t} } \right).\delta _{k} } \right)} \right] \le F \hfill \\ \mathop \sum \limits_{i}^{n} \left( {Q_{{ij}} \left( {1 - \delta _{m} } \right)\left( {1 - \frac{{\widetilde{{D_{{ij}} }}}}{{P_{i} }}} \right) - b_{{ij}} } \right) \le W_{j} \hfill \\ \mathop \sum \limits_{i}^{n} \left( {\widetilde{{C_{{\left( x \right)i}} }}.Q_{{ij}} \left( {1 - \delta _{m} } \right)} \right) + \left( {x_{{\left( x \right)j}}^{ + } - x_{{\left( x \right)j}}^{ - } } \right) = X_{{\left( x \right)j}} \hfill \\ \mathop \sum \limits_{i}^{n} \mathop \sum \limits_{j}^{m} \frac{{\widetilde{{D_{{ij}} }}}}{{Q_{{ij}} }} \le N_{{Max}} \hfill \\ Q_{{ij}} \le Q_{{Max}} \hfill \\ b_{{ij}} \le Q_{{ij}} \hfill \\ Q_{{ij}} > 0,~integer,~i = 1,2, \ldots ,n \hfill \\ b_{{ij}} \ge 0,~integer,j = 1,2, \ldots ,m \hfill \\ x_{{\left( x \right)j}}^{ + } ,x_{{\left( x \right)j}}^{ - } ,e_{j}^{ + } ,e_{j}^{ - } \ge 0, \hfill \\ \end{gathered}$$
(38)

Under the extended exergy accounting technique, the following section suggests three recent metaheuristic algorithms to solve the fuzzy exergy model in Eq. (38).

6 A solution algorithm

In general, for solving optimization models like Eq. (38), there are three solution search methods such as exact (complete), heuristic, and metaheuristic (Shokouhifar and Jalali 2017). The weakness of “Exact” approaches, for instance, LINGO, CPLEX, and GAMS are primarily on demanded CPU running time, particularly in real-size problems (Diabat 2014; Zahedi et al. 2016), while “heuristics” approaches do not explore the search space effectively (Naderi et al. 2021b). In contrast, “metaheuristics” algorithms have enhanced the global search implementation slightly (Yan et al. 2021; Guo et al. 2020) and have the most precision results with a reasonable CPU running time (Stojanovic et al. 2017). Since the model in Eq. (38) is “nonlinear integer-programming (NIP)” and “NP-complete,” finding an “analytical solution” (if any) is demanding (Diabat 2014; Gen and Cheng 1997; Peng et al. 1998). The fact is that the objective function has a non-derivative arrangement, and the decision variables are integers (Roozbeh Nia et al. 2014). Optimization with metaheuristic algorithms is an influential and well-known method utilized in several engineering and real-world problems (Islam et al. 2021; Maier et al. 2019). These algorithms focus on improved reliability, enhanced system performance, efficient resources, superior system response, profit intensification, error, and cost reduction. (Maier et al. 2019).

Metaheuristic algorithms employ a stochastic manner for the optimization process created on random operators (Islam et al. 2021). Moreover, natural or biological phenomena have stimulated metaheuristic algorithms based on swarm intelligence and evolution (Abdullah and Ahmed 2020; Islam and Ahmed 2020) and applied them to various models (Wang et al. 2020a). Many researchers have successfully employed traditional swarm intelligence and evolutionary algorithms, for instance, ant colony optimization (ACO), particle swarm optimization (PSO), and genetic algorithm (GA) (Roozbeh Nia et al. 2017a, b). Despite these algorithms, there are some modern and attractive examples involving the Horse herd Optimization Algorithm (HOA) (MiarNaeimi et al. 2021; Moldovan 2020), Whale Optimization Algorithm (WOA) (Mirjalili and Lewis 2016; Islam et al. 2021; Yan et al. 2021; Zhang and Wen 2021; Wang et al. 2021b), Lion Optimization Algorithm (LOA) (Yazdani and Jolai 2016; Varshney et al. 2021; Selvi and Ramakrishnan 2020; Wang et al. 2020b; Gope et al. 2019), Ant Lion Optimizer (ALO) (Mirjalili 2015; Wang et al. 2020a; Bekakra et al. 2021; Singh et al. 2021; Chen et al. 2020a; Pradhan et al. 2020), and Grey wolf optimizer (GWO) (Mirjalili et al. 2014; Padhy and Panda 2021; Bekakra et al. 2021; Wang et al. 2021a; Liu et al. 2021; Tütüncü et al. 2021).

The GA and ACO presents a high risk of falling into local optimal, accordingly might lead to an inconsistent result thus needed more iteration to find the optimal solutions (Varshney et al. 2021). Moreover, GA, ACO and PSO have many factors, and it is complicated to decide on correct parameters (Shinoda and Miyata 2019). In this study we consider three recent metaheuristic algorithms: ALO, LOA, and WOA, to solve the “exergy fuzzy NIP (EFNIP) problem” modeled in Eq. (38). The reasons for selecting these three modern algorithms are as follows:

  • ALO has been demonstrated as an efficient optimization algorithm in many areas (Mirjalili, 2015; Dubey et al. 2016; Ali et al. 2017; Wang et al. 2020a) and has a good performance in deciding global optimum (Pradhan et al. 2020; Mirjalili, 2015). The crucial aspect of selecting ALO is by reason of its efficient search space employing random walk and choice of search agents by chance. (Pradhan et al. 2020). ALO has drawn extensive interest because of its relatively adequate efficiency, flexibility, and simplicity (Wang et al. 2020a).

  • In most circumstances, the outcomes achieved by LOA deliver outstanding solutions in fast convergence and global optima accomplishment (Yazdani and Jolai, 2016). This approach uses the local as well as global optima and thus gives the optimal solution with minimum cost (fitness function) and takes less iteration (Varshney et al. 2021).

  • WOA demands no added modification parameters to come to an outstanding balance between its exploration and exploitation (Aala Kalananda and Komanapalli, 2021). Study findings present that WOA is outstanding to other optimization methods, for instance, PSO, ACO, GA, differential evolution (DE), and gravitational search for solution precision and convergence speed (Chen et al. 2020c; Kaur and Arora, 2018; Mohammed et al. 2019, Jahromi et al. 2018). Since the benefits of effortless assumption, simple operation, straightforward application, few modification parameters, and strong robustness, the WOA algorithm has received widespread interest and has achieved many significant research outcomes (Du et al. 2021; Zhang et al. 2021; Long et al. 2020).

Based on the literature, metaheuristic algorithms' parameters substantially impact outcome quality and running time (Yang et al. 2009; Kao and Zahara 2008). Consequently, the algorithm’s parameters employed are based on a pilot study, and the algorithm’s results will be validated with GAMS output in small-size problems. In the following subsections, short explanations primarily supported three metaheuristic algorithms. Interested readers are encouraged to see referred studies about these algorithms in detail. Afterward, the phases concerned in the proposed solutions are described.

6.1 The ant lion optimization algorithm (ALO)

The Ant Lion Optimization algorithm, which Mirjalili (2015) proposed, is one of the nature-stimulated optimization procedures for solving one-dimensional and multidimensional optimization models (Pradhan et al. 2020). The algorithm is stimulated by the hunting behavior of antlions that catch their prey, ants, by digging a pit in the sand (Singh et al. 2021; Mirjalili 2015). A larva of an ant lion builds a conical-formed hole by going along a spherical route in the sand and putting the sand with its enormous jaw. After excavating the hole, larvae conceal at the bottom, stopping for ants to be stuck in the hole. When an ant has been stuck in the hole, the ant lion drops sand towards the outside, so it falls its target into the hole. Once an ant is stuck into the jaw, the ant lion draws the prey toward itself and eats (Mirjalili 2015; Chen et al. 2020a). Six main processes were planned in ALO to replicate communication between the ant and the ant lion in the hole, comprising of random walk of ants, getting caught in the ant lion’s trap, the construction of a hole, descending ants towards the ant lion, sticking prey, re-construction of the hole, and elitism, respectively (Mirjalili 2015; Wang et al. 2020a). An analysis of all prior studies on ALO is newly offered by (Heidari et al. 2020; Abualigah et al. 2020; Abderazek et al. 2020). Moreover, Appendix Fig. 9. Presents pseudo-code of the ALO algorithm.

Fig. 1
figure 1

Flow chart of the proposed solving procedure

6.2 The lion optimization algorithm (LOA)

Yazdani and Jolai (2016) suggested Lion Optimization Algorithm (LOA) as a population-based metaheuristic approach. It is an optimization naturally motivated by the attributes of lions. It replicates lions' social and hunting performance, for instance, prey capturing, roaming, mating, and defence (Selvi and Ramakrishnan 2020). The lion has specific social behavior; hence it is the most powerful mammal globally. Lions have two forms of social behavior: inhabitants and travelers, and lions can switch over them. Inhabitants live in parties known as pride, in which resident females and males appear to give birth. The second structural behavior is so-called travelers, who occasionally move about in pairs or singularly. A detailed explanation of all LOA steps is presented in Yazdani and Jolai (2016). Moreover, Appendix Fig. 10. presents pseudo-code of the LOA algorithm.

Fig. 2
figure 2

An example of the chromosomes for the numerical example with four products and ten buyers

6.3 The whale optimization algorithm (WOA)

The whale optimization algorithm (WOA) is a recent swarm intelligence optimization method suggested by Mirjalili and Lewis (2016). The WOA algorithm is motivated by the hunting method of humpback whales. Their predation process is called the bubble-net attacking method, and it has been seen that it is done by producing unique bubbles along a circle (Goldbogen et al. 2013). The hunting behavior primarily includes three stages: search for prey, diminishing encircling, and spiral revising location (Mirjalili and Lewis 2016; Wang et al. 2021b; Chen et al. 2020b, 2020c; Lee and Lu 2020). The WOA uses three operators that simulate these phases. Among them, the operator replicating the bubble-net hunting behavior of humpback whales is an essential process in WOA (Li et al. 2021). In WOA, the location of each humpback whale stands for a search agent. During the search process, the whales progressively acquire the proper location of the prey by encircling, twisting, and capturing it at the end (Zhang et al. 2021). The WOA obtains the best solution to the global optimization problem by continuously revising the search agent (Yan et al. 2021). Moreover, WOA depends on a linearly declining vector whose value reduces from 2 to 0 as the repetitions develop (Aala Kalananda and Komanapalli 2021). Appendix Fig. 11. Presents pseudo-code of the WOA algorithm.

Fig. 3
figure 3

The total fuzzy exergy comparisons of algorithms in large size test problems (step 1)

At the end of this section, the main steps in the recommended solution process of Eq. (38) under carbon trade emission policy and uncertain environment are presented in Fig. 1. Moreover, an illustration of the chromosomes related to the order quantity (\(Q\)) and backorder amount (\(b\)) of a numerical example with one supplier and ten buyers who have four products are presented in Fig. 2, correspondingly.

7 Numerical examples

This section gives numerical test problems, including one real-world coal SC case study in Iran and nine arbitrary cases related to it. We are looking to optimize sustainability in a coal SC by considering the indirect (hidden) costs in Joules and including all three factors simultaneously (using the extended exergy accounting method) under a carbon trade policy in an uncertain environment. Based on the recommended solution steps in Fig. 1, we are examining to get the optimum value of six decision variables, such as the amount of required loan/investment for each buyer (\({x}_{j}^{-}\), \({x}_{j}^{+}\)), the required carbon credits for each buyer (\({e}_{j}^{+},{e}_{j}^{-}\)), order quantity of each product for each buyer (\({Q}_{ij}\)), and amount of backorder of each product for each buyer (\({b}_{ij}\)). Moreover, a sensitivity analysis considers different percentages for exergy costs in coal SC of eight countries: Iran, India, China, Australia, Japan, Poland, the USA, and Zimbabwe to find the best exergy values that great the highest sustainability in each country. These countries are ranked in the top 20 countries with the most coal consumption globally (Statista 2020).

7.1 Case study in Iran

The real-world case study includes one supplier and ten buyers of coal products in an SC in Iran. Tabas Parvadeh Coal Company (TPCCO), located in Tabas city, is the biggest coal producer in Iran. Consistent with the statistics printed by the Iranian Mines and Mining Industries Development and Renovation Organization (IMIDRO), TPCCO extracted 1.232 million tons of coal from March 21, 2019, to January 20, 2020. With about 1.15 billion tons of reserves, Iranian coal mines can deliver up to three million tons of coal concentrate yearly (IEA, clean coal center 2020). From another point of view, the production of steel in Iran is highly dependent on coal since metallurgical coal, or coking coal, is an essential part of steel-making operations. TPCCO produces four diverse types (grades) of coal, and this company has ten key customers (steel producers) in different cities in Iran. TPCCO and all buyers use the public rail transport system to transport coal orders. Since demand of each steel producer (buyer) for each type of coal, coal purchasing price, and carbon emission price is not stated precisely, we consider them trapezoidal fuzzy numbers (see Appendix Tables 10 & 11). Moreover, the initial data of the test problems (parameters and resource values) and their equivalent exergy parameters are presented in Appendix Tables 12, 13, 14, 15, 16, 17, 18 and 19, respectively. Moreover, all inventory costs and their equivalent exergy cost related to real case study in Iran are presented in Table 4. Consistent with the informed values in Sciubba (2011) as the only reference study for the extended exergy accounting method in the literature, we take equivalent exergy parameters of Egypt due to the resemblances between Iran and Egypt regarding economic development, population, religion, and culture. Therefore, exergy parameters of Iran and selected countries are presented in Table 5.

Table 4 Inventory costs and their equivalent exergy based on capital (30%), labor (60%), and environment (10%) values (Test with four products)
Table 5 The exergy parameters of selected countries (sensitivity analysis)

After consulting with SC managers of TPCCO, it was estimated that each cost of \({{K}_{i,S},K}_{ij,b}, {h}_{ij} and {C}_{i}\) can be divided to Cap = 30% for capital, L = 60% for labor, and Env = 10% for ecological remediation. In Sect. 4.1, we described the method of extended exergy accounting and related formulas that we applied to our model. For example, in Table 4, the cost of \({K}_{i,S}\) is assumed €20 for the first product which includes €6 (\(20\times 0.30\)), €12 (\(20\times 0.60\)) and €2 (\(20\times 0.10\)) (monetary values) for capital (Cap = 30%), labor (L = 60%) and environmental (Env = 10%) remediation, respectively. Moreover, these three numbers are converted to the exergy values of 34.08, 3.56, and 11.36 MJ, respectively (in total \({K}_{(x)i,S}=\) 49 MJ). To show better the performance of our suggested modern metaheuristic algorithms in solving big-size test problems, besides the actual case study, we considered nine arbitrary numerical examples related to it with 10, 20, 40, 80, 160, 320, 640, 1280, and 2560 products in a sustainable coal SC in Iran with one supplier and 15 buyers. The initial data of all numerical examples are shown in Appendix Tables 12, 13, 14, 15, 16, 17, 18 and 19, respectively. As noted previously, a pilot study is used for the parameter tuning of all suggested metaheuristic algorithms, and the test problems are solved on a PC with an Intel Core i7-7500U CPU with 2.70 GHz and 8.00 GB RAM in Windows 10. The “MATLAB” 2017a software is also employed for coding all the algorithms.

7.2 Solving phases and related results

7.2.1 Step one—Metaheuristic algorithms

Based on solving procedure (Fig. 1), at the first step, all suggested metaheuristic algorithms are executed 15 times for the fuzzy exergy model with carbon trade policy (Eq. 38). The outputs of algorithms include the lowest fuzzy total exergy (MJ), and the CPU times (seconds) are presented in Tables 6 and 7, respectively. Based on the results, the superior metaheuristic algorithm for the smallest fuzzy total exergy (MJ) and running times (seconds) could be found for the model (Eq. 38).

Table 6 The fuzzy total exergy (MJ) observed by the algorithms under carbon trade policy in Iran (Eq. 38)
Table 7 The CPU times (Sec.) of solving numerical examples by the algorithms under carbon trade policy in Iran (Eq. 38)

Concerning the fuzzy total exergy and in line with the fallouts shown in Table 6, ALO is the best algorithm (with 32,753,094.69 and 122,319,654.35 MJ) in the actual case study in Iran with four products as well as the numerical test with ten products, while for test problems from 20 to 2560 products, WOA is the best. For our large size test problems (640, 1280 & 2560 products), WOA gets the lowest fuzzy total exergy cost (3,964,974,414.68; 8,490,424,760.63 & 20,715,326,512.04 MJ) followed by LOA, and ALO, respectively (see Fig. 3). Regarding Appendix Fig. 12, performance improvement between top two algorithms from 20 to 80 p test problems, are less since the results of them are very close together. But in large-size test problems the average performance enhancement between the results of WOA and LOA is about 90%, which means the results of WOA are outstanding. In opposition, ALO has the highest fuzzy total exergy (MJ) results in our medium and large-size test problems.

Fig. 4
figure 4

The CPU time comparisons of all algorithms (step 1)

Considering the CPU time (Sec.), WOA is absolutely the best algorithm with the lowest running time in all test problems (see Fig. 4). For example, in our large-size test problems (640, 1280 & 2560 products), the WOA CPU times were 49.23, 78.89, and 154.47 (Sec.), respectively (see Table 7). Moreover, in large-size test problems, the average of WOA’s performance improvement (%) with the second-best algorithm is about 700% which means WOA solves the models fast (see Appendix Fig. 13). Conversely, ALO has the highest CPU time among other algorithms in all test problems except for 1285 products, where LOA (with 829.7169 Sec.) is the worse algorithm (see Table 7). In Fig. 5, we presented some convergence diagrams of the smallest fuzzy total exergy by the proposed algorithms.

Fig. 5
figure 5

The convergence diagram of the total fuzzy exergy by the proposed algorithms (step 1)

7.2.2 Step two—Exact method

A solution may be compared with an “exact method” to validate the results by suggested algorithms. Exact optimizer software, for example, “GAMS” or an optimization library in “Python,” can find the “exact result.” In this research, the proposed mathematical model (Eq. 38) under carbon trade strategy is solved in small size (test with four products) by GAMS. A contrast with the best metaheuristic algorithm is made in Table 8. Taking into account Eq. (38) for the 4-product test problem, the exact result for the fuzzy total exergy is 31,537,292.44 (MJ), while the outcome of the best metaheuristic algorithm (ALO) for this test is 32,753,094.69 (MJ). Therefore, the percentage penalty between the exact method and ALO is 3.86% (see Table 8). Because the percentage penalty is minor, suggesting the excellent dominance of the solutions got by the best-suggested algorithm (Cárdenas-Barrón et al. 2012) since it is remarkably close to the exact method (see Fig. 6). Concerning CPU running time and Table 8, the distinction between exact method and ALO is 1.21 (Sec.), but the percentage penalty is 39.48%. It shows that the metaheuristic algorithm (ALO) solved the carbon trade model more rapidly (see Appendix Fig. 14).

Table 8 Comparing the results of the exact method (GAMS) with the best algorithm (ALO)
Fig. 6
figure 6

Comparison of the total fuzzy exergy between exact method and the best metaheuristic algorithm for test problem with four products (step 2)

7.2.3 Step three- sensitivity analysis

In the earlier subsections, we studied the optimization of a sustainable fuzzy EPQ model of coal SC in Iran by taking into account different objectives simultaneously: the costs of the inventory system, an additional required budget of each buyer, coal transportation cost among SC members, and carbon emission cost. All goals in the models and related limitations under the emission trade strategy are in MJ in place of monetary values. This step tries to balance economic and sustainable advantages for coal SC companies. Considering that our proposed model is sustainable, we modify the exergy percentage for capital, labor, and environmental remediation by a sensitivity analysis to find the best values of exergy components that improve the sustainability of coal SC more than before. Additionally, to gain further insight into this adjustment, we evaluate sustainable coal SC in Iran as well as seven selected developing and developed countries with the world's most significant coal consumption. They are India, China, Australia, Japan, Poland, the USA, and Zimbabwe (Statista 2020). We assumed the same coal SC and products for all these countries to make a comparative analysis. In the previous section, we mentioned that in our numerical examples, it was assumed that each cost of \({{K}_{i,S},K}_{ij,b}, {h}_{ij} and {C}_{i}\) can be allocated to Cap = 30% for capital, L = 60% for labor, and Env = 10% for ecological remediation (consider it as exergy Set A). In this section, to get more insight, we have changed these percentages to make seven different exergy sets (see Appendix Fig. 15), including A (30-60-10), B (60-20-20), C (20-50-30), D (20-40-40), E (20-30-50), F (30-10-60) and G (33-33-33). Considering each exergy set, we computed the fuzzy total exergy for a 4-item test problem under carbon trade policy for all countries by GAMS (see Table 9). For example, we consider coal SC in the USA and exergy Set C (Cap = 20%, L = 50%, and Env = 30%), then employing extended exergy accounting method to convert all monetary costs of \({{K}_{i,S},K}_{ij,b}, {h}_{ij} and {C}_{i}\) to equivalent (MJ). After that, we run model Eq. (38) with four product test problems using the Exact method (GAMS). Likewise, the same process was done for other exergy Sets (A-G) and considering other countries' coal SC. Finally, all results are presented in Table 9. In the following we explain the results in detail.

Fig. 7
figure 7

Sensitivity analysis for each country—Min. & Max. of the total fuzzy exergy (step 3)

Table 9 Sensitivity analysis of different percentages for exergy elements (example with four products)
7.2.3.1 Analysis of each country

Considering Table 9 and Fig. 7, for coal SC in each country, we have:

  • Australia: For coal SC under the carbon trade policy in this country, the top exergy components are Set F (30-10-60) since more exergy percentage is assumed for Environment (60%) and less for Labor (10%). It created the minimum fuzzy total exergy of 24,251,604.43 (MJ) for coal SC. Besides, the worst exergy components are Set A (30-60-10) since Labor has 60% while Environment has only 10%, which created the highest fuzzy total exergy with 37,386,644.58 (MJ).

  • China: The best exergy components are Set C (20-50-30) when Labor has 50% weight, followed by Environment (30%) and Capital (20%), respectively. It created the minimum fuzzy total exergy of 83,731,242.82 (MJ) for coal SC. Likewise, the weakest exergy components are Set F (30-10-60) when more exergy percentage is assumed for Environment (60%) and only 10% for Labor, which generated the greatest fuzzy total exergy of 128,734,240.79 (MJ).

  • India: Like China, the finest exergy components in India are Set C (20-50-30), when Labor has 50% weight, while Environment and Capital are 30% and 20%, respectively. It produced the minimum fuzzy total exergy of 24,826,136.13 (MJ) for coal SC. Moreover, the unpleasant exergy components are Set B (60-20-20) when more weight is expected for Capital (60%) and the same weights (20%) for Labor and Environment, which formed the maximum fuzzy total exergy of 56,664,303.08 (MJ).

  • Iran: For coal SC in this country, the top exergy components are Set A (30-60-10) as Labor has 60% while Environment has only 10%. It made the minimum fuzzy total exergy of 31,537,292.44 (MJ). Like India, the unhealthiest exergy components in Iran are Set B (60-20-20) when more weight is assigned to Capital (60%) and the same weights for Labor and Environment (20%), which generated the maximum fuzzy total exergy of 50,042,180.33 (MJ).

  • Japan: Like Australia, the best exergy components in Japan are Set F (30-10-60), while more exergy percentage is given to Environment (60%) and less to Labor (10%). It established the least amount of fuzzy total exergy with 22,873,547.02 (MJ) for coal SC. Furthermore, the unhealthiest exergy components are Set B (60-20-20) when more weight is provided to Capital (60%) and the same weights for Labor and Environment (20%), which generated the highest fuzzy total exergy of 40,279,208.50 (MJ).

  • Poland: Like India and China, the excellent exergy components in Poland are Set C (20-50-30), when Labor has 50% weight, followed by Environment (30%) and Capital (20%), respectively. It created the least possible fuzzy total exergy of 86,131,627.76 (MJ) for coal SC. Besides, the worst exergy components are Set F (30-10-60), when more exergy percentage is offered to Environment (60%) and less on Labor (10%), which created the maximum fuzzy total exergy of 123,315,602.00 (MJ).

  • The USA: Like Australia and Japan, the superior exergy components in the USA are Set F (30-10-60) as more exergy percentage is assumed to Environment (60%) and less on Labor (10%). It generated the minimum fuzzy total exergy of 19,675,609.14 (MJ) for coal SC. Additionally, the harmful exergy components are Set A (30-60-10) since Labor has 60% while Environment has only 10%, which established the highest fuzzy total exergy of 31,673,757.27 (MJ).

  • Zimbabwe: Like Australia, Japan and the USA, the first-rate exergy components in Zimbabwe are Set F (30-10-60) because more exergy percentage is assumed to Environment (60%) and less on Labor (10%). It crafted the minimum fuzzy total exergy of 22,873,547.02 (MJ) for coal SC. Additionally, the weakest exergy components are Set A (30-60-10) since Labor has 60% while Environment has only 10%, which generated the greatest fuzzy total exergy of 31,803,458.12 (MJ).

  • Considering Table 9, the best total exergy (MJ) in each country is as follow: Australia (24,251,604.43), China (83,731,242.82), India (24,826,136.13), Iran (31,537,292.44), Japan (22,873,547.02), Poland (86,131,627.76), the USA (19,675,609.14) and Zimbabwe (22,873,547.02).

  • Among all presented countries, the coal SC in the USA has the smallest total exergy (19,675,609.14 MJ), followed by Japan, Zimbabwe, Australia, India, Iran, China, and Poland, respectively (see Fig. 7).

  • Moreover, coal SC in China creates the highest total exergy for all exergy sets except for Set B (60-20-20) and Set C (20-50-30) related to Poland (see Fig. 8).

Fig. 8
figure 8

Sensitivity analysis for each set—Min. & Max. of the total fuzzy exergy (step 3)

7.2.3.2 Analysis of each exergy set

Considering Table 9, Fig. 8, and Appendix Fig. 15, for each exergy set, we have:

  • Exergy Set A (30%-60%-10%): This exergy set has 60% for Labor, while for Environment, it is only 10%. Although this set works well for coal SC in Iran, with the minimum total exergy of 31,537,292.44 (MJ), in China, it is 121,884,457.74 (MJ).

  • Exergy Set B (60%-20%-20%): In this set, more weight is assumed for Capital (60%) and the same for Labor and Environment (20%). Despite coal SC in Poland (110,155,055.08 MJ), exergy set B operates well in the USA with 22,604,564.59 (MJ).

  • Exergy Set C (20%-50%-30%): In this set, Labor has 50% weight, followed by Environment (30%) and Capital (20%), respectively. Exergy set C performs well in coal SC in India (24,826,136.13 MJ), even though in Poland, the total exergy is 86,131,627.76 (MJ).

  • Exergy Set D (20%-40%-40%): In this set, Capital has only 20% while 40% is for both Labor and Environment. In spite of the high result in China with 94,201,685.52 (MJ), exergy set D runs well in Zimbabwe with 25,762,854.83 (MJ).

  • Exergy Set E (20%-30%-50%): In this set, 50% is assigned to Environment and 20% and 30% to Capital and Labor, respectively. Exergy set E operates well in the USA with 25,320,951.45 (MJ), although the result is high in China (111,411,481.62 MJ).

  • Exergy Set F (30%-10%-60%): In this set, 60% is allocated to Environment and only 10% Labor. Exergy set F performs well in the USA (19,675,609.14 MJ), despite the fact that the result is not healthy in China (128,734,240.79 MJ).

  • Exergy Set G (33%-33%-33%): In this set, all three exergy components have equal 33% weight. Even though exergy set G does not perform well in China with 121,351,102.11 (MJ), it runs well in Zimbabwe with 24,146,338.65 (MJ).

  • Moreover, exergy Sets B (30-60-10), E (20-30-50) and F (30-10-60) created the minimum total exergy for coal SC in the USA, while all exergy sets except Set B (30-60-10) and Set C (20-50-30) created the highest total exergy in China (see Fig. 8).

8 Conclusions and future work

According to the literature review, there is a lack of studies that assess a coal SC under a carbon trade policy with ambiguous parameters such as carbon price and customer demand. Likewise, it is scarce to obtain research that assesses a SC in terms of Joules (in place of traditional monetary measures of performance) and simultaneously evaluates all sustainability characteristics, such as economic, labour, and environmental. Similarly, to the best of the authors' knowledge, no exergy analysis method like the extended exergy accounting in the literature considers carbon policy in SC. Therefore, this study develops the work in the papers by Jawad et al. (2016) and Naderi et al. (2021a) to a multi-product multi-limitation inventory (EPQ) model with backorder for a coal SC in Iran under an uncertain environment. By applying the extended exergy accounting technique and Mega-Joules (MJ) as a universal unit of measure, the total exergy of the coal SC can be calculated. Moreover, a well-known carbon reduction strategy (carbon trade) is employed to evaluate the sustainability performance of the model. In this study, we presented four research questions (in Sect. 1) and attempted to answer them.

  • Q1. Is it possible to assess the sustainability of coal SC under a carbon reduction policy in terms of Joules rather than money, to benefit both the economy and the environment?

In Sect. 3.4, we developed a non-exergy mathematical model of the coal SC for carbon trade policy. Then the model has converted to a fuzzy model in Sect. 3.5, and finally, a new SC assessment method called the extended exergy accounting (in terms of Joules) was employed in Sect. 4. This method contains energy and material's main aggregate exergy subject and costs corresponding to economic externality (labor and capital) and ecological externality (environmental remediation). Therefore, employing this method could benefit both the economy and the environment. After that, three recent metaheuristic algorithms (ALO, LOA, and WOA) are utilized. When contrasting the best algorithm outcomes in small-size test problems (four products) with the exact method (GAMS), there is a small percentage error (3.86%) under the carbon trade policy between them. Therefore, it could validate the results of metaheuristic algorithms in this study.

  • Q2. Generally speaking, coal SC in developing countries, or even China, has the lowest overall cost; however, considering sustainability aspects (social, economic, and environmental characteristics) in terms of Joules, does this assumption still hold true?

Regarding the sensitivity analysis in Sect. 6.2.3, we compared the sustainability of coal SC in eight developed and developing countries, such as Iran, India, China, Australia, Japan, Poland, the USA, and Zimbabwe (see Table 9). They are the world's most significant coal-consuming countries (Statista 2020). It was observed that, Poland and China have the highest fuzzy total exergy of a sustainable coal SC (86,131,627.76 and 83,731,242.82 MJ, respectively) among eight selected countries. The reason behind this issue is that traditional assessment methods consider economic measures. In contrast, the method of extended exergy accounting (as mentioned in Sect. 4) considers all three aspects of sustainability (Labour, Money, and Ecological remediation) in goods or services. It determines the exergy corresponding to them (in terms of Joules) by some elements significantly affected by population, normal workload, labor statistics, and local and international wages in each country. Therefore, the extended exergy accounting results show the total number of Joules that coal SC utilized in Labour, Money, and Ecological aspects.

  • Q3. Which country has the most sustainable coal SC in terms of Joules?

Based on Table 9, the lowest total exergy of a sustainable coal SC among all eight countries belongs to the USA (19,675,609.14 MJ) under the carbon trade policy. It means sustainable coal mining and related processes in the USA have economic and environmental advantages compared to China or developing countries such as Iran and Zimbabwe. Moreover, Japan, Zimbabwe, Australia, India, Iran, China, and Poland followed the USA (see Fig. 7).

Q4. What is the best percentage of exergy components (social, economic, environmental characteristics) to achieve the greatest saving wherever coal SCs are working?

Considering Sect. 6.2.3 and Table 9, it is observed that under carbon trade policy, exergy Set F (30-10-60) percentages created the minimum fuzzy total exergy (highest carbon and exergy reduction) in coal SC of the countries such as Australia, Japan, the USA, and Zimbabwe. Set F (30-10-60) is given more weight (60%) to Environment, 30% to Capital and only 10% to Labor. Likewise, for coal SC in China, India, and Poland, exergy Set C (20-50-30) generated the least amount of fuzzy total exergy (83,731,242.82; 24,826,136.13 & 86,131,627.76 MJ respectively). In Set C (20-50-30), the emphasis is on Labor with 50% weight, while Capital and Environment have 20% and 30%, respectively. Finally, for coal SC in Iran, Set A (30-60-10) has the best exergy component with the minimum fuzzy total exergy of (31,537,292.44 MJ). Labor with 60% is the first weight in Set A (30-60-10), while only 10% was assigned to Environment and 30% to Capital.

Moreover, the theoretical and managerial implications of this work are presented as follows:

  • It is important to note that the EEA method has the advantage of enabling meaningful comparisons between coal SCs in different countries that produce the same coal type. By comparing the amount of exergy consumed in the coal production process and related SC processes, it becomes easier to determine where a coal SC business should be located. Due to this, selecting a product from a country with low wages, such as China or India, may not always be beneficial as more exergy is required for its production. Using the EEA method provides an indication of the sustainability impact of coal SCs in an era when climate change concerns are increasing prevalent.

  • The exergy equations in Sect. 4 (for instance, Eqs. 2637) show that all exergy parameters in Table 5 are directly related to the cost elements of inventory models (such as setup, purchasing, and holding), and affect the exergy functioning of the coal SC in a significant way. It is therefore critical to decrease the cost elements of a coal SC's inventory model to improve sustainability. The managers could use stock classification and shorter order cycles, reducing the lead time of suppliers, eliminating obsolete inventory, implementing a Just-in-Time inventory system, and monitoring key performance indicators.

  • Unlike conventional financial and commercial models, the results of our study found that despite assumptions that inventory parameters in coal SC are unchanged for all eight countries, more savings could be achieved through the tuning of exergy's inflows and outflows in each country. It means that no fixed amount of exergy components (Capital, Labor and Environment) can deliver the highest sustainability in all countries. According to our results in Table 9, set F (30-10-60) with 60% weight allocated to the environment and only 10% to labor generates the greatest sustainability for the USA (19,675,609.14 MJ) as well as the most unpleasant sustainability for China (128,734,240.79 MJ). Hence, finding the most appropriate values of the exergy components of the SC would be another task for decision makers.

  • Another point is that, considering Table 5, one can conclude that the exergy parameter of Capital (\({ee}_{Cap}=2.85 \mathrm{MJ}/\mathrm{Euro}\)) in the USA is less than the other countries. In contrast, China and Poland have the highest exergy parameter of Capital (\({ee}_{Cap}=14.01 \& 14.02 \mathrm{MJ}/\mathrm{Euro}\)) among other countries. This would be one of the reasons why the USA has the most sustainable coal SC in terms of Joules whereas China and Poland are the least sustainable. Therefore, a way to increase sustainability in each country is to find ways to decrease exergy parameters. If we look at Eqs. (24) and (25), exergy parameters of (\({ee}_{Cap},{ee}_{L}\)) are dependent on two econometric coefficients (\({\alpha }_{x},{\beta }_{x}\)) as well as (\({Ex}_{in}\)). Section 4.1 explains that these are influenced by the type of societal organization, the historical period, the technological level, the pro-capital resource consumption, and the geographical location of the country (Sciubba, 2011). All shareholders, governments, individuals, societies, business organisations, scientists, etc., need to contribute significantly to adjusting parameters, if possible. An example is controlling the import and export of goods from and to the country or extracting ores and minerals. Promoting locally made goods can be a way for individuals, societies, and business organizations to support this cause. As a result, there would be more jobs available in the country, and increasing the labor force rate (Jawad et al. 2018). Additionally, effective productivity growth (output per hour worked) can boost a country's income and GDP per capita. For more information, readers are encouraged to consult Sciubba (2011).

  • In addition, decision-makers should find ways to improve the sustainability of their coal SC by reducing waste, labor, material, and pollution, which will reduce the damaging effects of coal SC. When calculating energy costs, managers of SC would have more flexibility since they could use available resources rather than just capital to calculate the quantity. Furthermore, this research will also guide managers of international coal mining companies who wish to decide which country has more sustainable conditions for their business and investments.

Furthermore, the EEA method in this study is subject to some limitations, including the following:

  • When EEA is employed to a coal SC, the precision of the outcomes is dependent upon the assumptions made.

  • It is possible that the EEA method in coal SCs may have limitations when more than one country is involved in the SC processes (international companies).

  • Insufficient data regarding a country's total exergy input, the quantity of exergy represented in the workforce, the exergy of raw materials and energy consumed to supply a coal.

Finally, the following avenues for future research are suggested for consideration:

  1. (a)

    A coal production system.

  2. (b)

    An international coal SC model that works in more than one country at the same time.

  3. (c)

    Comparing a global coal SC with a national one.

  4. (d)

    A model with multi-objective (integrating inventory measures).

  5. (e)

    The strategy of increasing carbon price with increasing the amount of carbon (price dependent on amount) by each company.

  6. (f)

    The SC of coal power plants.

  7. (g)

    Quantity discounts in cost per unit of products can be allowed.

  8. (h)

    Multi-echelon SCs, for example, single-buyer multi-supplier and multi-buyer multi-supplier SCs, can be investigated.

  9. (i)

    Lead times can be included.