Introduction

India is one of the global leaders in coal production sector; more specifically India is third largest coal producer globally, with 715.13 Mt of coal production in the year 2018–2019 as reported by Ministry of Coal, Government of India [1]. Opencast mining technique in India extremely dominates the underground mining; as reported by Indian Bureau of Mines [2] in 2017–2018 only 6.3% of annual coal production is accomplished through underground mining method. The complex geo-mining condition and restricted implementation of cutting-edge technologies is considered to be the main hindrance in underground mining in India. According to Ghose. M.K. [3] with higher depletion of the seams near the surface, concerns related to environmental pollution and degradation are forcing the government to think for some alternatives to opencast mining; one of the most feasible solution to this is large scale implementation of cutting edge underground mining technologies; such as Continuous Miner (CM) and Longwall technology.

Following Table 1 depicts the percentage of coal produced from underground mining out of total coal produced annually by major coal producing countries.

Table 1 Contribution of underground mining in overall coal production for different countries

Table 1 delineates that except India other countries are far ahead in contribution of underground mining for coal production; this is basically feasible with implementation of cutting-edge technologies as stated above.

This is evident that the CM technology is effective with those major coal producers globally, though the scenario may be different in Indian geo-mining conditions; this opens a huge avenue for research in pre-commissioning and operational stage of these machines.

Government of India has already adopted this technology in some of the selective underground coal mines of the country; few of which are working satisfactorily but majority of them demands further research and observations.

This paper actually deals with the evaluation of Overall Equipment Effectiveness (OEE) of a CM-based project in southern India for a period of six months; to understand the scenario of effectiveness and utilization. From OEE calculation, an alarming situation was recognized; which encouraged for conducting further in-depth analysis of reliability of each sub-system with the purpose of identification of potentially vulnerable sub-systems. Further, FMEA technique was used on identified vulnerable sub-systems to describe significant failure modes or causes of failure and further few actions with controls were suggested on significant failure modes to mitigate their criticality.

Mine Description

The mine site is located in the southern part of the country and working with continuous miner for development purpose using room and pillar method of working for coal extraction. The geo-mining condition of the mine site is depicted in the following Table 2.

Table 2 Geo-mining condition of the mine site under study

The geo-mining condition depicts some adversity for deploying Continuous Miner such as very high gradient of 1 in 4. Rhombus-shaped pillar design is adopted to partially overcome the problem arising from high gradient. This create another problem of higher exposed roof area at the junctions, inducing high stress, demanding furthermore dense roof support at the junction for safety purpose. It is also worthy to mention that the Rock Mass Rating (RMR) value of the panel was evaluated as 37 by one of the statutory bodies in the country. This indicates really poor characteristics of immediate roof and makes it unsuitable to work out with Continuous Miner.

Other adverse conditions are low gallery width and high seam thickness. Therefore, unexplored strata induce enormous production loss and results in low return on investment. In addition, very high gradient (around 1 in 12) in strike direction poses an unusual and adverse condition for the operation of shuttle cars.

Methodology

figure a

Mine site with a Continuous Miner (CM) working was selected for development of the coal panel. Working time and downtime related data were collected for a period of six months, for CM and allied machines. A digital stopwatch was used to record the time related data. Using this downtime, working time and production data the system availability and performance were evaluated, which were used to calculate the OEE of the system. OEE of the CM machine was found alarming for a period of six months of study. Subsequently, reliability analysis of different sub-systems of CM was performed to identify potentially vulnerable sub-systems. Further, FMEA was applied to those potentially vulnerable sub-systems to recognize different failure modes based on field observations and discussions. Afterward, these failure modes were categorized as significant or not using Pareto chart. Finally, few actions were suggested to alleviate those significant failure modes.

Overall Equipment Effectiveness (OEE) Analysis

Overall Equipment Effectiveness (OEE) is an important technique for evaluation of the effectiveness of a production system. The article “Calculate OEE” depicts that OEE is actually a combination of three individual parameters, these are: availability, performance and quality [8, 9].

The quantification of these three parameters for CM-based underground coal production system is elaborated as follows:

Availability

It is basically the ratio of the time for which the machine is actually available (Actual Operation Time) to perform its desired task to the Net Available Time [9]. The Net Available Time is the time remaining after deduction of scheduled maintenance and planned stoppages from the overall planned production time.

Therefore, the Availability of a machine is given by; \({\text{Availability}} = { }\frac{{\text{Actual Operation Time}}}{{\text{Net Available Time}}}\).

Availability of CM based production system is calculated similarly.

Performance

It is the ratio of the actual output to the planned output [9].

The calculation of performance for CM based production system is done by evaluating the ratio of number of shuttle car load actually produced in a day to the number of shuttle cars load targeted to be produced in that same day under same condition.

Quality

Quality is the ratio of the number or quantity of product qualified in the quality benchmarking to the number or quantity of products actually produced [9].

For factory-based products, it is easy to determine how many products are actually defective within the total number of products produced in a specified time domain, from which the quality can be evaluated as a ratio of number good quality products and total number of products produced.

However, for coal mining this is very difficult to quantify good quality and inferior quality in a similar way like factory made products. Therefore, as suggested by Elevli and Elevli [10] that the bucket filling factor of the hauling or loading machine should be considered as the quality parameter for coal mining industries, as a result, the average bucket filling factor of the shuttle cars is considered as the quality parameter in this study, which is also supported by a paper by Banerjee. [9].

Bucket Filling Factor is basically the ratio of actual load carried by a hauling machine to maximum carrying capacity of that machine. Here average bucket filling factor is considered as quality parameter.

Overall Equipment Effectiveness (OEE) of machine can be calculated as follows:

OEE = Availability × Performance× Quality

From Table 3, the calculated OEE for CM system for a period of six months can be seen, which depicts alarmingly low value as compared to maximum possible value of 1. It is a potential reason to further research into the vulnerable sub-systems.

Table 3 OEE of CM for a period of six months

Reliability Analysis

Reliability of a system or sub-system describes the probability of a machine that; it will perform its desired task after a specified time since inception of working. This helps in identification of vulnerable sub-systems; contributing to lower equipment availability and OEE.

Here the overall CM package is divided in few sub-systems for the purpose of reliability analysis; these sub-systems are namely: electrical, cutter, gathering, traction, hydraulic, chassis, feeder breaker, mine conveyor, CM conveyor, and ram car.

There are few steps involved in reliability analysis of any sub-system; these are described as follows:

Estimation of Time Between Failures of Each Sub-Systems

For this purpose, the breakdown details were recorded for a period of six months and time between failures were calculated between each failure for every sub-system within the entire study period.

Check for Independence of Data

Time between failure (TBF) data are a prerequisite for evaluation of reliability of any system and sub-system. These TBF data are fitted to a suitable probability distribution. However, the TBF data must be free from any trend and correlation to qualify for getting fitted to any theoretical probability distribution.

As Vagenas et al. mention, there are basically two effective techniques utilized to evaluate the independence of the data set; these are trend test and serial correlation test [11].

Trend test is the scatter plot between the cumulative time between failure and cumulative failure number. The TBF data, which are free from any trend, depict linear plot. Here, the trend test plot for electrical sub-system is depicted in Figure 1, where the plot is depicting a linear trend to conclude that the TBF data for electrical sub-system are free from any trend.

Fig. 1
figure 1

Trend test plot for electrical sub-system

Similarly, the trend test graphs for each sub-system were plotted and show almost linear trend in every case, which delineates that the TBF data set for all the sub-systems is free from any trend.

Serial correlation plot is the scatter one between (i-1)th TBF and ith TBF, where ith TBF is the TBF at any instance and (i-1)th is the just previous TBF to any corresponding ith TBF. For data sets free from any correlation, this scatter plot becomes random and does not show any specific trend.

The serial correlation plot of electrical sub-system is depicted in Figure 2, which is showing a random plot; hence it is free from any correlation. Similarly, the serial correlation plots for each sub-system were plotted and in every case, randomness was observed. This delineates that TBF data set for all the sub-systems is free from any correlation.

Fig. 2
figure 2

Serial correlation plot of electrical sub-system

As TBF data for each sub-system are free from any trend and correlation, hence they qualify to be fitted to theoretical probability distributions.

Goodness-of-Fit Test to a Suitable Probability Distribution

For reliability analysis, there are three probability distributions, which are mostly accepted and widely used as mentioned by Vagenas et al. [11]; these are: Weibull distribution, log-normal distribution and exponential distribution.

Goodness-of-fit test is essential to identify the best fit distribution for TBF data of any specific sub-system.

There are two types of goodness-of-fit test techniques popular for this purpose; these are: Chi-square test and Kolmogorov–Smirnov test (K–S test). However, Vagenas et al. furthermore mention that K–S test is better suited for TBF data, as TBF is not discrete variable and K–S test is a versatile test technique [11]. Chi-square test is basically most suited to discrete variables. Therefore, K–S test technique is being used to evaluate the best fit probability distribution for TBF dataset of each sub-system.

Few assumptions for K–S test of this study are mentioned hereunder:

H0 = Data follow the specific distribution (this is null hypothesis).

Ha = Data do not follow the specific distribution (this is alternate hypothesis).

α is significance level (generally considered 5% or 0.05).

K–S test goodness of fit gives a p value, which along with α is considered for judgement of acceptance or rejection of the null hypothesis.

For every case where p < α the null hypothesis is rejected.

However, as per the selected null and alternate hypothesis as mentioned above; for fitting a distribution the null hypothesis should not be rejected and p > α is desirable.

The following Table 4 depicts result of K–S goodness-of-fit test as well as best fit probability distribution for TBF data of each sub-system. This goodness-of-fit test is performed using XLSTAT, which is a statistical add-in to MS-Excel.

Table 4 K–S test goodness-of-fit result and best fit distributions for each sub-system

Fitting a Suitable Distribution to TBF Data

TBF data of each sub-system are fitted to relevant distribution indicated by goodness-of-fit test. This is performed using distribution fitter application of MATLAB R2021a. During the distribution fitting in MATLAB, it provides information related to relevant parameters with that distribution, which is helpful in calculation of reliability at any instance theoretically.

The following Table 5 depicts the best fit probability distribution for each sub-system, which is also indicated in the above-mentioned Table 4. Table 5 also provides values of all relevant parameters of those distributions under the effect of TBF data of assigned sub-system.

Table 5 Best fit probability distributions and their relevant parameters

Evaluation of the Reliability of Each Sub-System Based on Best Fit Distribution

As mentioned earlier, the TBF data are fitted to relevant best fit distribution through MATLAB R2021a and evaluated by selecting survivor function to assess reliability. The entire reliability analysis was performed for a time period of fifteen thousand minutes starting from zero minute with interim time gap of one thousand minutes. The identified potentially vulnerable sub-systems require further analysis for performance improvement through Failure Modes and Effects Analysis (FMEA).

Here, the reliability is evaluated using survivor function option of MATLAB distribution fitter. However, reliability of any sub-system can be calculated using the relevant formulae for the associated distribution using the parameter values mentioned in Table 5.

The reliability calculation formulae for different probability distributions are depicted in the following expressions [11, 12]:

$${\text{Reliability }}\left( {{\text{Weibull}}} \right) = e^{{ - \left( {t/A} \right)^{B} }}$$
$${\text{Reliability}}\left( {{\text{Exponential}}} \right) = e^{{ - \left( {t/\mu } \right)}}$$
$${\text{Reliability}}\left( {{\text{Lognormal}}} \right) = \left[ {1 - \Phi \left( {\frac{\ln \left( t \right) - \mu }{\sigma }} \right)} \right]$$

Here, Φ represents normal cumulative distribution function.

And t = time in minutes for all above expressions.

The reliability result for each sub-system as obtained from MATLAB has been depicted in Table 6.

Table 6 Reliability result for each sub-system

Table 6 delineates the reliability of any sub-system at any point of time, and it is really a handy tool to identify the sub-systems with rapid rate of reliability depletion; those sub-systems are basically considered vulnerable and contributing to lower availability trend of the system. Consequently, it reduces the Overall Equipment Effectiveness of the system.

Reliability trend of mine conveyor sub-system is depicted in Figure 3. However, the reliability trend of all the sub-systems showing similar trend of depletion of reliability, having increase in probability of failure with time is quite natural for any working machinery. The sub-systems depicting unusually fast reliability depletion trend when compared with other sub-systems of the same system can only be considered as vulnerable sub-systems.

Fig. 3
figure 3

Reliability trend of mine conveyor sub-system

Table 6 helps to easily identify the sub-systems reaching almost zero reliability after fifteen thousand minutes of operation; these sub-systems are electrical, cutter, ram car, mine conveyor and feeder breaker, respectively. These sub-systems can be considered potentially vulnerable, but their ranking can be determined from their reliability depletion trend. For the purpose of understanding the reliability depletion trend, the reliability trend of above mentioned potentially vulnerable sub-systems is plotted in a single plot (Figure 4). This Figure 4 helps to identify the ranking of vulnerable sub-systems ranging from most vulnerable to the next vulnerable sub-systems in descending order with mine conveyor being the most vulnerable sub-system, followed by ram car, electrical, cutter and feeder breaker respectively.

Fig. 4
figure 4

Reliability depletion trend of vulnerable sub-systems

Failure Modes and Effects Analysis of these vulnerable sub-systems is important to identify most challenging failure modes; mitigating which may have positive impact on sub-systems as well as system availability and OEE.

Failure Modes and Effects Analysis (FMEA)

It is a technique to identify the failure modes or potential causes of failure of critical systems and sub-systems contributing towards overall system failure. This technique basically depends on evaluation of Risk Priority Number (RPN) for each failure modes. On identifying the critical or significant failure modes, some suitable controls are suggested to minimize the RPN value of significant failure modes.

The RPN value depends on three factors; namely: Occurrence (O), Severity (S) and Detectability (D). The RPN is calculated as follows:

RPN = Occurrence × Severity × Detectability.

Higher RPN number signifies higher criticality for corresponding failure mode; the target is to mitigate the criticality by reducing this RPN number through application of adequate engineering controls.

There are standard scales available for Occurrence (O), Severity (S) and Detectability (D) values as shown by Zuniga et al. [13], which are shown in Table 7. The ratings for all the parameters corresponding to each failure modes are selected from this table suitably.

Table 7 Occurrence, Severity and Detectability rating table

This technique is applied on potentially vulnerable sub-systems as identified by reliability analysis. The following Table 8 depicts important failure modes associated with those sub-systems and their corresponding Occurrence, Severity and Detectability ratings as well as Calculated RPN values. Finally, rankings are provided to each failure modes based on RPN value in descending order (highest RPN assigned rank 1) and then these failure modes are rearranged based on ranking. These ranking details are given in the following Table 8.

Table 8 Failure modes; their corresponding RPN calculation and Ranking details

From Table 8, it can be seen that RPN values for all the failure modes are not significantly high and thus can be excluded on taking special care. A Pareto chart as shown in Figure 5 is prepared to evaluate the significant failure modes.

Fig. 5
figure 5

Pareto chart to depict significant failure modes

The above Pareto chart in Figure 5 depicts one horizontal axis and two vertical axes. The horizontal axis depicts the rank of corresponding failure mode, left vertical axis depicts the RPN value for each failure modes, whereas right vertical axis depicts the cumulative contribution of each failure modes within total RPN.

Pareto chart depicted there are fifteen significant failure modes (rank 1 to rank 15) contributing to eighty percent of total RPN.

The concerned engineer along with marketing personnel and other experienced persons were involved in getting proper suggestions to mitigate RPN values of the identified failure modes. The related suggestions vis-à-vis fifteen failure modes are provided in Table 9. These may alleviate the criticality and improve the overall system reliability and effectiveness.

Table 9 Failure modes and corresponding suggestions

Discussion

The reliability depletion trend of the vulnerable sub-systems was plotted in a single graph depicted in Figure 4, for the purpose of their ranking based on reliability depletion trend. The mine conveyor with highest rate of reliability depletion was found most vulnerable, followed by ram car, electrical, cutter and feeder breaker.

Further, FMEA was applied on these vulnerable sub-systems to identify relevant failure modes. Subsequently, Pareto chart was used to identify the significant failure modes contributing to eighty percent of total RPN value.

In the Pareto chart depicted in Figure 5, a horizontal gridline passes through 80% mark intersecting the cumulative percentage curve at a point, from that intersection point passing a vertical gridline meeting the horizontal axis. There are fifteen failure modes which are at left to that vertical gridline, contribute to eighty percent of total calculated RPN and can be termed as “significant failure modes”. These significant failure modes were considered for special controls to mitigate the RPN value and rest of the failure modes were excluded from further studies.

Further, FMEA analysis can be carried out after considerable time from implementation of recommended controls; to verify the improvement of system and sub-system performance as well as effectiveness.

Conclusion

This study identified fifteen potential failure modes from five sub-systems, contributing to overall system failure. A discussion among group of experienced personnel and engineers was conducted to find suitable solutions to these failure modes. The actions or controls suggested out of the discussion to reduce the criticality of failure modes are believed to be effective in improving the machine performance as well as Overall Equipment Effectiveness (OEE).

Evaluation of OEE, reliability analysis and FMEA of the system and sub-systems must be carried out in regular predefined interval to mitigate their further criticality.

Finally, it can be concluded that this paper may be a handy guide for stepwise procedures of reliability analysis and Failure Modes and Effects Analysis of Continuous Miner-based underground coal production system, which can be applicable to other machines as well.