Introduction

The efficient usage of capital-intensive equipment in the work environment results in the accomplishment of the anticipated degree of production and productivity. In the present competitive business environment, every industry is regularly searching for improvement in their day-to-day production levels to survive with a noble reputation in society [1]. In the underground mining segment, utilized equipment for transportation purpose, i.e., load haul dumper (LHD), assumes an indispensable job in the accomplishment of the desired level of production rate. The recorded underground metal mine’s production in India is not at a satisfactory level from the last few decades. Unavailability of the equipment in the working phase is only the prominent reason for the fall of production levels in the industry [2]. The best probable utilization of equipment can be possible when the probability of equipment is readily available; the consequence of this should lead to an increase in the production levels of equipment. Enhancement of machinery availability can be possible through a reduction in the downtime hours.

Assessment and prediction of the reliability of intricate repairable assemblies play an important role in the estimation of overall performance. In general, complicated system performance usually relies on the usage of equipment, working ambiance, and adequacy of upkeep, operational techniques, and specialized expertise of the administrators. Dependability or reliability forecast helps to manage the activities of system operation and maintenance condition [3]. Dependability assessment is one of the key techniques to estimate the required outcomes. This assessment can be helpful to highlight the features of production flow in the industry and its reputation [4]. Untrustworthy components in the system assembly direct to unexpected stoppage of equipment operation. Restore as well as replacement activity provides a benchmark for disappointment segments to identify their leftover functional time [5, 6]. The enhancement of the required reliability level of percentage can be possible by undertaking suitable maintenance practices.

The activity of maintenance can be conducted in two different directions: One of the ways is performing maintenance activity in every scheduled time known as scheduled maintenance or preventive maintenance. In this activity, most of the failed parts/components can be repaired in the work environment as well as workshop premises and after successful completion of the repair action the parts can be restored in their original position. On the other hand, corrective maintenance (CM) can be performed to the parts or components that are unable to repair in the maintenance action. These parts can be replaced with new modified designs. The failures that cannot be possible to repair at the time of PM are called as censored failures. These failures can lead to an increase in both maintenance and operational costs. Statistical-based reliability methods can provide additional insight for the machinery during the estimation of the reliability, maintainability, and availability [7]. Keeping this in view, this research work mainly focuses on reorganization of frequent failures, identification of potential causes for the occurrence of these uncertainties, estimation of each subassembly reliability percentage, and identification remedial actions to control the influencing factors of reliability. The summary of the literature for the present research work is systematically arranged in Table 1.

Table 1 Summary of the literature review

Course of Action

The required information/data can be gathered either from constant checking of tests or from the presence of previous chronicled disappointment information which was put away in the support records. The arrangement of the information should be possible as per the kind of disappointment (Table 2). This information is helpful to evaluate the time between disappointment (TBF) and time to fix (TTR) [8, 9]. In this present study, one financial year failure data of LHDs were collected during the operation of the vehicle. These are classified according to the type of failure mode. TBF and TTR were estimated corresponding to the failure and repair data. These data sets were validated for the identification of independent and identically distributed (IID) nature using the trend and serial correlation tests [10, 11]. After validation of the IID assumption, the data sets were considered for the estimation of best-fit approximation by Kolmogorov–Smirnov (K–S) test [12]. Best-fit approximation models are important in the reliability forecasting of each subcomponent [13]. The procedure of reliability analysis of a complex repairable system has explained in a flow chart (Fig. 1) as follows [14]:

Table 2 Subsystems classification of LHDs
Fig. 1
figure 1

Source: Ascher H et al. 1984

Reliability analysis of a complex repairable system procedure.

Case Study

In this research, five numbers of LHD machines deployed in an Indian underground lead and zinc mine. The considered machines for the present analysis are made from M/s The Sandvick Company Limited with 17 tonne bucket capacity and named as LHD1, LHD2, LHD3, LHD4, and LHD5. The LHD is treated as the main workhorse intends for transportation in underground mining operations. The drilling and blasting approaches are utilized to extract the ore. The extracted ore is transported from the mined-out area to the primary belt conveyor point through an intermediate mechanized system called the LHD machine. A typical LHD machine at the workshop and during the repair action is shown in Fig. 2a and b.

Fig. 2
figure 2

(a) & (b). A typical LHD machine at the workshop and during the repair action. Before performing the reliability analysis, each machinery must be categorized into several subsystems for identifying potential failure modes [15]. Theses categorizations were made based on past historical records like day-to-day worksheets maintained by maintenance personnel about the breakdowns. In the current study, LHD was classified into seven subsystems/subassemblies (Table 2)

The two years (2015–2016 and 2016–2017) of the breakdown were collected for all the LHD systems of LHD1 to LHD5 to carry out the required analysis. This breakdown information is in the form of spreadsheets prepared by maintenance personnel and computerized soft copies of day-to-day failures. This information was comprised of three metrics such as the failure frequency (FF), the time between failure (TBF), and the time to repair (TTR). Collected data of various LHDs from a field visit are given in Table 3, and the failure frequency of each subsystem is shown in Fig. 3.

Table 3 Collected data of various LHDs from a field visit
Fig. 3
figure 3

Failure frequency of various subsystems

Results and Discussion

Key Performance Indicators (KPIs)

The reorganization of the status of the equipment can be helpful as a guideline for carrying out further analysis. The performance of the equipment can be projected by computing the key performance indicators (KPI) such as availability percentage (AP) and utilization percentage (UP). The AP is defined as the percentage of equipment which is readily available to perform the specified task in its work environment known as AP. It can be computed with the ratio of machine available hours (MAH) to the scheduled working hours (SWH) (Eq 1). The idle time of equipment with less than 15 min is need not be considered while calculating the percentage of available time.

$${\text{AP}} = \frac{{\text{MAH}}}{{\text{SWH}}} \times 100$$
(1)

UP can be defined as the ratio of available machine working hours or utilized hours to the shift scheduled hours. Depending upon the denominator value, the quantity of UP can be varied. In a study, available machine hours (AMH) are all the time smaller than the scheduled shift hours (SSH) (Eq 2). The computed values of AP and UP are given in Table 4, and the percentage difference is shown in Fig. 4.

$${\text{UP}} = \frac{{\text{AMH}}}{{\text{SSH}}} \times 100$$
(2)
Table 4 AP and UP of various LHDs from a field visit
Fig. 4
figure 4

Percentage difference of AP and UP

Trend as Well as Serial Correlation Checking

These examinations are utilized to check the breakdown information of every individual subsystem; to decide the IID attributes. The graphical analysis between cumulative failure frequencies (CTFF) in opposition to cumulative time between failures (CTBF) determines the trend of existence or not in the collected field failure data. The statement data sets are free from the presence of a trend that can be said when the failure data set points are not in a straight line manner [16, 17]. The presence of correlation between the data sets was determined by performing the graphical analysis with the ith estimation of TBF and (i − 1)th estimation of TBF. Scatter plots of informational indexes between the ith estimation of TBF and (i − 1)th estimation of TBF show the connection among the two qualities [18]. From the graphical analysis (Figs. 5, 6, 7, 8 and 9), it was observed that the data set points are not passing through the straight line and conclude that there is no existence of a trend in the data sets. Because of the serial correlation test, the focuses are dissipated haphazardly, which displayed no relationship. The aftereffect of these tests demonstrates that the informational collections of the considerable number of subsystems were found as pattern-free and the focuses are indistinguishably dispersed. Consequently, the IID supposition for the informational collections was not denied for every subsystem.

Fig. 5
figure 5

(a) Trend test of LHD1. (b) Serial correlation test of LHD1

Fig. 6
figure 6

(a) Trend test of LHD2. (b) Serial correlation test of LHD2

Fig. 7
figure 7

(a) Trend test of LHD3. (b) Serial correlation test of LHD3

Fig. 8
figure 8

(a) Trend test of LHD4. (b) Serial correlation test of LHD4

Fig. 9
figure 9

(a) Trend test of LHD5. (b) Serial correlation test of LHD5

The Best-Fit Approximation

The estimation of best-fit approximation of the data sets is necessary for identification of the maximum likelihood estimation (MLE) parameters such as scale, shape, and location parameters. These parameters are estimated using ‘Isograph Reliability Workbench 13.0 (IRW)’ software and are used at the time of reliability percentage estimation. The theoretical probability distribution functions of exponential, 1-parameter Weibull, 2-parameter Weibull, and 3-parameter Weibull, were considered for comparison of best-fit distribution functions. From the outcomes (Table 5) of best-fit approximations, it was found the 3-parameter Weibull distribution function was best fitted for the data sets of LHDs. Distinguishing proof of the best fit for the informational indexes has made using the Kolmogorov–Smirnov (K–S) test. Least estimation of the degree of significance (ε) in the K–S test was treated as a better fitment. The assessed parameters for best-fit circulation work with MLE are introduced in Table 6.

Table 5 Estimated best-fit approximations of LHD machines
Table 6 Probability distribution parameters from MLE

Reliability and Maintainability Analysis

The term reliability is stated as the likelihood of an item or product to perform its intended task before undergoing a failure. The reliability function of a 3-parameter Weibull distribution (Eq  3) is given as

$$R(t) = e^{{ - (\frac{t - \gamma }{\eta })^{\beta } }}$$
(3)

The failure rate is (FR) an important measure in system performance and is defined as the number of failures happened in a product for a particular period. FR can also be called as the hazard function of a system (Eq 4).

$$h(t) = \frac{\beta }{\eta }\left( {\frac{t - \gamma }{\eta }} \right)^{\beta - 1}$$
(4)

The probability density function of 3-parameter Weibull distribution (Eq 5) is given as

$$PDF = f(t) = \frac{\beta }{\eta }\left( {\frac{t - \gamma }{\eta }} \right)^{\beta - 1} e^{{ - \left( {\frac{t - \gamma }{\eta }} \right)^{\beta } }}$$
(5)
$$CFD = F(t) = 1 - e^{{ - \left( {\frac{t - \gamma }{\eta }} \right)^{\beta } }}$$
(6)

The mean time to failure (MTTF) or mean time between failure (MTBF) is defined as the average life of failure-free operation of equipment up to a consequent occurrence of a failure. The Weibull PDF function of MTTF or MTBF is specified as

$$MTBF = MTTF = \frac{1}{\lambda }$$
(7)

where \(\lambda = h(t)\) and λ = failure rate.

According to the allocation of best-fit distribution, the renewal process was implemented as a reliability modeling technique for the estimation of each subsystem’s reliability. The percentage of reliability was computed for TBF data sets of each system using Eq 3 and is illustrated in Table 7. The maintainability (Eq 8) and failure rate (Eq 9) metrics were estimated with a mean time between failure (MTBF) and mean time to repair (MTTR) values. The value of MTBF was computed with the ratio of CTBF and the total number of failures. Similarly, MTTR was computed with the ratio of CTTR with a total number of failures.

$$\textit{Mantainability}\, \left( {Mw} \right) = 1 - e^{{ - \left( {\frac{MTTR}{\eta }} \right)^{\beta } }}$$
(8)
$$\textit{Failure~rate} \left( { \leftthreetimes } \right) = \frac{1}{MTBF}$$
(9)
Table 7 Percentage of reliability and unreliability of subsystems

Estimation of Overall System Reliability

Reliability is a statement of probability, so complex systems are analyzed with logical statements and logical arithmetic. The reliability-wise relationship of components in a system can be represented graphically with a reliability block diagram (RBD) [19]. Reliability block diagram (RBD) is a deductive method utilized to estimate the overall system reliability. Keep in mind that the RBD is not the same as connectivity, system, or physical configuration diagram. It is also important to note that the value of reliability mainly depends upon the variation in the periods, that is, reliability of 0.1 means a component has a 90% probability of failure during a specified operational period. The estimation of overall system reliability is only possible by performing the analysis either in series or parallel configuration system dependency, that is, if one component fails, does it affect the reliability of the others.

In this analysis, reliability-wise relationship of the components was identified as all the subsystems were connected in the series configuration for all the equipment (Table 8). An example of RBD for the LHD1 system is shown in Fig. 10. Therefore, the reliability of each system was estimated with series configuration calculations (Table 9). The following empirical Eq 12 was utilized to estimate the overall system reliability:

$$\begin{aligned} R_{LHD1} \left( t \right) & = e^{{ - \left( {\frac{{t_{SSE} - \gamma_{SSE} }}{{\eta_{SSE} }}} \right)^{{\beta_{SSE} }} }} + e^{{ - \left( {\frac{{t_{SSBr} - \gamma_{SSBr} }}{{\eta_{SSBr} }}} \right)^{{\beta_{SSBr} }} }} \\ & \quad + \;e^{{ - \left( {\frac{{t_{SSTy} - \gamma_{SSTy} }}{{\eta_{SSTy} }}} \right)^{{\beta_{SSTy} }} }} + e^{{ - \left( {\frac{{t_{SSH} - \gamma_{SSH} }}{{\eta_{SSH} }}} \right)^{{\beta_{SSH} }} }} \\ & \quad + \;e^{{ - \left( {\frac{{t_{SSEl} - \gamma_{SSEl} }}{{\eta_{SSEl} }}} \right)^{{\beta_{SSEl} }} }} + e^{{ - \left( {\frac{{t_{SSTr} - \gamma_{SSTr} }}{{\eta_{SSTr} }}} \right)^{{\beta_{SSTr} }} }} + e^{{ - \left( {\frac{{t_{SSM} - \gamma_{SSM} }}{{\eta_{SSM} }}} \right)^{{\beta_{SSM} }} }} \\ \end{aligned}$$
(10)
$$R_{LHD1} \left( t \right) = R_{SSE} + R_{SSBr} + R_{SSTy} + R_{SSH} + R_{SSEl} + R_{SSTr} + R_{SSM}$$
(11)
$$Rs = \mathop \prod \limits_{i = 1}^{n} Ri \times 100$$
(12)

where Rs denotes the overall system reliability, i indicates the number of subsystems, i.e., 1,2,3… n, and R indicates the reliability of each subsystem. The variation in predicted values from ‘Isograph Reliability Workbench 13.0’ of the percentage of reliability of each subsystem and system is shown in Figs. 11 and 12.

Table 8 Availability and maintainability results
Fig. 10
figure 10

Reliability Block Diagram (RBD) of LHD1

Table 9 Reliability results (R in %) of each subsystem and system
Fig. 11
figure 11

Percentage of reliability of each subsystem

Fig. 12
figure 12

System-wise percentage of reliability

Reliability-Based Preventive Maintenance (PM) Time Schedules

Forecasting of preventive maintenance (PM) time intervals is very essential for the improvement in the reliability as well as reducing the failure rate of any kind of system or subsystem [20]. The calculated results of reliability-based preventive maintenance time intervals for the expected rate of reliability levels are presented in Table 10. It was understood that if the desired reliability for LHD is 90%, then PM must perform for every 538 h. Similarly, for LHD2 to LHD5 it is 367, 349, 620, and 288 h, respectively. PM can be also be defined as the actions executed to hold the machinery in an indicated state by giving the well-organized evaluation, reorganization, and furthermore avoidance from claiming early failure [21].

Table 10 Each LHD system’s PM time schedules

Conclusion

Reliability assessment techniques have been gradually accepted as standard tools during the planning and operation of simple to complex engineering systems for the past six decades. In this paper, a case study describing the reliability investigation for a fleet of LHDs in the underground mining industry was performed. Primarily, the performance of equipment was calculated, and it is noticed that from the results of Table 5, the least value of AP has identified for LHD3 (70.10%) and the highest value is for LHD1 (79.23%). Availability is the measure of maintainability and reliability. The required levels of a generation of productivity can be possible only when the equipment is readily available to perform its intended task. From the corresponding values of UP, it was found that the utilization of all the equipment is unsatisfactory. The utilization percentage can be improved by minimizing the idle times of the machine. This includes insufficient availability of ore to transport, shift changing of the personnel, harsh environmental conditions, and traffic in the underground. This can be controlled by undertaking better managerial and operational practices.

As the reliability investigation is one of the well-sophisticated techniques to forecast the life of the machinery, the present study has drawn the results of the reliability percentage of each subsystem and system of every LHD machine. From the results of reliability (Table 9), the very least value of the reliability was noticed for SSEl (16.00%), SSTy (20.56%), and SSE (21.73%). It was concluded that these subsystems are the most critical as compared with the others and assessed that more concentration needs to be kept improving their life. Similarly, overall system reliability (Rs) of each LHD machine was performed by considering all the subsystems as connected in series (RBD). The highest level of Rs was obtained for LHD1 (69.11%) and level obtained for LHD3 (56.77%) as compared with other systems. The achievement of (Fig. 12) least percentage of Rs is due to happening of frequent failures with fewer TBFs. Therefore, it is suggested that the poor efficiency equipment must be maintained at an adequate level by designing the optimal maintenance practices.

Anticipating of reliability-based PM time intervals will be used as a technical base for performing scheduled maintenance activity. The remaining useful life of the machine can be accomplished by performing the PM from time to time. From the determined consequences of PM time intervals, if the prerequisite of reliability is 90%, at that point the PM should conduct in every 539 h for LHD1, and others are given in Table 10. This examination observed that because of divergent operational and environmental conditions, diverse LHD machines ought to require distinctive maintenance strategies. For efficient maintenance planning and organization, each equipment’s reliability requirements are estimated individually. The present analysis provides a base for maintenance personnel in the industry to mitigate or control the uncertainties present in the equipment for the enhancement of equipment reliability.