Keywords

1 Introduction

Reliability analysis is one of system characteristics, that has to be taken into account as failures (system breakdown) can have fatal consequences. Important step in the system reliability is constructing of mathematical model of system [3, 19, 35]. The system mathematical model is constructed depending on the specifics of system analysis and properties of the investigated system. Depending on number of system performance levels, mathematical models can be divided into two types depending on the detail of the analysis [19]. The first of them is Binary-state systems. This model can be used to analyse system behaviour on two performance levels - system is working or system fails. This model is effectively used to analyse system failures. The other type of systems are Multi-State Systems (MSS). Using this model, we can analyse system behavior on more than two performance levels. This is used to analyse gradual degradation of system performance, however this model is more complex to analyse as binary-state.

Except system type based on number of states, there are different mathematical models depending on mathematical methods used to analyse this system. One of these mathematical methods is Boolean algebra [9]. In this case, structure function is used as mathematical model. Structure function clearly maps system components performance level to system performance [25, 33]. Advantages of using structure function as mathematical model are for example uniform approach to analyse system using Boolean or multiple-valued logic [33]. However in case of real systems, structure function can have large dimension. Therefore it is necessary to represent it in efficient way, such as Multi-Valued Decision Diagram (MDD) [22]. MDD is graphical orthogonal and canonical form for representation of logical function of large dimension. Usage of these diagrams for MSS structure function was considered in [2, 31].

In order to create structure function, full information about system behavior is required. The problem is, in many real systems, there is lack of such information [3, 32]. Therefore different methods have been developed for reliability evaluation based on incompletely specified data, for example, graph databases based methods [8, 27] or datamining based methods [26, 32]. In case of the use of datamining based methods the structure function is interpreted as classification structure that should be created based on incompletely specified [32]. In this paper methods of datamining, specifically decision trees will be used in order to handle this problem [1, 5, 20, 26]. In this case the structure function is represented in form of decision tree that is transformed into MDD [22].

Created mathematical model in form of MDD is in next step of reliability analysis used to calculate system reliability indices and measures, such as availability [13, 35], importance measures [14], minimal cuts and paths set [15] etc. These characteristics can be divided into two types - topological and probabilistic. Topological analysis does not require information about state probabilities and can be used to analyse system behavior based on its topology. Typical example of this characteristic is structure importance [11]. The probabilistic analysis takes into account also probabilities of individual component performance. Typical example of probabilistic importance measure can be Birnbaum’s importance. In most of investigations reliability indices and measures are considered for coherent system. But there are special group of non-coherent systems in reliability analysis. The degradation of some of components in such systems does not always lead to a degradation or failure of the system. This fact causes specific in reliability analysis of these system because the structure function of such system is not monotone. There are some investigations for analysis of non-coherent Binary-state systems [4, 10, 34], but the evaluation of MSS non-coherent systems is not investigated sufficient. In this paper the initial investigation for non-coherent MSS is considered based on calculation of Important measures.

Organization of this paper is following. Section 2 contains detailed information about used mathematical model - structure function together with its representation in form of multi-valued decision diagram. Section 3 contains description of system characteristics and their calculus and calculus of structure importance for multi-state non-coherent systems. These methods are used in Sect. 4 to analyse Bike crashes dataset and finally Sect. 5 concludes previous methods together with plans for next work.

2 Reliability Analysis

Reliability analysis of system consist of 2 essential steps:

  1. 1.

    Creation of mathematical model

  2. 2.

    Quantitative analysis - i.e. calculation of system characteristics and different types of reliability indices. In this paper importance measures will be used, with focus on topological analysis.

There are different mathematical models in reliability analysis. The creation of the mathematical model is caused by system properties and specifics of reliability analysis. For any mathematical model is defined:

  • number of system states

  • type of mathematical model.

The number of system states or system performance levels is caused by analysis detail. There are two groups of mathematical models:

  • Binary-State System (BSS) is modelled in two performance levels - i.e. system is either working or not. This approach is used, if system is binary-state from its nature [6, 16], or we are analysing consequences of system failure [35].

  • Multi-State System (MSS) is modelled in more than two performance levels. This approach allows us to describe gradual degradation of system performance from fully working to fully broken [18, 19, 25].

MSS allows the system analysis in more detail but computational complexity of this analysis increases and special methods and algorithms should be developed for quantitative analysis of MSS. The methods of quantitative analysis of MSS and other system associate with the types of mathematical model. The types correlates with background methods used in quantitative analysis. As a rule stochastic methods, methods of Boolean logic or algebra logic are used as background methods in reliability analysis [3]. The type of mathematical model correlated with algebra logic in reliability analysis is named structure function. This mathematical model can be used for representation of system of any topological complexity. It has favorable rules for the construction. But the computational complexity of the structure function analysis increases depending on this function dimensional. The development of special methods and algorithm for the structure function analysis allows creating effective approaches for reliability analysis of system.

2.1 Structure Function

Dependency between system performance and performance of system components can be expressed using Structure Function. It can be defined as following [11]:

$$\begin{aligned} \begin{aligned} \phi (x_1, x_2, ..., x_n) = \phi (\varvec{x}) : \{0, 1, ..., m_1-1\} \times \{0, 1, ..., m_2-1\} \\ \times ... \times \,\{0, 1, ..., m_n-1\} \rightarrow \{0, 1, ..., m-1\} \end{aligned} \end{aligned}$$
(1)

where \(x_i\) is i-th system component state, \(i \in \{1, 2, ..., n\}\), m is number of system performance levels and \(m_i\) is number of states of component i. In case \(m = m_i = 2\) system is Binary-state (BSS).

Structure function can be used in topological analysis of system. But in case probabilistic analysis is required, we need to know not only structure function, but also state probabilities of individual system components. The i-th component state probabilities will be denoted as following [11, 30]:

$$\begin{aligned} \begin{aligned} p_{i1}&=\text {Pr}\left\{ x_i=1\right\} , ..., \\ p_{i,j}&=\text {Pr}\left\{ x_i=j\right\} , \\ q_i&=\text {Pr}\left\{ x_i=0\right\} \end{aligned} \end{aligned}$$
(2)

Depending on system behavior, systems can be divided into two types - coherent and non-coherent. In non-coherent systems there are cases, when degradation of system component leads to increase of system performance level. This is not possible for coherent systems. From structure function point of view according to [11] system is coherent, if system meets 2 conditions:

  1. 1.

    Structure function is non-decreasing - This condition implies, that improving of state of any component does not degrade the performance of system and vice versa.

  2. 2.

    Each system component is relevant - System component is relevant, if there exists at least one case in that the state of component dictates the state of the system.

If at least one of these conditions are not met, the system is non-coherent. Majority of technical systems are coherent in general. Examples of non-coherent systems can be systems including human factor. Case study presented in this paper is also non-coherent system.

The main problem with traditional methods in reliability analysis is the fact, we need all information about system behaviour to create mathematical model. In many real systems, there is lack of such information, i.e. we either don’t know all system components or we don’t know system behavior based on these components behavior in some cases [3]. The other problem can be ambiguous data. This can be caused by many factors, for example inaccuracy of measurement [26].

This problem can be handled in different ways, for example usage of graph databases [8, 27]. In this paper, we will be used methods of datamining to create decision trees [1, 5, 20]. This tree will be then reduced into multi-valued decision diagrams using reduction rules described in [21, 24, 28].

2.2 Multi-valued Decision Diagram

Structure function can be represented in multiple ways such as truth table, reliability block diagram, fault tree, etc. One, effective representation of structure function is Multi-valued Decision Diagram (MDD). According to publications [21, 23], MDD is rooted acyclic graph that meets two conditions:

  1. 1.

    graph is canonical - the representation is unique for a particular variable ordering

  2. 2.

    graph is compact - any other graph representation contains more nodes.

There are two type of nodes in MDD - sink and non-sink. Sink nodes, labeled by numbers from 0 to \(m-1\), express system performance levels. There are exactly one sink node for each system state. Non-sink nodes represents system components. Node representing component \(x_i\) has exactly \(m_i\) outgoing edges expressed state of representing component.

Thanks to fact, MDD is orthogonal form of Structure Function [22], it can be used for the probabilistic analysis. In this case this graph is edge-weighted. Weight of j-th outgoing edge of node representing component \(x_i\) is equal to probability \(p_{i,j}\), that component \(x_i\) is in state j.

3 Quantitative Analysis

Created mathematical model can be in the next step used to calculate system characteristics such as availability and unavailability.

Availability and Unavailability. Availability \(\text {A}^{\ge j}\) can be defined as probability system is at least in j-th performance level. Unavailability can be defined similarly - i.e. probability system performance level is lower than j [18, 19, 25]:

$$\begin{aligned} \begin{aligned} A^{\ge j}(\varvec{p})&= Pr\{\phi (\varvec{x}) \ge j\},\\ U^{\ge j}(\varvec{p})&= Pr\{\phi (\varvec{x}) < j\},\\ U^{\ge j}(\varvec{p})&= 1 - A^{\ge j}(\varvec{p}) \end{aligned} \end{aligned}$$
(3)

These are characteristics of whole system but give us no information about how system performance in influenced by performance of its components. For this purpose importance measures has to be calculated. There are 2 types of importance measures - topological and probabilistic. This paper will focus on topological analysis, therefore structure importance will be used.

3.1 Structure Importance

Structure importance can be used to express influence of component to system performance in topological point of view.

$$\begin{aligned} \text {SI}^\downarrow _{i\downarrow } = \text {TD}\left( \frac{\partial \phi (1 \rightarrow 0)}{\partial x_i(1 \rightarrow 0)}\right) , \end{aligned}$$
(4)

where TD(.) stands for Truth Density, i.e. function described as relatively number of cases function (.) gets value 1 to all possible cases [15]. Function \(\frac{\partial \phi (1 \rightarrow 0)}{\partial x_i(1 \rightarrow 0)}\) is partial Boolean derivation and can be defined as following [29]:

$$\begin{aligned} \frac{\partial \phi (1 \rightarrow 0)}{\partial x_i(1 \rightarrow 0)} = \phi (x_1, ..., 1, ... x_n) \wedge \overline{\phi (x_1, ..., 0, ... x_n)}. \end{aligned}$$
(5)

Equation 4 suppose analysed system is coherent. Therefore it is necessary to investigate only cases, when system component failure leads to system failures. There are multiple structure importance for non-coherent systems. The next one takes into account cases, when system component failure leads to system repair. This can be defined using following equation:

$$\begin{aligned} \text {SI}^\downarrow _{i\uparrow } = \text {TD}\left( \frac{\partial \phi (1 \rightarrow 0)}{\partial x_i(0 \rightarrow 1)}\right) \end{aligned}$$
(6)

The total influence of system component can be calculated as a sum of Eqs. 4 and 6 [12]:

$$\begin{aligned} \text {SI}^\downarrow _{i} = \text {SI}^\downarrow _{i\downarrow } + \text {SI}^\downarrow _{i\uparrow } = \text {TD}\left( \partial \phi (\varvec{x} / \partial x_i \right) \end{aligned}$$
(7)

All of previous equations apply for binary-state systems. Structure importance for multi-state systems can be generalized as following:

$$\begin{aligned} \text {SI}^\downarrow _{i} = \text {TD}\left( \frac{\partial \phi (j \rightarrow k)}{\partial x_i(r \rightarrow s)}\right) , \end{aligned}$$
(8)

where \(\phi (j \rightarrow k)\) means system change state from state j to state k; \(j, k < m\). Expression \(x(r \rightarrow s)\) means system component change state from r to state s; \(r, s < m_i\). \(m_i\) is number of i-th system component states.

Formula 8 can be used for both - coherent and non-coherent systems. In case of coherent systems, condition \(j> k \wedge r > s\) has to be met.

For non-coherent systems, there are also multiple structure importance. \(\text {SI}^\downarrow _{i\downarrow }\) is common with previous case. For the next one, \(\text {SI}^\downarrow _{i\uparrow }\) investigating decrease of system performance caused by increase of system component performance, one of condition \(j > k \wedge r < s\) or \(j < k \wedge r > s\) has to be met. Total topological influence of component can be calculated as sum of these two values.

4 Case Study

In this section we will demonstrate application of described methods in dataset obtained from [7]. This dataset contains information about bicycle crashes such as driver age, whether ambulance was called or not, what was the weather like etc., together with information if bicycle driver was killed, injured or without injure. Whole dataset consists of 11 attributes and 162 records. In order to simplify manipulation with data, each attribute value was mapped into numeric value. Complete list of attributes together with possible values and their mapping can be seen in Table 1.

Table 1. Details of Bike Crashes Dataset

This dataset was analysed using methods of datamining presented in [17, 32]. Using these methods, decision tree was created. This tree was in the next step reduced into MDD using methods described in paper [21, 24, 28]. Resulting MDD can be seen in Fig. 1. Created mathematical model can be used to perform quantitative analysis as described in previous section.

Fig. 1.
figure 1

MDD of Bike Crashes Dataset

Fig. 2.
figure 2

Results of \(\text {SI}^\downarrow _{i\downarrow }\) for Bike Crashes Dataset

In the first step, structure importance for each component was calculated according to Eq. 4. This contains only cases, when decrease of system component state leads to decrease of system performance. Results of this calculation can be seen in Fig. 2. The x axis contains individual component and system changes. On the y axis there are values of \(\text {SI}^\downarrow _{i\downarrow }\) for every component. So for example values in column “\(2\rightarrow 0|1\rightarrow 0\)” is equal to value of Structure Importance index, when system component state changes from 2 to 0 and system changes from 1 to 0.

From these results we can identify most crucial components according to system structure. These can be seen in Table 2.

Similarly, second structure importance was calculated according to Eq. 6. This contains only cases, when system component performance decrease leads to case system increase its state. Results of this calculation can be seen in Fig. 3. The most crucial components together with value of its structure importance can be seen in Table 3.

Table 2. Attributes with highest values of \(\text {SI}^\downarrow _{i\downarrow }\) for Bike Crashes Dataset
Table 3. Attributes with highest values of \(\text {SI}^\downarrow _{i\uparrow }\) for Bike Crashes Dataset
Fig. 3.
figure 3

Results of \(\text {SI}^\downarrow _{i\uparrow }\) for Bike Crashes Dataset

Finally, structure importance according to Eq. 7 was calculated. This includes both cases - when decrease of system component leads to decrease of system state and decrease of system component leads to increase of system state. Results of this case can be seen in Fig. 4. The components with highest structure importance can be seen in Table 4.

Fig. 4.
figure 4

Results of \(\text {SI}^\downarrow _{i}\) for Bike Crashes Dataset

Table 4. Attributes with highest values of \(\text {SI}^\downarrow _i\) for Bike Crashes Dataset

According to Fig. 4, the most crucial attribute of this system is biker alcohol. This can be interpreted as the fact, biker drank alcohol or not is the most significant factor that makes difference between his death or injury. The next crucial attribute is ambulance. The fact ambulance was called to accident has significant impact to result of the accident. The next one is biker sex. The dataset implies, that women has less deaths than men. Significant role in biker survival is caused also by driver speed, where change from 51–55 mps to 31–40 or from 41–50 to 31–40 leads to the fact biker will be alive.

On the other hand, factors as weather or bike direction (whether it is with traffic it) has no impact on result of the accident. However it should be noted, that accuracy of these results depends on amount of available data.

5 Conclusion

This paper contains reliability analysis calculus of multi-state systems based on incompletely specified data. This analysis consists of two steps. The first one is creation of mathematical model. For this purpose we used structure function as it can be used for analysing system behavior based on behavior of its components. Structure function can be represented in multiple ways. In this paper it was represented in form of MDD and it is constructed through the induction of decision tree based on incompletely specified data. MDD can be used for both - topological an probabilistic analysis and can be easily created from decision tree.

The next step in reliability analysis is quantitative analysis using this model. In this paper, we investigate topological analysis of multi-state non-coherent systems, specifically calculus of structure importance of these systems. In difference of other investigations in this paper the non-coherent MSS is considered and mathematical background for calculation of Importance measures is presented.

Mentioned methods was demonstrated on dataset that contains information about bike crashes. Using methods of datamining this dataset was analysed and decision tree was created. This tree was in the next step reduced into MDD from which topological analysis was calculated.

In our future work, we will investigate also probabilistic analysis of multi-state non-coherent systems. This analysis will be based on structure function represented using multi-valued decision diagrams.