Keywords

1 Introduction

Patients are placed in an intensive care unit (ICU) following major surgery. Continuous monitoring is essential to the daily care of ICU patients. Ongoing and continuous monitoring is achieved by complex devices that require special training and experience to operate. If certain physiologic limits are exceeded, the devices set to generate alarms. Those alarms need to be investigated by following strict protocols. The measured observations are highly correlated for a given patient and the condition of the patient is assessed with individual observations. It is imperative to maintain overall variability of all the observations with respect to the accepted variability (target variance-covariance matrix) in order to achieve stable medical condition for the patient.

In this paper, Multivariate Exponentially Weighted Moving Variance (MEWMV) chart [1] has been used as the variance shift identifier for individual observations. The advantage of MEWMV chart is its ability to monitor variance while stabilizing mean of the process. The challenge in deploying multivariate control chart [2] is to identify variables responsible for the out-of-control (OOC) signals (fault diagnoses). Several machine learning approaches such as Artificial Neural Networks (ANN) and Support Vector Machines (SVM) have been proposed [3,4,5,6,7,8,9,10,11,12] for fault diagnoses. Hsu et al. [13] proposed an improved artificial neural networks called ANNIGMA approach. The method incorporates weights analysis based heuristic to direct the search in the wrapper model and allows effective feature selection for ANN.

This paper for the first time presents new strategy for fault diagnosis of variance shifts in multivariate clinical processes. The proposed approach integrates the deployment of MEWMV chart with ANNIGMA.

2 MEWMV Chart

MEWMV chart has the ability to monitor multivariate process variability while stabilizing the mean. Huwang et al. [1] has introduced the chart by estimating optimal control limits for up to 3 variables. Gunaratne et al. [14] have extended the charts ability to monitor high dimension multivariate processes by estimating control limits for up to 15 variables and introducing a mathematical model to estimate control limits beyond 15 variables.

Let a = (a1, …ap)1 be a random vector that represents p correlated quality characteristics from a multivariate process. Consider a transformation of a, \( \mathrm{X}={\sum}_0^{-1/2}\left(a-{\mu}_0\right) \) such that X is distributed as N(μ, Σ), where \( \mu ={\sum}_0^{-1/2}\left(\mu -{\mu}_0\right) \) and \( \sum ={\sum}_0^{-1/2}\sum u{\sum}_0^{-1/2} \). Consequently, when the process is in-control, X is distributed as N (0, Ip), where Ip is p*p identity matrix.

The MEWMV statistic Vt [1] is defined by

$$ {\mathbf{V}}_{\mathbf{t}}=\upomega \left({\mathbf{X}}_t-{\mathbf{Y}}_t\right){\left({\mathbf{X}}_t-{\mathbf{Y}}_t\right)}^{\prime }+\left(1-\upomega \right){\mathbf{V}}_{\mathbf{t}-1} $$
(40.1)

Where Yt is the predicted mean shift at sampling point t and its value is obtained by the multivariate exponentially moving average of Xt

$$ {\mathbf{Y}}_{\mathbf{t}}=\uplambda\ {\mathbf{X}}_{\mathbf{t}}+\left(1-\uplambda \right){\mathbf{Y}}_{\mathbf{t}-1} $$
(40.2)

They have shown that for 0 < ω<1, 0 < λ < 1 and t ≤ p, the matrix Vt is positive definite with probability 1.

Defining It as the t*t identity matrix and X, Y, D and N as follows

$$ {\mathbf{X}}_{\mathbf{t}}={\left({\mathbf{X}}_1,{\mathbf{X}}_2,\dots \dots {\mathbf{X}}_{\mathbf{t}}\right)}^{\prime } $$
$$ {\mathbf{Y}}_{\mathbf{t}}\kern0.5em ={\left({\mathbf{Y}}_1,{\mathbf{Y}}_2,\dots \dots {\mathbf{Y}}_{\mathbf{t}}\right)}^{\prime } $$
$$ \mathrm{D}=\operatorname{diag}\ \left({\mathrm{D}}_1,{\mathrm{D}}_2,.\dots \dots {\mathrm{D}}_{\mathrm{t}}\right) $$
$$ \mathbf{N}\kern0.5em =\kern0.75em \left(\begin{array}{c}\lambda \kern2.5em 0\kern3.5em \dots \kern4.5em 0\\ {}\lambda \left(1-\lambda \right)\lambda \kern3.5em \dots \kern3.75em 0\\ {}\kern1.5em \vdots \kern2.75em \vdots \kern6.75em \ddots \kern1.5em \vdots \\ {}\lambda {\left(1-\lambda \right)}^{t-1}\kern0.5em \dots \kern0.75em \lambda \left(1-\lambda \right)\kern1.5em \lambda \end{array}\right) $$

Huwang et al. [1] showed that Vt can be written as

$$ {\mathbf{V}}_{\mathbf{t}}={\left(\mathbf{X}-\mathbf{Y}\right)}^{\prime} \ \mathrm{D}\ \left(\mathbf{X}-\mathbf{Y}\right) $$
$$ =\kern0.5em {\mathbf{X}}^{\prime }{\left({\mathbf{I}}_t-\mathbf{N}\right)}^{\prime }\ \mathrm{D}\left({\mathbf{I}}_t-\mathbf{N}\right)\mathbf{X}\vspace*{-8pt} $$
$$ {\mathbf{V}}_{\mathbf{t}}={\mathbf{X}}^{\prime} \ \mathbf{RX}\vspace*{-8pt} $$
$$ \mathbf{E}\left[\boldsymbol{tr}\left({\mathbf{V}}_{\boldsymbol{t}}\right)\right]={\boldsymbol{p}}^{\ast} tr\left(\mathbf{R}\right) $$
(40.3)
$$ \mathrm{Var}\left[ tr\left({\mathbf{V}}_{\mathbf{t}}\right)\right]\kern0.5em =\kern0.5em 2\ p\ {\sum}_{i=1}^t{\sum}_{j=1}^t{r}_{ij}^2 $$
(40.4)
Fig. 40.1
figure 1

Time series plot for 9 variables

Therefore, the control limits for MEWMV chart are given by

$$ {\displaystyle \begin{array}{l}\mathrm{E}\ \left[ tr\left({\mathbf{V}}_{\mathbf{t}}\right)\right]\kern0.5em \pm \kern0.5em \mathrm{L}\sqrt{\mathrm{Var}\left[ tr\left({\mathbf{V}}_{\mathbf{t}}\right)\right]}\\[12pt] {}=\kern0.5em p\ tr\left(\mathbf{R}\right)\pm \kern0.5em \mathrm{L}\ \sqrt{2p{\sum}_{i=1}^t{\sum}_{j=1}^t{r}_{ij}^2\ }\end{array}} $$
(40.5)

The constant L depends on p, ω and λ. The value of L can be estimated by time consuming Monte Carlo simulations [15]. However, Gunaratne et al. [14] have developed predictive models to obtain the control limits for any high dimension multivariate process.

3 Annigma Approach

Hsu et al. [13] proposed a weights analysis based wrapper heuristic called ANNIGMA. It is a feature ranking approach which is mathematically derived from the back propagation training formulation of ANN. ANNIGMA also ranks the quality characteristics by the relevance which is based on the weight associated with the quality characteristic in a neural network based wrapper approach. Therefore, it is not necessary to train m neural nets for each branching point. Consequently, the speed of the fault diagnostic task using ANNIGMA would substantially increase.

Fig. 40.2
figure 2

Out-of-controls for MEWMV chart

For a two-layer neural network, if i, j and k are the input, hidden and output layer and F is a logistic activation linear function F(x) = 1/(1 +  exp (‐x)) then output of the network is given by Eq. (40.6).

$$ {O}_k={\sum}_jF\ \left({\sum}_i{A}_i\times {W}_{ij}\right)\times {W}_{jk} $$
(40.6)

where Wij and Wjk are the network weights. The local gain can be defined as:

$$ {LG}_{ik}=\left|\frac{\Delta\ {O}_k}{\Delta\ {A}_i}\right| $$
(40.7)

According to [13], the local gain LGik can be defined in terms of network weights by

$$ {LG}_{ik}=\kern0.5em {\sum}_j\ \left|{W}_{ij}\times {W}_{jk}\right|\kern0.5em $$
(40.8)

The ANNIGMA score for ith input and kth node is defined as [13]

$$ {ANNIGMA}_{ik}=\kern0.5em \frac{LG_{ik}}{\max\;\left({LG}_k\right)} $$
(40.9)

Neural Network weights give an estimate of the relative importance of input features.

4 Case Study: Application of the Proposed Approach in Monitoring ICU Patients

Authors have used observations from 9 correlated variables that play a substantial role in assessing the condition of a patient after open heart surgery. This is usually achieved by screening Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Mean Arterial Pressure (MAP), Systolic Pulmonary Artery Pressure (SPAP), Diastolic Pulmonary Artery Pressure (DPAP), Mean Pulmonary Artery Pressure (MPAP), Pulmonary Capillary Wedge Pressure (PCWP), Central Venous Pressure (CVP) and Heart Rate (HR). The characteristics indicate highly significant correlation (P-value = 0.05) among them. A total of 600 samples of size one with 9 characteristics in each sample were available. The data recording times were recommended by the medical practitioners. The study has used all the collected data.

The Time Series plots for 9 variables (Fig. 40.1) suggest that the most independent variables are CVP, DPAP and MAP. The Correlation analysis also confirms this.

4.1 MEWMV Chart for Monitoring the Variability of the Condition of a Patient After Heart Surgery

MEWMV chart has been used to monitor large variance shifts (ω = 0.9) while mean is stabilized for small shift (λ = 0.1). Stabilizing mean is essential while monitoring variance of the process to avoid capturing mean changes. The control limits for the MEWMV charts are obtained using the predictive models developed by Gunaratne et al. [14]. Figure 40.2 shows 22 OOCs for MEWMV chart (out of 600 observations).

4.2 Fault Diagnostics to Identify the Responsible Characteristics Based on ANNIGMA Approach

The OOCs and in-control samples from MEWMV chart have been injected to ANNIGMA and other commonly used machine learning approaches to identify the group of variables responsible for in-controls and OOC signals and estimate the percentage of their individual contribution to the signals. The accuracy of ANNIGMA results have been investigated by deploying univariate Moving Range (UMR) charts.

Table 40.1 shows the ANNIGMA approach is superior to other commonly used approaches in selecting the minimum number of variables with higher accuracy. The accuracy computation starts with full variables set. The lowest ranked variable has been removed by utilizing Backword Elimination (BE) process. BE iteration continues until last variable in the subset. The subset with highest accuracy with minimum number of variables has been selected as the best subset. The highest accuracy (97%) achieved for ANNIGMA by monitoring only 3 characteristics, i.e., 97% of the total variability of the patient condition is due to Mean Arterial Pressure (MAP), Diastolic Pulmonary Artery Pressure (DPAP) and Central Venous Pressure (CVP).

Table 40.1 Average accuracies for large variance shifts (ω = 0.9) while mean is stabilized for small shifts (λ = 0.1)

ANNIGMA also ranked the percentage contribution of the significant variables as shown in Table 40.2. The highest contributing variable for the OOC signals is MAP (37.96%) followed by DPAP (31.99%) and CVP (30.05%).

Table 40.2 Percentage contribution of variables selected by ANNIGMA to fault signals

5 Fault Diagnosis Using Univariate Moving Range (UMR) Charts

Traditional statistical approach for identifying variables responsible for the variance shift in multivariate chart is to deploy UMR charts for each individual variable and select the variables that produce OOC signals similar to MEWMV chart. However, this approach disregards the correlation among variables and may misdiagnosis the cause of the OOC signals (samples that are in control in multivariate chart may be OOC in UMR charts or vice versa).

UMR charts for SBP, DBP (Fig. 40.3a, b) and HR do not produce any OOC observations that matches OCCs produced by MEWMV chart. ANNIGMA removes SBP, DBP and HR variables as insignificant variables to process faults at early stage. The following UMR charts (Fig. 40.3c, d) show OOC signals for two of the responsible variables DPAP and CVP which have been identified by ANNIGMA. The OOC signals almost match those produced by the MEWMV chart.

Fig. 40.3
figure 3

UMR charts. (a) UMR charts for SBP (b) UMR charts for DBP (c) UMR charts show OOC signals for DPAP (d) UMR charts show OOC signals for CVP

Accuracy has been calculated for the variables selected by ANNIGMA (MAP, DPAP and CVP) using UMR charts. Table 40.3 shows the calculated accuracies are approximately close to 97% as indicated by ANNIGMA. Percentage contribution (around 33%) also in line with the ANNIGMA results. Therefore, UMR charts results further confirm the precision of results given by ANNIGMA.

Table 40.3 Accuracy and percentage contribution based on UMR charts

6 Conclusion

Fault diagnostic in multivariate control chart can be performed either by deploying univariate control chart for each individual characteristics or deploying machine learning approaches such as ANNs, SVMs and ANNIGMA. However, univariate charts not only ignore the correlation among the variables but also are time consuming as the number of variables to be monitored increases. This paper for the first time deployed ANNIGMA to identify and rank the characteristics responsible for the multivariate variance shifts when monitoring ICU patients. Average classification accuracy has been used to select the most significant subset of variables. The comparison results show that ANNIGMA approach select the responsible variables more effectively than other commonly used machine learning approaches. Information on the extent of the responsible variables’ abnormal rates and rhythms to the patient condition can effectively aid the medical practitioners in their early diagnostics intervention tasks.