Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Computer support systems supporting medical diagnosis have become increasingly more popular worldwide (cf. [14]). Therefore, the tool called Copernicus [11] for classification of patients with mental disorders screened with the MMPI test has been developed. The Minnesota Multiphasic Personality Inventory (MMPI) test (cf. [6, 15]) delivering psychometric data on patients with selected mental disorders is one of the most frequently used personality tests in clinical mental health as well as psychopathology (mental and behavioral disorders). In years 1998–1999, a team of researchers, consisting of W. Duch, T. Kucharski, J. Gomuła, R. Adamczak, created two independent rule systems, devised for the nosological diagnosis of persons, that may be screened with the MMPI-WISKAD test [7]. The MMPI-WISKAD personality inventory is a Polish adaptation of the American inventory (see [3, 16]). The Copernicus system developed by us is the continuation and expansion of that research.

In the Copernicus system, different quantitative groups of methods supporting differential inter-profile diagnosis have been selected and implemented. However, rule-based classification of patients is one of the most important parts of the tool. Knowledge representation in the form of rules is the closest method to human activity and reasoning, among others, in making a medical diagnosis. In the most generic format, medical diagnosis rules are conditional statements of the form:

$$\begin{aligned} \text{ IF } conditions (symptoms), \text{ THEN } decision (diagnosis). \end{aligned}$$

The rule expresses the relationship between symptoms determined on the basis of examination and diagnosis which should be taken for these symptoms before the treatment. In our case, symptoms are determined on the basis of results of a patient’s examination using the MMPI test. In the Copernicus system, the number of rule sets generated by different data mining and machine learning algorithms (for example: the RSES system [1], the WEKA system [17]) is included. However, the rule base can be extended to new rule sets delivered by the user. In the case of multiple sources of rules, we obtain a combination of classifiers. In classifier combining, predictions of classifiers should be aggregated into a single prediction in order to improve the classification quality. The Copernicus system delivers a number of aggregation functions described in the remaining part of this chapter.

The Copernicus system supports the idea that visualization plays an important role in professional decision support. Some pictures often represent data better than expressions or numbers. Visualization is very important in dedicated and specialized software tools used in different (e.g., medical) communities. In the Copernicus system, a special attention has been paid to the visualization of analysis of MMPI data for making a diagnosis decision easier. A unique visualization of classification rules in the form of stripes put on profiles as well as visualization of results of aggregated classification have been designed and implemented.

2 MMPI Data

In the case of the MMPI test, each case (patient) \(x\) is described by a data vector \(a(x)\) consisting of thirteen descriptive attributes: \(a(x)=[a_{1}(x), a_{2}(x), \ldots , a_{13}(x)]\). If we have training data, then to each case \(x\) we additionally add one decision attribute \(d\)—a class to which a patient is classified.

Table 1 An input data for Copernicus (fragment)

The validity part of the profile consists of three scales: \(L\) (laying), \(F\) (atypical and deviational answers), \(K\) (self defensive mechanisms). The clinical part of the profile consists of ten scales: 1.\(Hp\) (Hypochondriasis), 2.\(D\) (Depression), 3.\(Hy\) (Hysteria), 4.\(Ps\) (Psychopathic Deviate), 5.\(Mf\) (Masculinity/Femininity), 6.\(Pa\) (Paranoia), 7.\(Pt\) (Psychasthenia), 8.\(Sc\) (Schizophrenia), 9.\(Ma\) (Hypomania), 0.\(It\) (Social introversion). The clinical scales have numbers attributed so that a profile can be encoded to avoid negative connotations connected with the names of scales. Values of attributes are expressed by the so-called T-scores. The T-scores scale, which is traditionally attributed to MMPI, represents the following parameters: offset ranging from 0 to 100 T-scores, average equal to 50 T-scores, standard deviation equal to 10 T-scores.

For our research, we have obtained input data which has nineteen nosological classes and the reference class (norm) assigned to patients by specialists. Each class corresponds to one of psychiatric nosological types: neurosis (neur), psychopathy (psych), organic (org), schizophrenia (schiz), delusion syndrome (del.s), reactive psychosis (re.psy), paranoia (paran), sub-manic state (man.st), criminality (crim), alcoholism (alcoh), drug addiction (drug), simulation (simu), dissimulation (dissimu), and six deviational answering styles (dev1, dev2, dev3, dev4, dev5, dev6). The data set examined in the Copernicus system was collected by T. Kucharski and J. Gomuła from the Psychological Outpatient Clinic.

Data vectors can be represented in a graphical form as the so-called MMPI profiles. The profile always has a fixed and invariable order of its constituents (attributes, scales). Let a patient \(x\) be described by the data vector:

$$\begin{aligned} a(x)=[56, 78, 55, 60, 59, 54, 67, 52, 77, 56, 60, 68, 63]. \end{aligned}$$

Its profile is shown in Fig. 1.

Fig. 1
figure 1

MMPI profile of a patient (example); suppressors +0.5K, +0.4K, +1K, +0.2K—a correction value from raw results of scale K added to raw results of selected clinical scales

A basic profile can be extended by additional indexes or systems of indexes (cf. [13]). Different combinations of scales constitute diagnostically important indexes (e.g., Gough’s, Goldberg’s, Watson-Thomas’s, L’Abate’s, Lovell’s indexes—see [6]) and systems of indexes (e.g., Diamond’s [5], Leary’s, Eichmann’s, Petersen’s, Taulbee-Sisson’s, Butcher’s [2], Pancheri’s). They have been determined on the basis of clinical and statistical analysis of many patients’ profiles. All mentioned and some additional (e.g., the nosological difference-configuration Gough-Płużek’s system) indexes have been implemented in the Copernicus system. It enables the user to extend the basic profile even to 100 attributes (13 scales plus 87 indexes).

3 Rule-Based Classification in Copernicus

In this section, we describe step-by-step functionality of the Copernicus system concerning rule-based classification of patients screened with the MMPI test.

3.1 General Selection of Rule Sets

For classification purposes, the user can select a number of rule sets included in the tool (see Fig. 2). Such rule sets have been generated by different data mining and machine learning algorithms (for example: the RSES system [1], the WEKA system [17]). Moreover, the user can select its own rule set. The problem of selecting suitable sets of rules for classification of MMPI profiles has been considered in our previous chapters (see [810, 12, 13]).

Fig. 2
figure 2

The rule selection window in the Copernicus system

 

3.2 Specific Selection of Rules

After general selection of rule sets, the user can determine more precisely which rules will be used in the classification process.

Each rule \(R\) in the Copernicus system has the form:

$$\begin{aligned} \text{ IF } a_{i1}(x) \in [x^{l}_{i1}, x^{r}_{i1}] \text{ AND } \ldots \text{ AND } a_{ik}(x) \in [x^{l}_{ik}, x^{r}_{ik}], \text{ THEN } d(x) = d_m, \end{aligned}$$
(1)

where \(a_{i1},\ldots , a_{ik}\) are selected scales (validity and clinical) or specialized indexes, \(x^{l}_{i1}\), \(x^{r}_{i1}\),..., \(x^{l}_{ik}\), \(x^{r}_{ik}\) are the left and right endpoints of intervals, respectively, \(d\) is a diagnosis, \(d_m\) is one of nosological classes proposed for the diagnosis. Each \(a_{ik}(x) \in [x^{l}_{ik}, x^{r}_{ik}]\) is called an elementary condition of \(R\). For each patient \(x\), its profile is said to be matched to a rule \(R\) if and only if \(a_{i1}(x) \in [x^{l}_{i1}, x^{r}_{i1}]\) and \(\ldots \) and \(a_{ik}(x) \in [x^{l}_{ik}, x^{r}_{ik}]\). This fact is denoted by \(x |= R\).

Each rule set included in the Copernicus system has been extracted from a proper set of cases called a training set. Each rule \(R\) in the form of 3.1 can be characterized by the following factors:

  • the accuracy factor \(acc(R)=\frac{n_C}{n_{CD}}\),

  • the total support factor \(supp_t(R)=\frac{n_C}{n}\),

  • the class support factor \(supp_c(R)=\frac{n_C}{n_D}\),

  • the quality factor \(qual(R)= acc(R)supp(R)\),

  • the length factor \(length(R)=card({a_{i1}, ..., a_{ik}})\), i.e., a number of elementary conditions of \(R\),

where \(card\) denotes the cardinality of a given set, \(n\) is the size (a number of cases) of the training set, \(n_C\) is a number of cases in the training set which are matched to \(R\), \(n_{CD}\) is a number of cases in the training set which are matched to \(R\) and which have additionally assigned the class \(d_m\), \(n_D\) is a number of cases in the training set which have assigned the class \(d_m\).

Exemplary classification rules obtained from data consisting of profiles using the WEKA system are shown in Table 2. The accuracy factor has been assigned to each rule.

Table 2 Exemplary rules obtained after transformation of a decision tree generated for all scales excluding scale 5

The user can set, among others, that each rule \(R\) used for classification a given case \(x\) satisfies the following conditions:

  • a number of elementary conditions of \(R\) for which \(x\) does not match \(R\) is equal to \(0\) (exact matching) or more (approximate matching),

  • \(x\) is matched to \(R\) with a certain degree (tolerance), i.e., for each elementary condition \(a_{ik}(x) \in [x^{l}_{ik}-t, x^{r}_{ik}+t]\), where \(t\) is a tolerance value,

  • \(acc(R)\) is greater or equal to a given threshold,

  • \(supp(R)\) is greater or equal to a given threshold,

  • \(qual(R)\) is greater or equal to a given threshold,

  • the interval \([x^{l}_{ik}, x^{r}_{ik}]\) of each elementary condition of \(R\) has the property that \(x^{r}_{ik}-x^{r}_{ik}\) is greater or equal to a given threshold (i.e., a rule with tight conditions can be omitted),

  • \(length(R)\) is included in a given interval (i.e., a rule with too short or too long condition part can be omitted),

  • indicated scales in elementary conditions of \(R\) are omitted—the so called scale excluding (for example diagnosticians’ experience shows that the scale 5.\(Mf\) is weak and it should be omitted).

For elementary conditions in the form of intervals, we sometimes obtain lower and upper bounds that are, for example, \(-\infty \) and \(\infty \), respectively. Such values cannot be rationally interpreted from the clinical point of view. Therefore, ranges of classification rule conditions can be restricted. We can replace \(\infty \) by:

  • a maximal value of a given scale occurring for a given class in our sample,

  • a maximal value of a given scale for all twenty classes,

  • a maximal value of a given scale for a normalizing group (i.e., a group of women, for which norms of validity and clinical scales have been determined),

  • a maximal value for all scales for a normalizing group, i.e., 120 T-scores.

A procedure for restricting ranges of classification rule conditions with the value \(-\infty \) is carried out similarly, but we take into consideration minimal values. A minimal value for all scales of normalizing group of women is 28 T-scores. Specialized indexes are linear combinations of scales. Therefore, restricting ranges of rule conditions for indexes is also possible and simple. Copernicus enables us to select the way of restricting ranges. Rule conditions are automatically restricted to the form readable for the diagnostician-clinician.

3.3 Visualization of Profiles and Rules

The rule \(R\) can be graphically presented as a set of stripes placed in the profile space. Each condition part \(a_{ij}(x) \in [x^{l}_{ij}, x^{r}_{ij}]\), where \(j=1,...,k\), of the rule \(R\) is represented as a vertical stripe on the line corresponding to the scale \(a_{ij}\). This stripe is restricted from both the bottom and the top by values \(x^{l}_{ij}\), \(x^{r}_{ij}\), respectively. Such visualization enables the user to easily determine which rule matches a given profile (cf. Fig. 3).

Fig. 3
figure 3

Visualization of a rule in the profile space

3.4 Classification Results

On the basis of rules a proper diagnostic decision for the case \(x\) can be made. Aggregation factors implemented in the Copernicus system enable selecting only one main decision from decisions provided by rules used for the classification of \(x\). For each class \(d\) for which cases can be classified, a number of different aggregation factors can be calculated. The first aggregation factor is the simplest one. It expresses the relative number of rules denoting the class \(d\) in the set of all rules matched by \(x\) in the selected sense (see Sect. 3.2):

$$\begin{aligned} aggr_1(d)=\frac{card(\{R: x |= R \text{ and } class(R)= d\})}{card(\{R: x |= R\})}. \end{aligned}$$

Another three aggregation factors take also into consideration the maximal value of quality factors of rules from indicating the class \(d\). These factors differ on the weights of components:

$$ \begin{aligned} aggr_{2} (d) = & 0.8\max (\{ qual(R):x| = R{\text{ and }}class(R) = d\} ) \\ & + 0.2\frac{{card(\{ R:x| = R{\text{ and }}class(R) = d\} )}}{{card(\{ R:x| = R\} )}}, \\ \end{aligned} $$
$$ \begin{aligned} aggr_{3} (d) = & 0.5\max (\{ qual(R):x| = R{\text{ and }}class(R) = d\} ) \\ & + 0.5\frac{{card(\{ R:x| = R{\text{ and }}class(R) = d\} )}}{{card(\{ R:x| = R\} )}}, \\ \end{aligned} $$
$$ \begin{aligned} aggr_{4} (d) = & 0.67\,\max (\{ qual(R):x| = R\;{\text{and}}\;class(R) = d\} ) \\ & + 0.33\frac{{card(\{ R:x| = R\;{\text{and}}\;class(R) = d\} )}}{{card(\{ R:x| = R\} )}}. \\ \end{aligned} $$

The last two aggregation factors additionally take into consideration the average length of rules from the \(d\) class. In this case, the smaller the average length is, the better the set of rules. These factors differ on the weights of components:

$$ \begin{aligned} aggr_{5} (d) =\, & 0.6\,\max (\{ qual(R):x| = R\;{\text{and}}\;class(R) = d\} ) \\ & + 0.2{\text{avg}}(\{ 1 - length(R):x| = R\;{\text{and}}\;class(R) = d\} ) \\ & + 0.2\frac{{card(\{ R:x| = R\;{\text{and}}\;class(R) = d\} )}}{{card(\{ R:x| = R\} )}}, \\ \end{aligned} $$
$$ \begin{aligned} aggr_{6} (d) = \,& 0.4\,\max (\{ qual(R):x| = R\;{\text{and}}\;class(R) = d\} ) \\ & + 0.4{\text{avg}}(\{ 1 - length(R):x| = R\;{\text{and}}\;class(R) = d\} ) \\ & + 0.2\frac{{card(\{ R:x| = R\;{\text{and}}\;class(R) = d\} )}}{{card(\{ R:x| = R\} )}}. \\ \end{aligned} $$

In formulas of aggregation factors, \(max\) denotes the maximum value, \(avg\) denotes the arithmetic average value, and \(class(R)\) denotes the class indicated by the rule \(R\). To calculate the quality factor of a rule, we can use either the total support factor or the class support factor.

If a given aggregation factor \(aggr(d)\) is calculated for each class \(d\) to which cases can be classified, then weighted maximum value is determined:

$$\begin{aligned} \max (\{w_1aggr(d_1), w_2aggr(d_2), \ldots , w_maggr(d_m)\}), \end{aligned}$$

where \(m\) is a number of all possible classes and weights (\(w_1\), \(w_2, \ldots , w_m\)) can be set by the user between 0 and 1, for each class separately (see Fig. 4).

Fig. 4
figure 4

Selection of aggregation factors in the Copernicus system

The main differential diagnosis for a given case \(x\) is set as \(d_m\) if \(w_maggr(d_m)\) is the maximum value and \(w_maggr(d_m)>0.67\). The supplementary differential diagnosis for a given case \(x\) is set as \(d_s\) if \(w_saggr(d_s)\) is the next maximum value and \(w_saggr(d_s)>0.33\).

Classification results for each case are visualized in the form of the so-called classification star (see Fig. 5) or in the form of the so-called classification column chart (see Fig. 6).

Fig. 5
figure 5

An exemplary classification star in the Copernicus system

Fig. 6
figure 6

An exemplary classification column chart in the Copernicus system

Table 3 A quality of classification of cases for described aggregation factors

Aggregation factors have been validated by experiments carried out on a data set with over 1,000 MMPI profiles of women. The quality of classification of cases for described aggregation factors is shown in Table 3. This quality is calculated as a ratio of a number of cases for which a class assigned by a diagnostician is the same as a class indicated by the classification system to a number of all cases.

4 Conclusions

In this chapter, we have described the Copernicus system—for computer-aided diagnosis of mental disorders based on personality inventories. The main attention has been focused on rule-based classification. This part of the tool has been presented more precisely. Our main goal of research is to deliver to diagnosticians and clinicians an integrated tool supporting the comprehensive diagnosis of patients. The Copernicus system is flexible and it can also be diversified into supporting differential diagnosis of profiles of patients examined by means of other professional multidimensional personality inventories.