Introduction

Dementia with Lewy bodies (DLB) is a degenerative dementia that is pathologically characterized by the development of abnormal α-synuclein cytoplasmic inclusions, called Lewy bodies, throughout the brain in cortical, subcortical and brainstem structures. As the number of patients with dementia has increased, so the number of patients with DLB, the second most frequent cause of degenerative dementia in elderly adults, has also increased. DLB is estimated to account for about 20% of cases of dementia [1, 2]. In 1996, a consortium of scientists initially proposed guidelines for the clinical and pathological diagnosis of dementia with Lewy bodies [3]; these were later revised [4], and the guidelines have been gradually adopted in clinical practice. Central features of DLB include progressive cognitive decline and include recurrent visual hallucinations, fluctuating cognition and parkinsonism. Functional brain images support its diagnosis by demonstrating occipital metabolic or perfusion reduction in addition to the parietotemporal and posterior cingulate metabolic and perfusion reduction which is observed in Alzheimer’s disease (AD). From a prognostic and therapeutic standpoint, it is important to discriminate DLB patients from AD patients because the former have a favourable response to cholinesterase inhibitors and symptomatic treatments, and the treatment for the two diseases is not the same owing to their differing pathogenesis.

Recently, we developed a fully automatic diagnosis system for early AD [5] using the NEUROSTAT program [6]. As the next step, in this study we developed this into a fully automatic diagnosis system for differentiation between DLB and AD. We evaluated the differential diagnostic value compared with conventional inspection of FDG-PET images in mild DLB and mild AD patients.

Materials and methods

Subjects and PET procedure

The concept of this method is based on our previous method [5]. In order to establish an appropriate prototype ROI template for differentiating DLB from AD patients, first ten patients with mild DLB and 20 patients with mild AD were recruited. The AD group consisted of the subjects used for the previous fully automatic system for AD diagnosis [5]. The mean age ± standard deviation (SD) of the mild DLB group and the AD group was 71.2 ± 4.3 and 70.9 ± 2.7 years, respectively (Table 1, “first group”). Two further groups consisting of 16 mild DLB patients and 25 mild AD patients were recruited for diagnostic evaluation between DLB and AD using this method. The mean age ± SD of the mild DLB group and the mild AD group were 74.3 ± 4.9 and 71.7 ± 2.1, respectively (Table 1, “second group”). The mean scores of the MMSE for DLB and AD in the first group and for DLB and AD in the second group were 23.0 ± 3.2, 22.8 ± 1.5, 21.7 ± 1.9 and 23.1 ± 2.1, respectively (Table 1). Both groups of DLB patients fulfilled the criteria of the First International Workshop of the Consortium on Dementia with Lewy Bodies [3]. Both groups of AD patients fulfilled the NINCDS/ADRDA criteria for probable AD [7]. All patients were examined by both neuropsychologists and psychiatrists. Electroencephalography, laboratory tests and neuropsychological tests, MRI and MR angiography of the neck and head were also performed.

Table 1 Patient characteristics: number, age and MMSE score

The MR imaging and PET scanning methods are described in detail elsewhere [8]. In all patients, MR examinations for diagnosis were performed before PET scanning. All subjects fasted for 4 h before PET. After a transmission scan, a 12-min emission scan was started 60 min after intravenous injection of 185–296 MBq of FDG. PET images were obtained using a Headtome IV scanner (Shimadzu Corp., Kyoto, Japan). Data were collected in 128 × 128 matrices. The slice interval was 6.5 mm when the z-motion mode was used. Written informed consent was obtained from all subjects before the examination.

Automatic diagnosis system procedure

  1. 1.

    Creation of a prototype ROI map

    First, stereotactic anatomical standardization was performed. The original FDG-PET image data were transformed into a binary format. They were transformed into a standard Talairach space using the NEUROSTAT program [6], which contains all three-dimensional stereotactic surface projection (3D-SSP) programs (Dr. Minoshima, Department of Radiology and Bioengineering, University of Washington, Seattle, USA). Differences in size between the individual brain and the standard template were removed by linear scaling. Regional anatomical differences between the individual and the standard template were minimized by automated non-linear warping. Next, peak cortical activity in the brain was subjected to a three-dimensional search with a predefined vector for each stereotactic surface pixel after anatomical standardization. The peak value was projected back and assigned to the originating surface pixel. This procedure was continued on a pixel-by-pixel basis covering the whole cortex of the brain. Pixel values of an individual’s image set were normalized to the whole brain, the thalamus, the pons, the cerebellum and the sensorimotor value with the 3D-SSP program in NEUROSTAT [6]. The sensorimotor cortices were defined as reference regions for normalization [9, 10]. Pixels with significantly decreased metabolism in the DLB group (p < 0.05), which were obtained from a comparison between the AD group and the DLB group (the first group), were plotted as a prototype DLB ROI map (Fig. 1). The AD group had no regions with significantly lower metabolism compared with the DLB group (the first group).

  2. 2.

    Acquisition of threshold Z scores in the prototype ROI maps for diagnosis.

    For each individual surface projection image, a Z score was calculated for each pixel—Z score=([normal mean]−[individual value])/(normal SD)—using a normal database produced in the previous study [5], and the scores were shown as a Z score map. The averaged Z scores in each area of the prototype DLB ROI map were calculated. Two threshold values were tested, mean+2SD and mean+3SD, these values being derived from the AD patients in the first group. The mean and SD and threshold values in each region of the first group are shown in Table 2.

  3. 3.

    Clinical adoption of the system.

    In the same way, the Z score images of the second group were obtained and the averaged Z scores were also calculated with the ROI map. We diagnosed as DLB any subject with more than one area in the bilateral medial and lateral occipital lobes where averaged Z scores exceeded the thresholds for DLB in this system.

Fig. 1
figure 1

Prototype ROI map for mild DLB, which demonstrates significantly lower metabolism in DLB than in AD. The automatic diagnosis system uses this ROI map

Table 2 The averaged Z scores in each area of the prototype DLB ROI map calculated in the first group, and obtained thresholds

Comparison of the diagnostic performance of conventional FDG-PET imaging and the fully automatic method

We compared the diagnostic performance of visual inspection with that of the automatic method. Each image of the second group was displayed on a colour monitor. Five observers, three experienced and two beginners, reviewed the conventional FDG-PET images. All of them knew the criteria for abnormality but were blinded to the patients’ clinical data. The conventional FDG-PET images were classified as follows: (1) definite DLB, (2) probable DLB, (3) indeterminate, (4) probable AD or (5) definite AD. In the automatic method, each patient’s average for each region was obtained in reference to the prototype DLB map, and the diagnosis of DLB was made if the averaged Z score exceeded the determined value.

We performed a receiver operating characteristic (ROC) analysis to compare the automatic system with visual reading. For ROC analysis of the fully automatic diagnosis system, the averaged Z scores in the prototype ROI map of each subject in the second group were used. The areas under the curves were calculated using ROCKIT software 0.9B (Dr. Metz, Department of Radiology, The University of Chicago).

Results

The averaged Z scores in each area of the ROI map obtained in the second group are shown in Table 3. All the averaged Z scores in each region for DLB were significantly larger than those for AD. The fully automatic diagnosis system demonstrated an accuracy of 73%, a sensitivity of 75% and a specificity of 72% for the mild DLB group when the threshold was set at the mean+2SD. When using the threshold of the mean+3SD, an accuracy of 61%, a sensitivity of 38% and a specificity of 76% were demonstrated.

Table 3 The averaged Z scores in each area of the prototype DLB ROI map calculated in the second group

Figure 2 shows each observer’s area under the ROC curve (Az) for visual inspection of conventional FDG-PET images (Az = 0.83–0.58) and for the fully automatic diagnosis system (Az = 0.77), using the threshold of mean+2SD. There was no significant difference in the area under the ROC curve between the automatic diagnosis system and the visual inspections. The diagnostic value of the automatic system was compared with that of the observers (Table 4).

Fig. 2
figure 2

ROC curves for DLB patients versus AD patients obtained with visual inspection by experts or beginners or by use of the automatic diagnosis system. The area under the ROC curve of beginner B is significantly smaller than that of other observers

Table 4 Area under ROC curves for the automatic diagnosis system and individual observers

Representative cases

Figure 3 shows the conventional FDG-PET images (a) and Z score map (b) of a 74-year-old male patient with DLB. His MMSE score was 26. All the observers could easily diagnose this case as DLB by visual inspection of the conventional FDG-PET images because of a typical metabolic reduction pattern. The automatic diagnosis system also diagnosed it as DLB. Figure 4 shows the conventional FDG-PET images (a) and Z score map (b) of a 71-year-old female with AD. All the observers and the automatic system could diagnose this case as AD. Figure 5 shows the findings in a 74-year-old male DLB patient with an MMSE score of 21; two experts and one beginner diagnosed this case as DLB, while the other expert considered it equivocal and the other beginner diagnosed it as AD. The automatic diagnosis system could not diagnose this case as DLB.

Fig. 3
figure 3

Conventional FDG PET images (a) and Z score images (b) of a 74-year-old patient with DLB (MMSE 26). It is easy to detect decreased glucose metabolism in the bilateral occipital lobe by visual inspection and to diagnose the case as DLB

Fig. 4
figure 4

Conventional FDG PET images (a) and Z score images (b) of a 71-year-old patient with AD (MMSE 22). Decreased glucose metabolism in the bilateral frontal and parietotemporal association cortices can be seen. All the readers and the automatic diagnosis system could diagnose this case as AD

Fig. 5
figure 5

Conventional FDG PET images (a) and Z score images (b) of a 74-year-old patient with DLB (MMSE 21). Heterogeneously decreased glucose metabolism can be seen. The Z score image failed to demonstrate significant reduction

Discussion

The First International Workshop of the Consortium on Dementia with Lewy Bodies drew up guidelines for the clinical and pathologic diagnosis of DLB in 1996 [3]. Since that publication, a number of reports in the literature have concluded that the criteria have low sensitivity and high specificity. Several factors, such as poor reliability of fluctuating cognition and the lack of parkinsonism and visual hallucinations, probably contribute to the low sensitivity [3, 1115]. A sensitivity of 75% and a specificity of 79% were documented in a preliminary validation study of the criteria [11], and a new validation study using the revised criteria [4] is needed. The overlap between the clinical symptoms and pathology of DLB with those of AD can lead to misdiagnosis. The low sensitivity of the aforementioned criteria indicates that DLB is likely to be misdiagnosed particularly at the early stage. It is very important to diagnose early DLB accurately and to distinguish it from other dementia diseases because the treatment and prognosis differ. We have previously reported an automatic system using 3D-SSP for the diagnosis of early AD [5]. In the present study, we assessed the usefulness of this system, which is independent of operator skill, in distinguishing mild DLB from mild AD.

Diagnostic imaging methods, such as MRI or nuclear medicine methods, are helpful in diagnosing DLB accurately. The revised consortium proposed that imaging investigation can also be helpful in supporting a diagnosis [4]. In DLB, the volume of the hippocampal and the medial temporal lobe is preserved and atrophy of the putamen is demonstrated by MRI volumetry. However, it is not easy to demonstrate the morphological difference between mild DLB and mild AD. Occipital hypoperfusion on single-photon emission computed tomography (SPECT) and hypometabolism on PET have been shown without occipital lobe atrophy on MRI [1619]. The metabolism and perfusion of the medial and lateral occipital lobe are characteristically affected in DLB, and these are the features that discriminate it from AD [20]. Dopaminergic and cholinergic impairment is considered to be the cause of the hypometabolism and hypoperfusion in the occipital lobe [3, 16, 21, 22]. In this study we tried to diagnose mild DLB with occipital hypometabolism using FDG-PET and 3D-SSP automatically. We used the NEUROSTAT program for 3D-SSP, which is able to transform atrophied brain to standard brain more correctly than the statistical parametric mapping (SPM) method [23] and to compensate for inappropriate anatomical standardization [6]. An important step in this statistical method was the selection of the reference area for the normalization: we chose the sensorimotor area for this system because of the relative preservation of metabolism in the sensorimotor cortex in DLB patients [10]. The fully automatic diagnosis system for differentiation of mild DLB from mild AD was able to perform at a similar diagnostic level to visual inspection of conventional axial FDG-PET images by expert readers.

As shown in our study, although visual inspection by the readers provided good diagnostic accuracy, the area under the ROC curve was not as large as we had expected (mean Az = 0.7145). It can be difficult to make a diagnosis of mild DLB because occipital glucose metabolism is not always decreased more severely in mild DLB than in mild AD. The difference in the metabolic reduction between mild DLB and AD was smaller than we had anticipated, and this may be the reason why even experts had difficulty in diagnosing some of the mild DLB cases in this study. The area under the ROC curve was accordingly small, and the same results were obtained using our automatic diagnosis system. It is also to be borne in mind that while DLB accounts for about 20% of dementia cases, in the sample in this study the percentage was 40%. Since DLB patients also show the patterns of AD on FDG-PET images, this high prevalence of DLB may have led to underestimation of the diagnostic performance of the readers and of the automatic method as well. While our diagnostic system performed similarly to the experts in diagnosing mild DLB, the ROC curve of the automatic system was shifted slightly to the right; this indicates that the diagnostic performance in respect of AD patients varied more widely than that in respect of DLB patients (i.e. there were fewer misdiagnoses in DLB patients), and in clinical use this system might have the advantage of having a high sensitivity for identification of early DLB.

Neuroimaging for assessment of nitrostriatal dopaminergic function is also useful [24]. In DLB there is a 40–70% loss of striatal dopamine, and the loss of dopaminergic cells is accompanied by dopamine transporter loss [25]. Ligand imaging for the dopamine transporter provides a marker for presynaptic neuronal degeneration. When using 123I-radiolabelled 2β-carbomethoxy-3β-(4-iodophenyl)-N-(3-fluoropropyl) nortropane (123I-FP-CIT) SPECT, areas under the ROC curves were greater than 0.92, demonstrating good power in discriminating between DLB and AD [26]. Demonstration of reduction in the uptake of 18F-fluorodopa in the caudate and putamen using PET distinguishes between DLB and AD with high sensitivity and specificity [27]. On the other hand, FDG-PET is an accepted technique for the detection of AD, especially when the diagnosis is uncertain. Consequently we would suggest that the proposed automatic diagnosis system and FDG-PET could first be employed to investigate the possible presence of DLB in patients suspected of having AD, with the use of FP-CIT scan for further investigation when necessary. In future we are going to apply this method to SPECT perfusion studies which are more widely used than FDG-PET. Now that we have developed a fully automatic diagnosis system for AD and DLB, we also wish to expand this system to other degenerative dementias.

Conclusion

We have developed a fully automatic diagnosis system that can be employed to distinguish mild DLB from mild AD using 3D-SSP. Its diagnostic value is considered to be comparable to expert reading of conventional FDG-PET images. This method could be applied to detect cases of mild DLB in routine clinical examinations.