Introduction

Single-photon emission computed tomography (SPECT) imaging with the 123I-labelled raclopride analogue iodobenzamide (IBZM) allows assessment of the availability of postsynaptic dopamine D2 receptors [13]. Common clinical indications are differential diagnosis of parkinsonian syndromes [47], assessment of D2 blockade by neuroleptics [811], Huntington’s disease [12] and Wilson’s disease [13]. In addition, [123I]IBZM has been used successfully for the assessment of endogenous dopamine release [14, 15] and for the evaluation of D2 occupancy by endogenous dopamine [16]. [123I]IBZM SPECT has also been shown to have the potential to become a biological marker for the risk of relapse in alcoholic patients during recovery [17]. [123I]IBZM is known to have suboptimal kinetic properties that are less favourable than those of the PET tracer [11C]raclopride, for example. However, [123I]IBZM is commercially available in Europe and widely used.

Conventional evaluation of [123I]IBZM brain SPECT scans in clinical routine often relies on manual reorientation, visual reading of tomographic slices and semi-quantitative analysis of striatal IBZM uptake using manually defined volumes of interest (VOIs). These steps are time consuming, subjective and prone to error [18]. In addition, a large variety of methods of semi-quantitative analysis are in use, not all of which provide sufficient statistical power [19]. Thus, automatic anatomical standardisation and VOI analysis using standardised predefined VOIs, as are increasingly used for the evaluation of [18F]fluorodeoxyglucose (FDG) PET brain scans, might be useful for the evaluation of [123I]IBZM brain SPECT scans [20].

Radau et al. described a three-dimensional automated technique for the evaluation of [123I]IBZM SPECT scans [21, 22]. The method focusses on the semi-quantification and voxel-based statistical analysis of striatal IBZM uptake. Visual analysis plays a minor role. However, visual analysis of tomographic slices is essential, not only to check results of the semi-quantitative analysis for plausibility (even in fully automated techniques, errors cannot be ruled out), but also to assess regional distribution of alterations in IBZM uptake.

The present paper proposes a new software tool for fully automated processing, evaluation and documentation of [123I]IBZM SPECT scans with special emphasis on adequate support of visual evaluation. The tool is routinely used to support diagnostic decisions in the departments of nuclear medicine at the University Medical Center, Hamburg and the University School of Medicine in Hannover. Currently it is being implemented at a private nuclear medicine practice in Hamburg.

This study complied with the current German laws, inclusive of ethics approval.

Materials and methods

SPECT imaging procedures used in Hamburg

Prior to imaging, the thyroid gland was blocked by oral administration of perchlorate (600 mg). Then 160–220 MBq [123I]IBZM (GE Healthcare, Braunschweig, Germany) was injected intravenously as a slow bolus (30 s). SPECT imaging was performed about 120 min post injection [23, 24] using a four-head SPECT camera dedicated to brain imaging (Nucline X-Ring/4R, Mediso, Hungary) [25]. The heads of the camera were equipped with low-energy, high-resolution collimators. Patients were positioned on the camera table as comfortably as possible. The head was slightly fixed in a head holder by an elastic gauze bandage around the forehead. A dynamic scan with three consecutive frames (640 s each) was acquired. For each frame, 128 projections, i.e. 32 projections per head, each of 20 s duration, were recorded on a 64×64 matrix with a voxel size of 4.1×4.1×4.1 mm3. Total acquisition time was 32 min. Total number of counts was about 500 MBq−1 min−1. Transaxial images were reconstructed iteratively using the ordered subsets-expectation maximisation (OSEM) algorithm of the camera software (16 subsets, 2 iterations). No pre-filtering of the projection data was performed. Reconstructed images were post-filtered using a 3D Butterworth filter (order 5, cut-off 40% of the Nyquist frequency). Spatial resolution of the final images was about 19 mm full-width at half-maximum (FWHM). Attenuation correction was performed using Chang’s method with an attenuation coefficient of 0.12 cm−1. No scatter correction was applied. For further processing the format of the reconstructed images was converted from the scanner-specific DIAG format to ANALYZE 7.5 format (Biomedical Imaging Resource, Mayo Foundation) using in-house-developed software.

The three frames were realigned using the Realign tool of the Statistical Parametric Mapping software package (SPM99, Welcome Department of Cognitive Neurology, Institute of Neurology, University College London, London, UK) [26]. The final uptake image was obtained by summation of the realigned frames.

IBZM template

Twenty consecutive patients from clinical routine in whom [123I]IBZM SPECT had been performed for differential diagnosis of parkinsonian syndromes were recruited retrospectively for the creation of an IBZM template. No specific inclusion or exclusion criteria were applied.In a first step, each individual IBZM uptake image was stereotactically normalised to the perfusion SPECT template of SPM99 using the Normalise tool of SPM99. In addition to affine transformations, a restricted set of non-linear deformations (warping) was admitted (parameter values for warping: 4×4×4 non-linear basis functions, 12 non-linear iterations, medium non-linear regularisation, default brain mask, don’t mask object). Transformed images were written with a voxel size of 2×2×2 mm3 using bilinear interpolation. All images were scaled to the same global scale using proportional scaling of SPM99. A preliminary IBZM template was obtained by averaging all images for which normalisation had been successful (n=12) (visual assessment using the Check Register Tool of SPM99).Stereotactic normalisation of all individual IBZM uptake images was repeated, but now using the preliminary IBZM template. An improved preliminary IBZM template was obtained by averaging all images for which this second normalisation had been successful (n=17). This procedure was repeated iteratively until stereotactic normalisation was successful in all subjects.The resulting template was mirrored at the median sagittal plane. Then the original template and the mirrored template were averaged to end up with the final, symmetrical IBZM template (Fig. 1) [27].

Fig. 1
figure 1

Transverse slices of the IBZM template (top) and the reference region (bottom)

Stereotactic normalisation

The uptake image of each individual patient was stereotactically normalised to the IBZM template using the Normalise tool of SPM99 (4×4×4 non-linear basis functions, 12 non-linear iterations, medium non-linear regularisation, default brain mask, don’t mask object). Transformed images were written with a voxel size of 2×2×2 mm3 using bilinear interpolation.

Scaling and striatal VOIs

A reference region was obtained from the SPM99 brainmask by generously cutting the striatum in both hemispheres (Fig. 1), i.e. the reference region encompassed all voxels of the brainmask with the exclusion of those of the striatal cavities. The striatal cavities (VOIs) consisted in 13,492 voxels in total, i.e. 6,746 voxels corresponding to about 54 ml on each side.Scaled uptake images were obtained by dividing each voxel value by the 75th percentile of the voxel values within the reference region.

Semi-quantification

Semi-quantification was performed by hottest voxel analysis [28] of the striata. More precisely, the striatum-to-reference ratio (R) was estimated separately for each striatum by averaging the scaled uptake values over the 1,875 voxels (15 ml) with the highest scaled uptake within the corresponding striatal VOI.

Normal and pathological range of striatum-to-reference ratio

Since in Hamburg no IBZM SPECT studies in healthy volunteers were available, the normal and pathological range of the striatum-to-reference ratios R were obtained in patients who presumably had a normal R. More precisely, R was obtained in 90 striata of 45 patients (17 females, 28 males, age 61.2±11.1 years) in whom within 2 weeks both 123I-IBZM SPECT and SPECT with the dopamine transporter ligand 123I-FP-CIT (DaTSCAN, GE Healthcare, Braunschweig, Germany) [29] had been performed for diagnosis or exclusion of a parkinsonian syndrome at the University Medical Center, Hamburg. 123I-FP-CIT SPECT images had been evaluated visually (because of the excellent kinetic properties of 123I-FP-CIT, visual evaluation can be considered adequate for the evaluation of 123I-FP-CIT SPECT images).Assuming that normal FP-CIT uptake in the majority of cases implies normal IBZM uptake, an upper threshold R P for “pathological” R and a lower threshold R N for “normal” R were derived from the distribution of R values in those striata in which FP-CIT uptake had been classified as normal visually (n=27). R P was chosen to be the 10th percentile of this distribution, and R N was chosen to be the lower bound of the 95% confidence limit. This procedure resulted in R P=1.40 and R N=1.46, i.e. R<1.40 was assumed to be pathological and R > 1.46 was assumed to be normal. The range 1.40≤R≤1.46 was considered as a “grey area”, i.e. inconclusive.

Display (hard copy)

Six transverse slices through the striatum were averaged to one transverse slice of 12 mm thickness. This slice was printed on a black background using a modified continuous spectrum colour table (Fig. 2). A fixed range of scaled uptake values (0.0–1.9) was mapped to the colour table. Uptake values >1.9 were displayed in white (corresponding to uptake =1.9). The upper threshold (1.9) of the domain of the colour table was determined empirically from the mean frequency histogram of scaled uptake values in the visually normal striata that had been used for definition of the normal range of R (Fig. 3). In order to support visual interpretation, typical examples of the same slice in the following findings were displayed on the same page: normal uptake, mildly reduced uptake, severely reduced uptake, and increased uptake.

Fig. 2
figure 2

Documentation page with 12-mm thick slice through the striata. Patient with MSA, confirmed by clinical follow-up (female, 62 years, duration of disease 3 years, Hoehn & Yahr score 3)

Fig. 3
figure 3

Histogram of mean frequency of scaled voxel values in visually normal striata in Hamburg patients (cf subsection “Normal and pathological range of striatum-to-reference ratio” in “Materials and methods”)

In addition, a full set of transverse slices at a distance of 4 mm were printed on a separate page using the same colour table.

IBZM tool

Processing (motion correction, stereotactic normalisation, scaling, VOI analysis) and display as described above were integrated into an easy-to-use SPM toolbox (“IBZM tool”).

Hamburg subjects

Sixty-three patients (26 females, 37 males, age 63±11 years) in whom [123I]IBZM SPECT had been performed in Hamburg for differential diagnosis of parkinsonian syndromes were evaluated retrospectively using the IBZM tool.

Visual evaluation versus semi-quantitative evaluation in Hamburg subjects

The images created by the IBZM tool in the 63 Hamburg patients were evaluated by a nuclear medicine technician unaware of the diagnosis and without experience in the evaluation of IBZM SPECT. Using only the documentation page with the thick slice through the striatum (Fig. 2), the technician classified striatal uptake according to the typical examples, i.e. as normal, mildly reduced, severely reduced or increased. The two striata were classified separately. Visual evaluation did not make use of any VOI analysis. Visual evaluation was compared with semi-quantitative evaluation using Spearman correlation analysis (increased=−1, normal=0, mildly reduced=1, severely reduced=2; two-sided test).

Transfer of the IBZM tool to Hannover for external evaluation

Determination of the parameters of the IBZM tool (thresholds for pathological and normal R, colour table) as described above was somewhat empirical. Therefore, the IBZM tool was transferred to the Department of Nuclear Medicine of the University School of Medicine Hannover for external confirmation/evaluation. There are many years of experience with IBZM SPECT in Hannover [3032].

SPECT imaging procedures used in Hannover

At the University Medical School, Hannover, IBZM SPECT imaging had been performed according to standard protocols described previously [30, 32]. In short, for data acquisition a Multispect 3 (Siemens, Erlangen, Germany) camera equipped with medium-energy collimators was used. One hundred and eighty projections (60 per head) were acquired using a 1.23 acquisition zoom and a matrix size of 128×128. The resulting voxel size was 2.9×2.9×2.9 mm3. Transaxial tomograms were reconstructed using the standard ICON software (Siemens, Erlangen, Germany). Filtered backprojection was performed with a Butterworth filter (cut-off 0.35, order 10). Attenuation correction was done according to Chang assuming a homogeneous attenuation coefficient of μ=0.12 cm−1 within an ellipse drawn around the brain. The relatively low μ value was chosen for implicit scatter correction. Images were then converted to ANALYZE format using the MPI tool (Advanced Tomo Vision ATV, Kerpen, Germany). Reconstructed spatial resolution was about 15 mm.

Hannover subjects

Twenty-seven subjects who had been studied in Hannover previously were included retrospectively: (a) eight patients with probable multi-system atrophy (MSA, six females, two males, age 62±5 years, duration of disease at time of SPECT 2.6±1.6 years, Hoehn-Yahr score 3.3±0.7), (b) 12 patients with advanced (probable) idiopathic Parkinson’s disease (PD, five females, seven males, age 58±8 years, duration of disease at time of SPECT 12.5±5.5 years, Hoehn-Yahr score on/off 2.2±0.9/3.8±0.7) and (c) seven control subjects without neurological disease (NC, six females, one male, age 43±17 years). The clinical differential diagnosis of Parkinson syndromes had been fixed by neurologists, specialised in movement disorders, according to established clinical criteria [33, 34]. Medications potentially affecting the dopamine D2 receptor availability were withdrawn in due time prior to IBZM SPECT according to the guidelines of the European Association of Nuclear Medicine [20].

Adjustment of spatial resolution

Reconstructed spatial resolution of the Hannover images was about 15 mm FWHM (see subsection “SPECT imaging procedures used in Hannover”). In order to adjust the spatial resolution of the Hannover images to the spatial resolution of 19 mm FWHM of the Hamburg images, which is the resolution required for the application of the IBZM tool, the Hannover images were smoothed by convolution with an 3D isotropic Gaussian kernel with 12 mm FWHM using the Smooth tool of SPM. No further adjustment was performed.In general, the required width of the smoothing kernel is given by sqrt(19×19–x×x), where x is the FWHM of the original image in mm.

Evaluation of the IBZM tool in Hannover subjects

The IBZM tool was applied to the 27 Hannover patients.Statistical analysis of the striatum-to-reference ratio R was performed as follows. First, one-way analysis of variance (ANOVA) and post hoc tests for multiple comparisons were used to compare the groups with respect to R. Scheffé’s or Tamhane’s post hoc test was applied according to the result of Levene’s test of homogeneity of variances. These computations were performed using SPSS 10.0 for MS Windows. In the second step, sensitivity, specificity, positive predictive value and negative predictive value for the differentiation of MSA patients and controls were computed. Two different thresholds were used: the lower limit R P and the upper limit R N of the grey area.In addition, comparison of visual evaluation and semi-quantitative analysis was performed as described in the subsection “Visual evaluation versus semi-quantitative evaluation in Hamburg subjects”.

Manual VOI analysis

In order to further evaluate the potential of the IBZM tool, a non-automatic VOI analysis with manual positioning of fixed VOIs was performed in the 27 Hannover patients.The original Hannover images, i.e. before adjustment of the spatial resolution for application of the IBZM tool, were manually resliced according to the orbitomeatal line. The four slices with the highest striatal IBZM uptake were summed, resulting in a slice of 11.6 mm thickness. Standard VOIs were positioned on each of both striata and on the occipital cortex on the summed image (Fig. 4). The standard VOIs for the right striatum and the occipital cortex had been manually defined previously in the summed image of one of the Hamburg subjects who had been used for the definition of the normal and pathological range of the striatum-to-reference ratio R. The standard VOI for the left striatum had been obtained by reflecting the right striatum VOI at the mid-sagittal line. The standard VOIs were intended to anatomically delineate the striata and the occipital cortex, and, therefore, were irregular in shape. The surface area of the VOI of one striatum was 5.0 cm2; the surface area of the occipital VOI was 19.0 cm2. In consideration of the slice thickness of 11.6 mm, the VOIs represented a volume of 5.8 ml and 22.0 ml, respectively. The standard VOIs were used in all subjects without changes in size and shape. Only the position was adjusted individually by manually shifting and rotating the VOIs until the position appeared optimal visually. The striatum-to-reference ratio was estimated for each striatum separately by the ratio R m of the average uptake in the striatum VOI and the average uptake in the occipital ROI (“m” indicates manual analysis).

Fig. 4
figure 4

Standard VOIs used in the manual VOI analysis placed in the summed slice of a control subject from the Hannover sample

Statistical analysis of group differences in R m was performed as described in the previous subsection for the IBZM tool’s R. R m and R were compared by Spearman’s correlation analysis. In order to check for potential differences in statistical stability between R m and R, Pitman’s test for equality of variances was performed within each of the three groups of subjects [35]. Pitman’s test included two steps. First, both R and R m were scaled to a mean value of 1, separately within each group. Second, a potential correlation between the difference and the sum of scaled R and scaled R m was analysed by Spearman’s correlation test. Significant correlation between difference and sum indicates a significant difference of the coefficient of variance between R m and R.

Results

Automatic stereotactic normalisation was successful in all patients as assessed visually.

The results of the comparison of visual evaluation and semi-quantitative evaluation in 63 subjects from Hamburg are given in the scatter plot in Fig. 5. There was “perfect” correlation, i.e. (a) no striatum which visually was classified as having normal (or increased) uptake had a pathological R<1.40, and (b) no striatum which visually was classified as having reduced (mildly or severely) uptake had a normal R>1.46. This was quantitatively confirmed by the Spearman correlation analysis, which yielded a high correlation with very high statistical significance (correlation coefficient=−0.831, p=0.000, n=126).

Fig. 5
figure 5

Striatum-to-reference ratio R versus visual evaluation in 63 Hamburg patients (126 striata) in whom [123I]IBZM SPECT had been performed for differential diagnosis of parkinsonian syndromes

The semi-quantitative results of the IBZM tool in the well-documented Hannover patients are given in Fig. 6. Striatal-to-reference uptake ratio R was 1.36±0.09 (n=16), 1.52±0.06 (n=14) and 1.43±0.06 (n=24) in the MSA group, the control group and the PD group, respectively. The 10th percentile and the lower bound of the 95% confidence limit of R in the control group were 1.44 and 1.49, respectively. Differences between the groups were highly significant according to ANOVA (df=53, F=22.165, p=0.000). Post hoc testing revealed that all groups differed significantly from each other (Levene test: p=0.277; thus Scheffé test: MSA vs controls: p=0.000, MSA vs PD: p=0.008, controls vs PD: p=0.001). None of the MSA patients had normal uptake ratio R, and none of the controls had pathological R (Table 1). Sensitivity, specificity, positive predictive value and negative predictive value for the differentiation of MSA patients from the controls were 63%, 100%, 100% and 70%, respectively, when the lower bound of the grey area was used as the threshold (MSA if R<R P=1.40), and 100%, 86%, 89% and 100%, respectively, when the upper bound of the grey area was used as the threshold (MSA if RR P=1.46). Visual analysis showed almost perfect correlation with semi-quantitative evaluation (Fig. 7). Only one striatum (in a control subject) with normal R (1.48) was classified as having mildly reduced uptake visually.

Fig. 6
figure 6

Striatum-to-reference ratio R in Hannover subjects: eight patients (16 striata) with probable MSA, seven patients (14 striata) without any evidence of neurological disease (controls) and 12 patients (24 striata) with probable idiopathic PD

Table 1 Semi-quantitative analysis in the Hannover patients (on the basis of striata)
Fig. 7
figure 7

Striatum-to-reference ratio R versus visual evaluation in the 27 Hannover patients (54 striata)

The results of the manual VOI analysis in the Hannover subjects are shown in Fig. 8. The striatum-to-reference ratio R m was 1.51±0.14 (n=16), 1.71±0.09 (n=14) and 1.63±0.09 (n=24) in the MSA, control and PD groups, respectively. R m showed a highly significant, strong correlation with the IBZM tool’s striatum-to-reference ratio R (Spearman correlation coefficient=0.725, p=0.000, n=54). The Bland-Altman plot [36] of the difference R mR versus the average 0.5×(R m+R) given in Fig. 9 revealed (a) that the manual VOI analysis yielded systematically higher values than the IBZM tool (paired t test: df=53, T=17.976, p=0.000) and (b) that the difference increased with increasing striatum-to-reference ratio (Spearman correlation coefficient=0.490, p=0.000, n=54). The group effects of R m were highly significant according to ANOVA (df=53, F=14.214, p=0.000). Post hoc testing revealed that all groups differed significantly from each other (Levene test: p=0.040; thus Tamhane test: MSA vs controls: p=0.000, MSA vs PD: p=0.012, controls vs PD: p=0.047). The scaled striatum-to-reference ratios (scaled to mean value 1 within each group) had standard deviations 0.063 (R)/0.094 (R m), 0.038/0.055 and 0.043/0.053 in the MSA, control and PD groups, respectively. While Pitman’s test did not reveal a significant difference in the coefficient of variation between the IBZM tool’s R and the manual R m in the PD group (correlation coefficient=0.262, p=0.217), there were significant effects in the MSA and control groups (MSA: correlation coefficient=0.676, p=0.004; controls: correlation coefficient=0.569, p=0.034).

Fig. 8
figure 8

Manual VOI analysis: Striatum-to-reference ratio R m in Hannover subjects: eight patients (16 striata) with probable MSA, seven patients (14 striata) without any evidence of neurological disease (controls) and 12 patients (24 striata) with probable idiopathic PD

Fig. 9
figure 9

Bland-Altman plot of the striatum-to-reference ratio estimates of manual VOI analysis (R m) and fully automated IBZM tool (R) in the 27 Hannover patients (54 striata): difference versus average

Discussion

In the present paper a new software tool for fully automated standardised processing and documentation of [123I]IBZM SPECT scans is presented (IBZM tool). The IBZM tool includes (a) realignment and summation of multiple frames (motion correction), (b) stereotactic normalisation, (c) scaling, (d) VOI analysis and (e) display.

Dynamic SPECT acquisition allows correction of head movement to some extent, most easily by realignment and summation of the single frames (a more sophisticated method for motion correction of dynamic [123I]IBZM SPECT scans using principle component analysis has been described by Acton et al. [37]). This simple method assumes that movement mainly occurs between frames, and, therefore, in principle works the better the shorter the frames are. However, reduction of frame duration is limited by count statistics. A frame duration of 5 min might be a reasonable compromise (multiple head system). In the present study, frame duration was 10.7 min for reasons of practicability, since the SPECT system (Nucline X-Ring/4R, Mediso, Hungary) did not provide true dynamic acquisition (dynamic acquisition was simulated by performing static scans one after the other).

Spatial resolution of the reconstructed images was about 19 mm FWHM, which appears rather smooth at first sight. However, in a preparatory study (unpublished) we had compared different reconstruction algorithms and filter methods with FWHM between 10 and 20 mm by visual evaluation of the reconstructed images. An FWHM of 19 mm appeared to provide a good compromise between spatial resolution and statistical noise. Smoothing generally reduces the risk of false positive findings, but at the expense of reduced sensitivity. However, a low rate of false positive findings is considered to be useful in a clinical patient care setting.

The spatial resolution of 19 mm FWHM in reconstructed images raises the question of the use of all-purpose (AP) collimators for the acquisition instead of high-resolution (HR) or even ultra-high-resolution (UHR) collimators. The system spatial resolution of an AP collimator is typically about 9 mm FWHM as compared to 7–8 mm for an HR and 6 mm for a UHR collimator, and this difference might not be relevant for a reconstructed resolution of 19 mm. On the other hand, there is a large improvement in sensitivity, by about 50%, when using an AP collimator compared with an HR collimator. This sensitivity gain might be used to reduce the acquisition time, which is an important factor, particularly in patients with movement disorders and/or dementia. Increased scatter fraction of an AP collimator might be compensated by scatter correction [38]. The effect of different collimators or, more generally, different acquisition and reconstruction protocols, should be addressed more rigorously in further studies.

For stereotactic normalisation an IBZM template was used, since a tracer-specific template in general provides better normalisation results than the standard perfusion template of SPM, for example [3941]. The IBZM template was created from 20 [123I]IBZM SPECT scans that had been performed for differential diagnosis of parkinsonian syndromes in clinical patient care. No specific inclusion or exclusion criteria were applied. In particular, it was not required that the striatal IBZM uptake was normal in these subjects. The template was intended to be representative of the population of patients referred to our department for IBZM SPECT, in order to avoid unwanted bias by the stereotactic normalisation.

Scaling was performed by dividing each voxel value by the 75th percentile of the voxel values within a predefined mask which basically included the whole brain except the striata (Fig. 1), i.e. the whole brain without the striata was used as the reference region. Other groups have suggested the use of rather small reference regions, the occipital cortex [6], the frontal cortex [12] or the cerebellum [42], for example. The rationale for using these regions was that they might be least affected by neurodegenerative parkinsonian syndromes (in addition to having a very low D2 density compared with the striatum). However, small reference regions might cause both higher statistical variance and lower reproducibility than large ones. In subjects with neurodegenerative parkinsonian syndrome, IBZM uptake might also be reduced in some extrastriatal brain regions. Scaling using the “whole brain except striata” reference region causes overestimation of the striatum-to-reference ratio R in these subjects. However, this effect is expected to be rather small as long as the affected extrastriatal brain regions do not account for a major part of the whole brain. The situation here is very similar to the situation in brain FDG PET. In brain FDG PET the conventional proportional scaling approach of SPM, which essentially also uses the whole brain or even the whole head as a reference region, has been very successful in the voxel-based statistical evaluation [43]. The IBZM tool attempts to further reduce the effect of putative reduction of IBZM uptake in some extrastriatal brain regions by using the 75th percentile instead of the mean value to characterise IBZM uptake in the reference region. The 75th percentile might be less susceptible to localised extrastriatal alterations of IBZM uptake than the mean.

Striatal IBZM uptake was semi-quantified by hottest voxel analysis within large VOIs predefined in the template such that the whole striata were definitely included (Fig. 1). Hottest voxel analysis within large VOIs might be less sensitive to residual anatomical variations (after stereotactic normalisation) than averaging over all voxels within anatomically delineated VOIs. The number of hottest voxels to be averaged in each one-sided VOI was chosen to be 1,875, corresponding to a total volume of 15 ml, in order to sample roughly the whole tracer uptake in the striatum. The actual volume of the striatum is about 9 ml and 10 ml in females and males, respectively [44]. The total volume of the hottest voxels was chosen to be slightly larger than the actual volume in order to account for (a) the spatial resolution of the IBZM SPECT images, and (b) the fact that the perfusion SPECT template of SPM and, therefore, also the IBZM template of the IBZM tool, is slightly larger than the average brain. Sampling the whole striatum is important as a neurodegenerative parkinsonian syndrome might not affect the whole striatum. In this case, hottest voxel analysis with a too small number of voxels would be false negative. Acton et al. proposed fuzzy clustering for the automatic segmentation of [123I]IBZM SPECT scans [45]. However, this method requires full dynamic imaging and, therefore, is not applicable in clinical patient care, in which usually only a late static scan is performed.

Striatum-to-reference ratio R ranged from 1.04 to 1.80 in the subjects described in the present paper. This range might appear rather small. However, the standard deviation of R in the Hannover controls was rather small, too (SD=0.06). This suggests that R as estimated by the IBZM tool nevertheless provides significant statistical power for the detection of reduced D2 availability. This was confirmed by the fact that ANOVA and post hoc testing revealed highly significant R differences between the three Hannover groups, i.e. MSA patients, controls and PD patients.

For visual evaluation the IBZM tool displays a 12-mm-thick transverse slice through the striata using a standardised colour table fixed to a predefined range of scaled uptake values. This ensures that the cortical IBZM uptake is mapped roughly to the 50% value of the colour bar in all subjects, independent of striatal tracer uptake, i.e. the cortex is always displayed with about the same colour. As a consequence, the colour of the striata varies depending on striatal tracer uptake. This is in contrast to the most commonly used individual scaling of the colour bar to the whole range of values within a given study, which causes the striatum, usually the area with the highest uptake, to be always displayed with the maximum colour. Use of the standardised fixed colour table simplifies visual evaluation of [123I]IBZM SPECT scans, as indicated by the “perfect” correlation between semi-quantitative analysis and visual evaluation by an inexperienced interpreter in 63 subjects (126 striata). Thus, semi-quantification is not the only way to objectively assess striatal D2 availability and it can be vitally supported and critically verified by visual interpretation using an optimised scaling as laid out above.

Parameters of the IBZM tool (thresholds for pathological and normal R, colour table) were fixed somewhat empirically using IBZM SPECT images from Hamburg patients without firmly established diagnoses. For fixing the normal range of R, for example, it was assumed that normal FP-CIT uptake implies normal IBZM uptake. It is well known that this is true in the majority of cases, but certainly not always. Cases with normal FP-CIT binding but massively impaired IBZM binding have been reported in vascular parkinsonism, for example [46]. Therefore, in order to evaluate the tool and particularly to check the validity of the chosen parameter values, the IBZM tool was transferred to the Department of Nuclear Medicine of the University Medical School, Hannover. In Hannover the IBZM tool was tested rather rigorously using IBZM studies of well-documented patients with clinically confirmed diagnoses and control subjects without evidence of a neurological disease. The good results of this external evaluation confirmed the choice of the parameters of the IBZM tool. The range of normal R and the range of pathological R, for example, were confirmed not only by the high accuracy for the differentiation of MSA subjects and controls, but also by the analysis of the Hannover controls. In fact, applying to the well-documented Hannover controls the same procedure as was used to derive the lower bound R N of the range of normal R and the upper bound R P of the range of pathological R from the presumably normal striata in the Hamburg patients yielded values very close to the Hamburg values: R N(Hannover)=1.49 versus R N(Hamburg)=1.46, and R P(Hannover)=1.44 versus R P(Hamburg)=1.40. The fact that the Hannover values were slightly larger than the Hamburg values was most likely due to the difference in age: the Hannover controls were on average 14 years younger than the Hamburg controls (43±17 years vs 57±13 years, unpaired t test: p=0.042). Taking into account this age difference and the fact that R is expected to show an age-related decline of about 2.5% per decade in healthy subjects [12], the agreement between the Hannover values and the Hamburg values was excellent. The external confirmation of the parameters of the IBZM tool in the well-documented Hannover sample does provide better evidence for their validity than could have been obtained by additional Hamburg subjects with firmly established diagnosis.

The Hannover SPECT studies had been performed using a different camera, a different acquisition protocol and a different reconstruction algorithm than had been used in Hamburg. The only adaptation applied to the Hannover images was smoothing for adjustment of spatial resolution. The sensitivity, specificity, positive predictive value and negative predictive value of the IBZM tool for the detection of MSA in the Hannover subjects were in good agreement with previous studies. This suggests the validity of the IBZM tool. Simple adjustment of spatial resolution appears sufficient to adapt IBZM scans acquired and processed according to different protocols for analysis with the IBZM tool. In particular, no local adjustment of the bounds of the pathological range and the normal range is required. This simplifies use of the IBZM tool at different institutions.

Limitations of the Hannover patient groups used for the evaluation of the IBZM tool include that (a) control subjects were significantly younger than both MSA and PD subjects (43±17 years vs 62±5 and 58±8 years) and (b) duration of disease was exceptionally long in PD subjects—in particular, it was significantly longer in PD subjects than in MSA subjects (12.5±5.5 years vs 2.6±1.6 years, p=0.000).

Concerning effects of age, the striatum-to-reference ratio R is expected to show an age-related decline of about 2.5% per decade in healthy subjects [12]. This effect is rather small and hard to detect in diseased subjects [47]. Therefore, the IBZM tool does not perform an age correction of R. In our departments age is usually used only in difficult cases with inconclusive uncorrected R, i.e. 1.40≤R≤1.46 might be diagnosed as normal in “old” patients (≥70 years) and as pathological in “young” patients (≤40 years).

Concerning the duration of disease, it is well known that there might be compensatory D2 upregulation in the striatum of patients with early unmedicated PD [42, 48]. PD patients with a long disease duration might also show reduction of postsynaptic D2-availability [49]. The Hannover PD patients with a long disease duration were included in the present study for the purpose of confirming the latter possibility, and in fact the IBZM tool revealed pathological R in four of these patients, supporting the hypothesis. This result is a by-product of the present study. It does not allow conclusions to be drawn regarding the potential of the IBZM tool to differentiate between MSA and PD in clinical patient care. Patients referred to IBZM SPECT for differentiation between MSA and PD typically have a much shorter duration of disease than the PD subjects of the Hannover sample included in the present study. The accuracy of the clinical diagnosis increases with the duration of disease, so that nuclear medicine imaging is rarely required in patients with a long disease duration in routine clinical patient care. No patients with a short disease duration and confirmed PD were available in the present study. Therefore, the IBZM tool’s potential to differentiate between short-duration PD and MSA could not be evaluated explicitly. However, it is generally accepted that PD patients with a short disease duration show normal or even slightly increased D2 availability (cf. above). Therefore, the IBZM tool’s potential to differentiate between short-duration PD and MSA is expected to be the same, or even slightly better, than its potential to differentiate between MSA patients and healthy controls.

The IBZM tool was compared with a standard manual VOI analysis. Although both methods were able to differentiate MSA, control and PD groups of the Hannover sample, the IBZM tool appeared to provide higher levels of statistical significance. The R m difference of the manual analysis between PD and controls, for example, only just passed the level of statistical significance (p=0.047), whereas the R difference of the IBZM tool was highly significant (p=0.001). An additional indication of improved statistical power of the IBZM tool compared with manual analysis was provided by Pitman’s test, which demonstrated that the IBZM tool R had a significantly smaller coefficient of variance than the manual R m, at least in the MSA and control groups.

The IBZM tool is implemented as an SPM toolbox, i.e. the MATLAB (The MathWorks, Inc., Natick, MA, USA)/SPM environment is needed to run the IBZM tool. The IBZM tool can be downloaded free of charge via anonymous ftp ( ftp://www.ftp.uke.uni-hamburg.de/pub/nuklearmedizin/). A link will also be given at the SPM homepage (Extensions). The IBZM tool is available for both SPM99 and SPM2. The IBZM tool is easily adapted to local requirements (logo, labelling). The tool requires only that (a) reconstructed IBZM images are given in ANALYZE format and use radiological conventions and (b) spatial resolution is 19 mm FWHM. Detailed online help is provided.

Finally, the IBZM tool is easily adapted to other tracers (SPECT and PET) for the investigation of striatal receptor status. In our departments an analogous tool is used in [123I]FP-CIT SPECT.

In conclusion, the new IBZM tool allows reliable and accurate evaluation of [123I]IBZM SPECT simply by visual interpretation. Based on this new tool, semi-quantification may no longer be the only way to objectively assess striatal D2 receptor binding. Visual interpretation using optimised scaling can be an excellent supplementary method. This will clearly reduce the reliance of [123I]IBZM SPECT on semi-quantification alone. Even for the experienced interpreter, the IBZM tool might simplify and speed up evaluation of difficult cases. Furthermore, the standardised documentation will assist in presenting easily comprehensible findings to referring physicians.