Introduction

Imaging of dopamine transporters using single-photon emission computed tomography (SPECT) has been introduced as a valuable tool to evaluate patients with Parkinson’s disease (PD) [13]. Even if visual assessment of the images can be considered sufficient in many diagnostic situations [4], a quantitative approach is extremely important in novel promising perspectives, such as the measurement of disease progression and the monitoring of neuroprotective treatments. Quantitative analysis is based on calculation of the binding potential (BP) resulting from the ratio of specific (striatum, caudate, putamen) to non-specific (occipital lobe or cerebellum) activity at steady state.

Accuracy of region of interest (ROI) activity estimation is hampered by attenuation and scatter effects and by the poor spatial resolution of nuclear imaging techniques. The importance of attenuation and scatter corrections in improving the accuracy of quantitative BP measurements has already been pointed out [5, 6]. However, even with these corrections, accurate estimates of activity in ROIs that are often small with respect to the spatial resolution of the tomographic systems are particularly challenging. Indeed, activity is systematically underestimated for objects smaller than 2.5 times the full-width at half-maximum (FWHM) of the point spread function (PSF) [79] when their activity is higher than that in the surrounding regions. Conversely, low-activity objects are overestimated owing to the effect of spill-in from high-activity adjacent regions. The loss of signal due to the limited spatial resolution is called the “partial volume effect” (PVE) and the factor that expresses this loss of quantification is called the “recovery coefficient”. These considerations are relevant to BP estimates with SPECT, in which radioactivity concentration in the striata can be underestimated by more than 50% if no correction for PVE is applied [5, 10]. In spite of its high spatial resolution and sensitivity, PET is also markedly affected by the PVE [11, 12], and an underestimate of around 50% of the true activity can still be expected when BP measurements are done [13].

In recent years, physical [1416] or simulated [17] phantom studies have frequently been used to characterise the PVE. Nowadays, the availability of high-resolution structural imaging (X-ray CT or MRI) facilitates the implementation of sophisticated PVE correction procedures [1820], and several publications have addressed this issue [2124]. Anyway, all these techniques require some non-trivial procedures for the determination of corrections which are applied, a posteriori, to the ROI activity obtained by summing the values of the individual reconstructed pixels in the volume of interest.

A method with an inherent capability of PVE correction (LS-ROI, see below for the acronym) for the estimation of region activity directly from projections was previously developed by our group [25]. This method does not require an image reconstruction step and allows unbiased regional concentration estimates to be obtained. Its application in dynamic SPECT has previously been evaluated in a study simulating kidney dynamics of radiohippuran [26].

In the present study we applied the LS-ROI method to prevent PVE and to obtain reliable quantitative estimates of the BP in 123I-FP-CIT brain studies. LS-ROI results were compared with the values obtained when activity concentrations were estimated by overlapping ROIs over conventionally reconstructed images without PVE correction. Phantom studies were performed in order to compare the two methods. Results obtained in seven FP-CIT (DATSCAN, GE Healthcare, Amersham, UK) patient studies are also presented.

Materials and methods

LS-ROI method

The LS-ROI method [25] falls among the “natural” or “generalised” voxel techniques. This class of algorithms is derived from the early work of Huesman [27], extended later to include not only scatter but also spatial resolution effects, both in the heart [28] and in tumour imaging [29]. An alternative formulation assuming Poisson statistics was suggested by Carson [30] and, more recently, a technique called “template projection-reconstruction” [31] was used successfully to quantify target regions in tomographic images.

In such approaches, the object is supposed to be constituted of a few homogeneous ROIs of arbitrary shape. These ROIs are called “natural voxels”, since their homogeneous concentration is “naturally” induced by organ structure or tracer distribution. After their delineation, these natural voxels constitute the basic image volume element and the tomographic problem reduces to the determination of their value, just as if they were the standard cubic voxels. Owing to the reduced number of unknowns, the solution can be computed numerically from raw projection data by the direct inversion of the tomographic matrix. In the specific case of the LS-ROI method, the direct solution is evaluated in a least squares sense, hence the name of the method that we propose. The mathematical details of the LS-ROI method are reported in the Appendix. The main feature of quantification with LS-ROI is that conventional image reconstruction is not needed, except for the purely geometrical step of ROI contour drawing, when structural coregistered images are not available.

The hypothesis of region homogeneity required by the method sometimes may be a quite raw approximation of reality. Therefore, its validity has to be assessed every time, and the accuracy of ROI activity estimation increases if this hypothesis is well satisfied. In order to satisfy the homogeneity requirement, ROIs may be subdivided, but increasing the number of ROIs will lead to worsening of noise propagation and ill-conditioning. Thus, a trade-off always has to be made between accuracy and precision of estimates. However, in some kinds of study (such as those with FP-CIT), the tracer really distributes in a few regions, inside any of which the concentration is quite homogeneous. Thus, the application of these methods in dopamine receptor studies seems to be adequate.

System PSF model

In both EM-SPECT and LS-ROI algorithms the depth-dependent system response model described in [32] has been introduced through tomographic coefficients. The increase in the FWHM with the distance between the source and the collimator is modelled in the transaxial planes, while the longitudinal spread is assumed to be that of a voxel in the centre of the reconstruction volume and constant for each source position [33]. The intrinsic camera resolution has also been added to the collimator resolution in order to obtain the total system resolution.

Phantom study

In order to evaluate the quantitative capabilities of the two methods, we performed two Alderson striatal phantom (Radiological Support Devices Inc., Long Beach, CA, USA) acquisitions. The phantom consists of five anthropomorphic compartments, corresponding to left and right caudate nuclei, left and right putamen, and the remaining brain volume. The compartments are separated by thin polyurethane walls. The chambers are surrounded by an artificial skull that includes both the soft tissue substitute (polyurethane, modified for tissue equivalence) and the skull.

A high-resolution CT image of the phantom was acquired with the brain shell cavity filled with water and the four striatal compartments left empty. Two cross-sectional views of the phantom through the striata are shown in Fig. 1. In the CT scan, 73 slices of 512 × 512 pixels were acquired (pixel size = 0.49 mm, slice thickness = 2 mm).

Fig. 1
figure 1

CT transaxial views of the Alderson phantom through the striata

The SPECT acquisitions were performed with a triple-headed camera (IRIX with UHR collimators). Projections were acquired in a matrix of 128 × 128 bins, bin size = 2.33 mm. The radius of rotation was 15 cm. In both scans, 120 angles were acquired at 120 s each, for a total acquisition time of 80 min.

In the first experiment (Acq 1), the four striatal compartments were filled with the same solution of 123I-iodide. In the remaining part of the brain shell, a more diluted solution was used. Concentrations were measured in a well counter. A 6.11:1 striatum-to-background ratio was measured. In the second experiment (Acq 2), the left and right striatal compartments were filled with two different solutions of 99mTc. The striata-to-background ratios measured in a well counter were 9.15:1 and 27.87:1 for the right and the left striatum, respectively. Total counts were approximately 2.5 and 1.75 million in the first and the second experiment, respectively.

In order to register the CT and the SPECT images, the tomographic data sets underwent a first EM reconstruction. The CT images were processed in order to manually remove the skull external contribution that is not visible on the SPECT images. Then, the CT images were realigned to the SPECT ones by using the NEUROSTAT software [34]. After the CT images had been coregistered to the SPECT ones, the attenuation map, consisting of the skull and the inner brain compartment, was used for the final EM reconstruction (120 iterations) and for the calculation of the ROIs’ sinograms. A homogeneous μ = 0.20 cm−1 attenuation coefficient was used for Chang attenuation correction.

Region segmentation was performed on the realigned CT images with the MRIcro [35] region segmentation tool. The volumes of the segmented striata and of the remaining brain shell were 8.4 and 1,320 ml, respectively. Background concentration was evaluated with a cylindrical ROI with an elliptical section of 247 voxels extending over ten slices in which striata were also present and it was kept sufficiently far from them to avoid spillover effects. The volume of this reference region was 31.4 ml.

Patient studies

The DATSCAN studies of seven patients were analysed with both methods. A summary of patient symptoms and diagnosis is shown in Table 1. Criteria for possible diagnosis of PD are the presence of at least two of the following features: resting tremor, bradykinesia, rigidity, asymmetrical onset (at least one of these must be tremor or bradykinesia); criteria for probable diagnosis of PD are the presence of at least three of the above-mentioned features [36]. None of the patients was taking interfering drugs.

Table 1 Age, diagnosis and symptoms of the seven analysed patients

The SPECT scan was acquired 3 h post injection to obtain a conventional BP estimate [2]: FP-CIT injected dose was 185 MBq and 120 projection angles were acquired with the same camera and the same projection matrix as in the phantom study. The acquisition time/projection was 60 s. The radius of rotation was settled to the minimum possible value (15.5 ± 1.3 cm). Total counts were, on average, about 2 million.

No scatter correction was applied to the data sets. Chang attenuation correction (μ = 0.20 cm−1) was applied. The attenuation mask was evaluated on a four-iteration EM reconstruction by an automated contour detection software based on a thresholding algorithm. Then it was employed both in the final EM reconstruction of the tomographic acquisitions (40 iterations) and in the production of the sinograms.

The ROIs were segmented from the reconstructed 3D image obtained using the EM reconstruction algorithm. The two striatal ROIs were obtained with a thresholding algorithm while cerebellum ROI was obtained by manual segmentation. In order to better satisfy the requirement of region homogeneity needed by the LS-ROI method, the background region was split into two ROIs, one completely void of receptors and the other showing weak binding of the tracer due to the presence of non-specific receptors (like serotonin receptors) that have a certain affinity to DAT. The volumes of striatal and cerebellar ROIs are reported in Table 3.

Results

Phantom study

Conventional ratio estimates obtained by superimposing the ROIs on EM reconstructed images are shown in Fig. 2 as a function of iterations performed. The EM algorithm was used instead of the fastest OSEM version to study the quality and quantification capability of images as “continuously” as possible. A maximum of 120 iterations was performed; however, as shown in Fig. 3, image quality excessively degrades after 40 iterations because of the semiconvergence of iterative algorithms [37]. Therefore, results obtained with 40 iterations will be discussed, since this is a more usual clinical setting. Moreover, ratios obtained after 120 iterations would have produced a reduction of the bias of less than 5% with respect to 40-iteration results.

Fig. 2
figure 2

Ratio estimates on EM images as a function of the iteration number (iter) in the phantom studies. Actual striatal/background ratios (str/bkg) were: a 6.11:1 left and right; b 27.87:1 left and 9.15:1 right

Fig. 3
figure 3

EM reconstruction of the 6.11:1 filled phantom using 20, 40 and 60 iterations

In Table 2, EM-SPECT ratio values are compared with the LS-ROI estimates. The error in the LS-ROI estimates has been evaluated by propagating the error of the striatal and reference concentrations given by Eq. 10 in the Appendix. On the other hand, estimation of the uncertainty that affects the concentration obtained from reconstructed images is quite a difficult and controversial matter and has not been performed.

Table 2 Conventional and LS-ROI estimates of the striatum-to-background ratios in the phantom study

Patient studies

Table 3 shows the ratios between activity in striata and cerebellum (R=(C st C cer )/C cer ) evaluated on patients at 3 h post injection with the EM-SPECT and the LS-ROI method.

Table 3 Ratios R=(C st C cer )/C cer obtained in the patient studies with conventional EM-SPECT and LS-ROI methods. Striatal and cerebellar segmented volumes are also shown

A correlation plot between the EM-SPECT and LS-ROI ratio estimates is shown in Fig. 4. The values of R evaluated in the phantom study have also been included in the graph. Linear regression analysis was performed.

Fig. 4
figure 4

Correlation between LS-ROI and EM-SPECT estimates: 14 points have been obtained from patient data (white circles) and four points from the phantom studies (black circles). The regression line is shown in the figure (continuous line); the dashed line represents identity

Discussion

Phantom study

The importance of SPECT quantification has been increasing since the implementation of molecular imaging techniques like FP-CIT SPECT and, as a consequence, normative data are also becoming available [38, 39].

Absolute activity quantification with SPECT is hampered by several factors, such as attenuation, scatter and PVE [40, 41]. Among these effects, the geometrical response of the tomographic system is one of the most important [5, 42]. This effect can be compensated for by means of iterative reconstruction algorithms which include in the reconstruction kernel the geometrical response model [32, 33].

Our results with EM-SPECT reconstruction are one example of this approach. In the absence of noise, such algorithms would give a perfect image restoration, provided the system response is correctly and properly modelled and the number of iterations is adequate. However, the spatial response effect mainly affects the high-frequency components of the image and, consequently, the compensation of spatial resolution is very sensitive to noise. Indeed, our results in the phantom study demonstrate that, in the presence of noise and in spite of a thorough compensation for system spatially variable response, quantification with the EM-SPECT method is less than effective, with a mean underestimation of 42%. In other words, when the algorithm reconstructs individual pixels with a high spatial sampling, SPECT activity quantification is affected by errors even with an advanced reconstruction approach in which the spatial response of the SPECT system is compensated.

The determination of the contents of a few “generalised” pixels is a different task by comparison with the determination of thousands of pixels: the high frequencies implied by the high spatial sampling of the images are not required in the ROI evaluation problem and propagation of noise on ROI values is less critical. Moreover, in the ROI formulation of the tomographic problem, we have introduced reasonable a priori information about the object, since we have stated that the activity is constant over the ROIs. This information actually changes the original problem (Eq. 4 in the Appendix) into a new one (Eq. 8) which is stable if the number of ROIs is low. Thus, in typical terms of the “inverse problems” jargon, the use of ROIs is a “regularisation technique” and, as a consequence, the LS-ROI method shows improved performance, since it deals with a new problem which is less “ill-conditioned” with respect to the original one.

A similar phantom study was reported by Meyer et al. [43], in which data were acquired with both the triple-head camera MultiSPECT 3 (Siemens Medical Systems, Inc., Hoffman Estates, IL, USA) and the dedicated brain SPECT system Ceraspect (Digital Scintigraphics, Waltham, MA, USA) and reconstructed with filtered back projection (FBP). In that study, an underestimate similar to that in our results was found when the ratios were obtained from images acquired with the triple-head system. On the other hand, ratios estimated from images acquired with the dedicated system were better recovered. Those results can be explained in terms of “ill-conditioning” as well, taking into account that the sensitivity of a dedicated scanner may be 60% higher [43] than that of a triple-head system. Thus, with dedicated systems, the effects of noise are mitigated and the ill-conditioning of the reconstruction problem is reduced.

Patient studies

In the patients, as in the phantom study, image-based EM-SPECT estimates of BP were lower than the LS-ROI estimates. The linear correlation between the two estimates (Fig. 4) suggests a systematic effect where differences in ratio values measured with the LS-ROI method (ΔLS-ROI) turn to be compressed by a factor 0.625 when measured with EM-SPECT (ΔEM-SPECT): ΔEM-SPECT = 0.625 ΔLS-ROI. Indeed the R range 0–26 obtained with the LS-ROI is reduced to 0–16 with the image-based EM-SPECT method.

A first point to emerge from our results is that the values estimated at 3 h with EM-SPECT are higher than those that can be found in the literature for BP in normal controls. In fact, Tsuchida et al. report [39] a mean BP value of 3.7 ± 0.5 in ten normal controls, both with a single outcome measure at 3 h post injection and with a multilinear regression technique applied to data acquired up to 6 h post injection. Even lower BP values were reported for normal cases in other studies [2, 38, 44]. In all the references cited above, no PVE correction was applied. Our higher estimates in patients and with EM images perhaps can be explained by the fact that the results reported in the literature have often been obtained with images reconstructed by the FBP algorithm and without a geometrical model of the system PSF and, therefore, with lower recovery capabilities. Additionally, it is worth noting that normative data are also camera dependent [10, 43].

A second point to consider is that BP estimates found with the LS-ROI method are even higher than those found with EM-SPECT. If we take into account that an underestimation of up to 50% can be expected when the BP value is measured with images without PVE correction [5], the LS-ROI estimates in patients may be realistic. Accordingly, in [43], normal subjects acquired with the brain dedicated system indeed gave BP values of around 9.

BP has also been measured with 18F-labelled FP-CIT and PET [12]. In this reference, the acquisition of data ended at 90 min post injection, when the BP with graphical analysis was approximately 3.5. The BP value at 3 h was not measured, but it is not feasible that values as high as ours could have been obtained at that time. A possible reason is that PET measurement, too, was not corrected for PVE.

An interesting comparison can be done with the results in [24], where the same tracer and SPECT system as in our study were used. In [24], the effects of PVE correction were studied in Alzheimer’s disease (AD) and dementia with Lewy bodies (DLB). In AD patients, striata can be thought unaffected. For these patients, the PVE correction increased the putamen BP value from 2.9 ± 0.4 to 8.6 ± 1.5.

Limitations of the study

As far as the phantom study is concerned, the choice of Chang’s attenuation correction method (with separate contours for each slice) instead of a variable attenuation mask derived from the CT image was motivated by the fact that, in patient studies, CT scans are not always available in a routine context. For instance, CT images were not available for the seven patients analysed in this study, and we wanted to perform the same analysis on both phantom and patient data. Obviously, when an anatomical image is available, the more accurate CT-derived variable attenuation map should be used. In this regard, the increasing diffusion of SPECT/CT systems will help in obtaining more accurate results. However, the use of Chang’s correction, a method widely available on commercial SPECT systems, makes our results easy to reproduce and apply in clinical everyday practice.

As far as the patient analysis is concerned, even if our study yielded interesting results, a more accurate study should be performed using CT or MR images for ROI segmentation and accurate definition of the attenuation mask. In the present study both the region segmentation and the (homogeneous) attenuation map definition were based on SPECT images and were therefore prone to errors due to the limited spatial resolution.

PVE correction

Let us now make some comments on PVE correction methods that have to be implemented when image reconstruction is used for quantitative purposes. One of the most popular methods has been proposed by Rousset et al. for PET [13]. In this approach, the mean concentration value \(c_{\alpha } \) (α=1, ..,N R ) measured on the images in each of the ROIs is expressed as a weighted sum of the true activity concentrations in each of the functionally distinct tissue components, \(C_{\beta } \) (β=1, ..,N R ):

$$ c_{\alpha } = {\sum\limits_{\beta = 1}^{N_{R} } {\omega _{{\alpha \beta }} } }C_{\beta } $$
(1)

where the weighting factors ωαβ are evaluated starting from the PSF of the PET system. This linear system can be rewritten in matrix form as:

$$ c = \Omega C $$
(2)

where the square N R ×N R matrix Ω is called the geometric transfer matrix (GTM) and the N R ×1 vectors c and C are, respectively, the vectors of measured and true concentrations in the ROIs. Equation 2 can be immediately solved for C by matrix inversion.

As we can immediately understand, writing Eq. 1 consists in switching from the ill-conditioned image-reconstruction problem, which has already been solved, to the corresponding ROI problem. In other words, even in Rousset’s approach, PVE correction is not due to the (re-)introduction of the geometrical model through the definition of the GTM, as the information about the system response has already been completely used in the image reconstruction step. The activity recovery is due to the introduction of an ROI model for the activity distribution. In comparison with that, with the LS-ROI method, by starting directly with a formulation of the problem in terms of ROIs, we can avoid a second correction step. This explains what we mean by inherent compensation for the PVE in the LS-ROI method.

In order to further compare our approach with the one proposed by Rousset, Eq. 9 in the Appendix can be rewritten in this way:

$$ G^{T} p = {\left( {G^{T} G} \right)}X. $$
(3)

Equation 3 can be directly compared with Eq. 2. The true ROI contents are given by vector X and can be estimated from the backprojected image G T p once the GTM (G T G) has been computed starting from the sinograms that contain all the geometric information.

A difference between the two methods is in the way the GTM is used: in the original approach suggested by Rousset, the GTM relates two “objects”, c and C, while in the LS-ROI approach we relate an “object” X, to “projections” p. Therefore, in the LS-ROI method, we just need to model the system PSF in the projection space, while in Rousset approach, the spreading of information has to be described in the object space. The spatial resolution in the object space can be accurately described as a consequence of the geometrical system response in the projection space, in which actually the physical step of radiation detection takes place. In this case, the PSF defined in the projection space is used to project the ROIs and then the GTM is evaluated by reconstructing the ROI sinograms with the same parameters as those used for SPECT or PET data. Otherwise, especially for PET, where modelling the PSF in the projection space is even more complex than for SPECT, the GTM can be evaluated by convolution of the region support with some simplified version of an experimentally measured spatially-variant resolution in the object space. A comparison of these two approaches can be found in [21].

It is also worth to point out that only a PSF model in the projection space allows to directly estimate the regional activity without image reconstruction. This basic feature of the LS-ROI algorithm is a step forward to optimal integration of structural imaging (CT) in molecular imaging (PET/SPECT) as currently offered by hybrid tomographic systems (PET/SPECT-CT).

Conclusion

In this paper, by means of a phantom study, we have demonstrated that the LS-ROI method yields more accurate estimates of tracer concentration in the ROIs than are obtained with the conventional method of summing the content of the pixels of the reconstructed images. The improvement in the accuracy derives from (1) the inclusion of the geometrical system response in the algorithm and (2) the reduced number of unknowns. This approach decreases the ill-conditioning of the problem and allows better compensation for the PVE, once the system response and the statistics are known.

Our phantom study showed that the bias resulting from the conventional method of measuring BP involves a substantial underestimation. The better performance of LS-ROI in the “anthropomorphic” phantom study suggests that the LS-ROI estimates are probably more accurate than the EM-SPECT estimates also in the patients, in whom the true activity concentration is unknown, with the consequence that it is impossible to express a “technical” preference for either of the two reconstruction methods applied.

In patients, EM-SPECT BP values are about 60% of the LS-ROI BP estimates. The measured ranges of BP values are 2-7 and 3-12 with EM-SPECT and LS-ROI, respectively. In other words, a range of 5 BP units, measured with EM-SPECT, becomes a range of 9 BP units with the LS-ROI method. This suggests that the difference between an abnormal and a normal BP could be greater when measured with LS-ROI than when measured with EM-SPECT. This expansion of diagnostic sensitivity could be strategic in view of the use of DAT SPECT in early PD cases and in “at-risk” subjects as well.

The potential of the LS-ROI method in discriminating between normal control and PD patients has not been assessed thoroughly in this work. The application of the method to a larger group of normal controls and PD patients will constitute the subject of future work. However, these promising results suggest that, from the clinical standpoint, use of the LS-ROI method in BP measurements might enhance the role of receptor molecular imaging in clinical practice.