Keywords

1 Introduction

Organ-dedicated PET systems [1] present some advantages in comparison with conventional or Whole-Body (WB) scanners, i.e., a better spatial resolution and contrast recovery, a higher sensitivity by placing the detectors closer to the organs and a lower cost. The doses of radio-tracer agents injected to the patient is reduced, this makes possible to minimize the radiation exposure, one critical aspect associated to Nuclear Medicine (NM).

In addition, PET scanner explorations tend to put stress on patients (especially those suffering from claustrophobia) [2], who must remain still during the entire scanning process. For these reasons, a dedicated specific open system with modular and optimized geometry that maximizes the angular coverage of the dedicated organ is highly desirable.

Some example of this type of system can be found, e.g. ClearPEM PET [3] or MAMMI PET [4] for breast cancer or ProsPET for prostate cancer [5]; yet, not very common. In this work we present an open PET geometry configuration for an organ-dedicated, where the patient has freedom of movement.

A motion correction strategy based in a framing method present some advantages when compared to other methods. Specifically, joint reconstruction and system matrix modelling present a high computational load in comparison to that of the frame-based methods. In other cases, the system matrix is not available, and the model cannot be applied. Lastly, event rebinning does not allow us to have non-deformable motions such as heartbeat or breathing.

However, despite frame-based motion correction methods presenting a considerable amount of advantages, they also have deficiencies. One of them is the presence of noise in the reconstructed images when motion occurs inside the time duration of a given frame (intra-frame motion). Another example is the image degradation as a consequence of low count statistics that some of the frames can present when the motion is sudden. These deficiencies can affect the clinical practice, where it is necessary to have frames with high statistics and low noise to improve lesion detectability.

In order to overcome the greatest deficiencies of the frame-based algorithms, a novel frame-based motion correction algorithm is proposed in this study. This method is called EMAF (enhanced multiple acquisition frames). The main differences from the original MAF algorithm (multiple acquisition frames developed by Piccard [13]) is that EMAF use of an adaptive threshold in order to divide the original dataset in frames when some prior knowledge of the size of the lesion is present, allowing one to reduce the dose administered to the patient while providing a good, motion-free estimation. It also allows for grouping motions that share similar features under the same frame, thereby increasing the statistics on each frame and reducing the intra-frame motion artefacts. Once the frames are defined, multilevel spatio-temporal registration is performed to ensure a good signal-to-noise ratio.

To summarize, the proposed open PET system poses several challenges to image reconstruction, including (1) a limited angle problem, (2) a motion correction problem, and (3) a sensitivity correction problem.

The paper presents a reconstruction and registration framework to correct rigid and respiratory motion by respiratory gating in list-mode data using time of flight (TOF) information based on a multi-frame algorithm approach based on EMAF algorithm.

2 Materials and Methods

2.1 PET System and GATE Simulation

The PET system was designed [6] with a modular geometry consisting of two panels of 32 \(\times \) 530 \(\times \) 353 millimetres with 180 centimetres of separation between them (see Fig. 1). Each panel was formed by 24 modules of 64 LYSO crystals. In order to cover enough angular information, the panels rotates 90 degrees in the longitudinal axis every 5 s.

A 20 s simulation was performed in GATE [7,8,9] with the XCAT voxelized image [10]. GATE simulation was done with back-to-back gamma particles using a 5 nanoseconds of coincidence window, 10% of blurring resolution and an 350–650 keV energy window.

Fig. 1.
figure 1

PET system and a representation of the simulated motion

2.2 XCAT Phantom

XCAT phantom [10] was used to simulate a 18F-FGD scan of a total-body of 1.75 m of height with 1.2 Bq/mm3 organs activities concentration. The torso region was cropped afterwards. A lung lesion of 20 millimetres diameter was inserted in the upper lobe of the left lung. The proportion between the lesion to lung contrast was set 4.5:1.

2.3 EMAF Algorithm

Respiratory motion [11] was simulated with a magnitude of 12 millimetres in the anterior-posterior direction and 20 millimetres in the superior-inferior direction. Four cycles of five seconds were reproduced. Additionally, a oscillatory rigid torso-motion in the lateral axis (X) was simulated with the equation \(x = A sin(2\pi ft)\) with amplitude (A) of 10 centimetres, and frequency (f) of 0.2 Hz (see Fig. 1).

The presented EMAF algorithm [12] (shown in Fig. 2) is an improved modification based on Picard’s original Multiple Acquisition Frames (MAF) Algorithm [13] in order to overcome their most important limitations, such a prefixed number of frames and a short frames with low statistics.

Fig. 2.
figure 2

Proposed algorithm workflow

This method described in detail in [14] consists of three phases. First, the coincidence data given by the PET scanner are divided into different frames following the next steps:

1.- The motion is parametrized by the chosen tracking method. This allows have a 3D motion parametrization in the three directions and angles.

2.- Starting from the previous step, a region of interest (ROI) around the source is defined. This region needs to be wide enough to fit any position recorded by the tracking method. This region will be the grid T.

3.- The ROI is rebinned into an equidistributed and equispaced grid in which the size of each bin (what it defines as voxel) is a free parameter.

4.- Each of the voxels that make up the grid is numbered with an index associated with a certain position in the 3D space. The source motion each moment is stored. This is achieved through two parameters, the time in which the source travels from one voxel to another (called cut-off time) and the index of the voxel (cut-off rates).

5.- In order to apply a time filter, the total time that the source spends in each voxel is calculated. These steps are only followed for those voxels with a total time higher than half the average total voxel time. In this way, using voxels with small statistics that can add noise to the final image is avoided.

6.- For every voxel that passes the aforementioned filter, a new list-mode file is created using the cut-off time and cut-off rate. Once the process has finished, voxels that are completely empty in the grid are discarded.

Coming up next (in the second part of the algorithm), each of those frames are reconstructed separately, and, finally, an spatio-temporal image registration method is performed to align and register the frames into the final reconstructed image.

The approach exposed has two clear advantages: (1) it allows one to group in the same frame motions occurring in the same spatial region, which increases the statistical information; (2) it does not need to prefix a priori the number of possible frames, which allows one to obtain a greater or lesser accuracy depending on the other voxel size of implementation being used.

2.4 Parameter Settings

Once the experimental design carried out has been shown and the steps that make up the movement correction algorithm have been explained, the parameters used in the experimental setting are detailed below.

The algorithm applied for the reconstruction was an iterative Maximum Likelihood expectation maximization [15] with ordered subsets (MLEM-OS).

All the images had a size of 256 \(\times \) 256 \(\times \) 120 with pixel size of 1 mm. Five iterations with three subsets were employed for the reconstruction and no corrections were applied.

The motion correction registration EMAF step is a purpose- built subroutine written in C++ and based on the Insight Toolkit (ITK) architecture [16, 17]. Spatio-temporal B-spline is applied in a multilevel registration [18], we used a mean square difference (MSD) as the cost function.

The cost function was minimized using gradient descent with 0.01 mm spatial resolution. The optimization is stopped when either a minimal incremental improvement of 0.05 root mean square error (RMSE) value in the cost function is achieved or upon completing 500 rigid iterations plus 300 affine iterations it has been realized (early stop criterion).

One of the points that is taken into account in the development of the registration part is the problem of tissue compression and the partial volume effect (PVE) since they lead to intensity modulations at the same points (pixels) of different reconstructed frames. This effect is usually more visible in thin structures. In order to avoid image defects caused by intensity variation, we consider the mass preservation property of PET images.

We justify this, taking into account that, when the data are divided into frames, all of them are formed in the same time interval, that is, the entire acquisition time. In other words, in any frame, it is assumed that no radioactivity can be lost or added, apart from some minor changes to the edges of the field of view. For mass preservation, the variational algorithm for mass-preserving image registration (VAMPIRE) algorithm is used [19,20,21,22,23].

Finally, optimization of the multilevel registration algorithm is incorporated. Of first, the algorithm tries to solve discrete optimization at a very coarse level (with a small number of unknowns).

At the time, the approximation transformation is interpolated to the next finer level and used as an initial estimate for minimization of the next objective function.

The procedure is repeated until the transformation is of the desired resolution (user-selected threshold). Consequently, multiple versions of the original images are constructed by iterative downsampling and smoothing of the data.

This procedural scheme has several advantages. First, the likelihood of ending in a local minimum is reduced, as only the main characteristics are taken into account at a certain level. Second, numeric methods, such as that of Gauss–Newton, converge faster for an initial estimate close to the minimum. Third, solving the problems on the coarse grid is computationally more efficient, and, in most cases, only small corrections are required at finer levels.

2.5 Performance Metrics

To validate our corrections two quantitative metrics were used. On the one hand, the Peak Signal to Noise Ratio (PSNR) measured the power of corrupting noise that affects the fidelity of an image representation. Higher values represent a high- quality image.

On the other hand, an Intensity Matching Precision (IMP) test was applied. It measured how deviated the intensities of the moving and fixed images are. A higher IMP value implied a better image matching.

The two metrics have the following expressions:

$$\begin{aligned} PSNR_{X,F} = 20log_{10}\left( {f_{peak} \over RMSE_{X,F}}\right) \end{aligned}$$
(1)
$$\begin{aligned} IMP_{X,F} = \left( 1 - {RMSE_{X,F} \over \sqrt{E[F^2]}} \right) \end{aligned}$$
(2)

where X, F are the moving and fixed images respectively. \(f_{peak}\) is the maximum pixel value in the fixed image, and \(E[F^2] = {1 \over 2}\sum ^n_{i=1}f_i^2\), is the second moment of the fixed image intensity distribution.

3 Results and Discussion

It is proposed a motion correction method of respiratory- gated PET images together with a non-limited patient motion in an open geometry PET system. The experimental results show that spatio-temporal registration has the potential to yield an accurate motion artefacts reduction.

Figure 3 clearly shows that the motion corrected image displays a higher spatial resolution than the uncorrected one. The uncorrected image appears totally blurred, while the lung lesion is conserved in the corrected image. Slight blurring defects were unavoidably found in the image due to lack of lateral information described to the design of our PET system.

The image quality is substantially improved after motion correction using our algorithm, compared to the uncorrected image. The values of PSNR were 30 dB versus 28 dB between static-corrected and static-uncorrected images respectively. The IMP test displayed the same behaviour, with 89% versus 78% between static-corrected and static-uncorrected.

Comparing different experimental designs performed by the proposed motion correction algorithm (EMAF) [14] in closed geometry in comparation with an open planar geometry as described in this manuscript, it can conclude that the results for the Cardio-PET system proposed are relatively good. The results of the EMAF algorithm carried out for closed geometry (ring) give us PSNR values between 37–40 dB, while for CardioPET system, the PSNR result is situated at 30 dB.

If it compared the values of the IMP metric, obviously the values obtained with the application of the motion correction algorithm EMAF are slightly higher for a closed PET geometry (about 90% on average) than for open geometry(89%), due in large part to the loss of angular information mentioned above that the planar PET system shown in the manuscript presents.

Fig. 3.
figure 3

Reconstruction of the torso-body image (torso with a spherical lesion in the lung). Left to right: static, corrected and uncorrected image. The view is exactly in the same body position of Fig. 1

4 Conclusions

The paper presents a novel motion correction algorithm composed of three phases (motion estimation, image reconstruction, and motion correction). The approach is based on image registration and frame acquisition of the data. Firstly, the list-mode data obtained from the scanner were sorted into quasi-static frames through a method of cutting into frames consisting of several steps (Fig. 2).

Secondly, each frame was reconstructed using a built in-house MLEM-OS algorithm. Finally, all of the reconstructed frames were registered in order to obtain a final reconstructed image, taking into account that the registration had to be hyperelastic, nonlinear multilevel and spatio-temporal.

Although the number of frames that EMAF produces is minimal, when large spatial motion ranges are present or the structures under study are very small, it produces a high number of frames, which leads to an increase in the computational load.

Despite this limitation, the EMAF algorithm is a robust enhancement that solves the major problems of the original frame-based algorithm and improves the performance of motion correction in a low-intensity range both in closed and opened PET systems.

5 Acronyms

The following acronyms are used in this manuscript displayed in alphabetical order:

EMAF:

Enhanced Multiple Acquisition Frames

IMP:

Intensity Matching Precision

MLEM-OS:

Maximum Likelihood Expectation Maximization with Ordered Subsets

MSD:

Mean Square Difference

MAF:

Multiple Acquisition Frames

NM:

Nuclear Medicine

PSNR:

Peak Signal to Noise Ratio

PET:

Positron Emission Tomography

RMSE:

Root Mean Square Error

ROI:

Region Of Interest

TOF:

Time of flight

XCAT:

4D Extended Cardiac-Torso Phantom

VAMPIRE:

Variational Algorithm for Mass-Preserving Image REgistration

WB:

Whole-Body