Introduction

For patients who undergo radiation therapy (RT) for lung cancer treatment, computed tomography (CT) scans are acquired regularly to assess changes in tumor size and normal tissue reaction to treatment. Comparison of lung CT images over time is complicated by differences in patient positioning and respiratory phase between scans. Deformable image registration may thus be used to identify spatially corresponding locations between scans despite these differences. Several deformable registration algorithms have demonstrated success in lung CT scan co-registration, with a high degree of registration accuracy (e.g., <1 mm average registration error) between scans [1, 2]. Past experience with the Plastimatch demons deformable registration algorithm [3] showed that average registration errors of less than 0.5 mm were present between co-registered serial CT scans derived from healthy patients [4]. For these studies, however, only physical changes such as differences in respiratory phase or technical image acquisition parameters existed between scans. The registration process is further complicated when pathologic changes between serial scans are also present, potentially leading to lower registration accuracy. For example, Palma et al. [5] observed average registration errors >4 mm when they used B-spline registration to register post-RT lung CT scans containing fibrotic changes with scans acquired before RT.

This study evaluated the registration accuracy of pre- and post-RT thoracic CT scans for lung cancer patients. Specifically, the registration accuracy achieved when normal lung tissue damage exists in post-RT scans was compared with the accuracy achieved in the absence of changes between scans. Two deformable image registration algorithms (Plastimatch v. 1.5.12-beta demons and Fraunhofer MEVIS v. 1.1 Fast CT lung registration [6]) were investigated. The hypothesis of this study was that both algorithms would exhibit reduced registration accuracy when radiologic abnormalities developed due to the increased complexity of the registration task.

Materials and Methods

Patient Database

Twenty-five patients who underwent curative-intent RT for lung cancer at The University of Chicago Medicine between January 2007 and September 2011 were retrospectively identified under IRB approval. All patients received curative levels of radiation dose (≥60 Gy) delivered using 6 MV or 6 and 18 MV photon beams while immobilized using custom alpha cradles. Treatment planning was performed under the supervision of a radiation oncologist, with dose calculations corrected for heterogenous tissue density. For each patient, two diagnostic quality thoracic CT scans acquired (1) less than 6 months before RT and (2) less than 12 weeks following RT were collected. Additionally, each patient’s RT treatment planning CT scan with an associated dose map calculated in Pinnacle® 7.6, 8.0, or 9.0 (Philips Systems, Andover, MA) was collected. Dose maps were aligned with planning scans using tri-linear interpolation in CERR v. 4.0 Beta [7]. For all scans, semi-automated lung segmentation was performed by an experienced thoracic researcher (AC) using Pinnacle® 9.0 model-based segmentation. One patient was eliminated from the study due to poor scan quality degraded by motion and large differences in patient positioning between scans that prevented accurate evaluation of registration accuracy. Patient demographic information and technical imaging parameters for the remaining 24 patients are summarized in Table 1.

Table 1 Patient demographic information, treatment details, and CT scan parameters

CT Scan Assessment of Radiologic Change

An attending radiologist with over 10 years of experience (CS) compared each of the post-RT scans with the corresponding pre-RT scan for the presence of mild, moderate, or severe normal lung tissue abnormalities that had developed following RT (Fig. 1) [8]. Analysis was constrained to 60 32 × 32-pixel (approximately 2 × 2 cm) regions of interest (ROIs) that were automatically placed within the normal lung tissue of each post-RT scan. Specifically, these ROIs were randomly placed within each of four dose regions (<10, 10–30, 31–50, and >50 Gy) of the lungs in the treatment planning scan dose map and were prevented from overlapping with the planned target volume (PTV). This process ensured that ROIs captured normal tissue damage rather than residual tumor. To associate ROIs in the planning scan dose map with the post-RT scan, each patient’s treatment planning CT scan (and thus, the associated dose map) was automatically registered to their post-RT scan using the Plastimatch deformable registration algorithm [9]. Following radiologist review of all ROIs in all scans, patients were identified as having normal tissue abnormalities in the post-RT scan if at least one ROI had been categorized by the radiologist as containing moderate or severe interval abnormalities.

Fig. 1
figure 1

Randomly selected ROIs (yellow box) in each patient’s post-RT scan (left) were compared with the pre-RT scan and categorized as containing no, mild, moderate, or severe interval abnormality

Manual Landmark Matching

A set of 100 landmark points was automatically identified in the lungs of each pre-RT scan. The algorithm used to select these landmarks (iX v.1.2.0.0) [10] ensured that points (1) were located in high pixel-value-gradient regions and (2) were evenly distributed throughout the lungs. Points were also prevented from overlapping with the tumor as defined by the PTV in the planning scan, which had been registered to the pre-RT scan using affine registration. PTVs were excluded from the landmark matching process because large degrees of tumor shrinkage in the post-RT scan limit the ability to accurately identify matched landmark points manually within the PTV. Two researchers (JJ and BW) who received previous training in thoracic anatomy and landmark matching manually identified matched landmark points in each patient’s post-RT scan that corresponded with the identified landmarks in the patient’s pre-RT scan. If a matched landmark point could not be confidently identified in the post-RT scan, it was not included for further analysis. All matched landmark points were reviewed and edited if necessary by an experienced thoracic researcher (AC).

Automated Landmark Matching

To obtain point-to-point anatomic comparison between pre- and post-RT scans, each patient’s post-RT CT scan was registered to the pre-RT scan using both the Plastimatch and the Fraunhofer MEVIS Fast deformable registration algorithms. For both algorithms, registration proceeded as a multi-stage process, with registration at low resolution and using simple transformations occurring before high-resolution stages with additional degrees of freedom in image motion. Parameters used for registration with the Fraunhofer MEVIS algorithm were optimized by the developers for fast, accurate deformable registration using the publically available DIR-Lab dataset [11, 12]. The registration parameters used for demons registration with Plastimatch (Table 2) were selected because they demonstrated high registration accuracy in an independent patient database consisting of healthy thoracic diagnostic CT scans.1 Based on the displacement vector field output by each algorithm, the identified landmarks in each pre-RT scan were automatically mapped to the corresponding location in the post-RT scan. For both algorithms, the Euclidean distance between manually and automatically identified matched landmark points in the post-RT scan (d E) was calculated (Fig. 2). Regression modeling was used to model d E as a function of the registration algorithm (Algorithm j where jϵ{1,2}) and whether radiologic changes were present (Presence of Change k where kϵ{1,2}), while accounting for random patient-specific differences in d E (Patient i where iϵ{1,2,…,24}), according to the following formula:

$$ {\left\{{d}_{\mathrm{E}}\right\}}_{ijkl}={\mathrm{Patient}}_i+{\mathrm{Algorithm}}_j+\mathrm{Presence}\ \mathrm{of}\ {\mathrm{Change}}_k+{\epsilon}_{ijkl} $$

where ϵ ijkl represents the residual error in the model fit. Analysis of variance (ANOVA) was then performed to determine whether the registration algorithm and/or the presence of radiologic change significantly (p < 0.05) impacted d E.

Table 2 Parameters for multi-stage demons deformable registration using Plastimatch
Fig. 2
figure 2

Method used to calculate the Euclidean distance between manually and automatically placed matched landmark points (d E) in a post-RT scan

Results

Between 66 and 93 (median 85), matched landmark points were manually identified for each patient, for a total of 1977 points. The point location was edited by the experienced thoracic researcher for 2–29 (median 15) points per patient. The radiologist identified eight patients as having radiologic evidence of normal lung tissue damage in their post-RT scan, with between 1 and 31 (median 3) ROIs per patient identified as containing moderate or severe abnormalities. For both algorithms, average d E across patients was smaller in the absence of abnormalities (Table 3). Seven of the eight patients with abnormalities had smaller average d E using the Fraunhofer MEVIS algorithm than using the Plastimatch algorithm, compared with 11 of the 16 patients without abnormalities (Fig. 3). Regression modeling showed that d E increased significantly (p < 0.05) due to the presence of abnormalities and the use of the Plastimatch algorithm.

Table 3 Average Euclidean distance (dE) between manually and automatically identified matched landmark points
Fig. 3
figure 3

Average Euclidean distance (d E) between manually and automatically identified matched landmark points for each patient. Shown in brackets: number of ROIs categorized as containing moderate or severe changes

Discussion

This study demonstrated that radiologic changes between lung CT scans have a significant effect on the registration accuracy of deformable registration. Both the Plastimatch and Fraunhofer MEVIS algorithms demonstrated inferior CT scan alignment accuracy in the presence of interval normal lung tissue abnormalities. While the presence of abnormalities increased the registration error by 2.2 mm, on average, using the Plastimatch algorithm, average registration error was increased by 1.2 mm using the Fraunhofer MEVIS algorithm (Table 3). The superior registration accuracy observed using the Fraunhofer MEVIS algorithm may be due to the fact that this algorithm is highly constrained to prevent overfitting through unrealistic deformations. For example, one of the parameters in the objective function used to determine the deformation penalizes for highly curved displacement vectors, which represent unrealistic deformations. These constraints are especially important when structural changes exist between scans, complicating the registration process. The parameters used for registration with the Plastimatch and Fraunhofer MEVIS algorithms were selected due to the high registration accuracy achieved in independent databases; it is possible, however, that an alternative choice of parameters (or even a unique set of parameters for each patient) may be more appropriate for use with the current database, resulting in further improvements in the accuracy of image registration.

Some potential weaknesses may impact the findings of this study. First, manual identification of matched landmark points in the post-RT scans could have resulted in incorrect placement of some points. Furthermore, because the landmark matching task was divided between two researchers, differences in the accuracy of point placement may exist between the researchers. These errors were reduced in several ways. First, prior to beginning the study, the researchers received training on a common test case; no significant difference in the placement of matched landmark points was observed between the two researchers. Second, during landmark matching, all points were visualized using all three planar views, facilitating easy comparison between regions in pre- and post-RT scans (Fig. 4). Third, all manually placed matched landmark points were reviewed by a third researcher (AC), thus preserving consistency among scans annotated by the two researchers.

Fig. 4
figure 4

Display used in iX v.1.2.0.0 for manual placement of matched landmark points in each post-RT scan

Although the majority of patients with larger registration errors using Plastimatch had abnormalities in their post-RT scans, there were some patients who exhibited large registration errors despite the absence of radiologist-identified abnormalities within the 60 ROIs. If the selected ROIs did not overlap with regions containing damage, our methods would fail to identify some patients with abnormalities. Thirty of the 60 ROIs in each scan were placed in high-dose (≥30 Gy) regions where normal tissue damage was most likely to be observed [13, 14]. It is thus unlikely that normal tissue damage in these regions went uncategorized due to the small volume (15 % of total lung volume on average) of these dose regions. Furthermore, this method of classifying radiation damage allowed the radiologist to examine changes on a small regional basis, facilitating careful classification of both subtle and obvious radiation-induced damage. Large registration errors in the absence of abnormalities are also unlikely due to differences in patient positioning or scan appearance, as all CT scans were acquired on Philips CT scanners with 1-mm slice thickness and sub-millimeter pixel spacing (Table 1) and were reconstructed using identical lung convolution and lung smoothing kernels. During scan acquisition, all patients were supine with arms raised above the head, and patients were instructed to inspire and hold their breath. Instead, registration errors when abnormalities were absent may be due to notable patient improvement between pre- and post-RT scans. For example, patients #5 and #6 experienced a substantial decrease in tumor size following RT and patient #11 experienced lung volume expansion, which may explain the low registration accuracy achieved using Plastimatch (Fig. 3). Future studies should aim to characterize the degree of improvement following RT (e.g., decrease in lesion size) and the subsequent effects on registration accuracy.

While the Fraunhofer MEVIS algorithm was well suited for CT scan co-registration in the presence of normal lung tissue changes, future studies should investigate whether successful registration can be performed in the presence of other changes that can occur during lung cancer treatment. Additional treatments that may alter the visual appearance of the lungs on CT scans include chemotherapy and surgery. While the focus of the current study was to evaluate the accuracy with which normal lung tissue could be registered between CT scans acquired before and after RT, future studies are needed to determine the accuracy of deformable registration in tumorous portions of the lung where large degrees of tumor shrinkage are often present. Accurate registration of tumor tissue could allow for planned RT dose to the target volume to be mapped from treatment planning to diagnostic scans. The current study focused exclusively on normal lung tissue deformable registration because of its potential future utility to evaluate radiation-induced normal lung tissue damage. Specifically, regions of pre- and post-RT diagnostic CT scans that have been identified through registration could be compared quantitatively to evaluate changes following radiation delivery. Patients can also develop chronic fibrotic changes that may differ from the acute normal tissue damage investigated here. Additional studies would be needed to determine whether deformable registration could be used throughout the entire course of lung cancer treatment to assess regional changes between serial scans.

Conclusions

The accuracy of two registration algorithms was evaluated in the presence of normal lung tissue changes that had developed between pre- and post-RT CT scans. Both algorithms experienced decreased registration accuracy when changes were present, with the Fraunhofer MEVIS Fast algorithm demonstrating significantly improved registration accuracy compared with the Plastimatch algorithm. Deformable registration could be used in future studies to identify corresponding anatomic locations between serial CT scans, thus facilitating evaluation of image-based change in normal lung tissue following RT.