Metal implants can cause substantial artifacts on CT imaging due to the attenuation of X-rays as they pass through metallic substances [1], which is detrimental for diagnostic image quality. Numerous methods have been applied to reduce the metal artifacts with none producing results with artifact-free image. A recent innovation using iterative reconstruction methods has shown to reduce the severity of metallic artifacts. Iterative reconstruction and metal artifact reduction (MAR) can be performed with either single energy or spectral (dual energy) techniques. Previous research has shown spectral MAR to be partially effective in the case of hip prostheses [2], gold fiducials in the abdomen [3], and iodine-125 seeds in the liver [4], whereas single energy MAR has proven effective for vascular occlusion coils in the abdomen [1, 6], dental implants [5], and hip prostheses [6].

To date, there has only been one comparison study [7] about the relative efficacies of these two artifact reduction techniques for prosthetic hip implants. Our goal was to evaluate two iterative reconstruction algorithms that use single energy MAR vs. spectral MAR with a wider variety of metallic vascular implants for the ability to reduce metal artifacts and improve overall image quality in a controlled environment of a phantom model.

Materials and methods

A CT phantom was constructed in-house with three different densities (water, saline solution, and ultrasound gel) and placed atop a commercial CT head calibration phantom (Mindways Software, Austin, TX, USA). The phantom container was approximately 17 cm by 17 cm, and filled with water to a depth of 4 cm (Figure 1). The phantom was scanned with the following parameters: 120 kVp, CTDIvol = 21 mGy, SFOV = 40 cm, DFOV = 19.3 cm, with single energy (0.5 s tube rotation, 380 mAs tube current, 3 mm slice thickness, axial scanning mode) and spectral imaging (0.6 s tube rotation, 640 mAs tube current, 2.5 mm slice thickness, helical scanning mode) to serve as a base for comparison for underlying densities. Parameters were set first on the dual energy scanner, and then settings were adjusted on the single energy scanner to match CTDIvol on the dual energy scanner. A metal implant was then inserted into the phantom and the phantom was scanned with single energy and spectral CT scanners with and without MAR algorithms applied. The process was repeated for each type of metal implant: Amplatzer vascular plug 4 (St. Jude Medical, St. Paul, Minnesota, USA), 15 mm embolization coil (Penumbra, Alameda, California, USA), 8 × 40 mm expandable stent (Wallstent Endoprosthesis, Boston Scientific, Boston, Massachusetts, USA), 6 × 40 mm biliary stent (E-Luminexx, Bard Peripheral Vascular, Tempe, Arizona, USA), gold fiducial seed (Censis Technologies, Franklin, Tennessee, USA).

Fig. 1
figure 1

In-house constructed phantom, consistenting of 17 cm × 17 cm plastic container with water, a saline solution filled bag, and ultrasound gel. This is placed on top of a commercial CT head calibration phantom (Mindways, Austin, TX USA)

Single energy CT with single energy metal artifact reduction (SEMAR) was performed with a 320 multidetector CT scanner (Aquilion One, Toshiba Medical Systems, Otawara, Japan) and images with and without SEMAR were reconstructed on a Vitrea Workstation (Toshiba Medical Systems, Otawara Japan). Spectral CT with metal artifact reduction software (MARS) was performed with 64 multidetector CT scanner (Discovery CT750 HD, General Electric Healthcare, Milwaukee, Wisconsin, USA). Reconstructions were made with 30% statistical iterative reconstruction. A dedicated workstation (GE Advantage Workstation 4.5, GE Healthcare) was used to reconstruct images at 77 keV without MARS and 77 keV with MARS. The 77 keV level reconstruction level was chosen to be equivalent to 120 kVp and is standard for clinical abdominal CT angiography examinations at our institution, as higher keV levels would reduce the level of contrast enhancement. Scanners are calibrated daily to ensure Hounsfield Unit accuracy of 0 ± 3 HU for the spectral energy scanner and 0 ± 5 HU for the single energy scanner according to vendor specifications. Images were evaluated on Centricity Picture Archiving and Communication System (GE Healthcare) with readers free to adjust window levels as they saw fit.

Both SEMAR and MARS use iterative reconstruction techniques to reduce metal artifacts. The artifact reduction process starts with segmenting out metallic artifacts in the forward projection data. Data from the areas surrounding the artifacts are used to interpolate corrected projection data which are used to replace the artifact data. The corrected data are further segmented to remove metallic traces then forward projected and reconstituted with the original data [1]. MARS also uses dual energy techniques by leveraging the ability of the Discovery CT750 scanner to switch between 80 and 140 keV every 0.25 ms during scan acquisition. Materials that show similar attenuations at one voltage may show different attenuations at another voltage, thus dual energy CT may be better able to distinguish features and materials including metals and metal-induced artifacts [7].

Objective evaluation

Three identical regions of interest (ROI) of minimum size of 1000 pixels were obtained: (1) from image with metal implant with worst artifact without MAR algorithm; (2) corresponding ROI at the same level and in-slice location with MAR algorithm applied; and (3) baseline images without metal implant and therefore without any artifacts and without application of MAR algorithms. One triplet set was obtained with single energy MAR and the other one with spectral MAR.

ROIs were obtained adjacent to different types of metal implants causing artifacts: Amplatzer plug (n = 1), embolization coil (n = 2), gold fiducials (n = 7), or stents (n = 10). This produced 20 triplet sets of ROIs, each for single energy and spectral MAR algorithms.

Mean standard deviations of HU across ROI were compared with the Mann–Whitney test with significant cut-off value set at p < 0.05. Histograms of select ROI were used to aid visual analysis of the data.

Subjective evaluation

Images featuring metal implants were evaluated with and without the application of single energy and spectral MAR. The images were presented randomly and blindly to the method that they were obtained (with or without metal artifact reduction algorithm, which type) to avoid bias. The images were independently evaluated by a fellowship-trained interventional and abdominal radiologist with 14 years of experience and a radiology research fellow with 1 year of imaging experience who had received specific training from the attending radiologist prior to evaluation. Each image was given an overall grade as well as sub-categorical grades based on artifact location (near or far field) and artifact type (bloom, shadow, and streak). In addition, images were evaluated and graded for additional artifacts introduced by the MAR algorithms.

All subjective grades were assigned a score of 0 (unimpaired readability and no artifact), 1 (artifact present but no significant impairment), 2 (mildly impaired readability), 3 (moderately impaired image readability), and 4 (severely impaired readability).

Interobserver agreement was assessed with kappa statistics. Statistical comparisons of grades within each paired set were made with t test (significance level p < 0.05).

Results

Objective evaluation

Our objective metric is focused on noise reduction within the ROI. Objective evaluation of spectral MAR and single energy MAR techniques showed significant differences in the standard deviation of density in ROI adjacent to the metal implant between images at the baseline, with and without artifact reduction. These differences are depicted on histograms of individual ROI which show the distribution of pixel count vs. pixel value (HU) for baseline, with reduction and without reduction (Figures 2, 3, 4). Baseline pixel distributions are tightly centered around the mean pixel value, resulting in a single narrow, peaked histogram shape. Pixel distributions without reduction are more scattered, resulting in wider, lower peaks. After artifact reduction, the shape of the distribution is more similar to baseline, with the exception of instances in which the reduction algorithm itself introduced new artifacts (Figure 3).

Fig. 2
figure 2

A Spectral CT scan of fiducial seed embedded into ultrasound gel produces severe near-field shadow and bloom artifacts. Moderate to mild streak artifacts extend into the far fields. ROI used for objective measurement is marked by white rectangle. B Post-processing with spectral MAR effectively eliminates near-field bloom and reduces near-field shadow to a ring around the fiducial seed. Far-field streak artifacts are more prominent due to the introduction of new artifacts. C Histogram showing pixel count (Y-axis) vs. pixel HU (X-axis). Distribution curve within ROI without spectral MAR (green) has a lower, wider peak whereas distribution curve with spectral MAR (red) much more closely resembles baseline distribution curve (blue) in height, shape, and location

Fig. 3
figure 3

A Spectral CT scan of stent placed in water with ROI (white ellipse) drawn in near-field saline solution. Bloom and shadow artifacts are most severe near either end of the stent, due to the placement of radiopaque markers. Mild streak artifacts are present in the ROI. B Post-processing with spectral MAR introduces a severe amount of streak artifacts originating from the ends of the stent and extending to the far field. New bloom and shadow artifact are also visible near the ends of the stent. C Histogram that shows distribution curve without spectral MAR (green) is very similar to baseline distribution curve (blue). Application of spectral MAR causes distribution curve (red) to be much more scattered and widely distributed without a singular peak due to the introduction of artifacts

Fig. 4
figure 4

A Single energy CT scan of stent placed in water with ROI (white ellipse) drawn in near-field saline solution. Bloom and shadow artifacts are most severe near either end of the stent, and dark streak artifacts extend vertically into the ROI and far-field areas. B Post-processing with single energy MAR effectively reduces near-field shadow and bloom artifacts and far-field streak artifacts. No additional artifacts are introduced in the image by the application of single energy MAR. C Histogram distribution curve for the ROI with single energy MAR (red) is slightly more peaked than without single energy MAR (green), therefore more closely resembling baseline (blue)

Application of spectral MAR reduced the standard deviation of the density measurement from 63.1 to 25.9 HU (Table 1). Single energy MAR brought standard deviations of density measurements from 52.1 to 12.3 HU for all metal implants collectively (Table 1).

Table 1 For objective evaluation, standard deviation measurements (HU) were taken from identical regions of interest of at least 1000 pixels in size adjacent to the metal implant, without and with the application of MAR algorithms, and with comparison to the same area in the phantom scanned prior to incorporation of metal implants

Mann–Whitney–Wilcoxon test showed no significant differences (p = 0.73) in the standard deviations of uncorrected spectral energy images with implant (median 63.1 HU) vs. uncorrected single energy images with implant (median 52.1 HU), therefore the amount of noise is similar in uncorrected images with the two techniques. Standard deviation in the post-correction single energy MAR images (median 12.3 HU), however, was significantly lower (p = 0.0011) than that of the post-correction spectral MAR images (median 25.9 HU), therefore single energy MAR appears to be better for artifact reduction than spectral MAR.

Subjective evaluation

There was no significant subjective improvement in metal artifacts produced by spectral MAR in overall image quality, near-field bloom, near-field streak, far-field bloom, far-field shadow, and far-field streak. However, there was significant improvement in near-field shadow artifact with spectral MAR (p = 0.03) (Table 1).

Single energy MAR showed statistically significant improvements in overall image quality, near-field streak, near-field shadow, and no change in near-field bloom, far-field streak, far-field shadow, and far-field bloom (Table 1).

Spectral MAR also consistently introduced new artifacts (Table 2; Figure 3). In 10 out of 11 images (91%) with spectral MAR, streak artifacts in the near and far fields were introduced, each one with severity of grade 2 or above and one with severity of grade 4. Overall across all 11 images performed with spectral MAR, the average severity score of additional artifacts was 2.36 ± 1.03 (Table 2). In contrast (Figure 4), single energy MAR introduced additional artifact on 4 out of 12 images (33%), none with severity greater than grade 2, and with an overall grade for additional artifact across 12 images of 0.42 ± 0.67 (Table 2).

Table 2 Types and grades of additional artifacts introduced by MAR algorithms

Interobserver agreement for subjective evaluation of the presence of artifacts had kappa value of 0.75. In the cases of disagreement between two readers, a consensus was reached following further discussion.

Discussion

Our study shows that both single energy and spectral MAR significantly reduce the standard deviation of HU in ROI with metal-induced artifacts. Standard deviations with single energy MAR were significantly lower than standard deviations with spectral MAR, suggesting that artifacts are more effectively reduced in severity by single energy MAR. Single energy MAR was able to produce significantly improved subjective grades in the overall image quality compared to the unreduced image, whereas spectral MAR showed no significant improvement in the overall image quality. Spectral MAR also consistently introduced additional artifacts of mild to moderate severity.

We have confirmed that MAR algorithms with iterative reconstructions are effective, as seen by decrease in the standard deviation measurements adjacent to the metal artifacts after the application of MAR. This is in accordance with prior studies which show that single energy MAR [1, 5] and spectral MAR [3, 4] reduce the standard deviations of HU when applied to ROI afflicted by artifact. Single energy MAR produced significantly lower standard deviations than spectral MAR. Artifact ROI on images acquired on the single energy CT scanner vs. the spectral CT scanner by different manufacturers with their respective MAR algorithms off were not significantly different, making it less likely that hardware differences between the two scanners are a factor in the performance gap.

In the subjective evaluation there is some discordance between our findings and previously published research. In our evaluation, spectral MAR failed to produce statistically significant improvement in overall readability, which was not the case in other studies [24]. Additional artifacts caused by spectral MAR were also more common here than in previous studies [3]. This may be due to the difference in implants being evaluated, as these previous studies examined prosthetic hips [2], gold fiducial markers [3], and radio-iodide seeds [4] rather than stents and coils as we did in our study. This behavior of spectral MAR was not entirely unknown, as previous studies [3, 8] showed what was called the “dark star” far-field artifact when applying the algorithm to gold fiducials. As we have shown here, however, this phenomenon is not limited to gold fiducials and has appeared with nearly every type of metal implants we studied. This phenomenon appeared in 10 out of 11 total images of our study (Table 2). The additional artifacts were most severe with one stent, although a different type of stent in the same spatial orientation did not display additional artifacts at all. This could be due to the different metal densities of balloon expandable vs. self-expanding stents. Single energy MAR introduced relatively mild and less frequent additional artifacts (Table 2) when applied to the same implants.

An earlier study indicated that these additional artifacts could be endemic to artifact reduction algorithms which rely on replacing artifact-afflicted signal with interpolated data from surrounding areas [9, 10]. There are alternative techniques centered on artifact signal deletion rather than replacement [1113], but these methods have the potential drawback of blurring and loss of resolution around the metal implant due to loss of data.

The limitations of our study stem from small sample size and a limited variety of spatial orientations, including the fact that we only evaluated images in the axial plane. Whereas prior research was done in vivo [1, 3, 5, 9, 11], we elected to use a phantom. This allowed us to select the type and orientation of metal implants thus creating an identical benchmark for comparison between single energy and spectral MAR, but this is not a perfect analog for clinical application in a live patient. Although we could quantify and qualify the severity and nature of artifacts before and after the application of MAR algorithms, we could not evaluate how these artifacts and MAR techniques would have affected diagnostic accuracy in live patients with actual pathologies. Additionally, our phantom is not anthropomorphic and is much more homogenous than the inside of a human body which may account for some of the discrepancies in results. For example, it was hypothesized that the metal implant’s proximity to bone may cause some of the additional artifact [3] but this was not something we could investigate with our current phantom. We did not evaluate the differences in artifact severity between different implants, either different types of stents, or stents vs. coils vs. fiducial seeds, as we were primarily focused on the differences between the two MAR algorithms.

In conclusion, iterative reconstruction algorithms are able to reduce the severity of metallic artifacts. Given our phantom and selection of metal implants, we found the performance of single energy MAR to be superior to that of spectral MAR. Although the spectral MAR algorithm still shows improvement, single energy MAR performed better than spectral MAR in our phantom study with relatively small sample size. Further investigation is warranted to determine if single energy MAR’s apparent advantage holds true under clinical conditions with patients.