Introduction

Hip fracture is an important and common public health issue that increases morbidity and mortality in older adults [1]. Surgery using orthopedic hardware such as arthroplasty and internal fixation is recommended to improve outcomes in patients with hip fractures [2]. Although radiographs are the first-line modality for evaluation of the orthopedic hardware at the hips, computed tomography (CT) provides excellent evaluation of both bone and soft tissue pathology [3]. However, the metal orthopedic hardware is problematic in the follow-up CT scans because it causes beam hardening that decreases the imaging quality and loss of information about underlying structures [4, 5]. To improve diagnostic accuracy, the reduction of metal artifacts has become an important issue in modern CT systems [5, 6].

Traditionally, adjusting acquisition parameters such as increased tube current and voltage are used but the effect of reducing metal artifacts is limited [4, 6]. After the invention of dual-energy CT techniques, virtual monochromatic images, especially at high energy levels, can provide better imaging quality with less beam hardening [5, 7, 8]. However, dual-energy CT is expensive and there is less tissue contrast in the virtual monochromatic images at high energy levels [9, 10]. Recently, because of the advancement of computers in complex calculations, several projection-based metal artifact reduction algorithms that can be applied on the single-energy CT are introduced by CT vendors are able to provide better imaging quality than the traditional methods [5, 6, 11]. The single-energy metal artifact reduction (SEMAR) algorithm is one such algorithm and is reported to dramatically reduce metal artifacts and improve imaging quality have been reported [12,13,14,15]. Nevertheless, the reliability of the measurement of CT densities at the underlying structures, which is important in the evaluation of soft tissue pathology, has not been well defined. Furthermore, previous studies have focused on the reduction effect for hip arthroplasty with large metal compartments, which caused unresolvable artifacts using the traditional methods [13,14,15]. Thorough investigation of the effects of the SEMAR algorithm in the reduction of metal artifacts from different metal hardware is lacking.

In this study, we compared the image quality of CT with iterative reconstruction (IR) alone and in association with a SEMAR algorithm in postoperative patients with different orthopedic hardware at the hips. The study purpose was to evaluate the performance of the SEMAR algorithm in patients with different orthopedic hardware at the hips and the reliability of measured densities at specific periarticular soft tissue structures.

Materials and methods

Patient population

Patients with hip instrumentation who had undergone a routine CT, including the pelvic region, at a single institution between February 2015 and October 2019 (n = 153) were enrolled in this retrospective study. The inclusion criteria were adults with complete routine CT using helical scans with the adaptive iterative dose reduction (AIDR) 3D algorithm for the entire study region and volume scans with the SEMAR algorithm for the hip instrumentation region. The exclusion criteria were significant motion artifacts in any one of the scans. Based on the different types of orthopedic hardware, patients were then divided into 5 groups: bilateral prostheses, unilateral prosthesis, bilateral internal osteosynthesis fixation (IF), unilateral IF, and prosthesis + IF (prosthesis at one hip and IF at another hip).

Ethical considerations

This retrospective study was approved by the local institutional review board of our hospital. Because the images were deidentified and patients remained anonymous, the requirement to obtain informed consent was waived.

CT image acquisition

All included patients were scanned using a 640-MDCT scanner (Aquilion One, Version V6.0, Canon Medical Systems, Otawara, Tochigi, Japan). The scan parameters of the helical scan with the AIDR 3D algorithm (Canon Medical Systems, Otawara, Tochigi, Japan) were helical acquisition; detector collimation, 80 rows × 0.5 mm; tube rotation time, 0.5 s; tube voltage, 120 kV; tube current, 300–550 mA (automatic exposure control); and helical pitch factor, 0.813. The scan parameters of the volume scan with the SEMAR algorithm (Canon Medical Systems, Otawara, Tochigi, Japan) were sequential acquisition; detector collimation, 280–320 rows × 0.5 mm; tube rotation time, 0.5 s; tube voltage, 120 kV; and tube current, 300–550 mA (automatic exposure control). The volume CT dose index (CTDIvol) of each scan was recorded.

Image reconstruction

The CT images were reconstructed as 5.0 mm sections at 0.0 mm intervals for the helical and volume scans. The helical scans were reconstructed by using an AIDR 3D algorithm and the volume scans by using an AIDR 3D with the SEMAR algorithm. A standard soft tissue kernel (FC08) was used for all reconstructions.

Preparation for image evaluation

Before being reviewed, all images were randomly assigned a new identification number, and the original identification numbers, patients’ names, and study descriptions were erased by a radiographer with 10 years of experience. To evaluate the influence of the metallic devices, axial images with the most obvious metallic artifacts (images with artifacts) were selected for qualitative and quantitative evaluations by one radiographer with 12 years of experience. Axial images 1 cm above the top of orthopedic hardware were selected as reference images by the same radiologist.

Quantitative image analysis

The following regions of interest (ROIs) were drawn 3 times in AIDR 3D and SEMAR images on the images with artifacts and the reference images, including the common femoral vein and subcutaneous fat at the lateral aspect of the hip ipsilateral to the hardware and the urinary bladder. In patients with hardware at the bilateral hip, the ROIs were drawn at the side with more artifacts or at the right hip region if the artifacts were about the same. The average mean attenuations (HU) in the ROI were analyzed to evaluate reliability. The average standard deviation (SD) of the attenuation in the ROI was set as the image noise.

Qualitative image analysis

The images with artifacts were evaluated independently by two radiologists with 21 and 13 years of experience, respectively, who were blinded to the reconstruction methods. To prevent recall bias, the radiologists reviewed axial images of the different sequences at separate times at least 1 month apart. The images were reviewed using the standard picture archiving and communication system (EBM-viewer, EBM, Taiwan) on the standard viewer stations (dual 5 million-pixel monitors) in the study hospital. The windows width and level were fixed at 350 and 50 Hounsfield units (HU), respectively.

To evaluate overall quality, a 5-point scale was used to grade images by the severity of metal artifacts, as follows: 1, severe artifacts causing poor recognition of pelvic organs and muscles; 2, moderate artifacts causing partially obscured pelvic organ (urinary bladder or prostate) (< 50%) and muscles (< 50%); 3, some artifacts causing faint recognition of pelvic organ and poor recognition of periarticular soft tissue; 4, mild artifacts with good recognition of pelvic organs and periarticular soft tissue; and 5, minimal to no artifacts. To evaluate visibility of the pelvic organs, muscles, and common femoral vessels, a 5-point scale was used to grade images, as follows: 1, severely obscured (> 50%); 2, partially obscured (20–50%); 3, slightly obscured pelvic organs (< 20%); 4, visible with some heterogeneous densities; and 5, nearly normal imaging. Under consideration of good recognition of both pelvic organs and periarticular soft tissue, scales 4 and 5 were considered as diagnostic images.

To evaluate the image quality of SEMAR in the reference images, the readers compared the AIDR 3D images and SEMAR images side by side and graded them using a 5-point scale, as follows: − 2, significantly better quality on AIDR 3D images than SEMAR images; − 1, mildly better quality on AIDR 3D images; 0, similar quality between AIRDR 3D and SEMAR images; 1, mildly poorer quality on AIDR 3D images; and 2, significantly poorer quality on AIDR 3D images.

Statistical analysis

Statistical analysis was performed by using the Statistical Package for Social Science (SPSS) version 20 (IBM SPSS Statistics, Chicago, IL, USA). Distributions of qualitative and quantitative evaluation for image quality of the different hardware types were analyzed. Differences between the images with artifacts and the reference images of the AIDR 3D and SEMAR algorithm were evaluated using the paired t-test and P < 0.0125 (0.05/4) was considered statistically significant. Side-by-side comparisons between AIDR 3D images and SEMAR images were evaluated using a one-sample t-test. P < 0.05 was considered statistically significant in the analysis of all images. Differences in the qualitative evaluation between the different hardware types were evaluated using one-way ANOVA with Tukey’s post-tests and P < 0.01 (0.05/5) was considered statistically significant. The inter-rater correlation coefficients were examined using the kappa statistic. P < 0.05 was considered statistically significant. Poor, fair, moderate, good, and excellent agreement corresponded to the following ranges of kappa values: 0.00–0.20, 0.21–0.40, 0.41–0.60, 0.61–0.80, and 0.81–1.00, respectively.

Results

A total of 153 patients (93 females; mean age 75.5, range 32–97 years) with hardware at the hip joints were enrolled in this study. Table 1 shows patients’ demographic data and distribution of the different types of hardware. Mean CTDIvol value of the SEMAR images was significantly lower than that of the AIDR 3D images in all patients (P = 0.013) and in the unilateral IF group (P = 0.002).

Table 1 Patient and implant characteristics

Quantitative image analysis

In the images with artifacts, the urinary bladder was measured in 113 patients. The mean attenuation achieved with AIDR 3D images (− 10.12 ± 43.93 HU) was significantly lower than that achieved with SEMAR images (5.92 ± 12.86 HU) (P < 0.001). Data of the veins and fat were measured in images with artifacts and reference images in all patients and only 39 patients had measurable data at the urinary bladder in both artifacts and reference images. The mean attenuations of the urinary bladder, veins, and fat in the AIDR 3D images with artifacts were significantly lower than those in the AIDR 3D reference images (P = 0.008, P < 0.001, P < 0.001), in SEMAR images with artifacts (P = 0.006, P < 0.001, P < 0.001), and in SEMAR reference images (P = 0.007, P < 0.001, P < 0.001) (Table 2). No significant differences were seen in the mean attenuations of the urinary bladder, veins, and fat between the SEMAR images with artifacts and the SEMAR reference images (all P > 0.05). Figure 1 shows the distribution of the attenuations of the urinary bladder, veins, and fat in patients with different types of hardware.

Table 2 Mean attenuation and noise of the images with artifacts and the reference images of AIDR 3D and SEMAR reconstruction
Fig. 1
figure 1

Mean and standard deviation of attenuations at a the urinary bladder, b common femoral vein, and c subcutaneous fat were recorded by different types of hardware

In images with artifacts, the mean noise of the urinary bladder in the AIDR 3D images (− 46.93 ± 57.41 HU) was significantly lower than that in the SEMAR images (14.41 ± 4.45 HU) (P < 0.001). Comparison of images with artifacts and reference images revealed that the mean noise of the urinary bladder in AIDR 3D images with artifacts was significantly higher than those in AIDR 3D reference images (P < 0.001), in SEMAR images with artifacts (P < 0.001), and in SEMAR reference images (P < 0.001) (Table 2). The mean noises of the urinary bladder in SEMAR images with artifacts and SEMAR reference images were significantly higher than that in AIDR 3D reference images (P < 0.001; P < 0.001). Figure 2 shows the distribution of the noises of the urinary bladder, veins, and fat in the patients with different types of hardware.

Fig. 2
figure 2

Mean and standard deviation of noises at a the urinary bladder, b common femoral vein, and c subcutaneous fat were recorded by different types of hardware

Qualitative image analysis

The detailed results of qualitative image analyses in the images with artifacts are shown in Table 3. All AIDR 3D images were non-diagnostic (overall quality ≤ 3) and about one-sixth of the SEMAR images were non-diagnostic (15.7–17.6%), mainly in patients with prostheses, including bilateral prostheses, unilateral prostheses, and IF + prostheses [reader 1: 91.7% (22/24); reader 2: 92.6% (25/27)] (Fig. 3). Scores of the overall quality and the visibilities of pelvic organ muscles and vessels in the SEMAR images were significantly higher than those in the AIDR 3D images in all patients and subgroups with different types of hardware (all P < 0.001) (Fig. 4). Non-diagnostic images (overall quality ≤ 3) in SEMAR images were noted in the subgroups of unilateral IF [reader 1: 4.8% (2/42); reader 2: 4.8% (2/42)], bilateral prostheses [reader 1: 44.0% (11/25); reader 2: 48.0% (12/25)], unilateral prostheses [reader 1: 13.4% (9/67); reader 2: 16.4% (11/67)], and prosthesis + IF [reader 1: 18.2% (2/11); reader 2: 18.2% (2/11)] (Fig. 5). In analyzing differences between different types of hardware (Fig. 6), scores of the overall quality, visibility of pelvic organs, and visibility of vessels were significantly higher in the unilateral IF group than in those of the bilateral prostheses, unilateral prosthesis, and prosthesis + IF groups (all P < 0.01 for two readers). The mean score of the visibility of muscles of the unilateral IF group was also significantly higher than those of the bilateral prostheses and unilateral prosthesis groups (all P < 0.01 for two readers). However, no significant differences were found in scores between the other different types of hardware.

Table 3 Subjective analysis of image quality of the AIDR 3D and SEMAR reconstructions
Fig. 3
figure 3

A 54-year-old man with bilateral total hip arthroplasties. a Axial CT image reconstructed using AIDR 3D algorithm showed severe metal artifacts that masked seminal vesicles, posterior aspect of urinary bladder, anterior aspect of rectum, and adjacent muscles. b Axial CT image reconstructed using SEMAR algorithm improved the visibility of seminal vesicles, urinary bladder, rectum, and adjacent muscles and only mild streak artifacts were noted

Fig. 4
figure 4

A 92-year-old female with internal fixation at bilateral proximal femur. a Axial CT image reconstructed using AIDR 3D algorithm showed severe metal artifacts that masked urinary bladder and adjacent muscles and the overall quality scales were 1 by reader 1 and 2 by reader 2. b Axial CT image reconstructed using SEMAR algorithm improved the visibility of urinary bladder, rectum, and adjacent muscles and the overall quality of the scales was 5 by both readers

Fig. 5
figure 5

A 94-year-old female with left total hip arthroplasty. a Axial CT image reconstructed using AIDR 3D algorithm showed severe metal artifacts that masked posterior aspect of urinary bladder, anterior aspect of rectum, and adjacent muscles and the overall quality of scales was 2 by both readers. b Axial CT image reconstructed using SEMAR algorithm improved the visibility of seminal vesicles, urinary bladder, rectum, and adjacent muscles and only mild streak artifacts were noted

Fig. 6
figure 6

Image quality scores of the images with artifacts with AIDR 3D and SEMAR algorithm: a overall quality; b visibility of pelvic organ; c visibility of muscles; d visibility of vessels

The inter-rater correlation coefficients, indicators of inter-rater reliability, were 0.86 (95% CI: 0.79 to 0.93, P < 0.001) and 0.86 (95% CI: 0.79 to 0.93, P < 0.001) for the overall quality scores of the AIDR 3D images and SEMAR images, respectively; 0.80 (95% CI: 0.72 to 0.87, P < 0.001) and 0.92 (95% CI: 0.86 to 0.98, P < 0.001) for the visibility of pelvic organs in the AIDR 3D images and SEMAR images, respectively; 0.83 (95% CI: 0.75 to 0.91, P < 0.001) and 0.81 (95% CI: 0.73 to 0.89, P < 0.001) for the visibility of muscles in the AIDR 3D images and SEMAR images, respectively; and 0.81 (95% CI: 0.74 to 0.88, P < 0.001) and 0.85 (95% CI: 0.76 to 0.94, P < 0.001) for the visibility of vessels in the AIDR 3D images and SEMAR images, respectively.

In side-by-side comparisons of the reference images between the AIDR 3D and SEMAR algorithms, most SEMAR images were the same as the AIDR 3D images [reader 1: 95.4% (146/153), reader 2: 92.2% (141/153); K: 0.61 (95% CI: 0.35 to 0.87), P < 0.001]. Only a few SEMAR images presented slightly less quality [reader 1: 3.9% (6/153), reader 2: 6.5% (10/153)] and the peristalsis-related streak artifact was the major issue (Fig. 7). The inter-rater correlation coefficient was 0.61 (95% CI: 0.35 to 0.87).

Fig. 7
figure 7

An 80-year-old female with implants at left proximal femur. The CTDIvol of AIDR 3D image (a) was 9.9 mGy and SEMAR image (b) was 8.7 mGy. The SEMAR image (b) presented increased image noise and tiny peristalsis-related streak artifact (arrow) at left anterior abdominal wall and adjacent bowel loops

Discussion

Results of the present study showed that the SEMAR algorithm significantly reduced metal artifacts and provided diagnostic quality images in more than 80% of the included patients. Most SEMAR images with poor quality were noted in the patients with prostheses. In the SEMAR images, no significant differences were found in the mean attenuation of soft tissues around the hardware compared with the reference images. These findings suggest that the measurements provided by the SEMAR images are reliable. To the best of our knowledge, this is the first study to evaluate the reliability of attenuation and the influence of different types of hardware in SEMAR images.

Previous studies, both phantom and clinical studies, have shown that the SEMAR algorithm significantly reduces metal artifacts [11, 14,15,16,17]. In the present study, the metal artifacts derived from hardware led to non-diagnostic quality in all images with the AIDR 3D algorithm, and the SEMAR algorithm rescued more than 80% of the images with metal artifacts in the diagnosis of soft tissues. These findings were similar to those of previous studies with pelvic CT [12,13,14,15, 18]. However, these previous studies investigated the reduction of metal artifacts in patients with hip prostheses only, and the present study investigated metal artifact reduction in patients not only with hip prostheses but also with internal fixation. The present study found that the imaging quality in patients with internal fixation only, especially unilateral fixation, was better than that in patients with prostheses, which was upheld for both AIDR 3D and SEMAR algorithms.

Although previous SEMAR studies showed not only excellent effects in reducing metal artifacts due to hip prostheses but also less variation in the CT attenuations of the bladder and muscle [13], the reliability of CT density measurement at adjacent soft tissues was not well studied or discussed. The reliability of CT density measurement is important, especially in soft tissue pathology such as the differentiation between effusion, muscle, and hematoma that may influence clinical treatment decisions. In the present study, no statistically significant differences were shown between the mean densities of the urinary bladder, veins, and fat in the SEMAR images with artifacts and those in the reference images with the AIDR 3D algorithm. The results supported the good reliability of the SEMAR algorithm in CT density of the soft tissues around the metal hardware and the measured densities can be applied to the traditional evaluation criteria as well as in morphology evaluation. However, the image noises of the SEMAR images with artifacts were statistically significantly higher than those of the reference images with the AIDR 3D algorithm. These findings suggest that a region of interest should be drawn as large as possible to avoid the influence of image noise.

To evaluate the reliability of the SEMAR algorithm in the images without metal artifacts, the imaging qualities of the images with AIDR 3D and SEMAR algorithms were evaluated through side-by-side comparison, demonstrating the same quality as in most patients (92.2–95.4%). We also found no statistically significant differences when comparing the differences between the mean densities and noises of the urinary bladder, veins, and fat between the reference images of AIDR 3D and SEMAR algorithms. These results suggest that the SEMAR algorithm is reliable not only in images with metal artifacts but also in the images without metal artifacts.

In the present study, the CTDIvol of the images with SEMAR algorithm was statistically significantly lower in patients with unilateral fixation than in those with the AIDR 3D algorithm, and no significant differences were found between the AIDR 3D and SEMAR algorithms in patients with prostheses or bilateral fixation. In our CT protocol setting, the tube current (mA) was determined by automatic exposure control (AEC) and the reconstruction algorithm was one of the influencing factors. We supposed that the better reduction of metal artifacts caused by unilateral fixation may be the major cause of a lower tube current. Although no similar study has compared the radiation dose between these two algorithms, previous SEMAR study has shown that the radiation doses in the patients with bilateral prostheses were higher than those in the patients with unilateral prosthesis [15]. According to these findings, we supposed that the reduction ability for metal artifacts contributes not only to the image quality but also to the radiation dose.

The present study has a few limitations that warrant discussion. First, the study was retrospective and discussed the imaging quality of the SEMAR algorithm in patients with metallic devices. However, we lacked specific information about the materials of the metallic devices, such as titanium, steel, or other metals, and could not verify the influence of these different materials. Second, the sample size was small, especially in groups of bilateral fixation and arthrography + fixation. Third, the enrolled subjects had no significant disease of the pelvic and periarticular organs, so the diagnostic accuracy of AIDR 3D and SEMAR algorithms could not be evaluated.

In conclusion, the SEMAR algorithm provides significant reduction of metal artifacts, more in patients with internal fixations than in patients with prostheses. The attenuation of soft tissues in CT with the SEMAR algorithm is reliable not only in the images with metal artifacts but also in the images without metal artifacts.