Computed tomography (CT) imaging is fast, reliable, and widely available, and thus is routinely used for diagnosis, staging, treatment response assessment, and surveillance in oncology [13]. As usage of CT imaging has become more prevalent, radiation exposure has increased and dose-decreasing strategies are becoming more warranted [4, 5]. While attempts to reduce radiation exposure are generally commendable, this should not come at the risk of decreasing image quality and diagnostic accuracy [6]. Several dose reduction techniques have been introduced in the last decade, such as helical acquisition, adaptive section collimation, tube current modulation, automatic exposure control, peak kilovoltage adjustment, and adaptive noise reduction filters [7, 8], which are routinely implemented on state-of-the-art CT equipment.

A newer approach is to tackle the reconstruction process of the image data. Filtered back projection (FBP) had been the most frequently used technique and is considered the reference standard. However, it relies on the relationship between the measured attenuation and the pixel values in the image and assumes that these measurements are truly reflective of the actual data values [9]. In reality, the data are noisy and the noise is subsequently amplified [9]. Newer technologies involve the use of iterative reconstruction (IR) which statistically models the object, noise, and CT imaging system itself in the projection data with forward (projection data generated from the images) and backward (images reconstructed from the projected data) reconstruction steps [10, 11]. Adaptive statistical iterative reconstruction (ASIR) was developed as one type of first generation iterative reconstruction algorithm, capable of dose reduction over a wide range. In clinical practice, ASIR is utilized as a hybrid technique with FBP and images are blended to obtain combined images [11] with vendor recommended dose reduction ranging between 30 and 50% [12].

A next generation full IR algorithm, model-based iterative reconstruction (MBIR), also takes into account system statistics and system optics and further reduces image noise at reduced dose settings [11]. As a full IR technique, MBIR performs multiple backwards and forwards reconstructions, without the need to blend with FBP [11]. Advantages are that it produces better image quality than FBP at very low signal-to-noise ratio settings [9] and low-radiation dose settings while maintaining high spatial resolution [13].

Previous studies have compared image quality of low-dose datasets reconstructed by FBP, ASIR, or MBIR obtained from various organs such as the liver [3, 10, 14, 15], small bowel [16], colon [17], and lungs [1823]. They showed that low-dose techniques with IR are feasible, thereby allowing decreased radiation dose while gaining more contrast to noise, less noise, and improved lesion or structural conspicuity. However, there is little experience regarding objective and subjective image quality of CT images reconstructed by MBIR versus the current standard of care reconstructions (FBP/FBP–ASIR blend) in an intra-individual prospective comparison study in cancer patients [24]. The cancer patient population is a good surrogate to compare the effects of radiation dose reduction on image quality as patients undergo multiple exams.

Therefore, the purpose of this intra-individual prospective comparison study was to compare image quality and lesion conspicuity of reduced dose (RD) CT with MBIR compared to standard dose (SD) CT in patients undergoing oncological follow-up imaging.

Materials and methods

Patients

This HIPPA compliant prospective study was approved by the Institutional Review Board of our institution and written consent was obtained from all participating patients. One radiologist reviewed the daily list of patients scheduled for CT imaging between a 6-month period. Patients over 18 years old with a known diagnosis of cancer who presented for a routine scheduled follow-up single contrast phase chest, abdomen, and pelvis CT examination were screened for possible inclusion. Only those patients with at least one routine single contrast phase chest, abdomen, and pelvis CT examination performed in our institution within the last 12 months were considered for possible inclusion. Consecutive eligible patients were selected randomly with a maximum of two patients included daily to avoid delays in clinical workflow due to the long reconstruction times of MBIR [15, 25, 26].

Patients who were unable to provide written informed consent or could not obtain intravenous contrast due to major contrast agent allergy or compromised renal function defined as a creatinine of greater than 1.5 were excluded. Seventy-six patients met the study inclusion criteria. Thirty-two patients (41%) were excluded (Fig. 1). The final study population consisted of 44 patients (19 women and 25 men; mean age, 58.6 years; range, 33–85 years). A subset of the included patients, 35 in total, had a total of 138 target lesions included for analysis of lesion size and lesion conspicuity.

Fig. 1
figure 1

Flowchart of the patient selection process

For each patient, weight (mean, 77 kg; range, 45–122 kg) was recorded. Prior weight was retrospectively obtained from the patient chart (mean, 79 kg; range, 45–122 kg). This difference in patients’ weights at the two time points was small but statistically significant (median, 0.7 kg; 95% CI 0–2.3 kg; P = 0.03). The average time interval between previous SD CT and RD CT scans was 106 days (range, 28–365 days).

The majority of patients carried a diagnosis of colorectal carcinoma (n = 12), lymphoma (n = 7), lung cancer (n = 6), or breast cancer (n = 5). Other cancer indications included gastric cancer (n = 3), pancreatic cancer (n = 3), small bowel cancer (n = 2), head and neck cancers (n = 2), ovarian cancer (n = 2), esophageal cancer (n = 1), and prostate cancer (n = 1).

Standard dose routine multi-detector row CT imaging technique

All patients had previously undergone a routine standard dose (SD), single-phase chest, abdomen, and pelvis CT on one of six multi-detector row CT scanners at our institution. Reconstruction technique, scanning parameters, and number of patients scanned on each of these CT scanners are listed in Table 1. Automated tube current modulation was used on all acquisitions: CarekV and CareDose 4D (Siemens) were used with a Reference tube voltage of 120 kV and Quality Reference mAs of 200. Smart mA (GE) technique was used with the noise index (NI) adjusted to patient weight and for a reconstruction slice thickness of 0.625 mm. Due to differences in automated exposure control software versions, the numerical values of NI tables varied across different GE scanners based on three weight categories. The NI values ranged from 46 for patients greater than 91 kg, 37 for patients between 68 and 91 kg, and 30 for patients less than 68 kg on the Discovery HD750 (GE) to 37, 29, and 24, respectively, for the Discovery 690 (GE) and Lightspeed VCT (GE). For chest imaging, the “B45f” convolution kernel (Siemens) and the “Bone” convolution kernel (GE) was used. For abdomen and pelvis imaging, the “B20f” (Siemens) kernel and the “Soft” convolution kernel (GE) was. All datasets were reformatted to a transverse slice thickness of 1.0 mm (Siemens), or 1.25 mm (GE). The transverse images were subsequently used for both quantitative and qualitative image analysis.

Table 1 Summary of prior standard dose CT scan parameters

Reduced dose multi-detector row CT imaging technique

All patients were scanned on an outpatient clinic 64-detector row CT scanner (Discovery 750HD, GE Healthcare, Milwaukee, WI) using the following parameters: 0.625-mm collimation, 100 kilovolt peak (kVp), and automated milliamperage modulation (Smart mA, GE Healthcare). The following parameters varied based on established institutional guideline tables followed by the technologists to optimize image quality based on patient body weight: pitch (either 0.5156 or 1.375) and gantry rotation time (mean, 0.8 s; range, 0.4–1.0 s). All patients were assigned one of three NI values based on their weight. To target a 50% dose reduction at 0.625-mm slice thickness, a NI value of 72 was selected for patients over 91 kg, a value of 51 for patients between 68 and 91 kg, and a value of 43 for patients less than 68 kg. Axial images at 0.625 mm were acquired and image reconstruction was performed with MBIR (Veo 1.0, GE Healthcare). These were further reformatted to obtain axial 1.0–1.3 mm slices of the chest, abdomen, and pelvis. These images were subsequently used for quantitative and qualitative image analysis (Fig. 2).

Fig. 2
figure 2

A 51-year-old female with rectal carcinoma scanned with SD and RD CT scans, 119 days apart. Transverse contrast-enhanced SD with 40% ASIR (A) and RD with MBIR (B) CT scans at the level of the left renal vein show the four tissue types examined for quantitative noise measurements: right hepatic lobe, aorta, right paraspinal muscle, and anterior subcutaneous fat

For both SD and RD CT scans, a weight-based amount of iodinated contrast medium, at 1.6 mL/kg at 350–370 mgI/mL, was injected over 40 s using a power injector (Empower CTA, E-Z-EM Inc., Westbury, NY, USA). Timing for portal venous phase was obtained by use of an empirical 65-s delay after the start of the contrast injection.

Quantitative image analysis

Quantitative image analysis of all CT datasets (SD and RD data) was performed by one board certified, body fellowship-trained radiologist with 6 years of experience in oncologic CT imaging. Quantitative imaging data analysis included recording of the radiation dose, quantitative assessment of image noise, and measurement of the maximal diameter of up to 5 target lesions according to response evaluation criteria in solid tumors (RECIST) 1.1 [27, 28].

Radiation dose

The CTDIvol values for the SD and RD CT scans recorded for all acquisitions were retrieved from the dose reports generated by the CT scanners and archived on the picture archiving and communications system (PACS, GE Healthcare). The average percentage dose reduction per patient was calculated based on the difference between the CTDIvol of the SD versus the RD CT scan [(RD MBIR CTDIvol − SD CTDIvol)/SD CTDIvol) × 100%].

Image noise

Noise values (standard deviations in Hounsfield units) were measured by drawing a circular region of interest (ROI) measuring 50 mm2 on transverse images of both the SD and RD CT scans in the following four predetermined anatomical sites as described [3, 26, 29, 30], all obtained at the level of the left renal vein: (1), in the anterior abdominal wall fat; (2), in the lumen of the abdominal aorta; (3), in the right hepatic lobe parenchyma; and (4), in the right paraspinal musculature. Gross visible tissue inhomogeneities such as vessels or dilated bile ducts in the liver parenchyma, atherosclerotic plaques of the aorta, or striations of fat in the paraspinal musculature, if present, were avoided in the ROI.

Maximal lesion diameter measurements

Up to five index lesions were found in patients with target lesions and the maximum diameter was measured on transverse images on both SD and RD CT scans. All lesions were re-measured at least 3 weeks later by the same reader to assess reproducibility of index lesion measurements.

Qualitative image analysis

Qualitative image analysis was performed by three board-certified, fellowship-trained body radiologists (reader A, 22 years; reader B, 14 years; and reader C, 6 years of experience in CT cancer imaging). All readers were blinded to patient demographic information, history, diagnosis, and imaging findings. After an initial training session with three randomly selected MBIR studies which were not part of this study in order to familiarize all readers with the criteria used for qualitative image analysis, all readers independently analyzed the transverse SD and RD CT scans in randomized fashion on a dedicated two-monitor PACS workstation (Coronis 3 MP (MDCG 3120), Barco, Inc., Barco NV, Belgium). Transverse images were initially presented in standard soft tissue windows (window width, 320; window length, 50) and readers were encouraged to change window settings and zoom images as they deemed necessary. Scoring sheets were provided to the readers to grade the following parameters by using Likert grading scales: Image contrast, subjective image noise, artifacts, overall subjective image quality, and lesion conspicuity.

Image contrast

Image contrast scoring of grade 1 (excellent), 2 (good), 3 (moderate), 4 (suboptimal), and 5 (poor) was based upon the ability to visualize small anatomic structures on transverse images. Within the chest, images were evaluated to determine whether pulmonary vessels were visible within 1 cm of the pleura and if both internal mammary arteries were visualized in at least 75% of their course [12, 19, 31]. Within the abdomen and pelvis, visibility of the following three anatomical structures was assessed: subsegmental portal vein branches to within 2 cm of the capsule, entire course of the common bile duct, and at least 75% of the course of the non-opacified ureters [32, 33]. If all structures were seen, this was scored as grade 1, excellent. If one structure was lacking, this would be scored as grade 2, good; if two structures were lacking, this would be scored as grade 3, moderate. A grade 4, suboptimal and a grade 5, poor were reserved for studies with three or four or more structures not visible, respectively.

Subjective image noise

Image noise was graded on a five-point scale based on the overall perception of quantum mottling in the images [3], defined as the readers’ perceptions of the graininess to the images compared to what the reader is used to on average CT scans. This was graded as 1, minimal; 2, less than average; 3, average; 4, above average; and 5, unacceptable.

Artifacts

Artifacts were graded on a four-point scale: grade 1, no artifacts; grade 2, some artifact present not affecting interpretation; grade 3, artifacts present affecting interpretation but still diagnostic; and grade 4, artifacts present affecting interpretation and not diagnostic. Readers were also asked to list the type of artifacts noticed as follows: streak artifact (from hardware or vessels), pixelation (a step-wise artifact at tissue interfaces), or beam hardening [3, 25, 31].

Overall subjective image quality

Overall subjective image quality was scored on a four-point scale: grade 1, “better than what I am used to and diagnostic”; grade 2, “similar to what I am used to and diagnostic”; grade 3, “worse than what I am used to but still diagnostic”; and grade 4, “worse than what I am used to and not diagnostic.”

Lesion conspicuity

Lesion conspicuity was scored using a three-point Likert scale. Lesions that completely resolved in the interval or patients without index lesions were not included for analysis. The preselected lesions were marked and bookmarked on PACS by the same reader who measured the diameter of the target lesions. All readers were provided a list of accession numbers and instructed to choose a numbered bookmark that contained the preselected target lesions. Conspicuity of each lesion was evaluated using the following grading scale: grade 1, “lesion delineation against the surrounding tissue is better than what I am used to”; grade 2, “lesion delineation against the surrounding tissue is similar to what I am used to”; and grade 3, “lesion delineation against the surrounding tissue is worse than what I am used to.” This was performed for both the SD and RD CT scans.

Statistical analysis

The sample size of 44 patients was chosen for the study to have 80% power for detecting a mean difference of 0.45 standard deviation with the paired Wilcoxon rank sum test at the significance level of 0.05 and we hypothesized that the differences in relevant parameters of RD CT with MBIR and the standard of care SD CT are greater than 0.45 standard deviation.

Radiation dose reductions were summarized with median levels and 95% confidence intervals (CI). Quantitative noise reduction and Hounsfield Unit attenuation were summarized with their median levels and 95% CI by different tissue types. The P values between SD and RD CT were obtained with non-parametric Wilcoxon test. The trend test was performed to test the association between the dose reduction and baseline weight. The reproducibility of maximal diameter measurements of target lesions on both SD and RD CT images were measured with the estimated intraclass correlation coefficients (ICC) and the corresponding 95% CI based on bootstrap method. For comparing qualitative image measurements including image contrast, image noise, overall image quality, artifacts, and lesion conspicuity, the non-parametric paired sign test was performed for each reader separately. All statistical tests were two-sided with a significance level of 0.05. The statistical analysis was performed using R. 3.1.3 (R foundations for statistical computing).

Results

Quantitative image analysis

Radiation dose reduction

Radiation dose was significantly reduced (P < 0.001) by a median of 46.2% (95% CI 40.7–49.2%; range, 6.3–68.8%) with RD compared to SD imaging (CTDIvol: 7.3 ± 3.4 vs. 13.0 ± 4.5 mGy, respectively, DLP: 520.5 ± 240.2 vs. 864.5 ± 334.8 mGy cm). The median dose reduction was 38.7% (95% CI 23.5–68.8%) in patients >91 kg, 46.1% (95% CI 39.2–49.2%) in patients between 68 and 91 kg, and 47.8% (95% CI 36.4–56.1%) in patients <68 kg (Table 2). There was no statistically significant difference (P = 0.624) in dose reduction among the three different weight classes.

Table 2 Median radiation dose reduction among the three patient weight categories

Quantitative image noise

Median image noise was significantly lower on RD versus SD CT images across all tissues measured (P < 0.001; Table 3). Whereas the median reduction of image noise obtained on RD CT compared to SD CT was approximately 50% in the abdominal aorta (54.5%, 95% CI 49.6–58.1%), right hepatic lobe (49.4%, 95% CI 44.8–55.8%), and paraspinal musculature (50.2%, 95% CI 40.7–55.1%), the noise reduction was less in anterior abdominal fat with a median reduction of 33.1% (95% CI 24.2–41.7%); however, noise reduction was still statistically significant (P < 0.001) in abdominal fat. Hounsfield Unit attenuation per tissue type also varied significantly (P < 0.001) across all tissue types when comparing RD images to SD images with the Hounsfield Unit attenuation measuring higher on average in the aorta, liver, and paraspinal musculature and lower in fat (Table 4).

Table 3 Median noise and percent noise reduction per tissue type
Table 4 Median attenuation measurements per tissue type

Reproducibility of maximal lesion diameter measurements

Of the 138 target lesions included for analysis that were subsequently evaluated on both RD and SD CT imaging, there were 70 lymph nodes (range 3–23 mm), 31 lung lesions (range 3–88 mm), 13 liver lesions (range 4–65 mm), 8 abdominopelvic implants (range 8–36 mm), 6 adrenal lesions (range 13–50 mm), 3 bowel masses (range 13–48 mm), 3 pancreatic lesions (range 17–30 mm), 3 muscle lesions (range 13–38 mm), and 1 osseous lesion (range 19–25 mm).

There was excellent reproducibility of maximal diameter measurements of the analyzed lesions on both RD and SD CT images. The intraclass correlation coefficient was 0.982 (95% CI 0.971, 0.989) for measurements obtained on RD and 0.987 (95% CI 0.979, 0.992) on SD CT images.

Qualitative image analysis

Image contrast

Overall, all three readers noted that the ability to visualize the small anatomic structures of interest was good to excellent on both RD and SD CT images and there was no statistically significant difference (P = 0.59) between RD and SD CT images in terms of image contrast (Table 5).

Table 5 Summary of subjective image quality numerical scores per reader for all examined categories of qualitative image analysis comparing standard dose and reduced dose CT

Subjective image noise

Overall, there was noted to be significantly (P < 0.01) lower noise on RD compared to SD CT images (Table 5) although one reader (Reader B) favored SD CT over RD CT (P < 0.01).

Artifacts

Artifacts were significantly more often noted on RD compared to SD CT (P < 0.01; Table 5). The vast majority of artifacts were graded as “some artifact present not affecting interpretation” (a grade of 2).

Overall subjective image quality

Overall subjective image quality was significantly higher (P < 0.01) on RD compared to SD CT images when taken in total (Table 5). Reader A graded the quality between SD CT and RD CT as fairly comparable with most RD CT cases and SD CT cases graded as “similar to what I am used to and diagnostic.” Although Reader B also graded most cases as “similar to what I am used to and diagnostic” for both RD CT and SD CT, there was significantly better (P < 0.01) overall subjective image quality in favor of SD CT. Reader C, however, graded overall image quality of RD CT images significantly higher than SD CT images (P < 0.01).

Lesion conspicuity

Overall, lesion conspicuity was graded significantly (P < 0.01) higher on RD CT images compared to SD CT images (Table 5), although one reader (Reader C) graded RD CT and SD CT similarly (P = 0.32). In general, conspicuity of most target lesions was graded as grade 2 (“lesion delineation against the surrounding tissue is similar to what I am used to”) on RD CT images.

Discussion

This study showed that RD CT with MBIR maintains image quality compared to our institution’s current standard of care SD CT while reducing the median dose by nearly half in follow-up oncology patients. For three independent readers, subjective image quality assessments were graded substantially higher and image noise substantially lower on RD compared to SD CT exams. Also, dose reduction did not compromise reproducibility of consecutive target lesion size measurements, suggesting that this technique can be used for routine follow-up imaging in cancer patients.

The NI values used in this study were selected based on our previous experience with in house phantom testing and an independent group of 30 consecutive patients (data not shown and not part of this study) for which radiation dose at acceptable image quality was optimized for the three weight classes for routine clinical purposes in our institution. The chosen NI values were selected to target a dose that was approximately half of what the expected CTDIvol would have been for a SD CT exam of patients in these three weight classes. A target of ~50% dose reduction was considered realistic as previous studies reported that image quality using MBIR at this dose reduction level was still acceptable [3, 34]. Further reduction of dose resulted in perceived non-diagnostic image quality of the kidneys and retroperitoneum in a different study analyzing imaging quality of MBIR reconstructed excretory phase CT datasets at different dose reduction levels [34]. Compared to other studies, in our study we did not use a fixed NI for RD CT scans for all patients, but rather introduced a strategy with three noise values associated with different weight categories. With this strategy we were accepting higher NI for more obese patients, thereby also allowing substantial dose reduction in that patient group despite automatic upregulation of the tube current by the CT scanner. Consequently, there was no significant difference in dose reduction among the three patient weight groups.

Previous studies assessed objective and subjective image quality criteria on RD datasets reconstructed with different reconstruction algorithms such as FBP, ASIR, and MBIR in several body parts [3, 10, 1423]. In those studies, MBIR outperformed FBP and ASIR in terms of diagnostic image quality when applied to RD CT datasets due to its more complex and accurate reconstruction algorithm. Our study expands previous reports by using a different study design exploring the performance of MBIR in an intra-individual comparison study of consecutive cancer patients scanned with RD CT and using their previous SD CT scans as intra-individual control exams. The results of our study confirmed our hypothesis that for the purpose of oncological follow-up exams, MBIR allows substantially reduced radiation dose of chest, abdomen, and pelvic CT scans while maintaining diagnostic imaging quality. Interestingly, although readers noted that MBIR reconstructed datasets often appeared “blurry” in our study, also sometimes referred to as “pixelated blotchy artifact” [35], all three readers, in particular reader C, rated overall subjective image quality of MBIR datasets equal to or higher than standard of care SD CT scans. While there are several factors influencing radiologists’ perceptions of image quality, based on our experience with MBIR, some of this is related to becoming accustomed to the different looks of CT scans over time and appreciating the objectively decreased noise following iterative reconstruction of the CT datasets.

We acknowledge the following limitations of our study. First, this study did not address how accurate the interpretation of the CT exam was regarding cancer detection and staging. Rather, this study addressed whether image quality on RD CT scans was sufficiently high in patients with known cancer to monitor their disease. More studies are warranted to address how the use of MBIR in RD CT exams may affect other aspects of diagnostic imaging such as in characterizing focal lesions. In fact, our study showed that there were substantial differences in HU attenuation measurements in the tissues measured between the RD and SD images, confirming previous reports [3, 32, 36] and indicating that the use of MBIR may affect interpretation of entities based on absolute HU value measurements or analysis of other quantitative CT features [37]. Second, the nature of our study design introduces the potential limitation that lesion conspicuity may change in patients undergoing cancer treatment due to the course of treatment or course of disease rather that CT reconstruction technique as the lesions grew or got smaller in the time interval. Lesion conspicuity could both increase or decrease in our population and both progression and improvement could occur. However, it is unlikely that this limits the generalizability of these results. Third, the prospectively implemented dose reduction protocol using MBIR in this study was compared with various standard dose routine protocols implemented on a total of six different CT scanners available in our institution at the time of the study. Automatic exposure control dependence on patient size varied among different scanner models, therefore, although we reconciled dose performance and image quality on different scanners for an average adult patient size, the actual dose and resultant image quality varied due to different automatic exposure control behaviors. However, we chose this study design to reflect a real-life setting of an institution running several types of CT scanners with different protocols which improves generalizability of our results. Finally, since reconstruction times of the MBIR technique used in our study were long (averaging ~120 min, though this was not systematically measured as part of our study), this limited the inclusion of patients to a maximum two per day to not interfere with routine clinical workflow and may have biased patient inclusion in our study. MBIR incorporates the geometry of the whole imaging chain, including the finite size of X-ray focal spot size, cone-beam shape, detector elements, and three-dimensional reconstruction voxels [38]. Both photon statistics and electronic noise are modeled in MBIR. The extensive modeling of forward projection process is part of the reason that MBIR takes substantially longer reconstruction time than previous generations of IR techniques, such as ASIR. Further improvements of MBIR and alternative next generation iterative reconstruction algorithms may provide faster data reconstruction allowing larger scale implementation of low-dose strategies in the clinic [39, 40].

In conclusion, our study suggests that RD CT imaging with MBIR allows radiation dose reduction by almost 50% compared to a spectrum of current standard of care SD CT protocols while not interfering with diagnostic image quality. Reduced dose CT protocols using modern reconstruction algorithms may soon become the standard of care and could be expanded to other CT indications beyond cancer imaging.