Introduction

Assessment of bone density from multidetector computed tomography (MDCT) images using Hounsfield units (HUs) is considered an objective, accurate, and stable measurement [1, 2]. In imaging of dental implant sites, positive correlations have been reported between various measures of primary implant stability and bone density measurements recorded from MDCT [36] and quantitative CT [79]. Furthermore, due to the accuracy in thresholding and segmentation of MDCT data, it has been found to be one of the most accurate modalities in production of 3D models of the jaws [1013], which is significant when MDCT is used in the production of surgical stents for dental implant surgery.

However, the increasing use of MDCT has been cited as a cause for the increasing collective dose of ionizing radiation to populations [14], and cone beam CT (CBCT) has been advocated as imparting lower radiation doses to patients, and is increasingly becoming more available to oral implantologists. MDCT, though, is still utilized by many practitioners, and the position statement of the American Academy of Oral and Maxillofacial Radiology on selection criteria for the use of radiology in dental implantology recognizes the continued use and need for MDCT in oral implantology, but stresses that dose-sparing protocols should be used whenever possible without adversely affecting diagnostic accuracy [15]. The remarkable progress in MDCT technology has produced devices with more detectors, allowing faster scanning times and low-dose exposures [16, 17], and the reported doses from ultra-low-dose MDCT protocols [18] are lower than those reported for several dentomaxillofacial CBCT devices [19, 20]. Further, the use of various iterative reconstruction techniques (IRTs), such as adaptive statistical iterative reconstruction (ASIR) and model-based iterative reconstruction (MBIR) has shown improvements in MDCT image quality with reduced radiation doses, when compared with the traditionally used filtered backprojection technique (FBP) [18, 21]. However, the resultant images reportedly demonstrate an "oversmoothing" effect [18, 22], of which the effect on density and thresholding must be considered. Also, the HU scale may vary when the MDCT exposure parameters are lowered, especially x-ray beam energy (kVp) [23, 24].

At the time of writing, to our knowledge, there is no information in the published literature regarding the effect of considerable dose reductions in combination with various IRTs on the combined HU and noise levels in dental implant site imaging. The variations within these objective measurements needs to be investigated to understand how the use of low-dose IRT protocols may affect analysis of bone density of implant sites and computer guided surgery. Therefore, this study aimed to compare the objective density and noise measurements recorded from MDCT images of dental implant sites using ultra-low doses combined with FBP, ASIR and MBIR with those from a standard dose/FBP protocol, and to investigate the effect of dose and reconstruction technique on the measurements.

Materials and methods

Cadaver selection

Two cadaveric heads (one edentulous and one partially dentulous) with intact soft tissues were used in the study. The bodies were donated by people who had given their informed consent for their use for scientific and educational purposes prior to death. The study fulfilled all requirements necessary for studies on human cadavers according to the regulations of the Division of Clinical and Functional Anatomy, Medical University of Innsbruck [25, 26]. All cadavers were preserved using an arterial injection of an alcohol-glycerin solution and immersion in phenolic acid in water for 1 to 3 months [27].

Imaging of the cadavers

The cadaver heads including the mandible were scanned using a 64-multi-slice CT scanner (Discovery CT750 HD, GE Healthcare, Vienna, Austria). The scan range included the entire skull and mandible. Each cadaver was exposed to a high-resolution reference protocol and four low-dose protocols (Table 1). The pitch factor for all protocols was 0.5. There was no movement of the cadavers during and between the five different exposures. In addition to the FBP standard reconstruction all images were reconstructed using the following IRTs: ASIR-50, ASIR-100, and MBIR. Adaptive statistical iterative reconstruction (GE Healthcare) uses information obtained from the FBP algorithm but integrates a comparison of the pixel values with an ideal value to selectively identify and then subtract noise from an image at adaptive blend levels which are freely selectable, typically from 10 % to 100 %. [22] The ASIR reconstructions used in the present study were ASIR-50 (50 % FBP and 50 % ASIR) and ASIR-100 (100 % ASIR). A bone convolution kernel was used in the images of cadaver 1 (except with MBIR for which only a standard convolution kernel was available). A standard reconstruction kernel was used in all the images of cadaver 2. The volume CT dose index (CTDIvol) and dose length product (DLP) were obtained from the digital imaging and communication in medicine (DICOM) tags. In addition, effective doses were calculated using CT-Expo V 2.1 (Medical University Hannover, Germany), an MS Excel application for assessing the radiation doses delivered to patients undergoing CT examinations based on the scan parameters of the used scanner type. A protocol summary can be found in Table 1.

Table 1 List of exposure parameters and reconstruction techniques used to produce the various combinations of MDCT image datasets. Dose length product (DLP) and effective dose include the scan of the entire skull and mandible

Processing and viewing of 3D datasets

Each of the 40 MDCT data sets was converted to DICOM format and uploaded onto a shared Google drive. The DICOM data sets were then imported into a 3D image reformatting software (OnDemand Software, version 1.0, Cybermed Inc., Seoul, South Korea). The images were viewed on a 22" flat panel LCD colour monitor in landscape mode (Dell P2210, Round Rock, TX, USA). Specifications of the monitor were as follows: aspect ratio: 16:10; screen resolution 1680 × 1050 (highest, recommended; calculated pixel size: 0.282 mm); colour resolution: 32 bit; luminance 250 Cd/m2; contrast ratio (static): 1000:1. The window width/level of the MDCT images was set to 3000/650, and the images were viewed and measurements recorded in a dimly lit room.

Sample selection

The maxillary and mandibular edentulous ridges of both the imaged cadavers provided the study sample sites. The sample site inclusion criteria were all edentulous spaces within the jaws of the cadavers, at 5-mm intervals. The exclusion criteria for the edentulous spaces were the following: presence of foreign objects or artificial defects at the residual ridge, and areas of residual ridges where the standardized position of the region of interest (ROI) included non-osseous tissues. Based upon the inclusion and exclusion criteria, 41 sample sites were obtained (25 sites from cadaver 1 and 16 sites from cadaver 2). All the sample sites were in areas of the data sets which were not degraded by artifacts from foreign bodies or dental restorations.

Reformatting of sample sites

One oral and maxillofacial radiologist (A.A.) with 10 years experience in CT image processing and analysis performed the reformatting of all the datasets to obtain transverse cross-sectional images of the sample sites. The 3D module of the OnDemand software was used to produce each cross-section individually. The text overlay information was turned off in order to mask the mA, and kVp. Since the cadaver heads were not moved between the MDCT examinations, the default position and orientation of the orthogonal sectional planes relative to the jaws were consistent in all the MDCT datasets of each cadaver. Thus, standardization of the site and orientation of the reformatted sample sites was achievable through measured shift and angulation of the orthogonal sectional planes. The contrast and density and zoom level were also standardized amongst all images. Each reformatted site was saved as a bookmark on the master database of the reformatting software (on the hard drive of the computer) such that the examiners can access the reformatted sites using the OnDemand software.

Recording the test measurements

Two independent examiners who were blinded to the exposure parameters and reconstruction protocols recorded the test measurements. The examiners were OMF radiologists with 10 and 5 years experience in MDCT image processing and analysis. The ROI tool of the software was used to select a square area 4 × 4 mm in size which was positioned at the intersection of the axial and para-sagittal reformatting lines at the sample sites. Tables 2 and 3 show sample sites, with ROIs, obtained with each of the 20 combinations of dose protocols and reconstruction techniques using a bone kernel and standard kernel, respectively. The mean values for the HUs within each ROI were used to analyze density, and the standard deviation within each ROI was used to analyze the noise levels.

Table 2 Images of a sample site obtained using a bone kernel (except with MBIR) with the various combinations of dose protocols and reconstructions techniques. Mean HU and mean SD values (noise) at region of interest (marked by square) for each dataset are noted above the image. All images were compared to the reference dose protocol with filtered backprojection (red frame). n.s.: no significant difference (p = 0.05)
Table 3 Images of a sample site obtained using a standard kernel with the various combinations of exposure protocols and reconstructions techniques. Mean HU and mean SD values (noise) at region of interest (marked by square) for each dataset are noted above the image. All images were compared to the reference dose protocol with filtered backprojection (red frame). n.s.: no significant difference (p = 0.05)

The first examiner recorded all the measurements once. For reliability testing, 100 sample sites were selected to be equally distributed amongst the imaging protocols; five samples were randomly selected from each protocol (three sites randomly selected from cadaver 1 and two sites randomly selected from cadaver 2). An online random number generator was used for random selection of sites (http://stattrek.com/statistics/random-number-generator.aspx). The first and second examiners recorded the measurements from the 100 sites independently, with at least a 1-week interval between the first and second recordings by the first examiner.

Statistical analysis

The recorded measurements were analyzed with the statistical program: SPSS Versions 22 and 24 (IBM, Armonk, NY, USA). Intra- and inter-examiner reliability of the HU and noise measurements were analyzed by calculating the intraclass correlation coefficient. Bland–Altman plots comparing the examiners' readings were also obtained, and linear regression was used to test for proportional bias between the examiners' measurements. The Kolmogorov–Smirnov and Shapiro–Wilk tests of normality were performed for the subgroups imaged with a standard kernel, and indicated normal distribution of the data in these groups. Therefore, paired t tests were used to compare the HU and noise measurements recorded from each of the test combinations of protocols with those from the reference dose/FBP technique protocol. Two-way ANOVA with repeated measures was used to analyze the effect of dose and reconstruction technique on the measurements. Level of significance was set at p = 0.05.

Results

Intra- and inter-examiner reliability of the mean HU were 0.996 and 0.987, respectively (p < 0.001). For the noise measurements, they were 0.991 and 0.974, respectively (p < 0.001). Bland–Altman plots with linear regression (see electronic Supplementary Material) did not demonstrate systematic variation between the examiners' measurements for either HU or noise (p > 0.5).

Mean HU

A statistically significant difference in the mean HUs was found between the reference dose/FBP protocol and almost all the test combinations (p< 0.021), regardless of the kernel used. Using bone kernel exceptions were those protocols using the reference dose with ASIR 50 (p = 0.533), ASIR 100 (p = 0.096), and MBIR (P = 0.840; Table 2). MBIR was only possible with standard kernel, but was compared with the reference dose/FBP protocol bone kernel as well. Using the standard kernel, exceptions were the reference dose with ASIR 100 (p = 0.836) and low-dose protocol 1 with MBIR (p = 0.403; see Table 3). Overall, the maximum mean differences in HU were 178.35 HU using the bone kernel and 273.74 HU using the standard kernel.

When a bone kernel was used (except with MBIR), there was a significant effect for the interaction between the dose and reconstruction techniques on the mean of the HU (p = 0.007). When the dose and reconstruction technique were considered separately, there was a significant difference in mean HU between the different dose protocols (p < 0.001) with an overall pattern of increase in HU with decreasing dose, except at the lowest-dose protocols. No significant difference in mean HU was found between the reconstruction techniques (p = 0.353).

However, when a standard kernel was used with all reconstruction techniques, there was no significant effect for the interaction between the dose and reconstruction techniques on the mean of the HU (p = 0.318), although there was a significant difference in mean HU between the different dose protocols (p < 0.001) and between the reconstruction techniques (p < 0.001). Interestingly, using the standard kernel, use of MBIR decreased the mean HU when compared with the corresponding FBP and ASIR images. The maximum difference of the mean HU obtained from the reference dose/FBP protocol and the corresponding low-dose IV protocol was 162.48 HU using FBP but only 69.43 HU using MBIR.

Noise

A statistically significant difference in noise measurement (mean SD of HU measurements in bone) was found between the reference dose/FBP protocol and almost all the test combinations (p< 0.033). Using bone kernel exceptions were those protocols using the reference dose with ASIR 50 (p = 0.053), ASIR 100 (p = 0.400), and MBIR (P = 0.300; see Table 2). Using a standard kernel, exceptions were the reference dose with ASIR 100 (p = 0.705) and MBIR (p = 0.111; see Table 3). Overall, the maximum mean differences in noise were 243.73 using a bone kernel and 153.88 using a standard kernel.

There was a significant effect for the interaction between the dose and reconstruction techniques on the noise measurements when either a bone or standard kernel was used (p = 0.037 and 0.014, respectively). When the dose and reconstruction techniques were considered separately, there was a significant difference in noise between the dose protocols when either a bone or standard kernel was used (p < 0.001), with progressively increasing noise as the dose decreased. However, there was no significant difference in noise levels between the lowest dose protocols.

When a bone kernel was used there was also a significant difference in noise between the reconstruction techniques (p = 0.006), with FBP and MBIR showing the highest and lowest noise levels, respectively, and no difference was found between ASIR 50 and ASIR 100. When a standard kernel was used there was no significant difference in noise between the reconstruction techniques (p = 0.197).

Discussion

The present study aimed to investigate the effect of using ultra-low doses in combination with various reconstruction techniques on the objective density and noise measurements of MDCT images of dental implant sites. The investigated protocols allowed for dose savings of about 89–98 % as compared with a current reference dose protocol for implant planning and guided surgery. Reduction of radiation exposure to ALARA “as low as reasonably achievable” levels is a major task in radiology, driven by both legislative authorities and medical societies. And although CBCT is becoming more widely utilized for implant site analysis, ultra-low-dose MDCT may have an advantage over CBCT for analysis of multiple implant sites and if the CT examination is to be used for CT-guided production of surgical guides.

The lowest-dose protocols applied in the present study utilized the lowest mA and kVp possible with the MDCT device used. The calculated effective doses ranged from 100–500 microSv. However, the scans included the entire head and mandible; so, limiting the field of view (FOV) to one jaw only is expected to substantially lower the effective doses. Homolka et al. [28] showed that low-dose MDCT examination of the maxilla may impart an effective dose of 22 microSv, while that of a mandible may impart 123 microSv, which is comparable to a full-mouth survey with intra-oral films (150 microSv). Similar results were published by Cordasco et al. [16] who calculated mean absorbed dose using an ultra-low-dose MDCT protocol of CTDIvol of 2.5 mGy (comparable to the dose imparted by low-dose protocol 2 in the present study), which is similar to that related to conventional radiographic exams (lateral cephalogram + posterior-anterior cephalogram + panoramic), with a difference of about 0.06 mGy.

In order to standardize the area of interest and confine it to implant sites only, the size of the ROI selected was the largest possible size which would fit within the bone boundaries of all the sample sites. The largest possible ROI size was used in order to improve reproducibility of the measurements. The very high intra- and inter-examiner reliability obtained with the present technique indicates that positioning of the ROI was reproducible. Registration-based analysis is an alternative method which may also be used to ensure reproducible placement of the ROI.

The findings of the present study indicate that there was a significant difference in mean HU and noise between most of the test protocols and the reference dose/FBP protocol. The increase in mean HU with decreasing doses is expected considering the known effect of exposure parameters on the HU scale [24]. The lack of a significantly increasing mean HU observed between the lowest dose protocols in the present study may be due to the fact that the kVp, which is known to have an effect on the HU scale [29], was constant at the lowest-dose protocols. In dentomaxillofacial radiology where images are usually reconstructed using bone kernels, which are known to show higher accuracy compared to standard kernel for linear measurements of the jaws [30], additional reconstruction using a standard kernel and MBIR may potentially reduce HU variability for bone density evaluation in low-dose images.

The significant effect of the interaction between the dose and reconstruction techniques on mean HU found in the present study is in agreement with the findings of Herin et al. (2015) who reported higher HUs with a 70 % reduction in dose in combination with MBIR or ASIR 50 [31]. Furthermore, the lack of a significant difference in HU between reconstruction techniques when a bone kernel was used is in agreement with the findings of previous studies which found no difference in HU values of coronary artery plaque [32] or within the lungs [33] when FBP, ASIR, or MBIR were used. However, the significantly lower mean HU obtained with MBIR and standard kernel, compared to ASIR, in the present study is in contrast to the findings of Botsikas et al. (2014) who reported significantly higher HU numbers for urinary tract stones with MBIR, compared to ASIR [34].

In general, standard kernels show less noise than bone kernels [35]. Interestingly, in the present study, the reconstruction technique had no significant effect on noise levels when a standard kernel was used and with bone kernel a significant difference in noise between ASIR 50 and ASIR 100 was not observed. These findings are in contrast with the findings of previous studies which reported that increasing the percentage of ASIR led to a decrease in image noise [33, 36], and that MBIR images demonstrated less noise than ASIR [34, 37, 38]. The differences may possibly be explained by the fact that the noise measurements in the present study were confined to bone, whereas the previous studies evaluated noise as standard deviation of HU measures in soft tissue such as the posterior fossa, lung, or fat, or within air, such as the maxillary sinus. Similar to the results of the present study, Schulz et al. showed that the effect of IRT on noise reduction is more pronounced with bone (hard) kernels compared to soft (standard) kernels [37].

The clinical significance of the variations in HU and noise on implant treatment planning and surgery is still not clear. Standard-dose FBP protocols have been found to provide accurate 3D models of the jaws [1013]. Furthermore, ultra-low-dose protocols combined with IRT, utilizing dose reductions of more than 90 % compared to standard protocols, did not significantly interfere with subjective image quality of 3D models [38]. The accuracy of thresholding with these protocols is relevant to the CAD/CAM production of surgical stents for dental implant surgery; so, the effect of increased density and noise on thresholding accuracy needs to be investigated for the various combinations of ultra-low-dose protocols. Other implant imaging tasks which may possibly be affected by image noise are identification of anatomic margins and trabecular bone changes. Relevant studies performed with the various dose/reconstruction technique combinations may include evaluation of contrast noise ratios and morphometric analysis [39, 40].

The impact of the observed differences in noise and density between the various protocols on objective bone quality evaluation during implant site analysis is also still not clear. For previous investigators have related HU to bone density using HU values acquired with standard dose protocols and FBP. Although a wide range of HUs may be seen within each of the bone quality types described by Lekholm and Zarb (1985) [41], Norton and Gamble (2001) [42] published quantitative ranges for these bone quality types. Type 1 bone had an HU greater than +850. Type 2 and Type 3 bone were found to have overlapping HUs, mostly ranging from +500 to +800 HU. Type 4 bone had an HU range of less than 0 to +500.

An important aim of the present study was to demonstrate whether HU measurements remain stable with the currently available ultra-low-dose technology and IRTs. Different dose/reconstruction protocols were found to have different effects on HU. Although the observed maximum mean differences in HU of 178.35 (bone kernel) and 273.74 (standard kernel) obtained with low-dose protocols (compared with the standard protocol) may provide a false classification of bone quality, MBIR showed a lower discrepancy in HUs with ultra-low doses than the other reconstruction techniques. The difference of the mean HU obtained from the reference dose/FBP protocol and the low-dose protocol IV was 162.48 HU using FBP but only 69.43 HU using MBIR. Clinicians should be aware that MBIR may provide lower variability in HUs than other reconstruction techniques, but that HU values are not absolute values.

Furthermore, although modification of implant treatment planning on the basis of bone density has been advocated by some authors [43, 44], and primary implant stability has been correlated with bone density measurements recorded from MDCT [36], the effect of the combinations of the ultra-low doses and various reconstruction techniques on the determinants of implant stability still needs to be investigated. There is still no reference standard relating MDCT HU values to any of the determinants of implant site treatment planning and/or success [45]. In order to develop reference standards, reliable density values must be correlated with the various measures of implant stability and/or success for each combination of MDCT dose/reconstruction technique [46].