Introduction

X-ray micro-computed tomography (micro-CT) enables non-destructive visualization of the internal structure of an object. By using projection images obtained from different angles around the object, virtual cross-sections (slices) are reconstructed through the sample, from which a 3D visualization can be created. In bone research, micro-CT is used to study pathologies, such as osteoporosis [1], osteoarthritis [2], rheumatoid arthritis [3], as well as bone tumors. It has also increasingly found its way in orthopedic research to assess biomaterials [4] and bone implants [5].

For in vivo small animal micro-CT imaging, it is important to minimize scanning time and the radiation dose given to the animal, in particular, if repeated scans are required. One way to achieve this is by reducing the number of projection images during scanning. For example, by acquiring only half or a quarter of the projection images, potentially half or a quarter of the dose is given to the animal. However, this may introduce errors in the reconstructed cross-section image when utilizing traditional analytical algorithms such as that described by Feldkamp et al. [6], known as the filtered back-projection (FBP) method. Alternative reconstruction methods are the iterative algorithms [7], which consider the image reconstruction process as the optimization of a discrete representation of the object function in order to satisfy a system of equations that describes the imaging modality. They can provide reconstructions of higher quality compared to analytic methods, when only a limited number of projections are available [7]. Among these there is the discrete algebraic reconstruction technique (DART), which exploits a priori knowledge on the reconstructed grey levels. DART can provide high-quality reconstructions even with a small number of projections, as low as 10 [8]. Another advantage of discrete tomography is that the reconstruction directly computes the segmented (binarized) images necessary for the quantitative morphometric analysis, avoiding an additional segmentation step and selecting its threshold value. The DART algorithm has shown great potential in fields such as electron microscopy [9,10,11]. However, its application in X-ray micro-CT so far has been limited [12, 13]. In particular, it has never been explored in the field of in vivo micro-CT.

The aim of this manuscript is (1) to apply DART to repeated in vivo micro-CT scans performed on a rat model of osteoporosis, to explore DART’s capability in detecting bone loss over time compared to the standard FBP technique; (2) to apply FBP and DART on the same scan data, however, by using progressively fewer projection images (1/4 and 1/6 of the original number of projections), and assess the respective capability of the two techniques in detecting bone loss.

Materials and Methods

Experimental setup

Animals

We used a subset of the micro-CT projection image datasets taken from a previously published study on female Sprague–Dawley rats by Perilli et al. [1]. Rats were 8 weeks of age at the beginning of the study (baseline). Two subgroups of rats were examined: a group with induced bone loss due to ovariectomy (OVX, n = 3) and a sham-operated group (SHAM, n = 3) that served as a control. The animals had been micro-CT scanned in vivo at five time points, i.e., at time of surgery (baseline) and at 2, 4, 8 and 12 weeks after surgery.

X-ray micro-CT acquisition

The projection image acquisition was performed in the previous study by using an in vivo X-ray micro-CT system (SkyScan 1076, SkyScan-Bruker, Kontich, Belgium), using a standard protocol for in vivo small animal studies [1]. Briefly, while under anesthesia, the right hind limb of each animal was placed into a cylindrical plastic holder to position the limb at the scanning midline and to prevent movement. The X-ray source voltage was set to 74 kVp/100μA and a 1-mm-thick aluminum filter was used for beam hardening reduction. An isotropic pixel size of 8.7 μm was obtained for a 4000 × 2096 CCD detector array, resulting in a total field of view of 35 × 18 mm. The rotation step was 0.8° over 180°, leading to 248 projection images per scan (manufacturer default setting for in vivo scanning). The exposure time was 4.7 s and the total scan time per animal was 20 min at each time point [1].

Image reconstruction

All FBP and DART cross-section image reconstructions were performed first using the entire set of projection images, which was used as the ground truth, then 1/4 and then 1/6 of them (248, 62 and 42 projection images, respectively).

Image reconstruction using filtered back-projection (FBP)

For the standard FBP algorithm, the implementation of the SkyScan NRecon software was used to reconstruct a stack of 1200 cross-section images of 4000 × 4000 pixels from each scan. The FBP algorithm reconstructs the cross-section images to a set of grey values, saved as 8-bit images (256 grey levels).

Pre-processing of the projection images was performed for beam hardening reduction, ring artifact correction, Gaussian smoothing and post-alignment (NRecon software, SkyScan). The NRecon software provides the option to export the projection images after pre-processing, so that these can be used by other reconstruction algorithms, in this case, DART. This way, both reconstructions (FBP and DART) used the same pre-processed projection images.

Image reconstruction using the discrete algebraic reconstruction technique (DART)

A 3D DART implementation was created in MATLAB based on the ASTRA Toolbox (University of Antwerp, Belgium), which provides the crucial projection operations, hardware accelerated with NVIDIA CUDA (NVIDIA Corporation, Santa Clara, CA, USA) [14,15,16].

The DART algorithm utilizes a priori knowledge of the grey levels (attenuation coefficients of the materials) present in the reconstructions, to achieve accurately segmented cross-section images. In the present study, from a radiological perspective, the scanning field of view contained mainly three types of materials: background (air), soft tissue and bone. Hence, the algorithm was set to reconstruct the cross-section images containing three grey values representing these three materials. These grey level values were first estimated from a standard FBP reconstruction of one rat at one time point at the end of the study, using the full projection dataset; these were then applied in all DART reconstructions. The estimation was done using a projection distance minimization method, which automatically finds the optimal threshold values of the (three) different materials in a sample using the projection images [17].

Morphological parameter calculations

The reconstructed datasets were then analyzed (software “CT-Analyser”, Version 1.13.2.1 64-bit, SkyScan–Bruker microCT). To perform trabecular morphometric analysis, typically two steps need to be applied on the reconstructed dataset: the selection of the trabecular volume of interest (VOI) and the binarisation of the grey-level images. For both FBP and DART datasets, the trabecular VOI was identical to the one used previously [1], containing only trabecular bone, starting at a distance of 1 mm from the lower end of the growth plate and extended distally for 350 cross-sections (3 mm in height). Binarisation was done on the FBP VOI datasets only (as the DART images are already thresholded), using global (uniform) thresholding [1]. A despeckle procedure was performed to remove isolated voxels (all objects smaller than 10 voxels, default value for CT-Analyser software). Over the binarised VOI datasets, the following histomorphometric parameters were calculated: bone volume fraction (BV/TV), trabecular thickness, separation and number (Tb.Th, Tb.Sp, Tb.N), and structure model index (SMI), a topological parameter. The number of objects (Obj.N) in 3D present in each VOI was also calculated (CT-Analyser).

Statistical analysis

The study sought: (1) For each reconstruction method, to compare each morphometric variable measured at a certain time point with the corresponding baseline, and to compare each variable measured at a given time point with the previous time point. For this, a Friedman test over the structural parameters was performed. If the p values for a given variable were found significant, a Mann–Whitney test was used to investigate time-related changes. (2) To assess the capability of each reconstruction method to distinguish the two animal groups for a given number of projections; for this, the two groups were compared to each other at each time point (Mann–Whitney test). (3) To determine if within an animal group there were differences in the measured values that were due to the number of projections (all, 1/4, or 1/6 of the projections); a Friedman, followed by a Mann–Whitney test was performed at each time point. The alpha level for statistical significance was set at 0.05 for all the comparisons. Statistical analysis was performed using the Real Statistics Resource Pack software (Release 4.3; http://www.real-statistics.com/) for Microsoft Excel [18].

Results

Structural parameters describing the trabecular bone changes were calculated over the VOIs and compared for the different time points (Tables 1, 2). Figures 1 and 2 show transaxial cross-section images of a rat limb and the trabecular bone region of interest, respectively, in a SHAM rat reconstructed with the FBP and DART algorithms, at a decreasing number of projections. In Fig. 3, a 3D micro-CT rendering of the analyzed trabecular VOI of a rat from the SHAM and OVX group is shown at baseline and week 12, for the different reconstruction methods and numbers of projections.

Table 1 Standard filtered back-projection (FBP): overview of the bone structural parameters calculated for each group over time. Comparisons are made between time points: the difference at each time point compared to its baseline and to the previous time point, respectively
Table 2 Discrete tomography (DART): Overview of the bone structural parameters calculated over time for each group. Comparisons are made between the time points: the difference at each time point compared to its baseline and to the previous time point, respectively
Fig. 1
figure 1

Axial micro-CT cross-section image of a rat limb in a sham-operated rat at week 12. Top row: standard filtered back-projection reconstructions (grey level image) using all projections (FBP A), ¼ (FBP ¼) and 1/6 (FBP 1/6). Middle row: FBP after thresholding, bone in white colour, background in black. Bottom row discrete tomography reconstructions with all projections (DART A), ¼ (DART ¼) and 1/6 (DART 1/6), where bone is white, background black and soft tissue grey

Fig. 2
figure 2

Same axial micro-CT cross-section image as in Fig. 1, but now showing only the trabecular bone region of interest used for analysis. Top row: standard filtered back-projection reconstructions (grey level image) using all projections (FBP A), ¼ (FBP ¼) and 1/6 (FBP 1/6). Middle row: FBP after thresholding and application of a noise despeckle filter (10 voxels), bone in white colour, background in black. Bottom row: discrete tomography reconstructions with all projections (DART A), ¼ (DART ¼) and 1/6 (DART 1/6), after application of a noise despeckle filter (10 voxels), bone in white colour, background in black

Fig. 3
figure 3

3D micro-CT models of the trabecular bone VOI for a sham-operated and an OVX rat at baseline and week 12, using all projections (FBP A or DART A), ¼ (FBP ¼ or DART ¼) and 1/6 (FBP 1/6 or DART 1/6)

In each animal group, the overall trend of the structural parameters over time was independent of the chosen reconstruction algorithm. For both reconstruction methods, in the OVX group, the BV/TV and Tb.N decreased significantly over time (p < 0.05), while the Tb.Sp and SMI increased (Fig. 4; Tables 1, 2). Instead, in the SHAM group, BV/TV, Tb.N and Tb.Th increased, whereas SMI decreased. Although the trend over time of the morphometric parameters in each group was independent of the reconstruction technique, the actual values differed between techniques. The BV/TV, Tb.Th and Tb.N obtained by DART were consistently higher than by FBP (for example, BV/TV was almost doubled in all DART reconstructions compared to FBP, Fig. 4).

Fig. 4
figure 4

Trabecular bone morphometric parameters (BV/TV, Tb.Th, Tb.Sp, Tb.N and SMI) monitored over time, for FBP (left column) and DART (right column) algorithm. A for SHAM, A’ for OVX group: p < 0.05 compared to their baseline, for all projections (Mann–Whitney test). ¼ for SHAM, ¼’ for OVX group: p < 0.05 compared to their baseline, for ¼ of the projections (Mann–Whitney test). 1/6 for SHAM, 1/6’ for OVX group: p < 0.05 compared to their baseline, for 1/6 of the projections (Mann–Whitney test)

Significant differences in morphometric parameters between the SHAM and OVX animals were detected from week 2 onwards, independent of the reconstruction method (Table 3). These differences between groups were, in general, preserved also with decreasing number of projections, for both reconstruction techniques. However, FBP reconstructions were more affected than DART, as follows (Table 4). For the FBP, when the number of projections was reduced, BV/TV was increased compared to the full projection dataset (in SHAM, the increase range was 6–23% or 19–53% when reduced to 1/4 or 1/6, respectively; in OVX, the increase was 10–23 or 30–58%), statistically significant at baseline (SHAM 1/6) and week 2 (OVX 1/6). For DART, these projection-related differences were much smaller (3–5 or 5–7% in SHAM at 1/4 or 1/6, 0–5 or 0–8% in OVX, respectively) and not statistically significant at any time point. Similarly, other parameters, such as Tb.Sp and Tb.N, showed significant differences in the reduced compared to the full dataset; again, these were found mainly for FBP reconstructions (Table 4). For example, in OVX, when reducing the projections for the FBP, the Tb.Sp progressively decreased by up to 68% of its original value (from 1078 to 685 μm or 348 μm, for all, 1/4 or 1/6 of the projections), approaching values close to the range of the SHAM group (231, 176 μm or 143 μm). Correspondingly, the Tb.N increased with fewer projections, with highest increases found particularly in the OVX group, up to +79% of its full-projection values. In DART, these differences were much lower and, in general, not statistically significant both for Tb.Sp (max 18% SHAM or 15% OVX, when reducing to 1/6 of the projections) and Tb.N (max 14% SHAM, 16% OVX). Tb.Th showed significant increases over time for the SHAM and OVX group, for both methods. Reducing projection images to 1/4 or 1/6 significantly decreased Tb.Th for FBP (in SHAM up to −16% or −20%; in OVX −15% or −25%), whereas it increased in DART (in SHAM, up to 14 or 24%; in OVX, 15 or 25%). SMI decreased over time in SHAM and increased in OVX, for both methods. With fewer projections, SMI increased for both methods and animal groups, reaching particularly high (and improbable) values for FBP 1/6 OVX at weeks 8 and 12 (SMI = 3.2 and 3.4, respectively), compared to DART 1/6 OVX (SMI = 2.5 and 2.6, respectively).

Table 3 OVX vs. SHAM groups, statistical significances in morphometric parameters at each time point (Mann–Whitney test)
Table 4 Percent differences in bone morphometric parameters due to a decreasing number of projection images: ¼ and 1/6 of the projection images vs. all projection images

The Obj.N, when reducing the number of projection images, was significantly increased in the FBP reconstructions at all time points (up to a maximum of 1300% increase, 1/6 FBP vs. all FBP projections, OVX week 12); this was not so in the DART reconstructions (Fig. 5). This increase in amount of objects is also visible in the reconstructed cross-section images (Figs. 1, 2) and in the 3D renderings (Fig. 3).

Fig. 5
figure 5

The object number present in the 3D trabecular VOI calculated for the FBP (left graph) and DART (right graph), at different time points and number of projections. With decreasing number of projections (all, ¼, 1/6), the object number is much more affected for FBP where it significantly increases up to over 1300% of its value (all projections vs. 1/6 for FBP, week 12, OVX), than for DART. a for SHAM, a′ for OVX group: p < 0.05 between all projections and ¼ of the projections, for either FBP or DART (Mann–Whitney test). b for SHAM, b′ for OVX group: p < 0.05 between all projections and 1/6 of the projections, for either FBP or DART (Mann–Whitney test). c for SHAM, c′ for OVX group: p < 0.05 between ¼ of the projections and 1/6 of the projections, for either FBP or DART (Mann–Whitney test). s statistical significances in morphometric parameters at each time point (Mann–Whitney test)

Discussion

When studying bone changes in an in vivo animal experiment it is important to keep the dose and scan time as low as possible without compromising the image quality and the morphological parameter calculations. One way of achieving this is by reducing the amount of projection images. However, by doing so, the data available for calculating an accurate reconstruction image using the standard FBP reconstruction techniques might not be sufficient. In this study, DART is presented as a valuable iterative reconstruction technique which is less sensitive to the decrease in projection images compared to FBP.

When comparing bone changes over time between the OVX and SHAM group by using FBP or DART reconstructions, the general trend observed in the morphometric parameters was independent of the reconstruction method. The two animal groups showed statistically significant between-group differences as well as time-related differences when compared to their own baseline (natural bone growth in SHAM, bone loss for OVX), although the actual values differed between the two reconstruction techniques (e.g., DART showed higher BV/TV values than FBP, for both rat groups).

Reduction of the number of projections had statistically significant effects, mainly for FBP. With a decreasing amount of projection images, there was an increase in noise speckles in the FBP reconstructions (Figs. 1, 2, 3), which led to an increase in Obj.N (Fig. 5). On the binary images, new random speckles appeared (noise) as well as some trabeculae became (artificially) broken. As a result, there were increases in BV/TV (between 6 and 58%, depending on the time point and animal group), statistically significant at 2 out of 5 time points (with more animals per group these would likely be significant at each time point). Other micro-architectural parameters, such as Tb.Sp, Tb.N and SMI, were significantly affected by the reduction of the projection images from the full dataset to 1/6 in FBP. The DART algorithm instead did not show this behavior, neither for BV/TV, nor for Tb.Sp, Tb.N and SMI, suggesting it to be more robust when reducing the number of projection images. Also the average Tb.Th decreased for FBP as a consequence of the appearance of small noise speckles in the images.

In the OVX group, a major amount of trabecular structure is lost by week 12. The increased noise due to reduced numbers of projections (Figs. 2, 3), particularly for FBP, apart from increasing BV/TV, drastically decreased the Tb.Sp, as the free space between adjacent trabeculae became smaller due to new speckles appearing, thus increasing also Tb.N and SMI (Fig. 4). SMI is a topological parameter, with values of 0, 3 and 4 indicating an ideal plate-like, rod-like and sphere-like structure, respectively [19]. An increase in SMI as seen for FBP, from SMI = 2.6 (merely a rod-like structure containing a few plates) to SMI = 3.4 when reducing all the projection images to 1/6, means that for the latter, there is the artificial presence of sphere-like structures in the examined VOI, namely noise and disrupted trabeculae; indeed, an SMI value above 3 for trabecular bone is rather unrealistic. In contrast, for the DART algorithm, with a decrease in projection images, a substantially smaller spread in all the morphometric parameters is seen, with smaller and not significant differences compared to the full projection dataset. In fact, for DART in the OVX group, when reducing the full projection dataset to 1/6, the SMI increased from 2.0 to 2.1, a considerably smaller change compared to the FBP technique, maintaining also realistic values indicating a merely rod-like structure containing a few plates.

In this manuscript, DART was successfully applied to an in vivo micro-CT dataset of a rat model of osteoporosis and compared to FBP reconstructions, which is the standard method typically used in the micro-CT community. To the best of the authors’ knowledge, this is the first time that DART has been applied to in vivo micro-CT scans. Our results suggest that DART can be a viable alternative method for quantitative morphometric studies. With reduced numbers of projections, DART performed at least as well and, in some instances, better than FBP, particularly when the projection numbers were considerably reduced (from full projections to 1/6, that is 248 to 42 images), with the morphometric parameters less affected by an increase in noise speckles. Future studies might investigate DART in animal models of different pathologies and with different acquisition conditions.

Whereas DART has shown great potential in the field of electron microscopy [11], its application in X-ray micro-CT so far has been limited to merely proof of concepts studies [12, 13]. Iterative algorithms pose a high computational burden, a reason why they might not have yet been widely used in X-ray micro-CT [8]. However, recent developments in high-performance computing have led to the acceleration of iterative techniques, such as using several GPUs [20]. In the present study, all reconstructions were performed on a computer containing an Intel Xeon E5-2630 CPU @2.30 GHz, 128 GB of memory and two NVIDIA Tesla K20Xm GPU’s. We used the standard algorithm available at the Vision Lab, which usually handles smaller datasets and is not optimized for big datasets as these. Nonetheless, we attempted a comparison in computation time. A DART reconstruction (100 iterations) using all, 1/4 or 1/6 projection images required 51, 21 and 17 h to compute, respectively, which, compared to 7 min by using FBP with the GPU Reconserver (1.6.6), might seem a rather long time. However, the DART algorithm can be significantly optimized by moving several parts of the algorithm to the GPU, which would drastically reduce the final reconstruction time. This is part of future work in our group.

The strength of discrete tomography is that a priori knowledge on the attenuation values of the sample is included in the reconstruction algorithm. The downside is that this prior knowledge has to be available, which, in practice, might not always be the case. DART has been reported to improve reconstructions over FBP in specimens containing a known amount of materials, or in challenging cases with non-equiangular sampling, or when some projection data are missing, or with reduced number of projections [8]. Reducing the number of projection images reduces scanning time and related exposure to X-rays. If a typical scan lasts 20 min, a scan with 1/4 or 1/6 of the files requires 5 min or less than 4 min, respectively. The reduced radiation is an advantage ethically and experimentally, whereas the reduced scanning time is also an obvious practical advantage, particularly if many animals have to be scanned.

The limitation of this study is the low number of animals per group (three). For this reason, a non-parametric Friedman test was used instead of a parametric repeated measurement ANOVA statistical test. A Wilcoxon signed rank test (non-parametric analog to the paired t test) was not used for comparing time-related changes, or changes due to reduced projection images, as that statistical test would not give meaningful results for such small cohorts; a Mann–Whitney test was used instead.

DART relies on specific constraints of the grey levels in the reconstructed image, namely that the grey level intensity of each voxel corresponds to a specific class of attenuation values (and, hence, to a specific class of materials or tissues). However, in practice, physical effects present in X-ray micro-CT, such as noise, partial volume effect, beam hardening or scatter artefacts, which have an impact on the otherwise discrete grey levels, may call for less strict enforcement of these constraints in order to improve the accuracy of DART reconstructions. Future work might also consider the introduction of fuzzy classification into the DART algorithm in which classification parameters will be optimized by studying the residual reconstructed error [21]. Another further development of DART could be to account for the energy dependency of the (discrete) attenuation coefficients, which would enable a better assignment of each voxel to a specific attenuation class.

It could be questioned whether DART can be used for bone mineral density (BMD) analysis on a voxel scale (direct conversion of grey levels to BMD); since only a discrete amount of grey values are used when applying DART, it is not possible to look at density differences on a voxel-by-voxel basis. However, it could be feasible to assess for mean density differences between different bone samples. The grey level values used in DART are estimated using a projection distance minimization method [17], which automatically finds the optimal threshold values of the different materials in a sample; if different bone samples have a different mean density, this can be detected by using this method.

In conclusion, DART and FBP, when applied in in vivo micro-CT scans of a rat model of osteoporosis, showed similar changes in morphometric parameters over time, although the actual values were systematically overestimated (or underestimated) by DART. The DART algorithm was, however, more robust when using fewer projection images (62 or 42), where the standard FBP reconstruction was more prone to noise, showing a significantly bigger deviation from the morphometric parameters obtained using all projection images (248). This study supports the use of DART in X-ray micro-CT animal studies, in particular, when the number of projections has to be drastically minimized, which directly reduces scanning time and dose or when projections are missing.