Introduction

Over 400,000 spinal fusion procedures are performed annually in the USA [1], with a 56.4 % increase between 2003 and 2012 [2]. Spinal fusion surgery is the standard of care procedure for treating various spinal conditions involving scoliosis, spinal stenosis, degenerative disk disease and spondylolisthesis [1]. During surgery, pedicle screws are used to fix metal plates and rods to support the spine. However, the procedure is challenging and associated with potentially high neurological, vascular or mechanical complication risks [3,4,5]. Image-guided surgery (IGS) systems demonstrated a significant decrease in breach and screw malpositioning rates [6,7,8] by providing spatial positions of tracked surgical instruments in the operative field. During navigation, a rigid registration procedure is performed to align the patient’s anatomy to preoperative images. In addition, a dynamic reference object (DRO) is rigidly attached to a vertebra and serves as a reference coordinate frame to account for patient position and motion during surgery.

In a standard clinical procedure, registration is achieved with landmark-based technique, by manually identifying homologous anatomical landmarks on both the images and the patient. The procedure can take 10–15 min for each vertebra [9, 10]. This approach is tedious, extends the operating time and is subject to operator variability. Commercial navigation systems for spine surgery, e.g., the O-arm (Medtronic inc., Minneapolis, MN), enable acquisition of 2D fluoroscopy or 3D computed tomography (CT) intra-operative images. This provides updates of the anatomy and instrument locations during surgery [5, 8, 11], but introduces risks of harmful radiation exposure for both the patient and the operating room (OR) personnel [12, 13]. Moreover, the setup time is \(\sim \) 15 min [14] and extra personnel is required for equipment manipulation, which further extends the surgical procedure time and costs.

Another important issue in spine IGS is the loss of accuracy as the surgery duration increases [15]. Patient movement, surgical actions such as drilling or tapping, and inadvertent contact with the DRO may invalidate the registration. While intra-operative ultrasound (iUS) provides the ability to acquire images that can be used to correct for misregistration during surgery, the overall procedure is time demanding in the surgical workflow. Current iUS-based registration techniques require 2–5 min per vertebra [16,17,18,19], precluding frequent registration corrections. In addition, a manual landmark-based registration is commonly performed to initialize the registration transform. Our goal is to achieve frequent and accurate re-registration with minimal or no impact on the overall surgery time. In a preliminary work [20], we presented a framework for a fast rigid CT-to-iUS image registration of a single vertebra, designed for spinal fusion surgery. This paper provides further analysis and validation to explore the effects of iUS acquisition variability on the registration accuracy.

Related work

CT-to-iUS registration

A common approach to register CT-to-iUS images for spine surgery is to use the posterior surface of the vertebra. Yan et al. [9, 16] proposed to maximize the intensity cross-correlation of the posterior vertebra surface. The reported median target registration error (TRE) ranged between 1.65 and 2.31 mm on porcine cadavers. A slice-to-volume variant of the approach proposed in [17], in which the registration is performed without iUS volume reconstruction to accelerate the computations, achieved comparable accuracy with registration time around 120 s per vertebra. Koo et al. [18] extended the approach by including an additional intensity-based rigid registration step performed between the vertebra surface on the CT image and the original iUS image. The average TRE reported was 2.18 mm ± 0.82 mm (ranging between 0.89 and 4.45 mm) on a porcine cadaver. The registration was achieved in \(\sim \) 100 s per vertebra.

In order to avoid errors related to iUS surface extraction, Brendel et al. [21] proposed to use a surface-to-volume registration, by maximizing the sum of the iUS gray levels at the CT vertebral surface locations. The approach was used to successfully register the lumbar segment of an ex vivo human spine preparation. Winter et al. [22] used a similar approach with additional pre-processing of iUS images to enhance the bone surface. The accuracy was reported in terms of points-to-surface distances and showed good results under small initial misalignments (below 10 mm translation and \(12^\circ \) rotation), achieving the best success rate of 88.4 % under 1 mm distance.

To account for spine curvature, Gill et al. [23] used a biomechanical model to represent coherent intervertebral structure displacement between independently registered vertebrae. Alignment of each vertebra is obtained by performing an intensity-based rigid registration of an ultrasound-simulated image calculated from the CT image with the real iUS image. The reported average TRE ranged from 0.62 to 2.26 mm for an average computation time of 43 min. In order to reduce the computation time, the same group [24] proposed a spring-based biomechanical model to account for intervertebral disk motion. The model acts as an intervertebral spring force used as a regularization term in a point-based registration [25]. The average accuracy was 2.2 mm on lumbar segment of a sheep cadaver for 15 min of computation time.

Nagpal et al. [19] proposed a multi-vertebrae CT-to-iUS registration framework. First, a point-based registration is achieved using the vertebra surface points extracted from iUS and CT images. To account for the spine curvature over multiple vertebrae, an additional group-wise vertebra registration is performed, in which intervertebral points are manually added to prevent physically incoherent transformations. Because the study was conducted on clinical data of human subjects, a ground truth registration was not available. The authors used manual landmark registration combined with the proposed method to serve as ground truth registration. They reported average TRE of anatomical landmarks ranging from 0.71 to 1.70 mm and a computation time ranging from 50 to 185 s, excluding the time required by the operator to manually set intervertebral points.

Clinical limitations

Fig. 1
figure 1

Flowchart of the proposed registration framework

A common limitation of the aforementioned approaches is the need of an initial alignment, required under the assumption of a small initial misalignment. This is expected to be achieved prior to the registration with a landmark-based manual registration. Although Nagpal et al. [19] prevented this step by assuming the same anatomical structures (i.e., the same number of vertebrae) to be imaged in both iUS and CT scans, the initial alignment is obtained by applying an intensity-based registration, which may be computationally expensive. Another clinical issue often neglected is the variability and the duration of the iUS acquisition procedure during surgery. As the acquisition is performed intra-operatively, pre-processing, such as volume reconstruction and vertebra surface extraction, needs to be performed shortly after the acquisition. However, the computation time of the pre-processing tasks has not been reported in the aforementioned works. While an accurate bone extraction method takes approximately 9 s per image [26], a straightforward ray tracing approach would be less accurate but much faster. In this paper, a compromise between accuracy and computation time is considered while designing our registration approach.

Image registration

An overview of our registration framework is shown in Fig. 1. The preoperative stage involves two steps: extract the posterior surface of the vertebra in CT images (1), and extract CT anatomy orientation points (2). During the surgery, five intra-operative steps are performed: extract probe’s trajectory points (3), extract the posterior surface of the vertebra on iUS images (4), create an iUS compounded volume from iUS acquisition slices (5), estimate the initial alignment using the iUS scan trajectory and the CT anatomy points (6), and perform a multi-scale gradient alignment of the vertebra surfaces of CT and iUS images (7). Note that the posterior vertebra surfaces on CT were extracted using the forward tracing method [9]. A similar approach was used on 2D iUS images. However, surface points on iUS correspond to the position where the maximum value was encountered on the tracing. In addition, because we use gradient information in the final alignment step, the resolution of the iUS compounded volume is to be taken into account as a too sparse volume precludes capturing inter-slice gradient information. This will be discussed in the experimental validation.

Fig. 2
figure 2

Ultrasound acquisition procedure: a illustration of the iUS acquisition sweep direction, and b extraction of anatomy points from the iUS scan trajectory (red circles represent image centers)

Intra-operative ultrasound image acquisition

Ultrasound images were acquired using a locally developed IGS system composed of an optical tracking camera (Polaris, Northern Digital Inc., ON, Canada), an ultrasound machine with a tracked phased array probe (HDI 5000/P4-7, Philips, Amsterdam, Netherlands) and a tracked tool used as a DRO. A saline solution (0.9 % NaCl) is poured into the cavity to facilitate ultrasound propagation. Isotonic saline solutions are commonly used in open surgeries to irrigate, rinse or improve image quality. The Intra-operative Brain Imaging System (IBIS) [27] open-source platform was used to calibrate the probe, visualize 3D data and record the iUS acquisitions. Similar to an image-guided surgery procedure, the spatial position and orientation of tracked tools and intra-operative images were recorded relative to the DRO coordinate system. In order to reduce computation time, the iUS acquisition was sub-sampled so that a minimum distance \(d \in \mathbb {R}_{\ge 0}\) separates the center point of successive frames. A high value of d yields a sparse volume and fast computations, while a value of zero does not modify the acquisition. The frames satisfying the distance criterion are processed in the next steps.

Ultrasound volume compounding

The ultrasound frames are combined into a single volume by aggregating the ultrasound slices to form a resampled volume, called the compounded volume. This avoids a full volume reconstruction by interpolating in-between slice voxels, as proposed in [17]. Because the relationship between the spatial positions of the ultrasound slices is fixed, registering the compounded volume to the CT volume is analogous to simultaneously optimizing for a slice-to-volume rigid body registration of each individual iUS slice to the CT volume. In our implementation, each iUS pixel intensity is resampled in its corresponding 3D location in the compounded volume, and the intensities are averaged for overlapping pixels. It is important to consider the spatial resolution of the resampled compounded volume. While a fine resolution results in a large but highly sparse volume, a coarse resolution results in a small but dense volume. Note that because we use gradient information in the final alignment step, a too sparse volume precludes capturing inter-slice gradient information.

Initial alignment

In order to guess the initial alignment, we define a simple sweep procedure to limit the variability in the translational and angular positioning of the iUS probe during the acquisition. The procedure consists in a single axial sweep along the caudo-cranial direction, starting from the inferior extremity up to the superior extremity of the vertebra, with the probe orientation roughly normal to the coronal plane (Fig. 2a).

Similar to the approach proposed by Nagpal et al. [19], we consider that both CT and iUS images represent the same vertebra. In clinic, the vertebral level is identified before the surgical opening with palpation and confirmed by 2D fluoroscopy. Acquiring iUS images of the required vertebra is left to the surgeon’s expertise. Although automatic methods to identify the vertebra extents on iUS images would be desired, this is beyond the scope of this paper. For now, we assume that the iUS scan is performed such that the same number of vertebrae appears in iUS and CT images. Thus, the center of mass of the iUS frame roughly corresponds to the center of the CT image. Note that although this center point, on both CT and iUS images, does not necessarily correspond to the anatomical center of the vertebra, the constrained probe orientation provides a roughly good estimate of the initial rotation. The proposed acquisition procedure constrains the scan trajectory to be approximately linear along the inferior to superior axis. Moreover, on the iUS image plane, the proximal to distal axis from the probe’s transducers corresponds to the posterior to anterior axis on the vertebra. Based on these properties, the caudo-cranial direction of the iUS scan trajectory is estimated using a linear regression of the 3D coordinates of image centers. Then, three anatomical points can be created on the physical space (see Fig. 2b): a center of mass \(\mathbf {p}_{\text {mass}}^{\text {US}}\), a superior point \(\mathbf {p}_{\text {sup}}^{\text {US}}\) at a 10 mm distance from \(\mathbf {p}_{\text {mass}}^{\text {US}}\) toward the superior direction, and a distal point \(\mathbf {p}_{\text {distal}}^{\text {US}}\) at a 10 mm distance from \(\mathbf {p}_{\text {mass}}^{\text {US}}\) toward the anterior direction. Similarly, three homologous points \(\mathbf {p}_{\text {mass}}^{\text {CT}}\), \(\mathbf {p}_{\text {sup}}^{\text {CT}}\) and \(\mathbf {p}_{\text {distal}}^{\text {CT}}\) are created on the CT image, preoperatively. Finally, the initial alignment transform \(\mathbf {T}^{\text {init}}\) is obtained by minimizing the least-square distances between the CT and the iUS points.

Gradient alignment registration

The initial alignment approach roughly registers the CT-to-iUS images, based on the acquisition procedure described in Sect. 3.3. To refine the registration, we perform a gradient alignment registration [28]. The approach consists in using the orientation of the image intensity gradient vectors to estimate the similarity between two images. The similarity metric is given by the inner product between normalized gradients:

$$\begin{aligned} S(\nabla I_\text {US}(\mathbf {x}), \nabla I_\text {CT}(\mathbf {x})) = \left\langle \frac{\nabla I_\text {US}(\mathbf {x})}{|\nabla I_\text {US}(\mathbf {x})|}, \frac{\nabla I_\text {CT}(\mathbf {x})}{|\nabla I_\text {CT}(\mathbf {x})|} \right\rangle ^n, \end{aligned}$$
(1)

where \(\mathbf {x}\) is the image coordinate vector, \(\nabla I_\text {US}\) and \(\nabla I_\text {CT}\) correspond to the gradient of the fixed iUS compounded volume and the moving CT volume of the extracted posterior surface, respectively, and \(n \in \mathbb {N}\) is a free parameter which characterizes the matching criterion and was set to \(n=64\) (as suggested in [29]). Gradient images \(\nabla I_\text {US}\) and \(\nabla I_\text {CT}\) are computed on GPU by convolving the image with a Gaussian derivative operator. Evaluation of the similarity metric in Eq. 1 is also performed on a GPU to accelerate the processing.

Table 1 CT sub-volumes and iUS acquisition details

Covariance matrix adaptation evolution strategy (CMA-ES) [30] has been shown to provide good results for rigid vertebra registration [22, 23] and was used to optimize for the final registration transform, given by:

$$\begin{aligned} T^\text {reg} = \arg \max _T S\Big (\nabla I_\text {US}(\mathbf {x}), \nabla I_\text {CT}(T(\mathbf {x}))\Big ). \end{aligned}$$
(2)

We modified the algorithm to take into account the vertebra surface on the intra-operative images. In the original approach, the metric is computed on a subset of points sampled from the image with high gradient magnitude, which has been shown to correspond to points with the lowest uncertainty [29]. In our approach, the points are sampled from a 2-mm-thick region around the iUS extracted vertebra surface. Then, candidates satisfying the gradient magnitude criterion (i.e., with low uncertainty) were selected among the vertebra surface points to be used in Eq. 2. Our hypothesis is that gradient selectivity reduces the effect of surface extraction errors on the registration accuracy. The gradient orientation alignment metric implies that candidate gradients among the vertebra surface have a high gradient magnitude. Consequently, candidates with low gradient magnitude (high uncertainty) are discarded, which reduces risks of evaluating the metric on homogeneous regions that tend to be non-surface voxels.

Finally, we perform the registration using a multi-scale approach. Two different scales are used (\(K=2\)). In the first pass, the images are smoothed using a Gaussian filter with \(\sigma =2\,\text {mm}\) to capture large structures, e.g., thicker surface of the vertebra. A second registration pass is performed on images filtered with \(\sigma =1\,\text {mm}\).

Evaluation

Data

In this study, we used the same data described in [16]. The dataset consists in lumbosacral section of a porcine cadaver that underwent a CT and iUS scans. A PQ6000 CT scanner (Picker International Inc., Cleveland, OH, USA) was used to acquire images of vertebrae T15 to L6. The scan was performed from anatomical superior to inferior direction with the specimen placed in supine position. The CT images have a resolution of \(0.35 \times 0.35\,\text {mm}^2\) and a slice thickness of 2 mm. The entire CT volume was divided into sub-volumes, each one containing a single vertebra (Table 1). Using fiducial markers, a ground truth registration transform was obtained for each vertebra (see [16] for details).

Measurements

Because the fiducials were placed far from the vertebra surface, a small misalignment of the fiducial points may result in a large TRE at the vertebra surface. Therefore, the fiducial registration error (FRE), computed on fiducial locations, may not be representative of the registration accuracy. To evaluate the quality of the registration, TRE was computed on seven landmarks manually identified on the surface of each vertebra on the CT images. The anatomical landmarks correspond to: a point on the apex of the spinous process, two points on the left and right laminae, two points on the left and right superior articular processes and two points on the left and right inferior articular processes. The TRE of each vertebra is obtained by:

$$\begin{aligned} \text {TRE}_v = \sqrt{\frac{1}{7} \sum _{i=1}^7 |T^\text {gt} \mathbf {p}_i - T^\text {reg} \mathbf {p}_i|^2}, \end{aligned}$$
(3)

where v is the vertebral level, \(T^{\text {gt}}\) is the ground truth registration transform obtained from fiducial point-based registration, \(T^\text {reg}\) is the computed registration transform and \(\mathbf {p}_i\) is the \(i\text {th}\) landmark point manually positioned on the vertebra surface. As determined by the consensus report on spine IGS [31], we fix the acceptance threshold for successful registrations to 2 mm. We report the success rate (in %) as the fraction of registration trials achieving a TRE below 2 mm.

In addition to registration accuracy, we measured the computation time required to perform the registration of each vertebra. The computations involve three main tasks: extracting the vertebra surface (backward tracing), compounding the iUS volume and aligning CT-to-iUS volumes (i.e., solving Eq. 2). We also report the computation time required to perform the initial alignment, although it can be neglected due to its small contribution to the overall registration time. Note that the iUS acquisition time, i.e., the time required to manipulate the iUS probe and perform the sweep, is not reported in this study, but takes approximately 20 s. All computations were performed using an \(\hbox {Intel}^{\copyright }\) CoreTM i7-3820 CPU at \(3.6\,\text {GHz} \times 8\) station and a NVIDIA GeForce GTX 670 graphics card with \(4\,\text {Gb}\) of memory.

Fig. 3
figure 3

Qualitative registration results at various vertebral levels: a alignment of iUS image (gray), iUS extracted posterior surface of vertebra (blue) and CT extracted posterior surface of vertebra (green); b alignment of iUS image (heat map), CT image (gray) and CT extracted posterior surface of vertebra (green); c 3D views of iUS slice aligned with CT volume

Experiments

Two experiments were carried out with the following goals: (1) to assess the overall registration accuracy and the effect of the iUS acquisition sweep on the registration and (2) to assess the effect of the iUS volume compounding and acquisition frame rate on the registration accuracy and computation time.

Experiment 1: sensitivity to iUS acquisition

We use three iUS acquisitions that satisfy the requirements of the procedure described in Sect. 3.1. Each acquisition consists in a single axial sweep at approximately 3 cm posterior to the apex of the spinous process, such that: A1 is centered on the spinous process (ultrasound sweep No. 1 in [16]), A2 is 1 cm to the left of the spinous process (ultrasound sweep No. 2 in [16]), and A3 is 1 cm to the right of the spinous process (ultrasound sweep No. 3 in [16]). Note that for consistency the sweep directions were inverted to match the inferior to posterior direction.

For each vertebra, 100 registration trials were performed to account for the stochastic nature of the CMA-ES optimizer. The vertebra surface extraction on the iUS images and the volume compounding were performed every 10 trials to measure the computation time. We set \(d=0.5\) mm for acquisition sub-sampling (see Sect. 3.1). The resolution of the iUS compounded volume is set to \(1.5\times 1.5 \times 1.5\) \(\hbox {mm}^3\), to produce sufficiently dense volumes. In total, 100 registrations \(\times \) 7 vertebral levels \(\times \) 3 iUS acquisitions = 2100 trials were performed.

Experiment 2: sensitivity to parameter variation

To evaluate the sensitivity of our registration approach to the iUS compounded volume resolution parameter R and the sub-sampling parameter d, we experimented the effect of different values on the registration outcomes. Using the axial centered iUS acquisition A1, we varied the parameter R to take values in (\( 0.5\times 0.5 \times 0.5\) \(\hbox {mm}^3, 1\times 1 \times 1\) \(\hbox {mm}^3, 1.5\times 1.5 \times 1.5\) \(\hbox {mm}^3\)) and the parameter d to take values in (\(0\,\text {mm}, 0.5\,\text {mm}, 1\,\text {mm}\)). Similar to experiment 1, 100 registrations were performed for each vertebra. The vertebra surface extraction on the iUS images and the volume compounding were performed every 20 trials to measure the computation time. In total, 100 registrations \(\times \) 7 vertebral levels \(\times \) 3 sub-sampling rates \(\times \) 3 iUS volume compounding resolutions = 6300 trials were performed in this experiment. Note that the reported results obtained with the downsampling parameter \(d=0.5\) mm and the iUS compounded volume resolution \(R = 1.5\times 1.5 \times 1.5\) \(\hbox {mm}^3\) have already been evaluated in experiment 1.

Results and discussion

Figure 3 shows qualitative registration results obtained for different vertebrae with the proposed framework. The iUS images appear to be aligned with the surface of the vertebra on the CT image. The registration can be visually assessed based on the apex of the spinous process and the inferior articular processes.

Results of experiment 1

Accuracy and computation time results for the three iUS acquisitions are summarized in Table 2. The overall TRE for each acquisition is slightly better than the results reported in [16]. The axial centered acquisition A1 yielded the best results with a median TRE of 1.48 mm (IQR 0.68 mm) which is below the acceptance threshold of 2 mm with a success rate of \(84.42\%\).

Notice that the results obtained at L4 seem to be the worst, with a median TRE of 2.03 mm, 2.07 mm and 2.36 mm with A1, A2 and A3, respectively. We suspect the reason behind this large error to be related to the large FRE of 0.593 mm induced when the ground truth registration was generated at L4. In fact, since the fiducials were intentionally placed far from the vertebra, in order to preclude interference with the iUS scans, a small value of the FRE would lead to a large displacement at the vertebra surface where the TRE is being computed. Therefore, the assessment of the accuracy at L4 is rendered less reliable. Considering the established 2-mm increments breach grading system (intrapedicular, 0–2 mm breach, 2–4 mm breach, > 4 mm breach) [32, 33], we define the threshold value of 4 mm to represent a major registration failure. The error range reported at L4 is between 1.25 mm and 3.14 mm, preventing major failures. Over the 2,100 trials, we note 7 cases (\(< 0.004\,\%\)) of major failures.

Table 2 Registration accuracy and computation time results for each iUS acquisition

The number of selected frames and the computation time for each vertebra registration of the axial centered acquisition A1 are detailed in Table 3. The average overall registration time is \(0.742\,\text {s} \pm 0.037\,\text {s}\) per vertebra. This includes both the initial alignment and the gradient alignment processing times. Including the reconstruction and the registration, the total procedure time is \(10.79\,\text {s} \pm 1.27\,\text {s}\), which is practical in the OR. In fact, this is significantly lower than the 2 min reported by Yan et al. [17] and the 100 s reported by Koo et al. [18] per each vertebra registration. The registration time ranging between 50 and 185 s reported by Nagpal et al. [19] involved multiple vertebrae registration, precluding a direct comparison. It should be noted that all the aforementioned works did not include the iUS volume reconstruction time, which is expected to be performed after acquiring the iUS images during the surgery. In our approach, the computational bottleneck is associated with the reconstruction step with a \(93\,\%\) contribution to the overall registration time. Further improvements to reduce the computation time for the posterior surface of vertebra extraction and the iUS volume compounding could be the subject of future work.

Table 3 Computation time results for the axial centered acquisition

During the iUS acquisition, two types of variations can occur: (1) small variations of the probe positions (noise) along every direction caused by inadvertent hand motions, and (2) accumulation of variations that cause a drift of the acquisition trajectory. Our method aims at finding the linear trajectory of the probe. Therefore, compared to the average length of the probe’s trajectory of 4.2 cm (see Table 1), the residual noise is assumed to be significantly smaller and normally distributed along each direction. On the other hand, acquisition drifts are more likely to occur along the posterior–anterior axis then in the left–right axis, due to the superior-anterior cavity opening. As an effect, the linear regression would yield a rotational misalignment around the left–right axis. In our experiment, the iUS scans were acquired in a realistic scenario, i.e., in a standard open-back spine surgery on a porcine cadaver. The left–right as well as posterior–anterior variations are taken into consideration, as the acquisitions were manually performed. The results obtained demonstrate that our registration approach allows to recover from the variations in the 21 evaluated acquisitions.

Fig. 4
figure 4

Results of experiment 2 on the sensitivity to the sub-sampling (frame rate) parameter d and the iUS volume resolution parameter R: a computation time results, and b accuracy results

The stochastic outcome of the CMA-ES optimization method yields a variability in the TRE results. However, the interquartile ranges of 0.68 mm, 0.63 mm and 0.78 mm obtained using the centered axial, the left axial and the right axial acquisitions, respectively, are smaller than those reported in [16] with different initial starts. In a clinical context, the robustness of the method is critical for the registration to be practical in IGS. While major registration failures (TRE > 4 mm) are visually identifiable during surgery, allowing appropriate decisions to be taken (e.g., re-scanning the vertebra or re-running the registration), registration failures resulting in a TRE ranging between 2 mm and 4 mm are harder to be detected visually. Our registration approach showed a good statistical robustness to CMA-ES variability with an average success rate of 84.42 % over the 700 trials when using the centered axial acquisition.

Results of experiment 2

Figure 4 shows the effect of the frame rate sub-sampling and the compounded volume resolution on the registration accuracy and computation time. A repeated measure analysis of variance (ANOVA) test showed that reducing the number of acquisition frames allows to reduce the reconstruction time without a significant effect on the accuracy (\(F_{2,12} = 1.10, p=0.36\)). On the other hand, although the computation time decreases with a coarser resolution, the registration accuracy is affected (\(F_{2,12} = 12.01, p=0.001\)).

In a clinical context, the trade-off between speed and accuracy is important. While the overall accuracy showed good robustness to coarse resolutions (overall median TRE of 1.56 mm and IQR = 0.67 mm), the registration time varied significantly from 40.25 s for fine resolutions to 5.61 s for coarse resolutions. A compromise between speed and accuracy suggests a sub-sampling rate of \(d=0.5\,\text {mm}\) and an iUS compounding volume resolution between \(R=1\times 1 \times 1\,\text {mm}^3\) and \(R=1.5\times 1.5 \times 1.5\,\text {mm}^3\).

Conclusion

Rapid and accurate CT-to-iUS registration of the vertebrae is crucial for spine surgical navigation procedures. During the intervention, patient movement, bone removal and drilling instrumentations or contact with the tracked reference may invalidate the registration. In this case, the registration procedure has to be recomputed efficiently. This paper presents a framework to rigidly align preoperative CT with the intra-operative ultrasound images of a single vertebra. Experiments conducted with multiple iUS acquisitions on a lumbosacral section of a porcine cadaver yielded an overall median accuracy of 1.48 mm with \(\sim \) 11 s of computation time when using the proposed iUS sweeping procedure. The method is robust, as shown with off-center iUS sweep accuracies of 1.69 mm and 1.70 mm. With a clinically acceptable accuracy below 2 mm and a straightforward iUS acquisition procedure, the approach can be easily integrated into the surgical workflow. The entire registration procedure is expected to be completed in less than a minute, which includes preparing the cavity with a saline solution, acquiring iUS images and performing the registration. This would significantly reduce the typical 15–30 min required for intra-operative CT imaging (time includes: draping the patient, adjusting patient table, positioning the scanner, requiring the staff to leave the OR for radiation safety, acquiring the images and removing the scanner), without additional exposure to radiation.

Rigid registration is adapted to the bony structure of the vertebra. However, because of the spine curvature flexibility, this statement becomes less valid as the number of vertebrae in a single scan increases. Abdominal respiratory motions or changes between the preoperative supine posture and intra-operative prone posture of the patient may induce deformations of the spinal column. Therefore, it is important to account for the spine curvature, especially in the cervical section, where the intervertebral displacement can be large. Moreover, anatomical structure of the spine differs in the cervical, thoracic and lumbar sections; thus, the accuracy required in fusion surgery varies accordingly. Our future work will focus on investigating effects of the anatomy on rigid registration in the thoracic and cervical sections.