Introduction

Spine needle injections are commonly used for analgesia to relieve pain and anesthesia to remove nerve sensation. An example is an injection into the facet joint, which is used to treat chronic lower back pain. Between 25 and 45 % of chronic lower back is attributed to the facet joint [3]. Injections into this region are particularly challenging due to the deep location, proximity to nerves and the narrow joint space. These challenges make it difficult to provide accurate injection into the joint when the procedure is performed without guidance. The current standard to guide the injection is fluoroscopy or computed tomography (CT). However, there are several drawbacks to these image modalities, including the risks posed by ionizing radiation to the patient and the physician.

Another common percutaneous spinal intervention is epidural injection. It is used before surgery as an alternative to general anesthesia, as well as in obstetrics during delivery. Fluoroscopy may only be used in non-obstetric cases because of the ionizing radiation. Palpation can be used as an alternative; however, this can be challenging in obese patients or those with a diseased spine [1]. In these cases, a different guidance modality is required.

Ultrasound (US) guidance for spinal needle insertions is gaining interest, since US is a non-ionizing and more accessible imaging modality compared to fluoroscopy or CT [2]. US has not become the standard of care in spinal injections due to the difficulty in the interpretation of anatomy. There are many ultrasound artifacts limiting the visibility of facet joints or intervertebral spaces. These include attenuation and scattering due to the thick muscles and ligaments around the spine, and acoustic shadows from the bony processes of vertebrae. To enhance the quality of US images, three-dimensional (3D) anatomical information (e.g., from a preoperative CT) can be fused with the intraoperative US images through image registration.

To move toward a clinically acceptable US-guided system for percutaneous spine interventions, the objective of this paper is to register the preoperative diagnostic CT to the intraoperative US using in vivo patient data. Over the past two decades, several research groups have attempted to tackle this challenging problem  [68, 11, 12, 19, 22, 23]. To the best of our knowledge, these approaches have been validated either on single vertebra registration or on phantom and cadaver experiments. Robust and clinically feasible registration of multiple vertebrae with clinical data has not been reported to date. The preliminary results of our approach on five subjects were presented in [14]. In this paper, we evaluate the approach on 10 subjects, provide a detailed description of the methodology and perform extensive experiments to evaluate the role of the various components of our method on the final CT to US registration accuracy.

Accurate registration of CT and US data will enable intraoperative US needle guidance, and potentially reduce or eliminate the need for intraoperative fluoroscopy or CT which restrict current spinal interventions to specialized facilities. In contrast, US scanners have no radiation risks and are accessible and portable including new pocket-sized scanners. Reliable US guidance for spinal interventions would also improve the accuracy and accessibility of such procedures, and lower the associated costs.

Materials and methods

Data

Data from the lumbar spine of 10 subjects are used to validate the proposed registration algorithm. The registration accuracy is highly dependent on the quality of the acquired US images. Therefore, a protocol is created to set guidelines for US imaging parameters and freehand US data acquisition. The aim of the protocol is to minimize variability in US image quality between subjects, operators and imaging centers. The protocol also ensures that the setup time is minimal to adhere to the current clinical practice. The demographics of the 10 subjects recruited are shown in Table 1.

Table 1 Demographics of subjects recruited for US imaging

CT data acquisition

Only subjects with previous CT scans were recruited to avoid any additional radiation exposure. Approval was obtained from Institutional Research Ethics Boards (IREB). Informed consent was acquired from the subjects who participated in the study. Preoperative CT data were collected at St. Paul’s Hospital, Vancouver, BC, Canada, and at Kingston General Hospital, Kingston, ON, Canada. The subjects were in the supine position during CT data acquisition, and the CT images were provided as anonymized DICOM files from the hospitals’ picture archiving and communication systems. Some subjects had the entire lumbar vertebrae visible in their CT, while others had only a portion of the vertebrae visible. Table 2 lists the lumbar vertebrae visible, and hence used for registration, for each subject.

Table 2 Lumbar vertebrae in the CT that are registered for each of the 10 subjects

Tracked US data acquisition

The hardware used to acquire the US images and associated tracking data is composed of a SonixTouch US scanner (Analogic, MA, USA) equipped with a Guidance Positioning System (GPS) extension (Ascension DriveBay EM position tracker, VT, USA), a C5-2 GPS curvilinear US transducer (Analogic, MA, USA) and a 800 EM tracking sensor (Ascension, VT, USA) used as the subject coordinate reference. The reference sensor is affixed on the subject’s skin above the T12 vertebra. The GPS extension has an adjustable arm that attaches to the US machine. This allows the position of the EM transmitter to be as close to the reference sensor as possible. The US transducer is tracked by the EM transmitter through an embedded pose sensor. A schematic of the hardware setup for the data acquisition system for tracked US is seen in Fig. 1. Subjects are set in the prone position for US acquisition. To help minimize changes in curvature between the CT acquired in the supine position and the US acquired in the prone position, a medium-sized pillow is placed under the subject’s stomach during the US data acquisition. The imaging and subject setup time is minimal (around five minutes) in order to adhere to the clinical environment and avoid any changes to the current clinical workflow.

Fig. 1
figure 1

Data acquisition system for tracked US. SonixTouch (Ultrasonix, Richmond, BC) ultrasound scanner; GPS extension (Ascension DriveBay electromagnetic tracker); tracked C5-2 curvilinear transducer (Ultrasonix); 3D position sensors affixed to the transducer and subject. This figure is modified from [21]

The sonographer scans the subject and adjusts the US imaging parameters, such as depth, starting with preset suggestions. The preset values were created alongside the sonographer based on US images of volunteers. Only minor manual adjustments to the imaging parameters for each subject are needed which reduces intraoperative imaging time. The sonographer then landmarks the T12 vertebra and sacrum as well as right and left extremes of the L1 transverse processes to determine the US scanning region. This ensures that the entire lumbar region of the spine is scanned. The reference holder is attached approximately 3 cm above the T12 vertebra landmark. For data acquisition, the US transducer moves slowly and smoothly, while keeping complete contact between the subject’s skin and the ultrasound transducer (minimum of twenty seconds per acquisition). By moving slowly, a dense set of 2D US images are acquired during the US scan. This decreases the degree of interpolation needed during the 3D volume reconstruction of the 2D US images and consequently improves the US volume’s image quality for CT to US registration. From the ultrasound scans of volunteers, we also determined a scanning style that produced reliable US signals across different individuals with various spinal curvatures. We refer to this style as the sagittal zigzag scan where the US transducer starts at the left L1 transverse process and moves across to the right L1 transverse process. It then moves down and across in the opposite direction to acquire US image of the entire lumbar spine.

The imaging software comprises of the freely available and open-source PLUS toolkit [10]. PLUS enables the synchronization of the US image data from the US machine and tracking data from the EM tracker. Prior to US data acquisition, we use the fCal application in PLUS with a double N-wire phantom to calibrate the US transducer for depths between 6 and 9 cm. Once an appropriate depth of the US transducer is selected by a trained sonographer to visualize the spine anatomy, PLUS connects to the hardware using the configuration file associated with the depth chosen. Following data acquisition, an US volume is reconstructed using the PLUS toolkit taking into account the physical pixel size. Only the portion of the lumbar spine visible in preoperative CT is included in the final US volume.

Preoperative CT to intraoperative US registration

The registration pipeline involves both intensity-based and point-based registration of the bone surfaces to harness the advantages of each method. The aim of the registration workflow is to maximize the similarity using both voxel information and features (points) extracted from the input volumes to find the closest alignment between the CT and US. The general overview of the registration pipeline is illustrated in Fig. 2.

Fig. 2
figure 2

General overview of the CT to US registration workflow

The approach involves three major steps: (i) automatic global intensity-based registration to align the CT and US using voxel intensity information (step 1 in Fig. 2); (ii) automatic global point-based registration using point set correspondences (step 2 in Fig. 2); and (iii) automatic multi-vertebrae point-based registration to account for possible curvature change of each vertebra along the lumbar spine step 3 in Fig. 2). Preprocessing of the US and CT volumes is required prior to registration as enhancement of bone surfaces and elimination of the variability of intensity values in the CT and US volumes will improve the final registration results. Details of the registration are presented below.

Data preprocessing

CT To automatically enhance the CT bone surface, we filter the CT data in the frequency domain using local phase image processing. In the frequency domain, the local amplitude describes the local strength of the image signal and the local phase represents the local structural features, such as discontinuities in the image. Typically, access to these local properties in 2D and 3D is accomplished through the construction of a quadrature pair of oriented band-pass filters. These filters have a \(90^\circ \) phase shift with respect to each other [17]. The Log-Gabor filter is used as the band-pass quadrature filter on the CT slices [7]. As a result of CT local phase filtering, a single feature is extracted, namely the step edge corresponding to a sharp change in the intensity of an image. Step edges occur at soft tissue to bone interfaces. A simple raycasting is done following phase filtering in the posterior to anterior direction of a CT slice such that the first bone pixel encountered for each column is saved as bone and anything below that pixel is saved as background. The raycasting helps to remove the bright intensity values that exist in CT, but do not exist in US, since US signals cannot propagate through the first proximal bone surface. A sagittal CT slice from two different subjects overlaid with the local phase bone surface enhanced from the CT followed by the raycasting is seen in Fig. 3. Finally, to improve the capture range of registration, an inverse Euclidean distance map is calculated on the enhanced bone surface CT image by computing the Euclidean distance between each pixel and the nearest nonzero pixel.

Fig. 3
figure 3

Sagittal CT slice overlaid in yellow with the phase-filtered and raycasted bone surface. a CT slice from subject 3 with its corresponding raycasted bone surface. b CT slice from subject 5 with its corresponding raycasted bone surface

For multi-vertebrae registration (step 3 of our approach), the vertebrae are segmented from the CT data preoperatively. We use the method of Rasoulian et al. [20] for this purpose.

US Local phase filtering of the US volume is performed to automatically enhance the bone surface relative to the soft tissue. The US local phase filtering differs from the CT phase filtering in that multiple edge features (step edge, line, corner, junction) have to be extracted due to the complex shape of the vertebraes appearance in US. Rather than assuming one orientation as is done in the CT local phase filtering, the US local phase filtering allows for the simultaneous estimation of the orientation and local phase information. Details of the description and implementation of this approach are provided in [7]. Subsequently, raycasting is performed from the anterior to posterior direction in order to remove any soft tissue visible above the bone signals. A sagittal US slice of two of the subjects with enhanced bone surface overlaid is shown in Fig. 4.

Fig. 4
figure 4

Sagittal US slice. The phase-filtered and raycasted bone surface is overlaid on the US slice and is shown in yellow. a US slice from subject 3. b US slice from subject 5

The bone surface of the US data has to be further segmented prior to the second step of our registration approach. We modified an algorithm originally proposed by Foroughi et al. [5]. In our approach, we use the phase-filtered US data and identify a single pixel bone surface. The bone surface pixels are enhanced by a combination of two main bone features: high acoustic impedance and acoustic shadowing. Pixels with low intensities (shadows under bones) are therefore expected below a pixel of high intensity if the pixel of high intensity is a part of the bone surface. Continuity and smoothness of the bone surface are established by minimizing a cost function using dynamic programming [5]. Our modified algorithm shows fewer false positive bone surface pixels detected, which is critical for accurate point-based registration. An US slice is overlaid with the single pixel US bone surface using the original algorithm and our modified algorithm, respectively, in Fig. 5.

Fig. 5
figure 5

Sagittal US slice overlaid in yellow with the single pixel bone surface. a US slice using the modified algorithm; b US slice using the original algorithm

Automatic global intensity-based registration of CT and US

The open-source medical imaging software 3D Slicer is used to automatically register a preprocessed CT to a preprocessed US using an intensity-based approach [15, 16]. The General Registration (BRAINS) module within 3D Slicer version 4.2 that employs the BRAINSFit algorithm is used [9]. Although the original approach was developed for multi-modality registration for the brain, it is used successfully for a variety of multi-modality applications [4, 9]. In the BRAINSFit method, mutual information is used as the similarity metric for registration. Optimal alignment is achieved when the pixel intensity information each input image contains about the other is maximized. The registration is initialized given the transformation of the center of geometry of a preprocessed CT volume to the center of geometry of a preprocessed US volume. This assumes that the center of geometry in both image modalities represents similar structures. A rigid intensity-based registration is performed using the default parameters of the module.

Automatic global point-based registration using point set correspondences

An existing probabilistic point-based registration algorithm, Coherent Point Drift (CPD) [13], is used. One of the benefits for this choice is that it has a closed-form solution. CPD aligns the CT and US point sets using Gaussian mixture models and expectation maximization optimization.

Automatic multi-vertebrae point-based registration

Vertebrae are rigid bodies; however, the intervertebral disks are deformable. CT images are taken in the supine position, whereas the US images are taken in the prone position. To correct for the possible curvature differences between CT and US data, we perform a multi-vertebrae rigid registration. The CPD rigid registration algorithm is modified to define a novel multibody approach, where at each iteration, vertebrae are transformed individually. As a result, it is possible that the vertebrae can be transformed into a pose of the lumbar spine that is not physically possible. To overcome this challenge, points are preoperatively chosen manually between two adjacent vertebrae. Each point is then duplicated where one point belongs to the superior vertebra and the other to the inferior vertebra. These points act as a spring to constrain the pose of the vertebrae during registration. When the registration first begins, the distance between two points that represent a spring is zero. To incorporate the springs in the multi-vertebrae point-based registration, the energy of the springs is added as a regulatory term to the optimization function in the registration pipeline.

To elaborate, in our approach, each vertebra is transformed by an independent rigid transformation, \(T_n\). We assume that point sets of the nth vertebra model are represented by \(X^n\). Since our algorithm does not use any a priori knowledge on the division of ultrasound points to individual vertebrae, the ensemble of target points on the ultrasound is represented by Y. In the absence of any regularization model, the traditional CPD algorithm optimizes the following cost function:

$$\begin{aligned} \sum _n {E(X^n,Y,T_n)}, \end{aligned}$$
(1)

finding the appropriate \(T_n\) for the nth vertebrae that best matches each vertebrae model to the target point set.

The cost function E has the form of \(\sum {p\times \Vert X^n \!-\!T_n(Y)\Vert ^2}\) [13], where p is the posterior probability.

In order to incorporate a biomechanical model, one can add springs between each two adjacent vertebrae. The springs, S, are represented by two ends, between any two adjacent vertebrae. For easier representation, we assume that \(S^n\) are the spring end points belonging to the nth vertebra. Given that springs are only established between adjacent vertebrae, we add R as a regularization term as described below:

$$\begin{aligned} R(T_1, \dots , T_n) = \alpha \sum _{n=1}^{N-1} {\sum _{i=1}^I\Vert T_n(S^n_i)-T_{n+1}(S^{n+1}_i)\Vert ^2}, \end{aligned}$$
(2)

where I is the number of springs on each vertebra. Note that the springs that end on the nth vertebrae are transformed by \(T_n\). Combining this regularization term with the registration term will result in:

$$\begin{aligned} \sum _{n=1}^N {E(X^n,Y,T_n)} + R(T_1, \dots , T_n). \end{aligned}$$
(3)

CPD is an iterative algorithm, which updates the transformation as the certainty of the registration increases. In our proposed method, the cost function is split into terms associated with each vertebra that are optimized separately. In other words, the cost function is optimized with respect to each transformation, \(T_n\), separately and consecutively. Removing terms that are not a function of \(T_n\), the cost function for the nth transformation can be written as:

$$\begin{aligned}&E(X^n,Y,T_n)+\alpha {\sum _{i=1}^I\Vert T_n(S^n_i)-T_{n+1}(S^{n+1}_i)\Vert ^2}\nonumber \\&\quad +\,\alpha {\sum _{i=1}^I\Vert T_n(S^n_i)-T_{n-1}(S^{n-1}_i)\Vert ^2}. \end{aligned}$$
(4)

In order to minimize Eq. 4 using the CPD closed-form solution, we combine the regularization term with the registration term, where the variable p is substituted with the constant \(\alpha \) for all spring points, i. In each iteration of the CPD algorithm, \(T_1,\dots ,T_N\) are updated consecutively. The term \(\alpha \) determines the contribution of the springs. Values of \(\alpha \) between \(2^{-3}\) and \(2^{7}\) were tested; a value of \(2^{5}\) provided the most accurate registration for all clinical datasets and was chosen.

Validation

A gold standard alignment is not possible for the clinical data used in this work as the recruited subjects have pre-existing CT, and fiducial markers that are visible in both CT and US cannot be used. In place of a gold standard, anatomical landmarks on the lamina of each vertebra are identified on the US images. Two operators choose these anatomical landmarks: one orthopedic surgeon and one physician with spine anatomy expertise. The landmarks chosen by the two operators are pooled together. In an F-test performed for one subject, interoperator variability failed to show significance (F \(=\) 0.98).

In the absence of fiducial markers, we assume the CT and US have the optimal alignment following registration. The landmark points chosen in US are assumed to correspond to the same landmarks in the CT and are visually confirmed. To determine the accuracy and precision of the registration method, the CT and the points representing the lamina landmarks are perturbed by a transformation selected randomly from a uniform distribution of 5\(^\circ \) rotation about each axis and 5 mm translation along each axis. The transformation is applied to the entire lumbar spine that is visible in the CT. The initial misalignment is determined by calculating the target registration error (TRE) between the original position of the lamina landmark points and the position of the landmarks after the initial perturbation. To determine the capture range for the registration pipeline, 20 tests are performed with misalignment errors randomly generated within the range 0–25 mm. Registration is then performed, and the final TRE is calculated as the root mean square between the transformed lamina landmark points and their original positions. A test run for one subject is shown as an example in Fig. 6 to demonstrate how this validation works pictorially.

Fig. 6
figure 6

One case of quantitative validation of the CT to US alignment using lamina landmarks placed on the US. Original positions of landmarks are shown in yellow after the alignment of the CT to the US and the transformed positions are shown in blue. a CT aligned to original position of US lamina landmarks; b CT and lamina landmarks transformed by initial perturbation; c CT and lamina landmarks transformed by CT to US transformation found through registration

A qualitative clinical validation is also investigated. Here, a point is added on the posterior dura between two adjacent vertebrae in the US images by both operators. This is where clinicians aim their needle for spinal anesthesia and thus provide a clinically relevant validation. If the points selected are in the correct region, as seen in Fig. 7, after registering the CT to the US, the registration is potentially suitable for spinal interventions.

Fig. 7
figure 7

One case of qualitative validation of the CT to US alignment using posterior dura landmarks on the US signifying where a needle’s target would be for spinal anesthesia. Landmarks are shown in yellow

Results and discussion

As mentioned before, Table 2 lists the lumbar vertebrae included in the registration for each subject. The runtime for each of the main registration components of the pipeline is shown in Table 3. The registration was performed on a Lenovo ThinkCenter, with Intel i5-3570 quad-core CPU and 16 GB of RAM.

Table 3 Runtime range in seconds for each of the registration components in the pipeline

As mentioned above, to report quantitative registration accuracies, capture range experiments were performed for all 10 subject datasets. This involved perturbing CT data that are aligned to US by 20 random transformations and re-registering the two volumes. From capture range experiments, an average of 97 % success rate is achieved. Rasoulian et al. [18] defines 2–4 mm as a clinically acceptable accuracy for epidural injections. The space within a facet joint is reported to be between 2 and 4  mm [19]. Success is defined as achieving a mean TRE of 2 mm or less to provide a conservative estimate of clinical acceptability. The mean TRE, maximum point distance and total success rate for each subject are depicted in Table 4. From the results, it is evident that the mean TRE is well below 2 mm.

Table 4 Mean TRE (mm), maximum point distance (mm) and total success rate from the CT to US registration for the 10 subjects using the full registration pipeline

The qualitative validation of registration involves overlaying contours of the CT on the sagittal, transverse and coronal planes of the US. As an example, the qualitative validation for Subject 7 is illustrated in Fig. 8. A 3D rendering of the CT is also shown in this figure. In Fig. 8, the points on the posterior dura between each two adjacent vertebrae represent the target areas for spinal anesthesia. As mentioned, the posterior dura points are placed on the US images by two operators to clinically validate the CT alignment after registration. From Fig. 8, the CT contours align in all three planes and the points on the posterior dura are all within the target area for spinal anesthesia. Points placed on the posterior dura between adjacent vertebrae may be on different US slices, and therefore, all of the points may not be visible in the coronal slice shown. Qualitative validation provides a clinically relevant result to support the quantitative validation.

Fig. 8
figure 8

3D rendering of CT vertebrae with points on the posterior dura between two vertebrae (represented in yellow) on the top right and sagittal (bottom left), transverse (top left) and coronal (bottom right) planes showing US slices with the CT contours overlaid for Subject 7

The registration pipeline is extensive in that it involves global intensity-based, global point-based and multi-vertebrae point-based registration. We evaluate the effect of each of the three main components of the registration approach (steps 1–3, Fig. 2) on the accuracy of the final alignment of CT and US data. For this purpose, results for the following capture range experiments are outlined:

  1. 1.

    Full registration pipeline.

  2. 2.

    Registration pipeline without the global intensity-based step (i.e., step 1).

  3. 3.

    Registration pipeline without the global point-based step (i.e., step 2).

  4. 4.

    Registration without the multi-vertebrae point-based step (i.e., step 3).

To determine if there is a significant difference in the TRE using the proposed pipeline compared to a modified version, p values are calculated using the Wilcoxon signed-rank test, where \(p<0.01\) is considered significant.

The effect of the intensity-based registration step

CT to US registration results are evaluated without the global intensity-based registration (step 1) and shown in Table 5. The global intensity-based registration involves aligning the bone surfaces in the CT and US that are automatically enhanced using local phase filtering. From the capture range experiments, Subject 1, Subject 4 and Subject 5 failed to show a significant difference in the TRE without the intensity-based registration (\(p = 0.02\), \(p = 0.03\), \(p = 0.39\), respectively). Subjects 2 and 3, and Subjects 6–10 did show a significant difference with an increased TRE (\(p<0.001\), \(p<0.001\), \(p<0.001\), \(p<0.001\), \(p<0.001\), \(p<0.001\), \(p<0.001\), respectively). Removing the intensity information and using solely point information is problematic for these subjects as there is limited US bone visibility in these datasets. Table 5 shows that only Subjects 4 and 5 have a mean TRE less than 2 mm. Subject 1’s mean TRE is negatively affected by three runs of the capture range experiment. These three misalignments have a large rotation about both the z and y axes that the pipeline could not correct without intensity information. It is therefore more robust to include the global intensity-based registration as the first step to provide a coarse initial registration between the CT and US.

Table 5 Mean TRE (mm) and maximum point distance (mm) from the CT to US registration for the 10 subjects using a modified registration pipeline, where the intensity-based registration step is not performed

The effect of the global point-based registration step

Following the global intensity-based registration to attain a coarse estimate of the alignment between the CT and US, global point-based registration is performed in our proposed approach. Next, we consider whether the global point-based registration step is necessary. From the results of capture range experiments (Table 6), although Subject 6 did show a significant difference in the accuracy (\(p < 0.001\)), the mean TRE for that subject is still below the clinical threshold of 2 mm. Subject 4 was the only subject that failed to show a significant difference in the accuracy (\(p = 0.052\)). Table 6 shows that the mean TRE for most subjects are too high for clinical use in percutaneous spinal interventions. Subject 4 had 6/20 runs that were unsuccessful and negatively affected the mean TRE. In these cases, the intensity-based registration incorrectly aligned the CT vertebral bodies to the lamina of the US. The global point-based registration corrects instances where the global intensity-based step does not provide a close initial alignment between the CT and US. Since the final registration step is constrained by the springs, movement occurs between adjacent vertebrae. A close global alignment is required as there is limited global movement of vertebrae in the multi-vertebrae registration.

Table 6 Mean TRE (mm) and maximum point distance (mm) from the CT to US registration for the 10 subjects using a modified registration pipeline, where the global point-based registration step is not performed

The effect of multi-vertebrae point-based registration step

In the last step of registration, springs are used to constrain the multi-vertebrae point-based registration. As vertebrae are the rigid bodies transformed individually at each iteration of the registration, without the springs, registration can result in a pose of the lumbar spine that is not physically possible. This includes two vertebrae intersecting (collision). An example from one run of Subject 10 is shown in Fig. 9. We considered the effect of removing multi-vertebrae point-based registration from the proposed registration pipeline. From the capture range experiments (Table 7), all subjects had a significant loss in accuracy when multi-vertebrae registration is not performed in the pipeline (\(p = 0.001\) for Subject 5 and \(p < 0.001\) for all other subjects). Only Subject 4 has a clinically acceptable mean TRE value.

Fig. 9
figure 9

Registration result after one run for Subject 10 showing a spine pose that is not physically possible. The left panel shows a superior to inferior view of the lumbar spine; the right panel shows a posterior to anterior view of the lumbar spine. Here, L1 and L2 are colliding and that L2, L3, L4, L5 have moved away from each other more than what is physically possible

Table 7 Mean TRE (mm) and maximum point distance (mm) from the CT to US registration for the 10 subjects using a modified registration pipeline, where the springs in the piecewise point-based registration are not used

Conclusion and future work

We presented a novel registration pipeline for the lumbar spine that accurately aligns preoperative CT with intraoperative US, and validated it using 10 clinical datasets. By aligning the CT with the US, anatomical information that is not visible in US is provided to the clinician to guide spine needle interventions without having to expose the physician or patient to more radiation intraoperatively. To the best of our knowledge, this is the first work where multiple vertebrae are registered between CT and US using clinical data.

The proposed registration pipeline shows great promise for guiding percutaneous spine needle procedures, but further improvements are needed for its clinical use. The location of the springs that constrain multi-vertebrae registration is chosen manually preoperatively; however, improvements can be made to automate spring selection for a clinical workflow. In addition, in the multi-vertebrae point-based registration step, the weight \(\alpha \) is constant for all spring points; this could be adaptive depending on where the spring points are between the vertebrae. Finally, the registration pipeline is extensive so it would be useful if steps were only included if the registration accuracy was not clinically acceptable with a simpler version of the pipeline. The registration pipeline needs to be executed on a larger number of patient datasets to further demonstrate its robustness.