Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Intervertebral disc (IVD) degeneration is a major cause for chronic back pain and function incapacity [1]. Magnetic Resonance Imaging (MRI) has become one of the key investigative tools in clinical practice to image the spine with IVD degeneration not only because MRI is non-invasive and does not use ionizing radiation, but more importantly because it offers good soft tissue contrast which allows visualization of the disc’s internal structure [2].

MRI quantification has great potential as a tool for the diagnosis of disc pathology but before quantifying disc information, the IVDs need to be extracted from the MRI data. IVD extraction from MRI data comprises two key steps. Firstly, all IVDs have to be detected from the images and secondly, the regions belonging to IVDs have to be segmented. Manual extraction methods [3, 4] as well as automated extraction methods [511] have been presented before. Since manual extraction is a tedious and time-consuming process which lacks repeatability, automated methods are preferred.

There are different approaches for automatizing the extraction of IVDs from medical images such as graphical model [5], probabilistic model [6], Random Forest regression and classification [12, 13], watershed algorithm [7], atlas registration [8], statistic shape model [10], graph cuts [9], and anisotropic oriented flux [11]. But stable and accurate IVD segmentation remains a challenge.

In this paper we propose an automated 3D lumbar IVD extraction method with minimal user interaction from MRI data sets. The main contribution of our method is a combination of graphical model-based spine column localization with a multi-kernel diffeomorphic registration based segmentation. The 3D IVDs are extracted with a two-step procedure where we first identify the spine column structure and then carry out the IVD segmentation. The motivation behind this two-step procedure can be explained as follows. The IVD geometries are highly constrained by the geometry of the spine column. If the geometrical parameters of the spine column and each individual vertebral body can be estimated accurately from the observed images, then they can provide both geometrical and appearance information about the intervertebral discs, which helps to improve the accuracy and robustness of the IVD segmentation.

The work flow of the proposed algorithm consists of following three steps

  • Initialization. Two user supplied landmarks are required to indicate the centers of L1 and L5 vertebral bodies.

  • Lumbar spine column identification and modeling. Starting from the user initialization, the 3D geometry of the lumbar spine column is automatically extracted from the 3D data sets. The outputs are the 3D geometric information of each individual vertebral body of L1-L5 and a soft-tube model that fits the outer surface of the lumbar spine column.

  • Lumbar disc segmentation. Based on the prior information of the extracted lumbar spine column, the disc segmentation is achieved as a multi-kernel diffeomorphic registration between a disc template and the observed data.

2 Methods

2.1 Data Sets

All datasets used in this paper were generated from a 1.5 Tesla MRI scanner (Siemens, Erlangen, Germany). Dixon protocol was used to reconstruct four aligned high-resolution 3D volumes during one data acquisition: in-phase, opposed-phase, water and fat images, as shown in Fig. 1. Each volume has a resolution of \(2\times 1.25\times 1.25\) mm\(^3\) and the data set size is \(40\times 512\times 512\). The advantage of working with such datasets is that different channels provide complementary information for our disc segmentation task. In our proposed segmentation strategy, we always first extract either intensity or feature information about different tissues on each channel and then combine the 4 channel data into a single dataset.

Fig. 1
figure 1

The four aligned channels of a patient data (for visualization purpose, we only show the middle sagittal (mid-sagittal) slice of each channel.)

Fig. 2
figure 2

Initialization and 2D lumbar spine column detection. a User initialization by picking two landmarks indicating the centers of L1 and L5 in the middle sagittal slice. b Probability assignment (displayed as grey values) of the bone tissue in the mid-sagittal slice for 2D lumbar spine column detection. c 2D lumbar spine column detection result using the graphical model based detection algorithm, blue and green rectangles representing the vertebral bodies and IVDs respectively. d Graphical model of 2D lumbar column detection (Color figure online)

2.2 Lumbar Spine Column Identification

On the mid-sagittal slice, two landmarks are picked to indicate the centers of L1 and L5 vertebral bodies as shown in Fig. 2a. Starting from the initialization, we first carry out a 2D vertebral body and disc identification to localize vertebrae L1-L5 and the 5 target discs from the mid-sagittal slice. The geometrical information of the 2D identification is then used to guide a further 3D lumbar spine column modeling.

2.2.1 2D Vertebral Body and Disc Identification

Solutions for spine location and disc labeling include feature-based bottom-up methods, statistical model-based methods and graphical model-based solutions. For a detailed review of the existing methods, we refer to [14]. In this paper, the 2D vertebral body and disc identification is achieved using a graphical model based strategy introduced in [14]. Compared with the graphical models in [5, 6], the advantage of the graphical model in [14] is that both the low level image observation model and the high level vertebra context potentials need not to be learned from training data. Instead they are capable of self-learning from the image data during the inference procedure. For completeness, here we describe the key components of the method that we previously introduced [14].

  1. 1.

    The graphical model: The graphical model is given in Fig. 2d. Each node \(V_{i}\) represents a connected disc-vertebrae-disc chain of the spine, whose geometrical parameters are given by \(X_{i}\). We define

    • The component observation model. \(p(I|X_{i})\) of a single component \(V_{i}\) representing the probability that the configuration \(X_{i}\) of the node \(V_{i}\) match the observed images \(I\).

    • The potentials. \(p(X_{i},X_{j})\) between neighboring components \(V_{i}\) and \(V_{j}\) encoding the geometrical constraints between components which are defined by the anatomical structure of the spine column.

    The identification of the spine column from the mid-sagittal slice can then be formalized as to find the optimal configurations of \(\{V_{i}\}\), \(X=\{X_{i}\}\) that maximize

    $$\begin{aligned} P(X|I)\propto \varPi _{i}p(I|X_{i})\varPi _{i,j}p(X_{i},X_{j}) \end{aligned}$$
    (1)

    with

    $$\begin{aligned} p(I|X_{i})=p_{I}(I|X_{i})p_{G}(I|X_{i}) \end{aligned}$$
    (2)

    and

    $$\begin{aligned} p(X_{i},X_{j})=p_{S}(X_{i},X_{j})p_{O}(X_{i},X_{j})p_{D}(X_{i},X_{j}) \end{aligned}$$
    (3)

    \(p_{I}(I|X_{i})\) and \(p_{G}(I|X_{i})\) stand for the probabilities that the observed image intensity and image gradient distributions match the geometrical parameters \(X_{i}\) respectively. \(p_{S}(X_{i},X_{j})\), \(p_{O}(X_{i},X_{j})\) and \(p_{D}(X_{i},X_{j})\) are the geometrical constraints on the sizes, orientations and distances between neighboring components. All the observation models and constraints can be designed according to the observed data and prior anatomical knowledge of the spine structure. For detailed formulation of these terms, we refer to [14].

  2. 2.

    Optimization: The optimization is achieved as an inference on the graphical model. The method introduced in [14], which is a particle based nonparametric belief propagation on the graphical model, is used here to carry out the inference.

Figure 2b shows an example of the bone tissue probability assignment on the mid-sagittal slice, which is computed from the user supplied 2 landmarks (Fig. 2a) and 4 channel volume data (see Fig. 1 for an example) by a Gaussian distribution modeling and an equally weighted combination of the intensity distributions of the bony tissue in the 4 channels. This image is used for the computation of the intensity observation model \(p_{I}(I|X_{i})\) during the 2D lumbar column detection. Figure 2c gives the 2D lumbar column detection result. It can be observed that the centers, sizes and orientations of the vertebral bodies and IVDs are correctly identified.

2.2.2 3D Lumbar Spine Column Modeling

We model each lumbar vertebral body as an elliptical cylinder and the lumbar spine column as a variable-radius soft tube. Details of the modeling procedure are described as follows:

  • 3D modeling of each vertebral body: From the 2D vertebral body identification results, the position, hight, radius and orientation of each vertebral body and the image intensity distribution of the bone region can be estimated by modeling the vertebral body as a cylinder. Accordingly for each voxel in the neighbourhood of the estimated cylinder, we can assign the probability that it belongs to the bony tissue. To refine the 3D modeling of the vertebral bodies, we then further model the vertebral body as an elliptical cylinder, a least-squares geometric fitting to the voxels assigned with a high probability (\(>\)0.8) of belonging to the bony tissue can extract the 3D geometry of each vertebral body, including the center, height, orientation and the major radius and minor radius of the elliptical cylinder model.

  • 3D modeling of the spine column: The lumbar column can be modeled as a variable-radius soft tube that contains all the extracted vertebral bodies. Given the 3D models of L1-L5 vertebral bodies, the central axis and the variable-radius of the soft tube can be obtained by a linear interpolation on the centers and radii of the extracted 3D models of vertebral bodies. This results in a 3D variable-radius soft-tube spine column model as shown in Fig. 3a.

Given the 3D soft-tube lumbar spine column model, the spine column region can be extracted from the observed data sets (Fig. 3b). By further eliminating the bony tissue region using the 3D models of vertebral bodies, the candidate region for each target disc can be localized as shown in Fig. 3c–f. The following 3D IVD segmentation is then carried out on the extracted candidate IVD regions.

Fig. 3
figure 3

3D lumbar spine column detection and modeling. a the 3D soft-tube model of the lumbar spine column; b segmented lumbar spine column image; cd segmented disc candidate regions in sagittal slices; ef segmented disc candidate regions in coronal slices. Although all tasks are conducted in 3D, here we show the results in 2D slices for visualization purpose

2.3 3D Disc Segmentation

We solve the 3D disc segmentation as a template based registration between a geometrical disc template and the observed data.

  • The IVD template is set as a thin elliptical cylinder. Considering the anatomical structure of the spine column, i.e., each IVD must fall between its neighbouring vertebral bodies, the initial geometries (center, radii, orientation, hight) of the IVD cylinder template can be estimated using the 3D spine column model and the geometries of its neighboring vertebral bodies, which are all available from the previous 3D lumbar spine column modeling procedure.

  • For the segmentation of a specific IVD, the correspondent observed data to be matched is just the extracted candidate IVD region as shown in Fig. 3c–f

  • The registration algorithm we choose is a multi-kernel diffeomorphic image matching in the Large Deformation Diffeomorphic Metric Mapping (LDDMM) framework as described in [15] and related literatures [1618].

2.3.1 Multi-kernel LDDMM Registration

LDDMM framework [17] is one of the two main computational frameworks in computational anatomy [16]. Existing works show that LDDMM is a general solution for nonrigid image registration with a high flexibility and accuracy. In [15] multi-kernel LDDMM registration algorithms were investigated. Compared with the LDDMM registration with a single kernel, multi-kernel LDDMM has the capability to optimize the deformation in multiple spatial scales [15].

Following the general idea of LDDMM framework, we formalize the multi-kernel image registration between two images \(I_{0}\) and \(I_{1}\) as an optimization problem to find the optimal time dependent velocity field \(v(t)\) that minimizes the sum of a similarity and a deformation energy formalized as

$$\begin{aligned} \fancyscript{E}({v^{\alpha }(t)})&=\frac{1}{2}\sum _{i}w^{i}\int \limits _{0}^{1}\Vert v^{i}(t)\Vert _{V^{i}}^{2}dt + \Vert I_{0}\circ \phi _{v}^{-1}(1)-I_{1}\Vert _{L^{2}}^{2} \end{aligned}$$
(4a)
$$\begin{aligned} \frac{\partial }{\partial t}\phi _{v}(t)&=v(t)\circ \phi _{v}(t) \end{aligned}$$
(4b)
$$\begin{aligned} v(t)&= \sum _{i}v^{i}(t) \end{aligned}$$
(4c)
$$\begin{aligned} \phi _{v}(0)&= Id \end{aligned}$$
(4d)

where \(\Vert v^{i}(t)\Vert _{V^{i}}=<v^{i}(t),v^{i}(t)>_{V^{i}}^{\frac{1}{2}}\) is the norm induced by the inner product \(<u,v>_{V^{i}}=<L_{V^{i}}u,L_{V^{i}}v>_{L^{2}}\), and \(\{K_{V^{i}}=(L_{V^{i}}^{+}L_{V^{i}})^{-1}\}\) are the kernels. The \(\phi _{v}(t)\) is the time-dependent deformation computed as the integration of the velocity field \(v(t)\) and \(I_{0}\circ \phi _{v}^{-1}(t)\) is the transformed image of \(I_{0}\) by the deformation \(\phi _{v}(t)\).

Using the optimal control based approach introduced in [19, 20], we get the Euler-Poincare equation (EPDiff) for the multi-kernel LDDMM registration algorithm as

$$\begin{aligned} \dot{I}(t)&= -\nabla I(t)\cdot \sum _{i}v^{i}(t) \end{aligned}$$
(5a)
$$\begin{aligned} \dot{P}(t)&= -\nabla (P(t)\cdot \sum _{i}v^{i}(t)) \end{aligned}$$
(5b)
$$\begin{aligned} v^{i}(t)&= -(w^{i})^{-1}K_{V^{i}}\star (P(t)\nabla I(t)) \end{aligned}$$
(5c)
$$\begin{aligned} P(1)&= -(I(1)-I_{1}) \end{aligned}$$
(5d)
$$\begin{aligned} I(0)&= I_{0} \end{aligned}$$
(5e)

The registration can then be carried out by updating the deformation velocity fields \(v^{i}(t)\) iteratively from an initial value of \(v^{i}(t)\) using Eqs. (5a)–(5e). For more details on the computation routine, we refer to [17, 19, 20].

2.3.2 Disc Segmentation by Diffeomorphic Registration

The IVD segmentation is achieved as a template based registration between the thin cylinder IVD template and the correspondent candidate disc region as shown in Fig. 3.

In order to explore both intensity and feature information to enhance the accuracy and robustness of the segmentation, we consider a simultaneous registration of two pairs of images, \(I_{0}^{I}/I_{1}^{I}\) and \(I_{0}^{E}/I_{1}^{E}\), which stand for the image intensity and edge information template/observation pairs respectively. Accordingly in the cost function of the LDDMM registration (4a), the image similarity term includes two components \(\Vert I_{0}^{I}\circ \phi _{v}^{-1}(1)-I_{1}^{I}\Vert _{L^{2}}^{2} + \beta \Vert I_{0}^{E}\circ \phi _{v}^{-1}(1)-I_{1}^{E}\Vert _{L^{2}}^{2}\) (Fig. 4).

Fig. 4
figure 4

Determination of the high confidence disc region using the spine column model. Left to right: The spine column region extracted using the spine column model shown in a sagittal and a coronal slice; The central region of the spine column obtained by shrinking the radius of the spine column model by a factor 0.5 shown in the same two slices; The detected high confidence disc regions by further cutting out the bone tissue using the spine column model

\(I_{0}^{I}/I_{1}^{I}\) and \(I_{0}^{E}/I_{1}^{E}\) can be computed from the initialized IVD template and the extracted IVD candidate regions. Details about how to compute them are omitted due to limited space. An example of the template images and correspondent target images and the time dependent registration procedure is shown in Fig. 5.

The final segmented IVD can then be obtained as the deformed template achieved by the multi-kernel LDDMM registration.

Fig. 5
figure 5

3D IVD segmentation by multi-kernel LDDMM registration Left side: The data used in diffeomorphic registration based 3D lumbar disc segmentation. In the target images, the bone tissue regions are extracted using the spine column model. ab: 3 sagittal/coronal slices of the candidate disc region (disc L4-L5 in Fig. 2); cd: the intensity disc template in 3 sagittal/coronal slices; ef: intensity information extracted from MRI data sets in 3 sagittal/coronal slices; gh: the edge disc template in 3 sagital/coronal slices; ij:edge information computed from MRI data sets in 3 sagittal/coronal slices; Right side: The time-dependent deformation of the disc template during the multi-kernel diffeomorphic registration for a L4-L5 disc segmentation. left to right: the deformations of the template at 6 time slots t \(=\) 0, 0.2, 0.4, 0.6, 0.8, 1. t \(=\) 0 means the initial template and t \(=\) 1 gives the final registration results; from top row to bottom row: the evolution of the template visualized in 6 different slices

3 Experiments

The proposed algorithms are verified on MRI datasets of 15 patients obtained with the Dixon protocol. In all the data sets, based on the two landmarks obtained from the initialization step, both the 2D lumbar spine column and the 3D spine column models are correctly extracted. Examples of the disc segmentation results on 4 patient data sets are shown in Fig. 6.

Fig. 6
figure 6

3D intervertebral disc segmentation results on 4 patients. For visualization purpose, we display the results on 2D slices. For each image, the left three columns are sagittal slices and the right three are coronal slices

We also carried out quantitative evaluation of our algorithm. To do this, we manually segmented all datasets (we only need to segment one channel for each patient as all four channel volumes are aligned according to Dixon imaging protocol) and took the binary volumes from the manual segmentation as the ground truth to verify the accuracy of the present algorithm. We computed the Dice coefficient \(D\) which is usually used to measure the overlap between two binary images:

$$\begin{aligned} D =\frac{2\times |A\cap B|}{|A|+|B|}\times 100 \end{aligned}$$
(6)

Table 1 shows the average dice coefficients of the 5 discs on all 15 patients when the automated segmentation was compared to the manual segmentation. The highest average dice coefficient was found for patient #8 (87.9 %) and the lowest average dice coefficient was found for patient #9 (80.5 %). We also computed the average dice coefficients for all discs and the results are presented in Table 2. We note that Neubert et al. [10] reported a mean Dice of 76–80 % in their 3D IVD segmentation paper.

Table 1 Average Dice coefficients (%) of the 5 discs between the manual segmentation and the proposed algorithm on different patients
Table 2 Average Dice coefficients (%) between the manual segmentation and the proposed algorithm on different discs on all 15 data sets

4 Conclusions

In this paper we proposed an automated lumbar intervertebral disc segmentation strategy, whose key components include a graphical model based spine column identification algorithm and a multi-kernel LDDMM registration algorithm to achieve the disc segmentation. By identifying the lumbar spine column structure before carrying out the segmentation, we acquire geometrical and appearance information about the spine column. These information can be used to accurately locate the candidate disc region and provide constraints to enhance the performance of the disc segmentation. By converting the segmentation problem as a template based diffeomorphic registration, we can explore both the intensity and edge information of the observed data while keeping a smooth deformation of the template so that the final segmented discs will possess smooth surfaces. The experiments on 15 patient data sets verified the robustness and accuracy of our method.

We also noticed that for abnormal cases, such as with missing/additional vertebrae or the scoliosis case, the automated lumbar column identification may not be reliable although the graphical model can handle the unknown vertebra number as shown in [14]. A possible solution for these extreme cases is to ask the user to indicate the center of each vertebra body during the initialization step. Once the centers are known, the particle filtering-based inference can then achieve a reliable 2D lumbar column identification and the following up 3D lumbar column modeling and disc segmentation.