Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Accurate 3D spinal vertebrae image segmentation techniques are important tools to assist the diagnosis and treatment of spinal disorders such as spine trauma  [10, 14]. Severe spine injury can result in life threatening and chronological problems unless treated promptly and properly. In any spinal injury, the possibility of spinal fracture must be examined immediately. Image segmentation of spinal vertebrae in 3D allows detection, measurement, and monitoring of the fracture(s), and facilitates biomechanics analysis of the spinal column.

Despite an increasing interest in spinal vertebrae segmentation in recent years, accurate 3D segmentation methods for diseased or fractured vertebrae are still lacking. There are some existing works in the literature for 2D or 3D segmentations, however they often require user intervention or fall short in achieving high accuracy [4, 68, 1113, 15]. Segmentation of diseased or fractured vertebrae has been attempted recently [5, 19, 20], however, these are either in 2D or focused only on vertebral body rather than the whole 3D spinal vertebrae.

Traumatic injury of the spine often correlates with morphometric features in images. Segmentation of the whole vertebra in 3D would facilitate the detection of fractured vertebra and the assessment of the severity of the fracture. For example, the highlighted volumetric region of vertebrae could assist physicians in performing visual inspection of vertebral fractures, determining its stability and measuring quantitatively the fractured vertebrae.

This work extends the spinal vertebrae segmentation method presented in [9] to segment fractured vertebrae. In this case, high variability of fractured vertebral shape is largely captured by the embedded Willmore flow, while prior shape energy comes into action only when encountering inhomogeneous image intensity distribution.

2 Segmentation Framework

It is well-known that level set methods have advantages such as flexibility in dealing with topological change, easy extension into higher dimensions, as well as easy integration of prior knowledge and region statistic. The segmentation framework presented here has made use of these properties. The framework combines the kernel density estimation technique and Willmore flow to incorporate prior shape knowledge and local geometrical features from images into the level set method. Whilst the prior shape model provides much needed prior knowledge when information is missing from the image, the edge-mounted Willmore flow helps to capture the local geometry and smoothes the evolving level set surface.

The level set method embeds an interface in a higher dimensional function \(\phi \) (the signed distance function) as a level set \(\phi =0\) [16]. The evolution of the level set function \(\phi (t)\) is governed by \(\frac{\partial \phi }{\partial t}+ F|\nabla \phi |=0\), where \(F\) is the speed function. Based on the variational framework, an energy function \(E(\phi )\) is defined in relation to the the speed function. The minimization of such energy generates the Euler-Lagrange equation, and the evolution of the equation is through calculus of variation:

$$\begin{aligned} \frac{\partial \phi }{\partial t} = -\frac{\partial E(\phi )}{\partial \phi }. \end{aligned}$$

In this work, the fusion of energies whereby a shape prior distribution estimator \(E_s\) and an edge-mounted Willmore energy \(E_{\textit{w}_0}\) is employed:

$$ E(\phi )= \lambda E_s + E_{\textit{w}_0}, $$

where \(\lambda \) \((0<\lambda \le 1)\) is the weight parameter, which is tuned to suit the segmentation of normal and abnormal spinal vertebrae.

2.1 Computing Prior Shape Energy via Kernel Density Estimation

Kernel density estimation (KDE) is a nonparametric approach for estimating the probability density function of a random variable. Without assuming the prior shapes are Gaussian distributed, KDE presents advantage in estimating the shape distribution even with a small number of training set, in addition to modeling shapes with high complexity and structure. In this study, we adopted the prior shape energy formulation discussed by Cremers et al. [2].

The density estimation is formulated as a sum of Gaussian of shape dissimilarity measures \(d^2(\phi , \phi _i)\), \(i=1, 2, \dots , N\):

$$\begin{aligned} P(\phi )\propto \frac{1}{N} \sum _{i=1}^Ne^{-\frac{d^2(\phi ,\phi _i)}{2\sigma ^2}}, \end{aligned}$$

where the shape dissimilarity measure \(d^2(\phi , \phi _i)\) is defined as

$$\begin{aligned} d^2(\phi ,\phi _i)&=\int _\Omega \frac{1}{2}\left( H(\phi ) - H(\phi _m)\right) ^2 dx,\\ \sigma ^2&= \frac{1}{N} \sum _{i=1}^N\min _{j\ne i}d^2(\phi _i,\phi _j), \end{aligned}$$

and \(H(\phi )\) is the Heaviside function. By maximizing the conditional probability

$$\begin{aligned} P(\phi |I) = \frac{P(I|\phi )P(\phi )}{P(I)}, \end{aligned}$$

and considering the shape energy as

$$\begin{aligned} E_s(\phi ) = -\log P(\phi |I), \end{aligned}$$

the variational with respect to \(\phi \) becomes

$$\begin{aligned} \frac{\partial E_s}{\partial \phi }=&\frac{\sum _{i=1}^N \alpha _i \frac{\partial }{\partial \phi } d^2(\phi , \phi _i)}{2\sigma ^2 \sum _{i=1}^N \alpha _i}\\ =&\sum _{i=1}^N \frac{e^{-\frac{d^2(\phi , \phi _i)}{2\sigma ^2}}}{2\sigma ^2 \sum _{i=1}^N \alpha _i} \Bigg ( 2\delta (\phi ) \bigg [H(\phi )-H(\phi _i(x-\mu _\phi ))\bigg ]\\&\quad + \int \bigg [H(\phi (\xi )-H(\phi _i(\xi -\mu _\phi ))\bigg ] \delta \phi (\xi ) \frac{(x-\mu _\phi )^T\nabla \phi (\xi ) }{\int H\phi dx} d\xi \Bigg ), \end{aligned}$$

where \(\mu _{\phi }\) is the centroid of \(\phi \) and \(\alpha _i=\exp \left( -\frac{1}{2\sigma ^2}d^2(\phi ,\phi _i)\right) \) is the weight factor for \(i = 1,2,\dots ,N\).

2.2 Computing Local Geometry Energy via Willmore Flow

Willmore energy is a function of mean curvature, which is a quantitative measure of how much a given surface deviates from a sphere. It is formulated as

$$ E_\textit{w} = \frac{1}{2}\int _M h^2 dA, $$

where \(M\) is a \(d\)-dimensional surface embedded in \(\mathbb R^{d+1}\) and \(h\) the mean curvature on \(M\) [18]. For image segmentation, the Willmore energy provides an internal energy that gives a useful description of a region, where the effect of edge indicator is not significant. In these regions, smoothness of the shape of the curve should be maintained and extended, which can be regarded as a weak form of inpainting [3].

As a geometric functional, the Willmore energy is defined on the geometric representation of a collection of level sets. Its gradient flow can be well represented by defining a suitable metric, the Frobenius norm, on the space of the level sets. Frobenius norm is a convenient choice as it is equivalent to the \(l^2\)-norm of a matrix and more importantly it is computationally attainable. As Frobenius norm is an inner-product norm, the optimization in the variational method comes naturally.

Based on the formulation by Droske and Rumpf [3], the Willmore flow or the variational form for the Willmore energy with respect to \(\phi \) is

$$ \frac{\partial E_{\textit{w}}}{\partial \phi } = -\Vert \nabla \phi \Vert \left( \Delta _M h +h(t) \left( \Vert S(t)\Vert _2^2-\frac{1}{2}h(t)^2\right) \right) , $$

where \(\Delta _M h = \Delta h -h\frac{\partial h}{\partial n} - \frac{\partial ^2h}{\partial n^2}\) is the Laplacian Beltrami operator on \(h\) with \(n=\frac{\nabla \phi }{\Vert \nabla \phi \Vert }\), \(S = (I-n\otimes n)(\nabla \times \nabla )\phi \) is the shape operator on \(\phi \) and \(\Vert S\Vert _2\) is the Frobenius norm of \(S\).

In order to ensure the smoothing effect work successfully around the constructed surface and not affecting the desired edge of vertebrae, the Willmore flow is coupled with the edge indicator function \(g(I)= \frac{1}{1+|\nabla G_\sigma *I|^2}\), where \(G_\sigma \) is the Gaussian filter with standard deviation \(\sigma \):

$$\begin{aligned} \frac{\partial E_{\textit{w}_0}}{\partial \phi }= g(I)\frac{\partial E_{\textit{w}}}{\partial \phi }. \end{aligned}$$

3 Experiments and Results

Experiments have been conducted on CT images of spinal vertebrae for 2D and 3D segmentation. The dataset consists of 20 CT images of normal and 4 CT images of fractured spinal vertebrae images of patients aged 18 to 66 years. The images were acquired from various CT scanners such as a 32-detector row Siemens definition, 64-detector row Philips Brilliance and 320-detector row Toshiba Aquilion. The in plane resolutions for these sagittal images range from 0.88 to 1.14 mm, with consistent slice thickness of 2 mm. Original images for these images have fixed sizes of \(512\times 512\), with number of slices varying from 45 to 98. For 3D segmentation, a torus is set manually surrounding the spinal canal as the initial contour. The level set method is then implemented using a narrow band scheme [1] with a re-initialization algorithm [17].

Table 1 Average DSC (\(\%\)) and HD (mm) with standard deviation for segmentations of normal lumbar vertebrae (L1 to L5) using Chan-Vese (CV), Chan-Vese with prior shape (CV + S), Caselles (Ca), Caselles with prior shape (Ca + S), edge-mounted Willmore (\(W_0\)), edge-mounted Willmore with prior shape (\(W_0\) + S) energies, region growing (RG) and graph cut (GC) approach
Fig. 1
figure 1

An example of patient CT image with fractures on lumbar vertebrae L2 and L3

It has been reported that the 3D segmentations of spinal vertebrae clearly outperform the other methods such as region growing, graph cut, the classical level set methods such as Chan-Vese and Caselles models as well as their combinations with shape priors [9]. While some methods can perform relatively well in the 2D segmentation of spinal vertebrae, the majority of them fail badly when extended to the 3D segmentation of an individual vertebra due to the highly complex shape and connected structure as well as the nonuniform image intensity distribution in the posterior column of vertebra. Table 1 summarized 3D segmentation results on normal lumbar vertebrae using various approach, evaluated with ground truth. It is worth noted the effectiveness of our segmentation framework, with an overall accuracy of \(89.32\pm 1.70\,\%\) and \(14.03\pm 1.40\) mm based on Dice similarity coefficient and Hausdorff distance respectively, whilst the inter- and intra-observer variation agreements were \(92.11\pm 1.97\,\%\), \(94.94\pm 1.69\,\%\), \(3.32\pm 0.46\) mm and \(3.80\pm 0.56\) mm. Segmentation results depend highly on available dataset. The intra- and inter-observer variation estimations were performed to verify the difficulty of manual delineation in 3D using our dataset. We have shown that our results present no significant statistical differences (\(p > 0.05\)) when compared with these observer estimations. The robustness of the proposed segmentation framework is demonstrated on the CT image of a patient with fractures on lumbar vertebrae L2 and L3 as seen in Fig. 1. As shown in Fig. 2, the segmentation framework manages to capture the 3D shape of fractured vertebrae L2 and L3, despite the inhomogeneity, noise and missing edges appeared on these fractured vertebra images. It enables individual segmentation of vertebrae without leaking into the nearby connected vertebra. Segmentation results on fractured vertebrae were evaluated via visual inspection by radiologist.

Fig. 2
figure 2

Different views of 3D segmentation for fractured lumbar vertebrae L2 and L3 as well as their projections in 2D (red) compared with manual segmentation (yellow) on axial, sagittal and coronal views

4 Discussion and Conclusion

An accurate level set segmentation framework for segmenting spinal vertebrae in 2D and 3D is presented in this study. The robustness of the framework is demonstrated on CT images of fractured vertebrae. The framework combines the kernel density estimation technique and Willmore flow to incorporate prior shape knowledge and local geometrical features from images into the level set method. It is worth noted that fusion of these energies effectively translate the prior shape knowledge and local geometrical feature of spinal vertebrae into the level set segmentation framework. The Willmore flow driven level set segmentation demonstrates better regularization than the widely used mean curvature flow in level set segmentation. Unlike minimizing surface area by mean curvature flow in regularization, Willmore flow minimizes the bending energy when performing surface smoothing, which is more suitable for object with complex shape and structure. The segmentation algorithm was performed directly in a 3D volumetric manner instead of sequentially to the slices of a 3D image. This allows the volumetric tissue connectivity be taken into consideration and hence, enables more meaningful representation of 3D anatomical shape and structure. Moreover, it forms a continuous, smooth 3D surface and without the post processing redundancy posed by the slice by slice segmentation approach. More samples of fractured vertebrae are needed to perform further evaluation on the segmentation framework. Future work will integrate the algorithm into a pathological vertebrae characterization framework to yield an efficient computer aided diagnosis platform for quantitative analysis of spinal vertebrae fracture and related problems.