Vertebrae Segmentation in 3D CT Images Based on a Variational Framework

Hammernik, Kerstin; Ebner, Thomas; Stern, Darko; Urschler, Martin; Pock, Thomas

doi:10.1007/978-3-319-14148-0_20

Kerstin Hammernik⁷,
Thomas Ebner⁷,
Darko Stern⁷,
Martin Urschler⁸ &
…
Thomas Pock⁷

Part of the book series: Lecture Notes in Computational Vision and Biomechanics ((LNCVB,volume 20))

1430 Accesses
14 Citations

Abstract

Automatic segmentation of 3D vertebrae is a challenging task in medical imaging. In this paper, we introduce a total variation (TV) based framework that incorporates an a priori model, i.e., a vertebral mean shape, image intensity and edge information. The algorithm was evaluated using leave-one-out cross validation on a data set containing ten computed tomography scans and ground truth segmentations provided for the CSI MICCAI 2014 spine and vertebrae segmentation challenge. We achieve promising results in terms of the Dice Similarity Coefficient (DSC) of $0.93 \pm 0.04$ averaged over the whole data set.

Access provided by Autonomous University of Puebla. Download chapter PDF

Vertebral Column Localization, Labeling, and Segmentation

Vertebra segmentation based on two-step refinement

Article Open access 26 July 2016

Multi-object Model-Based Multi-atlas Segmentation Constrained Grid Cut for Automatic Segmentation of Lumbar Vertebrae from CT Images

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Due to reduced physical activity and modern office jobs that require prolonged sitting during work hours, pathological conditions affecting the spine have become a growing problem of modern society. As most spinal pathologies are related to vertebrae conditions, the development of methods for accurate and objective vertebrae segmentation in medical images represents an important and challenging research area. While manual segmentation of vertebrae is tedious and too time consuming to be used in clinical practice, automatic segmentation may provide means for a fast and objective analysis of vertebral condition. A current state of the art method for detecting, identifying and segmenting vertebrae in computed tomography (CT) images is proposed by Klinder et al. [5]. The method is based on a complex and computationally demanding alignment of statistical shape models to the vertebrae in the image. Using the deformable surface model and training an edge detector to bone structure, Ma et al. [6] segment and identify the thoracic vertebrae in CT images. In the work of Kadoury et al. [4], the global shape representation of individual vertebrae in the image is captured with a non-linear low-dimensional manifold of its mesh representation, while local vertebral appearance is captured from neighborhoods in the manifold once the overall representation converges during the segmentation process. Ibragimov et al. [3] used transportation theory to build their landmark-based shape representations of vertebrae and game theory to align the model to a specific vertebra in 3D CT images. In this paper, we propose a method for vertebrae segmentation in 3D CT images based on a convex variational framework. In contrast to the previously proposed methods that use sophisticated vertebral models, our segmentation method incorporates only a mean shape model of vertebrae initialized in the center of the vertebral body.

2 Methods

Our vertebrae segmentation algorithm is based on two representations of a priori information, a mean shape model and a bone probability map obtained from intensity information of the input vertebra image. The main steps of our algorithm are illustrated in Fig. 1. Firstly, an intensity based prior map of the bone is estimated by comparing the intensity values to trained bone and soft tissue histograms. This resembles our learned bone prior. The vertebral mean shape is then registered to the thresholded bone prior map to obtain the orientation of the individual vertebrae. This information is used to formulate a total variation (TV) based active contour segmentation problem, which combines the registered mean shape and the bone prior, and additionally incorporates edge information.

2.1 Mean Shape Model

The vertebral mean shape model $f_s$ is calculated separately for three groups of vertebrae to account for variation in shape along the spine: T01–T06, T07–T12, L01–L05. Ground truth segmentations of vertebrae are registered to an arbitrary reference vertebra using an intensity-based registration with a similarity transformation and normalized cross correlation as similarity measure. The vertebral mean shape model is obtained by averaging the registered binary images of the ground truth segmented vertebrae. This step leads to a voxelwise probability for being part of the mean shape. To meet the requirements of the TV optimization framework [10], the obtained values in the probability image are inverted such that negative values represent the mean shape vertebral region and values close to one the non-vertebral region.

2.2 Bone Prior Map

The bone prior map $f_b(x) = { log}\left( \frac{p_{ bg}(x)}{p_{ fg}(x)}\right) $ is calculated as the log likelihood ratio between the probability that a voxel $x$ belongs to the bone distribution $p_{ fg}$ and the probability that it belongs to the soft tissue distribution $p_{ bg}$. The bone and soft tissue distributions are obtained from the training data set by estimating normalized mean foreground and background histograms of the intensity values using the ground truth segmentations. A coarse segmentation of the bone in the input image is achieved by thresholding the inverted bone map. We select a threshold value of $-0.5$ to ensure that trabecular bones are included in the segmented bone region, since their image intensities might be close to soft tissue.

2.3 Total Variation Segmentation

To obtain the segmented vertebra $u$, the following non-smooth energy functional $ E_{ seg}(u)$ is minimized using the first order primal-dual algorithm from [1]:

$$\begin{aligned} \min \limits _{u\in [0,1]} E_{ seg}(u) = \min \limits _{u\in [0,1]} {{\mathrm{TV}}}_ { g,\,aniso}\;+\;\lambda _1 \int \limits _{\varOmega }uf_s \, dx+ \lambda _2 \int \limits _{\varOmega }uf_b \, dx \end{aligned}$$

(1)

where $\varOmega $ denotes the image domain. The trade-off between the vertebral mean shape model, bone prior map and image edge influence is regularized by the parameters $\lambda _1$ and $\lambda _2$. The term ${{\mathrm{TV}}}_{ g,\,aniso}(u)$ is the anisotropic $g$-weighted TV norm [7], using a structure tensor $D^{\frac{1}{2}}(x)$ as proposed by [9], incorporating both edge magnitude and edge direction to be able to segment elongated structures:

$$\begin{aligned} {{\mathrm{TV}}}_{ g,\,aniso}&= \int \limits _{\varOmega }|D^{\frac{1}{2}}(x)\nabla u | \, dx = \int \limits _{\varOmega }\sqrt{\nabla u^T D(x) \nabla u} \, dx\end{aligned}$$

(2)

$$\begin{aligned} D^{\frac{1}{2}}(x)&= g(x){n}{n}^T + {n_0}{n_0}^T + {n_1}{n_1}^T. \end{aligned}$$

(3)

Here, $n=\frac{\nabla I}{||\nabla I ||}$ is the normalized image gradient, $n_0$ denotes an arbitrary vector in the tangent plane defined by $n$, and $n_1$ is the cross product between $n$ and $n_0$. The edge function $g(x)$ is defined as

$$\begin{aligned} g (x)= e^{-\alpha ||\nabla I(x) ||^{\beta }},\, \alpha , \beta \in \mathbb {R}^+. \end{aligned}$$

(4)

During minimization of the energy $E_{ seg}$, the segmentation $u$ tends to be foreground, if $f_b,f_s < 0$ and background, if $f_b,f_s >0$. If $f_b,f_s$ equal zero, the pure TV energy is minimized, thus seeking for a segmentation surface with minimal surface area. The final segmentation is achieved by thresholding the segmentation $u$ between 0 and 1.

3 Experimental Setup

We evaluated our method on the volumetric CT data sets provided for the CSI spine and vertebrae segmentation challenge [11]. The data consists of ten training images and the corresponding ground truth segmentations. The performance of our algorithm was evaluated by a leave-one-out cross validation, i.e., we report average performance over ten experiments.

We implemented the registration as well as the segmentation algorithm on the GPU to exploit hardware parallelization of our algorithms using Nvidia CUDA. For edge detection, we chose the parameters as $\alpha =20$ and $\beta =0.55$. The regularization parameters are set to $\lambda _1=0.04$ and $\lambda _2=0.005$ for all experiments. We achieved the binary segmentations by thresholding the segmentation $u$ with a value of 0.2.

4 Results

For quantitive evaluation, we used the Dice Similarity Coefficient (DSC) to evaluate our segmentation algorithm. We achieved an average DSC of 0.93 $\pm $ 0.04 over all vertebrae from the leave-one-out experiment. Our algorithm performs well on lumbar vertebrae (0.96 $\pm $ 0.02) and lower thoracic vertebrae T07–T12 (0.95 $\pm $ 0.02). The DSC for thoracic vertebrae T01–T06 is 0.89 $\pm $ 0.05, which can be explained by the influence of ribs and small intervertebral discs that are connected to the vertebrae. The algorithm did not perform well on case 6 in terms of registration errors due to confusing trabecular bone intensities with soft tissue. All estimated DSC are depicted in Table 1 and 2. A qualitative result of a correctly segmented fifth lumbar vertebra is illustrated in Fig. 2. In contrast to that, Fig. 3 shows an example for the sixth thoracic vertebra where the segmentation is influenced by connected ribs.

Table 1 Mean values and standard deviations for each vertebra resulting from the leave-one-out cross validation experiment

Full size table

Table 2 Evaluation in terms of mean values and standard deviations for all vertebrae in the individual data sets

Full size table

5 Discussion

Our proposed method for vertebrae segmentation based on a variational framework has been successfully applied to ten volumetric CT data sets provided for the CSI spine and vertebrae segmentation challenge. A common problem in vertebrae segmentation is that edges of the vertebrae are not clearly defined. Furthermore, trabecular bone intensities sometimes resemble soft tissue. These limitations could be easily overcome with our approach by adding manual user constraints as proposed in [8]. We see this as a great benefit of using our variational framework in vertebrae segmentation. In contrast to other methods that do not guarantee the convergence to an optimal solution, our proposed TV energy minimization (1) is convex, hence, it yields a globally optimal solution given the successfully registered mean shape. Initialized with a single point in the center of the vertebra, the proposed algorithm can be considered fully automatic since various methods have already been proposed for automatic detection and labeling of the center of the vertebral body [2]. While other methods depend on the sophisticated shape models, whose generation usually requires a great amount of time consuming manual interaction, in our proposed method we used coarse mean shape models built separately for all upper thoracic, lower thoracic and lumbar vertebrae. The vertebral mean shape model is registered to the thresholded bone prior map, which may lead to wrong alignment, if ribs are present or if trabecular bone intensities are low, i.e., close to soft tissue values. The overall result of $0.93\,\pm \,0.04$ in terms of the DSC is similar to other published methods. The result in lumbar region ($0.96\,\pm \,0.02$) is better than 0.95 presented by Kadoury et al. [4] and the result of Ibragimov et al. [3] evaluated only on the lumbar vertebrae ($0.93\,\pm \,0.02$). In the lower thoracic part of the spine, our result of $0.95\,\pm \,0.02$ exceeds the overall result for thoracic vertebrae (i.e. 0.93) presented by Kadoury et al. [4]. However, these methods were evaluated on different data sets, so results are not fully comparable. The lower DSC values for thoracic vertebrae T01–T06 can be explained by the influence of ribs and small intervertebral discs that mislead the segmentation.

6 Conclusion

In this work, a fully automatic system for vertebrae segmentation from CT was shown. It builds upon a TV based convex active contour segmentation that incorporates shape and intensity priors learned from training data and combines this prior information with image edges to achieve a minimal surface segmentation. Our results on the data of the MICCAI 2014 CSI challenge are promising and comparable to state of the art methods.

References

Chambolle, A., Pock, T.: A first-order Primal-Dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis. 40(1), 120–145 (2011)
Article MATH MathSciNet Google Scholar
Glocker, B., Zikic, D., Konukoglu, E., Haynor, D.R., Criminisi, A.: Vertebrae localization in pathological spine CT via dense classification from sparse annotations. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2013. Lecture Notes in Computer Science, vol. 8150, pp. 262–270. Springer, Berlin (2013)
Google Scholar
Ibragimov, B., Likar, B., Pernus, F., Vrtovec, T.: Shape representation for efficient landmark-based segmentation in 3D. IEEE Trans. Med. Imaging 33(4), 861–874 (2014)
Article Google Scholar
Kadoury, S., Labelle, H., Paragios, N.: Automatic inference of articulated spine models in CT images using high-order Markov Random Fields. Med. Image Anal. 15(4), 426–437 (2011)
Article Google Scholar
Klinder, T., Ostermann, J., Ehm, M., Franz, A., Kneser, R., Lorenz, C.: Automated model-based vertebra detection, identification, and segmentation in CT images. Med. Image Anal. 13(3), 471–482 (2009)
Article Google Scholar
Ma, J., Lu, L., Zhan, Y., Zhou, X., Salganicoff, M., Krishnan, A.: Hierarchical segmentation and identification of thoracic vertebra using learning-based edge detection and coarse-to-fine deformable model. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2010. Lecture Notes in Computer Science, vol. 6361, pp. 19–27. Springer, Berlin (2010)
Google Scholar
Reinbacher, C., Pock, T., Bauer, C., Bischof, H.: Variational Segmentation of Elongated Volumetric Structures. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3177–3184 (2010)
Google Scholar
Unger, M., Pock, T., Trobin, W., Cremers, D., Bischof, H.: TVSeg—interactive total variation based image segmentation. In: Proceedings of the British Machine Vision Conference (2008)
Google Scholar
Weickert, J.: Anisotropic Diffusion in Image Processing. Teubner, Stuttgart (1998)
MATH Google Scholar
Werlberger, M., Pock, T., Unger, M., Bischof, H.: A variational model for interactive shape prior segmentation and real-time tracking. Scale Space and Variational Methods in Computer Vision. Lecture Notes in Computer Science, vol. 5567, pp. 200–211. Springer, Berlin (2009)
Google Scholar
Yao, J., Burns, J.E., Munoz, H., Summers, R.M.: Detection of vertebral body fractures based on cortical shell unwrapping. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2012. Lecture Notes in Computer Science, vol. 7512, pp. 509–516. Springer, Berlin (2012)
Google Scholar

Download references

Acknowledgments

This work was supported by the Austrian Science Fund (FWF) under the START project BIVISION, No. Y729, by the city of Graz (A16-21628/2013), and a European Community FP7 Marie Curie Intra European Fellowship (331239).

Author information

Authors and Affiliations

Institute for Computer Graphics and Vision, BioTechMed, Graz University of Technology, Graz, Austria
Kerstin Hammernik, Thomas Ebner, Darko Stern & Thomas Pock
Ludwig Boltzmann Institute for Clinical Forensic Imaging, Graz, Austria
Martin Urschler

Authors

Kerstin Hammernik
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Ebner
View author publications
You can also search for this author in PubMed Google Scholar
Darko Stern
View author publications
You can also search for this author in PubMed Google Scholar
Martin Urschler
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Pock
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kerstin Hammernik .

Editor information

Editors and Affiliations

Clinical Center, National Institutes of Health, Bethesda, Maryland, USA
Jianhua Yao
Department of Computing, Imperial College, London, United Kingdom
Ben Glocker
Innovative Technologies, Philips Research, Hamburg, Germany
Tobias Klinder
GE Healthcare and University of Western Ontario, London, Ontario, Canada
Shuo Li

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hammernik, K., Ebner, T., Stern, D., Urschler, M., Pock, T. (2015). Vertebrae Segmentation in 3D CT Images Based on a Variational Framework. In: Yao, J., Glocker, B., Klinder, T., Li, S. (eds) Recent Advances in Computational Methods and Clinical Applications for Spine Imaging. Lecture Notes in Computational Vision and Biomechanics, vol 20. Springer, Cham. https://doi.org/10.1007/978-3-319-14148-0_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-14148-0_20
Published: 10 February 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14147-3
Online ISBN: 978-3-319-14148-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics