Abstract
Multi-plane, 2D TEE images constitute the clinical standard of care for assessment of left ventricle function, as well as for guiding various minimally invasive procedure that rely on intra-operative imaging for real-time visualization. We propose a framework that enables automatic, rapid and accurate endocardial left ventricle feature identification and blood-pool segmentation using a combination of image filtering, graph cut, non-rigid registration-based motion extraction, and 3D LV geometry reconstruction techniques applied to the TEE image series. We evaluate our proposed framework using several retrospective patient tri-plane TEE image sequences and demonstrate comparable results to those achieved by expert manual segmentation using clinical software.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
- Left Ventricle
- Target Registration Error
- Volume Reconstruction
- Left Ventricle Volume
- Active Contour Method
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
1 Introduction
Over the past two - three decades, ultrasound (US) imaging has evolved as the preferred, standard-of-care imaging modality for the diagnosis, screening, monitoring, and real-time guidance of several conditions. Specifically, thanks to its real-time capabilities, relatively inexpensive cost (compared to other modalities), and lack of exposure, US imaging has become the “first-line” modality for patient screening, diagnosis, and cardiac function assessment.
Trans-esophageal echocardiography (TEE) enables heart imaging while minimizing signal attenuation and optimizing field-of-view. As such, TEE is not only used for screening and diagnosis, but also for intra-operative therapy monitoring and/or image-guided cardiac interventions. Since the mid-2000s, TEE technology has accommodated 3D image acquisition and visualization of the cardiac anatomy in lieu of simple 2D renderings. However, despite the added bonus of 3D and 4D (3D + time) displays, the inherent trade-off between frame rate, and extent of anatomy covered, has determined clinicians to resort to the acquisition and visualization of multi-planar (orthogonal bi-plane or tri-plane) images to estimate the required parameters to assess cardiac function (i.e., ejection fraction) or identify critical features for image-guided therapy.
Despite their high frame rate, 2D US images are hampered by several well known limitations: challenging interpretation and uncertainty in identifying structures of interest due to inherent specular appearance. Several approaches for LV segmentation in echocardiography [1] have been popularly formulated as a contour finding problem, with the active contour method [2, 3] being extensively used. Given its edge-based energy approach, the active contour method often produced many local minima and is also sensitive to the initialization. Inspired by the active contours, the level set method [4, 5] uses both edge- and region-based energy, making it more robust and less sensitive to initialization.
Active shape [6] and active appearance models [7] incorporate knowledge of the LV shape and appearance from manually segmented training sets, but assume a Gaussian distribution of the shape and appearance derived from the training sets, requiring an initial approximation close to the final solution. On the other hand, database-guided segmentation [8] overcome the initialization problem by implicitly encoding prior knowledge from the expert-annotated databases, yet at the expense of a highly complex search process. Other supervised learning techniques, such as artificial neural networks [9], have been used to detect endocardial border pixels using expert annotated training sets, but require large training sets and are unable to handle cases well outside of the training set.
In this work we propose the implementation and clinical validation of an automatic workflow that encompasses well-evaluated filtering, segmentation, registration, and volume reconstruction techniques as a means to provide a rapid, robust and accurate framework for feature tracking from multi-plane ultrasound image sequences. The proposed computational framework was developed in close collaboration with our echocardiography colleagues, motivated by the need to reduce user-dependent and user-induced bias and reduce the uncertainty associated with the process of manually identifying features from US image sequences. The impact and contribution of the proposed work is the integration of several image processing techniques (i.e., phase-based filtering, segmentation, registration and volume reconstruction) into a streamlined workflow that utilizes traditional standard of care images and fits seamlessly within the current workflows associated with both cardiac function assessment and intra-operative cardiac intervention guidance and monitoring.
2 Methodology
Speckle noise and signal dropouts inherent in US images render intensity based approaches unreliable; rather, local-phase based approaches [10], theoretically invariant to the intensity magnitude, have been preferred for detecting endocardium. Here we exploit the robustness of phase-based feature detection and combine it with the power of graph cut-based techniques [11] that use both region and boundary regularization, to obtain a rapid, automatic piecewise smooth segmentation of the LV blood pool and muscle regions. In addition, we conducted a preliminary study using retrospective clinical patient data consisting of tri-plane (60\(^\circ \) to one another) TEE image sequences through the cardiac cycle to validate the proposed tools and demonstrate their clinical utility and performance against commercial, clinical-grade, clinician-operated software.
The proposed methodology encompasses three steps: (1) endocardial left ventricle (LV) feature extraction and blood-pool segmentation from the raw 2D multi-plane image sequences, (2) frame-to-frame feature tracking and propagation through the cardiac cycle using non-rigid image registration, and (3) 3D reconstruction of the LV blood pool geometry at the desired cardiac phases using spline-based interpolation and convex hull fitting.
2.1 LV Feature Extraction and Blood-Pool Segmentation
Image Pre-processing via Monogenic Filtering: Unlike intensity-based edge detection algorithms are inefficient in identifying features from US images, intensity invariant local phase-based techniques have shown promising results [10], where a local phase of \(\pm \pi /2\) signifies high symmetry, while a local phase of 0 or \(\pi \) signifies high asymmetry [12]. The local phase computation of a 1D signal uses a complex analytic signal comprised of the original signal as the real part and its corresponding Hilbert transform as the imaginary part. However, since the Hilbert transform is mathematically restricted to 1D with no straightforward extension to 2D and 3D, we used the method described in [13] to extend the concept of the analytic signal to higher dimensions using a monogenic signal. The higher dimension monogenic signal is generated by combining a bandpass Gaussian-derivative filter with a vector-valued odd filter (i.e., a Reisz filter). The low frequency variations in the local phase are extracted using a high spread (\(\sigma \)) Gaussian-derivative filter, while the high frequency components are extracted using a low spread (\(\sigma \)) Gaussian-derivative filter. The described monogenic filtering sequence is used to transform each of the three tri-plane 2D US images into corresponding “cartoon” images in which the blood pool and myocardial wall appear enhanced, facilitating their segmentation in the subsequent step.
Graph Cut-Based Segmentation: The resulting “cartoon” image is used to construct a four neighborhood graph structure in which each pixel is connected to its east, west, north and south neighbors. Three special nodes called terminals are added, which represent three classes (labels): background, blood pool and myocardium. The segmentation can be formulated as an energy minimization problem to find the labeling f, such that it minimizes the energy:
where the first term represents smoothness energy, which forces pixels p and q defined by a set of interacting pair \(\mathcal {N}\), towards the same label. The second term represents the data energy that reduces the disagreement between the labeling f and the observed data \(i_p\). The links between each pixel and the terminals (i.e., t-links) are formulated as the negative logarithm of the normal distribution [14]:
where \(\mu \) and \(\sigma \) are the mean and standard deviation for the three classes obtained from the image. The links between neighboring pixels, called n-links, are weighted according to their similarity to formulate the smoothness energy:
where \(T(\cdot )\) is 1 if its argument is true, and otherwise 0, K is a constant, and C is a intensity threshold that forces the neighboring pixels within the threshold towards the same label. The minimum cut equivalent to the maximum flow is obtained via the expansion algorithm in [11] yielding optimal segmentation of background, blood-pool, and myocardium (Fig. 1e).
2.2 Frame-to-frame Feature Tracking and Propagation
Image Pre-processing: Once a single-phase image is segmented using the procedure outlined in Sect. 2.1, the extracted features are tracked and propagated throughout the cardiac cycle using non-rigid registration (Fig. 2). Prior to registration, each “2D + time” image sequence corresponding to each of the tri-plane views is first “prepared” by identifying a region of interest-based “bounding box” centered on the features that belong to the LV. To ensure the chosen “bounding box” spans the entire LV including blood-pool, myocardium, and surrounding region, this window is selected based on the high confidence blood pool mask obtained after the application of the high spread Gaussian-derivative filter employed in Sect. 2.1 to the entire image sequence, followed by an isotropic dilation to ensure full coverage beyond the LV myocardial boundary. Moreover, the mitral valve region is “trimmed” using a straight line joining the leaflet hinges.
Non-rigid Registration Algorithm: The employed registration algorithm is a modified version of the biomechanics-based algorithm proposed by Lamash et al. [15]. The LV anatomy is modeled as a two compartment model consisting of muscle — linear elastic, isotropic, and incompressible, and blood-pool, with prescribed smoothness constraints to allow rapid motion of the endocardial contour. We initialize the algorithm by first discretizing the endocardial and epicardial contours, then constructing a mesh of the blood-pool and myocardium. Rather than resorting to a rectangular grid, we account for the local curvature of the endocardial border using a finite-element like mesh defined via linear shape functions. The algorithm deforms the mesh by estimating the required deforming forces that minimize the sum of the squared difference between the initial and target images (Fig. 3). To avoid large deformations and ensure a smooth displacement field, a linear elastic regularization approach [16] is utilized.
2.3 3D LV Volume Reconstruction
Following the segmentation of each of the tri-plane views at end-diastole using the technique in Sect. 2.1 and their propagation throughout the cardiac cycle, the resulting images are re-inserted into a pseudo-3D image volume along the same orientation at which they were originally acquired (i.e., 60\(^\circ \) apart) corresponding to each cardiac phase. The boundary points of each segmented contour at the same elevation are then fitted using the parametric variational cubic spline technique in [17]. The spline interpolated data is used to generate a convex hull using the algorithm proposed in [18] (Fig. 4).
3 Evaluation and Results
We conducted a preliminary study using retrospective tri-plane time series data spanning multiple cardiac cycles from patients who underwent TEE imaging for cardiac function assessment. Since the proposed framework encompasses three different components — automatic extraction of endocardial features, registration-based feature tracking and propagation, and volume reconstruction — we assessed the performance of each component against the ground truth, which consists of the blood-pool representation annotated manually by the expert clinician, using the EchpPac PC clinical software. In addition, we also evaluated the performance at each stages of our application running in MATLAB on an Intel\(^\circledR \) Xenon\(^\circledR \) 3.60 GHz 32 GB RAM PC.
Automatic Direct Frame Endocardial Feature Extraction Evaluation: We first evaluated the accuracy of our automatic, direct frame endocardial feature extraction component against expert manual annotation of the same features from the same frames performed by a cardiologist using the GE EchoPac PC clinical software. Table 1 summarizes the blood-pool area measurements annotated by the expert (Ground Truth) and the area obtained via A — automatic feature detection from individual frames; B — single phase automated feature detection + registration-based propagation; and C — single phase expert manual annotation + registration-based propagation. Measurements are evaluated at two cardiac phases — end-diastole (ED) and end-systole (ES) — and averaged across all views and multiple cardiac cycles spanned by the acquired sequences. Our automatic blood-pool extraction technique required 26.5 s to segment a “2D + time” 15 frame TEE tri-plane sequence.
Registration-Based Blood-Pool Tracking and Propagation Evaluation: To evaluate the accuracy with which the non-rigid registration algorithm propagates the extracted features throughout the cardiac cycle, we employed several metrics, including the DICE correlation, Hausdorff distance, mean absolute distance error and endocardial target registration error (TRE) computed between the ground truth blood-pool manually annotated by the expert and the blood-pool depicted via three other methods under consideration (Table 2).
Figure 5 visually compares the ground truth blood-pool annotation performed by the expert clinician to that extracted via direct frame feature identification, as well as registration-based propagation of the single-frame blood-pool annotated either manually by the expert or automatically using the first component of our proposed framework. The segmentation propagation technique required 162 s to run through a 15 frame tri-plane TEE sequence.
3D Volume Reconstruction and Ejection Fraction Evaluation: Lastly, we assessed the accuracy of the 3D LV reconstruction procedure by comparing the reconstructed LV volume to that estimated by the GE EchoPac PC clinical software following expert manual segmentation. The end-diastolic and systolic volume measurements are summarized in Table 3, along with the corresponding ejection fraction measurements. Performance-wise, the LV volume reconstruction from a tri-plane sequence requires 11.6 s.
4 Discussion
We described the implementation and clinical data evaluation of a rapid, automatic framework that encompasses well-evaluated filtering, segmentation, registration, and volume reconstruction techniques as a means to provide a rapid, robust and accurate framework for feature tracking from multi-plane ultrasound image sequences. All components of the proposed technique — segmentation, registration-based feature tracking and propagation, and 3D blood-pool volume reconstruction — were assessed against expert manual segmentation at both the systolic and diastolic cardiac phases and demonstrated accurate and consistent performance, while significantly minimizing user-induced variability. Furthermore, unlike other techniques that operate on 3D datasets, this technique enables rapid and consistent analysis of multi-plane, 2D US image sequences — the standard format for acquisition, interpretation, and analysis of cardiac US images.
As the proposed workflow integrates multiple algorithms, the influence of different parameters in the segmentation result is an important consideration. The frequency specific to the monogenic filter operates over a wide range of values and yields a good quality “cartoon image” for further segmentation. Similarly, for the graph cut algorithm, the mean and standard deviations for the blood pool, muscle and background regions are adaptively extracted from the image content, while the threshold ’C’ that constraints the pixels towards same label can span a sufficiently wide range without significantly effecting the segmentation result. Furthermore, Lamash et al. [15] have thoroughly studied the effects of various regularization parameters in the biomechanics-based registration; for our purpose we selected the optimal parameters as suggested by the paper [15]. In summary, the proposed workflow yields a consistent segmentation result over a wide range of parameter values.
Unlike expert manual segmentation that is highly sensitive to intra- and inter-observer variability, the proposed technique provides a consistent result for each dataset, which can be reviewed and improved, if needed, by expert clinicians. The single-phase feature extraction, followed by tracking and propagation via registration further reduces uncertainty, avoiding the need to segment each frame independently by using the a priori frame information along with the image sequence to achieve optimal segmentation. Hence, should the expert clinician choose to perform any adjustments to the single-phase segmentation, their precise tracking and propagation throughout the cardiac cycle is guaranteed by the registration-based implementation.
5 Summary and Future Work
The impact and contribution of the proposed work is the integration of several image processing techniques (i.e., phase-based filtering, segmentation, registration and volume reconstruction) into a streamlined workflow that utilizes traditional standard of care images and fits seamlessly within the current workflows associated with both cardiac function assessment and intra-operative cardiac intervention guidance and monitoring.
Ongoing and future efforts include further evaluation and demonstration of how the proposed technique can cater to dynamically reconstructing 3D endocardial LV representations that facilitate computer-assisted assessment of stroke volume and ejection fraction, as well as employing intra-operative multi-plane 2D TEE data to dynamically update and animate CT and/or MRI anatomy depicted pre-operatively to better represent the intra-operative conditions. Lastly, although we believe the most meaningful assessment is still against the expert clinicians analysis of the same input data, we acknowledge the importance of assessing the output of our proposed framework against the output of other techniques and extend the analysis to a large dataset of multi-plane image sequences acquired across multiple cardiac cycles.
Besides its direct application to computer-aided cardiac function assessment, the proposed framework is readily adaptable to the guidance and monitoring of image-guided cardiac interventions, most of which involve the use of real-time ultrasound imaging the clinical standard of care for cardiac procedures.
References
Noble, J., Boukerroui, D.: Ultrasound image segmentation: a survey. IEEE Trans. Med. Imaging 25(8), 987–1010 (2006)
Mishra, A., Dutta, P., Ghosh, M.: A GA based approach for boundary detection of left ventricle with echocardiographic image sequences. Image Vis. Comput. 21(11), 967–976 (2003)
Mignotte, M., Meunier, J.: A multiscale optimization approach for the dynamic contour-based boundary detection issue. Comput. Med. Imaging Graph. 25(3), 265–275 (2001)
Bernard, O., Friboulet, D., Thevenaz, P., Unser, M.: Variational B-spline level-set: a linear filtering approach for fast deformable model evolution. IEEE Trans. Image Process. 18(6), 1179–1191 (2009)
Cremers, D., Osher, S.J., Soatto, S.: Kernel density estimation and intrinsic alignment for shape priors in level set segmentation. Int. J. Comput. Vis. 69(3), 335–351 (2006)
Cootes, T.F., Taylor, C.J., Cooper, D.H., Graham, J.: Active shape models-their training and application. Comput. Vis. Image Underst. 61(1), 38–59 (1995)
Bosch, J., Mitchell, S., Lelieveldt, B., Nijland, F., Kamp, O., Sonka, M., Reiber, J.: Automatic segmentation of echocardiographic sequences by active appearance motion models. IEEE Trans. Med. Imaging 21(11), 1374–1383 (2002)
Georgescu, B., Zhou, X., Comaniciu, D., Gupta, A.: Database-guided segmentation of anatomical structures with complex appearance. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 429–436 (2005)
Jyh Herng Wu, E., De Andrade, M.L., Nicolosi, D.E., Pontes Jr., S.C.: Artificial neural network: border detection in echocardiography. Med. Biol. Eng. Comput. 46(9), 841–848 (2008)
Mulet-Parada, M., Noble, J.A.: 2D + T acoustic boundary detection in echocardiography. Med. Image Anal. 4, 21–30 (2000)
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. IEEE Trans. PAMI 23, 1222–1239 (2001)
Kovesi, P.: Symmetry and asymmetry from local phase. In: Proceedings of 10th Australian Joint Conference Artificial Intelligence, pp. 2–4 (1997)
Rajpoot, K., Grau, V., Noble, J.A.: Local-phase based 3D boundary detection using monogenic signal and its application to real-time 3-D echocardiography images. In: Proceedings of IEEE International Symposium Biomedical Imaging, pp. 783–786 (2009)
Uzkent, B., Hoffman, M.J., Cherry, E., Cahill, N.: Processing IEEE western NY image signal process workshop, pp. 47–51 (2014)
Lamash, Y., Fischer, A., Carasso, S., Lessick, J.: Strain analysis from 4D cardiac CT image data. IEEE Trans. Biomed. Eng. 62, 511–521 (2015)
Zitova, B., Flusser, J.: Image registration methods: a survey. Image Vis. Comput. 21, 977–1000 (2003)
Lee, E.T.Y.: Choosing nodes in parametric curve interpolation. Comput. Aided Des. 21, 363–370 (1989)
Barber, C.B., Dobkin, D.P., Huhdanpaa, H.: The quickhull algorithm for convex hulls. ACM Trans. Math. Softw. 22, 469–483 (1996)
Acknowledgments
The authors would like to acknowledge Dr. Nathan Cahill for sharing his technical expertise and Aditya Daryanani for his help with image segmentation. In addition, we acknowledge funding support from the Kate Gleason Research Fund and the RIT College of Engineering Faculty Development Grant.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Dangi, S., Ben-Zikri, Y.K., Lamash, Y., Schwarz, K.Q., Linte, C.A. (2015). Automatic LV Feature Detection and Blood-Pool Tracking from Multi-plane TEE Time Series. In: van Assen, H., Bovendeerd, P., Delhaas, T. (eds) Functional Imaging and Modeling of the Heart. FIMH 2015. Lecture Notes in Computer Science(), vol 9126. Springer, Cham. https://doi.org/10.1007/978-3-319-20309-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-20309-6_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20308-9
Online ISBN: 978-3-319-20309-6
eBook Packages: Computer ScienceComputer Science (R0)