Combating Uncertainty with Novel Losses for Automatic Left Atrium Segmentation

Yang, Xin; Wang, Na; Wang, Yi; Wang, Xu; Nezafat, Reza; Ni, Dong; Heng, Pheng-Ann

doi:10.1007/978-3-030-12029-0_27

Xin Yang^20,23,
Na Wang^21,22,
Yi Wang^21,22,
Xu Wang^21,22,
Reza Nezafat²³,
Dong Ni^21,22 &
…
Pheng-Ann Heng²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11395))

Included in the following conference series:

International Workshop on Statistical Atlases and Computational Models of the Heart

2142 Accesses
15 Citations

Abstract

Segmenting left atrium in MR volume holds great potentials in promoting the treatment of atrial fibrillation. However, the varying anatomies, artifacts and low contrasts among tissues hinder the advance of both manual and automated solutions. In this paper, we propose a fully-automated framework to segment left atrium in gadolinium-enhanced MR volumes. The region of left atrium is firstly automatically localized by a detection module. Our framework then originates with a customized 3D deep neural network to fully explore the spatial dependency in the region for segmentation. To alleviate the risk of low training efficiency and potential overfitting, we enhance our deep network with the transfer learning and deep supervision strategy. Main contribution of our network design lies in the composite loss function to combat the boundary ambiguity and hard examples. We firstly adopt the Overlap loss to encourage network reduce the overlap between the foreground and background and thus sharpen the predictions on boundary. We then propose a novel Focal Positive loss to guide the learning of voxel-specific threshold and emphasize the foreground to improve classification sensitivity. Further improvement is obtained with an recursive training scheme. With ablation studies, all the introduced modules prove to be effective. The proposed framework achieves an average Dice of $92.24\%$ in segmenting left atrium with pulmonary veins on 20 testing volumes.

X. Yang and N. Wang—Authors contribute equally to this work.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Segmentation and visualization of left atrium through a unified deep learning framework

Article 26 February 2020

A Benchmarking Study of Deep Learning Approaches for Bi-Atrial Segmentation on Late Gadolinium-Enhanced MRIs

3D shallow deep neural network for fast and precise segmentation of left atrium

Article 27 March 2021

1 Introduction

Cardiovascular diseases are keeping as the leading causes of death in the world. About half of the diagnosed cases suffer from the atrial fibrillation caused stroke [8]. MR, especially the Gadolinium Enhancement MR (GE-MR), is a dominant imaging modality to localize scars and provide guidance for ablation therapy [6]. Segmenting the left atrium with associated pulmonary veins and reconstructing its patient-specific anatomy are beneficial to optimize the therapy plan and reduce the risk of intervention. However, manual delineation tends to be time-consuming and presents low inter- and intra-expert reproducibilities.

Automatically segmenting left atrium with pulmonary veins is a nontrivial task. As shown in Fig. 1, left atrium and pulmonary veins vary greatly in shape and size across subjects. Affected by the artifacts and differences in gadolinium dose, appearance of atrium and pulmonary veins can change dramatically. Due to the thin myocardial wall of left atrium and the low contrast between left atrium and other surroundings, the boundary of left atrium and pulmonary veins are hard and ambiguous for computer to recognize. Conquering the complex anatomical variance and boundary ambiguity are the main concerns of this work.

Cardiovascular image segmentation is an active research area. Deformable models were proposed to segment chambers and vessels [16]. Other popular streams are the multi-atlas [1] and non-rigid registration based methods [17]. However, those methods bear the difficulties in designing boundary descriptors and generalizing the model trained on limited data to unseen deformable cases [10]. Deep neural network rapidly emerges and pushes the upper-bound of cardiovascular image segmentation [3, 7, 14]. Whereas, limited by computational resources, most methods exploit 2D architectures which ignore the global spatial information in volume and the segmentation consistency across slices. Also, few works paid attention to the importance of loss function design. Hybrid loss function combining Dice loss and weighted cross entropy loss can address class-imbalance and preserve both concrete and branchy structures in segmentation [15]. Loss function enforcing the low overlap between foreground and background, denoted as Overlap loss, presents potential in promoting segmentation [13].

In this paper, we propose a fully-automated framework to segment left atrium and pulmonary veins in gadolinium-enhanced MR volumes. We firstly deploy a detection module to accurately localize the left atrium region in the raw volume. We then propose our customized segmentation network in 3D fashion. Transfer learning and dense deep supervision strategy are involved to alleviate the risk of low training efficiency and potential overfitting. To effectively guide our network combat the boundary ambiguity and hard examples, we introduce a composite loss function which consists of two complementary components: (1) the Overlap loss to encourage network reduce the intersection between foreground and background probability maps and thus make the predictions on boundary less fuzzy, (2) a novel Focal Positive loss to guide the learning of voxel-specific threshold and emphasize the foreground to improve classification sensitivity. Based on the design, we obtain further improvement by fine-tuning our network with a recursive scheme. With ablation studies, all the introduced modules prove to be effective. The proposed framework achieves an average Dice of $92.24\%$ in segmenting left atrium with pulmonary veins on 20 testing volumes.

2 Methodology

Figure 2 is the schematic illustration of our proposed framework. The localized left atrium ROI serves as the input of our customized 3D network. Our network generates an intermediate segmentation and also a single channel to represent the foreground probability map. The foreground map then merges with original ROI and goes through the next refinement level. The last recursive level outputs the final segmentation of left atrium and pulmonary veins.

2.1 Localization of Left Atrium Region

To exclude the noise from background and narrow down the searching area, we propose to exploit a Faster-RCNN [9] network to localize the ROI of atrium and pulmonary veins. We train our Faster-RCNN with 2D slices and bounding boxes. Testing MR volume is firstly split into slices along a canonical axis and all the localized ROI in these slices are then merged into a 3D bounding box. Our ROI detection module achieves $100\%$ accuracy in hitting the atrium area.

2.2 Enhance the Training of 3D Network

Shown as Fig. 2, we customize a 3D U-net from [4] and take it as the workhorse. To fit our proposed loss function design, we discard the last softmax layer and only output a single channel of foreground probability. We enhance the training of our network from the following three aspects.

Transfer Learning in 3D Fashion: Leveraging knowledges of well-trained model can improve the generalization ability of networks [2]. Equipped with 3D convolutions to extract spatial-temporal features, C3D model proposed in [11] is proper for the transfer learning in 3D networks. Therefore, we apply the parameters of layers conv1, conv2, conv3a, conv3b, conv4a and conv4b in C3D model to initialize the downsampling path of our customized 3D U-net. All the layers are then fine-tuned for our atrium segmentation task.

Dense Deep Supervision (DDS): Introducing deep supervision mechanism is effective in addressing the gradient vanishing problem faced by deep networks [5]. As shown in Fig. 2, deep supervision adds auxiliary side-paths and thus exposes shallow layers to extra supervisions. Let $\mathcal {X}^{w\times h\times d}$ be the ROI input, W be the parameters of main network, $w = (w^1, w^2, ..., w^{S})$ be the parameters of side-paths and $w^s$ denotes the parameters of the $s^{th}$ side-path. $\tilde{\mathcal {L}}$ and $\mathcal {L}_s$ are main loss function and loss function in $s^{th}$ side-path, respectively. Components of $\tilde{\mathcal {L}}$ and $\mathcal {L}_s$ are further explained in following sections. In this work, we extend the deep supervision in a dense form, that is, we attach auxiliary side-paths in both down- and up-sampling branches and finally 6 losses in total. The final loss function $\mathcal {L}$ for our 3D U-net with dense deep supervision is elaborated in Eq. 1, where $\beta _s$ ($s\in (1,2, ..., 6)$) is the weight of different side-paths.

$$\begin{aligned} \mathcal {L}(\mathcal {X};\mathcal {W},w) = \tilde{\mathcal {L}}(\mathcal {X};\mathcal {W}) + \sum _{s \in S}{\beta _s\mathcal {L}_s(\mathcal {X};\mathcal {W},w^s)} + \lambda (||\mathcal {W}||^2 + \sum _{s \in S}{||w^s||^2}) \end{aligned}$$

(1)

Dice Loss to Address Class Imbalance (DCL): Class imbalance can bias the traditional loss function and thus make network ignore minor classes [15]. Dice coefficient based loss is becoming a promising choice in addressing the problem and present more clean predictions. In this work, we adopt Dice loss as a basic component for main loss and all auxiliary losses.

2.3 Composite Loss Against Classification Uncertainty

Overlap Loss (OVL): Boundary ambiguity raises uncertainties for background-foreground classification. The uncertainty can be observed from the fuzzy areas in predicted probability maps. Enlarging the gap between background and foreground predictions can suppress this kind of uncertainty. In this work, we adopt the Overlap loss (OVL) [13] to measure this kind of gap. OVL loss is a basic component of main loss and all auxiliary losses. OVL loss is defined as follows,

$$\begin{aligned} \mathcal {L}_{ovl}(\mathcal {W},w^{s})=\sum _{i=1}^{\left| \mathcal {X}\right| }(P(y_i=1|\mathcal {X};\mathcal {W},w^{s}) * P(y_i=0|\mathcal {X};\mathcal {W},w^{s})), \end{aligned}$$

(2)

where P is the predicted probability maps for foreground and background, $*$ is basic multiplication. By minimizing the Overlap loss, our network is pushed to learn more discriminative features to distinguish background and foreground regions and thus gain confidence in recognizing ambiguous boundary locations.

Focal Positive Loss (FPL): OVL loss focuses on enlarging the gap between foreground and background predictions. Whereas, accurately extracting foreground object, i.e. atrium and pulmonary veins, is our final task. Thus, emphasizing the foreground should be further considered. To this regard, we add a threshold map (TM) layer after probability map at the end of the main network to adaptively regularize the foreground probability map. Our network can learn to tune the TM layer and obtain voxel-specific thresholds. Finally, the TM layer can suppress weak predictions and only preserve strong positive predictions. To train the TM layer, we introduce a novel loss function, i.e. Focal Positive loss, which is derived in a differentiable form as follows:

$$\begin{aligned} \mathcal {L}_{fpl}(\mathcal {W},w^{s})=1-\frac{2* \left| Mask(y_i=1|\mathcal {X};\mathcal {W},w^{(t)}) * Y \right| }{\left| Mask(y_i;\mathcal {W},w^{s})\right| +\left| Y \right| }, \end{aligned}$$

(3)

$$\begin{aligned} Mask(y_i;\mathcal {W},w^{s})=1/(1+e^{-tmp}), \end{aligned}$$

(4)

$$\begin{aligned} tmp=\left\{ \begin{array} {rcl} &{}P(y_i=1|\mathcal {X};\mathcal {W},w^{s}),&{}{P(y_i)>Threshold\_Map(y_i)} \\ &{}-\infty ,&{}else \end{array} \right. . \end{aligned}$$

(5)

By minimizing $\mathcal {L}_{fpl}$, our network can learn to enforce strong positive predictions in foreground areas defined by ground truth. $\mathcal {L}_{fpl}$ is only attached in main network. In summary, our network is trained with DCL and OVL losses existing in main network and auxiliary paths, and also the FPL loss in main network.

2.4 Recursive Refinement Scheme (RRS)

Probability map contains more explicit context information for segmentation than the raw MR volume. Revisiting the probability map to explore context cues for refinement is a classical strategy, like Auto-Context [12]. The core idea of Auto-Context is to stack a series of models in a way that, the model at level k not only utilizes the appearance features in intensity image, but also the contextual features extracted from the prediction map generated by the model at level $k-1$. The general recursive process of an Auto-Context scheme is $\hat{y}^{k} = \mathcal {F}^k(\mathcal {J}(x, \hat{y}^{k-1}))$, where $\mathcal {F}^{k}$ is the model at level k, x and $\hat{y}^{k}$ are the intensity image and prediction map from level $k-1$, respectively. $\mathcal {J}$ is a join operator to combine information from x and $\hat{y}^{k}$. Generally, $\mathcal {J}$ is a concatenation operator. In this work, we modify $\mathcal {J}$ as a element-wise summation operator. Summation $\mathcal {J}$ enables us to reuse all the parameters of level $k-1$ in level k for fine-tune. In addition, summarizing intensity image with prediction map is intuitive to highlight the target anatomical structures and also suppress the irrelevant background noise.

Table 1. Quantitative evaluation of our proposed method

Full size table

3 Experimental Results

Experiment Materials: We evaluated our method on the Atrial Segmentation Challenge 2018 dataset. We split the whole dataset (100 samples with ground truth annotation) into training set (80 volumes) and testing set (20 volumes). All the training and testing samples are normalized as zero mean and unit variance before inputting into network. We augmented the training dataset with random flipping, rotation and 3D elastic deform.

Implementation Details: We implemented our framework in Tensorflow, using 4 NVIDIA GeForce GTX TITAN Xp GPUs. We update the parameters of network with an Adam optimizer (batch size=1, initial learning rate is 0.001). Randomly cropped $144\times 64\times 144$ sub-volumes serve as input to our network. To avoid shallow layers being over-tuned during fine-tuning, we set smaller initial learning rate for conv1, conv2, conv3a, conv3b, conv4a and conv4b as 1e−6, 1e−6, 1e−5, 1e−5, 1e−4, 1e−4. We adopt sliding window with proper overlap ratio and overlap-tiling stitching strategies to generate predictions for the whole volume, and remove small isolated connected components in final segmentation result.

Quantitative and Qualitative Analysis: We use 5 metrics to evaluate the proposed framework, including Dice, Conform, Jaccard, Average Distance of Boundaries (Adb) and Hausdorff Distance of Boundaries (Hdb). We take the customized 3D U-net with basic cross entropy loss (CEL) as a baseline. We conduct intensive ablation study on our introduced modules, including DCL, DDS, OVL, FPL and RRS. All of the compared methods share the same basic 3DU-net architecture. Table 1 illustrate the detailed quantitative comparisons among different module combinations. For simplicity, all compared methods are denoted with the main module names. Both the DCL and DDS brings improvement over the traditional CEL. Significant improvements firstly occurs as we inject the Overlap loss. FPL further boosts the segmentation about 2% in Dice. Improvements brought by OVL and FPL are also verified as we visualize the foreground probability maps in Fig. 3. The foreground probability map becomes more sharp and noise-free as OVL and FPL involves. Mining context cues in probability maps with RRS contributes about 1% in Dice as two RRS levels (RRS2) are utilized. We only adopt two RRS levels to balance the performance gain and computation burden. Finally, DCL-DDS-OVL-FPL-RRS2 achieves the best performance in almost all metrics. We also visualize the segmentation results with ground truth in Fig. 4. Our method conquers complex variance of left atrium and pulmonary veins and achieves promising performance.

4 Conclusion

In this paper, we present a fully automatic framework for left atrium segmentation and GE-MR volumes. Originating with a network in 3D fashion to better tackle complex variances in shape and size of left atrium, we present our main contributions in introducing and verifying the composite loss which is effective in combating the boundary ambiguity and hard examples. We also propose a modified recursive scheme for successive refinement. As extensively validated on a large dataset, our proposed framework and modules prove to be promising for the left atrium segmentation in GE-MR volumes.

References

Bai, W., Shi, W., Ledig, C., Rueckert, D.: Multi-atlas segmentation with augmented features for cardiac MR images. Med. Image Anal. 19(1), 98–109 (2015)
Article Google Scholar
Chen, H., Ni, D., Qin, J., et al.: Standard plane localization in fetal ultrasound via domain transferred deep neural networks. IEEE JBHI 19(5), 1627–1636 (2015)
Google Scholar
Chen, J., Yang, G., Gao, Z., et al.: Multiview two-task recursive attention model for left atrium and atrial scars segmentation. arXiv preprint arXiv:1806.04597 (2018)
Çiçek, Ö., Abdulkadir, A., et al.: 3d u-net: Learning dense volumetric segmentation from sparse annotation. arXiv preprint arXiv:1606.06650 (2016)
Dou, Q., Yu, L., et al.: 3D deeply supervised network for automated segmentation of volumetric medical images. Med. Image Anal. 41, 40–54 (2017)
Article Google Scholar
McGann, C., Akoum, N., et al.: Atrial fibrillation ablation outcome is predicted by left atrial remodeling on MRI. Circ. Arrhythm. Electrophysiol. 7, 23–30 (2013)
Article Google Scholar
Mortazi, A., Karim, R., Rhode, K., Burt, J., Bagci, U.: CardiacNET: segmentation of left atrium and proximal pulmonary veins from MRI using multi-view CNN. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D.L., Duchesne, S. (eds.) MICCAI 2017, Part II. LNCS, vol. 10434, pp. 377–385. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66185-8_43
Chapter Google Scholar
Peng, P., Lekadir, K., et al.: A review of heart chamber segmentation for structural and functional analysis using cardiac magnetic resonance imaging. Magn. Reson. Mater. Phys. Biol. Med. 29(2), 155–195 (2016)
Article Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91–99 (2015)
Google Scholar
Tobon-Gomez, C., Geers, A.J., et al.: Benchmark for algorithms segmenting the left atrium from 3D CT and MRI datasets. IEEE TMI 34(7), 1460–1473 (2015)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: ICCV, pp. 4489–4497 (2015)
Google Scholar
Tu, Z., Bai, X.: Auto-context and its application to high-level vision tasks and 3D brain image segmentation. IEEE TPAMI 32(10), 1744–1757 (2010)
Article Google Scholar
Wang, Z., et al.: Automated detection of clinically significant prostate cancer in mp-MRI images based on an end-to-end deep neural network. IEEE TMI 37, 1127–1139 (2018)
Google Scholar
Yang, G., Zhuang, X., Khan, H., et al.: A fully automatic deep learning method for atrial scarring segmentation from late gadolinium-enhanced MRI images. In: ISBI 2017, pp. 844–848. IEEE (2017)
Google Scholar
Yang, X., Bian, C., Yu, L., Ni, D., Heng, P.-A.: Hybrid loss guided convolutional networks for whole heart parsing. In: Pop, M., et al. (eds.) STACOM 2017. LNCS, vol. 10663, pp. 215–223. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-75541-0_23
Chapter Google Scholar
Zheng, Y., et al.: Multi-part modeling and segmentation of left atrium in C-arm CT for image-guided ablation of atrial fibrillation. IEEE TMI 33(2), 318–331 (2014)
Google Scholar
Zhuang, X., et al.: A registration-based propagation framework for automatic whole heart segmentation of cardiac MRI. IEEE TMI 29(9), 1612–1625 (2010)
Google Scholar

Download references

Acknowledgments

The work in this paper was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region (Project no. GRF 14225616), a grant from National Natural Science Foundation of China under Grant 61571304, Grant 81771598, and Shenzhen Peacock Plan under Grant KQTD2016053112051497.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Xin Yang & Pheng-Ann Heng
National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
Na Wang, Yi Wang, Xu Wang & Dong Ni
Medical UltraSound Image Computing (MUSIC) Lab, Shenzhen, China
Na Wang, Yi Wang, Xu Wang & Dong Ni
Department of Medicine (Cardiovascular Division), Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, USA
Xin Yang & Reza Nezafat

Authors

Xin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Na Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Reza Nezafat
View author publications
You can also search for this author in PubMed Google Scholar
Dong Ni
View author publications
You can also search for this author in PubMed Google Scholar
Pheng-Ann Heng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin Yang .

Editor information

Editors and Affiliations

University of Toronto, Toronto, ON, Canada
Mihaela Pop
Inria, Epione Group, Sophia-Antipolis, France
Maxime Sermesant
Auckland University, Auckland, New Zealand
Jichao Zhao
University of Western Ontario, London, ON, Canada
Shuo Li
GE Healthcare, Oslo, Norway
Kristin McLeod
King’s College London, London, UK
Alistair Young
King’s College London, London, UK
Kawal Rhode
Siemens Medical Solutions USA, Inc., Princeton, NJ, USA
Tommaso Mansi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, X. et al. (2019). Combating Uncertainty with Novel Losses for Automatic Left Atrium Segmentation. In: Pop, M., et al. Statistical Atlases and Computational Models of the Heart. Atrial Segmentation and LV Quantification Challenges. STACOM 2018. Lecture Notes in Computer Science(), vol 11395. Springer, Cham. https://doi.org/10.1007/978-3-030-12029-0_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-12029-0_27
Published: 14 February 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-12028-3
Online ISBN: 978-3-030-12029-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us