Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI

Fidon, Lucas; Aertsen, Michael; Mufti, Nada; Deprest, Thomas; Emam, Doaa; Guffens, Frédéric; Schwartz, Ernst; Ebner, Michael; Prayer, Daniela; Kasprian, Gregor; David, Anna L.; Melbourne, Andrew; Ourselin, Sébastien; Deprest, Jan; Langs, Georg; Vercauteren, Tom

doi:10.1007/978-3-030-87735-4_25

Lucas Fidon²⁰,
Michael Aertsen²¹,
Nada Mufti^20,22,23,
Thomas Deprest²¹,
Doaa Emam^23,25,
Frédéric Guffens²¹,
Ernst Schwartz²⁴,
Michael Ebner²⁰,
Daniela Prayer²⁴,
Gregor Kasprian²⁴,
Anna L. David^22,23,
Andrew Melbourne²⁰,
Sébastien Ourselin²⁰,
Jan Deprest^21,22,23,
Georg Langs²⁴ &
…
Tom Vercauteren²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12959))

Included in the following conference series:

International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging
International Workshop on Preterm, Perinatal and Paediatric Image Analysis

1684 Accesses
10 Citations
1 Altmetric

Abstract

The performance of deep neural networks typically increases with the number of training images. However, not all images have the same importance towards improved performance and robustness. In fetal brain MRI, abnormalities exacerbate the variability of the developing brain anatomy compared to non-pathological cases. A small number of abnormal cases, as is typically available in clinical datasets used for training, are unlikely to fairly represent the rich variability of abnormal developing brains. This leads machine learning systems trained by maximizing the average performance to be biased toward non-pathological cases. This problem was recently referred to as hidden stratification. To be suited for clinical use, automatic segmentation methods need to reliably achieve high-quality segmentation outcomes also for pathological cases. In this paper, we show that the state-of-the-art deep learning pipeline nnU-Net has difficulties to generalize to unseen abnormal cases. To mitigate this problem, we propose to train a deep neural network to minimize a percentile of the distribution of per-volume loss over the dataset. We show that this can be achieved by using Distributionally Robust Optimization (DRO). DRO automatically reweights the training samples with lower performance, encouraging nnU-Net to perform more consistently on all cases. We validated our approach using a dataset of 368 fetal brain T2w MRIs, including 124 MRIs of open spina bifida cases and 51 MRIs of cases with other severe abnormalities of brain development.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Deep Learning Automatic Fetal Structures Segmentation in MRI Scans with Few Annotated Datasets

An automatic and accurate deep learning-based neuroimaging pipeline for the neonatal brain

Article 08 March 2023

Automatic Segmentation of the Intracranial Volume in Fetal MR Images

1 Introduction

The segmentation of fetal brain tissues in MRI is essential for the study of abnormal fetal brain developments [2]. Fetal brain structures segmentation could also support the evaluation and prediction of surgery outcome for open spina bifida [1, 4, 16, 21, 22]. Accurate and automatic methods for fetal brain segmentation are necessary as manual segmentation is very time-consuming and suffers from high inter- and intra-rater variability. Recently, deep neural network-based methods for fetal brain T2w MRI segmentation have been proposed [7, 8, 15, 18, 19]. On average, deep learning currently achieves state-of-the-art segmentation performance. However, those studies do not evaluate specifically the generalization and robustness properties when applied to fetuses with a pathological central nervous system.

Datasets used to train deep neural networks typically contain some underrepresented subsets of cases. These cases are not specifically dealt with by the training algorithms currently used for deep neural networks. This problem has been referred to as hidden stratification [17]. Hidden stratification has been shown to lead to deep learning models with good average performance but poor performance on some clinically relevant subsets of the population [17]. While uncovering the issue, the study of [17], which is limited to classification, does not study the cause or propose a method to mitigate this problem. Cases with abnormal fetal brain development are likely to suffer from hidden stratification effects for two reasons: 1) The presence of abnormalities exacerbates the anatomical variability of the fetal brain between 18 weeks and 38 weeks of gestation, as illustrated in Fig. 1; and 2) The prevalence of those diseases is typically below 1/1000 births [1].

In this work, we study the problem of hidden stratification in fetal brain MRI segmentation using deep learning. We claim that the methodology currently used to train deep neural networks, that is maximizing the average performance across the training volumes, is at the root of the hidden stratification problem. Instead of the average empirical risk, training safe and robust deep learning models requires an asymmetric measure of risk that gives higher weights to the cases for which the algorithm fails (hard examples). Percentiles, also known as value-at-risk, is such a measure of risk that has even been adopted in industry regulations [13]. Given a per-volume fetal brain MRI segmentation metric such as the Dice score and an algorithm, the percentile at $5\%$ is the value of the score below which $5\%$ of the cases fall, i.e. perform worse than the percentile. The percentile relates to hidden stratification effects as it informs us of how badly worst-case examples are performing. Our contributions are four-fold. 1) We empirically show that the state-of-the-art deep learning pipeline nnU-Net [14] trained by maximizing the average segmentation performance leads to clinically significant failures for fetal brain MRI segmentation. 2) We propose to use percentiles of the Dice score on clinically relevant subpopulations as a measure of hidden stratification effects. 3) We propose to train a deep learning network to minimize a percentile of the per-volume loss function. 4) We propose a relaxation of this optimization problem based on distributionally robust optimization that can be solved efficiently in practice. We evaluate the proposed methodology for the automatic segmentation of white matter, ventricles, and cerebellum based on fetal brain 3D T2w MRI. We used a total of 368 fetal brain 3D MRIs including anatomically normal fetuses, fetuses with open spina bifida, and fetuses with other central nervous system pathologies for gestational ages ranging from 19 weeks to 39 weeks. Our empirical results suggests that the proposed training method based on distributionally robust optimization leads to better percentiles values for abnormal fetuses. In addition, qualitative results shows that distributionally robust optimization allows to reduce the number of clinically relevant failures of nnU-Net.

2 Minimization of a Percentile Loss Using Distributionally Robust Optimization

In this section, we study how a deep neural network can be trained to minimize percentiles of the loss function using a distributionally robust optimization (DRO) approach [10].

Standard deep learning training consists in optimizing the parameters ${\boldsymbol{\theta }}$ of a deep neural network $f(\cdot ;{\boldsymbol{\theta }})$ by minimizing the average per-example loss ${{\,\mathrm{\mathcal {L}}\,}}$

$$\begin{aligned} \min _{{\boldsymbol{\theta }}} \frac{1}{n} \sum _{i=1}^n {{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i;{\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) \end{aligned}$$

(1)

Within this empirical risk minimization framework, $f(\cdot ;{\boldsymbol{\theta }})$ is typically a Convolutional Neural Network (CNN), ${{\,\mathrm{\mathcal {L}}\,}}$ is a smooth per-volume loss function, and $\left\{ ({\boldsymbol{x}}_i, {\boldsymbol{y}}_i)\right\} _{i=1}^n$ is the training dataset.

In our case, ${\boldsymbol{x}}_i$ are the input 3D fetal brain T2w MRI volumes and ${\boldsymbol{y}}_i$ are the ground-truth manual segmentations. This approach is the one used to train state-of-the-art deep learning methods for segmentation using stochastic gradient descent [14]. Due to the scarcity and the higher anatomical variability of abnormal cases illustrated in Fig. 1, we cannot assume that the set of all possible fetal brain anatomies is sampled uniformly in the training dataset. However, in (1), all brain volumes are given the same weight equal to $\frac{1}{n}$.

Instead of the average per-volume loss, for robust and safe segmentation, we argue that it might be more interesting to minimize the percentile $l_{\alpha }$ at $\alpha $ (e.g. 5%) of the per-volume loss function. Formally, this corresponds to the minimization problem

$$\begin{aligned} \min _{{\boldsymbol{\theta }},\, l_{\alpha }} \quad l_{\alpha } \qquad \text {such that} \qquad \mathbb {P}\left( {{\,\mathrm{\mathcal {L}}\,}}\left( f({\text {x}};{\boldsymbol{\theta }}), {\text {y}}\right) \ge l_{\alpha } \right) \le \alpha \end{aligned}$$

(2)

where $\mathbb {P}$ is the empirical distribution defined by the training dataset. In other words, if $\alpha =0.05$, the optimal $l_{\alpha }^*({\boldsymbol{\theta }})$ of (2) for a given value set of parameters ${\boldsymbol{\theta }}$ is the value of the loss such that the per-volume loss function is worse than $l_{\alpha }^*({\boldsymbol{\theta }})$ $5\%$ of the time. As a result, training the deep neural network using (2) corresponds to minimizing the percentile of the per-volume loss function $l_{\alpha }^*({\boldsymbol{\theta }})$.

Unfortunately, the minimization problem (2) cannot be solved directly using stochastic gradient descent to train a deep neural network. We now propose a tractable upper bound for $l_{\alpha }^*({\boldsymbol{\theta }})$ and show that it can be solved in practice using distributionally robust optimization [10].

The Chernoff bound [3] applied to the per-volume loss function and the empirical training data distribution states that for all $l_{\alpha }$ and ${\beta }>0$

$$\begin{aligned} \mathbb {P}\left( {{\,\mathrm{\mathcal {L}}\,}}\left( f({\text {x}};{\boldsymbol{\theta }}), {\text {y}}\right) \ge l_{\alpha } \right) \le \frac{\exp \left( -{\beta }l_{\alpha }\right) }{n} \sum _{i=1}^n \exp \left( {\beta }{{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i;{\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) \right) \end{aligned}$$

(3)

To link this inequality to the minimization problem (2), we set ${\beta }$ such that

$$\begin{aligned} \alpha&= \frac{\exp \left( -{\beta }\hat{l}_{\alpha }({\boldsymbol{\theta }})\right) }{n} \sum _{i=1}^n \exp \left( {\beta }{{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i;{\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) \right) \end{aligned}$$

(4)

$$\begin{aligned} \iff \hat{l}_{\alpha }({\boldsymbol{\theta }})&= \frac{1}{{\beta }} \log \left( \frac{1}{\alpha n} \sum _{i=1}^n \exp \left( {\beta }{{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i;{\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) \right) \right) \end{aligned}$$

(5)

$\hat{l}_{\alpha }({\boldsymbol{\theta }})$ is therefore an upper bound for $l^*_{\alpha }({\boldsymbol{\theta }})$, independently to the value of ${\boldsymbol{\theta }}$. We propose to relax the minimization problem (2) by

$$\begin{aligned} \min _{{\boldsymbol{\theta }}} \frac{1}{{\beta }} \log \left( \sum _{i=1}^n \exp \left( {\beta }{{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i;{\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) \right) \right) \end{aligned}$$

(6)

where ${\beta }>0$ is a hyperparameter, and where the term $\frac{1}{{\beta }} \log \left( \frac{1}{\alpha n}\right) $ was dropped as being independent of ${\boldsymbol{\theta }}$. While in (6), $\alpha $ does not appear in the optimization problem directly anymore, ${\beta }$ essentially acts as a substitute for $\alpha $. The higher the value of ${\beta }$, the higher weights the per-volume losses with a high value will have in (6).

We give a proof in the supplementary material^{Footnote 1} that (6) is equivalent to solving the distributionally robust optimization problem

$$\begin{aligned} \min _{{\boldsymbol{\theta }}}\, \max _{{\boldsymbol{q}}\in \varDelta _n} \left( \sum _{i=1}^n q_i {{\,\mathrm{\mathcal {L}}\,}}\left( f({\boldsymbol{x}}_i; {\boldsymbol{\theta }}), {\boldsymbol{y}}_i\right) - \frac{1}{{\beta }} D_{KL}\left( {\boldsymbol{q}}\, \biggr \Vert \, \frac{1}{n}\mathbf {1}\right) \right) \end{aligned}$$

(7)

where a new unknown probabilities vector parameter ${\boldsymbol{q}}$ is introduced, $\frac{1}{n}\mathbf {1}$ denotes the uniform probability vector $\left( \frac{1}{n}, \ldots , \frac{1}{n}\right) $, $D_{KL}$ is the Kullback-Leibler divergence, $\varDelta _n$ is the unit n-simplex, and ${\beta }> 0$ is a hyperparameter. $D_{KL}$ measures the dissimilarity between ${\boldsymbol{q}}$ and the uniform probability vector $\frac{1}{n}\mathbf {1}$ that corresponds to assign the same weight $\frac{1}{n}$ to each sample. Therefore, ${\beta }$ controls how much the samples with a relatively high loss value (hard examples) are weighted.

Recently, hardness weighted sampling [10] was introduced as a principled hard example mining method to solve (7). Here, we proved that it can be used to minimize the proposed relaxed minimization (6) of the percentile loss problem.

3 Anatomically Abnormal Fetal Brain T2w MRI Dataset

Table 1. Training and testing dataset details. Other Abn: other brain structural abnormalities. There is no overlap of subjects between training and testing.

Full size table

In this section, we give details about the fetal brain 3D MRI data, the labelling protocol, and the pre-processing used in our experiments.

Public Fetal Brain Datasets. We used the 18 control fetal brain 3D MRI volumes of the spatio-temporal fetal brain atlas^{Footnote 2} [12] for gestational ages ranging from 21 weeks to 38 weeks. We also used 80 volumes from the publicly available FeTA MICCAI challenge dataset^{Footnote 3} [18]. For the 40 MIAL 3D MRIs, corrections of the segmentations were performed by authors MA, LF, and PD to reduce the variability against the published segmentation guidelines that was released with the FeTA dataset [18]. Those corrections were performed as part of our previous work [8] and are publicly available^{Footnote 4}. Brain masks for the FeTA data were obtained via affine registration using two fetal brain atlases^{Footnote 5} [11, 12].

Image Acquisition and Preprocessing for the Private Dataset. All images in the private dataset were part of routine clinical care and were acquired at UHL and MUV due to congenital malformations seen on ultrasound.

In total, 93 cases with open spina bifida, 35 cases with other central nervous system pathologies, and 142 cases with other malformations, though with normal brain, and referred as controls, were included. The gestational age at MRI ranged from 19 weeks to 40 weeks. We have started to make fetal brain T2w 3D MRIs publicly available^{Footnote 6}. For each study, at least three orthogonal T2-weighted HASTE series of the fetal brain were collected on a 1.5T scanner using an echo time of 133 ms, a repetition time of 1000 ms, with no slice overlap nor gap, pixel size 0.39 mm to 1.48 mm, and slice thickness 2.50 mm to 4.40 mm. A radiologist attended all the acquisitions for quality control.

The reconstructed fetal brain 3D MRIs were obtained using NiftyMIC [6] a state-of-the-art super resolution and reconstruction algorithm. The volumes were all reconstructed to a resolution of 0.8 mm isotropic and registered to a fetal brain atlas [12]. Our pre-processing improves the resolution, and removes motion between neighboring slices and motion artefacts present in the original 2D slices [6]. We used volumetric brain masks to mask the tissues outside the fetal brain. Those brain masks were obtained using the automatic segmentation method described in [6, 20].

Labelling Protocol. The labelling protocol used for white matter, ventricles and cerebellum is the same as in [18]. The three tissue types were segmented for our private dataset by a trained obstetrician and medical students under the supervision of a paediatric radiologist specialized in fetal brain anatomy, who quality controlled and corrected all manual segmentations.

Separation of the Data into Training and Testing. A summary of the number of fetal brain 3D MRIs used at training and testing for each central nervous system condition can be found in Table 1. The training dataset contains a total of 177 cases with a majority of 139 controls and only 38 abnormal cases which is typical in clinical datasets. Five controls from the FeTA dataset were added in the training dataset because we found in preliminary experiments that nnU-Net [14] fails on most of the FeTA data at testing when it is trained using only data from UHL and MUV and the fetal brain atlas [12]. The testing dataset contains 193 volumes with a majority of abnormal cases which is necessary to cover the anatomical variability of abnormal cases in our evaluation.

Table 2. **Evaluation of distribution robustness with respect to the pathology (193 3D MRIs).** : White matter, : Ventricles, : Cerebellum. $\mathbf{p} _{X}$: $X^{\text {th}}$ percentile of the Dice score distribution in percentage. Best values are in bold.

4 Experiments

Common Deep Learning Pipeline. We used nnU-Net [14], a generic deep learning pipeline for medical image segmentation, that has been shown to outperform other deep learning pipelines on 23 public datasets without the need to tune the loss function or the deep neural network architecture. Specifically, we used nnU-Net version 2 in 3D-full-resolution mode which is the recommended mode for isotropic 3D MRI data. nnU-Net automatically splits the training data into 5 folds $80\%$ training/$20\%$ validation used to train 5 networks for each method. The predicted class probability maps of the 5 models are averaged at inference to improve robustness [14]. We used NVIDIA Tesla V100 GPUs with 16 GB of memory. Training each network took from 4 to 6 days.

Specificities of Each Method. The baseline consists in using nnU-Net [14] without any modification. Our method, nnU-Net-DRO, also uses nnU-Net. The only difference is that we changed the sampling strategy to use the hardness weighted sampler for DRO [10]. We used the default hyper-parameter values for the hardness weighted sampler, i.e. $\beta =100$ with importance sampling and clipping values $w_{min}=0.1$ and $w_{max}=10$ as described in [10]. No other values were tested. Our implementation of the nnU-Net-DRO training procedure is publicly available at https://github.com/LucasFidon/HardnessWeightedSampler. It provides an implementation of the hardness weighted sampler described in [10].

Evaluation Method. We evaluate the quality of the automatic fetal brain MRI segmentations using the Dice score [5, 9]. We are particularly interested in measuring the statistical risk of the results as a way to evaluate the robustness of the different methods. To this end, in addition to the mean and standard deviation, we also report the percentiles of the Dice score at $50\%$, $25\%$, $10\%$, and $5\%$. In Table 2, we report those quantities for the Dice scores of the three tissue types white matter, ventricular system, and cerebellum.

For each method, nnU-Net is trained 5 times using different train/validation splits and different random initializations. The 5 same splits, computed randomly, are used for the two methods. The results in Table 2 are for the ensemble of the 5 3D U-Nets. Ensembling is known to increase the robustness of deep learning methods for segmentation [14]. It also makes the evaluation less sensitive to the random initialization and to the stochastic optimization.

Evaluation of nnU-Net and nnU-Net-DRO. Quantitative evaluation of nnU-Net and nnU-Net-DRO for the three different central nervous system conditions control, spina bifida, and other abnormalities can be found in Table 2.

For spina bifida and other brain abnormalities, the proposed nnU-Net-DRO achieves same or higher mean Dice scores and lower standard deviations than nnU-Net [14] for the three tissue types. For controls, the mean Dice scores and standard deviation of nnU-Net-DRO and nnU-Net differ by less than 0.1 percentage points (pp) for the three tissue types.

The comparison of the percentiles of the Dice score allows us to compare methods at the tail of the Dice scores distribution where segmentation methods reach their worst-case performance. For spina bifida, nnU-Net-DRO achieves higher values of percentiles than nnU-Net for the white matter ($+0.6$pp for $\mathbf{p} _{10}$), for the ventricular system ($+1.0$pp for $\mathbf{p} _{5}$), and for the cerebellum ($+26.5$pp for $\mathbf{p} _{10}$). And for other brain abnormalities, nnU-Net-DRO achieves higher values of percentiles than nnU-Net for the white matter ($+1.9$pp for $\mathbf{p} _{5}$), for the ventricular system ($+1.5$pp for $\mathbf{p} _{5}$ and $+2.7$pp for $\mathbf{p} _{10}$), and for the cerebellum ($+1.3$pp for $\mathbf{p} _{5}$). All the other percentile values differ by less than 0.5pp of Dice score between the two methods. This suggests that nnU-Net-DRO achieves better worst case performance than nnU-Net for abnormal cases.

It is worth noting that the Dice scores decrease for the white matter and the cerebellum between controls and spina bifida and abnormal cases. It was expected due to the higher anatomical variability in pathological cases. However, the Dice scores for the ventricular system tend to be higher for abnormal cases than for controls. This can be attributed to the large proportion of pathological cases with enlarged ventricles because the Dice score values tend to be higher for larger region of interests.

As can be seen in the qualitative results of Table 2, there are cases for which nnU-Net predicts an empty cerebellum segmentation while nnU-Net-DRO achieves satisfactory cerebellum segmentation. There were no cases for which the converse was true. Robust segmentation of the cerebellum for spina bifida is particularly relevant for the evaluation of fetal brain surgery for open spina bifida [1, 4, 21]. Additional qualitative results in the supplementary material^{Footnote 7} illustrates 5 other cases for which nnU-Net-DRO outperforms nnU-Net.

5 Conclusion

The high anatomical variability of the developing fetal brain across gestational ages and pathologies hampers the robustness of deep neural networks trained by maximizing the average per-volume performance. Specifically, it limits the generalization of deep neural networks to abnormal cases for which few cases are available during training. In this paper, we propose to mitigate this problem by training deep neural networks to minimize a percentile of the per-volume performance rather than the average. To allow to do this in practice, we propose to train deep neural networks with Distributionally Robust Optimization (DRO) and we show that the DRO objective is a relaxation of the per-volume loss percentile. We have validated the proposed training method on a multi-centric dataset of 368 fetal brain T2w 3D MRIs with various diagnostics. nnU-Net trained with DRO achieved improved segmentation results for pathological cases as compared to the unmodified nnU-Net, while achieving similar segmentation performance for the neurotypical cases. Our results suggest that nnU-Net trained with DRO is more robust to anatomical variabilities than the original nnU-Net.

Notes

1.
Please see the arxiv version for the supplementary material http://arxiv.org/abs/2108.04175.
2.
http://crl.med.harvard.edu/research/fetal_brain_atlas/.
3.
DOI: 10.7303/syn25649159.
4.
DOI: 10.5281/zenodo.5148611.
5.
DOI: 10.7303/syn25887675.
6.
https://www.cir.meduniwien.ac.at/research/fetal/.
7.
Please see the arxiv version for the supplementary material http://arxiv.org/abs/2108.04175.

References

Aertsen, M., et al.: Reliability of MR imaging-based posterior fossa and brain stem measurements in open spinal dysraphism in the era of fetal surgery. Am. J. Neuroradiol. 40(1), 191–198 (2019)
Article Google Scholar
Benkarim, O.M., et al.: Toward the automatic quantification of in utero brain development in 3D structural MRI: a review. Hum. Brain Mapp. 38(5), 2772–2787 (2017)
Article Google Scholar
Chernoff, H., et al.: A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. Ann. Math. Stat. 23(4), 493–507 (1952)
Article MathSciNet Google Scholar
Danzer, E., Joyeux, L., Flake, A.W., Deprest, J.: Fetal surgical intervention for myelomeningocele: lessons learned, outcomes, and future implications. Dev. Medi. Child Neurol. 62(4), 417–425 (2020)
Article Google Scholar
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Article Google Scholar
Ebner, M., et al.: An automated framework for localization, segmentation and super-resolution reconstruction of fetal brain MRI. Neuroimage 206, 116324 (2020)
Article Google Scholar
Fetit, A.E., et al.: A deep learning approach to segmentation of the developing cortex in fetal brain MRI with minimal manual labeling. In: Medical Imaging with Deep Learning, pp. 241–261. PMLR (2020)
Google Scholar
Fidon, L., et al.: Label-set loss functions for partial supervision: application to fetal brain 3D MRI parcellation. arXiv preprint arXiv:2107.03846 (2021)
Fidon, L., et al.: Generalised Wasserstein dice score for imbalanced multi-class segmentation using holistic convolutional networks. In: Crimi, A., Bakas, S., Kuijf, H., Menze, B., Reyes, M. (eds.) BrainLes 2017. LNCS, vol. 10670, pp. 64–76. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-75238-9_6
Chapter Google Scholar
Fidon, L., Ourselin, S., Vercauteren, T.: Distributionally robust deep learning using hardness weighted sampling. arXiv preprint arXiv:2001.02658 (2020)
Fidon, L., et al.: A spatio-temporal atlas of the developing fetal brain with spina bifida aperta. Open Res. Europe (2021)
Google Scholar
Gholipour, A., et al.: A normative spatiotemporal MRI atlas of the fetal brain for automatic segmentation and analysis of early brain growth. Sci. Rep. 7(1), 1–13 (2017)
Article MathSciNet Google Scholar
Holton, G.: Value at Risk: Theory and Practice. Academic Press (2003)
Google Scholar
Isensee, F., Jaeger, P.F., Kohl, S.A., Petersen, J., Maier-Hein, K.H.: nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18(2), 203–211 (2021)
Article Google Scholar
Khalili, N., et al.: Automatic brain tissue segmentation in fetal MRI using convolutional neural networks. Magn. Reson. Imaging 64, 77–89 (2019)
Article Google Scholar
Mufti, N., et al.: Cortical spectral matching and shape and volume analysis of the fetal brain pre-and post-fetal surgery for spina bifida: a retrospective study. Neuroradiology 1–14 (2021)
Google Scholar
Oakden-Rayner, L., Dunnmon, J., Carneiro, G., Ré, C.: Hidden stratification causes clinically meaningful failures in machine learning for medical imaging. In: Proceedings of the ACM Conference on Health, Inference, and Learning, pp. 151–159 (2020)
Google Scholar
Payette, K., et al.: An automatic multi-tissue human fetal brain segmentation benchmark using the fetal tissue annotation dataset. Sci. Data 8(1), 1–14 (2021)
Article MathSciNet Google Scholar
Payette, K., et al.: Longitudinal analysis of fetal MRI in patients with prenatal spina bifida repair. In: Wang, Q., et al. (eds.) PIPPI/SUSI -2019. LNCS, vol. 11798, pp. 161–170. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32875-7_18
Chapter Google Scholar
Ranzini, M., Fidon, L., Ourselin, S., Modat, M., Vercauteren, T.: MONAIfbs: MONAI-based fetal brain MRI deep learning segmentation. arXiv preprint arXiv:2103.13314 (2021)
Sacco, A., et al.: Fetal surgery for open spina bifida. Obstetrician Gynaecol. 21(4), 271 (2019)
Article Google Scholar
Zarutskie, A., et al.: Prenatal brain imaging for predicting need for postnatal hydrocephalus treatment in fetuses that had neural tube defect repair in utero. Ultrasound Obstet. Gynecol. 53(3), 324–334 (2019)
Article Google Scholar

Download references

Acknowledgments

This project has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement TRABIT No 765148. This work was supported by core and project funding from the Wellcome [203148/Z/16/Z; 203145Z/16/Z; WT101957], and EPSRC [NS/A000049/1; NS/A000050/1; NS/A000027/1]. TV is supported by a Medtronic/RAEng Research Chair [RCSRF1819$\backslash $7$\backslash $34].

Author information

Authors and Affiliations

School of Biomedical Engineering and Imaging Sciences, King’s College London, London, UK
Lucas Fidon, Nada Mufti, Michael Ebner, Andrew Melbourne, Sébastien Ourselin & Tom Vercauteren
Department of Radiology, University Hospitals Leuven, Leuven, Belgium
Michael Aertsen, Thomas Deprest, Frédéric Guffens & Jan Deprest
Institute for Women’s Health, University College London, London, UK
Nada Mufti, Anna L. David & Jan Deprest
Department of Obstetrics and Gynaecology, University Hospitals Leuven, Leuven, Belgium
Nada Mufti, Doaa Emam, Anna L. David & Jan Deprest
Department of Biomedical Imaging and Image-guided Therapy Medical University of Vienna, Vienna, Austria
Ernst Schwartz, Daniela Prayer, Gregor Kasprian & Georg Langs
Department of Gynecology and Obstetrics, University Hospitals Tanta, Tanta, Egypt
Doaa Emam

Authors

Lucas Fidon
View author publications
You can also search for this author in PubMed Google Scholar
Michael Aertsen
View author publications
You can also search for this author in PubMed Google Scholar
Nada Mufti
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Deprest
View author publications
You can also search for this author in PubMed Google Scholar
Doaa Emam
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Guffens
View author publications
You can also search for this author in PubMed Google Scholar
Ernst Schwartz
View author publications
You can also search for this author in PubMed Google Scholar
Michael Ebner
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Prayer
View author publications
You can also search for this author in PubMed Google Scholar
Gregor Kasprian
View author publications
You can also search for this author in PubMed Google Scholar
Anna L. David
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Melbourne
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Ourselin
View author publications
You can also search for this author in PubMed Google Scholar
Jan Deprest
View author publications
You can also search for this author in PubMed Google Scholar
Georg Langs
View author publications
You can also search for this author in PubMed Google Scholar
Tom Vercauteren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lucas Fidon .

Editor information

Editors and Affiliations

University College London/King's College London, London, UK
Carole H. Sudre
Medical University of Vienna and TU Wien, Vienna, Austria
Roxane Licandro
University of Tübingen, Tübingen, Germany
Christian Baumgartner
King's College London, London, UK
Andrew Melbourne
Massachusetts General Hospital, Harvard Medical School, MIT, Cambridge, MA, USA
Adrian Dalca
King's College London, London, UK
Jana Hutter
Microsoft Research/University College London, London, UK
Ryutaro Tanno
Boston Children's Hospital, Boston, MA, USA
Esra Abaci Turk
Technical University Denmark, Kongens Lyngby, Denmark
Koen Van Leemput
Hewlett Packard, Barcelona, Spain
Jordina Torrents Barrena
Harvard Medical School/Brigham and Women's Hospital, Boston, MA, USA
William M. Wells
The Hospital For Sick Children, University of Toronto, Toronto, ON, Canada
Christopher Macgowan

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 774 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fidon, L. et al. (2021). Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI. In: Sudre, C.H., et al. Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, and Perinatal Imaging, Placental and Preterm Image Analysis. UNSURE PIPPI 2021 2021. Lecture Notes in Computer Science(), vol 12959. Springer, Cham. https://doi.org/10.1007/978-3-030-87735-4_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-87735-4_25
Published: 25 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87734-7
Online ISBN: 978-3-030-87735-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI

Abstract

Similar content being viewed by others

Deep Learning Automatic Fetal Structures Segmentation in MRI Scans with Few Annotated Datasets

An automatic and accurate deep learning-based neuroimaging pipeline for the neonatal brain

Automatic Segmentation of the Intracranial Volume in Fetal MR Images

1 Introduction

2 Minimization of a Percentile Loss Using Distributionally Robust Optimization

3 Anatomically Abnormal Fetal Brain T2w MRI Dataset

4 Experiments

5 Conclusion

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 774 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI

Abstract

Similar content being viewed by others

Deep Learning Automatic Fetal Structures Segmentation in MRI Scans with Few Annotated Datasets

An automatic and accurate deep learning-based neuroimaging pipeline for the neonatal brain

Automatic Segmentation of the Intracranial Volume in Fetal MR Images

1 Introduction

2 Minimization of a Percentile Loss Using Distributionally Robust Optimization

3 Anatomically Abnormal Fetal Brain T2w MRI Dataset

4 Experiments

5 Conclusion

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 774 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation