Incorporating Task-Specific Structural Knowledge into CNNs for Brain Midline Shift Detection

Pisov, Maxim; Goncharov, Mikhail; Kurochkina, Nadezhda; Morozov, Sergey; Gombolevskiy, Victor; Chernina, Valeria; Vladzymyrskyy, Anton; Zamyatina, Ksenia; Chesnokova, Anna; Pronin, Igor; Shifrin, Michael; Belyaev, Mikhail

doi:10.1007/978-3-030-33850-3_4

Maxim Pisov^17,18,
Mikhail Goncharov^18,19,
Nadezhda Kurochkina¹⁷,
Sergey Morozov²⁰,
Victor Gombolevskiy²⁰,
Valeria Chernina²⁰,
Anton Vladzymyrskyy²⁰,
Ksenia Zamyatina²⁰,
Anna Chesnokova²⁰,
Igor Pronin²¹,
Michael Shifrin²¹ &
…
Mikhail Belyaev^17,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11797))

Included in the following conference series:

International Workshop on Multimodal Learning for Clinical Decision Support
International Workshop on Interpretability of Machine Intelligence in Medical Image Computing

1395 Accesses
13 Citations
3 Altmetric

The original version of this chapter was revised: the names of two authors and the grant number were corrected. The correction to this chapter is available at https://doi.org/10.1007/978-3-030-33850-3_11

Abstract

Midline shift (MLS) is a well-established factor used for outcome prediction in traumatic brain injury, stroke and brain tumors. The importance of automatic estimation of MLS was recently highlighted by ACR Data Science Institute. In this paper we introduce a novel deep learning based approach for the problem of MLS detection, which exploits task-specific structural knowledge. We evaluate our method on a large dataset containing heterogeneous images with significant MLS and show that its mean error approaches the inter-expert variability. Finally, we show the robustness of our approach by validating it on an external dataset, acquired during routine clinical practice.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Automated neonatal nnU-Net brain MRI extractor trained on a large multi-institutional dataset

Article Open access 26 February 2024

TBI Lesion Segmentation in Head CT: Impact of Preprocessing and Data Augmentation

Diffusion Model Based Semi-supervised Learning on Brain Hemorrhage Images for Efficient Midline Shift Quantification

Keywords

1 Introduction

The brain midline can be viewed as a line on axial and coronal projections of diverse imaging modalities (Fig. 1, left). As the human brain is approximately symmetrical, the midline is straight in healthy subjects. However, various pathological conditions, such as traumatic brain injuries (TBI), stroke and brain tumors, may break this symmetry and lead to midline shift (MLS) [8].

A major number of studies show that MLS has a prognostic value for outcome prediction of various brain pathologies: level of consciousness in patients with acute intracranial hematoma [16], median survival in patients with glioblastoma multiforme [3], the outcome in patients with TBI [5]. Overall, early identification of patients with severe midline shift would assist patients management [14].

However, definitions of significant MLS vary across studies. While the 5 millimeters (mm) threshold is frequently used, other approaches are common. For example, MLS larger than 9 mm was identified in [14]; the 5 mm threshold was not justified within [5]. Such diversity is partly explained by the absence of a robust objective methodology of MLS estimation. A recent study [13] suggests that interrater variability of MLS estimation is rather high (intraclass correlation coefficients 0.72–0.89).

The importance of MLS estimation and the need for its automation was recently highlighted by The American College of Radiology Data Science Institute [10], and some promising results have already been achieved in this area (Sect. 3). In this paper we propose a novel deep learning based approach^{Footnote 1} for the MLS detection task. We show that combining a standard segmentation approach with task-specific structural knowledge yields results which are more accurate, compared to straightforward CNNs for regression, and also interpretable, since the key part of the method is the midline localization. Moreover, we show that our method generalizes well on highly heterogeneous data and provide a natural way of estimating its confidence.

2 Problem

We define the midline on an axial slice as a vertical curve that separates the brain hemispheres (Fig. 1, left). The midline shift for an axial slice is then defined as the maximal distance between the midline (which might be deformed) and a hypothetical normal midline (Fig. 1, center). Finally, the midline shift for a whole brain is the maximal midline shift across all axial slices where the midline is present. The task is to determine, for a given brain image, the midline shift as well as the corresponding axial slice on which it is manifested.

It is worth noting that in some complicated cases even professional radiologists cannot confidently determine the localization of the midline (Fig. 1, right). Taking into account such dubious cases, it is also desirable that the method for MLS detection has a means of estimating its own confidence.

3 Related Work

Most of the methods for automatic MLS estimation are computer vision (CV) based and rely on keypoints detection. The proposed approaches often have a lot of “moving parts” which makes them hard to implement and fine-tune. For example, in [9] the authors use a four-step pipeline (edge detection, morphological filtering, lines detection, rule-based filtering) just to detect the cerebral falx. Another drawback of keypoints-based methods is that they require various important regions to be present on the image, e.g. many methods can be applied only to slices that contain ventricles [1] which makes them inapplicable to cases where the midline shift is manifested on lower or higher slices.

There are also a few papers that propose deep learning methods. In [2] the authors trained an adapted a version of ResNet to classify whether there is a significant midline shift on a given slice. Another interesting approach that combines deep learning with classical CV is described in [6]. Here the authors use a U-Net [15] architecture for brain extraction, cisterns and acute intracranial lesions segmentation, while MLS detection is based on keypoints.

4 Method

A straightforward deep learning approach is to directly predict the MLS via a convolutional neural network. Following the authors of [2], we tested a ResNet-based [4] network which predicted the MLS for each axial slice of given image. The final prediction was obtained as the maximal MLS only among the slices that contained an annotated midline. However, even in such a simplified design (the model did not need to filter out the slices for which the MLS was undefined), this method yields poor results as we show in Sect. 7.

Our intuition behind this is that the midline shift is a very high-level concept: the network needs to learn to detect several keypoints located very far from each other (Fig. 1), as well as take into account their relative positions. The latter is a particularly difficult task for convolutional neural networks due to their invariance to translation.

On the contrary, the midline has visual features, like continuity and local symmetry, that are distinguishable on a smaller scale. This brings us to the idea to reduce the task of MLS prediction to the task of midline estimation: for a given slice we localize the midline while exploiting the structural knowledge about the target, then we derive the MLS from the predicted curve based on the definition given in Sect. 2. Normal midline is estimated as a straight line between prediction endpoints.

The key structural facts are: (1) for each coordinate y there is at most one x-coordinate, which is refered as $\mathbf{midline} _y$, such that the pixel $(\mathbf{midline} _y, y)$ is situated on the midline; (2) $\mathbf{midline} _y$ exists only for y-coordinates within certain interval I on the Oy axis to which binary mask we refer as limits (Fig. 2).

These facts imply that our method must be capable of solving the regression problem of mildine estimation and the classification problem of limits prediction. To solve these tasks, we propose a two-headed convolutional neural network with shared input layers (Fig. 3). As loss function, we optimize a weighted combination of standard losses for regression and classification:

$$\begin{aligned} L = \lambda _1 \cdot \frac{1}{|I|} \sum _{y \in I} (\mathbf {midline}_y - \mathbf {midline}_y^{\text {pred}})^2 + \lambda _2 \cdot \text {BCE}(\mathbf {limits}, \mathbf {limits}^{\text {pred}}), \end{aligned}$$

where $\mathbf {midline}_y^{\text {pred}}$ and $\mathbf {limits}^{\text {pred}}$ are the network’s predictions, BCE is binary cross-entropy.

4.1 Midline Estimation

In order to estimate the midline we adapt a segmentation approach. In a standard setting (with sigmoid activation and binary cross entropy loss) the output can be interpreted as “independent” probability of a particular pixel to be situated on the midline. In this case the midline is obtained after applying argmax along the Ox axis.

However, as we show in Sect. 7, significantly better results can be achieved while imposing the following constraint on the output probability map

$$\begin{aligned} \sum _x \mathbf {output}_{xy}^{\text {midline}} = 1, \end{aligned}$$

(1)

which follows from the structural fact (1). Next, taking into account that for any given y-coordinate the head’s output represents a probability distribution, we propose to predict the midline as its expected value:

$$\begin{aligned} \mathbf {midline}_y^{\text {pred}} = \sum _x x \cdot \mathbf {output}_{xy}^{\text {midline}}. \end{aligned}$$

The overall architecture for midline estimation is shown in Fig. 3 (top). For our experiments we chose a UNet-based [15] architecture as a de facto standard for medical image segmentation. We replaced plain convolutional layers by residual blocks [4] which are considered to improve the performance, as suggested by [11]. Also, during feature maps concatenation we use linear interpolation to make the output’s shape equal to the input’s shape. Finally, we apply a softmax nonlinearity to the network’s output along the Ox axis (instead of sigmoid), which ensures that the constraint from (1) is respected. Note that because the head’s output represents a probability distribution, at inference time we can calculate various statistics based on this distribution, e.g. percentiles, which are needed to estimate confidence intervals. This is a very important aspect of our approach which gives us a natural means of estimating the model’s uncertainty.

4.2 Limits Prediction

Since the proposed midline estimation approach yields $\mathbf {midline}_y^{\text {pred}}$ for all y-coordinates, we need to filter out the predicted values for the regions where the midline is not defined (Fig. 2, hatched). The corresponding limits are obtained by thresholding the second head’s output ($\mathbf {limits}^{\text {pred}}$) and taking the convex hull.

The architecture of the second head is shown in Fig. 3 (bottom). It has the same input layers as the midline estimation network, which are followed by two residual blocks [4]. Next, a global max pooling is applied along the Ox axis in order to reduce the dimensionality of the 2D feature maps to 1D. Finally we apply two 1D convolutions followed by the sigmoid activation function.

5 Experimental Setup

At train time in all of our experiments we used Adam optimizer [7] with default parameters ($\beta _1 = 0.9, \beta _2 = 0.999$) and a learning rate of $10^{-3}$, which showed the best results on the validation set. We used equal ($\lambda _1 = \lambda _2 = 1$) weights in the final loss as we didn’t notice any loss imbalance at train time.

Also, we applied a simple preprocessing in order to reduce the data variability: resampling the axial slices to a $0.5 \times 0.5$ mm pixel spacing, background removal by Otsu thresholding [12] and intensity normalization to zero mean and unit variance. Additionally, at train time we used random flips along the Ox axis as a cheap data augmentation technique.

The training was performed on batches of size 40 (which was simply determined by the amount of available GPU memory), until the validation scores reached a plateau, which happened at approx. 32000 batches. For this reason we used 32000 iterations for all our experiments.

6 Data

In our experiments we used data from two sources.

The first dataset (DS1) consists of 352 MRI series that come from a neurosurgery hospital and belong to patients with severe brain damage caused by tumors: 64% of the images have a significant midline shift ($\ge $5 mm), the mean MLS is 7.8 ± 5.0 mm. The dataset was labeled by an experienced neuroradiologist (exp1) and three specialists with limited background in neuroradiology (exp2-4). Their inter- and intra-expert variability is shown in Table 2. We split this dataset using 5-fold cross-validation. For each fold, we additionally leave 8 images out the training set to form a validation set.

The second dataset (DS2) comes from an out-patient clinic and represents a homogeneous sample of 203 MRI series acquired in routine clinical practice. For this dataset only the MLS is available but not the midline itself; only 8% of images have a large MLS ($\ge $5 mm), the mean MLS is 2.9 ± 1.5 mm. We use this dataset only for final models’ quality assessment in a prospective fashion.

The series from both sources contain only axial slices but have various voxel spacings, ranging from $0.2\times 0.2\times 1$ mm to $1\times 1\times 5$ mm, and modalities: T1 (25%), T2 (68%) and FLAIR (7%). The images were collected using scanners from GE/Siemens and Toshiba/Siemens for DS1 and DS2 respectively.

7 Results

7.1 Midline Shift Detection

We compare the proposed method with a direct MLS regression via ResNet [4] on two tasks: (1) MLS prediction; (2) significant MLS ($\ge $5 mm) detection. In order to evaluate the quality of both methods we use mean absolute error (MAE) and the area under the ROC-curve (ROC AUC) for task 1 and 2 respectively. The ROC-curve was obtained by thresholding the predicted MLS by different values (from 0 to maximal MLS magnitude). The results are presented in Table 1.

Table 1. Midline shift detection scores for various models (± std) calculated on 5-fold cross-validation.

Full size table

7.2 Midline Estimation

In order to assess the midline estimation performance we use root-mean-square error (RMSE) as well as maximal error (MAX):

$$\begin{aligned} \text {RMSE}(\mathbf {midline}_y, \mathbf {midline}_y^{\text {pred}}) = \sqrt{|I|^{-1}\sum \nolimits _{y \in I} (\mathbf {midline}_y - \mathbf {midline}_y^{\text {pred}})^2}, \end{aligned}$$

$$\begin{aligned} \text {MAX}(\mathbf {midline}_y, \mathbf {midline}_y^{\text {pred}}) = \max \limits _{y \in I} |\mathbf {midline}_y - \mathbf {midline}_y^{\text {pred}}|. \end{aligned}$$

These metrics, averaged along axial slices (MAXs, RMSEs) as well as entire brain images (MAX, RMSE), are shown in Table 2.

Table 2. Top: midline estimation metrics (± std) calculated on 5-fold cross-validation for DS1. Bottom: neuroradiologist (exp1) variability on DS1.

Full size table

We compare our method with a naïve segmentation approach mentioned in Sect. 4.1. Note that plain segmentation performs significantly worse in terms of maximal error, which is a more important characteristic for MLS detection.

8 Discussion

Figure 4 (right) shows several examples on which our method performs poorly. Our analysis of such examples suggests that the main source of errors are some really complicated cases that even professional radiologists have doubts with, e.g. images on which the tumor is located directly in the middle of the brain, or incorrect cases with an extracerebral tumor located in the medial longitudinal fissure, e.g. falx meningioma. Note how in the areas of greatest error the model’s uncertainty is much higher.

Our preliminary experiments with CT images show that the proposed method can be easily adapted to work with CT, however we require a larger dataset to support this claim, which might be the subject of our future work.

Change history

24 October 2019
The chapter titled “Incorporating Task-Specific Structural Knowledge into CNNs for Brain Midline Shift Detection” was revised. The names of two authors were spelled incorrectly and the grant number was missing the final digit. This was corrected.
The original version of this book was revised. Due to an error, the volume editor’s affiliation “ETH Zurich” appeared on SpringerLink instead of his name “Ender Konukoglu.” This was fixed.

Notes

1.
Full code for training and inference is available at GitHub: https://github.com/neuro-ml/midline-shift-detection.

References

Chen, W., Belle, A., Cockrell, C., Ward, K.R., Najarian, K.: Automated midline shift and intracranial pressure estimation based on brain CT images. J. Visualized Exp. JoVE 74, e3871 (2013)
Google Scholar
Chilamkurthy, S., et al.: Deep learning algorithms for detection of critical findings in head ct scans: a retrospective study. Lancet 392(10162), 2388–2396 (2018)
Article Google Scholar
Gamburg, E.S., Regine, W.F., Patchell, R.A., Strottmann, J.M., Mohiuddin, M., Young, A.B.: The prognostic significance of midline shift at presentation on survival in patients with glioblastoma multiforme. Int. J. Radiat. Oncol.* Biol.* Phys. 48(5), 1359–1362 (2000)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Jacobs, B., Beems, T., van der Vliet, T.M., Diaz-Arrastia, R.R., Borm, G.F., Vos, P.E.: Computed tomography and outcome in moderate and severe traumatic brain injury: hematoma volume and midline shift revisited. J. Neurotrauma 28(2), 203–215 (2011)
Article Google Scholar
Jain, S., et al.: Automatic quantification of CT features inacute traumatic brain injury. J. Neurotrauma (2019)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Liao, C.C., Chen, Y.F., Xiao, F.: Brain midline shift measurement and its automation: a review of techniques and algorithms. Int. J. Biomed. Imaging (2018)
Google Scholar
Liu, R., et al.: Automatic detection and quantification of brain midline shift using anatomical marker model. Comput. Med. Imaging Graphics 38(1), 1–14 (2014)
Article Google Scholar
McGinty, G.B., Allen, B.: The ACR data science institute and AI advisory group: harnessing the power of artificial intelligence to improve patient care. J. Am. College Radiol. 15(3), 577–579 (2018)
Article Google Scholar
Milletari, F., Navab, N., Ahmadi, S.A.: V-net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571. IEEE (2016)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Article Google Scholar
Paletta, N., Maali, L., Zahran, A., Sethuraman, S., Figueroa, R., Nichols, F.T., Bruno, A.: A simplified quantitative method to measure brain shifts in patients with middle cerebral artery stroke. J. Neuroimaging 28(1), 61–63 (2018)
Article Google Scholar
Pullicino, P.M., Alexandrov, A., Shelton, J., Alexandrova, N., Smurawska, L., Norris, J.: Mass effect and death from severe acute stroke. Neurology 49(4), 1090–1095 (1997)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Ross, D.A., Olsen, W.L., Ross, A.M., Andrews, B.T., Pitts, L.H.: Brain shift, level of consciousness, and restoration of consciousness in patients with acute intracranial hematoma. J. Neurosurg. 71(4), 498–502 (1989)
Article Google Scholar

Download references

Acknowledgements

The development of the interpretable algorithm (done by M. Pisov and M. Goncharov) was supported by the Russian Science Foundation grant 17-11-01390.

Author information

Authors and Affiliations

Skolkovo Institute of Science and Technology, Moscow, Russia
Maxim Pisov, Nadezhda Kurochkina & Mikhail Belyaev
Kharkevich Institute for Information Transmission Problems, Moscow, Russia
Maxim Pisov, Mikhail Goncharov & Mikhail Belyaev
Moscow Institute of Physics and Technology, Moscow, Russia
Mikhail Goncharov
Center for Diagnostics and Telemedicine, Moscow, Russia
Sergey Morozov, Victor Gombolevskiy, Valeria Chernina, Anton Vladzymyrskyy, Ksenia Zamyatina & Anna Chesnokova
Burdenko Neurosurgery Institute, Moscow, Russia
Igor Pronin & Michael Shifrin

Authors

Maxim Pisov
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Goncharov
View author publications
You can also search for this author in PubMed Google Scholar
Nadezhda Kurochkina
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Morozov
View author publications
You can also search for this author in PubMed Google Scholar
Victor Gombolevskiy
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Chernina
View author publications
You can also search for this author in PubMed Google Scholar
Anton Vladzymyrskyy
View author publications
You can also search for this author in PubMed Google Scholar
Ksenia Zamyatina
View author publications
You can also search for this author in PubMed Google Scholar
Anna Chesnokova
View author publications
You can also search for this author in PubMed Google Scholar
Igor Pronin
View author publications
You can also search for this author in PubMed Google Scholar
Michael Shifrin
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Belyaev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail Belyaev .

Editor information

Editors and Affiliations

Tokyo Institute of Technology, Yokohama, Japan
Kenji Suzuki
University of Bern, Bern, Switzerland
Mauricio Reyes
IBM Research - Almaden, San Jose, CA, USA
Tanveer Syeda-Mahmood
ETH Zurich, Zürich, Zürich, Germany
Ender Konukoglu
Imperial College London, London, UK
Ben Glocker
University Hospital of Bern, Bern, Switzerland
Roland Wiest
IBM Research - Almaden, San Jose, CA, USA
Yaniv Gur
Tel Aviv University, Ramat Aviv, Israel
Hayit Greenspan
Case Western Reserve University, Cleveland, OH, USA
Anant Madabhushi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pisov, M. et al. (2019). Incorporating Task-Specific Structural Knowledge into CNNs for Brain Midline Shift Detection. In: Suzuki, K., et al. Interpretability of Machine Intelligence in Medical Image Computing and Multimodal Learning for Clinical Decision Support. ML-CDS IMIMIC 2019 2019. Lecture Notes in Computer Science(), vol 11797. Springer, Cham. https://doi.org/10.1007/978-3-030-33850-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-33850-3_4
Published: 24 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33849-7
Online ISBN: 978-3-030-33850-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)