A Multi-scale Multiple Sclerosis Lesion Change Detection in a Multi-sequence MRI

Cheng, Myra; Galimzianova, Alfiia; Lesjak, Žiga; Špiclin, Žiga; Lock, Christopher B.; Rubin, Daniel L.

doi:10.1007/978-3-030-00889-5_40

Myra Cheng³⁶,
Alfiia Galimzianova³⁶,
Žiga Lesjak³⁷,
Žiga Špiclin³⁷,
Christopher B. Lock³⁶ &
…
Daniel L. Rubin³⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11045))

Included in the following conference series:

7761 Accesses
2 Citations

Abstract

Multiple sclerosis (MS) is a disease characterized by demyelinating lesions in the brain and spinal cord. Quantification of the amount of change in MS lesions in magnetic resonance imaging (MRI) over time is important for evaluation of drug effectiveness in clinical trials. Manual analysis of such longitudinal datasets is time- and cost prohibitive, and also prone to intra- and inter-rater variability. Accurate automated change detection methods would be highly desirable. We propose a new MS lesion change detection method that integrates a voxel’s multi-sequence MR intensity with its immediate neighborhood context and the texture of the extended neighborhood in a machine learning framework. On our dataset of 15 patients, the proposed method had higher performance (median AUC-ROC = 0.97, AUC-PR = 0.43, Wilcoxon’s signed rank test, p < 0.001) than implemented baseline methods. As such, the proposed method has potential clinical applications as an efficient, low-cost algorithm to capture and quantify local lesion change and growth.

You have full access to this open access chapter, Download conference paper PDF

Microstructural characterization of multiple sclerosis lesion phenotypes using multiparametric longitudinal analysis

Article Open access 13 July 2024

Unsupervised Framework for Consistent Longitudinal MS Lesion Segmentation

A Generative Model for Automatic Detection of Resolving Multiple Sclerosis Lesions

Keywords

1 Introduction

Multiple sclerosis (MS) is a disease of the central nervous system (CNS) that affects over 400,000 people in the U.S. and 2.5 million people worldwide. It is one of the leading causes of non-traumatic disability among young and middle-aged adults [1]. Currently, MS has no cure, although there is an ongoing research in search for improved treatment and management of the disease. The success of such research depends on clinical trials, in which the response to treatment and change in disease status must be quantified in an accurate and consistent manner.

Multi-sequence magnetic resonance imaging (MRI) is the standard imaging exam performed to analyze the white-matter lesions for diagnosis and follow-up evaluation of MS. Quantitative evaluation of the changes in the MS lesions appearance requires annotation of the corresponding areas in the brain, which when done manually is time-consuming and subjective. To address these challenges, reliable automated methods are needed.

The strategies for automated change detection can be categorized as longitudinal volumetric analysis, deformable image registration, and longitudinal analysis of MR intensity [5]. Longitudinal volumetric analysis relies on segmentation of MS lesions at each imaging timepoint independently and can only provide global measures of the lesion change, such as the count of new lesions and the total lesion volume difference. Deformable image registration relies on deformation fields obtained during non-rigid alignment of the MR images at two timepoints and can quantify local changes through analysis of enlarging and shrinking lesions, while detection of new or disappearing lesions is limited. The longitudinal analysis can address the issues of the previous two strategies through rigid of affine registration being followed by intensity analysis at matching anatomical sites, thus allowing for local quantification of all types of lesion change [4]. The computational core of such analysis is detection of lesion change for each voxel of the brain MR image set from two imaging timepoints. Although single-timepoint MS lesion segmentation approaches have incorporated multiple scales of spatial information for context [6], longitudinal MS lesion analysis has been limited to change detection using voxel intensities independently [4]. Texture descriptors specifically can aid in a compact representation of a local context of the multi-sequence MR images and, moreover, have been successfully applied to MS lesion segmentation [11].

In this work, we propose a change detection method that incorporates the local context of the multi-sequence MR images at three scales. The multi-scale descriptors are extracted from the intensity information of the voxels immediate neighborhood, and the intensity and texture information of a larger surrounding image patch. Our experiments demonstrated that incorporating the contextual information to change detection improved the performance at each scale and the proposed method statistically significantly outperformed the baseline state-of-the-art approach [8].

2 Materials and Methods

2.1 Dataset

We used anonymized imaging data from 15 MS patients with two imaging exams, each with three MRI sequences: T1-weighted (T1), T2-weighted (T2), and fluid-attenuated inversion recovery (FLAIR), acquired at 1.5T. Pre-processing included resampling to the common spatial resolution of $1\times 1\times 3$ mm$^3$, inhomogeneity correction on all sequences using N4 bias correction [10], registration of all sequences at both timepoints to a common space [2], and intracranial volume extraction from the T1 sequences using BET 2 [7]. The reference lesion change labels were acquired as a consensus segmentation of the corresponding regions by two neuroradiologists.

2.2 MS Lesion Change Detection

For each patient, we consider six three-dimensional volumes: T1, T2, and FLAIR at two time points. We denote intensity of a voxel v from the intracranial volume mask for patient i, from the imaging study conducted at time $t_j$ $(j=1,2)$ by $M_{iv}^{t_j }$, where $M\in \{T1,T2,FLAIR\}$. For common interpretations of voxel intensities, each volume was normalized by computing the z-scores of the intracranial volume intensities:

$$\begin{aligned} \widetilde{M}_{iv}^{t_j}=\frac{M_{iv}^{t_j}-\mu _{i,M}^{t_j}}{\sigma _{i,M}^{t_j}} \end{aligned}$$

where the mean $\mu _{i,M}^{t_j}$ and standard deviation $\sigma _{i,M}^{t_j}$ are computed as sample statistics across the voxels in the intracranial volume mask for patient i at time $t_j$. For each patient, dissimilarity maps were extracted by voxel-wise subtraction of the normalized image between two time points for each imaging sequence M as $\varDelta M_{iv}=\widetilde{M}_{iv}^{t_2}-\widetilde{M}_{iv}^{t_1}$.

Lesion Change Model and Voxel-Level Descriptors. We model the presence of change in a set of preregistered multi-sequence MR images as a function of descriptors extracted from $\widetilde{M}_{iv}^{t_1}$ and $\varDelta M_{iv}$ for each imaging sequence M. With the voxel-level lesion change represented by a random variable R, the probabilities for each test patient i at voxel v are modeled as a logistic regression:

$$\begin{aligned} logit[P\{R_{iv}=1\}]=\alpha _0+\varSigma _{x=1}^6 \alpha _x I_{iv}^x \end{aligned}$$

(1)

where $I^x\in \{\widetilde{FLAIR}^{t_1}, \varDelta FLAIR,\widetilde{T1}^{t_1}, \varDelta T1, \widetilde{T2}^{t_1}, \varDelta T2\}$ constitute the six input image volumes for patient i.

Incorporating Neighborhood Information. The first scale of the context we incorporate into the model in Eq. (1) is the immediate neighborhood of a voxel in the form of $\varDelta FLAIR$ values over a $K\times K$ neighborhood of each considered voxel. With x representing the indices of the neighborhood voxels, and these values were used as additional descriptors to learn additional coefficients $\beta $:

$$\begin{aligned} logit[P\{R_{iv}=1\}]=\alpha _0+\varSigma _{x=1}^6 \alpha _x I_{iv}^x+\varSigma _{x=1}^8 \beta _x \varDelta FLAIR_{iv}^x \end{aligned}$$

(2)

Incorporating Local Texture Descriptors. To incorporate a wider context, we extracted texture descriptors from a larger $L\times L$ neighborhood of a voxel on a $\varDelta FLAIR$ sequence. From voxel intensities within the brain mask, descriptors representing intensity statistics, such as mean, standard deviation, and kurtosis, were generated. Texture-based descriptors were extracted from the gray-level co-occurrence matrix (GLCM) of each patch using Haralick descriptors [3] and the gray-length run-length matrix (GLRLM) descriptors [9]. This resulted in 31-dimensional vectors, which were normalized to their z-scores to obtain $T_{iv}^x$, $x=1,\dots ,31$ for each selected voxel v. The logit model using these 31 descriptors is denoted as:

$$\begin{aligned} logit[P\{R_{iv}=1\}]=\gamma _0+\varSigma _{x=1}^{31} \gamma _x T_{iv}^x \end{aligned}$$

(3)

Multi-scale Method. The joint multi-scale model combines the multi-sequence voxel intensities with the immediate neighborhood and the local texture descriptors. This descriptor was used as input to fit the logistic regression function:

$$\begin{aligned} logit[P\{R_{iv}=1\}]=\alpha _0+\varSigma _{x=1}^6 \alpha _x I_{iv}^x+\varSigma _{x=1}^8 \beta _x \varDelta FLAIR_{iv}^x+\varSigma _{x=1}^{31} \gamma _x T_{iv}^x \end{aligned}$$

(4)

The logistic regression model was first learned using the balanced dataset over a selection mask, computed at voxel v for patient i as:

$$\begin{aligned} S_{iv}={\left\{ \begin{array}{ll} 1 &{} \varDelta FLAIR_{iv}>\sigma _{\varDelta }{ FLAIR_i} \\ 0 &{} \text {otherwise} \end{array}\right. }. \end{aligned}$$

In testing, the learned set of coefficients ${\alpha _0,\dots \alpha _6, \beta _1,\dots \beta _8, \gamma _1,\dots \gamma _{31} }$ was applied to infer the lesion change probability maps for the whole intracranial volume of a test subject.

3 Experiments and Results

The proposed method (Eq. (4)) was compared to an implementation of the existing state-of-the-art method [8] that considers only the multi-sequence voxel intensities (Eq. (1), referred here as Baseline 1); a method using the intensities as well as the immediate neighborhood context information from a $3\times 3$ neighborhood (Eq. (2), referred to as Baseline 2); and a method using texture-based descriptors (Eq. (3), referred to as Baseline 3).

A three-fold cross-validation scheme was used. To minimize the effect of random variation on training and testing subsets within folds, each three-fold experiment was run three times, resulting in 45 patient-level ROC and PR curves for each method. Table 1 and Fig. 2 depict comparisons of AUC-ROC and AUC-PR for the four methods. The higher performance of the Multi-Scale Method compared to other methods was statistically significant ($p < 0.001$) both in terms of AUC-ROC and AUC-PR.

Table 1. Median ± median absolute deviation values for patient-level AUC-ROC and AUC-PR generated across 45 independent three-fold cross-validated trials.

Full size table

3.1 Importance of Texture-Based Descriptors

Comparing Baseline 2 and the Multi-Scale Method, texture-based radiomic features clearly contribute to the latters higher performance. Here we investigate the discriminative power of the set of texture descriptors. Figure 3 shows the differences in descriptor values between voxels with lesion change and voxels with no change. Since the descriptors were normalized for each patients data, the learned coefficients $\gamma _x$ as in Eq. (4) indicate the significance of the corresponding descriptors in differentiating lesion change detection. Coefficients in Eq. (4) that are consistently large in magnitude suggest that they are especially important to generating the output lesion change probability map.

To study the importance of specific texture-based descriptors, a large set of coefficients were generated using a Monte-Carlo simulation. With repeated random sampling of image data from ten patients, 32 sets of coefficients were generated. A one-sample Wilcoxon signed-rank test was used to evaluate which coefficients consistently differ most from zero (null hypothesis that the median value is zero). The top ten descriptors ($p < 0.01$) were found to be homogeneity, sum average, sum variance, standard deviation, contrast, dissimilarity, difference variance, difference entropy, short run-length emphasis (RE), and long run high gray-level emphasis (GE).

4 Discussion

We proposed a multi-scale MS lesion change detection method, which had the highest performance when compared to three baseline methods that employed the multi-sequence MR images at fewer scales. The proposed method differs from others by extracting descriptors from not only the voxel itself, as in Ref. [8], which corresponded to Baseline 1, but also at two additional spatial scales, i.e. at immediate and extended neighborhoods. Since a radiologist annotates image voxels not in isolation but in the context of its surrounding voxels, it is intuitive that multi-scale context information improves lesion change detection.

Comparing Baseline 2 and the Multi-Scale Method, texture descriptors clearly contribute to the latter’s higher performance. The Multi-Scale Method and Base-line 3 had higher AUC-PR than Baseline 1 and 2, suggesting that texture-based descriptors improve the algorithms performance in the task of differentiating lesion change areas from non-lesion change areas. The top ten descriptors reported in the Results section (e.g. homogeneity, sum average) were especially critical in distinguishing between areas with and without lesion change, since their learned weights differ most from a median value of zero across multiple experiments. Extracting only these descriptors resulted in comparable performance (median AUC-ROC = 0.97, AUC-PR = 0.40) to using all 31 texture-based descriptors (median AUC-ROC = 0.97, AUC-PR = 0.43), which suggests that they may be sufficient for less computationally expensive use in a clinical setting.

While radiomic descriptors have made progress in tumor detection and segmentation for other diseases, this work is one of the first to use these descriptors in MS lesion change detection and highlights the importance of specific descriptors for this purpose. As expected, despite the significant contributions of texture descriptors, the intensities of the central voxel and its immediate neighborhood nonetheless contributed significantly to the performance. As per our experiments, the proposed Multi-Scale Method outperformed Baseline 3 due to these additional descriptors. This once again emphasizes the importance of a multi-scale approach. As a reference point, we also evaluated the coefficients directly provided by Ref. [8]. It resulted in a median AUC-ROC of 0.82, which was significantly lower than the results from training on our dataset as presented in Table 1, but the result is high enough to confirm the generalizability of such an approach, which can be attributed to the standard image preprocessing routines applied in both studies.

Quantitative longitudinal MS lesion analysis could provide insight into disease progression that occurs subtler and earlier than clinical markers like deterioration of physical movement. In the current clinical practice, lesion analysis is limited to recording lesion count and location. Manual annotation of the MS lesion change is challenging due to the number of sequences (T1, T2 and FLAIR in our study) that need to be taken into account when performing the labeling of each potential new lesion area. Moreover, the task becomes even more time and cost expensive in a setting of a large-scale clinical trials, where, due to the larger number of imaging timepoints of interests, it is crucial to have a consistent annotation in order to reliably evaluate the effectiveness of the tested treatment. Reliable automated methods such as our approach can be used to expedite and assist the radiologists daunting task of labeling lesion change for many patients and provide objective evaluations in a consistent and efficient manner.

To conclude, we proposed a multi-scale MS lesion change detection method, which incorporates not only information at voxel level, but also information from neighborhood and texture-based descriptors from the larger patch surrounding each voxel. The method statistically significantly improved over the state-of-the-art method. We also showed the importance of texture-based descriptors to effective lesion change detection, which to the best of our knowledge has not been explored in previous works.

References

NIH fact sheets - multiple sclerosis. https://report.nih.gov/nihfactsheets/ViewFactSheet.aspx?csid=103
Avants, B.B., Tustison, N.J., Stauffer, M., Song, G., Wu, B., Gee, J.C.: The insight ToolKit image registration framework. Front. Neuroinformatics 8, 44 (2014). https://doi.org/10.3389/fninf.2014.00044
Article Google Scholar
Haralick, R.M.: Statistical and structural approaches to texture. Proc. IEEE 67(5), 786–804 (1979). https://doi.org/10.1109/PROC.1979.11328
Article Google Scholar
Lesjak, Ž., Pernuš, F., Likar, B., Špiclin, Ž.: Validation of white-matter lesion change detection methods on a novel publicly available MRI image database. Neuroinformatics 14(4), 403–420 (2016). https://doi.org/10.1007/s12021-016-9301-1
Article Google Scholar
Lladó, X., et al.: Automated detection of multiple sclerosis lesions in serial brain MRI. Neuroradiology 54(8), 787–807 (2012). https://doi.org/10.1007/s00234-011-0992-6
Article Google Scholar
Mechrez, R., Goldberger, J., Greenspan, H.: Patch-based segmentation with spatial consistency: Application to MS lesions in brain MRI. https://doi.org/10.1155/2016/7952541
Smith, S.M.: Fast robust automated brain extraction. Hum. Brain Mapp. 17(3), 143–155 (2002). https://doi.org/10.1002/hbm.10062
Article Google Scholar
Sweeney, E.M., Shinohara, R.T., Shea, C.D., Reich, D.S., Crainiceanu, C.M.: Automatic lesion incidence estimation and detection in multiple sclerosis using multisequence longitudinal MRI. Am. J. Neuroradiol. 34(1), 68–73 (2012). https://doi.org/10.3174/ajnr.A3172
Article Google Scholar
Tang, X.: Texture information in run-length matrices. IEEE Trans. Image Process. 7(11), 1602–1609 (1998). https://doi.org/10.1109/83.725367
Article Google Scholar
Tustison, N.J., et al.: N4itk: improved n3 bias correction. IEEE Trans. Med. Imaging 29(6), 1310–1320 (2010). https://doi.org/10.1109/TMI.2010.2046908
Article Google Scholar
Zhang, Y.: MRI texture analysis in multiple sclerosis. https://doi.org/10.1155/2012/762804

Download references

Acknowledgments

This material is based upon work supported by Philips Healthcare.

Author information

Authors and Affiliations

Stanford University, Stanford, CA, 94305, USA
Myra Cheng, Alfiia Galimzianova, Christopher B. Lock & Daniel L. Rubin
University of Ljubljana, 1000, Ljubljana, Slovenia
Žiga Lesjak & Žiga Špiclin

Authors

Myra Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Alfiia Galimzianova
View author publications
You can also search for this author in PubMed Google Scholar
Žiga Lesjak
View author publications
You can also search for this author in PubMed Google Scholar
Žiga Špiclin
View author publications
You can also search for this author in PubMed Google Scholar
Christopher B. Lock
View author publications
You can also search for this author in PubMed Google Scholar
Daniel L. Rubin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alfiia Galimzianova .

Editor information

Editors and Affiliations

University College London, London, UK
Danail Stoyanov
University of Leeds, Leeds, UK
Zeike Taylor
University of Adelaide, Adelaide, SA, Australia
Gustavo Carneiro
IBM Research – Almaden, San Jose, CA, USA
Tanveer Syeda-Mahmood
Sunnybrook Health Science Centre, Toronto, ON, Canada
Anne Martel
Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Germany
Lena Maier-Hein
University of Porto, Porto, Portugal
João Manuel R.S. Tavares
Queensland University of Technology, Brisbane, QLD, Australia
Andrew Bradley
Universidade Estadual Paulista, Bauru, São Paulo, Brazil
João Paulo Papa
OSRAM (Germany), Garching b. München, Germany
Vasileios Belagiannis
University of Lisbon, Lisboa, Portugal
Jacinto C. Nascimento
ReFUEL4, Singapore, Singapore
Zhi Lu
German Center for Neurodegenerative Diseases (DZNE), Munich, Germany
Sailesh Conjeti
IBM Research – Almaden, San Jose, CA, USA
Mehdi Moradi
Tel Aviv University, Tel Aviv, Israel
Hayit Greenspan
Case Western Reserve University, Cleveland, OH, USA
Anant Madabhushi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cheng, M., Galimzianova, A., Lesjak, Ž., Špiclin, Ž., Lock, C.B., Rubin, D.L. (2018). A Multi-scale Multiple Sclerosis Lesion Change Detection in a Multi-sequence MRI. In: Stoyanov, D., et al. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. DLMIA ML-CDS 2018 2018. Lecture Notes in Computer Science(), vol 11045. Springer, Cham. https://doi.org/10.1007/978-3-030-00889-5_40

Download citation

DOI: https://doi.org/10.1007/978-3-030-00889-5_40
Published: 20 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00888-8
Online ISBN: 978-3-030-00889-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics