Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Diffusion magnetic resonance imaging (dMRI) [1] is a modality that provides non-invasive images of the white matter of the brain. DMRI measures the local the diffusion process of the water molecules in each voxel. That process represents structural information of neuronal axons. From dMRI data, tracking algorithms [9, 14] allow to reconstruct the 3D pathways of axons within the white matter of the brain as a set of streamlines, called tractography. A streamline is a 3D polyline representing thousands of neuronal axons in that region of the brain, and a tractography is a large set streamlines, usually \(\approx 3 \times 10^5\).

Current neuroscientific analyses of white matter tractography data are limited to qualitative intra-subject comparisons. Thus, it is quite difficult to use the information for direct inter-subject comparisons [2, 5]. This leads to the need of initial alignment, or registration, of tractographies via some methods before doing further study. Registration is most often performed by applying the transformation resulting from the registration of other images, such as T1 or fractional anisotropy (FA), to tractography [5, 6, 12]. Recently, [10] proposed group-wise registration using the trajectory data of the streamlines. The idea to work on deterministic tractography rather than other images is quite innovative. And, it may be advantageous to directly align the streamlines because the result would be closely related to the final goal of registration.

Similar to [10], in this work, we explore the idea of working on deterministic tractography rather than other images. However, in contrast to all current tractography registration methods, which are based on rigid or non-rigid shape transformation of one tractography into another, our approach tries to find which streamline of one tractography corresponds to which streamline in the other tractography, without transformations. This correspondence is a mapping from one tractography to the other.

In this work we propose to solve the problem of finding the mapping between two tractographies through a graph-based approach similar to that of the well-known graph matching problem [3, 13]. In the graph matching problem the aim is to find which node of one graph corresponds to which node of another graph, under the assumption that graphs have the same number of nodes and that the correspondence is one-to-one.

Given a tractography of N streamlines \(T = \{s_1,\ldots ,s_N\}\) and a distance function d between streamlines, we can create an undirected weighted graph by considering each streamline as a vertex and the edge connecting vertex \(s_i\) and \(s_j\) as the distance between the two streamlines, \(d(s_i,s_j)\). Then, intuitively, the problem of tractography mapping becomes very similar to that of graph matching, but with some key differences. Firstly, the size of the two tractographies/graphs is in general not the same. Global differences in the anatomy of the brains, e.g. different volume, motivates this difference. Secondly, in general there is not a one-to-one correspondence between the streamlines/nodes but a many-to-one correspondence. This is anatomically likely if we consider that a given anatomical structure (tract), e.g. the cortico-spinal tract (CST), whose streamlines should have direct correspondence across subjects, may have different thickness, i.e. different number of streamlines. In this case, for example, multiple streamlines of one CST would correspond to a single streamline in the other CST. Because of these differences, it is generally not possible to directly apply efficient graph matching algorithms to the problem of mapping tractographies.

In the following we formally describe the tractography mapping problem starting from the graph matching problem and define the details of the optimization problem to solve. We provide a preliminary algorithmic solution, based on simulated annealing, to minimize the proposed loss function. Then, we apply our proposed solution to a tractography segmentation task in order to compare a standard registration-based method to our proposed method on a fair ground. We conclude the paper with a brief discussion of the preliminary encouraging results.

2 Methods

An undirected weighted graph \(G=(V,E)\) of size N is a finite set of vertices \(V=\{1,\ldots ,N\}\) and edges \(E \subset V \times V\). The graph matching problem can be described as follows. Given two graphs \(G_A\) to \(G_B\) with the same number of vertices N, the problem of matching \(G_A\) and \(G_B\) is to find the correspondence between vertices of \(G_A\) and vertices of \(G_B\), which allows to align, or register, \(G_A\) and \(G_B\) in some optimal way. The correspondence between vertices of \(G_A\) and of \(G_B\) is defined as a permutation P of the N vertices, i.e. there a one-to-one correspondence between the two set of vertices. P is usually represented as a binary \(N\times N\) matrix where \(P_{ij}\) is equal to 1, if the ith vertex of \(G_A\) is matched to the jth vertex of \(G_B\), otherwise 0. Given A and B, i.e. the \(N\times N\) adjacency matrices of the two graphs, the quality of the matching is assessed by the discrepancy, or loss, between the graphs after matching as:

$$\begin{aligned} L(P) = \Vert A - P B P^\top \Vert _2 \end{aligned}$$
(1)

where \(\Vert A \Vert _2 = \sqrt{\sum _{ij}^N A_{ij}^2}\) is the Frobenius norm. Therefore, the graph matching problem becomes the problem of finding \(P^*\) that minimize L over the set of permutation matrices \(\mathcal {P}\):

(2)

which is a combinatorial optimization problem. The exact solution to this problem is NP-complete and only approximate solutions are available in practical cases [3, 13].

Let \(T_A = \{s^A_1,\ldots ,s^A_N\}\) and \(T_B = \{s^B_1,\ldots ,s^B_M\}\), where \(s = \{x_1,\ldots ,x_{n_s}\}\) is a streamline and \(x \in \mathbb {R}^3\), be the tractographies of two subjects. Let d be a distance function between streamlines. We define two graphs \(G_A\) and \(G_B\) with adjacency matrix \(A \in \mathbb {R}^{N \times N}\) and \(B \in \mathbb {R}^{M \times M}\) where \(A_{ij} = d(s^A_i,s^A_j)\) and \(B_{ij} = d(s^B_i,s^B_j)\). Our current choice of d is discussed in Sect. 3, however any common streamline distance from the literature can be used.

The loss function of a mapping Q from \(T_A\) to \(T_B\) is then:

$$\begin{aligned} L(Q) = \Vert A - Q B Q^\top \Vert _2 \end{aligned}$$
(3)

where the mapping Q is a binary \(N\times M\) matrix and \(Q_{ij}\) is equal to 1, if \(s_i^A\) of \(T_A\) is mapped to \(s_j^B\) of \(T_B\) and 0 otherwise. Note that, in general, Q is not a permutation matrix, because multiple streamlines can be mapped into the same one. In order to find the optimal mapping \(Q^*\), we minimize L so that \(T_B\) is most similar to \(T_A\):

(4)

where \(\mathcal {Q}\) is the set of all possible mappings. Because in general \(N \ne M\) and because Q is a mapping and not just a permutation, the tractography mapping problem has a larger search space than the graph matching problem, i.e. \(|\mathcal {Q}| = M^N \gg N! = |\mathcal {P}|\) when \(M \approx N\), is much larger than \(\mathcal {P}\). As a consequence, the efficient solutions available in the literature of graph matching, e.g. [13], are not applicable, because they heavily rely on the assumptions that we violate here. In Sect. 3 we implemented a simple preliminary solution to the combinatorial optimization problem by means of the Simulated Annealing meta-heuristic [8].

2.1 Comparison

In order to compare the proposed method against a standard registration procedure on a fair ground, we cannot rely on the value of the loss function L, because it is defined only in the case of mapping. For this reason, we compared the two approaches on the practical task of automatic tractography segmentation, i.e. finding a given tract of interest in \(T_B\) given its segmentation in \(T_A\). Our hypothesis is that reducing L leads to better overlap between tractographies, which is important for practical applications like segmentation. In Sect. 3 we describe an experiment to test this hypothesis and provide the necessary details. Here we introduce the metric that we use for comparing registration and mapping. As proposed in [5], we compare the set of voxels crossed by the streamlines of each tractography after mapping or after registration. As measure of the overlap between \(T_A\) and \(Q(T_B)\) Footnote 1, we adopt the Jaccard index:

$$\begin{aligned} J(T_A, T_B|Q) = \frac{|T_{A} \cap Q(T_B)|}{ \min \{|T_A|, |Q(T_B)|\}} \end{aligned}$$
(5)

Note that in the above equation, |T| is the volume computed as number of voxels that any streamline \(s \in T\) goes through, and \(|T_A \cap Q(T_B)|\) indicates the number of voxels in common between \(T_A\) and \(Q(T_B)\).

3 Experiments

We designed an experiment to provide empirical evidence that reducing the loss in Eq. 3 is related to an increase of the Jaccard index, i.e. of the overlap between tractographies.

The dataset used for the experiment is based on dMRI data recorded with a 3T scanner at Utah Brain Institute, 65 gradients (64 + b0); b-value = 1000; anatomical scan (\(2\times 2\times 2\) \(\mathrm{mm}^3\)). The tractography was reconstructed with the EuDX algorithm [4] using the dipyFootnote 2 toolbox. We considered 4 healthy subjects and focused the analysis on the corticospinal tract (CST). CST is a set of streamlines projecting from the lateral medial cortex associated with the motor homunculus. This tract is of main interest for the characterization of neurodegenerative diseases, like the amyotrophic lateral sclerosis (ALS). The CST tracts were segmented by the expert neuroanatomists using a toolbox [11] that supports an interactive selection of streamlines. The size of the segmented tracts is reported in Table 1 (see column size).

The reference method, against which we compared mapping, is the affine registration of the tractographies in a common MNI space using the voxel-based FLIRT method [7]. The registration is defined as follows: First, FA images were registered to the MNI-FMRIB-58 FA template, then the affine transformation was applied to the tractographies. The Jaccard index computed between the \(CST_A\) and \(CST_B\) in common space is reported in Table 1 (see column FLIRT).

We then used mapping to compute the same quantity. The first step was encoding the tractographies as graphs, which required to define a distance between streamlines. We refer to the commonly used Mean Average Minimum distance (MAM) [14], based on the Hausdorff distance:

$$\begin{aligned} d_{MAM}(s,s') = \frac{1}{2} (D(s,s') + D(s',s)) \end{aligned}$$
(6)

where \(D(s,s') = \frac{1}{n_s} \sum _{i=1}^{n_s} d(x_i,s')\), and \(d(x, s') = \min _{j = 1,\ldots ,n_{s'}}||x-x'_j||_2\).

Mapping a tract such as the CST, which usually comprises \(10^2\) streamlines, to an entire tractography \(T_B\), which usually consist of \(10^7\) streamlines, is computationally extremely expensive because the space of all possible mappings \(\mathcal {Q}\) has size \(|T_B|^{|CST|}\). For this reason, we introduced a heuristic to retain some of the streamlines in \(T_B\). The intuitive idea was to define a superset of streamlines of the CST for subject B, denoted \(CST_B^+\). The heuristic is in two steps: first, we computed the medoid \(s_m\) of \(CST_B\), and the radius \(r = \max \{d(s_m,s_i), \forall s_i \in CST_B\}\). Second, we filtered the streamlines in \(T_B\) such that \(CST_{B}^+ = \{s_j \in T_B | d(s_{m}, s_j) \le \alpha \cdot r\} \), where \(\alpha = 3\). See Table 1, column \(CST_B?\), for the actual sizes of the supersets.

Computing the optimal mapping \(Q^*\) requires to solve, even in an approximate way, the minimization problem of Eq. 4. As a preliminary strategy to approximate the optimal mapping \(Q^*\), we implemented the simulated annealing (SA) [8] meta-heuristic, a reference method for combinatorial optimization. SA requires the definition of a function to move from the current state, i.e. the current mapping Q, to a (potentially better) neighbouring one. As transition function we used a stochastic greedy one where, given the current mapping Q, one streamline of \(CST_A\) is selected at random and then it is greedily re-mapped to the streamline in \(CST_B^+\) providing the greatest reduction in the loss of Eq. 3. As starting point of the annealing process, we used the 1-nearest neighbour of \(CST_A\) with respect to \(CST_{B}^+\) after the registration of \(T_A\) and \(T_B\). We ran the simulated annealing for 1000 iterations, which required a few minutes on a standard computerFootnote 3.

The results reported in Fig. 1 show the behaviour of the loss during the optimization process for the mapping of \(CST_A\) (subject ID 205), with respect to the tractography of three other subjects (subject IDs 204, 206 and 212). In all cases, as the number of iterations increases, the value of loss function decreases. In Fig. 2 we show an example of experiment with the outcome of FLIRT registration and mapping which refers to subjects 204 and 206. In subfigure A, the source tract \(CST_A\) is shown in blue, in subfigure B the target tract \(CST_B\) is show in green and the related superset of streamlines \(CST_B^+\) in red. In subfigure C, the result of FLIRT registration is presented, both with respect to the superset \(CST_B^+\) on the left and with respect to the target tract \(CST_B\) on the right. On the right side, it is illustrated the set of streamlines (blue) from the source tract \(CST_A\) associated to streamlines of target tract \(CST_B\). The association between streamlines of \(CST_A\) and \(CST_B\) is computed as nearest neighbour after the FLIRT registration. The ratio between blue and green streamlines represents the portion of target tract correctly detected. On the left side of subfigure C, blue streamlines represents the portion of source tract \(CST_A\) not associated to target tract \(CST_B\). In subfigure D, the result of mapping is presented, with the same strategy of presentation of subfigure C. On the right side the visualization shows a greater amount of (blue) streamlines correctly mapped into target tract. Even on the left side the amount of (blue) streamlines erroneously mapped is greater. The sum of blue streamlines on the left and right side represents the portion of streamlines projected from the source to the target. The registration based on FLIRT doesn’t preserve after the alignment the same amount of streamlines from the source tract.

Fig. 1.
figure 1

Plots of the normalized loss (\(L_{norm} = \frac{L}{|CST_A|} \)) as a function of number of iterations with simulated annealing, when mapping the CST of subject 205 to those of subjects 204, 206 and 212.

Fig. 2.
figure 2

Example of registration vs. mapping of the corticospinal tract (CST). From the left, the tract to be mapped (subfigure A, \(CST_A\) in blue), the second tract with its superset (subfigure B, \(CST_B\) in green, \(CST^+_B\) in red), the result of FLIRT affine registration (subfigure C) and of mapping (subfigure D). (Color figure online)

Table 1. Comparison of registration vs. mapping. The subject IDs of \(CST_A\) and \(CST_B\) are reported in the first two columns. Their sizes, together with that of \(CST_B^+\), are in columns three to five. The last four columns report the overlap between \(CST_A\) and \(CST_B\) in terms of Jaccard index (higher is better), for FLIRT registration (6th column) and for mapping with simulated annealing at a different number of iterations (SA-0, SA-100, SA-1000 columns).

In Table 1 are reported the results of the comparison between registration and mapping methods, measured by the Jaccard index. The overlap between \(CST_A\) and \(CST_B\) provided by FLIRT registration is generally quite poor. This is partly expected because even after the registration of \(T_A\) and \(T_B\), \(CST_A\) and \(CST_B\) may have a systematic displacement due to the variability of anatomy across subjects. The results of mapping at different iterations of the optimization process shows a remarkable global increase in the Jaccard index and a general trend of improved alignment when more iterations are computed.

4 Discussion and Conclusion

In this work we addressed the challenge of finding an alignment between the tractographies of two subjects. We recast the question as a problem of mapping between two sets of streamlines and we provided the formulation of the corresponding minimization problem. Preliminary results show that this approach is promising despite some limitations. The computational complexity represents a major issue that may prevent to scale up to whole tractography.