1 Introduction

Diffusion tensor imaging (DTI) is a relatively new MRI modality capable of generating contrasts that are sensitive to fiber orientations. It carries rich information about intra-white-matter axonal anatomy, which cannot be seen in conventional MRI. DTI-based segmentation, where regions of interest are delineated, is necessary for performing subsequent quantitative analysis and qualitative visualization. While scalar image segmentation has been studied extensively and different algorithms have been developed over the last decades, DTI-based segmentation is a relatively new and challenging task [4].

The use of DTI properties to perform the clustering of brain tissue was initially described by Pierpaoli et al. [32]. More recently, DTI-based clustering strategies have been described [3, 25, 29, 41, 42, 44, 45] and have been applied in many studies, including studies of the thalamic nuclei [11, 17, 18, 43, 45], the thalamocortical projections [7] and the fetal brain [26]. Although the strategies are based on well-known segmentation algorithms, such as level-sets, region growing and graph-cuts, they were actually developed exclusively for DTI. Significant modifications in the original algorithms, conceived to work with scalar images, had to be done in order to deal with tensorial information.

Instead of defining a new segmentation method, the aim of this study is to explore the transformation of diffusion tensor data into scalar maps and the use of these scalar maps, together with the watershed transform, to segment diffusion tensor fields. Preliminary investigation of watershed segmentation of diffusion scalar maps has been done [35], and this paper explores new scalar maps and analyzes the performance of the segmentation with them.

The watershed transform is a region-based segmentation approach and has been successfully applied to the solution of many medical imaging problems, such as the segmentation of blood cells [12], neuromorphometry [13] and characterization of human cortex [9, 33]. It was chosen over other segmentation techniques because it is simple and efficient, and does not require previous knowledge of the structures to be segmented, surface initialization or manual seed placement.

A crucial step of the watershed-based segmentation is the computation of a gradient of the image to be segmented. The magnitude of a gradient can be seen as a scalar map that contains edge information. In the diffusion tensor field context, it is desirable that the gradient helps to find edges between regions with distinct diffusion characteristics. In other words, the gradient should be strong where diffusion changes are significant and weak where diffusion changes are imperceptible.

Several scalar maps based on diffusion measures can be found in the literature, including intra- and intervoxel measures. The mean diffusivity (MD), the fractional anisotropy (FA) and the volume ratio (VR) are examples of intravoxel measures. Other measures, such as the lattice index (LI) and the coherence index (CI), are classified as intervoxel measures. It is important to point out that none of the above scalar maps were conceived for segmentation purposes and, therefore, they do not necessarily contain enough information to distinguish regions with distinct diffusion characteristics. Recently, we published a technique, called the tensorial morphological gradient (TMG), that was design specifically to be used in the context of segmentation. The TMG transforms the diffusion tensor image into a scalar map with meaningful values to detect borders between brain structures [35]. The tensorial morphological gradient uses diffusion intervoxel measures and combines them to compute a gradient using concepts from mathematical morphology.

The contributions of this work are threefold: first, new scalar maps are proposed by combining mathematical morphology operators and existing measures. Second, we analyze the scalar maps, taking into account their suitability for the segmentation task. Third, segmentation experiments using the watershed transform illustrate the main differences between scalar maps and illuminate which are optimal for segmentation.

This paper is organized as follows: while Sect. 2 presents existing intra- and intervoxel measures, Sect. 3 describes a new class of intervoxel measures being proposed: gradients based on mathematical morphology. The experiments are described in Sect. 4 and results are presented in Sect. 5. Finally, Sect. 6 discusses the obtained results and Sect. 7 concludes the paper.

2 Existing Scalar Measures in Diffusion MRI

2.1 Intravoxel Diffusion Measures

Several properties of the diffusion tensor are rotationally invariant and useful in deriving quantitative information from the diffusion tensor and in comparing different tensors and ellipsoids. The most simple invariant measures obtained from DTI is the mean diffusivity (MD), which is one third of the trace of the tensor T:

$$ \mathit{MD}=\frac{\text{Trace}( \mathsf {{T}})}{3}=\frac{\lambda_{1}+\lambda_{2}+\lambda_{3}}{3}. $$
(1)

Another simple kind of rotationally invariant measure is the “diffusion anisotropy index” or DAI, that gives an idea of the degree of anisotropy of a given diffusion tensor. The common DAIs range from 0 to 1 and do not require eigenvalue sorting. The simplest anisotropy measure is the variance of the three eigenvalues about their mean. The variance alone, however, needs to be normalized to account for regional differences in the overall magnitude of diffusivity. The most used DAI based on this logic is the fractional anisotropy (FA) [5] given by:

$$ \mathit{FA} = \sqrt{\frac{3}{2}}\frac{\sqrt{(\lambda_{1}-\mathit {MD})^{2}+(\lambda_{2}-\mathit{MD})^{2}+(\lambda_{3}-\mathit {MD})^{2}}}{\sqrt{\lambda_{1}^{2}+\lambda_{2}^{2}+\lambda_{3}^{2}}}. $$
(2)

It normalizes the variance by the magnitude of the tensor as a whole. In other words, FA measures the fraction of the tensor that can be assigned to anisotropic diffusion. The FA measure is appropriately normalized so that it takes values from zero (isotropic diffusion) to one (anisotropic diffusion).

There are many other ways to represent the anisotropy, such as the scaled relative anisotropy (sRA):

$$ \mathit{sRA} = \frac{\sqrt{(\lambda_{1}-\mathit{MD})^{2}+(\lambda_{2}-\mathit{MD})^{2}+(\lambda_{3}-\mathit{MD})^{2}}}{{\sqrt{6}}\mathit {MD}}, $$
(3)

and the volume fraction (VF),

$$ \mathit{VF} = 1 - \mathit{VR} = 1 - \frac{\lambda_{1}\lambda_{2}\lambda_{3}}{\mathit{MD}^3}, $$
(4)

which is obtained by subtracting the volume ratio (VR) from unity, in order to scale the measure from 0 to 1 [21].

2.2 Intervoxel Diffusion Measures

The rotationally invariant properties mentioned in the previous section are usually computed for a single voxel and therefore, can be interpreted as an intravoxel measure. Further interesting information can be obtained by looking at how the tensor parameters estimated in a given voxel compare with those in neighboring voxels. The simplest way to do this is to use measures that quantify the distance (or the difference) between two neighboring tensors. Another approach is to compute the differences (or coherences) within a neighborhood of a tensor and combine all the obtained quantities into a single measure, that can be interpreted as a homogeneity (or inhomogeneity) measure.

2.2.1 Diffusion Tensor Distances

Distance between tensors can also be viewed as intervoxel measures. Given two tensors T i and T j , the most simple comparison between two tensor quantities, used by Ziyan et al. [45], is the dot product (DP) between the principal eigenvector directions:

$$ d_1( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})=|e_{1,i} \cdot e_{1,j}|, $$
(5)

where e 1,i and e 1,j are the principal eigenvectors of tensors T i and T j , respectively. Another simple similarity measure, presented by Pierpaoli et al. [32] as an intervoxel anisotropy measure, is the tensor dot product (TDP):

$$ d_2( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})= \mathsf {{T}}_{i}: \mathsf {{T}}_{j}= \sum _{k=1}^{3}\sum_{l=1}^{3}( \sqrt{\lambda_{k,i}}\sqrt{\lambda_{l,j}}e_{k,i}\cdot e_{l,j})^{2}. $$
(6)

Alexander et al. [1] proposed a number of tensor similarity measures. Their purpose was to match pairs of diffusion tensor images and the proposed measures were based on the diffusion tensor itself and indices derived from the diffusion tensor. One of the similarity measures proposed by them was obtained by negating the following Euclidean distance metric:

$$ d_3( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})= \sqrt{\text{Trace}\bigl(( \mathsf {{T}}_{i}- \mathsf {{T}}_{j})^2 \bigr)}. $$
(7)

This similarity metric was also explored in other DTI studies under different names, such as generalized tensor dot product and Frobenius norm [19, 43, 45]. As affine invariance is a desirable property for segmentation purposes and the Frobenius norm (FN) is not invariant to affine transformations, Wang and Vemuri [41] proposed a novel definition of diffusion tensor “distance”, as the square root of the J-divergence (Jdiv) of the corresponding Gaussian distributions, i.e.,

$$ d_4( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})= \frac{1}{2}\sqrt{\text{Trace}\bigl( \mathsf {{T}}_{i}^{-1} \mathsf {{T}}_{j}- \mathsf {{T}}_{j}^{-1} \mathsf {{T}}_{i} \bigr)-2n}, $$
(8)

where n is the matrix size that represents the tensor. Equation 8 is not a true distance since it violates the triangle inequality, but it is in fact a computationally efficient approximation of Rao’s distance [41]. More recently, a new approach for calculating tensor similarity has been adopted in DTI studies: the Log-Euclidean distances. Among the similarity metrics proposed by Arsigny et al. [2], there is a metric very closely related to the Frobenius norm, called the similarity-invariant Log-Euclidean distance (LogE), defined as :

$$ d_5( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})= \sqrt{\text{Trace}\bigl(\bigl(\log( \mathsf {{T}}_{i})-\log( \mathsf {{T}}_{j})\bigr)^2\bigr)}. $$
(9)

Contrary to the classical Euclidean framework on tensors, one can see from Eq. 9 that symmetric matrices with null or negative eigenvalues are at an infinite distance from any tensor. To overcome this problem, in this paper we replace log(T i ) by log(100∗T i +1) to avoid the computation of the logarithm of null values.

Another affine invariant metric for statistical analysis and image processing of diffusion tensor data based on the Riemannian geometry was introduced independently by different authors [6, 31]:

$$ d_6( \mathsf {{T}}_{i}, \mathsf {{T}}_{j})= \sqrt{\text{Trace}\bigl(\log( \mathsf {{D}}_{ij})^2\bigr)}, $$
(10)

where D ij is equal to \(\mathsf {{T}}_{i}^{-\frac{1}{2}} \mathsf {{T}}_{j} \mathsf {{T}}_{i}^{-\frac{1}{2}}\).

It is important to notice that the mentioned intervoxel distance measures are not the only ones proposed in the literature. They were chosen to be part of this study because they come from different approaches and privilege specific aspects of the tensor. Peeters et al. [30] presented a classification of the intervoxel distance measures based on their derivation: measures based on scalar indices; measures that make use of the angles between eigenvectors; measures based on linear algebra; measures based on Riemannian geometry; measures considering the tensors as a representation of a probability density function and measures that combine different measures from the previous classes. Hence, whereas the dot product is an angular difference, the tensor dot product and the Frobenius norm come from linear algebra, the Log-Euclidean distance and the affine-invariant Riemannian metric (Riem) are based on Riemannian geometry and the J-divergence derives from statistical considerations.

2.2.2 Neighborhood Anisotropy Measures

Intervoxel anisotropy measures were introduced by Pierpaoli and Basser [32] and extended by Skare et al. [38] to reduce the effect of noise. The principle is to perform spatial averaging of indices based on both eigenvalues and eigenvectors.

One example of such a diffusion anisotropy measure is the A dd , that measures how much the directions of the tensor eigenvectors differ between a given voxel and its neighbors, and is defined as:

$$ A_{dd} = \frac{\sum_{1}^{8}{a_{N}A_{dd}^{N}}}{\sum{a_{N}}}, $$
(11)

where

$$ A_{dd}^{N} = \frac{ \hat {\mathsf {T}}_i: \hat {\mathsf {T}}_j}{ \mathsf {{T}}_{i}: \mathsf {{T}}_{j}} $$
(12)

and the tensor dot product between the anisotropic parts of the tensor \(\hat {\mathsf {T}}_{i}: \hat {\mathsf {T}}_{j}\) is given by:

$$ \hat {\mathsf {T}}_{i}: \hat {\mathsf {T}}_{j}= \mathsf {{T}}_{i}: \mathsf {{T}}_{j} - \frac {1}{3}\text{Trace}( \mathsf {{T}}_i)\text{Trace}( \mathsf {{T}}_j). $$
(13)

Another measure, first described by Klingberg et al. [23], is the coherence index (CI). The CI in a voxel is the mean dot-product of the primary eigenvector of the reference voxel and the primary eigenvector of each one of its eight neighboring voxels. When the eigenvectors are of unit length, the dot product is the cosine of the angle between the primary eigenvectors of both tensors and has a value between 0 and 1.

2.2.3 Tensorial Morphological Gradient (TMG)

The tensorial morphological gradient (TMG) is an intervoxel diffusion measure based on mathematical morphology. It was first applied to segment tensorial images representing color images [36, 37] and its concept was then extended and applied to synthetic diffusion tensor images [34]. It is based on mathematical morphology and intervoxel distances between neighboring tensors.

While the classical morphological gradient at each point of a scalar image yields the difference between the maximum and the minimum values over the neighborhood at the point determined by a structuring element, the tensorial morphological gradient (TMG) [36] of a tensorial image is defined by:

$$ \nabla^{ \mathsf {{T}}}_{B}(f) (x) = \textstyle{ \bigvee_{y,z\in B_{x}}} d_{n}( \mathsf {{T}}_{y}, \mathsf {{T}}_{z}), $$
(14)

xE, where d n represents any of the intervoxel distances presented in Sect. 2.2.1, BE is a structuring element centered at the origin of E, T y is the tensor that represents the diffusion in y, and T z is the tensor that represents the diffusion in z (y and z are in the neighborhood of x, defined by B x ). \(\nabla^{ \mathsf {{T}}}_{B}\) is the proposed TMG. Because the intervoxel measures are already comparisons between neighbors, the proposed gradient is not the difference between the maximum and the minimum values, but only the maximum value.

3 Gradients: A New Intervoxel Diffusion Measures Class

In previous works, a tensorial morphological gradient (TMG) was conceived to allow the segmentation of diffusion tensor fields by the watershed transform. Experiments depicted that the TMG is able to preserve borders while transforming the diffusion tensor filed in a scalar map, a desirable requisite of gradients to be used in segmentation tasks. Inspired by the good results obtained by the tensorial morphological gradient, a new class of intervoxel diffusion measures is being proposed: gradients based on mathematical morphology. This new class of diffusion measures are, therefore, composed by the tensorial morphological gradient (TMG) of diffusion intervoxel measures and the morphological gradient (MG) of diffusion intravoxel measures.

The morphological gradient (MG) is the most common gradient based on mathematical morphology used in image processing [16], and depends on the size and shape of the chosen structuring element. Let E be the set of all points in the image. Using a flat structuring element at each point, the morphological gradient (MG) yields the difference between the maximum and the minimum values over the neighborhood at the point determined by the flat structuring element:

$$ \nabla^{I}_{B}(f) = \textstyle{\bigvee _{y \in B_{x}}}\textbf {I}_{y} - {\bigwedge _{y \in B_{x}}}\textbf {I}_{y}, $$
(15)

where BE is a structured element centered at the origin of E and I y is the intensity value in y (y is in the neighborhood of x, defined by B x ).

Because the diffusion tensor image does not have a single intensity value to be considered in the morphological gradient computation, the proposed solution is to compute the morphological gradient of any intravoxel measure presented in Sect. 2.1, obtaining a new class of intervoxel measures. As an example, by taking the fractional anisotropy FA y as the I y , it is possible to compute the morphological gradient of the fractional anisotropy (MG-FA):

$$ \nabla^{\mathit{FA}}_{B}(f) = \textstyle{\bigvee _{y \in B_{x}}}\mathit{FA}_{y} - {\bigwedge _{y \in B_{x}}}\mathit{FA}_{y}. $$
(16)

Although the FA is an intravoxel measure, by computing its morphological gradient, information of the neighborhood is incorporated, resulting in an intervoxel measure. As with the TMG, the neighborhood intervoxel measures presented in Sect. 2.2.2 (A dd and CI) result from comparison between tensors in the neighborhood using intervoxel measures. The main difference between them and the TMG is that the TMG takes the maximum between the computed intervoxel measurements while the A dd and the CI take the average of the measurements (weighted or not by the distance of each neighbor).

Hence, it is possible to extend the definition of the TMG to A dd (TMG-A dd ), for example, defining a new measure:

$$ \nabla^{A_{dd}}_{B}(f) (x) = \textstyle{\bigvee _{y,z\in B_{x}}} A_{dd}( \mathsf {{T}}_{y}, \mathsf {{T}}_{z}). $$
(17)

By using the same reasoning, it is easy to conclude that the extension of the Coherence Index (CI), originally computed as the mean value of the dot product within a defined neighborhood, is the TMG based on the dot product (TMG-DP). Furthermore, whereas the neighborhood in the CI computation is fixed as a square around the voxel (8 neighbors), the neighborhood in the TMG-DP computation is defined by the structuring element and can be chosen accordingly to the application.

Tables 1 and 2 summarize all intravoxel and intervoxel measures presented in Sects. 2 and 3. Table 1 lists the existing voxelwise measures and points out whether the intravoxel and intervoxel measures are rotationally invariant or not, based on the eigenvalues and/or eigenvectors. Table 2 lists each neighborhood measure and identifies which measure it is based on, and whether it is an average of the measure within a neighborhood (mean), the biggest difference between neighbors (max−min) or it is the maximum measure found in the neighborhood (max). It is valuable to note that the measures listed in the last column of Table 2, with the exception of CI and A dd , are the ones based on mathematical morphology and proposed by the authors.

Table 1 Summary of existing intra and intervoxel measures, their rotationally invariance and their dependence on eigenvalues and eigenvectors
Table 2 Summary of proposed neighborhood measures, concerning their computation method: mean, maximum or difference between the maximum and the minimum value within the neighborhood

4 Methods

4.1 Acquisition of Human Brain Data

Real data were acquired on a Siemens 3T Trio MR scanner with N=30 diffusion encoding directions, 63 slices, 2.0 mm×2.0 mm×2.0 mm, TE=95 ms, TR=8.7 s and b=1000 s/mm2. Fifteen human subjects were scanned. The diffusion data was first linearly interpolated before tensor estimation and the negative eigenvalues were replaced with zero [24]. The MINC toolsFootnote 1 from the Montreal Neurological Institute, McGill University, were used for all diffusion image preprocessing.

4.2 Segmentation Experiments

In order to compare the performance of distinct diffusion scalar measures when used together with the watershed in DTI-based segmentation of white matter structures we studied the corpus callosum. The corpus callosum is the major white matter structure and conduit for information transfer between the cerebral hemispheres and plays an integral role in relaying sensory, motor and cognitive information between homologous cortical regions.

The original image was cropped around the corpus callosum to a subvolume of size 33×44×21 voxels (xyz orientation). Although the scalar maps would not differ if calculated for the entire image (brain) or only for a cropped version of it, the watershed results would suffer considerably if run for the entire brain. In order to detect all the borders of the corpus callosum, the number of regions chosen for the hierarchical watershed would have to be increased tremendously, since several markers would appear outside the corpus callosum. In other words, the method would attempt to detect all borders inside the brain, even the ones not related to the corpus callosum, and would have less precision in the corpus callosum region. In the other hand, when the image is cropped around the corpus callosum, the watershed transform concentrates its efforts on distinguishing the corpus callosum from the background.

Intravoxel and intervoxel maps were computed and visualized. All morphological gradients were computed using a 3×3 diamond structuring element and the measures based on the average used the 8-connected two-dimensional neighborhood of the reference tensor. The 8-connected neighborhood was used because of the original definition of CI and A dd , even though a 6-connected three-dimensional neighborhood is expected to result in a better scalar map.

The diffusion scalar maps were analyzed in the context of the segmentation task. After computation of the scalar maps, the hierarchical segmentation was achieved by the application of the 3D watershed from markers [8, 10, 14, 40] and extinction values computation [15, 27, 28, 39]. In other words, after calculating the scalar map of the original image, the n structures in the image which had the greatest volume extinction values were automatically selected. The n markers assigned to these regions were then used in the watershed transform, which segmented the scalar map into n regions.

The main goal of the segmentation experiments was to identify which scalar maps were able to successfully segment the corpus callosum for all fifteen sets of real data. A segmentation was considered successful if it was able to segment the corpus callosum as only one connected component.

The results obtained by the watershed using the scalar measures able to segment the corpus callosum successfully for all datasets, i.e., the MG-FA, A dd and the TMG-A dd (max), were compared to manual segmentation results, in order to evaluate the performance of the segmentation. Since there was no segmentation ground truth for comparison, the definition of a segmentation standard was based on manual segmentations made by specialists, as described by Kaus et al. [20]. Three specialists delineated the corpus callosum boundaries in the FA map and the standard segmentation in each subject image was defined as those voxels labeled as corpus callosum by at least two specialists. All other voxels were labeled as background.

Two different metrics were used to evaluate the obtained segmentation in comparison with the standard: kappa and overlap [22]. The kappa coefficient measures the agreement of the segmentations, taking into account the probability of random agreement. The overlap metric is defined as the percentage of voxels classified as corpus callosum by both the method and the standard with respect to the number of voxels classified as corpus callosum by at least one of them.

Other metrics, such as accuracy and specificity, were not used to compare the obtained segmentations because their values take into account the voxels labeled as background by both the method and the standard. As the background area is significantly larger than the corpus callosum, high values do not necessarily reflect good performance of the proposed method.

5 Results

5.1 Segmentation Experiments

An example of a real diffusion tensor image is presented in Fig. 1. The original image is a DT image of the brain. The eigenvalues from the DTI (λ 1, λ 2 and λ 3) are shown in Figs. 1(a)–(c), the principal eigenvector direction (PED) (color-coded map) in Fig. 1(d) and the manual segmentation in Fig. 1(e).

Fig. 1
figure 1

DTI of the corpus callosum—each eigenvalue (λ 1, λ 2 and λ 3) shown separately, the principal eigenvector direction (PED) and the result of manual segmentation. Small λ 2 and λ 3 and uniform color of the PED confirm that the corpus callosum is a highly oriented structure

Although the original image is volumetric (33×44×21 voxels), only one slice is presented, in order to facilitate visualization. All scalar measures cited in Sect. 2 were computed for the original DT image and presented in Figs. 2, 3, and 4. Intravoxel measures can be seen in Figs. 2(a)–(d), intervoxel measures based on the morphological gradient (max–min) of intravoxel measures are presented in Figs. 2(e)–(h) and their respective segmentation by the hierarchical watershed in Figs. 2(i)–(l). Figures 3(a)–(e) and 3(k)–(l) show the tensorial morphological gradients based on intervoxel measures (Jdiv, TDP, FN, Riem, LogE, DP and \(A_{dd}^{N}\)) and Figs. 3(f)–(j) and 3(m)–(n) present their respective watershed-based segmentation. Finally, Figs. 4(a)–(b) present neighborhood anisotropy measures based on average, CI and A dd , and Figs. 4(c)–(d) their respective watershed-based segmentation.

Fig. 2
figure 2

Intravoxel measures—MD, FA, VF and sRA, respective morphological gradient (MG) and resulting watershed segmentation. The MGs were computed using a 6-connected neighborhood as structuring element

Fig. 3
figure 3

TMGs computed using each intervoxel measure presented in Sect. 2.2.1 and the respective obtained watershed-based segmentation

Fig. 4
figure 4

Neighborhood anisotropy measures computed for the corpus callosum region

In the example shown in Fig. 2, it is possible to confirm that transitions between regions of low and high anisotropy are smooth in the FA map (Fig. 2(b)) and more abrupt in the VF map (Fig. 2(c)), due to its lack of sensitivity in less anisotropic regions. The corpus callosum, therefore, seems bigger in the FA map than in the VF map and different segmentations result (Figs. 2(j) and 2(k)). On the other hand, although the MG-FA from Fig. 2(f) is much more noisy outside the corpus callosum than the MG-VF from Fig. 2(g), the segmentation result did not suffer distortions due to this noise and looks more similar to the ground truth (Fig. 1(e)).

By using the same intervoxel measures from Sect. 2 in the TMG computation, it is possible to verify the adequacy of a scalar map to detect borders of the corpus callosum. The DP and the TDP, unable to detect differences between distinct tensors with the same orientation, were able to preserve important borders in the corpus callosum experiment, as shown in Figs. 3(k) and 3(b). The FN, which is not affine invariant, also provided a useful scalar map (Fig. 3(c)), when used to compute the TMG.

The gradients based on Jdiv (Fig. 3(a)), on TDP (Fig. 3(b)) and on Riem (Fig. 3(d)) presented smoother borders, and consequently the segmentation result provided by them are inferior (Figs. 3(f), 3(g) and 3(i)). The borders in the TMG using FN (Fig. 3(c)), LogE (Fig. 3(e)) and A dd (Fig. 3(l)) were sharper and provide better results in the application of the watershed technique. Although the TMG using DP (Fig. 3(k)) presents strong borders, its obtained segmentation was not good due to the discontinuity in its upper border.

The neighborhood intervoxel measures presented in Figs. 4(a) and 4(b) seem able to preserve the borders of corpus callosum, especially the A dd . However, its counterpart, presented in Figs. 3(k) and 3(l), accomplish the segmentation task with superiority when compared to the ground truth, thanks not only to the replacement of the mean by the maximum (see Eqs. 11 and 17, for example), but also due to the neighborhood being considered in their computation.

Figure 5(a) shows the principal eigenvector direction (PED) of the corpus callosum, with a detail selected on it. A better solution to help the scalar map CI to preserve borders would be to use a 3D neighborhood in its computation. But even when a 3D neighborhood (6-connected neighbors) is considered instead of a 2D neighborhood (8-connected neighbors), the CI result (Fig. 5(b)) is considerably inferior, when compared to the TMG (Fig. 5(c)). Similar conclusions can be drawn about A dd , since the neighbors used in its computation are, by definition, also from a single plane.

Fig. 5
figure 5

Comparison of the coherence index (CI) and the CI-TMG computed using the dot product as similarity measure. Both were obtained using a 6-connected three-dimensional neighborhood and are showed only in the detail marked by the rectangle in the original image

Not all scalar maps presented similar segmentation performance for all 15 subjects. For some subjects, the TMGs were able to segment the corpus callosum as a single connected component (Figs. 6(a) and 6(b)), even for a small number of regions (n=25). For other subjects, in which the corpus callosum presented a thinner profile, the TMGs were not able to distinguish it from the background (Figs. 6(c) and Fig. 6(d)), even for a large number of regions (n=100). Some morphological gradients, MG-sRA and MG-VF, were not able to segment only one connected component representing the corpus callosum for some subjects, no matter the value assigned for n (Fig. 7). Other scalar maps, such as the MG-FA, the A dd and the TMG-A dd (max), were able to segment all datasets, depending only on a good choice of the number n of regions (Fig. 8).

Fig. 6
figure 6

Watershed segmentation results based on the TMG-FN for two subjects. For the first subject, where the watershed was able to segment the corpus callosum, the chosen number of regions n was 25. For the second subject, even with n=100, the watershed was not able to segment the corpus callosum

Fig. 7
figure 7

Segmentation using MG-VF computed for the corpus callosum region, varying the number of regions (10, 20, 50 and 100)

Fig. 8
figure 8

Segmentation using MG-FA computed for the corpus callosum region, varying the number of regions (10, 20 and 50)

The summary of segmentation results can be seen in Fig. 9. While the MG-FA, A dd e TMG-A dd (max) were able to segment all 15 subjects, the MG-sRA and the MG-VF succeeded for 14 subject and failed to segment the corpus callosum as one connected component for only 1 subject. The TMG-FN and the TMG-LogE failed to detect all the corpus callosum borders for 9 and 11 subjects, respectively.

Fig. 9
figure 9

Number of successful and unsuccessful segmentations within the dataset using several scalar measures: correctly segmented (green); the corpus callosum as 2 connected components (yellow) and missing borders of the corpus callosum (red) (Color figure online)

The scalar maps which were able to segment the corpus callosum as one connected component for all fifteen sets of real data had their segmentation results quantitatively assessed. The minimum, maximum and mean overlap of the manual segmentation versus the standard were computed separately for each subject (Fig. 10). Also the overlap of the watershed segmentation using all three measures in each subject were calculated and plotted (Fig. 10).

Fig. 10
figure 10

Overlap of manual and automatic segmentation results compared to the gold standard for all 15 subjects

The segmentation obtained using the morphological gradient of FA (MG-FA) presented higher overlap and kappa coefficients, when compared to the other two measures, A dd and TMG-A dd (max). And although it was inferior to the overlap and kappa coefficients of the mean of the manual segmentation, it presented lower variation. The TMG-A dd (max) segmentation performed better then the A dd , when compared to the standard (Table 3).

Table 3 Mean overlap and kappa metrics with their respective standard deviations, computed for segmentation results of all subjects (manual segmentation, watershed segmentation using MG-FA, A dd and TMG-A dd (max))

The volumetric segmentation of the corpus callosum can also be visualized and compared to the manual segmentation. The segmentation result shown in Fig. 11 (red) was obtained using the MG-FA, which has shown the best quantitative results, and overlaid to the manual segmentation done by a specialist (green). Most of the voxels assigned as corpus callosum by the specialist are also identified as corpus callosum using the proposed method. As the volumetric boundaries of the corpus callosum are not well defined, as they are in the midsagittal slice, the watershed segmentation can go beyond the specialist segmentation because it is difficult, even for a specialist, to tell exactly where the corpus callosum ends.

Fig. 11
figure 11

Automatic (red) segmentation result obtained by the watershed transform and the MG-FA overlaid on manual (green) segmentation (Color figure online)

6 Discussion

DTI is an important MRI technique that allows the investigation and characterization of brain connectivity, and some DTI-based segmentation methods have been studied in the last decade, mostly analyzing white matter structures, due to its high anisotropy [3, 25, 29].

In previous work, a DTI-based segmentation method using the watershed transform and a tensorial morphological gradient was proposed and applied to segment brain structures [35]. Since a crucial step in the proposed method is the computation of the gradient, the present study analyzes the existing diffusion scalar maps as gradient candidates for the watershed method.

Several diffusion scalar maps have been proposed and used mostly in clinical studies, to infer microstructural characteristics of normal tissue and pathological changes in tissue microstructure [5]. This work is the first time that several diffusion scalar maps are analyzed in the context of watershed segmentation.

The visualization of intervoxel maps of the corpus callosum showed that, although it has a mean diffusivity and an anisotropy very similar to its neighbor (the cingulum bundle), all morphological gradients (MGs) computed based on the intervoxel DAIs preserved the borders. The watershed transform based on any of the computed morphological gradients, except the MG-MD, was able to segment the corpus callosum. It is separable because of a strip of artifactually low anisotropy between it and the cingulum bundle, due to partial volume averaging of fibres from these two pathways.

Since the diffusion tensors in the corpus callosum are strongly oriented and their orientation differs considerably from that of their neighbors, measures that take into account tensor orientation also performed well in detecting its borders.

Segmentation experiments using the watershed transform showed that the segmentation of the corpus callosum is possible using almost all discussed scalar maps. Nevertheless, when dealing with the variety of shapes and sizes contained within the dataset, only a few measures are able to recover the borders of the corpus callosum.

The scalar maps can be divided in three groups, in terms of the segmentation results obtained. The first group is composed by measures that are not able to segment successfully any data: MG-MD, TMG-DP, TMG-TDP and TMG-Riem. The second group is composed by measures that are not able to segment the whole dataset, but only a subset of it. These measures are the morphological gradients, MG-sRA and MG-VF and both tensorial morphological gradients, TMG-FN and TMG-LogE. They fail in segmenting thin profiles, where voxels from the corpus callosum and from the background are used simultaneously in the TMG computation of a single voxel. In such cases, these metrics would obtain a better segmentation if computed in interpolated images.

Comparing the TMGs with the DAIs and with the MGs, it is evident that the TMGs are richer in detail, since they take into account much more information. The DAIs are the most simple measures, taking into account only the eigenvalues of each voxel. The MGs are an intervoxel version of the intravoxel DAIs.

Despite being simpler, the MGs were the only ones to segment correctly the entire dataset. The neighborhood is also included in the computation of the tensorial morphological gradients (TMGs), but while the MGs are based on intravoxel measures (i.e., consider only the eigenvalues), the TMGs are based on intervoxel measures, and therefore include information from the eigenvectors in addition to the eigenvalues. Because the eigenvalues carry the most relevant difference between the corpus callosum and the background, measures that take into account the eigenvectors only dilute the important information and result in gradients with weaker borders.

Even though the best segmentation results were obtained by using the MGs, segmentation resulting from the watershed combined with the MG-sRA and the MG-VF were not satisfactory. Such scalar maps tend to make thin white matter structures even thinner, thanks to their decreased sensitivity in the high anisotropy range. Therefore, even using the external morphological gradient and a high number of regions n, the corpus callosum was not obtained as one single connected component for all subjects.

The comparison of the segmentation results obtained from the watershed with the gold standard constructed from the manual segmentation showed that the MG-FA was able to segment two thirds of the dataset as well as the specialists. When looking at the volumetric segmentation results, the MG-FA was also the scalar map that performed best. But since the majority of the voxels considered as corpus callosum by the specialist do not present a significant change in the direction of the fibers, the result could be enhanced if the fractional anisotropy map were weighted by the component of the main eigenvector in the most common direction observed. The choice of markers to represent specifically the corpus callosum and the background, not the local minima of the image, could help to reduce the leak of the watershed regions and to perform the segmentation automatically.

7 Conclusions

In this work, tensor-derived maps, usually used in quantitative analysis of DTI, were studied in the context of segmentation. Existing intravoxel and intervoxel maps were discussed, and new scalar maps were proposed, based on concepts of mathematical morphology. All scalar maps were used together with the watershed transform to segment the corpus callosum. The MG-FA, the A dd and TMG-A dd (max) preserve relevant information from diffusion tensor images useful for the segmentation task, and therefore are able to segment successfully the corpus callosum for all 15 subjects of the dataset. Other studied scalar maps, such the tensorial morphological gradients TMG-FN and TMG-LogE and the morphological gradients MG-VF and MG-sRA), have some difficulty to detect thin structures and, therefore, segmented the corpus callosum as a single connected component only for some subjects. The remaining scalar maps were not able to successfully segment the corpus callosum for any subject.