Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Diffusion magnetic resonance imaging (dMRI) has been used to noninvasively reconstruct fiber tracts by imaging the anisotropy of water diffusion in tissue [11]. A major topic in dMRI is the estimation of fiber orientations (FOs) , especially in situations where fibers cross. For example, constrained spherical deconvolution [17], q-ball reconstruction [6], multi-tensor models [3, 10, 20], and spherical ridgelet models [13] have been proposed to estimate crossing FOs.

Successful resolution of crossing FOs may require a large number of diffusion gradient directions, which takes a long acquisition time and limits the use in clinical practice [4]. To reduce the required number of gradient directions, with the assumption that the number of FOs in a voxel is small, methods have been proposed to model the diffusion signals using a basis and solve the FO estimation with sparsity regularization [5, 10, 12, 13, 15, 20]. For example, the basis can be diffusion tensors [5, 10, 15, 20], spherical ridgelets [13], or spherical polar Fourier basis [12].

Besides sparsity assumption, it is also important to consider spatial coherence of FOs to reduce the effect of noise and improve FO estimation. For example, in [2] the diffusion weighted images (DWIs) are smoothed before FO estimation whereas in [16] smoothing of FOs is performed after FO estimation. Several works have placed spatial regularization of tensors on the multi-tensor model to estimate FOs, but the sparsity assumption was not used [7, 14]. There are also methods that have combined spatial continuity with sparsity and seek to simultaneously estimate and smooth FOs. In [13], the TV-norm of DWIs is incorporated in the objective function for FO estimation. In [15] and [20], spatial consistency of FOs is enforced by adding the smoothness of the mixture fraction of each basis tensor as regularization terms in the FO estimation. However, in [15] and [20], the FO coherence is ensured in a relatively indirect way in the sense that directional information in FOs is not explicitly modeled in the objective functions. FO estimation incorporating both sparsity and spatial coherence is still an open problem.

In this work, we propose a method of FO estimation using both sparsity assumption and neighborhood information. The diffusion signal is modeled by a fixed tensor basis. In contrast to previous works, we directly incorporate the directional information in the neighborhood into the objective function to encode spatial coherence. Spatial coherence and sparsity are enforced in weighted 1-norm regularization terms, which contain the interaction of directional information between neighbor voxels. Data fidelity is ensured by the agreement between raw and reconstructed diffusion weighted signals. The resulting objective function is solved using a block coordinate descent algorithm. Experiments were performed on a digital crossing phantom, ex vivo tongue dMRI data, and in vivo brain dMRI data for evaluation.

2 Methods

2.1 Background: Multi-Tensor Model with a Fixed Tensor Basis

As discussed in [8], the diffusion weighted signal at each voxel can be modeled using a unified framework,

$$\displaystyle\begin{array}{rcl} S({\boldsymbol q}) = S_{0}\int \limits _{\mathcal{M}}f(x)R({\boldsymbol q},x)\mathrm{d}x,& &{}\end{array}$$
(1)

where \(\mathcal{M}\) is a smooth manifold, x is a point on \(\mathcal{M}\), \(S({\boldsymbol q})\) is the diffusion weighted signal with the diffusion gradient \({\boldsymbol q}\), S 0 is the signal without diffusion weighting, \(R({\boldsymbol q},x)\) is a kernel function, and f(x) is a probability density function. As in [10] and [20], we use a fixed tensor basis to represent diffusion signals, which has the advantage of explicit relationship between the basis and FOs. In this work, the basis comprises N = 289 prolate tensors D i whose primary eigenvectors (PEVs) \({\boldsymbol v}_{i}\) are approximately evenly oriented over the sphere. Each D i represents an FO given by its PEV \({\boldsymbol v}_{i}\). The eigenvalues (\(\lambda _{1} \geq \lambda _{2} \geq \lambda _{3}> 0\)) determine the shape of the basis tensor, and they are determined by examining the diffusion tensors of a noncrossing fiber tract [10].

With this tensor basis, we have \(\mathcal{M} = \mathcal{S}^{2}\) (a unit sphere), \(x ={\boldsymbol v}\) (a unit vector), \(f({\boldsymbol v}) = f_{i}\delta ({\boldsymbol v};{\boldsymbol v}_{i})\), and \(R({\boldsymbol q},{\boldsymbol v}_{i}) = e^{-{\boldsymbol q}^{T}\mathbf{D}_{ i}{\boldsymbol q}}\) [8]. If we normalize the diffusion gradient as \(\tilde{{\boldsymbol q}} ={\boldsymbol q}/\vert {\boldsymbol q}\vert\), then \(\tilde{{\boldsymbol q}}\) is associated with a constant b determined by the imaging sequence. Then, taking noise \(n({\boldsymbol q})\) into account, (1) becomes [10]

$$\displaystyle\begin{array}{rcl} S({\boldsymbol q}) = S_{0}\sum \limits _{i=1}^{N}f_{ i}e^{-b\tilde{{\boldsymbol q}}^{T}\mathbf{D}_{ i}\tilde{{\boldsymbol q}}} + n({\boldsymbol q}),& &{}\end{array}$$
(2)

where f i (\(\sum _{i}f_{i} = 1\)) is the unknown nonnegative mixture fraction (MF) for D i .

By defining \(y({\boldsymbol q}) = S({\boldsymbol q})/S_{0}\) and \(\eta ({\boldsymbol q}) = n({\boldsymbol q})/S_{0}\), (2) can be written as

$$\displaystyle\begin{array}{rcl} {\boldsymbol y} = \mathbf{G}{\boldsymbol f}+\boldsymbol{\eta },& &{}\end{array}$$
(3)

where \({\boldsymbol y} = (y({\boldsymbol q}_{1}),y({\boldsymbol q}_{2}),\ldots,y({\boldsymbol q}_{K}))^{T}\) (K is the number of DWIs), G is a K × N matrix comprising the attenuation terms \(G_{ki} = e^{-b_{k}\tilde{{\boldsymbol q}}_{k}^{T}\mathbf{D}_{ i}\tilde{{\boldsymbol q}}_{k}}\), \({\boldsymbol f} = (f_{1},f_{2},\ldots,f_{N})^{T}\), and \({\boldsymbol \eta }= (\eta ({\boldsymbol q}_{1}),\eta ({\boldsymbol q}_{2}),\ldots,\eta ({\boldsymbol q}_{K}))^{T}\). Because the number of FOs in each voxel is small, it makes sense to estimate the MFs using sparse reconstruction:

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}} =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}\geq {\boldsymbol 0},\vert \vert {\boldsymbol f}\vert \vert _{1}=1}\vert \vert \mathbf{G}{\boldsymbol f} -{\boldsymbol y}\vert \vert _{2}^{2} +\beta \vert \vert {\boldsymbol f}\vert \vert _{ 0}.& &{}\end{array}$$
(4)

By relaxing the constraint of \(\sum _{i=1}^{N}f_{i} = 1\) and replacing the 0-norm with the 1-norm [10, 20], we have

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}} =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}\geq {\boldsymbol 0}}\vert \vert \mathbf{G}{\boldsymbol f} -{\boldsymbol y}\vert \vert _{2}^{2} +\beta \vert \vert {\boldsymbol f}\vert \vert _{ 1},& &{}\end{array}$$
(5)

Then, the estimated \({\boldsymbol f}\) is projected onto the unit sphere \(\vert \vert {\boldsymbol f}\vert \vert _{1} = 1\) by normalization [10]. Basis directions with nonzero MFs are interpreted as estimated FOs. Thus, we will use FO estimation and MF estimation interchangeably.

2.2 FO Estimation Using Neighborhood Information

Because of image noise, it is important to incorporate spatial coherence of FOs to improve FO estimation [13]. An intuitive way of incorporating neighborhood information can be based on the smoothness of MFs [15, 20]. But establishing smooth MFs does not mean that the FO angles are smooth. For example, let \({\boldsymbol f}_{\mathrm{a}} = (1,0,\ldots,0)^{T}\), \({\boldsymbol f}_{\mathrm{b}} = (0,1,0,\ldots,0)^{T}\), and \({\boldsymbol f}_{\mathrm{c}} = (0,0,1,0,\ldots,0)^{T}\). The difference between \({\boldsymbol f}_{\mathrm{a}}\) and \({\boldsymbol f}_{\mathrm{b}}\) (\(\vert \vert {\boldsymbol f}_{\mathrm{a}} -{\boldsymbol f}_{\mathrm{b}}\vert \vert\)) is the same as that between \({\boldsymbol f}_{\mathrm{a}}\) and \({\boldsymbol f}_{\mathrm{c}}\) (\(\vert \vert {\boldsymbol f}_{\mathrm{a}} -{\boldsymbol f}_{\mathrm{c}}\vert \vert\)), while the desired difference is clearly related to the basis directions represented by the nonzero entries in the MFs. In this work, we seek to explicitly incorporate the directional information from neighbor voxels into FO estimation.

FO Estimation with Known Neighborhood Information

First we consider a simplified case of estimating FOs in a single voxel with known neighbor information. Let the MFs at voxel m be \({\boldsymbol f}_{m}\). A voxel n is in the neighborhood N m of m and has FOs \(\{{\boldsymbol v}_{n,j}\}_{j=1}^{V _{n}}\), where V n is the number of FOs at n. Suppose \(\{{\boldsymbol v}_{n,j}\}_{j=1}^{V _{n}}\) were known, and we want to estimate \({\boldsymbol f}_{m}\) given the neighbor FOs.

We assume that a majority of neighbor voxels n ∈ N m have similar FO patterns as the desired one at m. Then, a set of likely FO \(\{{\boldsymbol u}_{m,p}\}_{p=1}^{U_{m}}\) for m can be obtained from the neighbor voxels (details will be introduced later), where U m is the number of likely FOs at m. Motivated by Ye et al. [18], where fixed pre-determined prior directions at each voxel are encoded in the sparse reconstruction of FOs from dMRI, a weighted 1-norm regularized least squares problem derived from a Bayesian perspective can be solved to encode the information of likely FOs:

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}}_{m} =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}_{m}\geq {\boldsymbol 0}}\vert \vert \mathbf{G}{\boldsymbol f}_{m} -{\boldsymbol y}_{m}\vert \vert _{2}^{2} +\beta \vert \vert \mathbf{C}_{ m}{\boldsymbol f}_{m}\vert \vert _{1}.& &{}\end{array}$$
(6)

Here, C m is a diagonal weighting matrix encoding likely FOs. The basis directions closer to likely FOs are weighted less in the weighted 1-norm, and thus they are less penalized in the objective function. We set the diagonal entries as

$$\displaystyle\begin{array}{rcl} C_{m:ii} = \left (1 -\alpha \max \limits _{p}\vert {\boldsymbol v}_{i} \cdot {\boldsymbol u}_{m,p}\vert \right )\Big/\min \limits _{q}\left (1 -\alpha \max \limits _{p}\vert {\boldsymbol v}_{q} \cdot {\boldsymbol u}_{m,p}\vert \right ),& &{}\end{array}$$
(7)

where α ∈ [0, 1) is a constant. Since \({\boldsymbol v}_{i}\) and \({\boldsymbol u}_{m,p}\) are unit vectors, \(0 \leq \vert {\boldsymbol v}_{i} \cdot {\boldsymbol u}_{m,p}\vert \leq 1\) and C m: ii is positive. In this way, the penalty for basis directions that are closest to likely FOs is the same as that when no information on likely FOs is used.

Note that our application of FO estimation with spatial coherence is fundamentally different than [18] in two aspects: (1) likely FOs are computed based on neighbor FOs while in [18] an anatomical atlas is needed to provide prior directions specified at each voxel; (2) since the FOs in the neighbors are also to be estimated, voxelwise FO estimation in (6) is inappropriate. The proposed approach to likely FO computation and FO estimation is introduced below.

Likely FO Computation

At voxel m we first consider one neighbor voxel n. Between each basis direction \({\boldsymbol v}_{i}\) and the neighbor n, we define a basis-neighbor similarity \(r_{m}(i,n) = w_{m,n}\max _{j}\vert {\boldsymbol v}_{i} \cdot {\boldsymbol v}_{n,j}\vert\). Here, w m, n is a weighting coefficient, which represents the similarity between m and n. It is defined as \(w_{m,n} = e^{-\mu d^{2}(\mathbf{D}_{ m},\mathbf{D}_{n})}\) (μ is a constant). D m and D n are the diffusion tensors fitted from DWIs at m and n, respectively, and \(d(\mathbf{D}_{m},\mathbf{D}_{n})\) is a distance metric for tensors D m and D n : \(d(\mathbf{D}_{m},\mathbf{D}_{n}) = \sqrt{\mathrm{Trace }(\{\log (\mathbf{D} _{m } ) -\log (\mathbf{D} _{n } )\}^{2 } )}\) [1]. For each \({\boldsymbol v}_{i}\), the \(\max\) function in r m (i, n) measures the similarity between \({\boldsymbol v}_{i}\) and its closest FO in n, and this similarity is further weighted by the voxel similarity w m, n . In this way, given one neighbor n, we can measure how similar \({\boldsymbol v}_{i}\) is to the FOs at n.

To consider all neighbor voxels, an aggregate basis-neighbor similarity for \({\boldsymbol v}_{i}\) at voxel m is defined as \(R_{m}(i) =\sum _{n\in N_{m}}r_{m}(i,n)\). We can extract likely FOs for m by finding the basis directions with local maximal R m values:

$$\displaystyle\begin{array}{rcl} \{{\boldsymbol u}_{m,p}\}_{p=1}^{U_{m} } =\{{\boldsymbol v}_{i}\vert R_{m}(i) \geq R_{m}(j),\forall \:j\neq i:\arccos (\vert {\boldsymbol v}_{i} \cdot {\boldsymbol v}_{j}\vert ) \leq \frac{\theta \pi } {180^{\circ }}\}.& &{}\end{array}$$
(8)

Here, \(\theta\) is a threshold and we empirically choose \(\theta = 20^{\circ }\).

FO Estimation for All Voxels

With the likely FOs, the weighting matrix can then be obtained. Note that we have assumed known neighbor information to obtain (6). However, the FOs in the neighbors are also to be estimated. Thus the FOs in all the voxels should be estimated simultaneously. For a total number of M voxels of interest, where the MFs \({\boldsymbol f} = ({\boldsymbol f}_{1}^{T},{\boldsymbol f}_{2}^{T},\ldots,{\boldsymbol f}_{M}^{T})^{T}\) (and thus the FOs) are unknown, the FO estimation can be achieved as

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}} =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}\geq {\boldsymbol 0}}E({\boldsymbol f}) =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}_{1},\,{\boldsymbol f}_{2},\ldots,\,{\boldsymbol f}_{M}\geq {\boldsymbol 0}}\sum \limits _{m=1}^{M}\vert \vert \mathbf{G}{\boldsymbol f}_{ m} -{\boldsymbol y}_{m}\vert \vert _{2}^{2} +\beta \vert \vert \mathbf{C}_{ m}{\boldsymbol f}_{m}\vert \vert _{1}.& &{}\end{array}$$
(9)

Note that C m contains the interaction between neighbors and it is also dependent on α. Greater α leads to more influence from neighbors. β controls the sparsity. In this work, α and β were chosen empirically.

2.3 Minimization of the Objective Function

Because each voxel m is coupled with its neighbors in C m in (9), we use a block coordinate descent (BCD) optimization strategy. At iteration k + 1,

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}}_{m}^{k+1}& =& \mathop{\mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}_{m}\geq {\boldsymbol 0}}E(\hat{{\boldsymbol f}}_{1}^{k+1},\ldots,\hat{{\boldsymbol f}}_{ m-1}^{k+1},{\boldsymbol f}_{ m},\hat{{\boldsymbol f}}_{m+1}^{k},\ldots,\hat{{\boldsymbol f}}_{ M}^{k}) \\ & =& \mathop{\mathrm{arg\,min}}\limits \limits _{{\boldsymbol f}_{m}\geq {\boldsymbol 0}}\vert \vert \mathbf{G}{\boldsymbol f}_{m} -{\boldsymbol y}_{m}\vert \vert _{2}^{2} +\beta \vert \vert \mathbf{C}_{ m}^{k+1}{\boldsymbol f}_{ m}\vert \vert _{1}. {}\end{array}$$
(10)

Using (8), the likely FOs \({\boldsymbol u}_{m,p}^{k+1}\) are obtained by finding the local maxima from \(R_{m}^{k+1}(i) =\sum _{n\in N_{m}}w_{m,n}\max _{j}\vert {\boldsymbol v}_{i} \cdot {\boldsymbol v}_{n,j}^{k+\mathbf{1}_{n<m}}\vert\), where \(\mathbf{1}\) is an indicator function. Then, \(\mathbf{C}_{m}^{k+1}\) can be determined with \({\boldsymbol u}_{m,p}^{k+1}\) using (7).

To solve (10), we define \({\boldsymbol g}_{m}^{k+1} = \mathbf{C}_{m}^{k+1}{\boldsymbol f}_{m}\). Since \(\mathbf{C}_{m}^{k+1}\) is diagonal and \(C_{m:ii}^{k+1}> 0\), \(\mathbf{C}_{m}^{k+1}\) is invertible and \({\boldsymbol f}_{m} = (\mathbf{C}_{m}^{k+1})^{-1}{\boldsymbol g}_{m}^{k+1}\). By defining \(\tilde{\mathbf{G}}_{m}^{k+1} = \mathbf{G}(\mathbf{C}_{m}^{k+1})^{-1}\),

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol g}}_{m}^{k+1} =\mathop{ \mathrm{arg\,min}}\limits \limits _{{\boldsymbol g}_{m}^{k+1}\geq {\boldsymbol 0}}\vert \vert \tilde{\mathbf{G}}_{m}^{k+1}{\boldsymbol g}_{ m}^{k+1} -{\boldsymbol y}_{ m}\vert \vert _{2}^{2} +\beta \vert \vert {\boldsymbol g}_{ m}^{k+1}\vert \vert _{ 1}.& &{}\end{array}$$
(11)

We find \(\hat{{\boldsymbol g}}_{m}^{k+1}\) using the method in [9] and the MFs are estimated as

$$\displaystyle\begin{array}{rcl} \hat{{\boldsymbol f}}_{m}^{k+1} = (\mathbf{C}_{ m}^{k+1})^{-1}\hat{{\boldsymbol g}}_{ m}^{k+1}.& &{}\end{array}$$
(12)

Then, we project \(\hat{{\boldsymbol f}}_{m}^{k+1}\) back onto the unit sphere by normalization: \(\tilde{f}_{m,i}^{k+1} =\hat{ f}_{m,i}^{k+1}/\sum _{j}\hat{f}_{m,j}^{k+1}\), and the FOs at m at iteration k + 1 are the basis directions with \(\tilde{f}_{m,i}^{k+1}> t\) (t = 0. 1 in this work), because FOs with small MFs are interpreted as components of isotropic diffusion [10, 18].

The FOs are initialized by Landman et al. [10]. Iterative update is terminated if the FO difference between successive iterations is small or the maximum iteration is reached.

3 Experiments

3.1 3D Digital Crossing Phantom

A 3D digital crossing phantom was generated to simulate two tracts crossing at 90, where one b0 image and 30 gradient directions (\(b = 700\,\mathrm{s}/\mathrm{mm}^{2}\)) were used. Rician noise (\(\sigma /S_{0}\)=0.05) was added to the DWIs.

The proposed method with (α, β, μ) = (0. 4, 0. 5, 1. 0) was compared with BEDPOSTX [3] and CFARI [10], which are commonly used with around 30 gradient directions. The results are overlaid on the fractional anisotropy (FA) map in Fig. 1. Compared with BEDPOSTX and CFARI, the proposed method produces smoother FOs and better identifies crossing FOs. We then compared the results quantitatively by using the two error measures proposed in [18], where the first measure (e 1) represents how close each estimated FO is to its ground truth FO, and the second one (e 2) measures how accurately each ground truth FO is represented. The results are listed in Table 1. The proposed method estimates FOs more accurately in both noncrossing and crossing regions.

Fig. 1
figure 1

FO estimation overlaid on the FA map of the 3D crossing phantom

Table 1 The mean errors of FO estimation (with standard deviations in parentheses)

3.2 Real Data

Ex Vivo Tongue dMRI

Nine b0 images and 64 gradient directions (\(b = 2000\,\mathrm{s}/\mathrm{mm}^{2}\)) were acquired on a 3T MRI scanner (Magnetom Trio, Siemens, Erlangen, Germany). The resolution is 2 mm isotropic.

The proposed method with (α, β, μ) = (0. 4, 0. 5, 3. 0) was compared with q-ball imaging (QBI) using spherical harmonics based transformation [6] and CFARI [10]. The eigenvalues of the basis tensors are \(\lambda _{1} = 7.0 \times 10^{-4}\,\mathrm{mm}^{2}/\mathrm{s}\) and \(\lambda _{2} =\lambda _{3} = 3.0 \times 10^{-4}\,\mathrm{mm}^{2}/\mathrm{s}\). We focus on the crossing region of the genioglossus (GG) and transverse (T) muscle in Fig. 2, where the proposed method better estimates the crossing FOs and produces smoother FOs than QBI and CFARI.

Fig. 2
figure 2

FO estimation on the ex vivo tongue in the coronal view, which is focused on the crossing of the GG and T muscle (the tongue area is indicated by the white mask). A high resolution structural image (left) is shown for location reference

In Vivo Brain dMRI

Two b-values (\(b = 1000\,\mathrm{s}/\mathrm{mm}^{2}\) and 2000 s∕mm2) were used in the acquisition. Each b-value is associated with 30 gradient directions and each DWI has two repeated scans. Twelve b0 images were also acquired. The images were acquired on a 3T MRI scanner (Magnetom Trio, Siemens, Erlangen, Germany). The resolution is 2.7 mm isotropic.

The proposed method with (α, β, μ) = (0. 4, 0. 5, 1. 0) was compared with CFARI [10] and generalized q-sampling imaging (GQI) [19]. GQI is a generalization of QBI [6] and can reconstruct FOs from multi-shell (multiple b-values) dMRI. The eigenvalues of the basis tensors are \(\lambda _{1} = 2.0 \times 10^{-3}\,\mathrm{mm}^{2}/\mathrm{s}\) and \(\lambda _{2} =\lambda _{3} = 5.0 \times 10^{-4}\,\mathrm{mm}^{2}/\mathrm{s}\). We highlight two regions for evaluation: the crossing region of the corpus callosum (CC) and the corticospinal tract (CST) and the crossing region of CC and the superior longitudinal fasciculus (SLF), which are shown in Fig. 3. It can be seen that the proposed method is able to better reconstruct the crossing FOs and has smoother results than CFARI and GQI.

Fig. 3
figure 3

FO estimation on brain dMRI (overlaid on the FA map), which is focused on the crossing of (a) CC and CST (coronal view) and (b) CC and SLF (axial view)

4 Conclusion

We have proposed an FO estimation algorithm using neighborhood information. The diffusion signal is modeled by a tensor basis. Directional information in neighbors is modeled in weighted 1-norm regularization terms to ensure spatial coherence. FO estimation is achieved using a BCD strategy. The proposed method was applied to simulated and real dMRI data. The results indicate that the proposed method improves FO estimation by using neighborhood information.