Keywords

1 Introduction

Diffusion tensor imaging enables the reconstruction of information revealing the shape, the coherence and the integrity of brain tissue microstructure which can be indirectly analyzed through the assessment of motion of water molecules [1]. This technique has been applied widely in vivo analysis of white matter architecture and has recently been applied to the study of gray matter. Moreover, diffusion tensor imaging has been used to study a large range of neurological disease, or to quantify other causes of tissue degradations such as epilepsy and malformations of cortical development [2]. It has been shown that at a voxel resolution of around 2–3 mm, a simple tensor model used to track the major white matter pathways in the human brain [3]. Despite its interesting properties, diffusion tensor imaging is an inherently low signal-to-noise ratio (SNR) technique and yields to relatively poor spatial resolution [4]. Besides, the tensor model fails to accurately track through regions with more complex fiber arrangements such as crossing, fanning and branching.

It has been shown that the limited spatial and angular resolution introduces a bias in diffusion parameter estimation [4]. Therefore, to improve the sensitivity and robustness of studies based on diffusion tensor imaging, high spatial and angular resolution (HSAR) diffusion MRI with high SNR has to be considered. Such HSAR diffusion MRI with high SNR could provide a better sensitivity for the brain microstructure. However the acquisition of such HSAR diffusion MRI remains a challenging problem in clinical conditions since the improvement in HSAR is obtained at the cost of either lower SNR, longer acquisition time or both [5]. For example, to resolve the crossing fiber direction one can apply more sophisticated local models like the DSI or HARDI, a low spatial resolution of about 3 mm is used to achieve sufficient SNR to enable the acquisition of a high number of diffusion directions and multiple b-values, thereby resolving crossing fiber directions [6]. However, this improvement of high angular resolution with high SNR is obtained at the cost of the longer acquisition time and the spatial resolution.

To enable acquisition of high spatial diffusion MRI without long acquisition times, super-resolution (SR) acquisition techniques have been investigated in the past. Some possible strategies consist in fusing several anisotropic acquisitions with a high in-plane resolution only along one axis [710]. In contrast to SR acquisition techniques that require specific acquisitions protocols of multiple LR images, there exists also a category of single image SR methods [4]. Since single image SR techniques are pure post-processing methods and thus are totally independent from the acquisition protocol. The main idea is to use the image content to reconstruct information at higher-resolution. Besides, another SR method for diffusion MRI has been proposed. The main idea is to use an HR image to drive the reconstruction of another modality (e.g. using \(\mathbf {B_0}\) image) [4]. However, this type of method is built on assumption that the two modalities have the similar image structures. Another problem in these methods is that few of them have considered the angular resolution. Only increasing the spatial resolution cannot break through the limitations of the tensor model. Besides, the ignorance of angular resolution also means the ignorance of the relationship between the diffusion images of different directions.

In this paper, we investigate the possibility to increase both the spatial and the angular resolution of diffusion imaging using single image SR method and compressed sensing. We proposed a unified approach to reconstruct HSAR image using the single dMRI set which are undersampled in k-space and q-space. Then we acquire the corresponding probabilistic orientation function (ODF) through the HSAR dMRI. To quantitatively assess the performance of the proposed method, we use the simulated dMRI. Some other super spatial dMRI methods are also evaluated for these data for comparing the high spatial reconstruction performance. We should note that, to the best of our knowledge, this is a first instance of using the single diffusion dMRI set to reconstruct HSAR dMRI.

The contributions of this work are twofold.

  1. (1)

    The proposition of a new SSR-CS method to reconstruct the HSAR dMRI.

  2. (2)

    The introduction of an efficient optimization algorithm for solving the SSR-CS model.

The remainder of the paper is organized as follows. Section 2 briefly reviews the background of the approach. Section 3 presents the proposed SSR-CS model. The experimental results are demonstrated in Sect. 4 in comparing with the state-of-the-art SR methods. Section 5 concludes the paper.

2 Background

In this section, we provide a brief background on diffusion tensor imaging and spherical ridgelets which will be used subsequently in our proposed SSR-CS algorithm.

2.1 3D Diffusion MRI

In diffusion MRI, measurements are commonly made using the pulsed gradient spin echo (PGSE) method, which samples the Fourier transform of the ensemble average diffusion propagator (EAP) \(P(\mathbf {r})\), two magnetic field gradient pulses of duration \(\delta \) and separation \(\varDelta \) are introduced to the simple spin-echo sequence [11, 12]. Assuming rectangular pulse profiles, the associated diffusion direction is \(\mathbf {q}=\gamma \delta \mathbf {g}\), where \(\mathbf {g}\) is the component of the gradient in the direction of the fixed field \({\mathbf {B}}_{0}\) and \(\gamma \) is the gyromagnetic ratio. The diffusion MRI measurement \(S(\mathbf {q};\mathbf {r})\)is defined at each location \(\mathbf {r}\) of a finite regular image grid in three-dimensional space and depends on the gradient direction \(\mathbf {q}\). For each vector \({\mathbf {q}_{k}}\), the diffusion MRI measurement \(S(\mathbf {q}_{k})\) is a 3D DWI. The normalized \({S^{*}(\mathbf {q}_{k})}\) is given by

$$\begin{aligned} S^{*}(\mathbf {q}_{k})=(S(\mathbf {0}))^{-1}S(\mathbf {q}_{k}) \end{aligned}$$
(1)

\(S(\mathbf {0})\)denotes the diffusion signal obtained in the absence of diffusion encoding (i.e.,the so-called “\(\mathbf {B}_{0}\)-image”). when the \(\varDelta ^{-1}\delta \) is negligible,

$$\begin{aligned} \ S^{*}(\mathbf {q}_{k})=\int _{\mathbf {r}\in {\mathbb {R}^{3}}}P(\mathbf {r})exp(i2\pi \mathbf {q}_{k}\cdot \mathbf {r})d\mathbf {r} \end{aligned}$$
(2)

2.2 Spherical Ridgelets

Spherical ridgelets are constructed by following the fundamental principles of wavelet theory [13, 14]. Specially let \(x\in \mathbb {R}_{+}\) and \(\rho \in (0,1)\) be a positive scaling parameter. Further, let \(\kappa (x)=exp\{-\rho x(x+1)\}\) be a Gaussian function, which we subject to a range of dyadic scaling which result in

$$\begin{aligned} \kappa (x)=\kappa (2^{-j}x)=exp\{{-\rho {\frac{x}{2^{j}}}(\frac{x}{2^{j}}+1)}\} \end{aligned}$$
(3)

with \(j\in \mathbb {N}:=\{0,1,2,...\}\).

The semi-discrete frame \(\mathbb {U}\) of spherical ridgelets can be defined as

$$\begin{aligned} \mathbb {U}:=\{\psi _{j,v}|v\in \mathbb {S}^2,j=-1,0,1,2,...\} \end{aligned}$$
(4)

where the spherical ridgelet functions \(\psi _{j,v}\) at resolution \(j\in \mathbb {N}\) and orientation \(v\in {\mathbb {S}}^{2}\) is

$$\begin{aligned} \psi _{j,v}=\frac{1}{2\pi }\sum \limits _{n=0}^{\infty }\frac{2n+1}{4\pi }\lambda _n(\kappa _{j+1}(n)-\kappa _{j}(n))P_n(\mathbf {u}\cdot \mathbf {v}), \forall \mathbf {u}\in \mathbb {S}^{2} \end{aligned}$$
(5)

where \(P_{n}\) denotes the Legendre polynomial of order n and \(\kappa _{-1}\)(n)=0, \(\forall \) \(n\)

$$\begin{aligned} \lambda _n=\left\{ \begin{array}{cc} 2\pi (-1)^{\frac{n}{2}\cdot \frac{1\cdot 3\cdot \cdot \cdot (n-1)}{2\cdot 4\cdot \cdot \cdot n}}&{}\text {if}\;n\;\text {is}\;\text {even}\\ 0 &{}\text {if}\;n\;\text {is}\;\text {odd}\\ \end{array} \right. \end{aligned}$$
(6)

when the \(n=0\), \(\lambda _n=2\pi \).

3 Proposed Model

3.1 3D Non-local Similarity Regularization

To improve the spatial resolution of the diffusion MR imaging, we adopt a 3D NLMs filter in [4, 15] to capture the nonlocal similarity in each diffusion MRI. Different from the method in [4], We capture the 3D nonlocal similarity of the normalized diffusion MRI which directly determine the diffusion of water rather than the diffusion MRI. Based upon the philosophy of the NLMs, each target voxel \(S(\mathbf {q}_{k};\mathbf {r}_{i})\) in the reconstructed high spatial resolution diffusion MRI of the kth direction can be represented as the weighted average of the voxels within its similarity neighborhoods, i.e.,

$$\begin{aligned} {\hat{S}^{*}(\mathbf {q}_{k};\mathbf {r}_{i})}=\sum \limits _{j\in V_i} w_{ij}^{k}{S^{*}(\mathbf {q}_{k};\mathbf {r}_{j})} \end{aligned}$$
(7)

where \({\hat{S}^{*}(\mathbf {q}_{k};\mathbf {r}_{i})}\) is the current estimation of \({S^{*}(\mathbf {q}_{k};\mathbf {r}_{i}})\), \(w_{ij}^{k}\) is the NLMs weights. For the kth 3D diffusion MRI, we could rewrite the Eq. (7) in a brief

$$\begin{aligned} {\hat{S}^{*}_{i}}=\sum \limits _{j\in V_i} w_{ij}{S^{*}_{j}} \end{aligned}$$
(8)

\(w_{ij}\) is defined as

$$\begin{aligned} w_{ij}=\frac{1}{Z_i}exp\{{-\frac{\Vert N_{3D}({S^{*}_{i}})-N_{3D}({S^{*}_{j}})\Vert ^2_2}{h^2}}\} \end{aligned}$$
(9)

where the constant \(Z_{i}\) ensures the sum of the weights is equal to 1. The \(N_{3D}({S^{*}_{i}})\) and \(N_{3D}({S^{*}_{j}})\) represents the 3D image patches around \(S^{*}_{i}\) and \(S^{*}_{j}\), in practical we use \(3\times 3\times 3\) voxels 3D windows. The similarity between patches is estimated within a restricted nonlocal volume \(V_i\). With the nonlocal similarity in diffusion MRI, we require the estimation error if the weighted average and the upsampled image X as small as possible. Thus for each 3D diffusion MRI, the corresponding regularization term can be written as:

$$\begin{aligned} E_{nlms}(S^{*})=\sum \limits _{i\in \varOmega }\parallel S^{*}_i-W_iS^{*}\parallel ^2_2 \end{aligned}$$
(10)

where \(\varOmega \) is the image grid of \(S^{*}\), \(W_i\) is a row vector formed by the NLMs weights, which is defined as

$$\begin{aligned} W_i(j)=\left\{ \begin{array}{cc} w_{ij}, &{} j\in V_j \\ 0, &{} \text {otherwise} \end{array}\right. \end{aligned}$$
(11)

We further rewrite (10) into the following concise form:

$$\begin{aligned} E_{nlms}(S^{*})=\parallel (I-W)S^{*}\parallel ^2_2 \end{aligned}$$
(12)

where I is an identity matrix and W is the NLMs similar weight matrix defined by \(W_i\) in (11).

3.2 Sparse Representation Regularization

For the fixed \(\mathbf {r}_0\), in q-space the \(S^*(\mathbf {q};\mathbf {r}_0)\) is the normalized diffusion signal at a b-shell along the direction \(\mathbf {q}\in \mathbb {S}^2\). In practical settings, the \(\mathbf {q}\) are discretized and restricted to a discrete set of orientations \(\{{\mathbf {q}_k}\}^K_{k=1}\) which prescribes the acquisition of diffusion data in the form of K diffusion-encoded images \(\{S^{*}_k\}_{k=1}^{K}\), which each \(S^{*}_k:\mathbb {R}^3\rightarrow \mathbb {R}^+\) corresponding to a given \(\mathbf {q}_k\). In this case, for a fixed \(\mathbf {r}_0\), the vector \([S^{*}_1(\mathbf {r}_0),S^{*}_2(\mathbf {r}_0),...,S^{*}_K(\mathbf {r}_0)]^T\in \mathbb {R}^K\) represents a discretization of \(S^{*}(\mathbf {q}|\mathbf {r}_0)\). For this purpose, we let \(s(\mathbf {r}_0)\in R^K\) denotes the vector of diffusion signal whose kth entry is equal to \(S^{*}_k(\mathbf {r}_0)\) and let the values of \(\psi _m\) for \(m=1,...,M\) at the K acquisition locations be stored in a \(K\times M\) matrix A defined as

$$\begin{aligned} A=\left[ \begin{array}{cccc} \psi _1(u_1) &{} \psi _2(u_1) &{} \cdots &{} \psi _M(u_1) \\ \psi _1(u_2) &{} \psi _2(u_2) &{} \cdots &{}\psi _M(u_2)\\ \vdots &{} \vdots &{} \cdots &{} \vdots \\ \psi _1(u_K)&{} \psi _2(u_K) &{} \cdots &{}\psi _M(u_K)\\ \end{array} \right] \end{aligned}$$
(13)

Each column of A is normalized through its \(\mathbb {L}_2\)-norm, to make all the spherical ridgelets have a unit norm. Thus the measurement s can be represented as

$$\begin{aligned} s(\mathbf {r}_0)=Ac(\mathbf {r}_0)+e(\mathbf {r}_0) \end{aligned}$$
(14)

where \(c(\mathbf {r}_0)\) denotes the vector of representation coefficients and \(e(\mathbf {r}_0)\) denotes the measurement noise. Since our main intension is to recover the coefficients c using as few diffusion-encoding gradients as possible (implying \(K\ll \) M), there is an infinite number of solutions which would fit the constraint \({\Vert A\{c\}-s\Vert }_{2}\ll \epsilon \). For each \(\mathbf {r}\in \varOmega _d\), the vector of representation coefficients \(c(\mathbf {r})\) is sparse, we may write a compressed sensing criterion for the estimation of \(c(\mathbf {r})\) as follows

$$\begin{aligned} {\min \limits _{c(\mathbf {r})}}\Vert c(\mathbf {r})\Vert _1 \end{aligned}$$
(15)
$$\begin{aligned} s.t. \Vert A\{c(\mathbf {r})\}-s(\mathbf {r})\Vert _2\le \epsilon . \end{aligned}$$
(16)

independently at each \(\mathbf {r}\in \varOmega \). This setup has been successfully used in [14] to reconstruct the high angular resolution dMRI signals.

3.3 Our Proposed SSR-CS Approach

To gain the high spatial and angular resolution diffusion MRI, we should not only consider the similarity regularization of single 3D diffusion MRI but also the sparse representation regularization of the different direction 3D diffusion MRI. For reconstructing a collection of different directions image volumes \(S={\{S^{*}_k\}}^{K}_{k=1}\) with high spatial resolution, a unified SSR-CS approach is proposed. The super spatial and angular resolution problem can be deemed as

$$\begin{aligned} \begin{aligned}&\hat{S}=\arg \min \limits _{V>0,S}\Vert DHS-L\Vert _2^2+\lambda _1\sum \limits _{i=1}^K\Vert (I-W)S\Vert _2^2+\lambda _2\Vert V\Vert _1\\ {}&s.t.\; S=AV \end{aligned} \end{aligned}$$
(17)

where L is a collection of low spatial resolution different directions images \(L={\{L^{*}_k\}}^{K}_{k=1}\); H and D stand for the blurring and down-sampling operations; A is the basis matrix introduced in compressed sensing section and each column vector of V is the representation coefficients at the corresponding voxel; the positive \(\lambda _1\) and \(\lambda _2\) determine the relative importance of data fitting terms versus the non-local similarity and sparse representation regularization terms. Note that it is not easy to directly solve the above problem because of the compound nature of the regularization it involves. we could rewrite the (18) into the following constrained optimization problem as

$$\begin{aligned} \begin{aligned} \hat{S}=\arg \min \limits _{V>0,S}\Vert DHS-L\Vert _2^2+\lambda _1\sum \limits _{i=1}^K\Vert (I-W)S\Vert _2^2+\lambda _2\Vert V\Vert _1+\lambda _3\Vert S-AV-q^t\Vert _2^2 \end{aligned} \end{aligned}$$
(18)
$$\begin{aligned} q^{t+1}=q^{t}+{AV^{t+1}-r^{t+1}} \end{aligned}$$
(19)

The split Bregman approach [16] allows one to reduce (18) to a simpler form, the minimization can now be performed by sequentially minimizing with respect to S and V separately. The resulting iteration steps are

$$\begin{aligned} \begin{aligned} Step\;1{:}\;S^{t+1}=\arg \min \limits _{S}\Vert DHS-L\Vert _2^2+\lambda _1\Vert S-\bar{S}\Vert _2^2+\lambda _3\Vert S-AV^t-q^t\Vert _2^2 \end{aligned} \end{aligned}$$
(20)
$$\begin{aligned} (\bar{S_i})^{t+1}=\sum _{j\in V_i}w(\bar{S_i}^{t},\bar{S_j}^{t})\bar{S_j}^{t} \end{aligned}$$
(21)
$$\begin{aligned} Step\;2{:}\;V^{t+1}=\arg \min \limits _{V>0}{\lambda _3\Vert AV-(S^t-q^t)\Vert _2^2+\lambda _2\Vert V\Vert _1} \end{aligned}$$
(22)

After we acquired the sparse coefficients, we can further compute the corresponding ODF image [13].

4 Experimental Results and Analysis

4.1 Construction of the Gold Standard

To validate the effectiveness of the proposed method, we conduct experiments on the simulated data sets proposed in [17]. In real clinical cases, diffusion MRI are often contaminated by rician noise. To validate the effectiveness of our model in high level noise condition and low level noise condition respectively, the simulated diffusion-encoded images were contaminated by two different levels of rician noise, giving rise to SNR of 40 db and 24 db. We use the following model in [14] to generate corresponding diffusion-encode images \(\{S_k\}_{k=1}^{K}\) for a range of different values of K

$$\begin{aligned} S(\mathbf {q};\mathbf {r})=S(\mathbf {0};\mathbf {r})\sum \limits _{i=1}^{M(\mathbf {r})}\alpha _i(\mathbf {r})exp\{-b(\mathbf {q}^{T}D_{i}(\mathbf {r})\mathbf {q})\} \end{aligned}$$
(23)

where \(\alpha _i(\mathbf {r})>0\) are the positive weights obeying \(\sum \limits _{i=1}^{M(\mathbf {r})}\alpha _i(\mathbf {r})=1\), b is defined as a function of shape and amplitude of diffusion-encoding gradients,and \(D_i(\mathbf {r})_{i=1}^{M(\mathbf {r})}\) are \(3\times 3\) diffusion tensors associated with \(M(\mathbf {r})\) neural fiber tracts passing through the \(\mathbf {r}\) coordinates.

The simulated set had a spatial dimension of 31\(\times \)31\(\times \)31 voxels with 90 gradients directions in a quasi-uniform manner which is straightforward to adapt for sampling of the “northern” hemisphere. The set are consisted of some “fibres” crossing. \(b=2000\,\text {s}/\text {mm}^2\) were used for data generation. The diffusion tensors \(D_i(\mathbf {r})\) in (24) respectively were: \(D_1=diag([y,x,z]), D_2=diag([x,z,y]), D_3=diag([0,0,1]),D_4=diag([1/\sqrt{3},1/\sqrt{3},1/\sqrt{3}])\). The orientations of diffusion flows (ODF) are shown in Fig. 1. The ODF are computed by the method in [18, 19]. Note that the number of the diffusion components \(M(\mathbf {r})\)varied between 1 and 3. One of the diffusion encoding images was shown in Fig. 2.

Fig. 1.
figure 1

The orientations of the original high angular diffusion flows

Fig. 2.
figure 2

Simulated 3D diffusion MRI and SNR = \({\infty }\) dB

To again the low resolution diffusion image, we firstly blur and downsample the diffusion encoding image. A downsampling factor of 2 along each axis was used. Then we choose \(K=24\) of the original set of 90 diffusion gradients. Within the subset of the 24 diffusion gradients, their corresponding points in q-space were also in a quasi-uniform coverage of the northern hemisphere. Finally the chosen 24 simulated diffusion-encoded images were contaminated by two level of Rician noise, giving rise to SNR of 24 dB and 40 dB diffusion encoding LR images as shown in Figs. 3 and 4 to reconstruct the high spatial and angular resolution images.

Fig. 3.
figure 3

Simulated downsampling 3D diffusion MRI and SNR = 24 dB

Fig. 4.
figure 4

Simulated downsampling 3D diffusion MRI and SNR = 40 dB

4.2 Experiment Results

To evaluate the quality of image reconstruction, we compare the proposed approach with cubic interpolation, B-spline interpolation. At the same time, the approach will also be compared to previously proposed patch-based superresolution methods (PBSR), the original PBSR method described in [15] used to reconstruct the normalized diffusion MRI instead of MRI was included in the comparison.

Table 1. PSNR estimated between the gold standard and the reconstructed images from 24 dB data
Table 2. PSNR estimated between the gold standard and the reconstructed images from 40 dB data

To evaluate the quality of reconstruction two different metrics were used, the usual Peak Signal-to-Noise Ratio (PSNR) and the percentage of generalized fractional anisotropy (GFA) were computed. The quality measures were estimated between the reconstructed HR images and the gold standard. The PSNR index was used to measure the spatial resolution quality and the GFA difference was used to measure the angular resolution quality. The experiment results were shown as follows.

Firstly we show the PSNR index in Tables 1 and 2 of our proposed method and the other methods compared when the LR data were in 24 dB and 40 dB. Then we should consider the angular index, i.e., GFA difference, see Tables 3 and 4.

From the results, we could see the proposed method has obvious advantage over the other methods when the LR data was in low SNR. When the LR data was in high SNR, the proposed method is still best. Besides, the GFA difference index shows the proposed method is better than PBSR when the LR data was in 24 dB data and 40 dB data.

Table 3. GFA difference estimated between the gold standard and the reconstructed images from 24 dB data
Table 4. GFA difference estimated between the gold standard and the reconstructed images from 40 dB data
Fig. 5.
figure 5

3D DWI reconstruction using compared methods

Fig. 6.
figure 6

ODF using compared methods

The reconstructed 3D diffusion MRI from the 40 dB data using PBSR and our proposed method are shown in Fig. 5.

The reconstructed ODF from the 40 dB data using PBSR and our proposed method are shown in Fig. 6.

5 Conclusion

In this work, we investigated the possibility to increase diffusion MRI spatial resolution and angular resolution using a new method named SSR-CS. We combined the single image spatial superresolution and CS to propose a practical diffusion acquisition and reconstruction scheme that allows for obtaining HSAR diffusion MRI. The proposed technique is independent of scanner type and can be implemented on any clinically feasible scan time. We found that the proposed method could effectively reconstruct the high spatial and angular resolution diffusion MRI from the compared results.

We also note some limitations of the proposed method. When the LR diffusion MRI was in 40 dB, the proposed algorithm just improve 0.13 dB compared to the PBSR in PSNR. The advantage of the proposed method is not obvious. In the future we will focus on this aspect of the algorithm. At the same time, we should consider the speed of the algorithm because the clinical diffusion MRI were actually 4D data. The high dimension of the data will limit the speed of the super-resolution algorithm.