Introduction

Magnet resonance imaging (MRI) of the liver is an increasingly common examination performed for the detection and characterization of focal liver lesions, in addition to the assessment of diffuse liver disease [1,2,3]. Fat-suppressed T2-weighted imaging (T2-FS) is a critical component of clinical liver MRI, providing a high contrast-to-noise ratio (CNR) to allow for the detection of focal abnormalities [4]. However, traditional T2-FS imaging often requires several minutes of acquisition time to obtain high-quality images, and even with the additional techniques of signal averaging, ordered phase encoding, and gradient moment nulling, and these sequences can be wrought with motion artifacts [5]. More recently, multi-shot 2D fast spin-echo pulse sequences (T2 FSE) with frequency-selective fat suppression have been used for sequence acquisition [6,7,8]. In addition to allowing for quicker acquisition times, T2 FSE techniques provide greater differences in signal intensity between solid and nonsolid lesions than on conventional SE images, likely as a result of greater magnetization-transfer contrast [9]. Even with these reductions in acquisition time, current T2 FSE protocols employ multi-shot acquisitions susceptible to respiratory motion, prompting the need for techniques, such as multi-breath-hold acquisitions, prospective respiratory triggering (RT), or navigator-based triggering (PACE) to acquire diagnostically acceptable images [10, 11]. Despite these additional steps, T2-FS images remain motion sensitive with associated low-quality images, diagnostically unacceptable blurring, and motion artifacts, often prompting repeat acquisitions, reportedly occurring as frequently as 54.7% at one institution [12].

Several methods have been explored to improve image quality, including 3D fast spin-echo pulse sequences (SPACE [Siemens Healthcare, Erlangen, Germany], CUBE [General Electric Healthcare, Milwaukee, WI, USA], VISTA [Philips Healthcare, Best, the Netherlands]), and periodically rotated overlapping parallel lines with enhanced reconstruction (PROPELLER [General Electric Healthcare, Milwaukee, WI, USA], BLADE [Siemens Healthcare, Erlangen, Germany], MultiVane [Philips Healthcare, Best, the Netherlands]), but at the cost of increased overall acquisition times [13, 14].

Single-shot T2-weighted imaging has also been suggested as an approach for reducing motion artifacts, given that these sequences have faster acquisition times and are, at baseline, more motion robust. However, traditional single-shot techniques have lower image contrast than T2 FSE, owing to the need to balance echo time and duration during the acquisition of single-shot images, given that longer echo train durations lead to increased T2 blurring. To attain the desired echo time, single-shot FSE sequences often use short echo spacings and partial Fourier acquisition, resulting in decreased scan times and motion artifacts, but at the expense of decreased spatial resolution, signal-to-noise ratio (SNR), and focal lesion conspicuity [15,16,17,18]. In addition, these single-shot techniques often lack fat suppression, which limits the detection and characterization of focal lesions due to increased ghosting artifacts from high signal fat and decreased liver-to-lesion contrast [19,20,21]. However, the addition of fat suppression techniques may result in inhomogeneous signal suppression near the body-array coil and increases overall scan time [20]. Consequently, there remains a need for the development of a shorter and more motion robust T2-weighted fat-suppressed MRI sequence for clinical MRI liver imaging.

In order to obtain sufficient image contrast comparable to traditional T2 FSE sequences, a single-shot sequence would require a high acceleration while still maintaining the SNR, which Deep-Learning (DL)-based image reconstructions are able to provide using an iterative procedure consisting of k-space data consistency requirements and image regularization, trained through an optimization process on representative images [22,23,24,25]. Recently, a single-shot T2-FS HASTE sequence utilizing deep-learning reconstruction has been employed on clinical 1.5 T MRI liver examinations at one institution, demonstrating decreased imaging times with improved overall image quality [26]. However, these techniques have yet to be implemented on 3 T clinical MRI liver examinations, which carry their own associated challenges due to increased magnetic field strength. Although 3 T examinations demonstrate increased SNR, allowing for improved spatial resolution, in comparison with 1.5 T examinations, they often exhibit exacerbation of artifacts and have specific absorption rate (SAR) limitations [27, 28]. Therefore, we aim to compare the image quality of a single-shot T2-weighted MRI prototype with deep-learning-based image reconstruction (DL HASTE-FS) with a standard T2-FS sequence for routine clinical liver MRI at 3 T.

Methods

Patients

This prospective HIPAA-compliant study was performed following institutional review board approval with waived informed consent. A total of 41 consecutive patients (male = 27, female = 14, mean age = 58, range = 25–83 years) underwent clinically indicated abdominal MRI examinations at one of our institution’s outpatient facilities between May 6, 2020 and November 23, 2020. Clinical indications for the MRIs were as follows: cirrhosis (n = 18), focal lesion (n = 7), viral hepatitis (n = 4), abnormal liver enzymes (n = 4), fatty liver disease (n = 3), follow-up after liver transplantation (n = 3), abdominal pain (n = 1), and splenomegaly (n = 1).

MR-imaging protocol

MR-imaging studies were performed on clinical 3 T MR imaging systems (MAGNETOM Skyra and Prisma; Siemens Healthcare, Erlangen, Germany), which had the novel prototype DL HASTE-FS sequence available. Traditional fat-suppressed two-dimensional (2D) T2-weighted turbo spin-echo (T2-FS TSE) and DL HASTE-FS sequences were performed as part of the routine 3 T liver MRI protocol. An 18-channel body-array coil and an 8-channel posterior spine coil were used for the imaging study.

Conventional T2-FS sequence acquisition

2D T2-FS acquisition is routinely performed as part of our institution’s routine clinical liver MRI protocol and was performed with the following parameters: TR/TE = 3800 ms/105 ms; FA = 132 degrees; spectral-attenuated inversion recovery (SPAIR) was used for fat suppression; FOV = 375 × 375 mm2; matrix = 256 × 205 (matched with DL HASTE-FS acquisition); section thickness = 5 mm (matched with DL HASTE-FS acquisition); resolution (interpolated) = 0.68 × 0.68 × 5 mm3; receiver bandwidth = 305 Hz/Px; PAT factor = 3; number of axial sections = 40; and number of breath holds = 3 (20 s each).

DL HASTE-FS sequence acquisition

The prototypical DL HASTE-FS sequence is based on the standard HASTE sequence, with the following alterations: During the echo train, no calibration data for the estimation of coil-sensitivity maps are acquired, but instead the calibration data are acquired in a second echo train following about 50 ms after the image data are acquired, which maintains the motion robustness and even reduces SAR, since lower flip angles can be used for calibration data. In order to avoid crosstalk and magnetization transfer effects, the slice increment of consecutively acquired slices is increased, which obviates artificial delays between the acquisitions of subsequent slices and allows for a repetition time of 500 ms without noticeable change of the image contrast. Conventional parallel imaging sampling patterns are used for the acceleration. Spectral-attenuated inversion recovery (SPAIR) was used for fat suppression [29]. Imaging parameters were as follows: TR/TE = 582 ms/118 ms; FA = 140 degrees; FOV = 375 × 375 mm2; matrix = 256 × 205 (matched with traditional 2D T2-FS acquisition); section thickness = 5 mm (matched with traditional 2D T2-FS acquisition); resolution (interpolated) = 0.68 × 0.68 × 5 mm3; receiver bandwidth = 723 Hz/Px; PAT factor = 3; ETL = 64 (for reference, ETL = 82 for conventional HASTE liver imaging); number of axial sections = 30; and number of breath holds = 1. Given that the HASTE sequence is a single-shot acquisition, the term TR is used differently for this type of sequence and refers to the duration between sequentially acquired slices. In addition, the TE provided is truly an “effective TE,” since there is no specific time point when it is acquired, instead, data are acquired at multiple TE’s and the “effective TE” is placed at the center of K-space to achieve the desired contrast.

DL HASTE-FS reconstruction

DL HASTE-FS sequence reconstruction employs an unrolled iterative reconstruction network which was previously used on 1.5 T MRI liver examinations [26]. This architecture shares some similarities with variational networks introduced previously [22, 24]. Initially, separately acquired calibration data are used to estimate step coil-sensitivity maps. K-space data, bias-field correction, and coil-sensitivity maps are then inserted into the variational network for reconstruction, which uses two types of iterations, both of them with trainable (Nesterov-type) extrapolation steps [30]. No regularization is applied for the first 22 iterations and the network focuses on parallel imaging. Subsequently, a regularization based on residual dense U-net is applied for the following 12 iterations. The empirical finding that initial steps in the variational network focus on signal recovery of missing data near the k-space center allows this approach to perform acquisitions without integrated calibration and using flexible k-space sampling.

Ground-truth images, acquired with parallel imaging, were obtained for the supervised training. Moderate parallel imaging with conventional HASTE protocols was used to acquire training data on volunteers, with the training based on further retrospective down sampling of the acquired data. For example, a typical protocol parameter consisted of a parallel imaging acceleration of 2 in the actual acquisition and an acceleration of 4 in the retrospective down sampling. Approximately 10,000 slices were acquired for training on clinical 1.5 T and 3 T MR scanners (MAGNETOM scanners, Siemens Healthcare, Erlangen, Germany), and the training was implemented in PyTorch and performed on a NVIDIA Tesla V100 (32 GB of memory) GPU.

Subsequently, the trained network was integrated into the scanner reconstruction pipeline by converting into a C +  + implemented inference framework. Inference required about 2 s per slice using the given protocol settings for the CPU-only reconstruction on a clinical MRI scanner.

Image analysis

Traditional T2-FS and DL HASTE-FS data were anonymized in Digital Imaging and Communications in Medicine (DICOM) format and randomly assorted. Three board-certified radiologists (with 2, 3, and 4 years of clinical experience in interpretation of MR examinations, respectively) underwent a short training session before image evaluation, then subsequently reviewed the cases and performed image analysis independently. No data other than the anonymized MRI images were available to the readers.

Using a five-point Likert scale (1–5) (Table 1), readers assessed sharpness of the liver margin, sharpness of intra-hepatic vessels, sharpness of the pancreatic duct, homogeneity of fat suppression, strength of fat suppression, in-plane and through-plane motion artifacts, other ghosting artifacts, and overall image quality. In addition, readers evaluated the cases for the presence or absence of focal hepatic lesions, which, if present, were assessed for lesion conspicuity and sharpness of the lesion edge, using the same five-point scale.

Table 1 Likert scale scoring (1–5) for each imaging quality measure

Statistical analysis

Acquisition times between traditional T2-FS and DL HASTE-FS were compared using the paired Wilcoxon signed-rank test for each case. Likert scores for each imaging quality metric were tabulated, and descriptive statistics (mean, standard deviation, median, first and third quartiles, and range) were computed for each reader and overall. In addition, the paired Wilcoxon signed-rank test was used to compare imaging quality metrics between sequences. For the subset of cases with focal liver lesions, the corresponding analysis of scores relating to lesion conspicuity and sharpness of lesion edge was performed. Reader score consistency was assessed using intra-class correlation coefficient (ICC) analysis, where < 0.4 = poor agreement, 0.4–0.59 = fair agreement, 0.6–0.74 = good agreement, and 0.75–1 = excellent agreement. All p values were two sided and considered statistically significant when less than 0.05. Analysis was performed using Microsoft Excel (Microsoft, Redmond, Washington) and MedCalc for Windows (MedCalc Software, Ostend, Belgium).

Prior studies

Recently, a single-shot T2-FS HASTE sequence utilizing deep-learning reconstruction has been employed on clinical 1.5 T MRI liver examinations at our institution, demonstrating decreased imaging times with improved overall image quality [26]. This study utilized a separate patient cohort with MR liver examinations performed at 1.5 T but employed a similar network for image reconstruction and a similar method for reader image analysis.

Statements and declarations

Two authors (DN and SA) are Siemens Healthcare employees, who provided technical assistance, but were not involved in the data acquisition or evaluation, nor did they have direct control of the data.

Results

Acquisition time

Acquisition time for DL HASTE-FS was 54.93 ± 16.69 s, which was significantly (p < 0.001) shorter than the acquisition time for traditional T2-FS (114.00 ± 32.98 s) (Table 2).

Table 2 Acquisition time comparison between DL HASTE-FS and T2-FS

Image quality and artifact assessment

The results of the subjective assessment of image quality for the DL HASTE-FS and traditional T2-FS sequences by the three readers using the five-point Likert scale are described in Table 3. DL HASTE-FS received significantly higher scores than standard T2-FS for sharpness of liver margin (mean 4.3 vs 3.3; p < 0.001), hepatic vessel margin (4.2 vs 3.3; p < 0.001), pancreatic duct margin (4.0 vs 1.9; p < 0.001); in-plane (4.0 vs 3.2; p < 0.001) and through-plane (3.9 vs 3.4; p < 0.001) motion artifacts; other ghosting artifacts (4.3 vs 2.9; p < 0.001); and overall image quality (4.0 vs 2.9; p < 0.001), in addition to receiving a higher score for homogeneity of fat suppression (3.7 vs 3.4; p = 0.04) and liver-fat contrast (p = 0.03), without receiving a significant difference in scores for strength of fat suppression (p = 0.22). A comparison of example cases between T2-FS and DL HASTE-FS is provided in Fig. 1.

Table 3 Image quality mean (± SD) score comparison between DL HASTE-FS and T2-FS
Fig. 1
figure 1

Example cases comparison between T2-FS and DL HASTE-FS

Focal liver lesions

A total of 12 focal liver lesions were seen by each reader on both sequences, with the individual reader breakdown as follows: 16 lesions on DL HASTE-FS and 15 on T2-FS for reader 1, 15 lesions on DL HASTE-FS and 12 on T2-FS for reader 2, and 20 lesions on DL HASTE-FS, and 20 on T2-FS for reader 3. For lesions seen by each reader on both sequences, DL HASTE-FS received a significantly higher score than standard T2-FS for sharpness of lesion margin (4.4 vs 3.7; p = 0.03), without receiving a significant difference in scores for liver-lesion contrast (p = 0.68) (Table 4).

Table 4 Overall image quality mean (± SD) score comparison between DL HASTE-FS and T2-FS for focal lesions seen by each reader on both sequences (N = 12)

Inter-reader agreement

Intra-class correlation coefficient (ICC) among the three readers for the overall image quality was fair for DL HASTE-FS (ICC = 0.51, range = 0.31–0.69) and good for T2-FS (ICC = 0.60, range = 0.53–0.70).

Discussion

In our study, DL HASTE-FS received significantly higher scores than standard T2-FS for sharpness of liver margin, hepatic vessel margin, pancreatic duct margin; in-plane and through-plane motion artifacts; other ghosting artifacts; and overall image quality, in addition to receiving a higher score for homogeneity of fat suppression and liver-fat contrast. Strength of fat suppression was comparable for both DL HASTE-FS and T2-FS. For lesions seen by each reader on both sequences, DL HASTE-FS received significantly higher scores than standard T2-FS for sharpness of lesion margin, with comparable scores for liver-lesion contrast. The total acquisition time for the DL HASTE-FS sequence was significantly less than the conventional T2-FS sequence. These results are similar to results seen at 1.5 T utilizing a similar sequence, which demonstrates the generalizability of deep-learning applications across different magnetic field strengths.

Our novel single-shot T2-weighted MRI with deep-learning-based image reconstruction used regular undersampling without integrated reference data to shorten the echo train duration for the image data and to lower the specific absorption rate, instead acquiring reference data right after the image data with a lower flip angle. In addition, the approach utilized pre-iterations with trainable extrapolations and gradient steps to allow for data reconstruction with an undersampled k-space center. In order to apply the homodyne filter on the network predictions, the network reconstructed complex-value images.

Not only does MRI provide significant information about the background liver parenchyma, biliary tree, and hepatic vasculature, but it often allows for the definitive characterization of various solid and cystic hepatic lesions. Traditional T2-FS images of the liver are essential for this purpose but require a long scan time and often can be wrought with motion artifacts. Even with breath-hold techniques, respiratory motion artifacts often prompt repeat sequence acquisition, increasing overall scan time, and therefore, decreasing patient throughput. Previously, modified acquisition techniques, such as 3D fast spin-echo pulse sequences and periodically rotated overlapping parallel lines with enhanced reconstruction, have been employed to mitigate this and have demonstrated improved overall image quality, but at the cost of increased overall acquisition times. Alternatively, single-shot T2 FSE has been used to increase motion robustness but demonstrates increased overall image noise, decreased signal-to-noise ratio (SNR) of the liver, and decreased liver-lesion contrast-to-noise ratio (CNR) when compared with traditional T2 FSE, limiting the clinical utility of the technique [31]. Deep-learning (DL) techniques have demonstrated the ability to improve image quality by identifying and mitigating artifacts in accelerated acquisitions, allowing for reduced overall imaging time [23]. This novel single-shot T2-weighted MRI with deep-learning-based image reconstruction sequence offers a potential solution to these previously encountered issues.

One limitation of our study was the number of patients included, which was limited by the number of patients receiving both traditional T2-FS and DL HASTE-FS sequences at the time of the study. Although the results of this study are promising for future clinical application of the sequence, and potential replacement of the traditional T2-FS sequence at our institution, a larger study will be required to compare the two sequences’ utility in lesion detection and characterization. Additionally, only outpatients were included in this study due to the limited availability of the prototype sequence on two MRI systems at an outpatient facility. Finally, we did not analyze the discrepancy in lesions detection rate both between readers for each sequence and for the first two readers between sequences. The discrepancy between readers on each sequence may have been secondary to each author’s interpretation of what constitutes a notable lesion. However, for readers 1 and 2, more lesions were seen on DL HASTE-FS than on T2-FS, for which the clinical significance of this discrepancy is unclear and beyond the scope of this study.

In conclusion, our novel single-shot T2-weighted MRI with deep-learning-based image reconstruction demonstrated superior image quality compared with the standard T2-FS sequence for routine clinical liver MRI at 3 T, while being acquired in less than half the time. Therefore, DL HASTE-FS is a potential alternative to traditional T2-FS in routine clinical liver MRI.