Efficient and Phase-Aware Video Super-Resolution for Cardiac MRI

Lin, Jhih-Yuan; Chang, Yu-Cheng; Hsu, Winston H.

doi:10.1007/978-3-030-59719-1_7

Jhih-Yuan Lin¹⁶,
Yu-Cheng Chang¹⁶ &
Winston H. Hsu¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12264))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

9151 Accesses
9 Citations

Abstract

Cardiac Magnetic Resonance Imaging (CMR) is widely used since it can illustrate the structure and function of the heart in a non-invasive and painless way. However, it is time-consuming and high-cost to acquire high-quality scans due to the hardware limitation. To this end, we propose a novel end-to-end trainable network to solve CMR video super-resolution problem without the hardware upgrade and the scanning protocol modifications. We incorporate the cardiac knowledge into our model to assist in utilizing the temporal information. Specifically, we formulate the cardiac knowledge as the periodic function, which is tailored to meet the cyclic characteristic of CMR. Besides, the proposed residual of residual learning scheme facilitates the network to learn the LR-HR mapping in a progressive refinement fashion. This mechanism enables the network to have the adaptive capability by adjusting refinement iterations depending on the difficulty of the task. Extensive experimental results on large-scale datasets demonstrate the superiority of the proposed method compared with numerous state-of-the-art methods.

J.-Y. Lin and Y.-C. Chang—Equal contribution.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Super Resolution of Cardiac Cine MRI Sequences Using Deep Learning

Multi-input Cardiac Image Super-Resolution Using Convolutional Neural Networks

Deep learning-based magnetic resonance image super-resolution: a survey

Article 17 May 2024

Keywords

1 Introduction

Magnetic Resonance Imaging (MRI) has been widely used to examine almost any part of the body since it can depict the structure inside the human non-invasively and produce high contrast images. Notably, cardiac MRI (CMR) assessing cardiac structure and function plays a key role in evidence-based diagnostic and therapeutic pathways in cardiovascular disease [13], including the assessment of myocardial ischemia, cardiomyopathies, myocarditis, congenital heart disease [14]. However, obtaining high-resolution CMR is time-consuming and high-cost as it is sensitive to the changes in the cardiac cycle length and respiratory position [23], which is rarely clinically applicable.

To address this issue, the single image super-resolution (SISR) technique, which aims at reconstructing a high-resolution (HR) image from low-resolution (LR) one, holds a great promise that does not need to change the hardware or scanning protocol. Most of the MRI SISR approaches [3, 21, 24] are based on the deep learning-based methods [5, 16], which learn the LR-HR mapping with extensive LR-HR paired data. On the other hand, several previous studies [11, 31] adapt the self-similarity based SISR algorithm [8], which does not need external HR data for training. However, straightforwardly employing the aforementioned methods is not appropriate for CMR video reconstruction since the relationship among the consecutive frames in CMR video is not well considered. Therefore, we adopt the video super-resolution (VSR) technique, which can properly leverage the temporal information and has been applied in numerous works [7, 10, 22, 27, 30], to perform CMR video reconstruction.

In this work, we propose an end-to-end trainable network to address CMR VSR problem. To well consider the temporal information, we choose ConvLSTM [28], which has been proven effective [6, 9], as our backbone. Moreover, we introduce the domain knowledge (i.e., cardiac phase), which has shown to be important for the measurement of the stroke volume [15] and disease diagnosis [29], to provide the direct guidance about the temporal relationship in a cardiac cycle. Combined with the proposed phase fusion module, the model can better utilize the temporal information. Last but not the least, we devise the residual of residual learning inspired by the iterative error feedback mechanism [2, 19] to guide the model iteratively recover the lost details. Different from other purely feed-forward approaches [10, 18, 22, 27, 30], our iterative learning strategy can make the model easier in representing the LR-HR mapping with fewer parameters.

We evaluate our model and multiple state-of-the-art baselines on two synthetic datasets established by mimicking the acquisition of MRI [4, 31] from two publicly datasets [1, 26]. It is worth noting that one of them is totally for external evaluation. To properly assess the model performance, we introduce the cardiac metrics based on PSNR and SSIM. The experimental results turn out that the proposed network can stand out from existing methods even on the large-scale external dataset, which indicates our model has the generalization ability. To our best knowledge, this work is the pioneer to address the CMR VSR problem and provide a benchmark to facilitate the development in this domain.

2 Proposed Approach

Let $I_{LR}^t$ $\in \mathbb {R}^{H \times W}$ denote the t-th LR frame obtained by down-sampling the original HR frame $I_{HR}^t$ $\in \mathbb {R}^{rH \times rW}$ with the scale factor r. Given a sequence of LR frames denoted as {$I_{LR}^t$}, the proposed end-to-end trainable model aims to estimate the corresponding high-quality results {$I_{SR}^t$} that approximate the ground truth frames {$I_{HR}^t$}. Besides, $\oplus $ refers to the element-wise addition.

2.1 Overall Architecture

Our proposed network is illustrated in Fig. 2. It consists of a feature extractor, a bidirectional ConvLSTM [28], a phase fusion module, and an up-sampler. The feature extractor (FE) first exploits the frame $I_{LR}^t$ to obtain the low-frequency feature $L^t$. Subsequently, the bidirectional ConvLSTM [28] comprising a forward ConvLSTM ($ConvLSTM_F$) and a backward ConvLSTM ($ConvLSTM_B$) makes use of the low-frequency feature $L^t$ to generate the high-frequency features $H^t_F, H^t_B$. With the help of its memory mechanism, the bidirectional ConvLSTM can fully utilize the temporal relationship among consecutive frames in both directions. In addition, we can update the memory cells in the bidirectional ConvLSTM in advance instead of starting with the empty states due to the cyclic characteristic of the cardiac videos. This can be done by feeding n consequent updated frames before and after the input sequence {$I^t_{LR}$} to the network.

Furthermore, to completely integrate the bidirectional features, the designed phase fusion module (PF) applies the cardiac knowledge of the $2N+1$ successive frames from $t-N$ to $t+N$ in the form of the phase code $P^{[t-N:t+N]}$, which can be formulated as $H_P^t = PF(H^{[t-N:t+N]}_F, H^{[t-N:t+N]}_B, P^{[t-N:t+N]})$, where $H_P^t$ represents the fused high-frequency feature. After that, the fused high-frequency feature $H_P^t$ combined with the low-frequency feature $L^t$ through the global skip connection is up-scaled by the up-sampler (Up) into the super-resolved image $I^t_{SR} = Up(H_P^t \oplus L^t)$. We further define the sub-network ($Net_{sub}$) as the combination of $ConvLSTM_F, ConvLSTM_B$ and PF. The purpose of $Net_{sub}$ is to recover the high-frequency residual $H_P^t = Net_{sub}(L^t)$. Besides, we employ the deep supervision technique [17] to provide the additional gradient signal and stabilize the training process by adding two auxiliary paths, namely $I^t_{SR, F} = Up(H_F^t \oplus L^t)$ and $I^t_{SR, B} = Up(H_B^t \oplus L^t)$. Finally, we propose the residual of residual learning that progressively restores the residual that has yet to be recovered in each refinement stage $\omega $. To simplify the notation, $\omega $ is omitted when it equals to 0, e.g., $L^t_F$ means the low-frequency feature of the t-th frame at the 0-th stage $L^{t, 0}_F$.

2.2 Phase Fusion Module

The cardiac cycle is a cyclic sequence of events when the heart beats, which consists of systole and diastole process. Identification of the end-systole (ES) and the end-diastole (ED) in a cardiac cycle has been proved critical in several applications, such as the measurement of the ejection fraction and stroke volume [15], and disease diagnosis [29]. Hence, we embed the physical meaning of the input frames into our model with the informative phase code generated by projecting the cardiac cycle to the periodic Cosine function as depicted in Fig. 3a. Specifically, we map the process of the systole and the diastole to the half-period cosine separately:

$$\begin{aligned} P^t = {\left\{ \begin{array}{ll} Cos(\pi \times \frac{t-ED}{ES-ED}), &{} \text {if } \; \text {ED} < t \le \text {ES}\\ Cos(\pi \times (1+\frac{(t-ES)\%T}{T-(ES-ED)})), &{} \text {otherwise} \end{array}\right. } \end{aligned}$$

(1)

where % denotes modulo operation and T is the frame number in a cardiac cycle.

The overview of the proposed phase fusion module is shown in Fig. 3b. The features from the bidirectional ConvLSTM with the corresponding phase code are concatenated and fed into the fusion module. With the help of consecutive $2N+1$ phase codes, it can link the same-position frames from different periods (inter-period). Besides, it can realize the heart is relaxing or contracting as the phase code is respectively increasing or decreasing (intra-period).

2.3 Residual of Residual Learning

In the computer vision field, the iterative error-correcting mechanism plays an essential role in several topics, such as reinforcement learning [19], scene reconstruction [20], and human pose estimation [2]. Inspired by this mechanism, we propose the residual of residual learning composing the reconstruction process into multiple stages, as shown in Fig. 3c. At each stage, the sub-network ($Net_{sub}$) in our model estimates the high-frequency residual based on the current low-frequency feature, and then the input low-frequency feature is updated for the next refinement stage. Let $L^{t, 0}$ be the initial feature from the feature extractor (FE) and $L^{t, \omega }$ denote the updated feature at the iteration $\omega $, the residual of residual learning for $\varOmega $ stages can be described as the recursive format:

$$\begin{aligned} L^{t, \omega } = {\left\{ \begin{array}{ll} FE(I^t_{LR}), &{} \text {when }\omega = 0 \\ L^{t, \omega -1} \oplus Net_{sub}(L^{t, \omega -1}), &{} \text {if } 0 < \omega \le \varOmega \end{array}\right. } \end{aligned}$$

(2)

Then, the network generates the super-resolution result $I^{t, \omega }_{SR}$ based on the current reconstructed feature $L^{t, \omega }$, which can be written as:

$$\begin{aligned} I^{t, \omega }_{SR} = Up(L^{t, \omega } \oplus Net_{sub}(L^{t, \omega })) \end{aligned}$$

(3)

The model progressively restores the residual that has yet to be recovered in each refinement stage, which is so-called the residual of residual learning. Compared to other one-step approaches [10, 18, 22, 27, 30], the proposed mechanism tries to break down the ill-posed problem into several easier sub-problems in the manner of divide-and-conquer. Most notably, it can dynamically adjust the iteration number depending on the problem difficulty without any additional parameters.

2.4 Loss Function

In this section, we elaborate on the mathematical formulation of our cost function. At each refinement stage $\omega $, the super-resolved frames {$I^{t, \omega }_{SR}$} are supervised by the ground-truth HR video {$I^t_{HR}$}, which can be formulated as $\mathcal {L}^\omega = \frac{1}{\tilde{T}}\sum _{t=1}^{\tilde{T}}\ \parallel I^{t, \omega }_{SR} - I^t_{HR} \parallel _1$, where $\tilde{T}$ indicates the length of the video sequence fed into the network. We choose the L1 loss as the cost function since the previous works have demonstrated that the L1 loss provides better convergence compared to the widely used L2 loss [18, 32]. Besides, we apply the deep supervision technique as described in Sect. 2.1 by adding two auxiliary losses $\mathcal {L}_F^\omega = \frac{1}{\tilde{T}}\sum _{t=1}^{\tilde{T}}\ \parallel I^{t, \omega }_{SR, F} - I^t_{HR} \parallel _1$ and $\mathcal {L}_B^\omega = \frac{1}{\tilde{T}}\sum _{t=1}^{\tilde{T}}\ \parallel I^{t, \omega }_{SR, B} - I^t_{HR} \parallel _1$. Hence, the total loss function can be summarized as $\mathcal {L} = \sum _{\omega =0}^{\varOmega } (\mathcal {L}^\omega + \mathcal {L}_F^\omega + \mathcal {L}_B^\omega )$, where $\varOmega $ denoted as the total number of refinement stages.

Table 1. Quantitative results. The red and blue indicate the best and the second-best performance, respectively. We adopt CardiacPSNR/CardiacSSIM to fairly assess the reconstruction quality of the heart region. It is worth noting that the large-scale DSB15SR dataset is entirely for external evaluation.

Full size table

3 Experiment

3.1 Experimental Settings

Data Preparation. To our best knowledge, there is no publicly available CMR dataset for the VSR problem. Hence, we create two datasets named ACDCSR and DSB15SR based on the public MRI datasets. One is the Automated Cardiac Diagnosis Challenge dataset [1], which contains four dimension MRI scans of a total of 150 patients. The other is the large-scale Second Annual Data Science Bowl Challenge dataset [26] composed of 2D cine MRI videos that contain 30 images across the cardiac cycle per sequence. We use its testing dataset comprising 440 patients as the external assessment to verify the robustness and generalization of the algorithms. To more accurately mimic the acquisition of LR MRI scans [4, 31], we project the HR MRI videos to the frequency domain by Fourier transform and filter the high-frequency information. After that, we apply the inverse Fourier transform to project the videos back to the spatial domain and further downsample by bicubic interpolation with the scale factor 2, 3, and 4.

Evaluation Metrics. PSNR and SSIM criteria have been widely used in previous studies to evaluate the SR algorithms. However, the considerable disparity of the proportion of the cardiac region to the background region in MRI images makes the results heavily biased towards the insignificant background region. Therefore, we introduce CardiacPSNR and CardiacSSIM to assess the performance more impartially and objectively. Specifically, we employ a heart ROI detection method similar to [25] to crop the cardiac region and calculate PSNR and SSIM in this region. This can reduce the influence of the background region and more accurately reflect the reconstruction quality of the heart region.

Training Details. For training, we randomly crop the LR clips of $\tilde{T} = 7$ consecutive frames of size $32\times 32$ with the corresponding HR clips. We experimentally choose $n = 6$ and $\varOmega = 2$ as detailed in Sect. 3.3, while $N = 2$ in the phase fusion module. We use the Adam optimizer [12] with learning rate $10^{-4}$ and set the batch size to 16. For other baselines, we basically follow their original settings except the necessary modifications to train them from the scratch.

Table 2. Ablation study. Memory: the memory cells in the ConvLSTM [28] are activated; Updated memory: the memory cells are updated by feeding n consecutive frames; Bidirection: bidirectional ConvLSTM is adopted; Phase fusion module and Residual of residual learning: the proposed components are adopted.

Full size table

3.2 Experimental Results

To confirm the superiority of the proposed approach, we compare our network with multiple state-of-the-art methods, namely EDSR [18], DUF [10], EDVR [27], RBPN [7], TOFlow [30], and FRVSR [22]. We present the quantitative and qualitative results in Table 1 and Fig. 5 respectively. Our approach outperforms almost all the existing methods by a huge margin in all scales in terms of CardiacPSNR and CardiacSSIM. In addition, our method can yield more clear and photo-realistic SR results which subjectively closer to the ground truths. Moreover, the results on the external DSB15SR dataset are sufficiently convincing to validate the generalization of the proposed approach. On the other hand, the comparison with regard to the model parameters, FPS, and the image quality in the cardiac region plotted in Fig. 4a demonstrates that our method strikes the best balance between efficiency and reconstruction performance.

3.3 Ablation Study

We adopt the unidirectional ConvLSTM as the simplest baseline. As shown in the Table 2, the temporal information is important since the model performance is worse when the memory cells in ConvLSTM are disabled. As the cardiac MRI video is cyclic, we can refresh the memory by feeding n successive frames. Accordingly, we analyze the relation between n and model performance. The result in Fig. 4b turns out that the network significantly improves as the updated frame number increases. Moreover, the forward and backward information is shown to be useful and complementary for recovering the lost details.

In Sect. 2.2, we exploit the knowledge of the cardiac phase to better fuse the bidirectional information. The result in Table 2 reveals that the phase fusion module can leverage the bidirectional temporal features more effectively. Besides, we explore the influence of the total number of refinement stages $\varOmega $ in the residual of residual learning. It can be observed from Fig. 4c that the reconstruction performance is improved as the total refinement stages continue to increase. The possible reason for the saturation or degradation of the overall performance when $\varOmega $ equals to 3 or 4 is overfitting (violate the Occam’s razor).

4 Conclusion

In this work, we define the cyclic cardiac MRI video super-resolution problem which has not yet been completely solved to our best knowledge. To tackle this issue, we bring the cardiac knowledge into our network and employ the residual of residual learning to train in the progressive refinement manner, which enables the model to generate sharper results with fewer model parameters. In addition, we build large-scale datasets and introduce cardiac metrics for this problem. Through extensive experiments, we demonstrate that our network outperforms the state-of-the-art baselines qualitatively and quantitatively. Most notably, we carry out the external evaluation, which indicates our model exhibits good generalization behavior. We believe our approach can be seamlessly applied to other modalities such as computed tomography angiography and echocardiography.

References

Bernard, O., et al.: Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE Trans. Med. Imaging 37(11), 2514–2525 (2018)
Article Google Scholar
Carreira, J., Agrawal, P., Fragkiadaki, K., Malik, J.: Human pose estimation with iterative error feedback. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4733–4742 (2016)
Google Scholar
Chen, Y., Shi, F., Christodoulou, A.G., Xie, Y., Zhou, Z., Li, D.: Efficient and accurate MRI super-resolution using a generative adversarial network and 3D multi-level densely connected network. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 91–99. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_11
Chapter Google Scholar
Chen, Y., Xie, Y., Zhou, Z., Shi, F., Christodoulou, A.G., Li, D.: Brain MRI super resolution using 3D deep densely connected neural networks. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pp. 739–742. IEEE (2018)
Google Scholar
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2015)
Article Google Scholar
Finn, C., Goodfellow, I., Levine, S.: Unsupervised learning for physical interaction through video prediction. In: Advances in Neural Information Processing Systems, pp. 64–72 (2016)
Google Scholar
Haris, M., Shakhnarovich, G., Ukita, N.: Recurrent back-projection network for video super-resolution. arXiv preprint arXiv:1903.10128 (2019)
Huang, J.B., Singh, A., Ahuja, N.: Single image super-resolution from transformed self-exemplars. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5197–5206 (2015)
Google Scholar
Huang, Y., Wang, W., Wang, L.: Bidirectional recurrent convolutional networks for multi-frame super-resolution. In: Advances in Neural Information Processing Systems, pp. 235–243 (2015)
Google Scholar
Jo, Y., Wug Oh, S., Kang, J., Joo Kim, S.: Deep video super-resolution network using dynamic upsampling filters without explicit motion compensation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3224–3232 (2018)
Google Scholar
Jog, A., Carass, A., Prince, J.L.: Self super-resolution for magnetic resonance images. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 553–560. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46726-9_64
Chapter Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
von Knobelsdorff-Brenkenhoff, F., Pilz, G., Schulz-Menger, J.: Representation of cardiovascular magnetic resonance in the AHA/ACC guidelines. J. Cardiovasc. Magn. Reson. 19(1), 70 (2017). https://doi.org/10.1186/s12968-017-0385-z
Article Google Scholar
von Knobelsdorff-Brenkenhoff, F., Schulz-Menger, J.: Role of cardiovascular magnetic resonance in the guidelines of the European society of cardiology. J. Cardiovasc. Magn. Reson. 18(1), 6 (2015). https://doi.org/10.1186/s12968-016-0225-6
Article Google Scholar
Lalande, A., et al.: Left ventricular ejection fraction calculation from automatically selected and processed diastolic and systolic frames in short-axis cine-mri. J. Cardiovasc. Magn. Reson. 6(4), 817–827 (2004)
Article Google Scholar
Ledig, C., et al.: Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4681–4690 (2017)
Google Scholar
Lee, C.Y., Xie, S., Gallagher, P., Zhang, Z., Tu, Z.: Deeply-supervised nets. In: Artificial Intelligence and Statistics, pp. 562–570 (2015)
Google Scholar
Lim, B., Son, S., Kim, H., Nah, S., Mu Lee, K.: Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 136–144 (2017)
Google Scholar
Mnih, V., et al.: Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)
Montemerlo, M., Thrun, S., Koller, D., Wegbreit, B., et al.: FastSLAM: a factored solution to the simultaneous localization and mapping problem. In: AAAI/IAAI, pp. 593–598 (2002)
Google Scholar
Pham, C.H., Ducournau, A., Fablet, R., Rousseau, F.: Brain MRI super-resolution using deep 3D convolutional networks. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), pp. 197–200. IEEE (2017)
Google Scholar
Sajjadi, M.S., Vemulapalli, R., Brown, M.: Frame-recurrent video super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6626–6634 (2018)
Google Scholar
Salerno, M., et al.: Recent advances in cardiovascular magnetic resonance: techniques and applications. Circ.: Cardiovasc. Imaging 10(6), e003951 (2017)
Google Scholar
Shi, J., Liu, Q., Wang, C., Zhang, Q., Ying, S., Xu, H.: Super-resolution reconstruction of MR image with a novel residual learning network algorithm. Phys. Med. Biol. 63(8), 085011 (2018)
Article Google Scholar
Tautz, L., Friman, O., Hennemuth, A., Seeger, A., Peitgen, H.O.: Automatic detection of a heart ROI in perfusion MRI images. In: Handels, H., Ehrhardt, J., Deserno, T., Meinzer, H.P., Tolxdorff, T. (eds.) Bildverarbeitung für die Medizin 2011, pp. 259–263. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19335-4_54
Chapter Google Scholar
National Heart, Lung, and Blood Institute: Data science bowl cardiac challenge data (2015)
Google Scholar
Wang, X., Chan, K.C., Yu, K., Dong, C., Change Loy, C.: EDVR: video restoration with enhanced deformable convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (2019)
Google Scholar
Xingjian, S., Chen, Z., Wang, H., Yeung, D.Y., Wong, W.K., Woo, W.C.: Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Advances in Neural Information Processing Systems, pp. 802–810 (2015)
Google Scholar
Xu, H.Y., et al.: Volume-time curve of cardiac magnetic resonance assessed left ventricular dysfunction in coronary artery disease patients with type 2 diabetes mellitus. BMC Cardiovasc. Disord. 17(1), 145 (2017). https://doi.org/10.1186/s12872-017-0583-5
Article MathSciNet Google Scholar
Xue, T., Chen, B., Wu, J., Wei, D., Freeman, W.T.: Video enhancement with task-oriented flow. Int. J. Comput. Vision 127(8), 1106–1125 (2019)
Article Google Scholar
Zhao, C., Carass, A., Dewey, B.E., Prince, J.L.: Self super-resolution for magnetic resonance images using deep networks. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pp. 365–368. IEEE (2018)
Google Scholar
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for neural networks for image processing. arXiv preprint arXiv:1511.08861 (2015)

Download references

Acknowledgment

This work was supported in part by the Ministry of Science and Technology, Taiwan, under Grant MOST 109-2634-F-002-032 and Microsoft Research Asia. We are grateful to the NVIDIA grants and the DGX-1 AI Supercomputer and the National Center for High-performance Computing. We thank Dr. Chih-Kuo Lee, National Taiwan University Hospital, for the early discussions.

Author information

Authors and Affiliations

National Taiwan University, Taipei, Taiwan
Jhih-Yuan Lin, Yu-Cheng Chang & Winston H. Hsu

Authors

Jhih-Yuan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Cheng Chang
View author publications
You can also search for this author in PubMed Google Scholar
Winston H. Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jhih-Yuan Lin .

Editor information

Editors and Affiliations

University of Toronto, Toronto, ON, Canada
Anne L. Martel
The University of British Columbia, Vancouver, BC, Canada
Purang Abolmaesumi
University College London, London, UK
Danail Stoyanov
École Centrale de Nantes, Nantes, France
Diana Mateus
EURECOM, Biot, France
Maria A. Zuluaga
Chinese Academy of Sciences, Beijing, China
S. Kevin Zhou
Sorbonne University, Paris, France
Daniel Racoceanu
The Hebrew University of Jerusalem, Jerusalem, Israel
Leo Joskowicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, JY., Chang, YC., Hsu, W.H. (2020). Efficient and Phase-Aware Video Super-Resolution for Cardiac MRI. In: Martel, A.L., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2020. MICCAI 2020. Lecture Notes in Computer Science(), vol 12264. Springer, Cham. https://doi.org/10.1007/978-3-030-59719-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-59719-1_7
Published: 29 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59718-4
Online ISBN: 978-3-030-59719-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)