PET Denoising and Uncertainty Estimation Based on NVAE Model Using Quantile Regression Loss

Cui, Jianan; Xie, Yutong; Joshi, Anand A.; Gong, Kuang; Kim, Kyungsang; Son, Young-Don; Kim, Jong-Hoon; Leahy, Richard; Liu, Huafeng; Li, Quanzheng

doi:10.1007/978-3-031-16440-8_17

Jianan Cui^12,13,
Yutong Xie¹⁴,
Anand A. Joshi¹⁵,
Kuang Gong¹³,
Kyungsang Kim¹³,
Young-Don Son¹⁶,
Jong-Hoon Kim¹⁶,
Richard Leahy¹⁵,
Huafeng Liu^12,17,18 &
…
Quanzheng Li¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13434))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

6096 Accesses
3 Citations

Abstract

Deep learning-based methods have shown their superior performance for medical imaging, but their clinical application is still rare. One reason may come from their uncertainty. As data-driven models, deep learning-based methods are sensitive to imperfect data. Thus, it is important to quantify the uncertainty, especially for positron emission tomography (PET) denoising tasks where the noise is very similar to small tumors. In this paper, we proposed a Nouveau variational autoencoder (NVAE) based model using quantile regression loss for simultaneous PET image denoising and uncertainty estimation. Quantile regression loss was performed as the reconstruction loss to avoid the variance shrinkage problem caused by the traditional reconstruction probability loss. The variance and mean can be directly calculated from the estimated quantiles under the Logistic assumption, which is more efficient than Monte Carlo sampling. Experiment based on real $^{11}$C-DASB datasets verified that the denoised PET images of the proposed method have a higher mean(±SD) peak signal-to-noise ratio (PSNR)　(40.64 ± 5.71) and structural similarity index measure (SSIM) (0.9807 ± 0.0063) than Unet-based denoising (PSNR, 36.18 ± 5.55; SSIM, 0.9614 ± 0.0121) and NVAE model using Monte Carlo sampling (PSNR, 37.00 ± 5.35; SSIM, 0.9671 ± 0.0095) methods.

Access provided by Autonomous University of Puebla. Download conference paper PDF

PET image denoising using unsupervised deep learning

Article 29 August 2019

Uncertainty-Informed Bayesian PET Image Reconstruction Using a Deep Image Prior

Noise-Aware Standard-Dose PET Reconstruction Using General and Adaptive Robust Loss

Keywords

1 Introduction

Deep learning-based methods have made great progress in the field of medical imaging [4,5,6,7]. Although they have achieved superior performance over traditional methods, the reliability of deep learning-based methods has always been questioned as they are data-driven models and work in a black box. Once the testing data is out of the distribution of the training data, the output would be degraded. Therefore, quantifying uncertainty during the inference of deep learning-based methods is of vital importance. Especially in clinical practice, uncertain information can effectively improve the accuracy of diagnosis. Thiagarajan et al. [27] measured uncertainty for the classification of breast histopathological images and verified that high uncertainty data have lower classification accuracy and need human intervention. Hao et al. [11] showed that adding uncertainty information into a graph attention network can improve the classification performance for parapneumonic effusion diagnosis.

In positron emission tomography (PET) image denoising tasks, without prior information from raw data, it is not easy for denoising models to distinguish noise and small tumors as they have very similar structures visually. Small tumors may be eliminated as noise, while some of the noise may be identified as a tumor and retained. If we can quantify the uncertainty for the denoising process, the uncertainty map can provide an additional reference when the radiologists are viewing the denoised PET imaging. Uncertainty has been well studied in the classic PET literature, particularly after Jeffrey Fessler’s seminar work [9] in 1997. However, traditional methods [9, 21] are not accurate and fast enough. Nowadays, several deep learning-based methods for uncertainty estimation have been proposed, like Bayesian neural networks [18, 19, 24, 26], ensemble methods [17], dropout methods [2, 10], and uncertainty aware neural networks [3, 20]. For PET image denoising, Sudarshan et al. [26] proposed a robust suDNN to map low-dose PET images to standard-dose PET images and estimate the uncertainty through a Bayesian neural network framework. Cui et al. [8] used a novel neural network, Nouveau variational autoencoder (NVAE) [28], to generate denoising image and uncertainty map based on Monte Carlo sampling.

In this paper, we developed an efficient and effective model for simultaneous PET image denoising and uncertainty estimation. Here, we also utilized NVAE as our network structure due to its superior performance over the traditional VAE model. Compared to the Reference [8], our work has two main contributions:

Quantile regression loss [1, 13, 16] was introduced as the reconstruction loss to avoid the variance shrinkage problem of VAE. In VAE, when the conditional mean network prediction is well-trained (there is no reconstruction error), continued maximizing the log-likelihood will result in the estimated variance approaching zero [1], leading to an artificially narrow output distribution. This will limit the model’s generalization ability and reduce sample diversity. Using quantile regression loss to replace log-likelihood can avoid this problem.
The variance map was directly calculated from the estimated quantiles. Monte Carlo sampling is no longer needed which can save a lot of time and computing resources.

2 Related Work

2.1 Variational Autoencoder (VAE)

VAE [14] is a generative model that is trained to represent the distribution of signal $\boldsymbol{x}$ given the latent variables $\boldsymbol{z}$ which has a known prior distribution $p(\boldsymbol{z})$. $p(\boldsymbol{x},\boldsymbol{z})=p(\boldsymbol{z})p(\boldsymbol{x}|\boldsymbol{z})$ represents the true joint distribution of $\boldsymbol{x}$ and $\boldsymbol{z}$, where $p(\boldsymbol{x}|\boldsymbol{z})$ is the likelihood function or decoder. Let $q(\boldsymbol{x})$ be the distribution of $\boldsymbol{x}$ and $q(\boldsymbol{z}|\boldsymbol{x})$ be the posterior distribution or encoder, then $q(\boldsymbol{x},\boldsymbol{z})$ suppose be the estimate of $p(\boldsymbol{x},\boldsymbol{z})$. VAE is trained to make $q(\boldsymbol{x},\boldsymbol{z})$ close to the true joint distribution $p(\boldsymbol{x},\boldsymbol{z})$ by minimizing their KL divergence.

2.2 Nouveau Variational Autoencoder (NVAE)

NVAE [28] is a model recently proposed by NVIDA that can generate high-quality face images as large as $256\times 256$. Through several novel architecture designs, NVAE greatly improved its generative performance for complex signal distribution. A hierarchical multi-scale structure is added between the encoder and the generative model to boost the expressiveness of the approximate posterior $q(\boldsymbol{z}|\boldsymbol{x})$ and prior $p(\boldsymbol{z})$. In this hierarchical multi-scale structure, the latent variables of NVAE are separated as L disjoint groups, ${\boldsymbol{z}} = \left\{ {{\boldsymbol{z}_1},{\boldsymbol{z}_2}, \ldots ,{\boldsymbol{z}_L}} \right\} $. Thus, the posterior can be represented as $p({\boldsymbol{z}_1},{\boldsymbol{z}_2}, \ldots ,{\boldsymbol{z}_L}) = p({\boldsymbol{z}_1})\prod \nolimits _{l = 1}^L {p\left( {{\boldsymbol{z}_i}|{\boldsymbol{z}_{ < l}}} \right) } $ and the prior is $q({\boldsymbol{z}_1},{\boldsymbol{z}_2}, \ldots ,{\boldsymbol{z}_L}|\boldsymbol{x}) = \prod \nolimits _{l = 1}^L {q\left( {{\boldsymbol{z}_l}|\boldsymbol{x},{\boldsymbol{z}_{ < l}}} \right) } $. Considering the substantial latent variable groups would make the network hard to optimize and unstable, NVAE proposed the residual normal distributions to parameterize the approximate posterior ${{q(z}}_l^i|{\boldsymbol{z}_{< l}},\boldsymbol{x}): = { \mathcal {N}}({\mu _i}\left( {\boldsymbol{z}_{< l}} \right) + \varDelta {\mu _i}\left( {\boldsymbol{z}_{< l},\boldsymbol{x}} \right) ,{\sigma _i} \left( {\boldsymbol{z}_{< l}} \right) \cdot \varDelta {\sigma _i}\left( {\boldsymbol{z}_{ < l},\boldsymbol{x}} \right) )$ relative to the prior ${{p(z}}_l^i|{\boldsymbol{z}_{< l}}): = { \mathcal {N}}({\mu _i}\left( {{\boldsymbol{z}_{< l}}} \right) ,{\sigma _i}\left( {{\boldsymbol{z}_{ < l}}} \right) )$, where $z_l^i$ is the ith variable in $\boldsymbol{z}_l$. In addition, spectral regularization [30], inverse autoregressive flow [15], depthwise convolutions, squeeze and excitation [12], batch normalization, swish activation [22] were added either to better model the long-range correlations or stable the training.

3 Methods

3.1 Overview

Figure 1 shows the diagram of the proposed simultaneous PET image denoising and uncertainty estimation framework. The NVAE network was trained with supervision by minimizing quantile regression loss and Kullback-Leibler (KL) divergence term. The posterior distribution was assumed to be the Logistic distribution. After training, the location parameter $\mu $ and the scale parameter s were acquired from the estimated quantiles. Thus, the mean and variance of the Logistic distribution can be calculated.

3.2 PET Image Denoising

For PET image denoising tasks, given a group of noisy PET images $\boldsymbol{x}'$ and clean PET images $\boldsymbol{x}$, the conditional log-likelihood function is $\log p(\boldsymbol{x}|\boldsymbol{x}')$. Based on the conditional VAE [25], the variational lower bound of the log-likelihood can be written as:

$$\begin{aligned} \log p(\boldsymbol{x}'|\boldsymbol{x}) \ge {{\mathbb {E}}_{q({\boldsymbol{z}}|\boldsymbol{x}')}}\log p(\boldsymbol{x}|\boldsymbol{x}',\boldsymbol{z}) - KL(q(\boldsymbol{z}|\boldsymbol{x}')||p(\boldsymbol{z}|\boldsymbol{x}')). \end{aligned}$$

(1)

In this work, we assumed $p(\boldsymbol{z}|\boldsymbol{x}')=p(\boldsymbol{z})$ that followed Normal distributions. And skip connections were added between the encoder and the generative model to provide information from input $\boldsymbol{x}'$ to the likelihood function $p(\boldsymbol{x}|\boldsymbol{x}',\boldsymbol{z})$. Thus, the loss can be written as:

$$\begin{aligned} \mathcal {L}(\boldsymbol{x})=-{{\mathbb {E}}_{q({\boldsymbol{z}}|\boldsymbol{x}')}}\log p(\boldsymbol{x}|\boldsymbol{x}',\boldsymbol{z}) + KL(q(\boldsymbol{z}|\boldsymbol{x}')||p(\boldsymbol{z})). \end{aligned}$$

(2)

The first term of the Eq. 2 represents the reconstruction loss $\mathcal {L}_{REC}(\boldsymbol{x})$ and the second term is the regularization term $\mathcal {L}_{KL}(\boldsymbol{x})$.

3.3 Quantile Regression Loss

Quantile regression (QR) [16] was proposed by Koenker et al. in 1978, which estimates the nth $(0<n<1)$ quantile $Q_n$ by minimizing the problem

$$\begin{aligned} \mathop {\min }\limits _\theta \sum \limits _t {{\rho _n}\left( {{x_t} - {f_\theta }({x'_t})} \right) }, \end{aligned}$$

(3)

where $x'_t$ and $x_t$ are the tth input and response, respectively. ${f_\theta }({x'_t})$ represents the NVAE model with parameters $\theta $, which is supposed to be the quantile $Q_n$ after optimizing. ${\rho _n}$ is the check function:

$$\begin{aligned} {\rho _n}(u) = u(nI(u > 0) - (1 - n)I(u < 0)). \end{aligned}$$

(4)

To avoid the variance shrinkage problem, we replaced the reconstruction loss with the quantile regression loss and calculated the mean and variance of the output distribution by quantiles. We assumed that the output distribution follows the Logistic distribution, which has a cumulative distribution function (CDF) as:

$$\begin{aligned} F(y;\mu ,s) = \frac{1}{{1 + {e^{ - (y - \mu )/s}}}} \end{aligned}$$

(5)

where $\mu $ is the mean and s is a scale parameter. s has a relation with the variance as ${\sigma }^2 = \frac{{{s^2}{\pi ^2}}}{3}$. As CDF is equal to n, we can get two quantiles:

$$\begin{aligned} {Q_n} = \left\{ {\begin{array}{*{20}{c}} {\mu ,n = 0.5}\\ {\mu - s,n = \frac{1}{{1 + e}} \approx 0.269} \end{array}} \right. \end{aligned}$$

(6)

The QR loss aims to estimate the 0.5th quantile and the 0.269th quantile by:

$$\begin{aligned} \mathcal {L}_{QR}(\boldsymbol{x}) = \sum \limits _t {{\rho _{0.5}}\left( {{x_t} - {f_{{\theta _{0.5}}}}({{x'}_t})} \right) } + \sum \limits _t {{\rho _{0.269}}\left( {{x_t} - {f_{{\theta _{0.269}}}}({{x'}_t})} \right) } \end{aligned}$$

(7)

Thus, the total training loss for the proposed work is

$$\begin{aligned} \mathcal {L}(\boldsymbol{x})=\mathcal {L}_{QR}(\boldsymbol{x})+\mathcal {L}_{KL}(\boldsymbol{x}). \end{aligned}$$

(8)

4 Experiment

4.1 Dataset

The proposed method was validated on real $^{11}$C-DASB datasets, which contain 26 subjects. Each subject was scanned by the HRRT-PET system (Siemens Medical Solutions, Knoxville, TN, USA) with the administered dose of 577.6 ± 41.0 MBq. PET images acquired at 60 min post-injection (matrix size, $256 \times 256 \times 207$; voxel size, $1.21875\,\textrm{mm} \times 1.21875\,\textrm{mm} \times 1.21875\,\textrm{mm}$) were reconstructed by 3D-ordinary ordered subset expectation maximization (OP-OSEM3D) algorithm and attenuation correction was performed. Each subject also has a T1-weighted image scanned by an ultra-high field 7T MRI. We generated the low-quality PET images by down-sampling the list-mode data to a quarter of the original count. In this dataset, 20 subjects were used for training, 3 for validation, and 3 for testing.

4.2 Data Analysis

Supervised denoising using Unet as the network structure was chosen as the baseline to compare the denoising performance with the proposed method (NVAE-QR). We also compared with the work in the Reference [8], which used NVAE for PET image denoising and uncertainty estimation based on Monte Carlo sampling (NVAE-MC). In this work, we evaluated the denoising performance by calculating the peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) [29] using the high-quality PET image as the reference. We used Logistic distribution as the output distribution for NVAE-QR. NVAE-MC used the mixture of discretized Logistic distribution [23] as the output distribution.

The network was trained by 2.5D patch with the size of $3 \times 256 \times 256$. The batch size is 20. The training epochs are 200 and the learning rate is 0.01. All the experiments were performed in Pytorch 1.9.1. The sampling number for NVAE-MC is 500.

Table 1. The PSNR and SSIM of the noisy PET image, the denoising results for Unet, NVAE-MC and NVAE-QR.

Full size table

5 Results

Figure 2 shows one axial slice from the output of our NVAE-QR methods. The network directly estimated the 0.5th quantile and the 0.269th quantile, which are equal to $\mu $ and $\mu -s$, respectively. We can get the scale parameter s from the difference image of the 0.5th quantile and the 0.269th quantile, and then calculate the variance image. Figure 3 displays the denoising results for Unet, NVAE-MC, and NVAE-QR. We can see that all the neural network denoised results significantly improved the quality of the low-dose image and even look better than the full-dose image. Among them, our NVAE-QR shows the clearest cortex structure (as the zoom-in subfigures are shown). As $^{11}$C has a short half-life and decays fast, $^{11}$C-DASB PET images usually have a higher noise level compared to $^{18}$F-FDG. Thus, the full-dose PET image shown in Fig. 3 is also noisy and cannot give enough reference for cortex regions. The corresponding MR is displayed to provide additional references. We also plotted the profile of the denoised results using different methods in Fig. 4. We can see that NVAE-QR matches the high-intensity peak better than the other two methods. The PSNR and SSIM of different denoising methods are listed in Table 1. The proposed NVAE-QR method can achieve the highest mean(±SD) PSNR (40.64 ± 5.71) and SSIM (0.9807 ± 0.0063) compared to Unet (PSNR, 36.18 ± 5.55; SSIM, 0.9614 ± 0.0121) and NVAE-MC (PSNR, 37.00 ± 5.35; SSIM, 0.9671 ± 0.0095). The variance maps of NVAE-MC and NVAE-QR are shown in Fig. 5. The variance of NVAE-MC is lower than NVAE-QR due to the variance shrinkage problem. We also compared the performance of NVAE-MC using Logistic distribution and NVAE-QR using Normal distribution as the ablation study. The PSNR and SSIM in Table 2 show that NVAE-QR using Logistic distribution has the best performance.

6 Discussion

In this work, we used quantile regression loss to replace the original reconstruction probability loss of the NAVE model to avoid the variance shrinkage problem [1]. The mean value of the variance map of NVAE-QR is 0.00142 which is 1.58 times of NVAE-MC (0.000898), showing that NVAR-QR does avoid variance shrinkage. From Table 2 we can see that using quantile regression can significantly improve the PSNR and SSIM for the denoised images.

In addition to superior denoising capability, another advantage for NVAE-QR is that it can save a lot of sampling time. NVAE-MC needed 35min to generate the result of one subject due to the repetitive sampling process, while NVAE-QR only needed 4s. There is not much difference between 1-sample and 500-sample NVAE-QR results. (The comparison is put in the supplementary materials). Thus, NVAE-QR only takes 1/500 percent of the time of NVAE-MC, which is much more efficient.

Table 2. The PSNR and SSIM of NVAE-MC and NVAE-QR using different output distributions

Full size table

We evaluated the performance of different output distribution models (as Fig. 5 and Table 2 are shown). We found that normal distribution cannot fit PET data well (usually has low SSIM). Both logistic and the mixture of discretized Logistic distribution [23] work well. However, the mean and variance of the mixture of discretized Logistic distribution [23] are hard to calculate as the mixture parameters are learned by the network. Thus, we assumed that the output follows the logistic distribution. We also evaluated the proposed method on whole-body $^{18}$F-FDG datasets. The results show that NVAE-QR achieved the highest PSNR, SSIM, and contrast-to-noise ratio (CNR) improvement than Unet and NVAE-MC. The results of the brain datasets and the whole-body datasets show our model works well under this output assumption and has good generalization ability.

7 Conclusion

In this paper, we proposed an NVAE model using quantile regression loss for simultaneous PET denoising and uncertainty estimation. Based on the quantile regression loss, the variance shrinkage problem caused by reconstruction probability loss can be avoided and the sampling time can be saved. The experiment based on real $^{11}$C-DASB HRRT-PET dataset verified that the proposed NVAE-QR method outperforms the Unet-based denoising and NVAE-based Monte Carlo sampling. Our future work will focus on more clinical evaluations.

References

Akrami, H., Joshi, A.A., Aydore, S., Leahy, R.M.: Addressing variance shrinkage in variational autoencoders using quantile regression. arXiv preprint arXiv:2010.09042 (2020)
Ballestar, L.M., Vilaplana, V.: MRI brain tumor segmentation and uncertainty estimation using 3D-UNet architectures. In: Crimi, A., Bakas, S. (eds.) BrainLes 2020. LNCS, vol. 12658, pp. 376–390. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-72084-1_34
Chapter Google Scholar
Bishop, C.M.: Mixture density networks (1994)
Google Scholar
Cui, J., Gong, K., Guo, N., Kim, K., Liu, H., Li, Q.: Unsupervised pet logan parametric image estimation using conditional deep image prior. Med. Image Anal. 80, 102519 (2022)
Article Google Scholar
Cui, J., et al.: Populational and individual information based pet image denoising using conditional unsupervised learning. Phys. Med. Biol. 66(15), 155001 (2021)
Article Google Scholar
Cui, J., et al.: Pet image denoising using unsupervised deep learning. Eur. J. Nucl. Med. Mol. Imaging 46(13), 2780–2789 (2019)
Article Google Scholar
Cui, J., Gong, K., Han, P., Liu, H., Li, Q.: Unsupervised arterial spin labeling image superresolution via multiscale generative adversarial network. Med. Phys. 49(4), 2373–2385 (2022)
Article Google Scholar
Cui, J., et al.: Pet denoising and uncertainty estimation based on NVAE model. In: 2021 IEEE Nuclear Science Symposium and Medical Imaging Conference Proceedings (NSS/MIC). IEEE (2021)
Google Scholar
Fessler, J.A.: Approximate variance images for penalized-likelihood image reconstruction. In: 1997 IEEE Nuclear Science Symposium Conference Record, vol. 2, pp. 949–952. IEEE (1997)
Google Scholar
Gal, Y., Ghahramani, Z.: Dropout as a Bayesian approximation: representing model uncertainty in deep learning. In: International Conference on Machine Learning, pp. 1050–1059. PMLR (2016)
Google Scholar
Hao, J., et al.: Uncertainty-guided graph attention network for parapneumonic effusion diagnosis. Med. Image Anal. 75, 102217 (2022)
Article Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Google Scholar
Huang, X., Shi, L., Suykens, J.A.: Support vector machine classifier with pinball loss. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 984–997 (2013)
Article Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. arXiv preprint arXiv:1312.6114 (2013)
Kingma, D.P., Salimans, T., Jozefowicz, R., Chen, X., Sutskever, I., Welling, M.: Improved variational inference with inverse autoregressive flow. In: Advances in Neural Information Processing Systems, vol. 29 (2016)
Google Scholar
Koenker, R., Bassett Jr, G.: Regression quantiles. Econometrica: J. Econometr. Soc. 33–50 (1978)
Google Scholar
Lakshminarayanan, B., Pritzel, A., Blundell, C.: Simple and scalable predictive uncertainty estimation using deep ensembles. In: Advances in Neural Information Processing Systems, 30 (2017)
Google Scholar
Laves, M.H., Ihler, S., Fast, J.F., Kahrs, L.A., Ortmaier, T.: Well-calibrated regression uncertainty in medical imaging with deep learning. In: Medical Imaging with Deep Learning, pp. 393–412. PMLR (2020)
Google Scholar
MacKay, D.J.: A practical Bayesian framework for backpropagation networks. Neural Comput. 4(3), 448–472 (1992)
Article Google Scholar
Nix, D.A., Weigend, A.S.: Estimating the mean and variance of the target probability distribution. In: Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN 1994), vol. 1, pp. 55–60. IEEE (1994)
Google Scholar
Qi, J., Leahy, R.M.: A theoretical study of the contrast recovery and variance of map reconstructions from pet data. IEEE Trans. Med. Imaging 18(4), 293–305 (1999)
Article Google Scholar
Ramachandran, P., Zoph, B., Le, Q.V.: Searching for activation functions. arXiv preprint arXiv:1710.05941 (2017)
Salimans, T., Karpathy, A., Chen, X., Kingma, D.P.: Pixelcnn++: improving the pixelcnn with discretized logistic mixture likelihood and other modifications. arXiv preprint arXiv:1701.05517 (2017)
Sambyal, A.S., Krishnan, N.C., Bathula, D.R.: Towards reducing aleatoric uncertainty for medical imaging tasks. In: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI), pp. 1–4. IEEE (2022)
Google Scholar
Sohn, K., Lee, H., Yan, X.: Learning structured output representation using deep conditional generative models. Advances in Neural Information Processing Systems, vol. 28 (2015)
Google Scholar
Sudarshan, V.P., Upadhyay, U., Egan, G.F., Chen, Z., Awate, S.P.: Towards lower-dose pet using physics-based uncertainty-aware multimodal learning with robustness to out-of-distribution data. Med. Image Anal. 73, 102187 (2021)
Article Google Scholar
Thiagarajan, P., Khairnar, P., Ghosh, S.: Explanation and use of uncertainty obtained by Bayesian neural network classifiers for breast histopathology images. IEEE Trans. Med. Imaging 41, 815–825 (2021)
Article Google Scholar
Vahdat, A., Kautz, J.: NVAE: a deep hierarchical variational autoencoder. arXiv preprint arXiv:2007.03898 (2020)
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Yoshida, Y., Miyato, T.: Spectral norm regularization for improving the generalizability of deep learning. arXiv preprint arXiv:1705.10941 (2017)

Download references

Acknowledgements

This work was supported in part by the National Key Technology Research and Development Program of China (2020AAA0109502), the National Natural Science Foundation of China (U1809204, 62101488), the Key Research and Development Program of Zhejiang Province (2021C03029), the Talent Program of Zhejiang Province (2021R51004) and by China Postdoctoral Science Foundation (2021M692830)

Author information

Authors and Affiliations

The State Key Laboratory of Modern Optical Instrumentation, College of Optical Science and Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
Jianan Cui & Huafeng Liu
The Center for Advanced Medical Computing and Analysis, Massachusetts General Hospital/Harvard Medical School, Boston, MA, 02114, USA
Jianan Cui, Kuang Gong, Kyungsang Kim & Quanzheng Li
Peking University, Beijing, 100871, China
Yutong Xie
University of Southern California, Los Angeles, CA, USA
Anand A. Joshi & Richard Leahy
The Neuroscience Research Institute, Gachon University of Medicine and Science, Incheon, 21999, South Korea
Young-Don Son & Jong-Hoon Kim
Jiaxing Key Laboratory of Photonic Sensing and Intelligent Imaging, Jiaxing, 314000, China
Huafeng Liu
Intelligent Optics and Photonics Research Center, Jiaxing Research Institute, Zhejiang University, Jiaxing, 314000, China
Huafeng Liu

Authors

Jianan Cui
View author publications
You can also search for this author in PubMed Google Scholar
Yutong Xie
View author publications
You can also search for this author in PubMed Google Scholar
Anand A. Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Kuang Gong
View author publications
You can also search for this author in PubMed Google Scholar
Kyungsang Kim
View author publications
You can also search for this author in PubMed Google Scholar
Young-Don Son
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Hoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Richard Leahy
View author publications
You can also search for this author in PubMed Google Scholar
Huafeng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Quanzheng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Huafeng Liu or Quanzheng Li .

Editor information

Editors and Affiliations

Rochester Institute of Technology, Rochester, NY, USA
Linwei Wang
Chinese University of Hong Kong, Hong Kong, Hong Kong
Qi Dou
University of Virginia, Charlottesville, VA, USA
P. Thomas Fletcher
National Center for Tumor Diseases (NCT/UCC), Dresden, Germany
Stefanie Speidel
Case Western Reserve University, Cleveland, OH, USA
Shuo Li

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 160 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, J. et al. (2022). PET Denoising and Uncertainty Estimation Based on NVAE Model Using Quantile Regression Loss. In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13434. Springer, Cham. https://doi.org/10.1007/978-3-031-16440-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-031-16440-8_17
Published: 16 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16439-2
Online ISBN: 978-3-031-16440-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)