Abstract
Purpose
This study aims to develop deep learning techniques on total-body PET to bolster the feasibility of sedation-free pediatric PET imaging.
Methods
A deformable 3D U-Net was developed based on 245 adult subjects with standard total-body PET imaging for the quality enhancement of simulated rapid imaging. The developed method was first tested on 16 children receiving total-body [18F]FDG PET scans with standard 300-s acquisition time with sedation. Sixteen rapid scans (acquisition time about 3 s, 6 s, 15 s, 30 s, and 75 s) were retrospectively simulated by selecting the reconstruction time window. In the end, the developed methodology was prospectively tested on five children without sedation to prove the routine feasibility.
Results
The approach significantly improved the subjective image quality and lesion conspicuity in abdominal and pelvic regions of the generated 6-s data. In the first test set, the proposed method enhanced the objective image quality metrics of 6-s data, such as PSNR (from 29.13 to 37.09, p < 0.01) and SSIM (from 0.906 to 0.921, p < 0.01). Furthermore, the errors of mean standardized uptake values (SUVmean) for lesions between 300-s data and 6-s data were reduced from 12.9 to 4.1% (p < 0.01), and the errors of max SUV (SUVmax) were reduced from 17.4 to 6.2% (p < 0.01). In the prospective test, radiologists reached a high degree of consistency on the clinical feasibility of the enhanced PET images.
Conclusion
The proposed method can effectively enhance the image quality of total-body PET scanning with ultrafast acquisition time, leading to meeting clinical diagnostic requirements of lesion detectability and quantification in abdominal and pelvic regions. It has much potential to solve the dilemma of the use of sedation and long acquisition time that influence the health of pediatric patients.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Over the past several decades, [18F]fluorodeoxyglucose positron emission tomography ([18F]FDG PET) imaging has been widely used in both clinical practice and research [1,2,3]. Recently, the application of PET in pediatric diseases has gained significant interest, proving to be a beneficial diagnostic instrument for evaluating both pediatric tumors and non-tumor diseases [3, 4]. [18F]FDG PET has been employed in the diagnosis of pediatric solid tumors, hematological tumors, post-transplant lymphoproliferative disease (PTLD), and fever of unknown origin (FUO) [5,6,7]. It is routinely used for various purposes such as staging, evaluating treatment response, monitoring disease progression, and detecting recurrence in numerous malignancies [8].
However, compared with adult PET imaging, the process of pediatric PET examination is more intricate [9, 10]. Immobilization in pediatric patients can be accomplished using body wraps or specialized holding devices. Parents are often allowed to stay with the child throughout the study to provide emotional support [11]. Maintaining stillness during the examination is crucial in pediatric imaging, as minor movements can lead to artifacts. Sedation is typically recommended to inhibit movement during the examination and ensure satisfactory image quality [12]. However, sedation comes with certain drawbacks: (1) It may potentially affect the neurodevelopment of children [13]; (2) despite sedation, some children may not cooperate or may exhibit involuntary movements during the examination; and (3) the potential effects of sedation on the biodistribution of the FDG tracer must be taken into account [11, 14]. Consequently, research efforts are increasingly focusing on expedited PET scanning and accurate diagnosis without the need for sedation.
Various methods were employed to compensate or correct for movement during PET imaging, such as the use of external devices that have proven successful in detecting motion [15]. However, the routine clinical application of these devices was impeded by challenges including limited and complex integration with the scanner, additional patient setup time, patient discomfort, and a relatively high failure rate in some cases. An alternative approach involved extracting motion information directly from the PET data by reconstructing a time series of short-duration PET images [16]. It was worth noting that the limitation of current data-driven methods for PET was their reliance on the CT image used for attenuation and scatter correction.
Ultrafast total-body PET imaging offered a balance between acquisition time and image reconstruction quality [17]. Total-body PET/CT enabled simultaneous PET acquisition of the entire human body with a single bed. This scanner boasted superior sensitivity of 174 kcps/MBq and commendable spatial resolution capabilities for human imaging (\(\le\) 3.0 mm FWHM near the center of the AFOV). As a result, the sensitivity of the PET scanner was greatly improved, and the acquisition time is significantly shortened [18]. In the study of [19], despite the noticeable visual distinctions evident in total-body PET images with 30-s and 300-s acquisition time, there existed no substantive variance in diagnostic accuracy between them.
Deep learning–based frameworks emerged as powerful tools, providing solutions to various tasks in medical imaging, including disease diagnosis, tumor segmentation, and image post-reconstruction [20,21,22,23]. Several generative models [9], e.g., employing convolutional neural networks (CNN) or generative adversarial networks (GAN), drew researchers’ attention due to their superior results in enhancing the quality of PET images compared to traditional approaches, all while maintaining low computational costs. However, to our best knowledge, a great number of deep learning–based methods focused on brain PET images instead of total-body PET data of pediatric patients [3, 9]. Additionally, popular GAN-based frameworks usually suffered from instability and model collapse during the training process, as obtaining the Nash equilibrium point was challenging and the training process in GANs was complex [24].
In this prospective study, we developed a deformable 3D U-Net on total-body PET to bolster the feasibility of sedation-free pediatric PET imaging (acquisition time durations of about 6 s, \(\ge\) 50-fold reduced scan duration compared to standard clinical routine). The proposed model was trained on 245 adult subjects and initially tested on 16 pediatric patients under sedation (weight 4.7–17.0 kg, age 0.4–3.5 years). Five rapid scans (acquisition times of approximately 3 s, 6 s, 15 s, 30 s, and 75 s) were retrospectively simulated by selecting the reconstruction time window. A variety of metrics, including quantitative comparisons of PET signal recovery and clinical reading scores, were adopted to compare the physical image quality, the preservation of clinical information, and lesion detectability between synthesized PET images and ground truth, i.e., full-time PET scans (300-s acquisition time). In the end, two experienced radiologists evaluated the ultrafast PET images of five children without sedation (weight 10.0–16.8 kg, age 1.2–2.9 years, 6-s scan time) enhanced by the proposed approach, which validated the detectability and clinical feasibility of synthesized PET images. The experimental results also demonstrated that the proposed method, based on ultrafast PET scanning, has the potential to mitigate the known side effects associated with the use of sedation in pediatric patients.
Materials and methods
Patients
This study was a prospective single-center imaging investigation that encompasses three distinct datasets. The first dataset comprised a substantial cohort of 245 adult subjects for the training and hyperparameter tuning of the proposed deep neural network. The second dataset (named test set 1) consisted of a smaller, independent group involving 16 pediatric subjects who underwent sedation due to suspected tumors, serving as the primary test set. The third dataset, denoted as test set 2, encompasses a prospective group of five children without sedation. All the patients, who underwent total-body PET/CT at the Department of Nuclear Medicine, Ren Ji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China, were enrolled for tumor diagnosis staging from January 2022 to January 2023. The selected pediatric dataset had the following inclusion criteria: (1) age under 4 years, (2) underwent total-body PET scan, and (3) had a definite surgical pathology or follow-up diagnosis. Among them, 16 children were orally sedated 20 min before the PET scan while five children did not take any sedation during the PET scan. The study was approved by the institutional review board of Ren Ji Hospital, Shanghai Jiao Tong University School of Medicine, and the informed consent was obtained from all patients’ legal guardians. The characteristics of all patients are presented in Table 1.
PET imaging protocol
All adult patients were required to fast for at least 6 h prior to [18F]FDG administration. A total-body PET/CT scanner with an AFOV (axial transverse field of view) of 194 cm (uEXPLORER, United Imaging Healthcare, Shanghai, China) was performed 1 h after [18F]FDG injection. As shown in Fig. 1, the acquisition time of list-mode PET data was 300 s, and all the PET images, including test sets 1 and 2, were reconstructed 75 s, 30 s, 15 s, 6 s, and 3 s by adopting OSEM (ordered-subset expectation maximization) with the following parameters: time of flight (TOF) and point spread function (PSF) modeling; two iterations and 20 subsets; matrix 256 × 256; slice thickness of 2.89 mm; and pixel size, 2.34 × 2.34 × 2.89 mm3 with a Gaussian post-filter (3 mm). Meanwhile, some critical correction approaches, including attenuation and scatter correction, were implemented. All images were assessed on a commercial medical image processing workstation (uWS-MI, United Imaging Healthcare) to ensure a better comparison.
Data processing
In this study, the adult dataset with 245 subjects was utilized exclusively during the entire training process, where the primary focus was to train the proposed model and optimize parameters to achieve superior image reconstruction quality. During the training stage, the adult dataset was divided into tenfolds, with each fold containing approximately 10% of PET images. The proposed approach underwent a tenfold cross-validation procedure, which involved ten iterations. In each iteration, onefold was designated as the validation set, while another fold was used as the test set, and the remaining eightfolds were used for model training. The above procedure was repeated ten times to ensure stable and average performance. Once the optimal training strategy and hyperparameter groups were found, the training and tuning process was halted. Subsequently, all subjects in the adult dataset were adopted to train the final version of the proposed model based on the optimized hyperparameter groups. Finally, the whole subjects from two pediatric datasets were fed into the well-trained neural network model to generate the synthesized PET images. This was done to assess the effectiveness and the routine feasibility of intelligent ultrafast total-body PET for sedation-free pediatric [18F]FDG imaging.
Deep neural network model
In this study, a deep convolutional neural network named deformable 3D U-Net was devised to synthesize ultrafast total-body PET images for sedation-free pediatric [18F]FDG imaging. Inspired by DeepPET [25], the proposed framework adopted a U-Net-like architecture, consisting of two parts: the encoding path and the decoding path, as shown in Fig. 2. In the encoding path, we designed a novel spatial deformable aggregation block (SDAB) with residual projection to adaptively extract spatial information from a larger receptive field. In the decoding path, we also employed SDAB to reduce the adverse effects of image noise, reconstruct textures and boundaries, and enhance the brightness and contrast of total-body [18F]FDG PET images. Meanwhile, we proposed a novel multi-scale self-attention module (MSSAM) in the decoding path to exploit multi-scale self-similarity prior, capture long-range dependency, and break scale constraint, producing more prosperous and more faithful details. The model was trained by designing the pairs of adult PET images for ultrafast scan (3 s, 6 s, 15 s, 30 s, and 75 s) and standard reference (300 s) and then tested on the datasets of pediatric patients with/without sedation. More information about the architecture of proposed deformable 3D U-Net and the training processing was attached in the Supplementary material.
Evaluation metrics and statistical analysis
For qualitative assessments, the quality of all synthesized PET images with different acquisition times was evaluated by two experienced nuclear radiologists (i.e., Ruohua Chen with 8 years of experience and Xiang Zhou with 20 years of experience in PET-related diagnosis). The opinion scores were given by them to assess the clinical feasibility, e.g., metabolic details at different body regions. The images in test sets 1 and 2 were presented to the two radiologists for independent reading in a randomized order to minimize potential bias. They were also blinded to the patient history and the acquisition time. According to the widely recognized reading standard in many recent studies [3, 26, 27], the radiologists independently assigned an image quality score on a five-point Likert scale, and the evaluation standards are shown in Table 2. The five-point Likert scale was used to evaluate three aspects: (1) the conspicuity of the organ anatomical structures, (2) the conspicuity of the major suspected malignant lesions, and (3) the image noise. Paired sample t-tests were also utilized to evaluate the statistical significance of opinion scores between two radiologists.
For quantitative evaluation, three classical quantitative metrics were included in this study to assess the reconstruction performance of these [18F]FDG PET images with different acquisition times: peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and feature similarity (FSIM). The PSNR was defined as
where V was the total amounts of voxels and R represented the range of the intensity of the PET image with 300-s acquisition times, and \({\Vert y-x\Vert }_{2}^{2}\) computed the mean squared error between it (\(x\)) and the reconstructed PET image (\(y\)). The pixel-wise quantities were easily calculated and compared, which also had straightforward interpretations. However, they did not correspond well with the sort of errors that humans perceived, particularly blurring and smearing artifacts. Additional measures that more accurately reflected perceived image quality were therefore desirable. The SSIM was defined as
where \({\mu }_{x}\) and \({\mu }_{y}\) were the averages of images \(x\) (i.e., the PET image with 300-s acquisition times) and \(y\) (i.e., the corresponding reconstructed PET image), and \({\sigma }_{x}\) and \({\sigma }_{y}\) were their standard deviations, respectively. \({C}_{1}\) and \({C}_{2}\) were two positive constants to avoid a null denominator and they were usually fixed at \(1\times {10}^{-6}\) and \(3\times {10}^{-6}\). Analogously, the FSIM was defined as
where \(x\) represented the PET image with 300-s acquisition times, \(y\) represented the corresponding reconstructed PET image, \(G(x)\) represented the gradient magnitude of image \(x\), \(G(y)\) represented the gradient magnitude of image \(y\), and \(S\left(x\right)\) and \(S\left(y\right)\) represented the contrast information of image \(x\) and image \(y\), respectively. This formula’s numerator was the weighted sum of the product of the gradient and contrast information from the two images. Conversely, the denominator of this formula was the weighted sum of the product of the squared sum of gradient information from the two images. Theoretically, images with higher PSNR and values closer to 1 for SSIM and FSIM were considered to be of higher quality in the context of image synthesis and reconstruction.
For lesion detectability and clinical feasibility of the proposed method, imaging metrics, i.e., the mean standardized uptake value (SUVmean) and the maximum standardized uptake value (SUVmax), were employed as tracer measures in the selected lesions. These lesions were manually delineated and subjected to review by a nuclear medicine physician under the supervision of a radiologist. To delineate the statistical significance, paired t-tests were used to compare the objective image values (SUVmean and SUVmax) and the quality metric values (PSNR, SSIM, and FSIM) between the ground truth (established as the standard reference, i.e., PET images acquired with a 300-s acquisition time) and the outcomes derived from the proposed deformable 3D U-Net. A p value of < 0.05 was considered to indicate statistical significance.
Results
Objective image quality
The quantitative and qualitative results of test set 1 are depicted in Fig. 3 and Tables 3 and 4. As shown in Fig. 3, the synthesized PET images were of high visual image quality (less noise) and present significant improvements in comparison to the original total-body PET images. Here, [18F]FDG PET images with 300-s acquisition time were established as the ground truth. As delineated in Table 3, as the acquisition time decreased to 30 s, 15 s, 6 s, and 3 s, the PSNR of original PET images diminished to \(35.71\pm 7.63\), \(32.24\pm 7.65\), \(29.13\pm 7.81\), and \(28.92\pm 7.70\), respectively. Correspondingly, SSIM decreased to \(0.917\pm 0.07\), \(0.913\pm 0.07\), \(0.906\pm 0.07\), and \(0.888\pm 0.07\), while FSIM decreased to \(0.925\pm 0.06\), \(0.918\pm 0.06\), \(0.911\pm 0.06\), and \(0.895\pm 0.05\). These trends revealed a positive correlation between quantitative values of PSNR, SSIM, and FSIM and the length of acquisition time. The results were presented in the form of a ± b, where a and b represented the average and standard deviation values of each metric based on tenfold cross-validation experiments.
The quantitative results of the corresponding synthesized PET images, including the average PSNR, SSIM, and FSIM, are presented in Table 4. The p-values, calculated by comparing ultrafast PET images with varying acquisition times against their corresponding synthesized PET images (e.g., Table 3 vs. Table 4), were also included in the analysis. It was observed that the proposed deformable 3D U-Net yields substantial improvements, enhancing PSNR by 23.06%, 27.33%, 18.08%, and 11.65%; SSIM by 2.48%, 1.66%, 1.53%, and 1.64%; and FSIM by 2.57%, 1.87%, 1.31%, and 1.62%, when compared to the original ultrafast PET images. These improvements were statistically significant (p < 0.01 in t-tests). It was noteworthy that, for ultrafast PET images with longer acquisition times (e.g., 75 s), the increment ratios were not as pronounced as those observed with shorter acquisition time (e.g., 3 s or 6 s). These findings underscored the considerable potential of the proposed model in the context of ultrafast PET reconstruction scenarios.
Figure 5 illustrates visualized results of synthesized PET images based on short-term scans with various acquisition times (3 s, 6 s, 15 s, 30 s, and 75 s) and the original full-time PET images. The patient depicted in this figure was the same as that in Fig. 3. The proposed method generated images that preserved tumors (highlighted by red numbers) and displayed tissue structure. Local regions of interest were magnified using red boxes. For each PET image, the two radiologists independently assigned an image quality score on a five-point Likert scale, as we described in Table 2. According to the independent scoring results, for all sub-images in Fig. 5, the two radiologists provided consistent image quality scores; hence, we directly displayed the scores for each sub-image. The middle row of Fig. 5 depicted the axial views of the abdominal lesions with different acquisition times in terms of PSNR, SSIM, and FSIM, indicating that the benefits (PSNR of 36.37, SSIM of 0.960, and FSIM of 0.964) from the proposed method were obvious under shorter-term scan conditions (i.e., 6-s acquisition time) for diagnosis.
For lesion detection, Table 7 provided the mean standardized uptake values (SUVmean) errors and max standardized uptake value (SUVmax) errors which were calculated from the ultrafast PET images and the synthesized scans relative to the full-time scan images for the same patient in Fig. 5. We could observe that after enhancement by the proposed approach, the average SUVmean error ratios for ultrafast PET images with 75 s, 30 s, 15 s, 6 s, and 3 s were reduced from 2.0 to 0.3% (p > 0.05), 4.7 to 0.9% (p > 0.05), 13.6 to 3.42% (p < 0.05), 12.9 to 4.1% (p < 0.05), and 19.3 to 8.1% (p < 0.05), respectively. Similarly, the average SUVmax error ratios for ultrafast PET images with 75 s, 30 s, 15 s, 6 s, and 3 s were reduced from 11.2 to 5.6% (p > 0.05), 8.7 to 2.8% (p > 0.05), 12.2 to 4.4% (p < 0.05), 17.4 to 6.2% (p < 0.05), and 20.1 to 6.6% (p < 0.05), respectively. It was evident that the synthesized lesion areas based on ultrafast PET images (e.g., 3 s, 6 s, or 15 s) showed significantly improved error ratios in SUVmean and SUVmax (p < 0.05). However, the SUVmean and SUVmax error ratios of the lesion regions synthesized by the proposed framework did not exhibit significant differences compared to the original full-time PET images in the 30 s and above PET imaging strategies. These results demonstrated the strong reconstruction ability of the proposed framework, particularly for PET images acquired with very short scanning times (e.g., less than 15 s), highlighting its adaptability to ultrafast PET image reconstruction.
Subjective image quality
Table 5 presents the average opinion scores of synthesized total-body PET images with various acquisition times. The images with 3 s (\(1.8\pm 0.4\)) and 6 s (\(3.2\pm 0.4\)) had significantly lower scores than those with 75 s (\(4.6\pm 0.7\); p < \(1\times {10}^{-3}\)). However, the generated PET images with 15 s (\(3.6\pm 0.4\)) and 30 s (\(3.9\pm 0.4\)) received higher scores. It was noteworthy that only five cases of 6-s acquisition time and two cases of 15-s acquisition time were scored 2. These subjective image quality scores highlighted the great potential of the proposed model in ultrafast PET reconstruction scenarios. Specifically, the proposed deformable 3D U-Net may significantly improve opinion scores for ultrafast PET images with an acquisition time of 6 s or more. Figure 5 further illustrates the corresponding opinion scores for the brain, abdomen, and pelvic cavity, offering a more detailed understanding of the clinical reading. The brain images showed in the top row in Fig. 5 were further preprocessed and registered to the MNI brain atlas for more accurate quantitative analysis. The opinion scores of these synthesized PET scans with 3 s, 6 s, 15 s, 30 s, and 75 s were 1, 2, 3, 4, and 5, respectively. The results meant that the generated ultrafast brain PET images with 6 s were difficult to meet the clinical diagnostic requirements.
Sedation-free synthesis
Figure 4 depicts the visualized results of the five original [18F]FDG PET images (6-s acquisition time) and the corresponding synthesized PET images. These subfigures were derived from test set 2. The pediatric subjects in our study, who underwent imaging without sedation, were gently guided into a state of sleep with their parents’ presence. Following this, they were subjected to a rapid PET scan, with a total acquisition time of 300 s, under the careful assistance of technologists and nurses. It was evident that our proposed deformable 3D U-Net consistently achieved high-quality reconstruction regardless of the PET imaging acquisition time. Upon closer examination of the regional details in the synthesized PET images, such as the liver region in Figs. 3 and 4, it became apparent that the reconstruction quality of longer acquisition time PET images (i.e., 6 s) surpassed that of shorter acquisition time PET images (i.e., 3 s). This disparity arose from the inherent limitation of ultrafast PET images with shorter acquisition times (e.g., 3 s), which inherently contained significantly less metabolic detail across the entire body, thereby presenting considerable challenges for the reconstruction process. Two radiologists reached a high degree of consistency on the clinical feasibility of the 6-s synthesized PET images.
Discussion
Total-body PET imaging was known for its high sensitivity and had the potential to reduce examination time. However, rapid total-body PET scanning with shorter acquisition time often resulted in compromised image quality, leading to a significantly reduced signal-to-noise ratio and an increased risk of missing small tumors. In previous works [28,29,30], ultrafast total-body PET imaging adopted GANs to achieve satisfactory performance based on 30-s acquisition time. For instance, Hosch et al. [31] proposed a modified pix2pixHD deep-learning network, which was trained on the data from 387 patients who underwent ultra-low-count FDG PET/CT scans (whole-body acquisition time approximately 30 s) and tested on data from 200 patients, to generate synthesized full-dose PET images based on a digital PET/CT scanner. Additionally, Zhang et al. [19] investigated the diagnostic value of the images obtained through the total-body PET/CT scans (88 oncology patients) with 30-s acquisition time and adopted the post-surgical pathological diagnosis to evaluate the experimental results. To the best of our knowledge, no previous studies specifically focused on intelligent ultrafast total-body PET for sedation-free pediatric [18F]FDG Imaging, particularly involving brain function analysis, abdominal lesion detection, and pelvic tumor classification.
Evidence from prior research [32,33,34] suggested that sedation might alter the accurate distribution patterns of radioactive tracers, like FDG, in brain PET imaging. Such alterations have the potential to result in diagnostic inaccuracies or errors. Furthermore, sedation can induce respiratory depression in children, as all sedative drugs suppress the central nervous system in a dose-dependent manner, potentially resulting in the loss of airway control. Allergic reactions to light oral sedation, while rare, can also occur, presenting symptoms such as skin rash, itching, and swelling of the face, tongue, or lips. Moreover, after using sedation, pediatric patients are typically monitored until they are near their baseline level of consciousness and are no longer at increased risk for cardiorespiratory depression. Some children may require extended monitoring for up to 12 h post-sedation. Finally, in the context of sedation procedures, the role of the sedation nurse is critical. They ensure the availability and proper functioning of necessary monitoring equipment, verify that informed consent has been obtained prior to sedation administration, and confirm that the patient will be accompanied by a responsible adult upon discharge. These measures are integral to maintaining patient safety and the integrity of the sedation process. These risks underscore the importance of careful consideration and medical supervision when administering sedation, particularly in children. Due to emotional or stressful reactions, some children do not cooperate with doctors and parents to use sedation, and involuntary movement may still occur post-sedation. Thus, ultrafast imaging based on 30-s acquisition time via the total-body PET/CT could not be suitable for pediatric patients. Therefore, it is urgent to explore the limitation of acquisition time for ultra-fast scanning without sedation. A sedation-free approach for ultrafast PET reconstruction would potentially reduce the risks associated with sedation, such as respiratory depression and allergic reactions, and could also reduce costs and recovery time.
In this study, we attempted to use an artificial intelligence framework to investigate the feasibility of acquiring maximum-speed ultrafast PET images for sedation-free pediatric patients, while ensuring image quality for subsequent lesion detection and analysis. We proposed a novel deformable 3D U-Net framework to generate high quality total-body PET images based on ultrafast PET scans with 50-fold reduced acquisition time (6 s), obtained using the uEXPLORER PET/CT system. Our approach offered three main advantages over the standard 3D U-Net, as identified in our preliminary performance comparison tests: (1) adaptive convolutional layers: The deformable convolutional layers in our framework can adaptively modify the shape of the convolutional kernel. This feature was crucial for synthesizing total-body PET images due to the high variability of anatomical structures. (2) Enhanced feature capture: Deformable convolutions were more proficient at capturing complex spatial relationships and morphological variations, leading to more accurate and robust feature extraction than standard convolutional layers. (3) Superior performance in quantitative metrics: Our results suggested that the deformable 3D U-Net significantly outperformed the standard 3D U-Net, particularly in cases with substantial anatomical variation among different patients. To our knowledge, only a few previous approaches [35,36,37] that used GANs or U-Nets enhanced pediatric [18F]FDG ultrafast scans using total-body PET imaging. Some previously published approaches were restricted to abdominal or pelvic PET scanning, which was more accessible due to the limited anatomical variance compared to whole-body images [38]. We aimed to evaluate the qualitative and clinical results of this study in the context of ultrafast total-body imaging for pediatric patients without sedation.
The quantitative and semiquantitative results presented that the quality of the synthesized PET images gradually increases with increasing scanning time, and the quantitative (SSIM, PSNR, FSIM) and semiquantitative (SUVmax, SUVmean) values of the images synthesized by the proposed deformable 3D U-Net were the closest to the full-time reference images for the same short-term scan duration. Significant improvements in quantitative metrics were observed for 6-s and 15-s acquisition time (Table 4 and Fig. 5). For the reconstruction of ultrafast brain PET images, it was found that the acquisition time could be shortened to 15 s to achieve satisfactory qualitative and quantitative results (Figs. 5 and 6). In the case of abdominal and pelvic diseases, an extremely short scanning time of 6 s was sufficient to produce images of diagnostic quality (Figs. 5 and 6). Therefore, the proposed method demonstrated its ability to compensate for the reduced signal-to-noise ratio caused by rapid scanning, thereby enhancing image quality. This is particularly beneficial for the investigation of body tumors in children, as more pediatric patients can successfully complete a PET examination without the need for sedation.
The clinical reading results of the two radiologists for all the synthesized PET images generated by our proposed framework for the brain, abdomen, and pelvic cavity were analyzed, as depicted in Fig. 6 and Table 6. The opinion scores were denoted by the manner of mean \(\pm\) SD. As shown in Fig. 6a–c and Table 6, most of the opinion scores given by the two radiologists exhibited high agreement (p > 0.05 in all paired t-tests). Significant differences between raters in opinion scores were only found in synthesized PET images from 3 s at the brain region (p < 0.05). The opinion scores of synthesized PET images from 3 s, 6 s, 15 s, 30 s, and 75 s progressively increased (Table 7). The generated image based on 6-s scans can meet clinical needs for the diagnosis of both body diseases. All the synthesized PET images of pediatric patients with/without sedation from two test datasets were used for further focal segmentation, which also demonstrated that a 6-s ultrafast PET image was feasible in the primary investigation for body diseases. Regarding specific regions, the opinion scores for brain significantly decreased, indicating the relatively greater reconstruction difficulty for brain region. Therefore, for some uncooperative young children, the examination time can be further shortened to 6 s with our method assistance.
This study had several limitations worth mentioning. First, our analysis encompassed a limited cohort of pediatric patients who were pathologically diagnosed, with only five of them undergoing PET scans without sedation. The small sample size could potentially introduce bias into the results and restrict the applicability of the findings. Second, due to ethical considerations, we were unable to expose the pediatric patients to both ultrafast imaging without sedation and conventional imaging with sedation, which facilitated for a more accurate comparison of the two scan images. This lack of direct comparison may constrain the robustness of the conclusions derived from the study. Additionally, this study primarily concentrated on examinations for body diseases, with a limited number of cases focusing on diagnosing brain function. Consequently, additional research was required to enhance ultrafast imaging techniques specifically for evaluating brain function.
Conclusion
This paper introduces a novel deep neural network framework designed to evaluate the feasibility of significantly reducing the time required for pediatric [18F]FDG ultrafast scans using total-body PET/CT without sedation. The visualized results and quantitative analysis demonstrate that our approach achieves excellent reconstruction performance on commonly used metrics, such as PSNR, SSIM, and FSIM, for total-body, brain, abdomen, and pelvic cavity scans. Compared to reconstructed PET images with longer time durations (e.g., 75 s), our approach exhibits a marked improvement in ultrafast PET reconstruction scenarios (e.g., 3 s), with increasing ratios of 23.06%, 2.48%, and 2.57% for PSNR, SSIM, and FSIM, respectively. The clinical reading results by two radiologists also support the clinical feasibility and better visual discrimination of the synthesized full-time PET images using our proposed framework. In conclusion, this study provides a sufficient demonstration for the design of a better framework for pediatric ultrafast sedation-free whole-body PET imaging and enhances clinical decision-making.
Data availability
The data could be obtained from the corresponding authors upon request.
References
Peng L, Liao Y, Zhou R, Zhong Y, Jiang H, Wang J, et al. [18F] FDG PET/MRI combined with chest HRCT in early cancer detection: a retrospective study of 3020 asymptomatic subjects. Eur J Nucl Med Mol Imaging. 2023;50:3723–34.
Chételat G, Arbizu J, Barthel H, Garibotto V, Law I, Morbelli S, et al. Amyloid-PET and 18F-FDG-PET in the diagnostic investigation of Alzheimer’s disease and other dementias. The Lancet Neurology. 2020;19:951–62.
Fu Y, Dong S, Niu M, Xue L, Guo H, Huang Y, et al. AIGAN: attention–encoding integrated generative adversarial network for the reconstruction of low-dose CT and low-dose PET images. Med Image Anal. 2023;86: 102787.
Alavi A, Houshmand S, Werner TJ, Zaidi H. Potential applications of PET-based novel quantitative techniques in pediatric diseases and disorders. PET clinics. 2020;15:281–4.
Xu Y-F, Yang J-G. Roles of F-18-fluoro-2-deoxy-glucose PET/computed tomography scans in the management of post-transplant lymphoproliferative disease in pediatric patient. PET clinics. 2020;15:309–19.
Nygaard U, Larsen LV, Vissing NH, von Linstow ML, Myrup C, Berthelsen AK, et al. Unexplained fever in children—benefits and challenges of FDG-PET/CT. Acta Paediatr. 2022;111:2203–9.
Lyra V, Chatziioannou S, Kallergi M. Clinical perspectives for 18F-FDG PET imaging in pediatric oncology: μetabolic tumor volume and radiomics. Metabolites. 2022;12:217.
Masselli G, De Angelis C, Sollaku S, Casciani E, Gualdi G. PET/CT in pediatric oncology. Am J Nucl Med Mol Imaging. 2020;10:83.
Fu Y, Dong S, Liao Y, Xue L, Xu Y, Li F, et al. A resource-efficient deep learning framework for low-dose brain PET image reconstruction and analysis. 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI): IEEE. Kolkata, India, 2022;1–5.
Chugani HT, Shewmon DA, Khanna S, Phelps ME. Interictal and postictal focal hypermetabolism on positron emission tomography. Pediatr Neurol. 1993;9:10–5.
Jadvar H, Alavi A, Mavi A, Shulkin BL. PET in pediatric diseases. Radiologic. Clinics. 2005;43:135–52.
Mandell GA, Cooper JA, Majd M, Shalaby-Rana EI, Gordon I. Procedure guideline for pediatric sedation in nuclear medicine. J Nucl Med. 1997;38:1640–2.
Parad RB. Non-sedation of the neonate for radiologic procedures. Pediatr Radiol. 2018;48:524–30.
Lee K-H, Ko B-H, Paik J-Y, Jung K-H, Choe YS, Choi Y, et al. Effects of anesthetic agents and fasting duration on 18F-FDG biodistribution and insulin levels in tumor-bearing mice. J Nucl Med. 2005;46:1531–6.
Olesen OV, Paulsen RR, Hojgaard L, Roed B, Larsen R. Motion tracking for medical imaging: a nonvisible structured light tracking approach. IEEE Trans Med Imaging. 2011;31:79–87.
Kesner AL, Kuntner C. A new fast and fully automated software based algorithm for extracting respiratory signal from raw PET data and its comparison to other methods. Med Phys. 2010;37:5550–9.
Aide N, Lasnon C, Desmonts C, Armstrong IS, Walker MD, McGowan DR. Advances in PET/CT technology: an update. Semin Nucl Med. 2022;53:286–301.
Cherry SR, Jones T, Karp JS, Qi J, Moses WW, Badawi RD. Total-body PET: maximizing sensitivity to create new opportunities for clinical research and patient care. J Nucl Med. 2018;59:3–12.
Zhang Y, Hu P, He Y, Yu H, Tan H, Liu G, et al. Ultrafast 30-s total-body PET/CT scan: a preliminary study. Eur J Nucl Med Mol Imaging. 2022;49:2504–13.
Dong S, Pan Z, Fu Y, Yang Q, Gao Y, Yu T, et al. DeU-Net 2.0: enhanced deformable U-Net for 3D cardiac cine MRI segmentation. Medical Image Analysis. 2022;78:102389.
Dong S, Yang Q, Fu Y, Tian M, Zhuo C. RCoNet: deformable mutual information maximization and high-order uncertainty-aware learning for robust COVID-19 detection. IEEE Trans Neural Netw Learn Syst. 2021;32:3401–11.
Dong S, Pan Z, Fu Y, Xu D, Shi K, Yang Q, et al. Partial unbalanced feature transport for cross-modality cardiac image segmentation. IEEE Trans Med Imaging. 2023;42:1758-1773.
Dong S, Zhao J, Zhang M, Shi Z, Deng J, Shi Y, et al. DeU-Net: deformable U-Net for 3D cardiac MRI video segmentation. Medical Image Computing and Computer Assisted Intervention–MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part IV 23: Springer. 2020;98–107.
Liu K, Tang W, Zhou F, Qiu G. Spectral regularization for combating mode collapse in gans. Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019;6382–90.
Reader AJ, Corda G, Mehranian A, da Costa-Luis C, Ellis S, Schnabel JA. Deep learning for PET image reconstruction. IEEE Trans Radiat Plasma Med Sci. 2020;5:1–25.
Chen KT, Gong E, de Carvalho Macruz FB, Xu J, Boumis A, Khalighi M, et al. Ultra–low-dose 18F-florbetaben amyloid PET imaging using deep learning with multi-contrast MRI inputs. Radiology. 2019;290:649–56.
Chen KT, Toueg TN, Koran MEI, Davidzon G, Zeineh M, Holley D, et al. True ultra-low-dose amyloid PET/MRI enhanced with deep learning for clinical interpretation. Eur J Nucl Med Mol Imaging. 2021;48:2416–25.
Sanaat A, Shiri I, Arabi H, Mainta I, Nkoulou R, Zaidi H. Deep learning-assisted ultra-fast/low-dose whole-body PET/CT imaging. Eur J Nucl Med Mol Imaging. 2021;48:2405–15.
Chaudhari AS, Mittra E, Davidzon GA, Gulaka P, Gandhi H, Brown A, et al. Low-count whole-body PET with deep learning in a multicenter and externally validated study. NPJ digital medicine. 2021;4:127.
Lei Y, Dong X, Wang T, Higgins K, Liu T, Curran WJ, et al. Whole-body PET estimation from low count statistics using cycle-consistent generative adversarial networks. Phys Med Biol. 2019;64: 215017.
Hosch R, Weber M, Sraieb M, Flaschel N, Haubold J, Kim M-S, et al. Artificial intelligence guided enhancement of digital PET: scans as fast as CT? Eur J Nucl Med Mol Imaging. 2022;49:4503–15.
Vinson AE, Houck CS. Neurotoxicity of anesthesia in children: prevention and treatment. Curr Treat Options Neurol. 2018;20:1–10.
Kessler M, Mamach M, Beutelmann R, Bankstahl J, Bengel F, Klump G, et al. Activation in the auditory pathway of the gerbil studied with 18 F-FDG PET: effects of anesthesia. Brain Struct Funct. 2018;223:4293–305.
Liu X, Ji J, Zhao G-Q. General anesthesia affecting on developing brain: evidence from animal to clinical research. J Anesth. 2020;34:765–72.
Vali R, Alessio A, Balza R, Borgwardt L, Bar-Sever Z, Czachowski M, et al. SNMMI procedure standard/EANM practice guideline on pediatric 18F-FDG PET/CT for oncology 1.0. J Nucl Med. 2021;62:99–110.
Alessio AM, Sammer M, Phillips GS, Manchanda V, Mohr BC, Parisi MT. Evaluation of optimal acquisition duration or injected activity for pediatric 18F-FDG PET/CT. J Nucl Med. 2011;52:1028–34.
Reichkendler M, Andersen FL, Borgwardt L, Nygaard U, Albrecht-Beste E, Andersen KF, et al. A long axial field of view enables PET/CT in toddler without sedation. J Nucl Med. 2022;63;1962
Rufini V, Garganese G, Ieria FP, Pasciuto T, Fragomeni SM, Gui B, et al. Diagnostic performance of preoperative [18 F] FDG-PET/CT for lymph node staging in vulvar cancer: a large single-centre study. Eur J Nucl Med Mol Imaging. 2021;48:3303–14.
Funding
This work was supported by in part by the National Key Research and Development Program of China (Grant No. 2020YFA0909000 and 2021YFA0910000), the National Natural Science Foundation of China (Grant No. 81571710, 82001878, and 82171972), and the Shanghai Rising-Star Program (Grant No. 20QA1406100).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Ethical approval
All procedures were performed in accordance with the principles of the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards. The study design and exemption from informed consent were approved by the Institutional Review Board of the Ren Ji Hospital.
Consent to participate
The informed consent was waived.
Consent for publication
Not applicable.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, X., Fu, Y., Dong, S. et al. Intelligent ultrafast total-body PET for sedation-free pediatric [18F]FDG imaging. Eur J Nucl Med Mol Imaging 51, 2353–2366 (2024). https://doi.org/10.1007/s00259-024-06649-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00259-024-06649-2