Introduction

[18F]FDG PET imaging has been widely used in clinical and research applications over the past few decades. The application of PET in pediatric diseases has also gradually attracted attention and has become an essential diagnostic tool in the imaging evaluation of pediatric tumors and non-tumor diseases [1]. [18F]FDG PET is used in the diagnosis of pediatric solid tumors, hematological tumors, post-transplant lymphoproliferative disease (PTLD), and fever of unknown origin (FUO) [2,3,4], and is routinely used for staging, assessing response to treatment, monitoring the course of the disease, and detecting recurrence in many malignancies [5].

Pediatric PET imaging is often performed when patients are sedated to achieve good-quality PET imaging [6]. Because keeping the child still during the examination is not easy, and the motion during scanning would corrupt reconstructions, leaving artifacts in the final image. However, sedation has the following disadvantages: (1) It will have a potential impact on the neurodevelopment of children [7]; (2) Some children will not cooperate to take sedatives, or after taking sedatives, some children have involuntary movements during the examination. (3) It is also essential to consider the potential effects of sedatives on FDG tracer bio-distribution [8,9,10] or peripheral-type benzodiazepine receptor imaging [11, 12], especially in early or dynamic PET imaging.

Therefore, some physicians performed clinical studies on PET imaging in children without sedation using long axial field-of-view (FOV) PET/CT [13,14,15]. However, in the non-sedation state, children will be prone to move during the PET scan period and produce motion artifacts in acquired PET images. Many studies focus on sedation-free pediatric PET imaging, developing techniques for correcting motion-corrupted PET images. Some tracked the motion with external devices [16], but the complex setup might hammer the adoption into clinical routine. Other studies have attempted to extract motion signals directly from raw PET data [17], which may not accurately convey the actual motion amplitude. The limitation of current data-driven methods for PET relates to the CT image used for attenuation and scatter correction (ASC), particularly evident in cases of respiratory motion, which can lead to scatter or attenuation artifacts affecting PET quantification.

Ultra-fast total-body (TB) PET imaging may balance acquisition time and image reconstruction quality from the raw data for signal extraction back to image-based methods [18]. The advent of TB PET made it possible for ultra-fast PET scan [19]. Shi et al. reported that no significant difference were observed in the diagnostic accuracy of adult tumors between scanning 300 s and 30 s (P = ns) [20]. Additionally, the motion also presents a problem during the ASC of PET images, especially in the case of ultra-fast PET scans. Emission-based AC is the one possible solution for that. Xue et al. [21] employed a simple way to integrate domain knowledge in deep learning (DL) for computed tomography (CT)-free PET imaging.

Our study explored the value of ultra-fast TB PET and DL-based ASC for sedation-free pediatric PET imaging. We established the feasibility of PET imaging through TB PET rapid scan and PET ASC without sedation.

Methods

Patient cohorts

This study analyzed the data of 40 patients with suspected tumors that were referred to the Department of Nuclear Medicine to undergo [18F]FDG PET/CT for tumor diagnosis and staging from January 2022 to January 2023. The inclusion criteria were as follows: (1) under 4 years old (2) undergo TB PET scan (3) definite surgical pathology or follow-up data. Among them, 35 children were orally sedated 20 min before the PET scan. 5 children did not take any sedatives during the PET scan. The study was approved by the institutional review board of Renji hospital, and the informed consent was obtained from all patients’ legal guardians.

General imaging protocols

All patients fasted for at least 6 h before [18F]FDG administration. The list-mode PET data were acquired using a TB PET/CT scanner with 194-cm-long axial FOV (uEXPLORER, United Imaging Healthcare, Shanghai, China) 60 min after [18F]FDG injection 3.7MBq/kg [0.1 mCi/kg] activity per body weight). The acquisition time was 300 s. All PET imaging was reconstructed with the same parameters: time of flight [22] and point spread function modeling; ordered subset expectation maximization algorithm with 2 iterations and 20 subsets; matrix, 192 × 192; slice thickness, 2.89 mm; FOV, 600 mm (pixel size, 3.125 × 3.125 × 2.89 mm) with a Gaussian post-filter (3 mm). Low-dose CT scans of the whole body were obtained by uEXPLORER (tube current 10 mA, voltage 100 kV, rotation time 0.5s, pitch 1.0125, collimation 80 × 0.5 mm) were reconstructed in a 512 × 512 matrix for ASC.

Imaging protocols for patients under sedation

We define “fast-scan-time” (FST) as a time window featuring randomly sampled full-count data, simulating the fast imaging conditions relevant to practical scenarios. PET images of 35 sedated children (administered chloral hydrate orally at 30–50 mg/kg) were reconstructed from FST data captured at various intervals (300s, 60s, 30s, 15s, 10s, 5s). Prior to PET data acquisition, a CT scan was conducted for ASC. For each FST interval, both CT-ASC PET images and non-corrected (NASC) PET images were obtained; for instance, FST-300s CT-ASC represents PET imaging reconstructed from 300 FST data with CT-based ASC applied.

Imaging protocols for sedation-free patients

Five children were initially subjected to PET/CT scans while non-sedated, followed by subsequent CT-free PET scans under sedation. The standard protocol for PET scans is set at 300 s in a non-sedated context. However, the scanning process is subject to interruption should the patient display excessive movement or an inability to maintain cooperation. The minimum scan duration achieved among these non-sedated children was 60 s. PET imaging frames were reconstructed based on 5-second intervals of counts. The PMOD software was utilized to identify a period of three consecutive frames (15 s) where the child remained motionless, then the reconstruction was performed with the counts from the identified motionless time window. Reconstructed NASC PET imaging was collected at FST-15s and FST-300s.

Deep learning-based correction

We leveraged the framework introduced in our previous work on DL-based AC within the realm of ultra-fast PET imaging [21]. This method offers a straightforward means of incorporating domain knowledge into DL for CT-free PET imaging. Particularly, as depicted in Fig. 1, we customized the generative adversarial network to estimate the anatomy-dependent correction map, calculated from the FST-300s CT-ASC PET image, with the input being the non-corrected ultra-fast FST-15s NASC PET image. The selection of FST-15s for this process was based on our thorough evaluation of ultra-fast PET imaging. Our training phase involved a dataset of 30 sedated patients, chosen randomly from a total cohort of 35. We then validated our model on the remaining 5 patients from the sedated group, and further tested on the 5 non-sedated patients.

Fig. 1
figure 1

General protocol of our customized deep learning method for attenuation and scattering correction. We employed a modified generative adversarial network to derive an anatomy-specific correction map from a standard full-time PET image (300s), denoted as FST-300s CT-based attenuation and scatter corrected (CT-ASC) PET, alongside FST-300s NASC PET. The input to the network was the non-corrected ultra-fast PET image (15s), specifically FST-15s NASC

Evaluation

Qualitative evaluation

The quality of PET images was independently assessed by two board-certified nuclear medicine physicians with over a decade of experience (X.Z. and R. C.). Employing a 5-point scale (Supplementary Table S1), they evaluated the overall image quality. Images receiving a score of 2 or more were deemed diagnostically adequate. During the evaluation, the physicians were blinded to patient history and acquisition time, and the order of image presentation was randomized to minimize potential bias.

Quantitative evaluation

To evaluate the efficacy of ultra-fast PET imaging, two board-certified nuclear medicine physicians (X.Z. and R. C.) utilized PMOD software to manually identify circular regions of interest (ROIs) within the aorta lumen and the normal parenchyma of the right liver lobe, each approximately 2.0 cm in diameter (ranging from 1.5 to 2.5 cm). Lesions were also identified and outlined. This process facilitated the measurement of Standardized Uptake Values (SUV) and the subsequent statistical analysis of imaging parameters, including SUVmax, SUVmean, and the tumor-to-background ratio (TBR). TBR was derived by dividing the lesion SUVmax by the liver SUVmean. Lesions with SUVmax > 2.5 or TBR > 1 were identified as suspected lesions. Furthermore, global physical metrics were calculated in comparison to FST-300s CT-ASC PET imaging. These metrics included structural similarity index measurement (SSIM), signal-to-noise ratio (SNR), and voxel-wise root mean squared error (RMSE).

Statistical analysis

Statistical analyses were conducted using SPSS 16.0 Statistics. The inter-reader agreement was evaluated through linearly weighted kappa coefficient analysis. A paired samples t-test was applied for objective measurement comparisons, while a chi-squared test was utilized to compare lesion detection rates across groups. P < 0.05 was considered statistically significant.

Results

Patient cohorts

Table 1 summarizes characteristics of the pediatric patient cohort, comprising 40 individuals (13 males, 27 females) with a mean age of 28 months (range: 0.5 to 3.9 years). The cohort included 16 patients with neuroblastoma, 9 with PTLD, 8 patients with sarcoma, 4 with lymphomas and 3 patients with leukaemia.

Table 1 Demographic information of patients’ data

Qualitative evaluation of ultra-fast PET image quality

All imaging used to evaluate ultra-fast image quality was CT-ASC PET imaging. For reference, the FST-300s group achieved an average score of 5. Groups with FST exceeding 15 s (FST-60s, FST-30s, and FST-15s) satisfied clinical standards, scoring over 2, whereas the other groups (FST-10s and FST-5s) exhibited substandard quality (Table 2). Exemplary MIP and tomographic images, showcasing liver lesions and lymph node metastases, are presented in Figs. 2 and 3, respectively.

Table 2 Scores of qualitative assessments of ultra-fast PET image quality
Fig. 2
figure 2

Maximum intensity projection imaging of a 1-year-old child, reconstructed from fast-scan-time at different levels (300s, 60s, 30s, 15s, 10s, 5s)

Fig. 3
figure 3

Tomographic images of lesions in a 22-month-old child with neuroblastoma, reconstructed from fast-scan-time at different levels (300s, 60s, 30s, 15s, 10s, 5s). (A) Arrows indicate liver lesions. The long diameter of liver lesion is 51 mm. SUVmax (SUVmean) of 300, 60, 30, 15,10,5, 2s are 13.1(4.7), 13.4(4.7), 13.1(4.6), 13.8(4.7), 13.0(4.7), 13.6(4.7), 17.3(4.7), respectively. (B) Arrows indicate the left para-aortic lymph node. The long diameter of the lymph node is 5 mm. SUVmax (SUVmean) of 300, 60, 30, 15,10,5, 2s are 2.8(2.6), 2.8(2.6), 2.8(2.5), 3.0(2.5), 3.1(2.6), 3.7(3.0), 3.1(2.8), respectively

Additionally, image quality was also qualitatively assessed based on lesion detectability. In the FST-300s group, 74 suspicious lesions (diagnosis based on 18F-FDG PET/CT imaging) were identified in 30 sedated patients, while the FST-15s group identified 72, with discrepancies in 2 lesions. These two lesions were either smaller than 1 cm or had an SUVmax below 2.5, and both originated from the liver (Fig. 4). According to the follow-up and postoperative pathological results, 94 lesions were found in 30 patients, 27 benign, and 67 malignant. Calculation of the diagnostic performance of the 2 groups is based on the lesions. The FST-300s group achieved a sensitivity of 92.5%, specificity of 55.6%, and overall accuracy of 81.9%, compared to the FST-15s group’s 89.6% sensitivity, 55.6% specificity, and 79.8% accuracy. No significant statistical differences were noted between these groups (P = ns) (Supplementary Table S2).

Fig. 4
figure 4

A 32-month-old child with posttransplant lymphoproliferative disorders (PTLD) after liver transplantation. The images showed lesions in the liver for the 300s, 30s, 15s, 10s. We can diagnose the liver lesion that arrow indicate in 300s, but depiction of liver lesion was arguable in 10–30 s

Quantitative evaluation of ultra-fast PET image quality

Table 3 presents the quantitative evaluation of ultra-fast PET image quality, focusing on clinical features including SUVmax, SUVmean and TBR. The analysis reveals that groups with FST over 15 s (FST-60s, FST-30s, and FST-15s) matched the reference FST-300s group in SUVmax, SUVmean (for mediastinum, liver and lesions), and TBR (for lesions) with no significant differences (P = ns). Conversely, shorter FST groups (FST-10s and FST-5s) showed significantly different results, especially in SUVmax for mediastinum and liver (P < 0.001). Evaluations involving physical metrics (SSIM, SNR and RMSE) are detailed in Supplementary Table S3.

Table 3 Quantitative evaluation of ultra-fast PET image quality with clinical features

Evaluation of CT-free PET image quality on sedated patients

Figure 5. A-B demonstrates that the SUV values derived from lesions in five test subjects exhibit no significant disparities between CT-ASC and DL-ASC images (SUVmax: P = 0.21, SUVmean: P = 0.44). Supplementary Figure S1 details the quantitative accuracy of these images, as assessed using established physical metrics.

Fig. 5
figure 5

Comparisons of lesion SUVmean and SUVmax under no attenuation and scatter correction (NASC), CT attenuation and scatter correction (CT-ASC), and deep learning attenuation and scatter correction (DL-ASC), on sedated and non-sedated test dataset (S/NS). Comparison of the lesion SUVmax (A) and SUVmean (B) with 15s FST on sedated patients, and SUVmax (C) and SUVmean (D) on non-sedated patients. DL: deep learning; CT: computerized tomography

In terms of qualitative image quality, DL-ASC images demonstrate significantly superior clarity and detail compared to CT-ASC images for visual assessment (P = ns), achieving an average score of 3.5, as documented in Supplementary Table S4. Additionally, Supplementary Figure S1 provides representative results of CT-ASC versus DL-ASC PET imaging in a sedated pediatric patient.

Evaluation of CT-free PET image quality on non-sedated patients

Figure 5. C-D demonstrates that lesions identified from NASC PET imaging show no significant differences in SUV values, whether the patient is sedated (S/NASC) or not (NS/NASC) (P = 0.97 for SUVmax and P = 0.08 for SUVmean). Meanwhile, quantification of lesions from sedated CT-ASC and non-sedated DL-ASC images reveals no significant differences. Figure 6 presents the comparison of CT-ASC and DL-ASC PET imaging in a non-sedated pediatric patient. Notably, Lesion 1, located in the abdominal cavity, is more discernible in the DL-ASC image due to enhanced contrast, unlike its appearance in the NASC image. Conversely, Lesion 2, located at the thoracic vertebra, is detectable in both NASC and DL-ASC images. However, there is a significant discrepancy in the SUV values between these two methods, with the DL-ASC imaging aligning more closely with the reference values from the FST-300 CT-ASC sedated case. Comprehensive quantitative evaluation of mediastinum, liver and lesions are shown in Table 4. Detailed results of the qualitative evaluation are available in Supplementary Table S6.

Fig. 6
figure 6

15s ultra-fast DL PET image under non-anesthesia compared to the CTC PET image under anesthesia. A 23-month-old female with multiple metastases of neuroblastoma. In the non-anesthetic PET scan images (three columns on the left ), the figure showed the total body MPR images without attenuation correction for 15s, full-time scan (180s) without attenuation correction, and artificial intelligence-assisted attenuation correction. The PET images of typical lesions 1 and lesion 2 were shown in the transversectional position. Lesion 1 was a large primary tumor in the abdominal cavity, about 20 × 17 mm in size. Lesion 2 is a bone metastatic tumor, about 12 × 6 mm in size. as well as SUVmax/SUVmean of mediastinal, liver, and typical lesions were shown in this figure. PET scan images at the anesthesia state were shown in three columns on the right. From left to right, the images were 15s CTC, 15s DL, and full-time (300s) CTC PET

Table 4 Quantitative evaluation of CT-free PET image quality on non-sedated patients

Discussion

Fast imaging has extensive demand in PET examination of many types of patients [23], such as uncooperative children, cancer patients who cannot tolerate pain, and patients with mental or neurological abnormalities that cannot control their behavior. This is particularly true for pediatric patients, where the common practice of sedation poses certain disadvantages. Our study investigated the minimum feasible scanning time for children, employing DL-based AC to facilitate sedation-free pediatric [18F]FDG imaging in total body PET/CT.

Our qualitative and quantitative evaluation of image quality across different FSTs from 5 s to 300 s revealed that a minimum of 15 s is required to meet basic clinical standards. This finding aligns with other studies, such as Zhang Y et al. [20] and Ying-Ying Hu et al. [24], which also support the adequacy of a 15-second scan for basic image quality in pediatric ultra-fast TB PET imaging.

Our examination of the detectability of suspicious lesions indicated that there was no notable difference between the FST-15s and the standard FST-300s modalities. This was consistent except in cases of lesions smaller than 1 cm in diameter or with an SUVmax below 2.5, all of which were in the liver. In clinical settings, the diagnosis of liver lesions is typically corroborated using MRI or ultrasound, ensuring that these lesions are not overlooked. A Total-body PET acquisition time of 3 to 5 min is recommended to facilitate the detection of small liver lesions when suspicious lesions are found on MRI. When no liver lesion is detected in MRI or US, we can shorten the collection time to achieve an ultrafast scan.

Our study also explored the feasibility of 15-second non-sedated PET imaging in children. Challenges include child’s unpredictable movement during the scan and the positional discrepancies between CT and PET scans. To address this, we engaged five children in continuous scans ranging from 60 to 300 s without sedation. Analysis via PMOD software confirmed that all children could maintain a static position for at least 15 s. Future research will focus on developing AI software to automatically identify and extract PET data from static positions.

Accurate quantification of PET imaging necessitates ASC, to compensate for photon pair losses due to scattering and photoelectric absorption. However, discrepancies between CT and PET scans can result in inaccurate corrections, especially for non-sedated patients. To mitigate this issue, our study employed a decomposition-based method [21] designed to extract low-frequency, anatomy-specific information from NASC PET data, which can be applied for correction. Particularly in this ultra-fast imaging scenario, we were able to leverage the richer information from FST-300s scans, to guide the model to estimate the correction map from FST-15s NASC PET image, thereby enhancing the generated image quality (Supplementary Table S7 & Figure S3).

Our research utilized a model trained on a homogeneous and limited amount of dataset using a single scanner and tracer, which may not be ideal for robust DL development. However, our preliminary findings validated the potential of our initial hypothesis. This proof-of-concept underscores the importance of designing more realistic studies in the future, incorporating larger and more diverse datasets that encompass a range of centers, scanners, tracers, diseases, and body regions. Limitations of the study are the amount and variety data included this study. Future research should broaden the scope to include larger dataset, diverse diseases, and other radiotracers like 68Ga and 11C for ultra-fast PET imaging in non-sedated children. Furthermore, deep learning-based dose reduction techniques [25] will also be incorporated to reduce the injected dose for ultra-fast PET imaging, or further reduce the acquisition time.

Conclusion

This research establishes the feasibility of 15-second ultra-fast TB PET scans for pediatric oncology diagnosis. It demonstrates that pediatric PET imaging can be performed without sedation, utilizing ultra-fast imaging techniques combined with a DL-based AC algorithm. This pioneering effort in sedation-free ultra-fast PET imaging paves the way for its wider clinical application and fosters further advancements in PET imaging technology.