Introduction

Respiratory motion affects positron emission tomography/computed tomography (PET/CT) image quality, because the acquisition time of PET (2–3 min/bed position) is much longer than the respiratory cycle time (3–6 s) [1]. The common problems are reduced image resolution, underestimation of the standardized uptake value (SUV), overestimation of tumor volume, and deformation or inaccurate localization of lesions [2].

Although respiratory gating can increase lesion SUV [3], decrease metabolic tumor volume (MTV) [4], reduce image noise, and improve diagnostic performance [5], it is generally not used in the clinic, because of the long imaging time required to match the count and image quality to non-gated imaging [6, 7]. The extended acquisition time is poorly tolerated by patients, and the time taken for the transfer and reconstruction of large data reduces working efficiency.

The advent of PET scanners with a long axial field of view (AFOV) such as the uEXPLORER (AFOV: 194 cm; United Imaging Healthcare) [8] and the Quadra (AFOV: 106 cm, Siemens Healthineers) [8], has significantly improved detector sensitivity and PET image quality. These scanners have enabled PET imaging with reduced acquisition time [9, 10], low tracer activity [9, 11], and delayed imaging time [12]. We hypothesized that the improved efficiency in count detection and the long AFOV would improve efficiency in respiratory-gated imaging too. In this study, we aimed to investigate whether respiratory-gated total-body PET imaging using the uEXPLORER could minimize the effect of respiratory motion and reduce acquisition time. The hardware-based technique was used since it is still the standard method for respiratory gating.

Materials and methods

This study was approved by the Institutional Review Board of Zhongshan Hospital, Fudan University (B2021-329). Written informed consent was obtained from all patients.

Patients enrollment

From September to December 2021, a total of 145 patients underwent respiratory-gating total-body [18F]FDG PET/CT for suspected malignancies in the thorax and upper abdomen. After an initial review of PET/CT images, 49 patients were excluded (15 had no lesions identified, 22 had lesions with invisible [18F]FDG uptake compared to the surrounding background, 7 had tumors that were not in the thorax and upper abdomen). Another five patients were excluded because of failure in lesion segmentation on PET images, including lesions that were close to the myocardium (n = 1) or the stomach (n = 1) which had obvious physiological uptake of [18F]FDG, and lesions that were too close to the segment from each other (n = 3). Then, the remaining 96 patients were followed up, of which 25 patients without a pathologically confirmed diagnosis were excluded (Fig. 1). Finally, a total of 71 patients with pathologically confirmed malignancies (n = 68) and benign lesions (n = 3) were included.

Fig. 1
figure 1

Flowchart of patient enrollment

Total-body PET/CT acquisition

Patients were fasted for at least 6 h before injection of 18F-FDG (4.4 MBq/kg). The mean glycemia of patients at the time of [18F]FDG injection was 6.1 (range: 3.9–10.7) mmol/L. Total-body PET/CT using the uEXPLORER scanner was performed after the patient had rested for about 60 min. The respiratory motion signal was recorded using a Vital Signal Monitor Gating belt (United Imaging Healthcare). Free-breathing CT acquisitions were performed using the following parameters: tube current, 140 mAs; tube voltage, 120 kV; collimation, 64 × 0.5 mm; pitch, 1.0; and matrix, 512 × 512.

Respiratory gating and PET image reconstruction

Each respiratory cycle was evenly divided into three bins with the second being taken to reconstruct respiratory-gated images, corresponding to the end-expiratory phase. In other words, approximately one-third of coincidences were extracted for a gated reconstruction. For comparison, four datasets of PET were reconstructed: gated and ungated reconstruction using the entire 15-min data (Gated-15 and Ungated-15), ungated reconstruction using the first 5 min (Ungated-5), and gated reconstruction using the first 6 min (Gated-6). Ungated-5 was reconstructed mainly for count-matched comparison with Gated-15. Gated-6 was reconstructed to investigate the performance of time-reduced respiratory-gated imaging, took into account an effective count equal to the standard protocol of 2 min/bed position recommended in the European guideline [13]. All PET image reconstructions were performed using the list-mode ordered-subsets expectation-maximization algorithm, incorporating time-of-flight and point-spread function with three iterations, 20 subsets, an image matrix of 192 × 192, and a slice thickness of 1.443 mm.

Objective assessment of image quality and lesion demonstration

An objective assessment of image quality was performed by measuring the [18F]FDG uptake in the background liver and the blood pool. Briefly, spherical volumes of interest (VOI) with diameters of 3 cm were placed in the middle area of the right lobe of the liver, and spherical VOIs as large as possible were drawn in the descending aorta without touching its wall. From each VOI, the maximum, mean, and standard deviation of standardized uptake value (SUVmax, SUVmean, and SUVSD, respectively), corrected by the body weight, were estimated. The signal-to-noise ratio (SNR) of the liver and blood pool were calculated by dividing SUVmean with SUVSD. Identical VOIs were copied among the four image sets of each patient.

Lesions detectability was assessed by counting the number of lesions on the four image sets and comparing the number with the lesion count on chest high-resolution CT, enhanced abdominal CT, or MRI performed within 1 week before or after PET. For measuring the [18F]FDG uptake in lesions, an adaptive threshold segmentation method was used to segment each lesion [14]. This segmentation method is a combination of the threshold algorithm and the region-growing algorithm, with the former finding an optimal threshold through iteration for lesion segmentation and the latter connecting the seed pixels and those pixels whose intensity values are greater than the optimal, to finally generate the lesion region [14, 15]. Making full use of the information from signal to background, this method showed better performance in lesion delineation and is independent to image properties, scanner type, reconstruction, and imaging noise, compared to the commonly used relative or absolute fixed SUV threshold [14, 15]. MTV, peak SUV (SUVpeak), SUVmax, and SUVmean of the lesions were recorded. SUV90% was determined by averaging the SUV of voxels with SUV > 90% of SUVmax. The tumor-to-liver ratio (TLR) and tumor-to-blood ratio (TBR) were calculated by dividing tumor SUVmax with SUVmean of background liver and blood pool, respectively. All lesions were measured if there were fewer than then; otherwise, three lesions with the highest [18F]FDG uptake were measured. Two operators (S.C. and G.L.) performed the quantitative measurements independently for testing inter-operator agreements, with one (S.C.) performing the measurements twice for testing intra-operator agreements.

Subjective assessment of image quality

With the four data sets arranged side by side, two nuclear medicine physicians (Y.H. and G.L., with 5 and 6 years of working experience, respectively), blinded to the reconstruction information, independently evaluated the image quality using a slightly modified version of a method described previously [16]. First, overall image quality was scored as follows: 0, non-diagnostic; 1, numerous significant heterogeneities; 2, satisfactory, some significant heterogeneities of varying size and magnitude; 3, good, small heterogeneities visible throughout; 4, very good, subtle, tiny heterogeneities; or 5, excellent, no/minimal heterogeneities. Figure 2 shows typical images of scores 1–5. Second, the noise in the liver and the bone marrow was also scored visually using the same criteria.

Fig. 2
figure 2

Coronal PET and fused PET/CT slices on an SUV grayscale of 0–6 from two representative cases illustrating subjective assessment of image qualities. The overall score is shown below each image

Statistical analysis

Statistical analysis was performed using the SPSS 20 (IBM Corp.). The weighted kappa was calculated to evaluate inter-rater agreement in subjective image analysis, while intra-class coefficients (ICC) were calculated to assess inter- and intra-operator agreements regarding quantitative measurements. Comparisons of quantitative indices among the four PET sets were performed using one-way ANOVA with repeated measures if the Kolmogorov–Smirnov test confirmed normality; otherwise, the Freidman test was used. The Cochran Q test was performed to compare lesion detectability among the four PET sets. For pairwise post hoc comparisons, the Bonferroni corrections were applied. Lesions were divided into two size groups (by median MTV), and two location groups (esophageal/pancreatic and others). Relative differences of lesions’ quantitative indices between Gated-15 and Ungated-15/5 were calculated as \( \frac{\mid {Gated}_{15}-{Ungated}_{15/5}\mid }{Gated_{15}}\times 100\% \), and were compared between the two size and location groups, using the Mann–Whitney U test. All tests were two-sided, with p < 0.05 indicating statistical significance.

Results

Patient characteristics

Table 1 summarizes the characteristics of the patients. All patients had confirmed the pathological diagnosis: 41 (57.7%) from biopsy and 30 (42.3%) from surgery. While 68 patients had malignant lesions, 3 had benign lesions.

Table 1 Basic and clinical information of the 71 patients included in this study

Respiratory gating, image reconstruction, and prompt counts

An average of 223.8 respiratory cycles were recorded for the 15-min gated imaging; 18.0 ± 16.8 cycles failed for image reconstruction, leaving 223.8 ± 57.6 (about 92.4%) cycles effective for respiratory-gating reconstruction (Table 2). The prompt counts of each image set were estimated as follows: count rates × imaging time × percentage of bins (1/3) × percentage of effective respiratory cycles. Counts (including true, random, and scatter) were significantly lower in Gated-15 than in Ungated-15 and Ungated-5 (Table 2). However, the differences were not significant between Gated-15 and Gated-6. Interestingly, the proportion of effective counts was significantly higher in Gated-15 than in Ungated-15 (41.55% vs. 39.87%, p < 0.001) and in Ungated-5 (41.55% vs. 41.08%, p < 0.001). The proportion of effective counts was even higher in Gated-6 than in Gated-15 (41.95% vs. 41.55%, p < 0.001).

Table 2 Summary statistics of the estimated counts and respiratory cycles among the four image sets

Objective assessments of background [18F]FDG uptake

Both the inter- and intra-operator agreements regarding SUV measurements of background [18F]FDG uptake were excellent (ICC > 0.85, Table S1). The mean SUVs in the background liver and blood pool were close among the four image sets, although most of the intergroup differences were statistically significant (Table 3). Liver SUVSD and blood-pool SUVSD were significantly higher in Gated-15 than in Ungated-15. This resulted in significantly lower SNR in the liver and blood pool in Gated-15 than in Ungated-15 and Ungated-5. Even worse results were seen in Gated-6 which, compared to Gated-15, had significantly higher liver SUVSD (0.24 vs. 0.17, p < 0.001) and blood-pool SUVSD (0.16 vs. 0.10, p < 0.001), and significantly lower liver SNR (9.79 vs. 14.20, p < 0.001) and blood-pool SNR (10.80 vs. 15.87, p < 0.001).

Table 3 Comparisons of objective measurements of background uptake among the four PET image sets

Lesion detectability and [18F]FDG uptake

A total of 390 lesions were found in the thorax and upper abdomen (Table 4). Lesion detectability in gated-15 was significantly higher than in ungated-15 (97.2% vs. 93.6%, p = 0.001) and Ungated-5 (97.2% vs. 92.3%, p = 0.001), but comparable to that in Gated-6 (97.2% vs. 95.9%, p = 0.993).

Table 4 Comparisons of objective measurements of lesions among the four PET image sets

A total of 131 lesions were measured for quantitative analysis, with excellent inter- and intra-operator agreements (ICC > 0.90, Table S1). Lesions in Gated-15 presented significantly higher SUVmax, SUV90%, SUVmean, SUVpeak, TLR, and TBR, and significantly smaller MTV compared to Ungated-15 and Ungated-5 (Table 4). These differences remained even when only lung or liver lesions were considered (Table S2). In the subgroup analysis by lesion size (Fig. 3), the differences in the various quantifications between Gated-15 and Ungated-15 or Ungated-5 were more obvious among relatively smaller lesions (MTV ≤ 4326.2 mL) than among relatively larger lesions (MTV > 4326.2 mL) of this study. Negative correlations had been found between the relative differences of the various quantifications with lesion volumes (Table S3). In the subgroup analysis by lesion location, lesions in the esophagus and pancreas presented relatively smaller differences in the quantifications between Gated-15 and Ungated-15 or Ungated-5, although none of the differences had reached statistically significant (Fig. 3). However, none of these indices was statistically different between Gated-15 and Gated-6 (p3 in Table 4).

Fig. 3
figure 3

Comparisons of the gated-to-ungated differences of lesion quantifications between lesions of MTV > 4326.2 mL and ≤ 4326.2 mL (A and B), and between lesions from the esophagus/pancreas and those from other places (C and D). A and C, Differences between Gated-15 and Ungated-15. C and D, Differences between Gated-15 and Ungated-5

Subjective assessments of image quality

Inter-rater consistency in assessing the image quality was good (weighted kappa > 0.75; Table 5). Obviously, higher overall scores were identified in gated images than in ungated images, but no significant difference was found between Gated-15 and Gated-6 (4.77 vs. 4.62, p > 0.999). Figure 4 and Figure 5 show typical cases of better presentation in gated images than in ungated images of lung and liver lesions. Although the scores for liver noise and bone-marrow noise were significantly lower in Gated-15 (4.15 and 4.38, respectively) than in Ungated-15 (4.93 and 4.92, respectively), they were comparable to the scores in Ungated-5 (4.30 and 4.43, respectively) (Table 5). However, liver noise (3.16 vs. 4.15, p < 0.001) and bone-marrow noise (3.57 vs. 4.38, p < 0.001) were significantly higher in Gated-6 than in Gated-15. Figure 6 shows a typical case of the bone-marrow noise and liver noise in the four reconstructed PET images.

Table 5 Subjective assessment of the image qualities and the inter-rater consistency among the four PET image sets
Fig. 4
figure 4

Coronal slices of PET and fused PET/CT (SUV grayscale, 0–6) showing [18F]FDG-avid liver metastasis (arrows), which are easier to visualize on gated reconstructions but not definitely visible on Ungated-15 and Ungated-5 images. The Gated-15 and Gated-6 images received a score of 5 and 4, respectively, while Ungated-15 and Ungated-5 both received an overall score of 1

Fig. 5
figure 5

Coronal slices of PET, fused PET/CT and breath-hold high-resolution CT (HRCT: upper coronal; lower, sagittal) showing an [18F]FDG-avid lung cancer with obvious deformation on ungated images. This lesion can be easily misdiagnosed as two lesions on Ungated-15 and Ungated-5, which received an overall score of 2 and 1, respectively. The deformation is markedly less on Gated-15 and Gated-6 images, which received overall scores of 5 and 4, respectively. The PET images are on an SUV grayscale of 0–6

Fig. 6
figure 6

Coronal (upper row) and axial (lower row) slices of PET showing bone-marrow noise and liver noise, respectively. The bone-marrow noise is comparable among the four image sets, and thus all received a subjective score of 5. The liver noise is close among Ungated-5, Gated-15, and Gated-6, which is slightly inferior to Ungated-15; thus, subjective scores of 5, 4, 4, and 4 were given to these four image sets, respectively. The images are on an SUV grayscale of 0–6

Discussion

In this study, we investigated the performance of respiratory-gated total-body PET/CT in a patient cohort with miscellaneous suspect malignancies. The 15-min gated PET imaging outperformed the 15-min ungated and the 5-min “count-matched” ungated PET imaging, in lesion detectability, presentation, and quantification, with comparable overall image quality. Despite the reduced acquisition time of 6 min, respiratory-gated imaging performed as well as 15-min gated imaging.

The results of this study show significance to the clinic. According to the European guideline, for traditional full-activity imaging (~3.7 MBq/kg), a PET system with short AFOV of 15–25 cm and an imaging protocol with a bed overlap of > 30% and an acquisition time of 2 min/bed for the body and 5 min/bed for the brain, would require an acquisition time over 15 min for a whole-body PET imaging [13]. If respiratory gating were to be applied for the two bed positions in the upper abdomen and lower thorax, the acquisition time would be tripled with the use of hardware-based gating [17] or doubled with the use of data-driven gating [7]; a total time of 19–23 min would be required. This is not a suitable protocol for oncological imaging, especially in seriously ill patients who cannot tolerate prolonged immobilization. The reduced time of 6 min for respiratory-gated total-body PET imaging, as identified in our study, would be more practical in the clinic. This result is consistent with our previous experience in ungated total-body PET imaging as the Gated-6 just matched the count of the 2-min protocol recommended for a full-activity ungated imaging [12].

A cost of respiratory gating is the reduced counts for image reconstruction with increased image noise. In our study, increased noise in the liver and blood pool leading to reduced SNR of image were identified in gated images compared to ungated images. In addition, the subjective scores of liver and bone marrow noise were lower in Gated-15 than in Ungated-15. The image SNR and subjective scores of noise were even lower in Gated-6. However, SNR of background liver in both Gated-15 (14.20 ± 1.98) and Gated-6 (9.79 ± 1.37) were relative high when compared to the SNR in PET images (full-activity, 2 min/bed) with clinically acceptable qualities (liver SNR: 8.3 ± 1.2) obtained using conventional PET scanner with the state-of-art configurations [18]. Furthermore, when compared with total-body PET images in our previous studies, the SNR of background liver and blood pool, and the subjective scores of liver noise in both Gated-15 and Gated-6, exceeded the corresponding results of 2-min full-activity images and fulfilled the requirement for clinical diagnosis [19, 20]. One method to reduce noise is to increase the injected activity to gain better counting statistics for compensating the count loss caused by failure of respiration capture. This is why we used a relatively higher dose of [18F]FDG (4.4 MBq/kg instead of 3.7 MBq/kg) [13]. Another method to reduce image noise is to reduce the number of iterations to two; we are exploring this possibility also.

Another important factor that hinders the clinical application of respiratory gating is the strict requirement for regular respiration. In our study cohort, approximately 7.6% of respiratory cycles were unavailable for use in image reconstruction. The proportion would be much higher for seriously ill oncological patients whose respiration can be very disordered.

As for image counts, it is reported that at least 10 million true counts are required for the accurate quantification of lung lesions on PET images [21]. Schaefferkoetter et al found that for obtaining image quality and lesion detection equal to that with full-count imaging, the minimum requirement was 5 million effective counts [22]. In our study, the mean true counts of Gated-15 and Gated-6 were 410 and 290 million, respectively, and the mean effective counts were 650 and 480 million, respectively, which were both significantly higher than these requirements. Interestingly, the gated images presented significantly higher percentages of effective counts than in ungated images in our study (Table 2). This might have compensated for the reduced total counts and led to the comparable subjective liver- and bone-marrow-noise scores in Gated-15 and Ungated-5. Given the dramatically high amount of counts detected, it is likely that the acquisition time can be reduced further if the data-driven-gating technique is used [16].

In PET/CT imaging, malignant lesions commonly show higher tracer uptake than adjacent areas. Therefore, for small lesions, the respiratory motion would cause the measured prompts to be lower as a larger amount of adjacent normal areas would be included than would be the case with large lesions. In our study, more obvious underestimations of quantification in ungated imaging were found in relatively small lesions; this is consistent with previous studies [23, 24]. Another factor affecting the efficiency of respiratory gating is the lesion location. For example, lesions in the mediastinum or retroperitoneum are relatively invulnerable to respiration motion compared to lesions in other areas; this was confirmed in our study.

Several limitations of this study should be mentioned. First, we did not investigate how much further the acquisition time of respiratory-gated imaging could be reduced without compromising image quality. Nevertheless, a total time of 6 min for a total-body respiratory-gating PET imaging is dramatic. Second, the slightly higher dose of tracer used in this study increases radiation to the patient. However, the increase of tracer activity from 3.7 to 4.4 MBq/kg would only bring about an increase of 0.0133 mSv/kg of radiation, i.e., accounting for about 0.8 mSv for a 60-kg patient as calculated by the method suggested in the ICRP Publication 106 [25]; this slight increase of radiation may not be a major concern for patients with malignancies. Third, we did not include patients without pathologically confirmed diagnosis aiming to confine the study subjects to be a patient set with definitive diagnosis in order to guarantee a robust result that could be likely extrapolated to similar patients from other institutions using similar PET/CT scanner. Finally, we did not apply the data-driven-gating technique, although its great potential in total-body PET imaging has been indicated in recent studies [16, 26].

To conclude, [18F]FDG PET/CT imaging using the total-body PET scanner allows for respiratory-gated imaging with dramatically reduced scanning time while providing image quality superior to that obtained with ungated time- or count-matched imaging. Therefore, there should be increased use of respiratory gating in clinical PET imaging.