Introduction

Primary bone tumors make up 5–6% of childhood and adolescent malignancies, with the majority being osteogenic sarcoma (OS) and Ewing sarcoma (ES) [1].

Important considerations for determining prognosis and subsequent management include tumor histology and grade, the location of the primary tumor, the extent of loco-regional and metastatic disease, and response to chemotherapy [2]. The extent of loco-regional disease is usually defined on MRI [3]; however, these tumors, particularly in the Ewing group, demonstrate a variable degree of adjacent soft tissue abnormality that can make assessment of the extent of the primary tumor difficult. PET/CT imaging using the glucose analogue 18F-fluoro-2-deoxy-d-glucose (FDG) may provide relevant ancillary information to conventional imaging (CI) and identify more accurately the extent of disease at diagnosis.

Response of the primary tumor to chemotherapy is not assessed accurately by MRI [4]. FDG PET/CT has the potential to provide a non-invasive, in vivo imaging technique to evaluate tumor necrosis and provide an opportunity to identify children likely to benefit from further adjuvant therapy before surgery.

Studies in adults with primary bone and soft tissue sarcomas confirm the accuracy of FDG PET/CT in the detection of malignant lesions [5, 6], assessing histological response to chemotherapy [7] and predicting survival [8, 9]. Studies have also been published that incorporate children with bone and soft tissue sarcomas and show FDG PET/CT to be clinically useful, provide additional information to CI and alter management [10, 11], but evidence from childhood studies is lacking.

Our aims were to evaluate the performance of FDG PET/CT compared to CI in detecting malignant lesions, predicting histological response to chemotherapy and diagnosing physeal involvement in children with primary bone tumors.

Materials and methods

Subjects

The electronic database of all PET/CT scans performed at the Children’s Hospital at Westmead, Sydney, Australia during the period June 2006 to December 2008 was searched to identify all children with OS and ES. The histologic diagnoses were confirmed and imaging reports and histopathological results following surgery were analyzed retrospectively. To be eligible for inclusion, correlative CI (CT, US, MRI and/or bone scintigraphy) of lesions was required within 4 weeks of PET/CT, and clinical follow-up for at least 6 months after PET/CT.

PET/CT technique and analysis

PET/CT scans were carried out using a Siemens Biograph16 HiRez PET/CT system (Siemens Medical Solutions, Erlangen, Germany). All children fasted for 4–6 h before injection of 370 MBq (scaled to weight) of FDG and had blood glucose levels within the normal range. During the uptake phase, the children rested in a warm, dimly lit room lying down with minimal stimulation. Acquisition time was 2 min per 16 cm axial field of view for whole-body imaging.

PET/CT scans at our institution are reported after review by two pediatric nuclear medicine physicians trained in PET/CT interpretation. All relevant CI and clinical information are available at the time of reporting. Lesions reported as positive, suggestive, or equivocal for malignancy were considered positive for the analysis.

For the purpose of our study, if the reported findings were ambiguous, then PET/CT scans were reviewed by a pediatric nuclear medicine physician (R.H.G., 35 years of experience or K.L., 4 years of experience) to determine the final classification of lesions.

CI evaluation and analysis

CI was performed according to the patient’s clinical requirement and consisted of CT, US, MRI, and/or bone scintigraphy. Studies reported as positive, suggestive, or equivocal for malignancy were considered positive for the analysis.

If the reported findings were ambiguous, then CI scans were reviewed by a pediatric radiologist (E.O., 21 years of experience) to determine the final classification of lesions.

Comparison of PET/CT and CI

A lesion-based analysis was used to determine the performance of PET/CT and CI in detecting malignant lesions. Imaging performed at diagnosis, at completion of treatment and during routine surveillance was included. Findings for PET/CT and CI were each compared to a defined reference standard. The reference standard was histopathological evaluation, if available. If lesion histopathology was not performed, further imaging combined with clinical follow up for at least 6 months was used to determine the lesion reference standard classification. For example, in the case of a child entering remission, lesions that resolved on follow-up imaging were considered malignant and lesions that remained stable or progressed were considered non-malignant. In the case of a child experiencing disease progression, lesions that progressed were considered malignant and lesions that resolved or remained stable on follow-up imaging were considered non-malignant.

PET/CT and MRI performed at diagnosis and following neo-adjuvant chemotherapy before surgery were analyzed to determine the ability of these modalities to predict histological response of the primary tumor. PET/CT was aimed to be performed at least 3 weeks after the last dose of chemotherapy. Semi-quantitative assessment of primary tumor FDG uptake was performed by calculation of maximum standardised uptake value (SUVmax) using a volumetric analysis tool (Siemens ESOFT workstation, Erlangen, Germany). An ellipsoid volume of interest was placed manually encompassing the primary tumor and the SUVmax at diagnosis and following chemotherapy were recorded. On MRI in the coronal plane, the maximal cranio-caudal diameter (diameterCC) and maximal transverse diameter (diameterT) of the primary tumor was recorded. If this data was not available on the scan reports, the MRI was reviewed by a pediatric radiologist (E.O.) and the tumor measurements were taken. The maximal cross sectional area (CSAmax) at diagnosis and following chemotherapy was calculated according to the formula CSAmax = diameterCC × diameterT.

SUVmax at diagnosis, and the change in SUVmax and CSAmax following chemotherapy were compared to histopathologic evaluation of viable tumor after en block primary tumor resection after chemotherapy. The change in SUVmax and CSAmax were expressed as a ratio of these parameters after chemotherapy to before chemotherapy. Less than 10% residual viable tumor was used to classify the response as “good” and greater than or equal to 10% residual viable tumor was used to classify the response as “poor”.

The children with primary tumors involving metaphyses were identified and the reported involvement of the physis on PET/CT and MRI was recorded. If physeal involvement was not commented on specifically in the imaging reports, the scans were reviewed (PET/CT by R.H.G., MRI by E.O.) for final classification. Imaging findings were compared to histopathologic evaluation of physeal involvement following en block resection of the primary tumor after chemotherapy.

Statistical analysis

Standard 2 × 2 tables were constructed of the relevant data. Sensitivity, specificity, accuracy and positive likelihood ratio (PLR) were calculated; 95% confidence intervals for each parameter were calculated using an online calculator adhering to standard statistical methods [12]. A non-parametric test for statistical difference (Mann-Whitney-U test) with alpha = 0.05 was used to compare predictors of treatment response (SUVmax data and CSAmax data) between FDG PET/CT and CI. Statistical computer software was used for the analysis (SPSS version 19, IBM). No statistical analysis was performed on the data for predicting physeal involvement due to the very small number of children included in this evaluation.

Ethical approval for the study was obtained from The Human Research Ethics Committee of The Children’s Hospital at Westmead.

Results

A total of 155 PET/CT scans in 41 children (26 male) with OS (n = 20) and ES (n = 21) with an average age at the time of the scan of 12.7 years (range, 1.8 years–18.7 years), had been performed during the review period. In total, 616 lesions were identified on PET/CT and/or CI; 16 scans were performed at diagnosis, 2 at relapse, 24 during therapy, 14 at the end of therapy and 99 during routine surveillance. In total there were 12 different primary tumor sites (Table 1). Eight children had metastatic disease at time of diagnosis: four with lung metastases, one with a bone metastasis and three with both lung and bone metastases.

Table 1 Primary tumor site and histopathological type. OS Osteogenic sarcoma, ES Ewing sarcoma

In all, 49 scans (248 lesions) in eight children were excluded due to follow-up of less than 6 months (42 scans) or no corresponding CI within 4 weeks of the PET/CT (7 scans). There remained 106 scans (368 lesions) in 33 children eligible for inclusion in the study, of which 86 scans (314 lesions) in 33 children were used for assessing diagnostic accuracy (performed at diagnosis, at completion of treatment and during routine surveillance).

Diagnostic accuracy

The individual lesion-based results are shown in Table 2. The sensitivity, specificity, accuracy and PLRs with corresponding 95% confidence intervals for PET/CT and CI are presented in Table 3.

Table 2 Lesion-based 18F-fluoro-2-deoxy-d-glucose (FDG)-PET/CT and conventional imaging (CI) in the diagnosis of malignant lesions. TP True positive, TN true negative, FP false positive, FN false negative
Table 3 Diagnostic performance of FDG-PET/CT and CI in detecting malignant lesions. Values are expressed as percentages with 95% confidence intervals. PLR Positive likelihood ratio

Overall, 14/86 (16%) of the scans had at least one lesion with discordant PET/CT and CI findings and 22/314 (7.0%) of the total lesions detected had discordant PET/CT and CI findings. PET/CT was correct in 15 of these 22 (68%). The lesions diagnosed incorrectly on PET/CT and/or CI are described in Table 4 (the false negative findings) and Table 5 (the false positive findings) and in Figs. 1, 2 and 3.

Table 4 Lesion-based false negative findings at FDG-PET/CT and CI
Fig. 1
figure 1

Example of a discordant conventional imagine (CI) false negative lesion. a Staging 18F-fluoro-2-deoxy-d-glucose (FDG) PET, (b) corresponding low dose attenuation CT, and (c) fused PET/CT of a 13-year-old boy with osteogenic sarcoma (OS). Intense FDG uptake with disrupted bone cortex and soft tissue is seen involving in the primary tumor site in the right distal femur (arrow). There is also a small intense focus of increased FDG uptake in the right proximal tibial epiphysis medially (arrowhead), without abnormality seen on the low dose attenuation CT scan. No abnormality was seen on the MRI scan in the right proximal tibia (image not shown). A metastasis in the right proximal tibial epiphysis was confirmed by histopathology following surgery

Fig. 2
figure 2

Example of a concordant PET/CT and CI false negative lesion. Cerebral T2 weighted MRI (upper row), FDG PET (middle row) and fused co-registered PET/MRI (lower row) of a 17-year-old boy performed at two time points following therapy for pulmonary relapse of right tibial osteogenic sarcoma. The child had a transient encephalopathy attributed to a side effect of chemotherapy prompting the cerebral imaging. a (left panels) Initial imaging showed a high signal abnormality in the para-sagittal anterior left frontal lobe (arrow) thought to represent an area of gliosis and atrophy associated with previous therapy. The FDG PET and fused co-registered PET/MRI show the lesion to be relatively hypometabolic (arrow). b (right panels) Follow up imaging performed 6 months later shows the development of a large hypermetabolic lesion in the left frontal lobe. There were also pulmonary and multiple skeletal metastases (images not shown)

Fig. 3
figure 3

Example of a CI false positive PET true negative lesion in a 15-year-old girl with previous Ewing sarcoma (ES) of the right 4th rib treated with rib excision, chemotherapy and radiotherapy. a During surveillance, an MRI scan showed a 2.5 cm rim-enhancing fluid-containing lesion anterior to the subscapularis muscle (arrow) thought to potentially represent relapse. b At the same time, FDG PET/CT showed the lesion to have very low level tracer uptake (arrowhead)—not greater than mediastinal blood pool. The lesion resolved on further imaging without cancer-directed therapy and was attributed to a trauma-related seroma

Table 5 Description of the lesion-based false positive findings at FDG-PET/CT and C)

Tumor response to chemotherapy

Seventeen children had PET/CT for disease staging at diagnosis or relapse before therapy. SUVmax at initial diagnosis ranged from 5.3 to 42.8 (mean 12.2, median 8.2).

For the evaluation of PET/CT parameters in predicting histological response of the primary tumour, nine children were excluded: two for restaging of lung relapse, two without PET/CT performed after chemotherapy and five with histopathology not available or not performed (one base of skull tumor, two pelvic tumors, one rib relapse, one who refused surgery). The eight children remaining were included in the analysis. Of these eight, six had good and two poor response (Table 6). Examples of PET/CT and MRI imaging of good and poor response are shown in Figs. 4 and 5, respectively.

Table 6 Children included in the analysis of tumor response to chemotherapy. SUV max 1 Maximum standardized uptake value at diagnosis, SUV max 2 maximum standardized uptake value after chemotherapy, Diameter CC 1 maximal cranio-caudal diameter at diagnosis, Diameter T 1 maximal transverse diameter at diagnosis, CSA max 1 maximal cross sectional area at diagnosis, Diameter CC 2 maximal cranio-caudal diameter after chemotherapy, Diameter T 2 maximal transverse diameter after chemotherapy, CSA max 2 maximal cross sectional area after chemotherapy
Fig. 4
figure 4

Good histological tumor response in a 13-year-old male with OS involving the right proximal tibia. a Prior to therapy the PET/CT shows a markedly hypermetabolic tumor involving the right proximal tibia with SUVmax1 = 17.5. b The corresponding MRI at diagnosis showed an expansile, exophytic lesion in the lateral right proximal tibial metaphysis. c After chemotherapy, the PET/CT shows a marked reduction in tracer uptake with SUVmax2 = 2.1 and SUVmax2 :SUVmax1 = 0.12 ,indicating a very good response. d MRI after chemotherapy showed a slight reduction in overall size, and gadolinium enhancement suggesting poor response. Histopathology revealed <2% viable cells confirming a good histological response to chemotherapy

Fig. 5
figure 5

Poor histological tumor response in an 11-year-old male with OS involving the distal left femur. a The staging PET/CT prior to therapy shows the tumor, demonstrating heterogeneous increased FDG uptake with SUVmax1 = 10.6. b The corresponding MRI shows a large soft tissue mass centred in the left vastus medialis muscle with periosteal reaction in the adjacent femur. It extends from the mid femur to the distal metaphyseal region with bone marrow involvement. c After chemotherapy, the PET/CT shows a significant reduction in the tumor extent and a persisting focal hypermetabolic region medially with SUVmax2 = 6.1 and SUVmax2 :SUVmax1 = 0.58 indicating a poor response. d The MRI shows significant reduction in overall size and gadolinium enhancement of the tumor, suggesting good response. Histopathology after chemotherapy showed 25% viable malignant cells, confirming a poor histological response

SUVmax values at diagnosis and after chemotherapy were available for all eight children. There was a trend for SUVmax at diagnosis to be higher in those tumors that responded well (median, 14.5) than in those with a poor response (median, 8.0; P = 0.096). In all children, SUVmax was reduced following chemotherapy. However, tumors with good response appeared to have a greater reduction in SUVmax with a median ratio post/pre of 0.24, compared to 0.55 in the poorly responding tumors (P = 0.18). The largest SUVmax decrease was found in the lesion with the highest initial SUVmax (Table 6).

CSAmax values were not available for one of the well-responding tumors due to MRI scans being performed at another institution. Seven children were included in the MRI analysis, five with good response and two with poor response to chemotherapy. There was no consistent pattern of change in size of the primary tumor seen on MRI between the response groups. Two tumors with good response increased in size and three reduced in size. Both tumors with poor response reduced in size (Table 6).

Assessment of physeal involvement

Assessment of physeal involvement (Table 7), revealed that six children had a primary tumor in an extremity, involving the metaphysis, had documented histopathological assessment including the physis, and had complete PET/CT and MRI data available. All of these appeared to have physeal involvement on PET/CT (two false positives) compared to five children on MRI (one false positive). The child without physeal involvement on MRI had concordant histopathological assessment. An example of PET/CT and MRI imaging in evaluation of physeal involvement is shown in Fig. 6.

Table 7 PET/CT and MRI for detection of physeal involvement by metaphyseal tumors. R Right, L left
Fig. 6
figure 6

Example of a PET false positive, MRI true negative for detecting epiphyseal involvement. The images were performed at diagnosis of a 12-year-old boy with right proximal fibular ES. a FDG PET/CT shows intense tracer uptake in the primary tumor appearing to breach the growth plate medially. b Corresponding MRI scan shows abnormal low signal in the tumor extending up to but not breaching the growth plate. Post operative histopathology did not show epiphyseal tumor involvement

Discussion

Our study of children with primary bone tumors showed that FDG-PET/CT had higher specificity but lower sensitivity in detection of malignant lesions when all lesions were considered. The reduced sensitivity of PET/CT was most apparent in the assessment of lung lesions, with the three out of six false negative PET/CT-lesions being lung lesions < 10 mm in diameter. CI (here, chest CT), while being more sensitive, had a considerably lower specificity with a higher rate of false positive findings in the lungs. Of the 16 false positive lesions seen on CI, 10 were small parenchymal lung lesions. Our findings are consistent with studies of osteosarcoma in adults and with studies of solitary pulmonary nodules in general [13, 14]. There is likely a combination of explanatory factors, including lung nodules below the limit of resolution of PET/CT, partial volume effects and respiratory movement. High-resolution chest CT remains the most sensitive tool for small nodules. The higher specificity of FDG PET/CT reinforces the complementary roles of these imaging modalities in the assessment of lung lesions (Fig. 7).

Fig. 7
figure 7

Characterising lung lesions in children with primary bone tumors remains challenging on both FDG PET/CT and dedicated diagnostic CT. Four cases of lung lesions with variable findings on CI (diagnostic CT) and FDG PET/CT are presented (ad). In each case the upper image is the diagnostic CT, the middle image is the low dose attenuation CT component of PET/CT, and the lower image is the corresponding attenuation corrected FDG PET. Note that the diagnostic CT studies were performed during an inspiratory breath hold with arms raised, while the FDG PET/CT was performed during normal tidal respiration with arms at the patient’s side. a Concordant true positive lung lesions. At diagnosis, several nodular opacities were seen in the superior segment of the right lower lobe apparent on both the diagnostic CT (arrow) and the low dose attenuation CT component of the PET/CT (arrow) and demonstrated increased FDG uptake (arrow). b Discordant CI false positive lung lesion. At diagnosis, a 3-mm subpleural nodule was identified on diagnostic CT and considered suspicious for metastasis (arrow). The lesion was not apparent on either the low dose attenuation CT or FDG component of the PET/CT study. Diagnostic CT hookwire and thorascopic guided biopsy revealed a benign subpleural lymph node. c Concordant false positive lung lesions. During surveillance there was a new 5-mm nodule identified in the superior segment of the left lower lobe seen on both diagnostic CT (arrow) and the low dose attenuation CT component of the PET/CT (arrow). Although there was no apparent corresponding increase in FDG uptake, it was considered suspicious for recurrent malignancy. The lesion resolved on subsequent imaging without further cancer directed therapy and was regarded as a benign inflammatory nodule. d Discordant PET/CT false negative lung lesion. During surveillance, a new 7-mm sub-pleural nodule was identified in the anterior aspect of the left lower lobe on diagnostic CT (arrow). The lesion appeared more diffuse on the low-dose attenuation CT component of the PET/CT (arrow) without increased FDG uptake and was not identified as suspicious for recurrent metastasis. The lesion was proven by biopsy to be recurrent ES. The diffuse FDG accumulation seen in a location more medial to the lung lesion localised to physiologic brown fat (open arrow)

With the exclusion of lung lesions, PET/CT was both more sensitive and specific than CI for malignant lesions. Positive and negative predictive values have an intimate reliance on the disease prevalence in the study population and therefore may not be applicable to populations with a different underlying risk of disease. PLRs are a way to express the overall usefulness of a diagnostic test to confirm the presence of a disease that combines both test sensitivity and specificity, and is independent of the disease prevalence in the underlying study population [15]. In general, a PLR > 10 indicates a conclusive increase in the probability of having the disease [16]. Our results show very high PLRs for both PET/CT (44.7) and CI (24.2) in evaluating lesions outside of the lungs. For lung lesions, CI had a lower PLR (7.4) in the “moderately useful” range [16]. This was due to the high false-positive rate. PET/CT however maintained a very high PLR (19.2) for assessing lung lesions despite the lower sensitivity, indicating that when positive, it is a more useful than CI.

Morphological assessment by plain radiographs and MRI following neoadjuvant chemotherapy is insufficiently correlated to histological response, and should therefore not form the sole basis for decisions to escalate preoperative treatment [4]. FDG-PET/CT may become an accurate non-invasive test of response evaluation, analogous to current mid-treatment assessment in children with lymphoma [17].

FDG PET/CT combined with MRI in osteosarcoma has been demonstrated to be potentially useful in predicting histological response. It may therefore prove helpful for planning of surgery and tailoring of treatment [18]. The most important prognostic factor is tumor necrosis following neoadjuvant chemotherapy [19].

Early studies, predominantly adults with osteosarcoma and Ewing sarcoma, have looked at reduction in tumor/non-tumor tracer uptake before and after neo-adjuvant chemotherapy. These have showed promising results for FDG-PET in providing an accurate in vivo method for evaluating tumor response [20, 21]. We used SUVmax as the FDG-PET/CT parameter in our response assessment. SUVmax is more reliable than tumor-to-background ratios for semi-quantitative measurement of FDG-uptake and has low inter-observer variability in the characterization of sarcomas, which often demonstrate considerable heterogeneity in metabolic activity [22].

Our study raises the possibility, but does not confirm, that a high initial SUVmax may predict a better response. If true, then children with low initial tumor SUVmax would have a worse prognosis or be at higher risk of relapse. This finding conflicts were previous findings that sarcomas with higher pretherapy SUVmax had significantly poorer prognosis and that osteosarcomas with higher tumor/non-tumor uptake had poorer outcome [23, 24]. Both these studies were, however, primarily in adults.

A more recent study involving nine patients (five children) with osteosarcoma [25] showed a larger mean percent reduction in SUVmax after chemotherapy in patients with >90% necrosis compared to those with <10% necrosis: 74% reduced compared to 26% reduced, respectively. This supports our findings that the relative reduction in SUVmax (expressed in the current study as SUVmax2:SUVmax1 ratio and representing the proportion of metabolic activity remaining after chemotherapy) rather than the absolute value of SUVmax following chemotherapy may be a stronger predictor of tumor response. Using a cut-off point of SUVmax2:SUVmax1 < 0.5 to indicate a good response, Hamada et al. [25] determined the positive and negative predictive values were 80% and 100%, respectively, albeit with a very small dataset. Of the five children in this study, only one had poor response to therapy.

The largest study looking at response assessment in osteosarcoma by FDG-PET analysed 40 patients up to the age of 31 years (median, 15.1 years) with extremity tumors. Although they found that an SUVmax cut-off following chemotherapy of < 2.5 was a significant predictor of improved outcome, there was only a weak correlation when using a cut-off of > 50% reduction in SUVmax as a marker for good response (> 90% necrosis) [26]. The same group in an earlier study analysed 34 patients (up to 46 years old) with ES with similar findings [27]. The discrepancy in the effectiveness of relative reduction in SUVmax predicting tumor response between these reports and the study by Hamada et al. [25] may be due to a larger proportion of older patients in their cohorts.

Children with primary bone tumors probably constitute a biologically different group, distinct from adults, potentially rendering the translation of evidence from adult studies to pediatric practice invalid. Larger pediatric studies are warranted.

MRI has been shown previously to be accurate in the assessment of physeal tumor involvement [28]. Our review of six children supports this finding. Although we had a very small number of children in the comparison of PET/CT and CI, both modalities were very sensitive to detect physeal tumor involvement; however, MRI appeared to be more specific, with one false positive compared to two false positive results with PET/CT.

A major limitation of our study is the small number of eligible children included in the analysis, especially in the evaluation of tumor response parameters and assessment of physeal involvement. Correspondingly, there is overlap of the 95% confidence intervals in the measures of diagnostic test performance between PET/CT and CI; and the difference in the PET/CT and MRI parameters in predicting tumor response were not statistically significant. Larger studies with greater statistical power would be required to prove the trends seen in our study.

The reports of both PET/CT and CI were generated with full knowledge of all other imaging and clinical information available at the time of the respective scan. There was the potential for the reported findings of one modality to bias the reporting of the other. A blinded re-assessment of all imaging may have produced more valid results but would not reflect everyday clinical practice.

For obvious ethical reasons, not all lesions detected in our study were biopsied. We therefore had to rely in part on (clinical) follow-up as a reference standard. This incorporated the change (resolution or progression) of each lesion as seen on follow-up imaging. The application of the reference standard is therefore not independent of the imaging modalities being evaluated, giving the potential for systematic reference standard misclassification. As this methodological limitation is unlikely to be satisfactorily solved, it is of particular importance that studies evaluating diagnostic performance clearly and thoroughly explain how the reference standard (in addition to classification of a “positive” and “negative” scan result) is classified, as has been done in our report. In fact, there are suggested guidelines on the reporting of studies of diagnostic accuracy that should be adhered to wherever possible [29].

Conclusion

In children with primary bone tumors, PET/CT appears to be more sensitive and specific in evaluating lesions outside of the lungs. While less sensitive for detecting malignant lung lesions, the higher specificity of PET/CT renders it more useful than CI in confirming lung metastases when positive. On PET/CT, SUVmax reduction may be able to predict response to neo-adjuvant chemotherapy. Change in tumor size on MRI does not appear to be predictive of tumor response. Both PET/CT and MRI may be very sensitive but poorly specific in predicting physeal involvement of primary metaphyseal tumors.