Introduction

Classical Hodgkin lymphoma (cHL) accounts for approximately 8% of newly diagnosed lymphomas, with a rather stable incidence [1]. It is nowadays a highly curable disease, with lasting remissions in more the 70–80% of patients after first-line therapy [2]. Treatment strategy is based on Ann Arbor stage and risk classification, so the pre-therapeutic work-up is crucial to avoid over-treatment in limited-stage patients and under-treatment of advanced-stage patients.

The Ann Arbor classification represented a cornerstone in cHL staging [3]; in 1989, staging was reviewed in the Cotswold meeting in order to limit invasive procedures, thanks to the improvement of imaging diagnostics [4]. Bone marrow involvement in cHL is not frequent, accounting for 4–11% of new patients [5] and is usually associated with the presence of B symptoms [6, 7]. Iliac crest bone marrow biopsy (BMB) is an invasive procedure for staging of patients with newly diagnosed cHL according to the 2010 ESMO guideline [8]. However, BMB investigates only a limited volume of bone marrow, and so it could miss partial, localized lymphomatous infiltration [9]. Moreover, BMB is associated with a small risk of complications, including hemorrhages and infections in the bioptic site (in up to 0.12% of patients) [10], and it may cause anxiety and pain to the patient [11, 12].

18F-fluorodeoxyglucose (18F-FDG) positron emission tomography/computed tomography (PET/CT) is today recommended in the initial staging of cHL [13], response assessment after treatment [14], and for prognostic assessment as an interim scan [15]. A whole-body, non-invasive diagnostic instrument, PET/CT exhibits impressive accuracy in the identification of nodal and extranodal disease, including skeletal lesions [13, 16], leading frequently to an upstaging of patients when compared to CT scan alone.

Several studies have evaluated the role of BMB in the PET/CT era [17,18,19,20,21,22,23,24,25,26,27]. The largest one, by El-Galaly et al., has retrospectively reported data on the diagnostic and prognostic value of routine BMB in 454 treatment-naive patients with cHL undergoing PET/CT staging, showing that in light of the results of BMB, no patient was upstaged or had a change in treatment [21]. Recently, a meta-analysis evaluated the diagnostic performance of PET/CT in detecting bone marrow involvement in cHL patients, concluding that PET/CT could appropriately substitute BMB in the staging of cHL [28]. With the aim of increasing the knowledge of this issue, we performed a multi-centric, retrospective study on consecutive cHL patients referring to hematology centers of the Fondazione Italiana Linfomi, aimed at establishing the concordance between BMB and PET/CT in the detection of bone marrow involvement in cHL and to evaluate the impact of BMB in the staging of patients with cHL.

Methods

Patients

Consecutive cHL patients diagnosed from January 2007 to December 2013 referring to 16 hematological centers of the Fondazione Italiana Linfomi were included in this retrospective study. The study was approved by the ethical committee of the coordinating center (Florence, Italy) and was conducted according to the Helsinki Declaration. Informed consent was obtained from all individuals participants included in the study. Data were collected from electronic database and patient charts. All data were collected in an anonymized format, in compliance with institutional and national requirements.

Patients more than 14 years old were included in the study if both PET/CT and BMB had been performed at baseline. The exclusion criteria were (i) diagnosis of nodular lymphocyte-predominant HL, (ii) HIV positivity, (iii) other previous or concomitant malignancy, (iv) and a follow-up of less than 3 months after the end of therapy.

All patients were evaluated according to the Ann Arbor staging system by the combination of both PET/CT and BMB and by the use of PET/TC only [2]. Risk stratification was performed according to the categories identified by the German Hodgkin Study Group (GHSG) and, for patients in advanced stage, to the Hasenclever score [29, 30]. Response to treatment was defined according to the International Working Group criteria [31].

Evaluation of BMB

BMB had been performed unilaterally or bilaterally according to local policies and were evaluated by local pathologists without a central review. Minimal requirements for immunohistochemical staining were pan-B, pan-T, CD30, and CD15. BMB was repeated at the end of the treatment if positive at baseline [31].

Evaluation of PET/CT

PET/CT were performed according to local protocols and scanner procedures. PET/CT were performed by co-registration of PET and multislice CT in a single, whole-body session; a low-dose CT was employed in most cases. Scans were performed from skull base to mid-tights and were registered 60–90 min after the intravenous injection of 18F-FDG; different doses of 18F-FDG were administered depending on the type of PET/CT scanner. PET/CT was repeated at the end of treatment [31] and, in most cases, after two cycles of treatment with ABVD as prognostic interim evaluation [14].

Interpretation of skeletal lesions

Due to the retrospective nature of this study, investigators could decide to further evaluate focal skeletal lesions according to their internal policies. In absence of a histological verification via guided biopsies or supplementary radiological studies, focal skeletal lesions were considered as a disease involvement if a focal 18F-FDG uptake above liver 18F-FDG uptake disappeared or persisted, according to other involved sites, in PET/CT studies performed after chemotherapy [32]. Focal skeletal lesions were categorized as unifocal (a single PET-positive site), bifocal (two PET-positive sites), and multifocal (three or more PET-positive lesions). A widespread marrow 18F-FDG uptake above liver 18F-FDG uptake in absence of focal skeletal lesions defined as above was not considered as a disease involvement. PET/CT scans of patients with focal skeletal uptakes were locally reviewed by a different nuclear medicine physician; no central review was performed.

Statistical analysis

Measures of diagnostic performance, such as positive predictive value (PPV), negative predictive value (NPV), sensitivity, and specificity, were estimated with the corresponding confidence intervals, based on the obtained results and on the number of false negatives (FN) and false positives (FP). Lastly, the agreement in the definition of the Ann Arbor stage between PET/CT plus BMB and PET/CT alone was assessed, evaluating also whether the omission of BMB could modify the patient management.

Results

We retrospectively collected data from 1244 patients with cHL diagnosed in 16 hematology centers of the Fondazione Italiana Linfomi; 159 patients were excluded because of a lack of baseline BMB or PET/CT. Patient characteristics are reported in Table 1. The median age of the 1085 evaluable patients was 33 years (range 14–89 years), and 567 patients (52.3%) were male. Only 47 patients were less than 18 years old, with a median age of 17 years. As expected, nodular sclerosis was the most common histological subtype (769 patients, 70.9%); 455 patients (41.9%) reported B-symptoms at presentation. One hundred ninety-one patients (17.6%) underwent a bilateral BMB, while in the large majority of patients (890 patients, 82%), BMB was unilateral. A positive BMB was reported in only 55 patients (5.1%), without differences between patients who underwent unilateral or bilateral BMB. All analyzed patients reached the minimum required follow-up of 3 months.

Table 1 Baseline patient characteristics

The analysis of skeletal 18F-FDG uptake is summarized in Table 2. Overall, the PET/CT scans featured at least a focal skeletal lesion in 169 patients (15.6%); in particular, 56 patients showed a single focal lesion, 32 had two focal lesions, and 81 presented three or more focal lesions. Considering the patients with a negative BMB, 48 (4.7%) showed a single bone lesion, 30 (2.9%) had two lesions, and 57 patients (5.5%) displayed three or more lesions. Of the 55 patients with a positive BMB, eight (14.6%) had a unifocal skeletal 18F-FDG uptake, two (3.6%) a bifocal involvement, and 24 (43.6%) a multifocal localization. A diffuse and homogeneous marrow 18F-FDG uptake was observed in 53 patients (5.1%) with a negative BMB and in nine patients (16.3%) with a positive BMB. Nonetheless, all of them have been considered a stage IV with PET/CT only because of other extranodal 18F-FDG uptake sites (seven lung lesions, two liver lesions).

Table 2 Correlation between skeletal lesions identified by PET/CT and bone marrow biopsy results

The sensitivity and specificity of focal skeletal PET/CT lesions for detection of positive versus negative BMB were 61.8% (CI 95%, 47.7–74.6) and 86.9% (CI 95%, 84.7–88.9), respectively. PPV and NPV of focal skeletal PET/CT lesions for detection of positive BMB were 20.1% (CI 95%, 14.3–27.0) and 97.7% (CI 95%, 96.5–98.6), respectively.

We also evaluated the sensitivity, NPV, and the diagnostic accuracy of both BMB and PET/CT under the assumption that both of them are separately sufficient to reveal true bone or marrow disease. For BMB, the sensitivity was 28.9% (CI 95%, 26.3–31.8), NPV was 86.9% (CI 95%, 84.7–88.8), and the diagnostic accuracy was 88% (CI 95%, 84.5–89.0%). For PET/CT, the sensitivity was 86.5% (CI 95%, 84.3–88.5), NPV was 97.8% (CI 95%, 96.7–98.5), and the diagnostic accuracy was 98% (CI 95%, 96.0–98.5%).

A surgical- or CT-guided bone biopsy was performed in only four out of 169 patients with at least a focal skeletal lesion (2.4%); in three of them, the biopsy confirmed a disease involvement, while one patient underwent a CT-guided needle biopsy of a rib, which did not reveal a lymphomatous localization. This patient was considered a stage III according to the results of PET/CT.

The results of the staging considering the PET/CT scan alone or the PET/CT and BMB are represented in Table 3. With the PET/CT scan alone, 55 patients (5.1%) were in stage I, while 532 (49%), 258 (23.7%), and 240 (22.1%) resulted in stages II, III, and IV, respectively. The addition of BMB to PET/CT led to the upstaging of nine patients (0.8%). In particular, none of the patients in stage I with PET/CT alone was upstaged, while one out of 532 patients in stage II (0.1%; CI 95%, <0.0001–0.0117)) and eight out of 258 in stage III (3.1%; CI 95%, 0.0099–0.0521) with PET/CT alone were upstaged to stage IV with the addition of BMB.

Table 3 PET/CT versus PET/CT and bone marrow biopsy staging and risk assessment

In the 1059 patients assessed according to the GHSG risk score, none of the 184 patients in the early group and only one out of the 291 patients in the intermediate group (3.4%; CI 95%, <0.0001–0.0212) shifted to the advanced group after the incorporation of BMB to PET/CT.

Discussion

In the last years, new imaging techniques have become available in routine clinical practice, greatly modifying the diagnostic approach of many hematological diseases. PET/CT is currently employed in the baseline assessment of cHL as well as in the treatment response evaluation [13, 14] and for prognostic assessment as an interim scan [15].

With the widespread availability of high-quality PET/CT scanners, which co-registrate PET and multislice CT scans in a single whole-body examination, the possibility to detect extranodal sites of disease has become realistic, thanks to the high sensitivity of PET/CT in the detection of bone marrow involvement in cHL, as reported in many series [17,18,19,20,21]. BMB is the classical procedure to detect bone marrow involvement in lymphomas, but it is an invasive, possibly painful procedure, which samples only a limited part of bone marrow. In the past years, efforts were done in order to develop scores constituted of laboratory and clinical data to evaluate the probability of marrow involvement in cHL [5], but they never entered the routine daily practice. Recently, considering the very low positivity rate of BMB in patients with cHL [9] and the astonishing results of PET/CT, the role of BMB as a mandatory staging procedure has been debated. Several authors reported their data in relatively small cohorts of patients [21,22,23,24,25,26,27]; the largest population was described by El-Galaly et al. [21] who reported data on 454 newly diagnosed patients with cHL. With a negative predictive value of focal skeletal lesions of 99% and no patient upstaged with the addition of BMB, the authors considered that omission of BMB would have not affected the treatment choice [21]. This issue was further evaluated in a meta-analysis of nine studies performed by Adams et al., which included 955 patients from 9 different studies. Pooled sensitivity and specificity of PET/CT ranged were 96.9 and 99.7%, respectively, and even with considering the moderate quality of the studies included, the evidence was that PET/CT may replace BMB in the staging of cHL [28].

We present in this paper the results of a large retrospective, multi-center study including consecutive cHL patients referring to centers of the Fondazione Italiana Linfomi. To our knowledge, this represents the largest cohort of consecutive patients analyzed with the aim to address these issues. In our large population, a histological confirmation of disease involvement trough BMB was observed in 55 out of 1085 patients (5.1%), while 169 patients (15.5%) featured at least one focal skeletal lesion at PET/CT.

A diffuse, homogeneous marrow 18F-FDG uptake was observed in 62 out of 1085 patients (5.7%), a similar proportion when compared to other populations [21]. Nine of these 62 patients had a positive BMB, a different finding compared to El-Galaly’s data, which otherwise showed an absence of BMB positivity in patients with a diffuse marrow 18F-FDG uptake. However, these nine patients could already be considered as stage IV after PET/CT alone because of other extranodal 18-F FDG uptake sites at PET/CT.

The sensitivity and specificity of focal skeletal lesions at PET/CT for identification of positive versus negative BMB were 61.8 and 86.9%, respectively. Most importantly, despite the low PPV of focal skeletal lesions (20.1%), we found a very high NPV (97.7%), confirming that the absence of focal skeletal lesions at PET/CT could be roughly associated to a negative BMB. These results are even more meaningful considering that a thorough bioptic evaluation of extranodal and skeletal sites of 18F-FDG uptake is unfeasible, either for technical and ethical reasons and to avoid a treatment delay. The lower sensitivity showed in our study compared to El-Galaly’s data [21] could possibly be explained with the different reference level used to determine the disease involvement of bone lesions, which were considered positive if showed a focal 18F-FDG uptake above liver 18F-FDG uptake, while in the El-Galaly’s study, the mediastinal blood pool was used as reference. We chose a more selective reference in order to minimize the number of false positives; considering the lower sensitivity, the NPV was particularly high, and we could then consider the liver reference as a suitable reference to evaluate bone lesions.

A striking finding in our study, however, is than even if the addition of BMB to PET/CT led to an upstage in nine patients, only one patient (0.1%) had a change in the risk group allocation and treatment choice, while the remaining eight patients could have been already considered as an advanced stage with PET/CT only.

Undoubtedly, the major limits of our study are represented by the lack of a central review of the PET/CT scans and the absence of a standardization method for the evaluation of PET/CT scans, which were performed according to local procedures. Despite their importance, these limitations give a “real-life” perspective to our results and could be applied to the majority of patients treated outside a clinical trial.

Notwithstanding a partial level of evidence, recent recommendations indicate that a BMB is no longer required for the routine evaluation of patients with cHL undergoing a PET/CT at baseline [33], as well as the latest ESMO guidelines, which suggested with a grade IIIB level of evidence that BMB could be omitted if patients are staged with PET/CT [34].

Our data strengthen the aforementioned evidence on safely omitting BMB as a staging procedure in the initial assessment of cHL patients when undergoing a PET/CT scan. However, BMB could provide information not only about disease involvement but also on marrow cellularity and the presence of dysplastic features; these data could help the treating physicians to evaluate the hematopoietic reserve in a more accurate fashion, which may be useful in elderly patients and in those patients who present with cytopenias or abnormalities of blood smear. Although BMB does not add information about the actual stage, we therefore suggest evaluating clinical and laboratory data for single patients before definitely discarding BMB (or a marrow aspiration, which could also give information about hematopoietic lineages) from the diagnostic work-up.