Introduction

HIV infection causes acquired immunodeficiency syndrome (AIDS) via defective cell-mediated immunity secondary to progressive depletion of CD4+ T lymphocytes, immune dysregulation and sustained viral production [1]. Several malignancies are increased in patients with HIV infection, including non-Hodgkin lymphoma, which is an AIDs-defining cancer as well as Hodgkin lymphoma [2]. There has been a rising incidence of AIDS-related cancers, possibly because antiretroviral therapy (ART) prolongs patient survival and causes dysfunctional cell-mediated immunity [3, 4].

18F-FDG PET/CT can demonstrate abnormal foci of increased FDG accumulation in HIV-infected individuals with suspected malignancy or fever of unknown origin (FUO) by localizing to malignant or inflammatory cells such as neutrophils and macrophages [59]. Activated lymphocytes exhibit increased glucose utilization and HIV-infected individuals have a greater accumulation of FDG in their lymph nodes than patients without HIV infection [6, 10]. The amount of HIV RNA in the peripheral blood is correlated with the amount of viral replication in lymph nodes [11]. 18F-FDG PET/CT is also valuable in the management of HIV-infected patients with lymphoma [6, 12]. In lymphadenopathy due to lymphoma the nodes are often enlarged with intense FDG uptake in comparison to those in reactive lymphadenopathy without malignancy [13], although these entities can overlap in appearance. There are no rigorously defined quantitative PET methods to differentiate these entities.

Differentiating lymphoma from reactive adenopathy in HIV-infected patients with 18F-FDG PET/CT can be challenging, especially in the setting of high plasma HIV RNA in which FDG-avid nodes commonly occur. In this retrospective study we investigated the diagnostic accuracy of quantitative FDG PET/CT metrics and compared their performance with those of qualitative PET metrics and CT. Our overall goal was to better differentiate between nonmalignant reactive adenopathy and lymphoma in HIV-infected patients.

Material and methods

Study population

The study was HIPAA-compliant and received institutional review board approval with a waiver of written informed consent for a retrospective study. Patients who had undergone a whole-body 18F-FDG PET/CT study between 2006 and 2010 were identified from the Radiology Information Systems and the local HIV clinic database. We identified 19 patients (47 ± 11.7 years) with untreated biopsy-proven HIV-associated lymphoma (16 large B-cell lymphoma, 3 Hodgkin lymphoma) who had undergone PET/CT prior to chemotherapy (7 in whom the primary diagnosis of lymphoma was made following PET/CT without a significant delay between PET/CT and subsequent biopsy). We also identified a suitable control group of 22 patients (50 ± 9.7 years) with HIV without any known underlying malignancy who had undergone PET/CT with identical protocols over the same period comprising 12 patients for evaluation of solitary pulmonary nodules (SPN) and 10 patients with FUO (Table 1). Prescan plasma viral load (single determinant), CD4+ count, pathology, culture and results of follow-up imaging were obtained. None of the SPN group had malignancy on follow-up.

Table 1 Characteristics of HIV-infected patients

18F-FDG PET/CT study and image analysis

18F-FDG PET/CT studies were performed using either a Discovery VCT (RX) system in 3-D or a Discovery LS PET/CT scanner in 2-D (both GE Healthcare, Waukesha, WI) using standard protocols with a 60-min target uptake period [14, 15]. CT scans were used for attenuation correction of the PET emission data and lesion size measurements.

All 41 PET/CT studies were retrospectively analyzed qualitatively and semiquantitatively in a blinded manner by a single independent investigator with CT and PET training, with verification by a second trained reader. The pattern and intensity of FDG accumulation were also assessed in bilateral cervical, axillary, mediastinal, hilar, periaortic, iliac and inguinal nodal regions, and in extranodal sites of lymphoma. Qualitative visual glycolytic scores were determined for PET/CT for each nodal region using a five-point scale from low to high uptake (0–4): 0 no nodal activity, 1 activity < descending aortic blood pool, 2 activity similar to blood pool, 3 activity ≤ liver but > blood pool, 4 activity > liver uptake. The highest single qualitative visual nodal score in the whole body and the summed qualitative score of all the nodal groups were determined for each patient. The utility of a visual symmetry score for differentiating lymphoma from reactive adenopathy in HIV-infected patients was also evaluated. A simple binary score was used to assess the symmetry of nodal FDG uptake (0 asymmetry, 1 symmetry) based on the rotating maximum intensity projection (MIP) images. FDG uptake in patients without apparent adenopathy but extranodal sites of lymphoma was also scored as asymmetrical.

The maximum standardized uptake value at lean body mass (SUL-Max) was recorded manually for the hottest nodes in each of the nodal regions using an AW workstation (GE Healthcare) and a summed nodal (and/or extranodal) SUL-Max score was determined for each patient. The single hottest nodal SUL-Max value in the whole body was also recorded. The summed CT nodal size from all the described nodal regions on the noncontrast low-dose CT component was calculated measuring the maximum short axis diameter of the node. The highest SUL-Max score was also determined for the nasopharyngeal region as described by Liu [16]. A semiautomated software tool (Auto-PERCIST™: A PET Image Volume Viewer with Integrated PERCIST-based Computer Assisted Analysis [software]. Baltimore, MD: The Johns Hopkins University; 2013) was used to obtain the total lesion glycolysis (TLG), metabolic tumor volume (MTV), SUL-Max and SUL-Peak of the single hottest lesion, and sum SUL-Peak of nodes and extranodal lesions as defined in PERCIST 1.0 [17]. Disease was defined as abnormal metabolic uptake above a set baseline threshold of 1.5 × liver mean + 2 standard deviations (SD) [17]. Liver background was determined according to PERCIST 1.0. Areas of normal physiological activity such as the heart, brain, kidneys and bladder were manually excluded from the analysis.

Subgroup analysis of the patients with FUO against those with lymphoma was also performed to determine whether FDG PET performs similarly in the febrile and the nonfebrile HIV population.

Statistical analysis

The continuous variables in HIV-infected patients with lymphoma and those with reactive adenopathy were compared using the Wilcoxon test. Correlations between viral load and the sum SUL-Max, sum SUL-Peak, TLG, MTV, and CT nodal size were determined with Spearman’s correlation coefficients (r). Univariate analysis was performed for each imaging marker using logistic regression to assess its performance in differentiating lymphoma from reactive adenopathy. Multivariate logistic regression analysis was used to identify a subset of relevant PET parameters that could potentially be used in routine practice for differentiating lymphoma and reactive adenopathy in HIV-infected patients by applying a stepwise backward elimination based on the Akaike information criterion (AIC). Receiver operating characteristic (ROC) analysis using univariate logistic models was used to quantify the discriminatory ability of TLG, SUL-Max, SUL-Peak, MTV and CT nodal size metrics for differentiating lymphoma from reactive adenopathy. The cut-off values and corresponding sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were determined. DeLong’s method was applied to compare the areas under the curve (AUCs) among the different metrics.

We further evaluated the performance of the quantitative metrics stratified by the presence of viremia. Our definition of viremia was based on a single prescan HIV plasma RNA measurement of >200 copies/mL as per the latest US Department of Health and Human Services guidelines [18, 19].

Kappa statistics were used to evaluate the interreader agreements for visual symmetry, and 95 % confidence intervals were calculated by assuming a normal distribution of the kappa coefficient with variance [20]. Univariate analysis was performed to assess the diagnostic performance of the symmetry score. Spearman’s rank correlation was used to evaluate the association between the symmetry score and quantitative computer-aided detection analysis. All tests are two-sided and p values <0.05 were considered to indicate statistical significance. The statistical software R (version 2.14.1) was used for the analysis.

Results

The single hottest SUL-Max, summed nodal SUL-Max, semiautomated single hottest SUL-Peak, summed SUL-Peak, TLG and MTV were significantly higher in patients with lymphoma than in those with reactive adenopathy (Fig. 1, Table 2), and CT nodal size trended towards a difference (p = 0.06). There was no significant difference in the viral load or CD4 counts between the lymphoma and the nonlymphoma groups (p = 0.41 and 0.51, respectively; Table 1).

Fig. 1
figure 1

Values of metabolic imaging markers in relation to lymphoma status in all patients. PET metrics and CT nodal size were significantly higher in HIV patients with lymphoma than in those with reactive adenopathy. The bottom and top of the box represent the 25 % and 75 % quartiles; the whiskers indicate the range of data when excluding outliers (defined as 1.5 times the interquartile range, i.e. difference between 75 % and 25 % quartiles)

Table 2 Qualitative and quantitative FDG PET/CT metrics (values are means ± SD unless stated otherwise)

Univariate logistic models for determining the accuracies of each described quantitative metabolic metric and CT nodal size for differentiating lymphoma from reactive adenopathy revealed AUCs of 0.97 for the single hottest SUL-Max, 0.905 for sum SUL-Max, 0.96 for the single hottest SUL-Peak, 0.94 for sum SUL-Peak, 0.964 for TLG, 0.96 for MTV, and 0.67 for CT nodal size (Fig. 2). The PET metrics performed significantly better than CT nodal size (p < 0.001). A TLG cut-off of 173 yielded a sensitivity of 89 % and specificity of 100 % for differentiating lymphoma from reactive adenopathy. A summed SUL-Peak cut-off of 23.8 yielded a sensitivity and specificity of 84 % and 95 %, whereas a summed SUL-Max cut-off of 28.4 yielded a sensitivity of 84 % and specificity of 82 % (Table 3). In the multivariate analysis the AUC for the single SUL-Max and sum CT nodal size parameters combined was 0.974. This was not significantly higher than the individual AUCs from the univariate analysis for the single SUL-Max, single SUL-Peak, TLG or MTV, and therefore did not provide significant improvement in the ability to differentiate lymphoma from reactive adenopathy in HIV-infected patients.

Fig. 2
figure 2

ROC curves of the imaging markers in all patients. Sensitivity and specificity at each threshold value were computed on the basis that patients whose marker value was larger than the threshold were classified as having lymphoma. The AUC of CT nodal size is significantly smaller than those of the other markers (p < 0.05). This indicates that FDG PET metrics are highly accurate in differentiating lymphoma from reactive adenopathy in HIV-infected patients

Table 3 FDG PET/CT metrics and cut-off points for predicting lymphoma in HIV-infected patients

The differences in qualitative nodal visual intensity scores between HIV patients with lymphoma and those with reactive adenopathy were not statistically significant (Table 2).

Stratification of metabolic metrics by viremia (HIV plasma RNA >200 copies/mL) revealed 21 aviremic patients, 10 with lymphoma and 11 without lymphoma. In the aviremic patients, sum SUL-Max, sum SUL-Peak, TLG, MTV, single hottest SUL-Max and SUL-Peak performed best with AUCs of 1, with CT nodal size having an AUC of 0.905. In viremic patients with plasma HIV RNA levels >200 copies/mL, single SUL-Max and SUL-Peak of the hottest lesion and TLG performed best with AUCs of 0.93, 0.91 and 0.86, respectively; MTV (AUC 0.84), sum SUL-Peak (0.80), sum SUL-Max (0.74), and CT nodal size (AUC 0.463, p = 0.95) performed less well (Fig. 3).

Fig. 3
figure 3

ROC curves for viremic patients (plasma HIV RNA >200 copies/mL). PET metrics performed better than CT in differentiating lymphoma from reactive adenopathy, but less well than in aviremic patients (plasma HIV RNA <200 copies/mL) with AUCs of 1 for all the PET metrics and 0.91 for CT nodal size

In the HIV control group, a positive correlation was found between viral load and the following variables: summed and single hottest nodal SUL-Max (r = 0.58 and 0.56, p = 0.005 and 0.007, respectively), sum and single hottest SUL-Peak (r = 0.68 and 0.50, p = 0.001 and 0.02, respectively), sum TLG (r = 0.59, p = 0.004), CT nodal size (r = 0.63, p = 0.019), and MTV (r = 0.63, p = 0.002). However, in the subgroup of HIV-infected patients with lymphoma, none of the imaging parameters showed a significant correlation with viral load (data not shown).

Visual qualitative analysis of the pattern of lymph node involvement using the MIP qualitative symmetry score (where 0 indicates asymmetry and 1 indicates symmetry) was performed (Table 4). Reader 1 scored 86.4 % of the patients without lymphoma as score 1, and 5.3 % of the patients with lymphoma as score 1, giving a sensitivity of 94.7 %, a specificity of 86.4 %, a PPV of 85.7 %, and a NPV of 95 % for predicting lymphoma. Reader 2 scored 90.9 % of the patients without lymphoma as score 1, and 10.5 % of the patients with lymphoma as score 1, giving a sensitivity of 89.5 %, a specificity of 90.9 %, a PPV of 89.5 % and a NPV of 90.9 % for predicting lymphoma. The two readers had almost perfect agreement using the qualitative MIP symmetry score (kappa coefficient 0.90, 95 % CI 0.77 – 1.00) [20]. Overall, asymmetrical FDG uptake had an accuracy of 90.4 % for differentiating lymphoma from reactive adenopathy in HIV-infected patients for both readers.

Table 4 MIP symmetry scores in relation to disease status for readers 1 and 2

HIV patients with lymphoma showed asymmetrical uptake, significantly higher PET metrics, as well as the presence of abdominal lymph node involvement and other extranodal sites of disease. This extensive disease was not seen in HIV patients with reactive adenopathy (Figs. 4 and 5). The mean splenic metabolic uptake was higher in patients with lymphoma than in those with reactive adenopathy (SUL-Max 5.25 ± 6.17 vs. 2.41 ± 0.81, p = 0.04).

Fig. 4
figure 4

Whole-body FDG PET/CT MIP images demonstrating symmetrical and asymmetrical uptake. a Relatively symmetrical bilateral cervical, axillary and mediastinal adenopathy in a patient with reactive adenopathy. b Asymmetrical FDG uptake greater in the right cervical, left axillary and retroperitoneal nodes, with splenic and marrow involvement of the spine and left femur in a patient with biopsy-proven HIV-associated lymphoma. Assessment of nodal symmetry aids in differentiating lymphoma from reactive adenopathy in patients with HIV infection

Fig. 5
figure 5

Whole-body FDG PET/CT images in HIV-infected patients, one (a–d) with reactive adenopathy, a high level of viremia and increased nodal metabolic activity and the other (e–g) with a low level of viremia and low nodal metabolic activity. a–d The MIP image (a) and the fused axial PET/CT images (bd) show symmetrical bilateral axillary (blue arrows), mediastinal (yellow arrows) and inguinal adenopathy (red arrows). The sum SUL-Max was 27.1, and the plasma HIV RNA was 98,203 copies/mL. e–g The MIP image (e) and the fused axial PET/CT images (f, g) show mildly FDG avid bilateral axillary (blue arrows) and inguinal (red arrows) nodes. The sum SUL-Max was 9.6, and plasma HIV RNA was 64 copies/mL. The level of viremia correlated with the intensity of nodal metabolic uptake in HIV-infected patients with reactive adenopathy

The separate analysis comparing HIV patients without lymphoma (10 patients) and those with lymphoma showed no significant difference in CT nodal size (p = 0.36). ROC analysis for differentiating lymphoma and reactive adenopathy revealed AUCs of 0.95 for the single hottest SUL-Max, 0.88 for sum SUL-Max, 0.94 for the single hottest SUL-Peak, 0.91 for sum SUL-Peak, 0.95 for TLG, 0.94 for MTV, and 0.61 for CT nodal size. Again, all the PET metrics performed significantly better than CT nodal size (p < =0.001).

There was no significant difference in the mean SUL-Max scores of the nasopharyngeal region between patients with lymphoma the those with reactive adenopathy (p = 0.54, Table 2). There was no correlation with viral load and the SUL-Max of nasopharyngeal lesions in either patient group (r = 0.011, p = 0.96, and r = 0.04, p = 0.86, respectively). When stratified by viremia status, FDG uptake in the nasopharyngeal region differentiated poorly between lymphoma and reactive adenopathy, with an AUC of 0.46 in aviremic patients and 0.41 in viremic patients.

Discussion

Differentiating reactive adenopathy from malignant nodes due to lymphoma in HIV-infected patients is challenging as both conditions can be associated with increased FDG uptake. In this retrospective study we evaluated several quantitative and qualitative methods which could aid in the differentiation of lymphoma and reactive adenopathy in HIV-infected patients. Our major findings were that quantitative metabolic PET metrics performed better than CT nodal size in differentiating lymphoma from reactive adenopathy. Qualitative assessments of symmetry of uptake were also more robust than CT nodal size assessments. Nodal and extranodal qualitative and quantitative PET metrics were higher and CT nodal size larger in HIV-infected patients with lymphoma than in those with reactive adenopathy, but CT nodal size was similar between the patient groups. The MIP symmetry score showed an asymmetrical pattern of FDG uptake in lymphoma in contrast to the pattern in reactive adenopathy. SUL increased with increasing viral load in HIV-infected patients with reactive adenopathy, but quantitative FDG PET metrics were high in HIV-infected patients with lymphoma irrespective of the viral load. Our data are consistent with previous studies that have shown increased maximum standardized uptake value (SUV-Max) in HIV-infected patients with lymphoma [21, 22]. We evaluated the performance of quantitative semiautomated PET/CT data by including SUL-Peak and the volumetric TLG and MTV metrics.

All the quantitative PET metrics performed well in differentiating lymphoma from reactive adenopathy in HIV-infected patients particularly in aviremic patients, and were correlated with the serum viral load in patients with reactive adenopathy. Multivariate analysis did not show a significant improvement in the ability of the combined quantitative PET metrics to differentiate lymphoma from reactive adenopathy in HIV-infected patients when compared with each individual PET metric. Thus each of the individual quantitative PET metrics appears to perform well enough to be used alone. Previous studies have shown a good correlation between the net lymphoid uptake and plasma viremia in chronic and acute infection in asymptomatic HIV-infected patients with long-term nonprogression and stable viremia [23].

Our study found that TLG and SUL metrics were high in HIV-infected patients with lymphoma irrespective of viral load. HIV-infected patients without viremia or those on ART with suppressed viral load typically demonstrate minimal or absent FDG lymph nodal uptake as opposed to patients with viremia [24]. Studies have shown a positive linear relationship between viral load and SUV of the whole-body lymphoid tissue in HIV-infected patients. A more variable relationship is seen between viral loads when regions of interest are manually drawn to obtain total lymph node SUV [25].

While quantitative FDG PET was more accurate in differentiating lymphoma from reactive lymphadenopathy in HIV-infected patients, the diagnostic accuracy appeared greater in aviremic patients. . We also found that splenic SUL was higher in lymphoma than in reactive adenopathy. Only occasionally was mildly increased splenic uptake seen with high viremia in HIV-infected patients with reactive adenopathy. This is similar to the findings of previous studies [26]. Controlling viremia prior to PET imaging by reimaging after initiation of ART when viremia is suppressed to <200 copies/mL may potentially aid interpretation of indeterminate findings in patients in whom the pattern of FDG uptake is not helpful. Practically, targeted biopsy of a hypermetabolic area would likely be performed in patients in whom malignancy is strongly suspected, rather than delay the diagnosis of lymphoma while ART therapy is being performed.

Goshen et al. similarly showed a high sensitivity of PET/CT in diagnosing lymphoma in seven HIV-infected patients with known non-Hodgkin lymphoma and one patient with suspected lymphoma and palpable lymphadenopathy, although their results showed a high false-positive rate (25 %) [27]. They suggested that in patients with normal-sized nodes with discordant CT findings, correlation with CD4 and viral load may support a benign diagnosis and avoid further investigations. This view is supported by our data. However, the false-positive rate in our study was much lower, which may have been due to accounting for both the pattern and quantitative extent of metabolic uptake. Our data concur with the findings of previous studies showing that the degree of FDG uptake is related to viral load in reactive adenopathy [3, 28], and we demonstrated a significant correlation between viral load and SUL uptake in HIV-infected patients with reactive adenopathy. This differs from the findings of Brust et al. [25], which may have been due to differences in the patient populations, levels of viremia and SUL-Max threshold. We also found lower metabolic uptake in aviremic HIV-infected patients with reactive adenopathy [25].

Qualitative scoring is a typical method used for scoring PET images. Using our scoring system, the qualitatively hottest node did not allow differentiation of lymphoma from reactive adenopathy. Quantitative PET metrics performed significantly better than qualitative visual scores in our population, although qualitative symmetry assessment was valuable.

A possible potential limitation of our study was that our population of HIV-infected patients with lymphoma had relatively advanced disease (stage III or IV disease in 16 of 19 patients). Most lymphomas in HIV-infected patients are typically high-grade large-cell, immunoblastic type with extranodal involvement common. Our patients were likely quite representative of the HIV-positive NHL patient population. Another potential limitation of our study was that we used a low-dose CT protocol without contrast enhancement which may have been of suboptimal diagnostic quality for analyzing CT nodal size. This may have biased the study results in favor of PET.

It is encouraging that the PERCIST-derived SUL-Peak uptake and TLG were quite accurate for diagnostic purposes, but they did not perform any better than the simple SUL-Max value. Due to stability of liver SUL, we used a TLG threshold based on the mean liver SUL [29]. TLG and SUL-Peak performed comparably in our patients (Tables 2 and 3). Another potential limitation of our study was the retrospective design. However, our patients were undergoing clinical PET/CT studies in relevant clinical situations. Patients with reactive adenopathy were considered to possibly have lymphoma or infection in their differential diagnosis and showed similar CT nodal size compared to the patients with lymphoma. A prospective study using the quantitative techniques described to validate their clinical significance in a larger patient population is warranted.

Conclusion

Quantitative PET metabolic metrics as well as the qualitative assessment of symmetry of nodal uptake appear to be valuable tools for differentiating lymphoma from reactive adenopathy in HIV-infected patients using FDG PET. These parameters appear more robust in aviremic patients.