Introduction

Immunotherapy has recently emerged as an important advance in cancer treatment. To date, the main immunotherapeutic approach that has been translated into survival benefit and is currently used in practice is the immune checkpoint inhibitors (ICIs). Broadly, the two most effective classes of agent are directed, alone or in combination, towards cytotoxic T lymphocyte-associated protein 4 (CTLA-4) or the programmed cell death protein 1 (PD1) or the PD1/programmed cell death protein ligand 1 (PD1/PD-L1) axis, which are negative regulators of T cell immune function. The CTLA-4 inhibitor, ipilimumab, has been shown to improve survival rates in melanoma patients [1]. PD1/PD-L1 inhibitors (of which the first validated agents were pembrolizumab and nivolumab) have been shown to improve survival rates among patients with various tumour types [2, 3].

Response assessment with PET/CT during immunotherapy can be rather challenging, since 18F-FDG avid inflammatory reactions can occur during the treatment and may be associated with pseudoprogression, leading to misinterpretation of 18F-FDG PET images [4,5,6]. The PET community has been very active in the last decade, coming up with various response criteria including PERCIMT (PET Response Evaluation Criteria for Immunotherapy). PECRIT (PET/CT Criteria for Early Prediction of Response to Immune Checkpoint Inhibitor Therapy), iPERCIST or PERCIST (PET Response Criteria in Solid Tumours)-adapted criteria such as imPERCIST5 [7].

On top of these PET response criteria created or modified to meet the challenges raised by immunotherapy,, several groups have reported baseline prognostic factors of response such as the metabolic active tumour volume (MATV) [8] and signs of immune activation [9]. However, while inter-observer variability in MATV delineation is known [10] and percentage change in SUV used to discriminate between responders and non-responders when using PERCIST or PERCIST-adapted criteria are based on test-retest studies [11], little data is available on immune activation signs [12]. The first sign of immune activity to be checked is spleen enlargement and/or increased uptake leading to an inversion of the spleen to liver ratio (SLR) [13, 14]. Some groups also proposed other signs such as the bone marrow to liver ratio (BLR) [15] and uptake in the ileocecal valve [16].

The biological counterparts of PET signs of immune activation are absolute lymphocyte count (ALC) and neutrophils to lymphocytes ratio (NLR), the latter having been shown to be a prognosticator in melanoma patients [17]. Also correlated to a worst survival is serum lactate dehydrogenase (LDH), a biochemical marker of tumour load [18].

The main objective of the present study was to seek the most clinically useful signs of immune activation on 18F-FDG PET images to predict the prognostic of melanoma patients’ ongoing ICI therapy. Ideally, these signs should be highly reproducible between observers, simple and fast to compute and should be independent factors of response, especially compared with widely available, cost-effective and established biological parameters.

Materials and methods

Study design

We conducted a retrospective monocentric observational study in melanoma patients aged over 18 and who were referred for a 18F-FDG PET/CT for therapy monitoring of immunotherapy. After searching in our database, out of 63 patients screened, 36 patients met the criteria and were included. Inclusion criteria were: (i) immunotherapy treatment; (ii) availability of baseline 18F-FDG PET/CT, followed by an early monitoring PET/CT scan (iPET) after 1 month of treatment and follow-up studies at 3rd (M3PET) and 6th month (M6PET). Seven patients were excluded based on the following criteria: (i) baseline whole-body metabolic active tumour volume (WB-MATV) or TLG (WB-TLG) at 0 and (ii) missing data. PET/CT acquisitions dated from 11/08/2017 to 04/12/2019. Institutional review board approval was not required in accordance with European regulation. French observational studies without any additional therapy or monitoring procedure do not need the approval of an ethics committee. Nonetheless, in accordance with the European General Data Protection Regulation, we sought approval to collect data for this work from the national committee for data privacy, with the registration no. 2081250 v 0.

18F-FDG PET/CT protocol

Patient’s preparation in the PET unit and PET acquisition and reconstructions were performed as per the European Association of Nuclear Medicine (EANM) guidelines for PET tumour imaging [19, 20], our PET unit being EANM research Ltd (EARL) accredited since 2015. 18F-FDG was injected after glucose level had been checked to be < 200 mg/dL in patients who had been fasting for at least 4 h. Patients were tentatively scanned 60 min after tracer injection. Patients were scanned from the base of the skull to mid-thigh, except for patients with primary melanoma of the lower limb, for whom a whole-body acquisition was performed. Two different PET/CT scanners were used: a Vereos system (Philips Medical Systems. Cleveland OH) and a Biograph TrueV with extended field of view (Siemens Medical Solutions). On the Biograph system, the injected dose was 4.0 MBq/kg with time per bed position of 2 min and 40 s for body mass index (BMI) < 25 kg/m2 and 3 min and 40 s and for patients with BMI ≥ 25 kg/m2. To meet the EARL harmonizing standards, PET/CT raw data were reconstructed with a 3D-OSEM (ordered subsets expectation-maximization) reconstruction algorithm with point spread function (PSF) modeling (3 iterations and 21 subsets) and a 6.3-mm post-reconstruction Gaussian filter, using a 128 × 128 matrix size. Concerning the Vereos system, the injected dose was 3.0 MBq/kg with a time per bed position of 2 min whatever the patient’s weight. To fulfill the EARL accreditation, PET/CT raw data were reconstructed with a 3D-OSEM reconstruction algorithm with PSF modeling (2 iterations and 10 subsets) with a 5-mm post-reconstruction Gaussian filter, using 288 × 288 matrix size.

FDG PET/CT analysis

PET readers were blinded to the results of any other imaging tests, clinical outcomes, or biological data. The analysis was carried out using LIFEx, an open source software [21]. All quantitative analyses were made using EARL-compliant PET images [20].

Fourteen PET variables involving the spleen, bone marrow and the ileocecal valve were reviewed by two PET readers as potential immune activation signs, at baseline, iPET and M3PET, for reproducibility analysis purposes. Other biomarkers (PET response and WB-MATV/TLG) were assessed by a single nuclear medicine physician. Absolute values were considered for the baseline PET and relative values expressed as percentage of variation (%var) for iPET and M3PET.

Evaluation of immune activation and immune-related side effects

First, spleen immune activation signs were analysed as follows:

  • The maximum standardized uptake values (SUVmax), the SUVmean, from a volume of interest (VOI) of 20 mm of diameter placed on the central part of the spleen.

  • Three SUVpeak values were recorded in anterior, median and posterior part of the spleen.

  • The spleen length was recorded as recommended as per the American Roentgen Ray Society on the CT [22] part of the PET/CT scan, using the greatest splenic measurement in the cranio-caudal axis from a coronal view.

  • SLR was calculated from a central VOI of 20 mm of diameter on spleen and a 30 mm VOI on the right liver lobe using SUVmax and SUVmean values, referred to as SLRmax and SLRmean, respectively [15].

  • The metabolic volume (spleen MV) and total lesion glycolysis (spleen TLG) from the whole spleen PET volume from interpolated regions of interest (ROI), using the 2D pencil tool around the spleen from LIFEx Software [21].

Secondly, other potential lymphocyte activation signs were recorded from the whole ileocecal valve using an adaptive VOI [16] and the bone marrow using SUVmax and SUVmean. Osteomedullary activation was measured from a 20 mm VOI located on the right iliac bone. Then, BLR were calculated and referred to as BLRmax and BLRmean still using a 30 mm VOI on right liver lobe.

Finally, all potential signs of immune-related adverse effects (irAEs) were recorded based on existing literature [13, 23,24,25]. Corticosteroid use for more than 2 months was also recorded.

Evaluation of MATV and TLG

The MATV protocol from LIFEx was used [26]. Each lesion was completely by-passed by an adapted VOI by hand; then, a threshold of 41% of SUVmax was applied as recommended by the EANM for solid tumours [19]. Thus, each MATV and TLG (TLG = SUVmean × MATV) per lesion was summed to obtain WB-MATV and WB-TLG. Any sign of potential immune infiltrate organs was excluded. Figure 1 displays the methodology for MATV computation and immune organ evaluation.

Fig. 1
figure 1

Methodology for measurement of MATV and uptake in immune organs. 18F-FDG-PET/CT monitoring in a 50-year-old woman treated for a metastatic melanoma under Nivolumab a MIP imaging and b PET transverse planes showing the spleen metabolic volume (blue) and the whole body MATV (green). c First line from the left to the right. The spleen MV and TLG recording process is shown in the PET transverse plane, beginning by drawing the spleen contours at different levels followed by interpolation to obtain the spleen MV. Second line from the left to the right showing the 3 three spleen SUVpeak VOIs (blue) then the 2 VOIs (purple) used to compute the SLRmax and SLRmean. Finally, the CT in the coronal plane to record the spleen length (red arrow) d

Evaluation of PET response

Tumour response was assessed at 3 and 6 months. Recent immunotherapy-modified PERCIST using five lesions, imPERCIST5, was used [7].

Briefly, the recommendations of PERCIST5 were followed: up to five hottest target lesions were recorded to be summed (maximum of two per organ). The appearance of new lesions alone did not result in progressive metabolic disease (PMD); only an increase of the sum of SULpeak by 30% was considered as PMD, with or without new lesions. Response to immunotherapy was classified as complete metabolic response (CMR), partial metabolic response (PMR), stable metabolic disease (SMD) or PMD. The responders concerned the patients classified as CMR or PMR and non-responders as SMD and PMD, as follows.

  • CMR: complete resolution of 18F-FDG uptake in the tumour volume, with tumour uptake lower than liver SUL and background blood pool, and disappearance of all lesions if multiple.

  • PMR: at least 30% reduction in tumour uptake.

  • SMD: less than 30% increase, or less than 30 decrease in tumour uptake

  • PMD: greater than 30% increase in tumour uptake.

Biological biomarkers

The values of LDH (normal value 100–190 UI/L)), reactive protein C (CRP, normal value < 10 mg/L), white blood count (WBC, normal value 4000–10,000/mm3), absolute neutrophils count (ANC, normal value 1500–7000/mm3) and ALC (normal value 1500–4000/mm3) were recorded. Each value comes from the biological assessment done systematically in approved laboratories 48 to 72 h before immunotherapy. Methods may differ depending on the laboratory.

The NLR and the derived neutrophils to lymphocytes ratio (dNLR) were calculated as follows:

$$ NLR=\raisebox{1ex}{$ ANC$}\!\left/ \!\raisebox{-1ex}{$ ALC$}\right. $$
$$ dNLR=\raisebox{1ex}{$ ANC$}\!\left/ \!\raisebox{-1ex}{$\left( WBC- ALC\right)$}\right. $$

Statistical analysis

Quantitative variables are presented as mean (SD).

To assess the inter-observer reproducibility and compare all PET quantitative measurements made by the two observers, intraclass correlation coefficients, Bland-Altman analyses as well as non-parametric Wilcoxon tests were used.

For the identification of PET and biological variables of interest to determine the 1-year OS status of patients, a univariable analysis by means of non-parametric Mann-Whitney test was used to compare quantitative variables of patients alive at 1 year (1 year OS 0 patients) and patients dead at 1 year (1 year OS 1 patients). For qualitative variable, Fischer exact tests were used. The landmarked specific overall survival (OS) was defined as the time from the first immunotherapy cycle to death from melanoma. The correlations between PET and biological variables of interest were studied using Spearman correlations. Afterward, ROC analyses were used to determine their optimal cut-off to discriminate between 1 year OS 0 and 1 year OS 1 patients based on the optimisation of the Youden index and 1 year OS analyses were performed using univariable Kaplan-Meier survival analyses with log-rank test to compare survival curves.

The evolution biological and PET variables of interest during immunotherapy from baseline to M3PET were assessed using Friedman tests. A post hoc test was performed with the two-stage linear step-up procedure of Benjamini, Krieger and Yekutieli test for multiple comparisons.

To confront the importance of biological PET variables in the prediction of the 1 year OS status, a multivariable random forest prediction model was used incorporating all PET and biological variables previously identified. The importance of each PET variable in the classification process was determined by the measurement of its mean decrease accuracy of class prediction. The method implemented classification and regression trees (CART, n = 100) and bootstrapping aggregating (bagging) method proposed by Breiman [27]. Moreover, a CHAIN algorithm classification tree was performed with a maximum tree depth set to 3 to avoid a too complex tree.

For all statistical tests, a two-tailed P value of less than 0.05 was considered statistically significant. Graphs and statistical analysis were performed on XLSTAT Software (XLSTAT 2017: Data Analysis and Solution for Microsoft Excel, Addinsoft, Paris, France (2017)).

Results

Population characteristics

Twenty-nine patients were included. Baseline patients’ characteristics are displayed in Table 1. Fifteen patients (51.7%) had more than 2 metastatic sites. BRAFV600 mutation was found in 11 patients (38.0%), RAS mutation in 3 patients (10.3%) and NRAS mutation in 4 patients (13.8%). No KRAS mutation was observed. The median follow-up was 16.6 months and the 1 year OS was 65.5 % in the entire population. There were 10 recorded events in the entire population over the first year of follow-up. Only one patient was lost to follow-up at 6.4 months (Table 1).

Table 1 Baseline patients’ characteristics

Comparison and assessment of reproducibility of PET measured by the two observers

ICC values for all explored variables are displayed in Table 2. Strong reproducibilities (ICC > 0.90) were observed for spleen anterior SUVpeak, spleen MV, spleen TLG, spleen length and BLRmean. Focusing on SLR, best performance was observed using mean values with an ICC equal to 0.86 as compared with 0.66 using maximum values. In the same trend, although ileocecal valve SUV ICC performances appeared lower, judged as moderate, better results were observed using SUVmean values. Bland-Altman analyses of these seven PET variables, performed on all time points mixed together, are displayed on Fig. 2. Largest bias and 95%CI were observed for ileocecal valve SUVmean (7.45, 95%CI = 64.81–49.9) and BLRmean (2.59, 95%CI = 25.47–20.29). Moreover, Wilcoxon tests found that anterior spleen SUVpeak values and ileocecal valve SUVmean values were significantly different between observers: Obs1 = 1.9 versus Obs2 = 1.89 (p = 0.02) and Obs1 = 1.53 versus Obs2 = 1.35 (p < 0.001), respectively. Descriptive analysis for others variables can be found in Table 3.

Fig. 2
figure 2

Bland and Altman analysis between two observers for variables displaying the best ICC

Overall, the more robust signs of immune activation were therefore SLRmean, spleen MV and TLG, spleen length and BLRmean.

Identification of PET variables of interest to predict the 1-year OS status

Evaluation of immune activation

Concerning quantitative PET variables, interim %var splenic length, M3%var SLRmean and M3%var ileocecal valve SUVmean were found to be significantly different between 1 year OS 0 and 1 year OS 1 patients. Interim %var splenic length was significantly lower in 1 year OS 1 patients [− 3.1 (14.3) versus 15.9 (31.4); p = 0.01] whereas M3%var SLRmean and M3%var ileocecal valve SUVmean were higher in 1 year OS 1 patients: − 19.1 (24.4) versus 3.4 (14.3); p = 0.03 and 2.38 (26.85) versus 106.91 (159.24); p = 0.03, respectively.

Evaluation of MATV and TLG

Characteristics of all quantitative PET variables as well as Mann-Whitney results between 1 year OS 0 and 1 year OS 1 patients can be found in Table 4. Of note, baseline WB-MATV almost reached significance with a p value equal to 0.06 with and values seemingly higher in 1 year OS 1 patients: 37.23 (55.08) versus 79.9 (78.5).

Correlation with biological biomarkers

The exploration of correlations between PET variables of interest and all biological variables (Fig. 3) found that SLRmean was positively correlated to M3%var LDH (ρ = 0.533, p = 0.005) and negatively correlated to M3%var ALC (ρ = − 0.415, p = 0.035). Of note, there were also significant positive correlations of M3%var SLRmean with M3%var WB-MATV (ρ = 0.382, p = 0.049) and with M3%var BLR (ρ = 0.610, p = 0.001).

Fig. 3
figure 3

Correlation matrix using the non-parametric spearman method. a Among the best ICC, the 5 variables displayed a significative difference after Mann and Whitney tests according to the 1-year OS status. Spearman rho values are displayed in each cell. Those displaying a significative p value are marked as *. b The two significative correlations are shown

Evaluation of PET response

Concerning qualitative variables, Fischer tests showed that 1 year OS 1 patients were mostly patients classified as non-responders on M6PET (p = 0.027) but no significant relationship with imPERCIST M3PET response classification was found (p = 0.098).

Occurrence of irAEs

Overall, 33 patients displayed irAEs (44.8%) mostly on iPET (61.5%). Endocrine and digestive systems were the most affected. Overall, 5 colitis, 4 thyroiditis, 3 gastritis, 3 enthesopathies, 2 hypophysitis and 2 myositis were observed, all time point considered. Five patients had multiple irAEs (≥ 2). There was no treatment interruption and only four patients received immunosuppressants (30.8%) with long-term corticosteroid therapy (n = 3) or cyclosporine (n = 1). No relation between patient’s outcome and i-irAEs, M3-irAEs or long-term use of corticosteroid was found with p value equal to 0.201, 0.398 and 0.128, respectively.

Exploration of established biological variables of interest

Mean (SD) time (days) between biological samples and PET baseline, iPET, M3-PET and M6-PET was 18 (18), 5 (4), 4 (3) and 5 (5), respectively.

Baseline LDH values were higher in 1 year OS 1 patients [272.8 (84.3) versus 226.1 (85.1); p = 0.04] as well as i%var NLR [98.7 (100.9) versus 29.5 (56.5); p = 0.049]. Other biological variables were not significantly different between 1 year OS 0 and 1 year OS 1 patients (Table 3).

The exploration of correlations between biological variables of interest and all PET variables found a single significant positive correlation between baseline LDH and baseline WB MATV (ρ = 0.445, p = 0.01) (Fig. 3).

Figure 4 b shows the evolution of the main biology data. In 1 year OS 1 patients, a significative increase of NLR was found at M6-PET vs baseline PET (p = 0.03).

Fig. 4
figure 4

Evolution of PET (a) and biological (b) variables during treatment with immune checkpoint inhibitors. Results are split according to the survival status at 1 year: 1 year OS as OS0 (survivors) on the left side and OS1 (deceased) on the right side of each graph. Friedman tests were used to compare each variable at different time points

Change in imaging biomarkers across the treatment sequence

The evolution of PET variables according to 1 year OS is displayed in Fig. 4a. In the 1 year OS 1 group, from baseline, SLRmean tended to increase at each time point to achieve a significant difference at M6-PET (p = 0.019). The same trends were observed with spleen SUVpeak anterior and spleen length. In the 1 year-OS 0 group, a significative increase of spleen length was found at iPET, as compared with baseline PET (p = 0.014) and M3-PET (p = 0.0239).

Survival analysis

Prediction of survival status at 12 months

Interim %var spleen length, M3%var SLRmean, M3%var SUVmean ileocecal valve, baseline LDH and i%var NLR ROC analyses were all statistically significant and showed that optimal cut-off values to predict an event were < 5%, > 25%, > 47%, > 196 UI/L and > 35%, respectively. The variable displaying the best performance was i%var spleen length: area under the curve = 0.800, sensitivity = 80.0%, specificity = 84.2%, accuracy = 82.8% (p = 0.001). All ROC analyses results can be found in Table 5. Of note, an extra-ROC analysis was conducted on baseline WB-MATV and was found to be significant (AUC = 0.718, p = 0.026). With an optimal cut-off value > 29 cc, sensitivity was equal to 70.0%, specificity to 73.7% and accuracy to 72.4%.

Kaplan-Meier survival analysis

Univariable Kaplan-Meier survival analyses were run using these previously described cut-off values and found that i%var spleen length, M3%var SLRmean, baseline LDH and i%var NLR were all predictors of 1 year OS. Kaplan-Meier curves for all variables are shown on Fig. 5. The parameter that seemed the most discriminant between 1 year OS 0 and 1 year OS 1 patients was M3%var SLRmean. Patients with high M3%var SLRmean (> 25%) had a 1 year OS of 0% whereas patients with low M3%var SLRmean (≤ 25%) had a 1 year OS of 85.9% (p < 0.0001). Moreover, M6-PET imPERCIST response was also predictive of 1 year OS with poorer prognostic in patients classified as non-responders on M6-PET. Of note, the same analysis conducted on baseline MATV was also significant with poorer 1 year OS observed in patients displaying the highest values (> 29 cc): 41.15% versus 80% (p = 0.018).

Fig. 5
figure 5

Landmarked Kaplan-Meier estimates of 1-year overall survival (1 year OS) according to a 3 PET variables reported as the %var and b 2 biology variables

Random forest prediction model

The multivariable random forest prediction model including i%var spleen length, M3%var SLRmean, M3%var SUVmean ileocecal valve, baseline LDH, i%var NLR and M6-PET response showed that the variable displaying the highest importance expressed as mean decrease accuracy was M3%var SLRmean followed by M3%var SUVmean ileocecal valve. Respective mean decrease accuracies were 5.34, 8.54, 6.71, 4.65, − 1.92 and 2.51. Moreover, a CHAID classification tree including all previously mentioned PET and biological variables of interest found that 96.3% of patients were well-classified when using first M3%var SLRmean > 25% and then ileocecal valve M3%var SUVmean > 47% as criteria of classification. Only one 1 year OS 1 patient was misclassified. The representative case of a well-classified patient can be seen in supplemental Figure 1.

Discussion

ICIs have transformed the management and the prognostic of patients with metastatic melanoma. However, some of them developed resistance to ICIs [2] and the majority is still refractory [28]. Therefore, achieving a gain in survival while also accounting for health economics implications [29] requires biomarkers to predict the outcome of ICI treatment. Amongst these potential biomarkers are PET metrics, which can be split into three categories: PET response criteria adapted to ICIs in order to account for pseudo-progression patterns, MATV and signs of immune activation.

To the best of our knowledge, this study is the first to report inter-observer variability in a wide range of signs of immune activation on 18F-FDG PET. Strong reproducibility was observed for spleen and bone marrow measurements and showed that the SUV mean metric should be preferred. Ileocecal valve SUV performances appeared lower, with ICCs judged as moderate. In addition to harbouring a low inter-observer variability, computation of signs of immune activation has to be user-friendly if one wants to use them for daily routine in busy PET centres. Therefore, despite an excellent ICC, spleen MV and TLG are unlikely to be adopted. Overall, SLR, BLR and spleen length appear the most suitable.

While understanding the baseline host anti-tumour immune response on baseline scan is important, its evolution under treatment has not yet been studied. Our study focused on the longitudinal evolution of several potential biomarkers extracted from 18F-FDG PET/CT imaging, at baseline, after the 2 first injections of ICIs, and after 3 and 6 months of treatment.

The main finding of our study is that M3%var SLRmean was higher in patients who died during the first year of treatment and was the variable displaying the highest importance in a multivariable random forest prediction model including PET and biological variables significantly different in survivors and non-survivors on univariable analysis (i%var spleen length, M3%var SLRmean, M3%var SUVmean ileocecal valve, baseline LDH, i%var NLR and M6-PET response). While the attention of the PET community has been mainly focused on the capability of SLR to predict immune activation [13, 30] (an increased spleen uptake being considered to reflect “unleashed” T lymphocytes with an expected better outcome), several findings of our study show that the detrimental increase in SLR on M3PET may be related to inflammation. The spleen, being the largest lymphoid organ in the human body, is the site of immune cell activation and maturation [31]. The mechanism for high splenic avidity is not well understood but high splenic uptake or high SLR has been observed in patients with infections, granulomatous diseases as well as patients with cancers associated with an inflammatory state such as Hodgkin’s lymphoma [16, 32,33,34,35,36,37]. Increased splenic avidity was also reported in GCSF injections and in the post-chemotherapy period [38]. In our study, SLR correlated well with the evolution of tumour burden WB-MATV at M3 and displayed a negative correlation with ALC at the same time point, known to be correlated with inflammation (Fig. 4).

Immune inflammatory cells can be actively tumour promoting and be associated with tumour aggressiveness and immune suppression, resulting in failure of T cells to protect from cancer [39]. Inflammation promotes tumour through many mechanisms [40]. Currently, it is suggested that myeloid-derived suppressor cells (MDSCs) and maybe other inflammatory processes, which induce expression on MDSCs, could lead to a resistance to ICIs. MDSCs are a heterogeneity population of cells [41] and could explain the increase of SLR. A major mechanism recently discovered is that inflammation induces the production of MDSCs which inhibit both adaptive and innate immunity to suppress T cells, leading to ICI failure [41]. MDSCs accumulated in blood, bone marrow lymph nodes, spleen and tumour sites. Preclinical studies have shown that MDSCs represent up to 20% of splenocytes in cancer disease vs 3% in healthy mice [42].

Also important as a prognosticator was M3%var ileocecal valve SUVmean, with a CHAID classification tree including PET and biological variables of interest demonstrating that 96.3% of patients were well-classified when using first M3%var SLRmean > 25% and then M3%var ileocecal valve SUVmean > 47% as criteria of classification.

In addition to signs of immune activation and conventional or ICI-adapted PET response criteria, the third piece of information assessable on PET imaging is the occurrence of irAEs. While several studies have shown that patients experiencing irAEs may have better survival [43], studies on PET irAEs are scarce [44, 45]. Recently, Iravani et al. showed in melanoma patients receiving a combination of two ICIs that 18F-FDG PET/CT often could detect relevant irAEs which may precede clinical diagnosis [46]. According to our study, most of irAEs appeared early in the course of treatment but were not found to predict 1 year OS.

This warrants validation of %varSLR as a prognosticator with prospective studies or larger series by pooling our EARL-compliant data with those of other EARL-accredited PET centres.

Our study has several limitations. First, its retrospective nature and its relatively small sample size, except for the reproducibility study where 14 variables involving immune organs were studied at 3 time points. Secondly, we did not use time-dependent ROC curves [47]. Thirdly, a random forest method was used on a small amount of data and including multiple variables. Given the risk of over fitting, these results should be considered as a proof of concept requiring prospective evaluation.

Conclusion

Based on its good reproducibility and practicality, SLRmean is recommended as a prognosticator in melanoma patients under immunotherapy: its increase greater than 25% at 3 months, compared with baseline, was associated with poor outcome after ICIs. This study provides a proof of concept concerning this parameter and warrants prospective evaluation.