Introduction

Patients with localised renal cell carcinoma (RCC) receiving curative surgery, either radical nephrectomy (RN) or partial nephrectomy (PN), have been shown in contemporary studies to develop recurrence within 5 years in 20–30% of cases [1,2,3,4,5,6,7]. For these patients, a potential cure or prolonged survival is dependent on the possibility of resection of the local recurrence or metastases [8, 9]. Post-operative follow-up (FU) imaging, therefore, plays a crucial role in detecting recurrent or metastatic disease. To date, no superior strategy has been found to address optimal post-operative surveillance of these patients. Major guidelines such as the European Association of Urology (EAU), American Urological Association (AUA) and National Comprehensive Cancer Network (NCCN) use risk-stratified approaches for their FU recommendations and have been compared in several publications [10,11,12]. These guidelines all have 5 year FU recommendations, but differ in number and type of imaging procedures, and none take non-RCC-related factors such as patient age and performance status into account [13,14,15]. Furthermore, the 5-year FU time frames in each guideline have been based on the cumulative incidence of recurrence shown in previous studies [16,17,18]. In this review, we focus on the oncological aspects of FU and discuss different known FU protocols and their limitations.

Methods

A non-systematic literature search was conducted using Medline. The reference lists of selected manuscripts were checked manually for eligible articles. Furthermore, major guidelines (EAU, AUA, NCCN) were reviewed and assessed.

Results

Prediction of recurrence

For predicting risk of RCC recurrence, several contemporary risk stratification models based on both tumour node metastasis (TNM) classification and histopathological factors have been developed. Capogrosso et al. summarised the available models and analysed their utility for follow-up [19]. The University of California-Los Angeles Integrated Staging System (UISS) incorporates TNM stage, Fuhrman Grade, and Eastern Cooperative Oncology Group (ECOG) status to stratify patients according to their probability of RCC recurrence and survival [7]. Kattan et al. (Memorial Sloan-Kettering Cancer Centre; MSKCC) have developed a 5-year recurrence-free survival predictive nomogram based on symptoms at presentation, histologic subtype, tumour size and pT stage [20]. According to Cindolo et al. who compared both the Kattan and UISS model together with two other preoperative models in 2404 patients, the Kattan model was the most accurate risk assessment tool [21]. The Kattan model has later been modified with microvascular invasion as an additional parameter, but stratifies patients with clear cell RCC only [22]. Finally, the Leibovich score developed at the Mayo Clinic predicts 1-, 3-, 5-, 7- and 10-year risk of recurrence in patients treated for localised RCC [23]. It is based on TNM stage, Fuhrman Grade and presence of tumour necrosis albeit only aimed at clear cell RCC subtypes. A variant of the Leibovich risk score is the tumour stage, size, grade and necrosis (SSIGN score) estimating cancer-specific survival rather than the risk of recurrence. In addition, the Sorbelini nomogram has been developed making use of similar risk factors. Nevertheless, the Kattan nomogram, UISS score and Leibovich risk model are most frequently used. In addition, these three models have been externally validated having a concordance index (C-index) of 74–84% [24].

Follow-up protocols

Prior to combining both TNM and histopathological factors in predictive models, some FU protocols based on TNM alone have previously been proposed although these studies have a low level of evidence [25, 26]. More contemporary recurrence risk models, such the UISS, MSKCC and Leibovich score, have led to newer risk-stratified FU protocols.

Lam et al. used the UISS model to stratify 559 patients surgically treated for localised RCC into four risk groups and suggest a FU protocol based on time to recurrence (TTR) and recurrence sites [7]. They suggested low-risk patients to be followed for 5 years with annual chest computed tomography (CT) and abdominal CT at 2 and 4 years after primary surgery. This was based on low-risk patients having recurrence most frequently in the lungs with one-third of these within the first year, while abdominal recurrence developed later (median TTR 32 months). For intermediate-risk patients with a higher recurrence rate (32%), especially within the first year, recurrences in the lung and abdomen were found in 74.4 and 58.1%, respectively. Therefore, chest CT every 6 months for 3 years and annually thereafter was proposed together with abdominal CT at 1 year after surgery and every other year thereafter. For high-risk patients, recurrence rates were the highest, recurrences occurred frequently within 6 months and were more aggressive. As such, chest CT every 6 months for the first 3 years and annually thereafter was suggested with the option of alternating CT and plain chest X-ray after 3 years. For high-risk patients, abdominal CT was recommended every 6 months, the first 2 years and annually thereafter. Finally, a forth risk group consisting of patients with positive lymph nodes at primary surgery (i.e. TanyN1-2M0) was defined. These patients presented with more aggressive and early recurrence lead the authors to suggest a vigorous surveillance with both chest and abdominal CT at 3, 6, 12, 18, and 24 months followed by annual CTs thereafter.

Also, based on the UISS, a long-term FU protocol by Antonelli et al. in 2007 is taking into account the risk of late recurrence [27]. In the study, 814 patients were stratified based on UISS into three risk groups: low, intermediate and high risk and followed for a mean time of 6.3 (range 0.33–24.6) years. Based on recurrence sites and TTR, they concluded a more simplified FU imaging protocol for low, intermediate and high-risk patients. For low-risk patients, recommendation was chest CT every 30 months and annual abdominal CT the first 5 years followed by chest and abdominal CT every 30 months until the tenth year and only abdominal CT every 5 years thereafter. For intermediate-risk patients, the recommendation was chest and abdominal CT every 6 months the first 5 years, chest CT annually and abdominal CT every 30 months the following 5 years and finally abdominal CT every 5 years thereafter. For high-risk patients, chest and abdominal CT was suggested every 6 months for the first 5 years, annually the following 5 years and only abdominal CT thereafter.

Siddiqui et al. proposed tailored FU protocols for clear cell, papillary and chromophobe RCC subtypes based on 2339 patients treated for localised RCC at the Mayo Clinic [28]. Using a Cox multiple regression model, they identified independent pathological features (surgical margin, tumour stage and size, lymph node status, Fuhrman grade and tumour necrosis) determining recurrence risk in the abdomen and thorax, respectively. Based on the severity of these features, patients were scored and stratified into low, intermediate and high-risk groups. For clear cell RCC, if receiving 0 points in the thoracic score algorithm, a minimal risk subgroup was available. For papillary and chromophobe RCC, only tumour stage and grade stood out as independent prognostic factors. Finally for each subtype, a tailored and detailed imaging surveillance algorithm for chest and/or abdominal imaging was recommended depending on patient’s risk score. Imaging frequency was between 3 and 18 month intervals for the first 3 years depending on risk score and subtype.

More recently, a publication by Stewart-Merrill et al. highlighted a novel approach to FU by recognising age and co-morbidity, in a competing risk analysis model, as important co-factors in determining length and necessity of FU [29]. In their retrospective analysis, 2511 patients with localised RCC treated with curative intent were stratified based on TNM stage (T1Nx-0M0, T2Nx-0M0, T3/4Nx-0M0 and TanyN1M0), age (< 50, 50–59, 60–69, 70–79, ≥ 80 years), Charlson co-morbidity index (CCI ≤ 1 or ≥ 2) and site of recurrence (chest, abdomen, bone, other). A Weibull distribution model, which contrary to Cox’s proportional hazard regression model, allows for both estimation of hazard ratio and relative increase or decrease of risk of event over time, was used to determine risk of recurrence but also risk of non-RCC death. Recurrence rates for each TNM stage and site were in accordance with previous published data [1]. They concluded that when age and co-morbidity are accounted for in determining length of FU, risk of non-RCC death exceeds that of risk for RCC recurrence at different time points, suggesting shorter durations of surveillance the older and the higher CCI the patient has. Based on the above, the authors suggest a novel approach for individualising duration of FU and they argue that for some patients (e.g. T2N0 patients with CCI ≥ 2 and over 80 years old) the risk of non-RCC death exceeds risk of recurrence as early as 30 days after primary surgery making any FU redundant.

Follow-up protocols endorsed by major guidelines

The FU protocols of the EAU, AUA and NCCN are been summarised in Table 1 [10]. The EAU guideline FU protocol recommends risk stratification of patients based on prognostication scoring tools such as the UISS to stratify patients into low, intermediate and high-risk score groups. The NCCN and AUA FU recommendations are based solely on TNM stage. The most recent and comprehensive assessment of these guidelines has been done by Lobo et al. [11]. Using a simulation model, they evaluated cost, radiation exposure and recurrence detection accuracy of each guideline protocol and concluded that for local recurrence and distant metastases, when adhering to AUA or NCCN recommendations on FU, 59 and 63% of low-risk patients’ local and distant recurrence, respectively, would be missed due to FU duration less than 3 years. In the EAU guidelines, adherence to protocol would render 95% detection of low-risk recurrences. For high-risk patients, the detection rate for all types of recurrence was > 92% for all three guideline FU recommendations.

Table 1 Imaging follow-up protocols endorsed by different guidelines

Duration of FU is discussed within each guideline, and a minimum of at least 5-year duration based on recurrence risks is recommended [17, 25]. Although most recurrences occur within this time frame, several studies have shown late RCC recurrences [18, 30, 31]. Stewart et al. have assessed the detection of recurrence based on either the AUA or NCCN recommendations and found that discharging patients from surveillance after 5 years or earlier may result in a rate of up to 33% of non-detected recurrences depending on the risk profile [32].

As another example, Kim et al. showed that between 5 and 15% of patients had late recurrence detected 10 years after the initial 5-year FU [18]. It has been indicated that these patients may have a less aggressive tumour and more likely are asymptomatic [33]. Therefore, detection of these patients is still perhaps warranted as some studies indicate that metastasectomy of late recurrences may prolong the overall survival or even provide cure [8].

The optimal duration of follow-up remains to be elucidated. As Stewart-Merrill et al. point out in their Weibull model, there is a strong need to not only address FU based on TNM and histopathological parameters but also address patient age and co-morbidities [29]. Indeed, life expectancy should play a major role in determining the frequency and duration of a FU protocols, and also is a determinant in reducing health care costs in countries where health services are an integral part of the state system. In addition, certain patient populations among low-risk groups may have an extremely low likelihood of recurrence. Further investigation is warranted, comparing guideline recommendations to a more evidence-based approach with age and co-morbidity standardised protocols.

Choice of follow-up imaging modality

Regarding type of imaging performed, chest and abdominal CT is today’s gold standard, especially for thoracic imaging as cross-sectional topography allows for more detailed and earlier detection of metastases compared to conventional X-ray [34]. Ultrasound (US) and magnetic resonance imaging (MRI) are also recommended in all three guidelines, but play an auxiliary role to reduce radiation or are used when CT with intravenous contrast is not suitable. Brain and bone metastases are symptomatic in most cases, and FU imaging should be tailored if symptoms arise but is not recommended on a routine basis [6]. Fluorodeoxyglucose positron emission tomography (FDG-PET) combined with CT is showing some promise in the detection of RCC recurrence as both spatial and functional aspects of tumours are visualised. However, in a recent meta-analysis of FDG-PET for detection of RCC malignancy, the pooled sensitivity and specificity were 79 and 90%, respectively [35]. FDG-PET has been shown to have limitations in detecting lesions smaller than two centimetres but has a higher potential for detecting more aggressive tumours [36, 37]. Novel approaches with immuno-PET are showing promise with the girentuximab chimeric antibody to carbonic anhydrase IX in detecting localised clear cell RCC, occult regional lymph node metastases and distant metastases [38,39,40].

EAU guidelines advise cross-sectional imaging (CT or MRI) as the imaging modality of choice for follow-up of RCC and discourage the use of conventional X-ray. Beisland et al. [41] recently analysed the pattern of follow-up imaging in 1612 patients and explored if more frequent use of cross-sectional imaging improves survival. The patients were from the RECUR database setup to develop comparators between pattern of recurrence and type of FU applied for the EAU RCC guidelines [42]. A particular feature of the protocol used was to distinguish recurrences according to how they were managed. Potentially, curable recurrences were defined as single-, local- or oligo-recurrences at a single site amenable to local therapies such as metastasectomy or radiation, whereas probably incurable recurrences were multiple and at multiple sites treated by systemic therapy or observation. Of the 17,333 FU imaging procedures performed, 7953 were CT (46%), 6540 (38%) were conventional X-rays, 183 (1%) were MRI and 2651 (15%) were US. In general, cross-sectional imaging was more frequently used in high-risk patients. However, there was no significant difference in the overall survival between patients with potentially curable or probably incurable recurrences based on type of imaging with which the recurrence was detected.

Impact of follow-up on patient outcomes

The rationale for surveillance of patients curatively treated for localised RCC is to detect recurrence within a window of opportunity in which surgery, radiotherapy or ablative techniques are possible either to cure or prolong the patient’s life. Current studies, which have been reviewed above, have not yet shown that stratification into risk groups and follow-up based on risk-adapted protocols help us find more potentially curable patients or prolong survival. A major reason for this lack of data is the fact that most studies used detection of metastasis or recurrence as their final objective to investigate the quality of the respective follow-up protocols. However, without investigating if further management or intervention after detection changes the natural course of the disease this question cannot be answered. To this effect, the EAU RCC guideline panel has set up the RECUR protocol to retrospectively capture data and develop comparators to guide future FU recommendations. A first analysis confirms that FU should be tailored to risk of recurrence and age [42]. Another way of addressing this issue would be to stipulate evidence-based FU protocols that could then be compared to each other in terms of overall- or cancer-specific survival or delay of systemic therapy in a randomised controlled trial (RCT) setting. Alternatively, the link between optimal FU protocol and association of any potential survival benefit could be addressed by cohort studies which allow comparisons of patterns and time of recurrence stratified by risk groups in patients with different imaging frequencies, recurrence burden and intended recurrence treatment.

Although cross-sectional imaging improves recurrence detection compared to conventional X-ray, there may be a subgroup of patients who would benefit from avoiding CT imaging during their FU. Age, kidney function, RCC risk group and co-morbidities perhaps render cross-sectional imaging redundant in a subgroup of patients. Without data supporting a change of the natural course of the disease by cross-sectional imaging, we cannot rule out that better resolution imaging simply leads to earlier diagnosis without changing prognosis and survival. Also for patients with risk of late recurrence, perhaps a post 5-year FU period clinical appraisal (including physical examination and laboratory blood tests) instead of perfunctory annual imaging is more suitable to avoid excessive radiation exposure and costs. Retrospective comparative (between risk groups, age groups, co-morbidity levels etc.) analysis of prognostic risk factors associated with benefit of avoiding cross-sectional imaging may be performed alternatively; the same could be setup in a RCT setting where physician discretion (control) could be compared to a risk-factor-based algorithm where patient is randomised either to cross-sectional imaging, conventional imaging or to clinical follow-up. In addition, patients’ preference for attributes of FU schedules and quality of life can be explored in the trials.