The incidence of renal cell carcinoma accounts for 3% to 5% of malignant tumours in adults and is second only to prostate and bladder cancers among male urological malignancies.1 The incidence of renal cell carcinoma increases every year in most regions.2 Surgery is the treatment of choice for limited renal cell carcinoma, and the current surgical approach, for limited renal cell carcinoma are partial nephrectomy (PN) and radical nephrectomy (RN).3 To predict the complexity of PN and the possibility of complications, surgeons use RENAL, PADUA, and C-index nephrometry scoring systems, as well as other renal measurement scoring systems to quantify the possibility of related renal tumors and complications.4 The Renal Tumour Surgery Scoring System is based on the deconvolutional characteristics of renal tumours and consists of Radius (tumor size as maximal diameter), Exophytic/endophytic properties of the tumor, Nearness of tumor deepest portion to the collecting system or sinus, Anterior/posterior descriptor, and the Location relative to the polar line.5,6 It is now generally accepted that PN better preserves renal function, reduces the risk of renal insufficiency and associated cardiovascular events, and has an OS benefit in some populations.7,8,9 Partial nephrectomy is indicated for patients with renal tumours at stage T1, located on the surface of the kidney, and amenable to surgical manipulation.10 However, the efficacy of PN for the treatment of complex renal masses (RENAL score ≥ 7) remains controversial. Although several recent studies have compared the perioperative and oncological outcomes of PN and RN for the treatment of complex renal tumours, most of them have been limited to a single medical centre.

Therefore, we conducted a systematic review and meta-analysis of articles using the RENAL score to assess perioperative and oncological outcomes and to assess whether anatomical complexity of renal tumours affects perioperative and oncological outcomes.

Methods

Literature Search

We performed a systematic review and cumulative meta-analysis of the primary outcomes of interest according to the PRISMA criteria, following AMSTAR guidelines for quality assessment.11 This systematic review has been registered with PROSPERO.

The literature search and screening process is carried out independently by two researchers, and if there is a disagreement and no agreement can be reached, a third examiner participates in the decision. Four databases were searched: Embase, PubMed, Cochrane Library, and Web of Science. The search period was from the creation of each database to November 2023. Search terms included: ("complex renal tumours" or complex) and ("partial nephrectomy" or PN) and ("radical nephrectomy" or RN), and synonyms of these terms. Hand-searching reference lists of relevant studies broadened the scope of the search.

Eligibility Criteria

Reports were included in our systematic review if they met the inclusion criteria: (1) study subject diagnosed with renal carcinoma and undergoing PN or RN; (2) comparison was made according to RENAL score; and (3) contain at least one outcome, including operation time (OT), length of stay (LOS), estimated blood loss (EBL), complications, overall survival (OS), cancer-specific survival (CSS), relapse-free survival (RFS). Exclusion criteria included: (1) inability to extract applicable data; (2) research in the form of editorials, meetings, expert opinions; (3) the overlapping study population reported the same results; (4) the study subject is nonhuman; and (5) not grouped according to surgical approach.

Data Extraction

Two independent reviewers independently selected articles for inclusion and extracted the data according to a preestablished data collection form. Extracted data included: author, year of publication, sample size, age, document type, surgical approach, operative time, length of hospital stay, estimated blood loss, complications, transfusions, OS, CSS and RFS.

Study Quality Assessment

Retrospective studies were assessed using the Newcastle-Ottawa (NOS) scale.12 The NOS scores range from 0 to 9; more than 6 was considered high quality.

Risk of Bias Assessment

The risk of bias assessment in the included studies was conducted by the same two authors independently. The ROBINS-I tool was used to assess the risk of bias in nonrandomized studies. The ROBINS-I tool includes seven domains: confounding bias, selection bias, intervention measurement classification bias, bias due to deviations from the intended intervention, bias due to missing data, outcome measurement bias, and reported outcome selection bias.13 Because fewer than ten studies were included, we did not perform a risk of bias assessment.

Data Analysis

We used Stata version 16.0 tool for data analysis. We use log RR (relative risk), SMD, and the variance as the summary outcome measure from all trials in the meta-analysis. For each trial, HR (hazard ratio) with the 95% confidence intervals (CI) of the survival rate was derived and calculated using either the fixed effects model or the random-effects model.14 Statistical significance was defined as P < 0.05. Chi-square and q-test were used to verify the heterogeneity among the included studies, such as I2 > 50% or P < 0.10, indicating significant heterogeneity between studies, and a random-effects model was selected.

Results

Description of Study

The authors searched 251 records from four databases and manually searched ten records from the reference lists of relevant studies. Ninety-five duplicate articles were eliminated using document management software; 149 articles were excluded from reading titles and abstracts; 17 studies were included in careful reading, excluding three studies with no outcomes of interest, three systematic reviews, one meta-analysis, and four incomplete data. A total of 261 studies were included, and six studies were finally included for meta-analysis.15,16,17,18,19,20 The screening process is shown in Fig. 1, and the baseline characteristics of the included studies are shown in Table 1. Six publications were published from 2016 to 2023, including 2602 patients. In addition, the sample size was 118–969. Six of the studies were retrospective.

Fig. 1
figure 1

Selection process

Table 1 Baseline data for studies included in the meta-analysis

Quality Assessment

The quality of the cohort studies was evaluated by using the modified Newcastle-Ottawa Scale, NOS score was 6 to 8 points in Table 2. 21 Six studies of high quality were included, all with a score of 6 or more in Table 2.

Table 2. Quality score of included studies based on the NOS scale

Operation Time

Four studies reported OT. Because of high heterogeneity (I2 = 89.4%, =0.000), we used a random-effects model. The pooled meta-analysis demonstrated significant difference between RN and PN (SMD = −0.59, 95% CI [−1.14, −0.04], P < 0.05; Fig. 2).

Fig. 2
figure 2

Forest plot and meta-analysis of OT between RN and PN scores

Length of Stay

Four studies reported LOS. Because of high heterogeneity (I2 = 98.2%, P = 0.000), we used a random-effects model. The pooled meta-analysis demonstrated no significant difference between RN and PN (SMD = −0.61, 95% CI [−1.60, 0.39], P > 0.05; Fig. 3).

Fig. 3
figure 3

Forest plot and meta-analysis of LOS between RN and PN scores

Estimated Blood Loss

Three studies reported EBL. Because of high heterogeneity (I2 = 73.1%, P = 0.024), we used a random-effects model. The pooled meta-analysis demonstrated significant difference between the RN and PN (SMD = −0.51, 95% CI [−0.90, −0.12], P < 0.05) (Fig. 4).

Fig. 4
figure 4

Forest plot and meta-analysis of EBL between RN and PN scores

Complications

Four studies reported complications. Because of high heterogeneity (I2 = 80.7%, P = 0.001), we used a random-effects model. The pooled meta-analysis demonstrated no significant difference between the RN and PN (RR = 0.72, 95% CI [0.34, 1.50], P > 0.05) (Fig. 5).

Fig. 5
figure 5

Forest plot and meta-analysis of complications between RN and PN scores

Four studies reported major complications (Clavien- Dindo score ≥3). Because of low heterogeneity (I2 = 0.0%, P = 0.728), we used a fixed-effects model. The pooled meta-analysis demonstrated significant difference between the RN and PN (RR = 0.46, 95% CI [0.26, 0.80], P < 0.05) (Fig. 6).

Fig. 6
figure 6

Forest plot and meta-analysis of major complications between RN and PN scores

Blood Transfusion Needs

Four studies reported blood transfusion needs. Because of low heterogeneity (I2 = 13.8%, P = 0.323), we used a fixed-effects model. The pooled meta-analysis demonstrated no significant difference between the RN and PN (RR = 0.84, 95% CI [0.43, 1.64], P > 0.05) (Fig. 7).

Fig. 7
figure 7

Forest plot and meta-analysis of blood transfusion need between RN and PN scores

Decreased Glomerular Filtration Rate

Three studies reported decreased glomerular filtration rate (GFR). Because of high heterogeneity (I2 = 77.9%, P = 0.024), we used a random-effects model. The pooled meta-analysis demonstrated significant difference between the RN and PN (SMD = 0.73, 95% CI [0.44, 1.01], P < 0.05; Fig. 8).

Fig. 8
figure 8

Forest plot and meta-analysis of decreased GFR between RN and PN scores

Overall Survival

Four studies reported OS. Because of low heterogeneity (I2 = 0.0%, P = 0.550), we used a fixed-effects model. The pooled meta-analysis demonstrated significant difference between the PN and RN (HR = 1.97, 95% CI [1.38, 2.81], P < 0.05; Fig. 9).

Fig. 9
figure 9

Forest plot and meta-analysis of OS between PN and RN scores

Relapse-Free Survival

Two studies reported RFS. Because of high heterogeneity (I2 = 76.6%, P = 0.039), we used a random-effects model. The pooled meta-analysis demonstrated no significant difference between the PN and RN (HR = 1.91, 95% CI [0.72, 5.05], P > 0.05; Fig. 10).

Fig. 10
figure 10

Forest plot and meta-analysis of RFS between PN and RN scores

Cancer-Specific Survival

Three studies reported CSS. Because of low heterogeneity (I2 = 0.0%, P = 0.392), we used a fixed-effects model. The pooled meta-analysis demonstrated significant difference between the PN and RN (HR = 1.86, 95% CI [1.11, 3.14], P < 0.05; Fig. 11).

Fig. 11
figure 11

Forest plot and meta-analysis of CSS between PN and RN scores

Sensitivity Analysis

We used sensitivity analyses to track sources ofheterogeneity for each outcome measure. The results showed a stable source of OT, LOS, EBL, complications, blood transfusion needs, GFR, OS, RFS, CSS, and heterogeneity.

Discussion

Renal cell carcinoma (RCC) is the sixth most common cancer in men and the tenth in women, accounting for 5% and 3% of all cancer diagnoses, respectively.22 The incidence of RCC is increasing.22,23

Treatment of renal tumours should ensure satisfactory perioperative, functional and oncological outcomes. Previous studies have explored different surgical approaches to PN for the treatment of complex renal tumours,24,25 but comparing PN and RN in the treatment of complex renal masses remains controversial. There were no significant differences in complications, length of hospital stay, and blood transfusion. In terms of oncological outcomes, the PN group had longer OS, CSS, and no significant difference in RFS.

Four studies reported OT. The heterogeneity test showed high heterogeneity between studies (P < 0.10). This difference may be due to differences in medical level and surgeon’s proficiency in surgery in different regions. The pooled meta-analysis demonstrated significant difference between RN and PN. This reflects the fact that PN is a more complex procedure than RN, because it includes all the steps of RN in addition to tumour resection and renal reconstruction.

Four studies reported LOS. The heterogeneity test showed high heterogeneity between studies (P = 0.000). This difference may be due to differences in medical level in different regions. The pooled meta-analysis demonstrated no significant difference between RN and PN. Although PN had a higher EBL than RN, the difference between the groups was less than 100 ml and therefore unlikely to be clinically significant. This can also be supported by transfusion requirements.

Three studies reported EBL. The heterogeneity test showed high heterogeneity between studies (P = 0.024). The pooled meta-analysis demonstrated significant difference between the RN and PN. It is not hard to understand that more OT and more surgical steps lead to an increase in EBL.

According to the Clavien-Dindo score, there were no significant differences in complications, while there was significant difference in high-grade complications. PN is a more complex procedure than RN and the potential risks may be greater for larger and more complex masses, thus requiring more extensive parenchymal resection and reconstruction. The reason for the difference between overall and major complications may be uncounted minor complications (Clavien-Dindo score). In Zhang et al.17 study, minor complications were, between the two groups, not significantly different.

Four studies reported blood transfusion needs. The heterogeneity test showed low heterogeneity between studies (P = 0.323). The pooled meta-analysis demonstrated no significant difference between the RN and PN (P > 0.05). PN increased EBL but did not increase the patient’s blood transfusion needs.

Three studies reported decreased GFR. The heterogeneity test showed high heterogeneity between studies (P = 0.004). The pooled meta-analysis demonstrated significant difference between the RN and PN (P < 0.05). 4 studies reported OS. The heterogeneity test showed high heterogeneity between studies (P = 0.004). The pooled meta-analysis demonstrated significant difference between the RN and PN (P < 0.05). Two studies reported RFS. The heterogeneity test showed high heterogeneity between studies (P = 0.039). The pooled meta-analysis demonstrated no significant difference between the RN and PN (P > 0.05). Three studies reported CSS. The heterogeneity test showed high heterogeneity between studies (P = 0.0392). The pooled meta-analysis demonstrated significant difference between the RN and PN (P < 0.05). PN preserves renal function better than RN, leading to a lower incidence of CKD, and decreased renal function is associated with increased severe serious cardiovascular disease, OS, and CSS. This was confirmed in a study by Weight et al.26

The standardized nephrometry scoring system (R.E.N.A.L. Nephrometry Score) is used to quantify the anatomical characteristics of renal masses on computerized tomography/magnetic resonance imaging.

Our research is not without limitations. First, the source of publications is limited, and we have not been able to retrieve unpublished research, so it may inevitably introduce publication bias. Second, because of the small number of studies available, we cannot predict outcomes based on different scoring stages. Third, because there are not many studies, and most of the included studies have a small sample size, the reliability of this finding needs to be further confirmed.

Conclusions

For complex renal tumours, PN requires more operative time and has a higher chance of complications in the short term. However, in long-term follow-up, PN has a small decrease in renal function with longer OS and CSS.