Introduction

Men undergoing radical prostatectomy (RP) as primary treatment for prostate cancer (PCa) may experience treatment-related sexual dysfunction and urinary incontinence [1, 2]. These functional outcomes may be determined by the quality of surgical care [3].

In the United Kingdom, surgical services in the National Health Service (NHS) have been reorganised, concentrating RP to fewer centres following national guidance requiring that major urological pelvic cancer surgery is carried out in specialist centres performing more than 50 cases per year [4]. Centralisation has gathered pace since further guidance stipulated that robotic-assisted radical prostatectomy (RARP) is concentrated in centres performing at least 150 procedures per year [5].

There is evidence, mainly from the United States, that outcomes are better in hospitals with higher RP volumes [6, 7]. However, the effect of hospital volume is likely to depend on the outcome of interest as well as on the characteristics of the health system that provides the procedure [8].

In this study, we tested the hypothesis that two key measures of outcome post-prostatectomy, notably, long-term urinary incontinence and sexual function reported by the patients themselves at least 1 year after surgery, are better in hospitals with larger volumes, both for RP of any type and for RARP only. We used data from the National Prostate Cancer Audit (NPCA), a population-based study that evaluates the care and outcomes of all men diagnosed with PCa in the NHS in England and Wales [9].

Methods

Study design and participants

All patients who were diagnosed with non-metastatic PCa between 1 April 2014 and 30 September 2016 (the study period) according to the English Cancer Registry and who subsequently underwent RP (OPCS-4 code ‘M61’) were eligible for inclusion in the study. The NPCA patient survey was designed to record their personal outcomes in a structured manner following surgery. Patients were identified using NPCA data, which includes English Cancer Registry data linked to Hospital Episode Statistics (HES) at patient level [10, 11]. The NPCA patient survey methods are described in detail elsewhere [2].

Outcome measures

Men were invited to complete a questionnaire at least 18 months after diagnosis (Appendix 1). Time from surgery to completion of the questionnaires was at least 12 months. The patient-reported outcome (PRO) questionnaire comprised items from EPIC-26, a validated instrument to measure function following radical PCa treatment across five domains including sexual function and urinary incontinence. The validated summary score for each domain ranges from 0–100, with higher scores representing better function [12]. Thresholds for a minimal clinically important difference (MCID) have been estimated for each domain, representing changes considered to be meaningful for patients [13].

The questionnaire also included two adapted EPIC-26 questions: “Overall, how big a problem was your urinary incontinence function or lack of sexual function for you immediately before you were diagnosed with PCa?”

Hospital-level characteristics

Hospital RP volume was derived from the number of patients diagnosed during the study period according to the Cancer Registry data who subsequently underwent a RP according to HES data. For each hospital RP volume was calculated as the average annual number of procedures.

For this study, a ‘hospital’ is defined as an NHS Hospital Trust, the organisational unit that provides secondary care in the English NHS in a local area [14]. Overall, 52 hospitals performing at least 10 RPs in each year of the study period were included. Two hospitals not meeting this minimum number of RPs each year, in total treating 50 patients, were excluded. Hospital volume was modelled in ‘volume groups’ (up to 60 RPs per year, from 61 to 100 RPs, from 101 to 140 RPs, and 141 RPs or more). The volume groups were chosen in order to create, as much as possible, categories that are equal in terms of both the number of hospitals and the number of patients. Hospital volume was also modelled as a continuous variable.

Patient-level characteristics

10,487 men were sent a survey questionnaire and 10,089 men were eligible for inclusion in the final analysis in the study cohort after exclusions (Fig. 1). Questionnaire responses were linked to the NPCA database. Cancer Registry records provided information on age at diagnosis, tumour characteristics according to the TNM classification [15], Gleason biopsy score, and pre-treatment serum prostate-specific antigen. A modified D’Amico risk stratification algorithm [16] categorised each patient’s cancer into low, intermediate or high risk/locally advanced disease.

Fig. 1
figure 1

Patient flow diagram illustrating the cohort selection of men undergoing radical prostatectomy (RP) in the study.

HES records of hospital admissions provided information on each patient’s ethnicity, socioeconomic status (measured in quintiles by the Index of Multiple Deprivation) and number of comorbidities in the year preceding diagnosis according to the RCS Charlson score [17, 18]. Patients who had a code for a robot-assisted procedure (OPCS-4 Y753, Y765) in their HES records were classified as having had a RARP.

Statistical analysis

We used multilevel multivariable linear regression to model EPIC-26 domain scores as a function of hospital volume, included as ‘volume groups’ or as a continuous variable. RP volume was included as a hospital-level characteristic. We modelled the volume-outcome relationship including all men undergoing a RP of any type as well as those undergoing a RARP. The models were adjusted for patient-level characteristics (age, number of comorbidities, ethnicity, cancer risk group, and socioeconomic status) and hospital-level characteristics (radiotherapy centre, university hospital). P values were derived from Wald tests.

When modelling hospital volume as a continuous variable, we tested whether the relationships between hospital volume and the outcomes were linear by adding a quadratic term for hospital volume in the model. Missing patient-reported data to individual questions were handled in accordance with guidelines for EPIC-26 [19]. Multiple imputation accounted for missing values of the patient-level characteristics and the PROs so that regression models included all patients [20]. Missing values were replaced with 30 sets of plausible values and Rubin’s rules [21] were used to obtain estimates and 95% confidence intervals (CI).

All reported p values are two-sided and 0.05 was the significance level. Negative differences represent poorer outcomes compared to the reference group. Data analysis was undertaken using Stata version 15 [22].

Results

Descriptive analysis

The hospital RP volume during the study period varied from 37 to 597 (median of 225.5). About one-fifth of hospitals carried out 60 or fewer procedures annually and one-sixth more than 140 (Table 1). The proportion of men undergoing RARP increased during the study from 54.3% of men diagnosed in 2014 to 71.1% of men diagnosed in 2016. RARP was the surgical modality most frequently performed (72.7% of RPs) in the highest volume group of hospitals (>140 RPs per year) compared with 35.0% of RPs carried out in the lowest volume group (≤60 RPs per year; Table 1). Approximately three-quarters of hospitals in the higher volume groups were university hospitals (78.6% in the 101–140 RPs per year group and 75.0% in the >140 RPs per year group) compared with 45.5% in the lowest volume group (≤60 RPs per year; Table 1).

Table 1 Hospital characteristics stratified by annual hospital RP volume per year.

Of the 10,089 men in the study cohort, 7,702 (76.3%) responded to the questionnaire. All men underwent RP less than 6 months after diagnosis. Overall, 5529 of the men who responded (71.9%) had a RARP. On average, responders were older, more frequently of white ethnicity, had fewer comorbidities and had a more affluent socioeconomic status compared to non-responders (Appendix 2).

There were only small differences in the characteristics of the responders across volume groups (Table 2). Men in the highest volume group tended to be younger, were less often of white ethnicity and more often had locally advanced disease compared with the lowest volume group. We did not find differences between volume groups in the proportion of men who indicated that immediately before the time of diagnosis they had a big problem with their urinary function (6.2%, 6.6%, 7.0%, 6.5% with increasing volume) or their lack of sexual functional (7.8%, 9.5%, 8.7%, 9.3%).

Table 2 Patient and tumour characteristics by hospital volume of radical prostatectomies per year.

Outcomes

The differences in EPIC-26 urinary continence scores between the four volume groups were small (70.4, 69.5, 71.6, and 72.6 with increasing volume) and none were statistically significant with adjustment for differences in patient-level and hospital-level characteristics (p = 0.08; Table 3). When modelling hospital volume as a continuous variable, we found no evidence of a volume-outcome relationship. For each increase in hospital volume of 100 procedures, there was a non-significant increase of 0.70 (95% CI: −0.41 to 1.80; p = 0.22) in the urinary continence score. Adding hospital volume as a quadratic term did not improve the fit of the model significantly.

Table 3 Relationship between EPIC-26 domain scores (urinary incontinence and sexual function) and hospital volume of radical prostatectomies per year.

The differences in EPIC-26 sexual function scores between the four volume groups (18.7, 24.2, 24.1 and 26.6 with increasing volume) were slightly bigger than the corresponding differences in urinary continence scores but they did not reach statistical significance with adjustment for differences in patient-level characteristics (p = 0.20; Table 3). However, when modelling hospital volume as a continuous variable we found that each increase in hospital volume of 100 procedures was associated with an increase of 1.54 (95% CI: 0.62–2.45; p = 0.001) in the sexual function score. We did not find that adding hospital volume as a quadratic term improved model fit significantly.

The same pattern of results was observed when we included only men who underwent RARP (Table 3). There were no significant differences in the volume groups either in urinary continence (p = 0.12) or sexual function scores (p = 0.17). When modelling hospital volume of RARPs as a continuous variable we did not find evidence of a statistically significant increase of urinary continence scores (0.99 for each 100-procedure increase in hospital volume, 95% CI: −0.18 to 2.17; p = 0.10). However, we did find evidence of an increased sexual function score with higher hospital volumes (1.10 for each 100-procedure increase in hospital volume, 95% CI: 0.07–2.12; p = 0.04). Again, adding hospital volume as a quadratic term did not lead to significant improvements of fit of the models.

Discussion

Main findings

To our knowledge, this is the first study to explore the relationship between hospital RP volume and patient-reported urinary continence and sexual function at least 12 months after surgery. We did not find significant differences in the EPIC-26 domain scores between the four defined volume groups for these long-term outcomes. However, when hospital volume was modelled as a continuous variable, there was some evidence that the sexual function score increased with higher hospital volumes. The increase in sexual function (a 1.5 increase in sexual function score for a 100-procedure increase in hospital volume) is unlikely to be clinically significant given that the threshold for a MCID is 10–12 points [13].

These results need to be interpreted in the context of the ongoing process of centralisation of RP services, in the English NHS since 2002, that has gained further impetus since the introduction of RARP [4, 5]. Four in five English NHS hospitals carried out more than 60 RPs per year during the study period and these hospitals carried out more than 90% of all RPs.

Relationship to previous research

A systematic review of the volume-outcome relationship for RP, mainly including studies carried out in the United States, concluded that there is consistent evidence of an association between hospital volume and short-term outcomes (surgical complications, blood loss and length of stay) [6]. An assessment of in-hospital outcomes after all RPs performed in Germany between 2006 and 2013 (221,331 procedures) reported that hospital volume is the most important factor for improved in-hospital outcomes (mortality, blood transfusion and length of stay) [23]. A recent, large database study of over 100,000 patients also reported a volume-outcome relationship between hospital RARP volume and short-term outcomes (perioperative complications and oncological outcomes) [7]. However, the evidence on associations with longer-term functional outcomes is less clear. Our results address this important evidence gap with respect to long-term urinary continence and sexual function.

Some studies carried out in high-volume centres reported better PROs after RP than population-based studies [24, 25]. However, our results do not support the explanation that the superior functional outcomes seen in these high-volume centres can be explained merely by the fact that they have a higher than average volume of procedures. Other quality-related factors, for example specific quality assurance programmes or differences in patient selection or referral are more likely to be evident in expert centres, which may explain the differences in functional outcomes [26].

We found similar results when analysing the volume-outcome relationship in all men and in men who had RARP. This is in line with the emerging evidence that RARP is likely to have better short-term outcomes but similar long-term outcomes compared to other RP modalities [2, 27, 28].

Strengths and limitations

A key strength of our study is that we report outcomes for a recent cohort of patients from all English hospitals that provide RP. Men were identified on the basis of routine cancer registry data including every man diagnosed with PCa in the English NHS. As such, it presents a highly representative population. Given that less than 5% of healthcare expenditure in England covers procedures outside the NHS provided by the private sector, our study cohort also represents a near-complete cross-section of the population of men with PCa undergoing surgical treatment [29].

Patient selection and survey administration were independent of surgeons and other healthcare professionals, eliminating the possibility of selection and reporting bias. Furthermore, we had a robust sample size (7700 men) and a high response rate to the survey (73.4%). We observed some differences between responders and non-responders, but it is unlikely that these have affected the volume-outcome relationship that we report, as the response rate did not vary according to volume group, with only small differences in the men’s characteristics between the volume groups. Neither did we find differences in baseline function based on patients indicating whether their urinary incontinence or lack of sexual function was a big problem immediately before the time of diagnosis. The comparisons of the functional outcomes were adjusted for a range of patient characteristics, which further reduces the possible effect of confounding. Finally, the study benefitted from the use of a validated instrument that is widely used to determine sexual and urinary function after PCa treatment (EPIC-26).

Our results provide a snapshot of the functional outcomes collected at least 18 months after diagnosis. This implies that we were not able to explore whether hospital volume had an impact on the speed of functional recovery after surgery or whether there is a trade-off between functional outcome and cancer cure. Data on nerve-sparing technique were unavailable in this study.

A recent systematic review reported that increasing surgeon experience (>50 RPs/year) is associated with better urinary incontinence recovery rates, although the authors highlight key methodological limitations of this research with respect to variability in the definition of both surgeon experience and urinary incontinence, with inconsistent use of validated measures [30]. We did not investigate a potential relationship between surgeon volume and PROs for a number of reasons. First, the administrative information available identifies the experienced urologist who is ‘responsible’ for the care episode but not the ‘operating’ surgeon. In hospitals with a team-based approach and those with a relatively large number of trainee surgeons, the responsible urologist and the operating surgeon are not necessarily the same. Thus, observed volume for an individual surgeon may not be truly accurate. Second, there is evidence that short-term outcomes are more affected by the overall surgical management, including the surgeon’s experience and skill, whereas longer-term outcomes may be more affected by the support provided by the wider multidisciplinary team, including the provision of support services. These factors support consideration of hospital rather than individual surgeon volume [6].

Implications

Our study demonstrates that it is unlikely that a “volume-based policy” will lead to further improvements of functional outcomes after RP in the English NHS, where most hospitals providing PCa surgery already carry out at least 60 procedures per year. Volume-based policies are commonly implemented either by decreasing the number of low-volume providers by setting a minimum threshold or by increasing the number of high-volume providers through centralisation of care. Our findings demonstrate that it can be assumed that hospitals performing more than 60 RPs per year produce acceptable urinary continence and sexual function and further centralisation is unlikely to lead to additional improvements of clinical significance but it may have a negative impact on access, especially for patients from disadvantaged groups [31]. It is also important to note that the volume-outcome relationship may vary according to the complexity of the surgical procedure that is studied [31].

Volume-based policies follow the idea that “practice makes perfect” [31]. However, a recent study suggested that selective referral or patient choice may have had an impact on the current configuration of the English NHS hospitals that provide PCa surgery [32]. Hospitals adopting robotic surgery early and employing experienced urologists with a strong media reputation were particularly attractive to patients given that the volumes of RPs increased in such centres [33, 34]. These findings re-emphasise the need to be cautious about inferring causation when interpreting the relationship between hospital volume of RPs and outcomes.

Conclusions

The results from this study have important implications for PCa services in many countries. We conclude that it is unlikely that there will be clinically significant improvements in urinary continence and sexual function with further centralisation of RP services beyond the level observed in the English NHS, where four in five hospitals providing PCa surgery undertake at least 60 procedures per year.