Introduction

Bladder cancer is the seventh and 17th most commonly diagnosed cancer in males and females, respectively, with age-standardized incidence rates of 9.5 and 2.4 per 100,000 person/year, respectively [1]. Approximately 75% of patients with bladder cancer are diagnosed with non-muscle-invasive bladder cancer (NMIBC) [1]. Due to the high risks of disease recurrence and progression after transurethral resection of bladder tumor (TURBT) under simple white-light (WL) guidance in NMIBC patients, additional management strategies, including intravesical instillation therapy, second TUR, and photodynamic diagnosis (PDD) under blue-light guidance, have been developed to improve the prognostic outcomes of these patients, and these advantages have been widely recognized in recent years.

Several systematic reviews and meta-analyses demonstrated that PDD-TURBT using 5-aminolaevulinic acid (5-ALA) or hexaminolevulinic acid increased detection rate of tumors, particularly that of CIS [2] and reduced the risk of disease recurrence more than WL-TURBT in patients with NMIBC [3, 4]. Furthermore, progression to muscle invasive disease was effectively inhibited by PDD-TURBT [4]. However, the oncological benefits of PDD-TURBT reported in several randomized control studies differed [5,6,7,8]. In addition, PDD is known to be accompanied by several adverse events, such as photosensitivity, liver injury, and hypotension [9, 10], and the cost-effectiveness of PDD-TURBT remains controversial [8, 11, 12]. Accordingly, PDD-TURBT is not strongly recommended or is only recommended for limited cohorts in major clinical guidelines [13, 14].

Based on these findings, it is important to preoperatively identify patients expected to benefit from PDD-TURBT. Although previous studies identified factors associated with oncological outcomes in NMIBC patients receiving PDD-TURBT, most included an insufficient number of patients and/or conducted assessments based on factors unavailable prior to surgery. Therefore, the present study included 1174 patients with NMIBC who were treated with PDD-TURBT or WL-TURBT and subsequently observed for ≥ 6 months, and the significance of baseline parameters before TURBT was evaluated as predictors of recurrence-free survival (RFS) according to the type of TURBT.

Patients and methods

Patients

This retrospective multicenter study was approved by each Ethics Committee of the participating institutions (No. 21-058). Informed consent was obtained from the participants or deceased patients’ families through posters and/or websites using the opt-out method.

After the exclusion of patients with an observation period < 6 months following TURBT and/or missing data for analyses, the present study examined 1174 consecutive patients with primary NMIBC, consisting of 385 and 789 who underwent PDD-TURBT with oral 5-ALA under blue-light guidance (the PDD group) and non-PDD-TURBT under WL guidance (the WL group), respectively, between February 2006 and June 2021 at Nara Medical University Hospital, Saitama Medical University International Medical Center, Hamamatsu University Hospital, Kochi Medical School Hospital, and Yamaguchi University Hospital. The following clinicopathological data of the patients included were obtained from their medical records at each hospital: age, sex, previous or concomitant history of upper urinary tract urothelial cancer (UTUC), preoperative cytology, tumor multiplicity, tumor size, pathological T stage, concomitant carcinoma in situ (CIS), second TURBT, intravesical bacillus Calmette–Guérin (BCG) therapy, including both induction and maintenance therapies, and intravesical chemotherapy, including both single immediate and adjuvant instillation therapies, adverse events, and follow-up data.

Surgical procedure, postoperative therapy, and follow-up protocol

In the PDD group, approximately 3 h (range, 2–4 h) before surgery, patients orally received 5-ALA hydrochloride (SBI Pharmaceuticals Co., Ltd., Tokyo, Japan) in water at a dose of 20 mg/kg. The procedures and instruments used for anesthesia and the two types of TURBT were described in detail in our previous studies [15,16,17]. Each physician decided whether the following postoperative management approach was performed: the intravesical adjuvant administration of anthracyclines, the intravesical administration of BCG, and second TUR. Patients were followed-up with routine cystoscopy combined with urine cytology and imaging examinations at regular intervals according to contemporary clinical practice guidelines.

Statistical analysis

Statistical analyses were performed using SPSS software Version 28.0 (IBM Corp., Chicago, IL, USA). Clinical characteristics between the PDD and WL groups were compared using the chi-square test for categorical variables. To evaluate the efficacy of PDD-TURBT, RFS was calculated as the length of time from initial TURBT to pathologically diagnosed intravesical recurrence or overall death by the Kaplan–Meier method, and differences in RFS between the PDD and WL groups were compared using the Log-rank test. A propensity score matching analysis was performed using a 1:1 and 1.2 nearest neighbor matching algorithm without replacement with distances assessed by logistic regression. The analysis was performed based on age, sex, previous or concomitant history of UTUC, preoperative cytology, tumor multiplicity, tumor size, pathological T stage, concomitant CIS, second TURBT, and intravesical BCG therapy, including both induction and maintenance therapies. Covariate distributions between patients undergoing PDD-TURBT or WL-TURBT were balanced after conditioning on the propensity score, with a difference of < 0.1 in the standardized difference after matching being considered to indicate a good balance. After propensity score matching, each preoperative baseline factor that contributed to longer RFS in the PDD group than in the WL group was analyzed in subgroups using Cox’s proportional hazards regression model. P < 0.05 (two-sided) were considered to be significant.

Results

Among the 1174 patients included in the present study, 385 and 789 were classified into the PDD and WL groups, respectively. The clinicopathological characteristics of these patients are shown in Table 1. Although significant differences were only observed in the percentage of patients undergoing second TUR between the PDD and WL groups, the unbalanced distribution of some factors was noted between the two groups.

Table 1 Characteristics of patients in the whole cohort

Median observation periods were 19.9 and 29.7 months in the PDD and WL groups, respectively. During the observation period, disease recurrence or overall death occurred in 71 (18.4%) and 293 (37.1%) patients in the PDD and WL groups, respectively. Median RFS was not reached and was 79.0 months in the PDD and WL groups, respectively, and was significantly longer in the former than in the latter (P = 0.006) (Fig. 1a).

Fig. 1
figure 1

Kaplan–Meier curves of recurrence-free survival (RFS) and the P value calculated by the Log-rank test between PDD and WL groups. a A comparison of 1174 patients in the entire cohort before propensity score matching. b A comparison of 776 patients after propensity score matching population

To adjust for imbalances in baseline characteristics, including age, sex, previous or concomitant history of UTUC, preoperative cytology, tumor multiplicity, tumor size, pathological T stage, concomitant CIS, second TURBT, and intravesical BCG therapy, between the two groups, 1:1 propensity score matching was performed. Following propensity score matching, 383 patients were included in both groups, and backgrounds between the two groups were statistically balanced (Table 2).

Table 2 Characteristics of the propensity score matching population

In the propensity score matching cohort, RFS was not reached and was 65.5 months in the PDD and WL groups, respectively. As shown in Fig. 1b, RFS was significantly longer in the PDD group than in the WL group (P < 0.001). Subgroup analyses of RFS were presented as forest plots in Fig. 2. In the subgroup with tumors ≥ 30 mm, no significant difference was noted in RFS between the two groups (HR 0.68, 95% CI 0.38 to 1.23, P = 0.21); however, in the remaining subgroups, RFS was significantly longer in the PDD group than in the WL group. Figure 3 shows Kaplan–Meier curves according to tumor sizes after propensity score matching. Among patients with tumors < 30 mm, RFS was significantly longer in the PDD group than in the WL group (P < 0.001), but was not in those with tumors ≥ 30 mm (P = 0.20). In addition, according to European Association of Urology (EAU) NMIBC risk group based on 2021 EAU NMIBC scoring model, PDD group had significantly longer RFS than WL group in the high-risk group, but not in the remaining 3 groups (Fig. 4).

Fig. 2
figure 2

Recurrence-free survival (RFS) according to subgroups of preoperatively evaluable factors. The number of events/the number of patients and forest plots according to subgroups. The hazard ratio (HR) (median, range) and P value were analyzed by Cox’s proportional hazard analysis

Fig. 3
figure 3

Kaplan–Meier curves of RFS and the P value calculated by the Log-rank test between PDD and WL groups in the propensity score matching population. a Patients with tumors < 30 mm. b Patients with tumors ≥ 30 mm

Fig. 4
figure 4

Kaplan–Meier curves of RFS and the P value calculated by the Log-rank test between PDD and WL groups in the propensity score matching population according to EAU risk group. a Low risk. b Intermediate risk. c High risk. d Very high risk

As common adverse events associated with PDD, the incidences of liver injury and hypotension were investigated as follows: liver injury occurred in 5.5% (n = 21) and 0.6% (n = 5) in the PDD group and WL groups, respectively (P < 0.001), and hypotension in 4 patients (1.0%) and 2 patient (0.5%), respectively (P = 0.69). In addition, one patient (0.3%) in PDD group developed photosensitivity.

Discussion

The combination of PDD with TURBT is generally regarded as a beneficial approach for patients with NMIBC. Veeratterapillay et al. recently conducted a systematic review and meta-analysis including randomized control trials that evaluated the postoperative outcomes of WL- and PDD-TURBT, and found that PDD-TURBT increased recurrence-free rate at both 12 and 24 months [3], while an abridged Cochrane review showed that PDD-TURBT reduced the risk of both disease recurrence and progression more than WL-TURBT [4]. However, several issues associated with the combination of PDD and TURBT have been reported, such as inconsistent prognostic benefits in prospective studies [5,6,7,8, 18,19,20,21] and false-positive findings [2]; therefore, PDD-TURBT is recommended under limited conditions in major guidelines [13, 14]. Based on these findings, we conducted a retrospective multi-center study that included 1174 NMIBC patients who underwent TURBT and were followed up for ≥ 6 months in order to assess the prognostic impact of PDD-TURBT and identify the cohort most likely to benefit from PDD-TURBT.

A previous systematic review reported that 2-year RFS rates were between 40.0 and 89.6% and between 28.0 and 72.0% in the PDD and WL groups, respectively [3]. In the entire cohort of this study before propensity score matching, 2-year RFS rates were 82.5 and 73.2% in the PDD and WL groups, respectively, suggesting similar or even superior postoperative disease control to those in previous studies. In this series, RFS was significantly longer in the PDD group than in the WL group before matching. Furthermore, after matching, which balanced all of the parameters examined, RFS was also significantly longer in the PDD group than in the WL group. Therefore, the present results suggest that PDD-TURBT significantly prolonged RFS because the effects of imbalanced baseline parameters on RFS were markedly reduced by propensity score matching from those in a simple observational study.

A number of disadvantages of PDD-TURBT have been reported [2, 9, 10, 22, 23]. For example, the oral administration of 5-ALA rarely induced severe hypotension and liver toxicity [9, 10]. In fact, the incidence of liver injury was significantly higher in the PDD group and one patient developed photosensitivity after administration of 5-ALA in this series. Furthermore, regarding the oral administration of 5-ALA, patients need to be tolerant of an acidic taste, avoid phototoxic drugs, and reduce their exposure to sunlight or room light in order to prevent photosensitivity [24]. Accordingly, it is important to preoperatively select optimal candidates who will achieve prognostic benefits by PDD in combination with TURBT; however, few studies have investigating these issues [21, 25]. The present assessment of subgroups classified according to several preoperatively evaluable factors after propensity score matching revealed that PDD-TURBT was effective for the majority of NMIBC patients, except for those with tumors ≥ 30 mm. Kobayashi et al. reported similar findings to the results of our subgroup analyses; recurrence within < 500 days after PDD-TURBT was not significantly reduced in NMIBC patients with tumors ≥ 30 mm [25]. Although it was characterized in detail, the lack of prognostic benefits in patients with tumors ≥ 30 mm after PDD-TURBT may be associated with a long operative time during the management of large tumors because the stability of protoporphyrin IX induced by 5-ALA is known to be dependent on the wavelength and intensity of light used during surgery, while its elimination (photobleaching) is accelerated during WL cystoscopy [2].

There are several limitations that need to be addressed. Initially, adjuvant therapy and the follow-up protocol were not standardized, because data were retrospectively acquired from multiple institutions for a long period. In particular, detailed information, regarding intravesical instillation therapy with BCG and chemotherapeutic agent was not available. Secondly, sample in each subgroup was comparatively small to conclude the effectiveness of PDD-TURBT, especially, in patients with tumors ≥ 30 mm. Furthermore, the prognosis of patients may have been affected by differences in the skill of surgeons; however, multiple surgeons, even in each institution, were involved. Moreover, due to the study period between 2006 and 2020, tumor grades were excluded from statistical analyses and the cytology-reporting system was not standardized as The Paris System. Thirdly, variables for propensity score matching were not selected based on subjective criteria in this study. For example, it may be controversial to include CIS as a variable of propensity score matching, since detection of CIS is one of the advantages of PDD-TURBT. However, a systematic-review and meta-analysis indicated the incidence of CIS was not always high in PDD group [3], and concomitant CIS is a well-recognized major factor influencing RFS. Accordingly, to balance the background involving risk of recurrence, we included CIS as the variables of matching, like a previous report [25]. Lastly, the retrospective design of this study may lead to limited ability to accurately capture AEs; thus, it should be considered that adverse events could be under-reported.

In conclusion, RFS was significantly longer in the PDD group than in the WL group in both the entire cohort and propensity score matching cohort. However, subgroup analyses showed no significant differences in RFS between patients with tumors ≥ 30 mm who underwent PDD-TURBT or WL-TURBT.