Introduction

Non-muscle-invasive bladder cancer is treated with endoscopic transurethral resection of bladder tumours (TURBT); however, it is highly recurrent and sometimes progresses into muscle-invasive bladder cancer (MIBC) [1, 2]. The advent of photodynamic diagnosis (PDD), such as oral 5-aminolevulinic acid hydrochloride (5-ALA) and intravesical hexaminolevulinate, has improved the detection of tiny tumours and flat lesions that would otherwise be missed by white light (WL) observation alone. Previous reports have shown that PDD with oral 5-ALA increases the tumour detection rate by approximately 20% compared with WL, and the diagnostic benefits of PDD have resulted in improved recurrence-free survival [3,4,5,6,7].

Although both bladder urothelial carcinoma (BUC) and upper urinary tract urothelial carcinoma (UTUC) are derived from the urothelium and thus share many characteristics [8], the diagnostic and therapeutic strategies for UTUC differ from those for BUC. Ureteroscopy (URS) is used to observe the upper urinary tract, which is somewhat inaccessible, and the acquisition of significant pathological specimens is often challenging. Although the feasibility of PDD for UTUC diagnosis has been investigated for decades [9,10,11,12,13,14,15,16], PDD has not been clinically approved for UTUC.

To the best of our knowledge, this is the first report to investigate the difference in diagnostic performance safety of oral 5-ALA for UTUC and BUC conducted in a single institution.

Materials and methods

Ethics statements

Administration of oral 5-ALA for URS was performed as a prospective single-arm clinical trial [9]. Data were collected retrospectively from a prospective trial database. TURBT data were retrospectively collected from electronic medical charts, and the use of the data was approved by the ethics board of Kansai Medical University (IRB No. 2018,036).

Patient selection and data collection

We reviewed 20 patients with suspected UTUC who were enrolled in a prospective clinical trial conducted between May 2018 and January 2019 and underwent URS diagnosis with pretreatment with oral 5-ALA (the PDD-URS cohort). Two patients were excluded because of a lack of sufficient information for analysis in the present study. We also reviewed the data of 114 consecutive patients with suspected BUC who underwent TURBT with oral 5-ALA between January 2019 and March 2023 (the PDD-TURBT cohort). Two patients who did not undergo tissue biopsy and two patients who had a history of radiation or systemic therapy for advanced BUC were excluded from the study. All procedures were performed at the Kansai Medical University Hospital. The recorded clinicopathological characteristics of the patients included age, sex, tumour appearance, history of URS for suspected UTUC, TURBT, intravesical UC treatment for BUC, preoperative urine cytology results, and laterality of the suspected upper urinary tract. Regarding tumour appearance, “irregular urothelium” referred to a thick, irregular, or reddened epithelium. Considering that a history of intravesical Bacillus Calmette-Guérin (BCG) treatment and/or TURBT has a negative effect on the diagnostic performance of PDD [17, 18], the PDD-TURBT cohort was further subdivided into PDD-TURBT-naïve (patients without previous TURBT), PDD-TURBT-recurrent (patients with a history of TURBT), and PDD-TURBT-BCG (patients with a history of intravesical BCG treatment) subgroups for assessment of diagnostic performance of WL and PDD.

Surgical procedures

Each patient was orally administered 20 mg/kg of 5-ALA (ALAGLIO®; SBI Pharmaceuticals Co., Tokyo, Japan) dissolved in 50 mL of water 2 to 4 h before URS or TURBT. In all cases, surgical procedures were performed under spinal anaesthesia, except for five cases of TURBT performed under general anaesthesia. The details of the URS system used in this study have been described in our previous study [9]. A D-Light C® (Karl Storz, Tuttlingen, Germany) with a removable protoporphyrin IX excitation eyepiece filter permitting blue-violet light (BL) (380–430 nm) was used for PDD.

Pathological assessment

All specimens were assessed by pathologists at the Department of Pathology, Kansai Medical University. One of the authors, C. O., reassessed the false-positive specimens of PDD regarding the existence of potential precancerous findings, such as suspicious lesions for malignancy (but not decisive for UC) and dysplasia.

Statistical analysis

Significant differences between the two groups were estimated using Student’s t-test for parametric data and the Mann–Whitney U test for nonparametric data. Fisher’s exact test or the chi-squared test was used to compare the contingency groups. All analyses were performed using GraphPad Prism version 9.4.1 (GraphPad Software, San Diego, CA, USA). Calculations of all diagnostic accuracy parameters, the area under a receiver operating characteristic curve (AUC), and statistical comparison of the AUC were performed using EZR. All reported P values were two-sided, and statistical significance was set at p < 0.05.

Results

Patients’ characteristics

The characteristics of the 18 and 110 patients who underwent PDD-URS and PDD-TURBT are shown in Table 1. Of the 18 patients with suspected UTUC, 11 (61.1%) showed visible tumour lesions on contrast-enhanced computed tomography (CT) scan or magnetic resonance imaging (MRI), whereas 48 (43.6%) showed papillary tumour lesions on cystoscopy in 110 patients with suspected BUC. In the PDD-URS cohort, seven patients were suspected to have UTUC in the right upper urinary tract and 11 patients in the left upper urinary tract. In the PDD-TURBT cohort, 30 patients (27.3%) received intravesical BCG treatment. Random biopsy was not performed in the PDD-URS cohort, whereas it was performed in most patients whose urine cytology results were suspicious or malignant in the PDD-TURBT cohort; 66 patients (60.0%) underwent random biopsy.

Table 1 Baseline characteristics of patients who underwent PDD for suspected UTUC and BUC

Overall pathological results per patient and sample

Eight of the 18 patients (44.4%) in the PDD-URS cohort did not show malignancy, another eight showed Ta tumours, and the other two each showed carcinoma in situ (CIS) and T1 tumours. Twenty-two of the 110 patients (20.0%) in the PDD-TURBT cohort did not show malignancy, 37 and 24 showed Ta tumours and CIS, respectively, and the remaining 27 patients showed invasive T1 or T2 tumours. Concomitant CIS was found in 18 patients.

In total, 63 biopsy samples were obtained in the PDD-URS cohort, 38 of which were positive for PDD (Table 2a). Twenty-two of 25 PDD-negative samples (88.0%) showed no malignancy, whereas only seven of 38 PDD-positive samples (18.4%) exhibited no malignancy. Out of 25 samples negative for PDD, only three samples were positive for malignancy. All 25 tumours visible with WL were PDD-positive, and only three samples (12.0%) did not show malignancy. When the urothelium appears irregular with the WL, PDD-negative samples showed no malignancy in six out of seven samples (85.7%), and PDD-positive samples showed malignancy in six out of eight samples (75.0%), suggesting the usefulness of PDD for irregular urothelium. Out of the five samples that appeared normal with WL but positive for PDD, three showed malignancy, whereas out of 18 PDD-negative samples from the normal urothelium, only two samples exhibited malignancy (Table 2a).

Table 2 Pathological results per biopsy samples in the PDD-URS and PDD-TURBT cohorts

In total, 708 biopsy samples were obtained from the PDD-TURBT cohort, of which 309 were positive for PDD (Table 2b). Of the 399 PDD-negative samples, 317 (79.4%) showed no malignancy, whereas 104 of the 309 PDD-positive samples (33.7%) demonstrated no malignancy. Ninety-four of the 101 visible tumours with WL (93.1%) were PDD-positive, and among them, 85 samples (90.4%) showed malignancy. PDD was negative in seven visible tumours (6.9%), and five samples (71.4%) were malignant. In the irregular urothelium with WL observation, as many as 47 of 113 PDD-positive samples (41.6%) were negative for malignancy, and 10 of 34 PDD-negative samples (29.4%) were malignant. These findings suggested inferior diagnostic performance of PDD for irregular urothelium in the PDD-TURBT cohort compared to that in the PDD-URS cohort. Sixty-five of the 358 lesions that appeared normal and PDD negative (18.2%) were positive for malignancy, suggesting the importance of random biopsy. Moreover, of the 102 samples from the normal urothelium with PDD positivity, 48 (47.1%) showed no malignancy, indicating a high incidence of false PDD positivity in the normal urothelium, as was also observed in the irregular urothelium (Table 2b).

Diagnostic performance of PDD in suspected BUC and UTUC

The diagnostic accuracies of WL and PDD in 63 biopsies from 18 patients in the PDD-URS cohort estimated with AUC were 0.74 and 0.84, respectively (p = 0.011). The sample sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for WL and PDD were 85.3% and 91.2%; 62.1 and 75.9%; 72.5 and 81.6%; and 78.3 and 88.0%, respectively (Table 3).

Table 3 Diagnostic accuracy of WL and PDD in the PDD-URS and PDD-TURBT cohorts

The diagnostic accuracies of WL and PDD in 708 biopsies from 110 patients in the PDD-TURBT cohort estimated with AUC were 0.69 and 0.73, respectively (p = 0.00028). The sample sensitivity, specificity, PPV, and NPV values for WL and PDD were 57.8%, 71.4, 80.5%, 75.3, 66.9%, 66.3, 73.7%, and 79.4%, respectively. As expected, PDD biopsy for patients in the PDD-TURBT-naïve subgroup showed the highest AUC, followed by that for patients in the PDD-TURBT-recurrent subgroup and then for patients in the PDD-TURBT-BCG subgroup. Among the estimated parameters, the specificity and PPV showed the same trend as the AUC values (Table 3).

All estimated diagnostic parameters of PDD were better in the total PDD-URS cohort than in the total PDD-TURBT cohort, and the AUC value of PDD biopsy in the total PDD-URS cohort was higher than that in the total PDD-TURBT cohort. In particular, the sensitivity (91.2 vs. 71.4%) and PPV (81.6 vs. 66.3%) were much better in the total PDD-URS cohort than in the total PDD-TURBT cohort. Next, we compared the AUC values of PDD between the total PDD-URS cohort and the PDD-TURBT-naïve subgroup, which showed the best diagnostic performance among the PDD-TURBT subgroups, and found that the AUC value in the total PDD-URS cohort was slightly higher than that in the total PDD-TURBT-naïve subgroup (0.84 vs. 0.76) (Table 3). Thus, the diagnostic performance of PDD in the PDD-URS cohort can be considered comparable to that in the PDD-TURBT-naïve subgroup.

Pathological findings of false-positive samples

In the PDD-URS cohort, four out of seven false-positive samples of PDD (57.1%) had features of potential precancerous lesions (one suspicious for malignancy and three for dysplasia). In contrast, only eight of 101 false-positive samples of PDD (7.9%) in the PDD-TURBT cohort showed precancerous features (four suspicious for malignancy and four for dysplasia). Thus, false PDD-positive samples in the PDD-URS cohort showed a significantly higher incidence of precancerous findings than did those in the PDD-TURBT cohort (p = 0.0027).

Discussion

Although several studies have demonstrated the potential clinical benefits of PDD for UTUC [9,10,11, 13,14,15,16, 19, 20], there are no previous reports comparing the diagnostic performance of PDD between URS and TURBT. This study provides valuable basic information regarding differences in the diagnostic performance of PDD depending on the location within the urinary tract.

The diagnosis of UTUC is often challenging because of the limited accessibility to the upper urinary tract and potential modification of the mucosa caused by the manipulation of guidewires and ureteroscopes. Consequently, diagnostic accuracy is influenced by the skill and experience of operators. When malignant lesions in the upper urinary tract are not visible, the diagnosis becomes extremely challenging because a random biopsy of the upper urinary tract, as performed in the bladder, is not routinely feasible. PDD for UTUC appears to offer a promising solution for addressing these issues.

In the present study, all estimated diagnostic parameters of PDD were better in the PDD-URS cohort than in the PDD-TURBT cohort (Table 3). The AUC values in the PDD-URS cohort appeared to be equivalent, even when compared to the PDD-TURBT-naïve subgroup, which showed the best diagnostic performance among the PDD-TURBT subgroups. Moreover, the incidence of precancerous findings in false PDD-positive samples in the PDD-URS cohort was much higher than that in the PDD-TURBT cohort (57.1% vs. 7.9%, p = 0.00028). Although a direct statistical comparison of the diagnostic performance of PDD between the PDD-URS and PDD-TURBT cohorts is not possible practically, our findings support the efficacy of PDD in the diagnosis of UTUC.

The sensitivity of PDD for BUC was lower than that reported in previous studies (Table 3) [21, 22]. This is probably because as many as 60% of the patients in the present study underwent random biopsy, which often detected malignancy negative for PDD. A high false-positive rate has been recognised as one of the issues with PDD for BUC [18, 23]. The false-positive rate of PDD for BUC was also high in the present study. However, when the PDD-TURBT cohort was subclassified into the PDD-TURBT-naïve, PDD-TURBT-recurrent, and PDD-TURBT-BCG subgroups, the false-positive rate of the PDD-TURBT-naïve subgroup was only 18.9%, whereas those of PDD-TURBT-recurrent and PDD-TURBT-BCG subgroups were approximately 50% (Table 3), suggesting that false positivity is caused by reactive changes in previous TURBT and intravesical BCG treatment.

Our study had some limitations. First, we could not conclude the advantages of applying PDD for UTUC to clinical standards because of the retrospective design and small number of patients, especially in the PDD-URS cohort, in the present study. Second, the procedures were conducted at different time periods in the PDD-URS and PDD-TURBT cohorts, which may have affected the diagnostic performance, even though most surgeons overlapped between the time periods.

Conclusion

PDD with oral administration of 5-ALA improved the diagnostic performance of BUC and UTUC. Among the subgroups in the PDD-TURBT cohort, the PDD-TURBT-naïve subgroup showed the best diagnostic performance, and a history of TURBT and intravesical BCG considerably decreased diagnostic accuracy. The diagnostic performance of PDD in the PDD-URS cohort was equivalent to that in the PDD-TURBT-naïve subgroup, supporting the efficacy of PDD for UTUC.