Introduction

Most cases of bladder cancer do not exhibit detrusor invasion at first diagnosis befitting the definition of non-muscle-invasive bladder cancer (NMIBC) for which optimal transurethral resection of the bladder tumor (TURBT) is crucial to risk stratification and treatment [1, 2]. However, the results of TURBT are highly heterogeneous [3] with suggestion that early recurrence might relate to imperfect visualization and incomplete excision of the tumors [4].

Since Folkman, the ability to elicit angiogenesis from the host microvasculature is considered one hallmark of cancer [5]. Narrow-Band Imaging (NBI) was developed to take advantage of this using for illumination 415 and 540 nm wavelengths that are selectively absorbed by hemoglobin and facilitate the visualization of the sub-mucosa microvasculature [6]. In addition, while the normal bladder mucosa is thin and transparent under White-Light Imaging (WLI), the increased scattering of the light by accumulated nuclei and organelles may reveal areas of increased thickness or disorganization of the urothelium. This phenomenon is digitally reinforced and shifted to the red in NBI that thereby also provides insights into the structure of the mucosa. Improved detection with NBI was consistently observed across all fields of endoscopy, notably for bladder tumors [7, 8].

However, the thorough analysis of the entire bladder is necessary before engaging resection with the 12° or 30° lenses that are needed to visualize the resection loop but are restrained in field and depth of view and in access to the entirety of the bladder lining [9, 10]. The objective of this study was to evaluate on a lesion-by-lesion basis the contribution of NBI obtained by flexible videoscopy (NBI-FV) to the detection of cancer in patients due for primary TURBT, compared to WLI flexible videoscopy (WLI-FV).

Patients and methods

Patients and study design (Fig. 1a)

Patients scheduled for TURBT were enrolled (6/2013-9/2014) in a multicentre study after informed consent. All investigators had over 10 years of experience in NMIBC detection and treatment. Following the National Institute of Health [11], this study that was designed to validate the diagnostic performances of a CE-marked diagnostic instrument did not qualify as a clinical trial and was therefore not registered. The Institutional Review Board of each institution approved the study.

Fig. 1
figure 1

a Flow chart of the prospective matched pair assessment of the relative true-positive and relative true-negative rates of cancer detection by narrow-band imaging, compared to white-light imaging. b Cancer distribution of white-light imaging and narrow-band imaging findings in 147 lesions detected during flexible videoscopy (LS Likert 5-score scale)

Patients were sequentially imaged in WLI and NBI with the Olympus CYF-VH flexible HD video scope and high-definition TV monitor (Olympus Europa SE & Co, Hamburg). The same investigator performed the two procedures to control inter-operator variability and to follow the real-life nature of the study. Suspicious findings were positioned on bladder diagrams. Under WLI-FV, the lesions were considered as either suspicious when they were found by the investigator to warrant biopsy or resection, or non-suspicious. On the contrary, for NBI-FV the suspicion was scored on the Likert 5-score scale (LS) from not (score 1), to highly suspicious (score 5). Representative pictures organized along the alterations observed in the structures of the mucosa and microvasculature are presented in Fig. 2a. All suspicious findings were individually resected and processed for pathological examination by the Institution pathologist, blinded to the techniques used for detection. Additional resections or biopsies could be taken at the discretion of the investigators. As opposed to the only previous report on flexible cystoscopy in the context of primary NMIBC [12], no biopsies were taken in areas found unsuspicious on both WLI and NBI-FV.

Fig. 2
figure 2

A Representative cases of NBI-FV findings organized along increasing abnormalities in the urothelium structure and microvasculature and graded according to Likert 5-score scale. LS1-not suspicious: Well-organized microvasculature organized along a geometric pattern underneath the transparent (non-visible) mucosal layer. LS2-mildly suspicious: Isolated alterations of microvasculature or epithelium. (a) Reddish area that demonstrates mucosal thickening above a well-organized, non-suspicious microvasculature. (b) Dense microvasculature but no tortuosity. LS3-moderately suspicious: (a) Raised urothelium that obscures the vision of the capillaries. (b) Reddish patch of thick mucosa with dense microvasculature. (c) Dense and tortuous microvasculature but no definite abnormalities of the mucosa. LS4-suspicious: (a) Micro-papillary epithelium with dense microvasculature. (b) Large veins and thick mucosa in the vicinity of a papillary tumour. LS5-highly suspicious: Exophytic lesion above a dense and tortuous capillary network. B Distribution of positive samples in relation to Likert 5-score scale

2002 AJCC-TNM staging system and 1973 WHO manual on histological typing of urinary bladder tumors were used for reporting stage and grade [13].

Statistical methodology

As highlighted by the photodynamic diagnosis literature [14], WLI considered as the diagnostic standard is far from optimal. It is, therefore, not possible to ascertain the presence or absence of cancer on the basis of WLI endoscopy alone whose unbiased estimates of the diagnostic characteristics remain elusive.

Such limitations are classical when individuals negative for a diagnostic test are not verified. However, when comparing two systems of diagnosis, unbiased estimates of the relative true-positive and relative true-negative rates of one system in relation to another can be obtained [15]. Therefore, contrary to previous reports cancer detection under NBI-FV was not assessed in terms of sensitivity or specificity but of relative True-Positive Rate (rTPR), with relative True-Negative Rate (rTNR) as the second endpoint [15]. 95% confidence intervals (95%CI) of rTPR and rTNR were computed according to the modified Wald method [16]. Detection figures according to NBI-FV Likert scores were compared using the Kruskall–Wallis test.

Sample size was obtained from the confidence interval theory [17, 18]. Detection rate with WLI-FV was approximated to 80% and it was hypothesized that to have clinical significance, NBI-FV would have to detect at least 20% more cases than WLI-FV (rTPR-NBI/WLI > 1.19). With a unilateral type I error rate of 2.5%, 120 lesions were required to obtain a power of 90%. Considering that most cancers are multifocal, no more than 80 patients were necessary to ascertain whether NBI-FV improved detection relative to WLI-FV.

Results

147 specimens were obtained in 68 patients [Mean and (95% CI); age: 70.8 years (63.4–74.2), % of male patients: 80.9% (69.9–88.5), current smokers: 51.5% (39.8–62.9), Fig. 1b]. Eighty lesions were confirmed positive at pathology (80/147; 55.8%, Table 1). In eleven patients, NBI-FV detected fourteen cancer lesions that had been overlooked by WLI-VF with upgrading or upstaging in six patients (Fig. 1b; Table 1). Of the seven additional lesions found suspicious in the trigon and bladder neck areas, two were positive for cancer. Flexible videoscopy findings in relation to pathology are summarized in Table 2. A significant increase (p < 0.0002) in the proportion of samples confirmed positive for cancer was observed with increments in the Likert scale (Fig. 2a).

Table 1 Lesions characteristics in relation to white-light and narrow-band imaging and details of the situations in which narrow-band imaging upstage or upgraded the findings obtained by white-light imaging
Table 2 Two-by-two tables of relative true-positive (rTPR) and relative true-negative rates (rTNR) of narrow-band imaging in reference to white-light imaging

The relative true-positive rate was 1.22 (95% CI 1.12–1.39) that is that NBI-FV detected 22% more cancer lesions compared to WLI-FV. The relative false-positive rate was 1.35 (95% CI 1.19–1.59), which indicated a 35% increase in false-negative specimens with Narrow-Band Imaging, compared to White-Light Imaging (Table 2). Positive predictive values for WLI-FV and NBI-FV were 63.4% (95% CI 53.6–72.1) and 59.1% (95% CI 50.6–67.1), respectively. Negative predictive values were 65.2% (95% CI 50.7–77.4) and 86.7% (95% CI 60.9–97.5).

WLI-FV sensitivity and specificity were 80.0% (95% CI 69.6–88.1%) and 44.8% (95% CI 32.6–57.4%), respectively, while the corresponding NBI-FV values were 97.5% (95% CI 91.3–99.7%) and 19.4% (95% CI 10.8–30.9%) demonstrating higher sensitivity but lower specificity for NBI-FV (both p < 0.001).

Discussion

In the best-case scenario where the limitations of rigid cystoscopy were overcome by the wide field and multiple angles of view allowed by flexible endoscopy and the optimized vision provided by chip-on-tip technology and high-definition monitor, white-light imaging failed to detect at least one cancer lesion out of five (16/80, 20.0%). Alternately, NBI-FV detected 22% more cancer lesions compared to WLI-FV, meeting the primary objective of the study (relative true-positive rate: 1.22). It also corrected the misallocation of grade or stage in a minority of patients (6/68, 8.8%).

This observation was in keeping with a recent meta-analysis, where compared to WLI, NBI was shown to carry additional detection rates of 9.9% and 18.6% in per-patient and per-lesion analysis, respectively [19]. In view of such limitations, the recent NICE guidelines concluded that at the time of TURBT, WLI should not be used as single modality but complemented by NBI, photodynamic diagnosis or a urinary biomarker test [2]. Here, we extended to the operating theater the clinical relevance of NBI flexible videoscopy already reported in the fields of primary diagnosis [7] and follow-up [20].

However, NBI-FV specificity (19.4%) was degraded compared to the already imperfect specificity of WLI-FV (44.8%). By design, the specimens were taken only when any one of the two imaging modalities was found suspicious. The number of negative findings was, therefore, not inflated with negative protocol biopsies and the specificity figures, that is the proportion of specimens found negative for cancer in non-suspicious areas, were lower than those reported in series that integrated random biopsies of non-suspicious mucosa [12, 21]. We believe that the present figures offered a better reflection of the clinical real life where inflammation and metaplastic conditions are not uncommon and random biopsies hardly recommended. In spite of that, NBI-FV entailed a 35% increase in false-negative specimens, which in the specific context of patients already due for TURBT would constitute an acceptable trade-off for optimized surgery.

NBI may highlight only two of the six distinct micro-environmental barriers that are overcome by premalignant lesions to establish cancer, that is overcoming contact inhibition and the resulting increase in thickness and angiogenesis to control acidosis [22]. Although investigators were not given recommendations to characterize their increments in suspicion, representative cases of non-papillary lesions drawn from the intraoperative records collected at the time of surgery were retrospectively organized along increasing degrees of abnormalities in the urothelium and microvasculature (Fig. 2a). By allowing the structured analysis of NBI findings, this instrument would constitute a valuable tool in the translation of NBI into the clinical routine of TURBT. Indeed, one major limitation in endoscopy comes from the fact that while the detailed analysis of the lesions may translate into a large spectrum of suspicions, the ultimate decision of the surgeon remains dichotomous to resect or leave untouched any given area of interest. While this decision is quite straightforward for typical cases of large exophytic tumors, it may be difficult in small or non-papillary abnormalities. We observed that the Likert score of NBI findings was significantly correlated to the probability of cancer in the biopsy or resection specimens (Fig. 2b) and could be used to balance the benefit of demonstrating cancer to the risk of false-positive result.

The demonstration of relevant clinical outcomes in the photodynamic diagnosis [14] or narrow-band imaging [19, 23] literatures required prolonged and costly trials. We offer that the present methodology may be better suited to the constraints of modern endoscopy where ameliorations in scopes, light sources, software and monitors drive a rapid evolution in manufacturer’s catalogs. While conformity with safety standards is certified for all new products (CE marking in Europe, FDA approval in USA), clinical relevance is more often inferred than proven. The present evaluation addressed detection, a crucial prerequisite in all urology guidelines [2, 4], taking advantage of the methodology of relative true-positive and negative rates that was designed to address the uncertain nature of a negative diagnostic test when it is not verified [15]. Moreover, this design did not constitute a clinical trial according to the most recent National Institute of Health definition [11], a key advantage to streamline clinical evaluation and accelerate the introduction of modern endoscopy technology into the clinics.

Although efforts were paid to control imprecision in design and analysis, some limitations must be acknowledged.

First, to follow the clinical routine where it is considered as an add-on to WLI, the same operator conducted NBI-FV after WLI-FV, thereby taking advantage of WLI-FV to inform on NBI-FV detection of cancer lesions. To address this bias of sequence, Shen randomly allocated the rank order of imaging between two separate surgeons. He still observed a significant 15% increase in sensitivity for NBI-FV, compared to WLI-FV (92.9% vs. 77.7%). Another recent multicentre study using the same design confirmed similar improvement [24].

Second, the complex biology of NMIBC where cancer-primed cell may show little morphological changes [22, 25] supports a continuum of endoscopic presentations from a normal looking mucosa to minute alterations in thickness and microvasculature to self-evident exophytic tumors. Conversely, pathologists must dichotomize their understanding of the specimens as either cancer or non-cancer, a demanding decision as shown by the large inter-observer variability coefficients reported in most studies [26]. This is most exacting for flat urothelial lesions that are being dismembered into a wide array of entities, some with clear premalignant potential [27]. In keeping with the real-life nature of the study, neither central review nor recommendations as to the use of immunochemistry for select cases were set, which may have influenced in either way the correct allocation of lesions of minute pathological alterations and impacted sensitivity and sensitivity performances of both modalities.

Last, the benefit on recurrence and progression of improved detection may be debated. This was not addressed here, although two meta-analyses confirmed NBI as a valuable adjunct to WLI to control recurrence rates at 3 and 12 months after TURBT [19, 23] while reduction in recurrence rates was observed in the subset of low-risk patients in the sole randomized controlled trial comparing NBI to WLI [28].

Conclusion

Researching alterations in mucosa and microvasculature by narrow-band imaging flexible videoscopy augmented by 22% the detection of cancer foci and contributed to the objective of complete resection of all visible lesions. Conversely, it entailed a 35% increase in false-positive results compared to white-light imaging, although the structured analysis of narrow-band imaging findings might be used to grade suspicion according to the Likert scale and balance the risk of a false-positive result to the benefit of demonstrating cancer.