Introduction

Currently, robot-assisted systems allow the performance of the majority of complex surgeries, traditionally conducted with either an open or a laparoscopic approach. Surgical experience is maximally enhanced by utmost movement precision, excellent ergonomics, and minimal patient cosmetic consequences [1, 2].

To date, multiple robotic systems have emerged as innovative solutions in the field of minimally invasive surgery and numerous technical refinements have been progressively implemented to the daily robotic practice [2]. Among the latter, intraoperative image guidance emerges as needful support during robotic procedures as, for instance, in the distinction of malignant vs benign tissues, and in the identification of specific anatomic structures (e.g., blood or lymphatic vessels, lymph nodes, or glands) [3,4,5,6]. Image guidance includes a heterogeneous variety of tools, among which near-infrared fluoresce (NIRF) [7], intraoperative ultrasound (IOUS) [8], and augmented reality (AR) [9, 10] represent some of the most widespread.

Fluorophore-based NIRF (e.g., indocyanine green [ICG]-NIRF) provides deep photon penetration (near infrared region, between 650 and 900 nm), thus magnifying optical contrast. NIRF tracers are not visible using white light and their use does not obscure the routine surgical field. Therefore, dedicated NIRF cameras allow to instantaneously switch from white light vision to NIRF vision upon request of the operator, and vice versa [11].

When the proper understanding of anatomy is a matter of crucial relevance, either, for example, concerning the relationship of a tumor with the surrounding healthy parenchyma, or the stream of arteries and veins, IOUS and AR provide further assistance. These types of image guidance increase the meticulousness of surgical gestures, mainly enhancing precision in tumor resection, as well as vascular isolation and clamping. These are all aspects involved in improving patient surgical outcomes, as intraoperative adverse events, surgical margins, ischemia time, and overall procedural time, among others [12].

The current literature regarding image-guided surgery (IGS) is flourishing [13,14,15]. However, the real impact of such heterogeneous technologies applied to different surgical fields on everyday practice has never been deeply addressed. In consequence, the real additional benefit of IGS during robot-assisted procedures remains an object of debate, and the magnitude of such benefit may be different according to the various surgical fields where IGS is used.

The objective of the current study consists in assessing the quality of the contemporary body of literature concerning IGS in robotic surgery throughout different surgical specialties. Additionally, the current study aims to provide meta-analytic data concerning the actual surgical benefits of the adoption of IGS during robotic procedures.

Materials and methods

Study identification and evaluation

A systematic review of the literature was conducted using the PubMed, Cochrane Central Register of Controlled Trials (CENTRAL), EMBASE, MEDLINE, and Scopus databases. We searched from inception of the databases up to March 1, 2023. All the references for key reviews on IGS were also screened. Keywords used for the research were as follows: “((robotic surgery OR robot-assisted) AND (image-guided surgery OR radio-guided surgery OR molecular imaging OR molecular trac* OR hybrid trac* OR bimodal trac* OR fluorescence imaging OR magnetic particles OR ultrasound guidance OR augmented reality OR virtual reality OR 3D) AND (staging accuracy OR diagnosis OR metastases OR complications OR oncological outcomes OR functional outcomes OR recurrence OR survival OR mortality).” This systematic review is reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-analyses Protocols (PRISMA-P) guidelines [16] and is registered within the international prospective registry of systematic reviews (PROSPERO, CRD42023480670).

Initial screening, eligibility criteria, and risk of bias assessment

After identifying all eligible studies, eleven independent reviewers screened all titles and abstracts (or full text, for further clarification) for inclusion in the study. Literature reviews, editorials, comments, and studies that did not answer the review question were excluded at the eligibility evaluation (Fig. 1). Also, studies including small sample size (less than ten patients) were excluded. Lastly, only comparative studies that tested performance of an intraoperative guidance vs control were included for the final quantitative synthesis, yielding a final cohort of 29 assessable studies. Disagreements regarding eligibility were resolved by discussion with third parties (EM, PD) until consensus was reached.

Fig. 1
figure 1

PRISMA flow diagram

Methodological quality of the included studies was graded using modified Risk Of Bias in Non-Randomized Studies-of Intervention (ROBINS-I) checklist [17]. Three investigators (AM, FC, ST) independently assessed the risk of bias for all studies. In case of disagreements, a discussion with an additional experienced investigator (EM) was carried out.

Intervention and comparison arms

For meta-analytic evaluation, the intraoperative surgical guidance was considered the experimental arm. The group which received standard surgery represented the comparison arm (Table 1). Of the 29 assessable studies, 18 used ICG-NIRF [18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35], nine used 3D/VR guidance [36,37,38,39,40,41,42,43,44], and two used US guidance [45, 46].

  • ICG-NIRF

Table 1 Characteristics of studies included in the quantitative meta-analysis

Freshly prepared ICG is diluted to 2.5 mg/ml with a maximum dosage of 2 mg/kg [21]. After the administration of the dye, the light can be switched to NIRF. The pass of ICG can be observed in the surgical field of interest (i.e., kidney [18,19,20,21], lymph nodes [23,24,25,26,27], ureteral anastomosis[22], uterus [28], thyroid [29, 30, 32], stomach [33,34,35], rectal sphincter [31]) after approximately 2 min. An exact discrimination between vascularized and de-vascularized areas can be achieved.

ICG can also be used for Sentinel Node Biopsy procedure. In this case, free-ICG (5 mg in 2 mL of sterile water) or combined with 99mTc-nanocolloid (0.5 mg of albumin, 0.25 mg of ICG, and 240 MBq of 99mTc in 2 mL of saline) can be injected in the organ of interested (i.e., prostate [47]) and used for sentinel node biopsy via lymphatic mapping.

  • 3D/VR guidance

The three-dimensional virtual model of the anatomy of interest (i.e., kidney [36, 38,39,40, 42], prostate [41,42,43]) is originated from computed tomography or magnetic resonance images, using a specific software among those nowadays available on the market. Typically, a rendered colored virtual-segmented 3D model can be rotated in all dimensions, and each extracted volume (e.g., tumor, pyelo-caliceal system, renal cortex, seminal vesicles) can be made opaque, translucent, or hidden, separately.

  • US

A drop-in US probe inserted by the table assistant and maneuvered by the console surgeon allows real-time visualization of the surgical target (e.g., renal tumor, edge between healthy parenchyma and tumor [45], vascular structures, ureter, perineal pelvic floor [46]).

Outcomes definition

For pooled meta-analysis, we evaluated outcomes that were assessed in at least three studies. Specifically, the following outcomes were assessed among studies of all specialties considered: operative time, length of stay (LOS), estimated blood loss (EBL), surgical margins, postoperative complication rate, overall number of nodal retrievals, number of positive nodes identified, ischemia time, and estimated reduction of glomerular function (eGFR). Additional single-studies outcomes were described within qualitative synthesis and reported in Table 1.

Data synthesis and statistical analysis

Data not suitable for meta-analytic evaluation was presented in a narrative fashion (qualitative analysis). Reported results for continuous outcomes were pooled using bias-corrected ratio of means (ROM) according to previous established methodology [48, 49]. Thus, the bias-corrected odds ratio (OR) were used to compare dichotomous variables, respectively. All results were reported with 95% confidence intervals. Pre-planned subgroups analyses were performed in studies after stratification according to the type of intraoperative guidance (3D/VR vs UltraSound vs NIRF-ICG vs Hybrid Nanocolloid vs Carbon nanoparticles).

Heterogeneity between studies was measured using the I2 statistic [50] and the between-study variance (t2) from the random-effect analyses. I2 values > 50% indicate large inconsistency. Unless otherwise indicated, all models have allowed for different effect sizes (random effects). In case of large heterogeneity, random effect models (using the DerSimonian and Laird approach [51]) were prioritized. For the assessment of small study effects and publication bias, values of the ROM or OR were plotted against their standard error in a contour-enhanced funnel plot. The latter bias represents the error in connection with whether a study is published or not depending on the characteristics and result of individual studies [52]. This error is caused because statistically significant study results generally have a higher likelihood of being published. Furthermore, Eggers asymmetry test [53] was used to explore statistically the presence of publication bias. Statistical significance for all analysis was defined as two-sided p < 0.05. Statistical analysis was performed with the R software (version 3.6.3; http://www.r-project.org/) [54].

Results

Study selection flowchart

Figure 1 shows the flow of studies through the screening process. Overall, 13,626 papers were blindly screened by eleven reviewers (AD, AL, AM, EP, FA, FB, FC, ML, MLP, RD, ST), with 3247 of these records included for further evaluation based on pre-defined eligibility criteria. Of these, 330 studies were considered eligible for final inclusion in qualitative analysis. At this point, final evaluation for the inclusion in the quantitative synthesis was carried out by three reviewers (EM, GF, PD). At the end of the process, 29 manuscripts were included for the quantitative meta-analysis (Table 1).

Study quality and risk of bias

The Supplementary Material Appendix 1a-b summarizes the quality criteria assessed for each study using the ROBINS-I tool. The overall methodological quality of the studies was moderate, with most of the studies having moderate or low risk of bias.

Evidence synthesis

Table 1 summarizes general and design characteristics of the selected studies. Primary analysis included 29 papers for quantitative synthesis. The final screened manuscripts reported outcomes based on 11 different surgical procedures across the included surgical specialties (general surgery, gynecology and urology). Overall, 18 studies used NIRF with ICG [18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35]. Nine studies used 3D/VR guidance [36,37,38,39,40,41,42,43,44], while two implemented US guidance [45, 46]. According to the outcomes, 24, 16, and 18 studies assessed operative time [18,19,20,21,22, 25,26,27,28, 30,31,32,33,34,35,36,37, 39,40,41, 43,44,45,46], LOS [18, 20, 22, 26, 28, 30, 32,33,34, 36, 37, 39, 41, 44,45,46], and EBL [18,19,20,21,22, 26, 28, 30, 33, 35,36,37, 39,40,41, 44,45,46], respectively. Additionally, eight, seven, and nine assessed total number of nodes retrieved [23,24,25,26,27, 32, 33, 35], number of positive nodes [24,25,26,27, 32, 33, 35], and rate of surgical margins [9, 24, 25, 38, 39, 41, 42, 44, 45], respectively. Lastly, eight and five tested the impact of IGS on ischemia time [18, 20, 21, 36,37,38,39, 45] and eGFR reduction [18,19,20, 39, 45]. These last concerned nephron-sparing surgery.

In quantitative synthesis testing for operative time, a pooled meta-analysis on 3987 patients was conducted (Fig. 2a), using random-effects models. Overall, intraoperative guidance did not impact operative time (ROM 1.00, 95% CI 0.94; 1.06; p = 0.9). Funnel plot and Eggers’ linear regression estimates (bias 1.36, p = 0.3) both showed absence for potential publications bias (Supplementary Material Appendix 2). In subgroup analyses according to the guidance used, intraoperative guidance was not associated with different operative time in any of the evaluated groups (Fig. 2b).

Fig. 2
figure 2

Quantitative synthesis concerning operative time, a regardless of tracer type; b according to tracer type

In quantitative synthesis testing for LOS, a pooled meta-analysis on 1261 patients was conducted. Overall, at ROM analysis, use of intraoperative guidance was associated with 12% reduction in LOS compared to standard surgery (ROM 0.88, 95% CI 0.77; 0.99; p = 0.03) (Fig. 3a). Funnel plot and Eggers’ linear regression estimates (bias − 1.60, p = 0.4) recorded absence for potential publications bias (Supplementary Material Appendix 3). When testing type of guidance, LOS reduction was significant only in those studies implementing NIRF-ICG (ROM 0.87, 95% CI 0.80; 0.95; p = 0.02) (Fig. 3b).

Fig. 3
figure 3

Quantitative synthesis concerning length of stay, a regardless of tracer type; b according to tracer type

In quantitative synthesis testing for EBL, a pooled meta-analysis on 1496 patients was conducted. Overall, patients treated with intraoperative guided-surgery had lower EBL compared to standard surgery (ROM 0.87, 95% CI 0.77; 0.99, p = 0.03) (Fig. 4a). Funnel plot and Eggers’ linear regression estimates (bias − 1.31, p = 0.4) demonstrated absence of potential publications bias (Supplementary Material Appendix 4). However, in subgroup analyses, none of the evaluated guidance approach was singularly associated with reduced EBL (Fig. 4b).

Fig. 4
figure 4

Quantitative synthesis concerning estimated blood loss, a regardless of tracer type; b according to tracer type

In quantitative synthesis testing for number of nodes retrieved, a pooled meta-analysis on 2854 patients was conducted. Overall, patients treated with intraoperative guided-surgery had an estimated 44% increase in mean number of removed nodes at surgery compared to standard surgery (ROM 1.44, 95% CI 1.18; 1.77, p < 0.001) (Fig. 5a). Funnel plot and Eggers’ linear regression estimates (bias − 7.14, p = 0.03) demonstrated presence of potential publications bias (Supplementary Material Appendix 5). Subgroups analyses showed that most of the studies included used NIRF-ICG with only one study evaluating the role of hybrid nanocolloid (99mTc-ICG) (Fig. 5b).

Fig. 5
figure 5

Quantitative synthesis concerning the number of nodes retrieved, a regardless of tracer type; b according to tracer type

In the quantitative synthesis testing for the rate of metastatic nodal involvement, a pooled meta-analysis on 2920 patients was conducted. Overall, patients who underwent IGS had significantly higher rate of pN1 disease identified (OR 1.82, 95% CI 1.49; 2.21; p < 0.001 using a common effect model), as confirmed in subgroup analyses according to tracer type (Fig. 6a, b). Funnel plot and Eggers’ linear regression estimates (bias − 1.10, p = 0.9) demonstrated absence of potential publications bias (Supplementary Material Appendix 6).

Fig. 6
figure 6

Quantitative synthesis concerning the rate of metastatic nodal involvement, a regardless of tracer type; b according to tracer type

In the quantitative synthesis testing for the rate of positive surgical margins, a pooled meta-analysis on 1488 patients was conducted. Overall, patients who underwent IGS had significantly lower rate of surgical margins at final pathology compared to standard surgery (OR 0.62, 95% CI 0.46; 0.85; p < 0.001 using a common effect model) (Fig. 7a, b). Funnel plot and Eggers’ linear regression estimates (bias − 0.5, p = 0.1) demonstrated absence of potential publications bias (Supplementary Material Appendix 7).

Fig. 7
figure 7

Quantitative synthesis concerning positive surgical margins, a regardless of tracer type; b according to tracer type

In the quantitative synthesis testing for the rate of postoperative complications, a pooled meta-analysis on 4432 patients was conducted. Overall, no differences in rate of postoperative complications were recorded between IGS and standard surgery (OR 0.93, 95% CI 0.79; 1.10; p = 0.4 using a common effect model) (Fig. 8a). Funnel plot and Eggers’ linear regression estimates (bias − 0.25, p = 0.3) demonstrated absence of potential publications bias (Supplementary Material Appendix 8). The results were consistent even after stratification according to type of guidance used (Fig. 8b).

Fig. 8
figure 8

Quantitative synthesis concerning postoperative complications, a regardless of tracer type; b according to tracer type

In quantitative synthesis testing for ischemia time during nephron sparing surgery, a pooled meta-analysis on 745 patients was conducted (Fig. 9a), using random-effects models. Overall, intraoperative guidance did not impact on ischemia time (ROM 0.89, 95% CI 0.76; 1.04; p = 0.5). Funnel plot and Eggers’ linear regression estimates (bias 4.13, p = 0.4) both showed absence for potential publications bias (Supplementary Material Appendix 9). In subgroup analyses according to tracer used, intraoperative guidance was not associated with different ischemia time in the evaluated group including multiple studies (Fig. 9b).

Fig. 9
figure 9

Quantitative synthesis concerning ischemia time during nephron sparing surgery, a regardless of tracer type; b according to tracer type

In quantitative synthesis testing for eGFR reduction after nephron sparing surgery, a pooled meta-analysis on 479 patients was conducted (Fig. 10a), using random-effects models. Overall, intraoperative guidance was associated with lower magnitude of eGFR reduction (ROM 0.37, 95% CI 0.22; 0.62; p = 0.002). Funnel plot and Eggers’ linear regression estimates (bias − 3.92, p = 0.09) both showed absence for potential publications bias (Supplementary Material Appendix 10). Subgroup analyses confirmed the efficacy of IGS on eGFR reduction regardless the type of guidance used (Fig. 10b).

Fig. 10
figure 10

Quantitative synthesis concerning renal function reduction after nephron sparing surgery, a regardless of tracer type; b according to tracer type

Discussion

Imaging is increasingly employed as a tool to enhance the precision of surgical procedures. This comprehensive systematic review and meta-analysis synthesized published evidence on IGS across various surgical specialties and 11 distinct surgical procedures. Despite the inclusion of different IGS modalities, our findings indicate that the utilization of IGS led to significant improvements in several perioperative outcomes compared to standard surgery. Notably, these improvements were observed in LOS, EBL, positive surgical margins rate, number of nodes retrieved, rate of pN1, and renal function deterioration (in the case of partial nephrectomy). Conversely, no statistically significant difference was found for operative time and perioperative complications. Irrefutable reasons for such observations are currently not available. Nonetheless, from a surgical standpoint, it is reasonable to hypothesize that the usage of intraoperative guidance might increase the accuracy of a procedure to an extent where the more accurate visualization of the targets, the greater distinction of healthy and malignant tissues, and the more reliable identification of noble structures (e.g., blood vessels, nerves, organs) translate in better overall patient outcomes. Therefore, a clearer view of the surgical field may restrain the blood loss, which can foster the postoperative recovery and decrease the hospitalization time. Similarly, it may provide better oncologic and functional outcomes. For instance, IGS-based selective or super-selective arterial clamping during partial nephrectomy may favorably impact postoperative renal function preservation [18].

These results warrant careful consideration for several reasons. First, the widespread adoption of IGS in recent years underscores its clinical potential [55]. Ensuring the complete removal of tumors without positive margins and preserving healthy tissue is crucial for enhancing patient survival and improving functional outcomes. Therefore, incorporating intraoperative guidance may be essential for surgeons. However, it is undoubtable that the current level of evidence in favor of IGS is limited, and this work provides some proofs of that. It is of note that out of more than 13,000 studies, only three compared a specific IGS vs control, in a randomized controlled fashion [31, 44, 49]. Specifically, these studies only concerned the field of urology, with overall 304 patients being recruited, either submitted to partial nephrectomy for kidney cancer or prostatectomy for prostate cancer. Additionally, in all three cases, the outcomes of interest were potential surrogates of cancer-control (e.g., surgical margins, positive lymph nodes), with no mention to overall or cancer-specific survival [31, 44, 49]. Hence, these data are still immature, not easily generalizable, and invariably incapable of demonstrating any certain advantages of IGS implemented to robotic surgery. However, despite the aforementioned limitations, to the best of our knowledge we are the first to undertake the mission to provide the first contemporary systematic review that evaluate the quality of the available literature on IGS in robotic surgery and to assess the potential surgical benefit of the adoption of IGS during robotic procedures across different surgical specialties.

While intraoperative ultrasound is the most commonly used, other imaging techniques such as X-ray, CT, MRI, or nuclear imaging are primarily utilized for surgical planning but are not able to provide real-time intraoperative guidance [56]. Indeed, regarding nuclear imaging, novel drop-in gamma probes can intraoperatively assist the surgeon. However, these lack from image, which may be provided by additional intraoperative tools (e.g., fluorescence) [57, 58]. Therefore, emerging techniques like NIRF with ICG and 3D/VR guidance are gaining traction in various settings.

Fluorescence imaging relies on a camera capturing light emitted by a fluorescent dye visible when excited with an appropriate light source [56]. These imaging tools can be integrated into laparoscopic or robotic instruments. Various fluorescent contrast agents, typically emitting in the near-infrared region (between 650 and 900 nm), can penetrate several millimeters into tissue [59]. For instance, ICG and methylene blue are two FDA-approved near-infrared fluorophores widely used in numerous surgical applications [60]. In other settings, 5-aminolevulinic acid hexyl ester is employed for malignant gliomas, and non-fluorescent dyes like hexyl ester are widely used for bladder cancer visualization [61, 62]. Over recent years, more specific fluorescent agents have been increasingly used, in particular the use of multimodality imaging [63]. For instance, 99mTc-nanocolloid associated with ICG is a novel contrast agent implemented across various clinical fields [63, 64]. Another approach involves activatable fluorescent tracers that become visible upon enzymatic cleavage [56]. Other groups are currently evaluating antibodies or fragments labeled with NIRF tracers in preclinical settings, targeting alternative biomarkers for IGS applications [65,66,67,68,69]. Some of these novel molecular biomarkers have been tested in clinical setting into phase I or II studies [70,71,72,73,74,75]; on the other hand, RCTs have been planned to test the efficacy on oncological outcomes of integrating molecular targets with intraoperative fluorescence guidance [76]. Similar developments involve small molecules or peptides [60, 77].

Unfortunately, lack of proper structured pipeline for implementing these novel techniques is one the main limiting factor for bridging the gap between the pre-clinical discovery of a novel biomarker or imaging modality and their formal evaluation into clinical practice, thus preventing a timely application after the initial pre-clinical conception. Moreover, other key factors are the technical requirements for surgical manufacturers to adapt and integrate their systems with novel technologies. In this direction, there is an impelling need for a constructive dialogue between researchers and industries with the goal of defining common areas of research and optimizing resources allocation.

Additionally, despite these advancements, the issue of cost–benefit needs to be taken into consideration. The costs associated with novel imaging modalities can be substantial without clear clinical benefits [78]. Unfortunately, accurate discussions on the topic cannot be easily done, based on the paucity of formal cost-analyses [79] and since capital costs, including intraoperative guidance tools, vary widely according to institutional and amortization practices [80]. For instance, as previously reported, the cost of adding NIRF to a robotic system is approximatively $100,000 [81], with a per vial cost of ICG ranging $80–100 [80, 81]. Therefore, whether the added cost of such technologies is justified may depend on the institution, on the surgeon, and on the expected quality of outcomes without using IGS, as previous authors have already stated [81].

Beyond the challenge of creating new tracers, identifying new potential targets poses a significant hurdle. The ideal biomarker should exhibit high expression in tumors and lower expression in healthy tissue, preferably being extracellular for being targeted by non-penetrating molecules but able to be internalized for a lasting signal. The biomarker should also demonstrate specificity across different types and subtypes of cancers.

Unfortunately, the available data did not permit a specific analysis of individual operations or IGS modalities. The results are generalized to IGS as an innovative surgical approach, integrating state-of-the-art surgery with novel imaging modalities, rather than specifying which imaging modality improves perioperative outcomes in specific surgeries. Thus, each IGS should be evaluated independently before clinical implementation. Moreover, the vast majority of the included comparative studies relied on retrospective data. Therefore, the reported results must be interpreted within the boundaries of such limitation, since selection biases could be operational, and heterogeneity in key factors (e.g., inclusion criteria, template of lymph node dissection, surgical technique such as for prostatectomy, partial nephrectomy and pyeloplasty, surgical expertise) could undermine results interpretability. This being said, the risk of bias assessment revealed that the overall quality of evidence in this meta-analysis was moderate, as most studies demonstrated moderate or low risk of bias (Supplementary Fig. 1a-b).

In conclusion, IGS is still in its early stages. Large, multicenter, randomized controlled trials are imperative to determine the benefits of IGS for patients. However, the absence of a general consensus on standardized protocols for the clinical evaluation of new techniques complicates the conduct of multicenter trials and the comparison between clinical studies. Nevertheless, our meta-analysis has demonstrated that the implementation of IGS has the potential to enhance surgical outcomes across various specialties and operations.

Conclusions

The current study suggests that the performance of robot-assisted surgery might be consistently enhanced by intraoperative image guidance. This is especially true when pathologic outcomes are considered. The usage of IGS might increase the detection of metastatic lymph nodes, and simultaneously it can boost the precision of tumor resection, as testified by the significant reduction of positive surgical margins at final pathology. Therefore, IGS has the potential to impact patient prognosis, besides surgical conduct.