Introduction

Pedicle screws have revolutionized the spinal surgery since their introduction to clinical application [1]. The success of robust fixation relies on the perfect insertion of the screws inside the pedicle. However, the procedure for accurate pedicle screw placement is challenging because of the complex anatomical structures surrounding the spine and widely morphologic variations of individual pedicles [2,3,4,5]. By means of conventional techniques, pedicle screws were implanted according to anatomical landmarks and intraoperative fluoroscopy. But the screw misplacement cannot be neglected, a literature had reported the percentage of misplaced screws ranging from 8.3 to 50.3% with conventional fluoroscopy [6]. Even though severe complications including vascular, neurological or visceral injuries just occur to a small number of patients after pedicle screws’ malposition, the complications can be potentially life and limb threatening.

To address the issue of pedicle screws’ malposition, image guidance system was explored and applied to spine surgery [7, 8]. Theoretically, the new technique can prevent complications by providing more precise anatomic guidance, but Verma et al.’s meta-analysis based on former studies did not favor computer-assisted navigation over conventional techniques [9]. Besides, with the more widespread application of image-guided techniques, issues such as reduction of radiation exposure, the operating time and cost-effectiveness need to be addressed as well [8]. The unsatisfactory issues in spine surgeries inspired the emergence of robot-guided pedicle screw insertion. Recently, a miniature robot has been devised for the purpose of improving the accuracy of pedicle screw placement, decreasing potential complications and reducing intraoperative radiation exposure [10,11,12]. Since 2006, a number of studies have reported the results of robot-assisted and conventional freehand pedicle screw placement techniques. However, the results remain inconsistent.

By far, two meta-analyses [13, 14] comparing robot-assisted and conventional freehand pedicle screw implantation have been published, but neither was based on RCTs, which will lead to less powerful results compared to study purely based on RCTs. Furthermore, in these two previously published studies, only accuracy of pedicle screws was combined using meta-analysis, other outcomes such as intraoperative radiation exposure, surgical duration and proximal facet joint violation were not combined. Thus, it is increasingly necessary to perform an updated meta-analysis based on RCTs. Our systematic review and meta-analysis were performed with the aim of investigating the difference of pedicle screw positioning accuracy, radiation exposure, surgical time and proximal facet joint violation between robot-assisted and conventional freehand technique based on RCTs.

Methods

This systematic review and meta-analysis were conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement [15].

Literature search strategy

Six electronic databases including PubMed, EMBASE, CENTRAL, ISI Web of Science, CNKI and WanFang were systematically searched to identify potentially eligible articles. All the above databases were searched from inception to the latest issue (May 2017), without language restriction. Medical Subject Headings (MeSH) and free text words were combined to retrieve all the potential studies. MeSH were modified based on the specifications of each database. The following search strategy was used for the literature search in PubMed, CENTRAL and ISI Web of Science: (“Robotics”[Mesh] or robot or robotics or robotic) and (“Pedicle Screws”[Mesh] or “Bone Screws”[Mesh] or “Spine”[Mesh] or pedicle or screw or spine or bone screws) and (randomized controlled trial or randomized controlled trial or RCT or RCTs). For CNKI and WanFang, search terms were “ji qi ren” and “zhui gong gen”. In addition, the bibliographies of relevant systematic reviews were manually searched.

Inclusion criteria

We established the inclusion and exclusion criteria based on the PICOS principle. (1) Types of participants: patients presenting with lumbar spinal stenosis, degenerative lumbar disorders or other lumbar diseases that scheduled to undergo pedicle screw placement were included in our study, no specific indications were imposed. (2) Interventions: patients in experimental groups received robot-assisted pedicle screw implantation surgery. (3) Control: patients in control groups received pedicle screw surgery with the conventional freehand approach. (4) Outcome measurements: the primary outcome of our systematic review was the accuracy of pedicle screw replacement, assessed by Gertzbin–Robbins classification [16], Rampersaud scale [17] along with other well-established criteria. Secondary outcomes included radiation exposure, surgical time and proximal facet joint violation between robot-assisted and conventional freehand technique. (5) Types of publication: only randomized controlled trials (RCTs) could be included in our systematic review. Cohort studies, reviews articles, case reports or expert experience reports were excluded.

Data extraction

Two investigators (Z. Lv and S. Gao) screened each article independently and were blinded to the findings of the other reviewer. According to the predetermined inclusion criteria, two reviewers performed a strict screening to identify qualified articles independently, and they extracted data from these eligible articles using a standardized data collection form, which included first author, year of the publication, number of patients and number of screws implanted in different groups, robot type, assessment of screw accuracy, clinical outcome assessments. Any disagreement between the two reviewers was resolved through discussion until a consensus was reached. The third review author (H. Fang) was consulted if a consensus could not be reached.

Quality assessment

The Cochrane Collaboration’s tool was utilized to assess the risk of bias in the selected RCTs, which was based on seven items: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other sources of bias. Two reviewers assessed the risk of bias among studies independently, the results were compared afterwards. Disagreements regarding the risk of bias assessment were settled by discussion and consensus between reviewers.

Data synthesis and analysis

Risk ratio (RR) and the associated 95% confidence intervals (CIs) were calculated for accuracy of pedicle screw implantation and proximal facet joint violation. The mean difference (MD) or standard mean difference (SMD) in the continuous variables was calculated using the same methodology. The Chi-squared test and the Higgins I 2 test were used to assess the heterogeneity among studies (P > 0.1 and I 2 indicate acceptable heterogeneity). We used fixed-effect model if no obvious heterogeneity existed. Otherwise, a random-effect model was utilized. Forest plot and funnel plot were generated via RevMan 5.3 (Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).

In case that significant heterogeneity existed, subgroup-analysis by risk of bias of included studies was conducted. Begg’s rank correlation test and Egger’s linear regression test were used to evaluate the publication bias (StataCorp LP, USA).

Results

Literature search

The literature selection process is presented in Fig. 1. The initial literature search yielded 155 potentially eligible records including 17 from PubMed, 21 from EMBASE, 10 from ISI Web of Science, 9 from CENTRAL, 44 from Wanfang and 54 from CNKI. 54 records were removed because they were duplicated for retrieval, the remaining 101 studies were screened with titles and abstracts. After the stage of title and abstract screening, eight articles were downloaded for full-text screening. One study [18] was removed because it was an editorial, one study [19] was deleted because it was duplicated. Finally, 6 RCTs [20,21,22,23,24,25] were deemed eligible and included in the meta-analysis.

Fig. 1
figure 1

Flowchart of literature selection

Main characteristics of included studies

Six studies involving 158 patients (688 pedicle screws) in robot-assisted groups and 148 patients (672 pedicle screws) in conventional freehand groups were included in our study. Each study was conducted in a single center, three studies [20,21,22] were performed in South Korea, two [23, 24] were conducted in Germany, the remaining one study [25] was performed in China. All the RCTs used post-operative thin-cut CT to evaluate the accuracy of pedicle screw implantation, and the scale used to assess the accuracy of pedicle screw was consistent across studies. The main characteristics of included RCTs are summarized in Table 1.

Table 1 Main characteristics of included studies

Quality assessment

The Cochrane’s tool was employed for appraisal of risk of bias for included RCTs. All the studies claimed randomization, but two studies [23, 24] failed to report the method of random sequence generation. Allocation concealment was well conducted in four studies [20,21,22, 25]. Regarding the blinding of personnel and participants, it was unfeasible to blind the spinal surgeons in our included studies so it was judged to high risk of bias in all selected RCTs. Two articles [20, 24] failed to report the blinding of outcome assessment, and they were judged to unclear risk of bias. When it comes to the incomplete outcome data and selective reporting, the Roser et al. study [24] was judged to high risk of bias because Roser et al. reported their preliminary results of 37 patients, with fewer patients undergoing fluoroscopy-guided (n = 10) than robot-assisted techniques (n = 18), and Roser et al. did not perform statistical analysis due to limited statistical power. The reviewers’ judgments about each risk of bias item for each included RCT is presented in Fig. 2.

Fig. 2
figure 2

Risk of bias summary: review authors’ judgements about each risk of bias item for each included study

Meta-analysis results

Pedicle screw placement accuracy

The comparison of accuracy of pedicle screw implantation between robot-assisted and conventional freehand according to Gerztbein–Robbins Classification (Grade A, 0 mm) is shown in Fig. 3. The heterogeneity between studies were significant (P = 0.07, I 2 = 51%), thus the random-effect model was used for statistical analysis. The combined results showed that robot-assisted pedicle screw placement was not significantly different from conventional freehand technique (τ 2 = 0.00, χ 2 = 10.28, df = 5; RR 1.02, 95% CI 0.98, 1.06). Fig. 4 presents the comparison of accuracy according to the Gertzbein–Robbins Grade A + B criteria. Random-effect model was employed due to obvious between-study heterogeneity (P = 0.002, I 2 = 73%), pooled results indicated that robot-assisted pedicle screw placement was not prior to freehand approach (τ 2 = 0.00, χ 2 = 18.69, df = 5; RR 1.00, 95% CI 0.97, 1.02).

Fig. 3
figure 3

Forest plot of robot-assisted versus conventional freehand technique: accuracy of screw according to Grade A criteria

Fig. 4
figure 4

Forest plot of robot-assisted versus conventional freehand technique: accuracy of screw according to Grade A + B criteria

Proximal facet joint violation

Proximal facet joint violation was reported by two RCTs [20, 22], and the facet joint violation was evaluated using post-operative fine-cut CT. The combined results suggested that robot-assisted pedicle screw placement was associated with significantly fewer proximal facet joint violation than conventional freehand technique (χ 2 = 1.12, I 2 = 0%; RR 0.07, 95% CI 0.01, 0.55) (Fig. 5).

Fig. 5
figure 5

Forest plot of robot-assisted versus conventional freehand technique: proximal facet joint violation

Radiation exposure

Three studies reported intraoperative radiation exposure as secondary endpoints, but only two studies [20, 24] provided detailed data on radiation exposure time and radiation dosage. The combined intraoperative radiation time showed that robot-assisted pedicle screw placement could significantly reduce the radiation time than conventional freehand pedicle screw implantation (MD − 12.38, 95% CI − 17.95, − 6.80; I 2 = 84%) (Fig. 6). Furthermore, robot-assisted pedicle screw placement was associated with significantly fewer intraoperative radiation dosage (SMD − 0.64, 95% CI − 0.85, − 0.43; I 2 = 10%) (Fig. 7).

Fig. 6
figure 6

Forest plot of robot-assisted versus conventional freehand technique: radiation time

Fig. 7
figure 7

Forest plot of robot-assisted versus conventional freehand technique: radiation dosage

Surgical duration

Three RCTs [20, 22, 25] mentioned the overall surgical time from skin to skin, the findings of these studies were pooled using meta-analysis, with only one study favoring robot-assisted procedure. The combination of surgical time suggested that the overall surgical duration from skin to skin differed significantly between robot-assisted and conventional freehand pedicle screw implantation (MD 20.53, 95% CI 5.17, 35.90; I 2 = 10%) (Fig. 8).

Fig. 8
figure 8

Forest plot of robot-assisted versus conventional freehand technique: surgical time

Subgroup-analysis and publication bias

In the comparison of accuracy of pedicle screw implantation, obvious heterogeneity was detected regardless of using Gertzbein–Robbins Grade A criteria or Gertzbein–Robbins Grade A + B criteria. We performed subgroup analysis according to the methodological quality of included studies. Ringel et al. and Roser et al. study failed to report the random sequence generation and allocation concealment, so the aforementioned studies were assigned into the “without random sequence generation and allocation concealment” subgroup. According to the Gertzbein–Robbins Grade A (0 mm) criteria, there was no obvious heterogeneity between studies that reported random sequence generation and allocation concealment (P = 0.90, I 2 = 0%), and the combined results favored robot-assisted pedicle screw placement (RR 1.03, 95% CI 1.00, 1.06; P = 0.04). Similarly, according to the Gertzbein–Robbins Grade A + B criteria (2 mm), no obvious between-study heterogeneity existed in RCTs that reported random sequence generation and allocation concealment (P = 0.71, I 2 = 0%). The meta-analysis results showed that there was no significant difference between robot-assisted and conventional freehand pedicle screw placement (RR 1.01, 95% CI 0.99, 1.02; P = 0.29).When removing the studies without random sequence generation and allocation concealment, the heterogeneity in comparison of accuracy of pedicle screw placement was not significant, thus we removed Roser et al. study and Ringel et al. study from our meta-analyses.

Begg’s test (Grade A: z = 0.34, P = 0.73; Grade A + B: z = − 0.34, P = 1.00) and Egger’s test (Grade A: t = 0.99, P = 0.43; Grade A + B: t = 2.36, P = 0.14) revealed no significant publication bias in the accuracy of pedicle screw placement.

Discussion

To the best of our knowledge, this is the first meta-analysis and systematic review to compare robot-assisted and freehand pedicle screw implantation purely based on RCTs with the consideration of heterogeneity across included studies. In two previously published meta-analyses [13, 14], no definite conclusions could be drawn to unequivocally recommend one surgical technique over the other. The major methodological concern in above studies was the inclusion of both RCTs and cohort studies for meta-analysis, which could lead to less powerful results. The way screw accuracy was defined and measured was also noted to vary among their included studies. Another methodological flaw to be considered was the absence of an investigation of publication bias, publication bias might arise because only articles published in English were included by aforementioned meta-analyses.

In our current study, only RCTs that featured to compare robot-assisted and conventional freehand pedicle screw placement were included for meta-analysis. The definition of accuracy of pedicle screw was consistent across all the selected studies, namely the Gerztbein–Robbins Classification [16]. Roser et al. [24] did not perform statistical analysis on their preliminary results due to limited statistical power, although a trend towards increased pedicle screw accuracy in robot-assisted group was observed. Ringel et al. [23] for the first time compared the screw position accuracy between robot-assisted and conventional freehand technique using a randomized controlled design; however, they failed to report the method of random sequence generation and details about allocation concealment. Regardless of using Grade A or Grade A + B, the between-study heterogeneity was statistically significant, thus subgroup-analysis according to methodological quality was conducted. The combination of data from studies that reported adequate randomization and allocation concealment suggested that robot-assisted pedicle screw placement was superior to freehand technique when using Grade A (test for heterogeneity: P = 0.90, I 2 = 0%), but was similar to conventional freehand approach using Grade A + B (test for heterogeneity: P = 0.71, I 2 = 0%).

Pedicle screw position deviations up to 2 mm can be accepted by clinicians since minor deviations rarely injury vital vessels and nerves [26]. The included studies, with the exception of Ringel et al’s [23], had a higher accuracy of screw insertion in the robot-assisted group. The poorly positioned implants were partially attributed to the bed mount by Ringel et al. [23], but Roser et al. [24] did not encounter a similar inaccuracy with the same bed mount. When using grading A as the standard of accuracy, robot-assisted technique tends to overmatch freehand technique (RR 1.03, 95% CI 1.00, 1.06; P = 0.04). While using Grade A + B, robot-assisted technique is not superior to freehand technique (RR 1.01, 95% CI 0.99, 1.02; P = 0.29). The explanations could be summarized as below: first, spine surgeon relies upon a steady hand and meticulous skills during surgery. The surgical procedures are tedious and laborious, predisposing the surgeon to both physical and mental exhaustion, which may decrease the accuracy [27]. But robots are capable of performing the repetitive tasks and holding tools for long periods with accuracy and precision devoid of the limitations of human manual errors [28]. Besides, a preoperative CT scan of the surgical region is beneficial to select optimal screw size and plan an ideal entry position and trajectory. The robot-assisted system guides the surgeon to the desired position but leaves the actual execution of the surgical procedure in the surgeon’s hands, which also help the surgeon improve accuracy. What is more, the robot-assisted setting fixed firmly to the spine, monitoring and coordinating with the patient’s movements, reinforces its reliability. Last but not the least, the solid floor-flexible base and firm robotized arm of the robot device could help reduce the risk of secondary motion, thus guaranteeing the dependability of the entry point and trajectory [29]. Less than 2 mm deviation can be achieved by well-trained and experienced surgeons, but the 0 mm deviation which needs more accurate and precise movements enable the robot-assisted system to perform better than their human counterparts.

Reducing radiation exposure is especially important because large doses of radiation can contribute to an increasing risk of delayed effects such as malignancy [30, 31]. Ringel et al. [23] reported that total intraoperative radiation in freehand group was similar to robot-assisted group but freehand group required an additional radiation exposure for planning. Roser et al. [24] found the radiation in freehand group is nearly two times than robot-assisted group. Hyun et al.’s study [20] showed fluoroscopy time for per screw in freehand group was almost four times than the robot-assisted group. A latest study led by Alaid et al. also indicated a significant reduction of radiation exposure using robotic guidance for the treatment of pyogenic spondylodiscitis with pedicle screw fixation [32]. One valuable advantage of robot-assisted system is to lessen reliance on intraoperative fluoroscopy. With robot-assisted technique, simply anterior–posterior and lateral fluoroscopy images are necessary for image matching between patient and robot. C-arm can be taken away after registration, surgeons insert the screws along previously planned trajectories [33], thus limiting additional radiation exposure. Besides, robot-guided procedures require significantly shorter imaging sessions, exposing the patients and staffs to less radiation. Keric et al. reported that surgeons with more experience in conventional spinal instrumentation procedures might suffer less radiation exposure using the same robot-assisted technique [34].

Shah et al. initially reported the incidence of proximal facet joint violation, which might accelerate the occurrence of adjacent segment degeneration [35], was more than 30% of involved patients [36]. Factors including body mass index (BMI), age, type of pedicle screw and fusion procedure, number of fused segments may contribute to proximal facet joint violation [37]. A latest study led by Levin et al. reported that through no obvious impact on reoperation rate or post-operative quality of life at 1-year follow-up, the reoperation rate was considerably higher in the proximal facet joint violation patients compared with non-PFJV subjects at 2- and 3-year follow-up [38]. Among the included studies, Hyun et al. and Kim et al. mentioned proximal facet joint violation, the pooled data indicated that robot-assisted technique allowed surgeons to reduce proximal facet joint violation. Freehand technique largely relies on intraoperative C-arm fluoroscopy images for screw insertion, requiring the surgeon’s own judgment to harmonize the trajectory in three dimensions, while the robot-assisted system guides the surgeon to the preoperatively planned trajectory directly. By selecting optimal screw trajectory on the planning scheme, it was possible to keep screws away from the facets. The robot-assisted system implements the plan precisely during the operation, regardless of the surgeon’s proficiency [22].

Hyun et al. [20] reported the overall surgical time had no difference in robot-assisted and freehand group. But Ringel et al. [23], Tian et al. [25] and Kim et al. [22] reported a longer surgical time in robot-assisted group. The robot-assisted technique is in its early stage of clinical application. And a new technique always involves a learning curve before reaching to a proficient and steady state. The increase in surgical time can be partly attributed to the learning curve. Besides, the intraoperative preparation phase may also contribute to the longer surgical time with robot-assisted technique.

Several limitations to our study should be taken into account. First, although Begg’s test and Egger’s test indicated no significant publication bias in the accuracy of pedicle screw placement, the number of included studies was still limited, additional RCTs with larger sample size within different cultural contexts are needed. Second, as most studies used the Mazor robot, the findings of our present study may not represent the surgical robots as a whole. Additionally, all of our included studies restricted their participants to patients undergoing lumbar pedicle screw placement, considering the anatomical differences between the lumbar and thoracic vertebrae, whether or not robot-assisted technique is superior to conventional freehand remains to be addressed by future studies. Despite these aforementioned limitations, our present meta-analysis represents the highest level of evidence.

Conclusion

Taken together, the findings of the present meta-analysis suggested that robot-assisted pedicle screw implantation was as accurate as conventional freehand techniques, robot-assisted approach was associated with fewer proximal facet joint violation, less intraoperative radiation exposure but longer surgical duration than freehand technique. Although robotics in spinal surgeries holds promise for future spinal instrumentation, the effectiveness of robotics has been researched less than might be anticipated. To validate the beneficial role of robotics in spinal surgery, more RCTs with rigorous design and larger sample size are encouraged.