Introduction

Many factors contribute to successful repair of the rotator cuff. Anatomic factors are one of the key considerations, which include restoration of the footprint contact area [1], tendon and bone quality [2], appropriate compression of the tendon on the footprint to facilitate healing [3] and minimized motion at the bone–tendon interface [4]. Likewise, intrinsic factors such as decreased vascularity, hypoxia, atrophy, fibrocartilaginous changes and fatty infiltration of the torn rotator cuff muscles have also been associated with inferior outcomes [2].

Initial fixation plays an important role and may be critical in achieving successful healing [2, 5]. Numerous surgical techniques have been described for the repair of rotator cuff injuries, including fixation using single- or double-row suture anchor techniques, transosseous tunnels or newer configurations such as transosseous-equivalent (TOE) or knotless techniques [1]. Several biomechanical studies have demonstrated that double-row suture repair recreates the native footprint more closely resulting in significantly higher loads to failure and decreased gap formation under tensile loading [6,7,8]. In contrast, other basic science studies have not shown any significant differences between these two repair techniques [9,10,11].

Despite these results, debate continues as to whether double-row repair results in better clinical outcomes. A recent systematic review of overlapping meta-analyses suggested that double-row repair resulted in higher rates of structural healing when compared to single-row repairs [12]. Unfortunately, these putative advantages do not seem to correlate with clinical observations. For example, a prospective randomized level 1 study by Burks et al. [13] could not demonstrate any radiological or clinical differences between patients repaired with either single- or double-row techniques.

Given the lack of high-level evidence from both clinical and basic science studies, the purpose here was to perform a systematic review and meta-analysis comparing single- to double-row biomechanical studies with respect to load to failure, mode of failure and gap formation. We hypothesized that there would be no differences between these two techniques.

Methods

The research was conducted according to the methods described in the Cochrane Handbook [14]. The results are reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines statement [15].

Eligibility criteria

Ex vivo basic science and biomechanical published studies that compared single- to double-row suture anchor techniques (Figs. 1, 2) between January 1990 and November 2016 were identified and included if they fulfilled the following inclusion criteria: complete documentation of ultimate load to failure including standard deviation (SD) or range, description of failure mode for all specimens and documentation of elongation or gap formation including SD or range; complete description of the biomechanical testing set-up, surgical techniques and methodology in human cadaveric studies. Studies comparing the more contemporary suture bridge or transosseous-equivalent (TOE) rotator cuff repair to single-row repairs were also included. Park et al. [16, 17] demonstrated that both techniques were similar with the exception of ultimate load, which was higher in the TOE group.

Fig. 1
figure 1

Single-row supraspinatus repair with two anchors placed at the lateral edge of the footprint

Fig. 2
figure 2

Double-row supraspinatus repair with a medial row typically placed just lateral to the articular margin and the lateral row placed just medial to the drop of the greater tuberosity

Studies were excluded if the study protocol used ovine, bovine or other non-human specimens. Pietschmann et al. [18] demonstrated that ultimate load and primary stability of suture anchors under load depends mainly on cortical thickness and subcortical trabecular bone quality; hence, bovine and ovine humeri were deemed unsuitable for testing. Furthermore, clinical outcome studies, research theses, conference proceedings or abstracts, and articles comparing other surgical techniques such as the transosseous Millett technique, or studies that have tested a combination of either one medial and two lateral, or two medial and one lateral anchor, were also excluded.

Literature research

A systematic review of the literature was performed to identify all publications in the English and German literature dealing with biomechanical evaluation of rotator cuff repair. The databases MEDLINE, Embase, Scopus and Google Scholar were systemically searched using the terms and Boolean operators: ‘rotator cuff’ AND ‘biomechanical’ AND/OR ‘cadaveric/cadaver’; ‘suture anchor’ AND ‘arthroscopic’ AND/OR ‘open’. Three reviewers conducted independent title and abstract screening. All eligible articles were manually cross-referenced to ensure that other potential studies were included. Disagreements were resolved by consensus; if no consensus was achieved, they were carried forward to the full-text review and then resolved by consensus. The search was performed on 5 November 2016.

Data extraction and quality assessment

For studies that met the inclusion criteria, an electronic data extraction form was used to obtain the following information from each article: author, journal and year of publication, any conflicts of interest, surgical technique, sample size, type of suture anchor and suture material, biomechanical testing details such as cyclic loading conditions, load to failure testing, failure modes, ultimate load to failure and SD, stiffness and SD, elongation and SD. Two authors independently completed data extraction, and the third reviewer and senior author verified the data.

Risk of bias was assessed adapting the Cochrane Collaboration’s risk of bias tool [14]. Cochrane’s Handbook does not describe specifically the assessment of risk of bias for basic science and biomechanical studies, and the assessment tool described in chapter 8 addresses clinical studies only. The first four assessment items (random sequence generation, allocation concealment, blinding of participants and personnel and blinding of outcome assessment) specifically relate to clinical studies only and were not utilized in the analysis. Therefore, risk of bias assessment for this meta-analysis consisted only of the following three items: incomplete outcome data, selective reporting and other bias. Assessment of these three items was based on the recommendations by Hartling et al. [19], as described in appendix B of their publication. According to Hartling et al. and the recommendations in the Cochrane Handbook, the risk of bias was categorized as ‘low risk’, ‘unclear risk’ and ‘high risk’ [14, 19].

The GRADE system was used by the senior author to assess the quality of the evidence for each outcome measure; the second reviewer verified the assessments. The recommendations from the Cochrane Handbook were followed, and studies were downgraded if there were limitations in the design, indirectness of evidence, unexplained heterogeneity, imprecision of results and high probability of publication bias. All institutional and author information was concealed to the second reviewer to reduce reviewer bias. Any disagreement between reviewers was resolved by a consensus and/or by arbitration between the two senior authors.

Statistical analysis

Interobserver differences for study eligibility and risk of bias were measured using Cohen’s kappa coefficient. Heterogeneity of the data was assessed using χ 2 and I 2 statistics. Outcomes were pooled using a fixed effect model if the I 2 statistic was < 25%; however, if it was > 25%, then a random effect model was utilized. If standard deviations were not reported, the standard deviation was calculated using the following formula: SD = max–min/4. Hozo et al. [20] showed that this formula provides a good estimation for standard deviation. The results of the categorical outcome recurrence were pooled and analysed using a contingency table and a Chi-square test, or Fisher’s exact test with Yates corrections if the sample size in any category was < 5. All tests of significance were two-tailed, and an α of less than 0.05 was considered significant. Publication bias was assessed using funnel plots. If publication bias was present, Egger’s test of intercept was performed to test for asymmetry. Funnel and forest plots, as well as all statistical analyses, were performed using Stata SE (version 12.0; StataCorp, College Station, Texas, USA) for Windows and the comprehensive meta-analysis software package (CMA), version 3 (Biostat Inc, Englewood, NJ, USA).

Results

Study selection and characteristics

The literature search identified 128 studies for consideration; however, 78 studies were excluded following abstract screening. For the remaining studies, examination of the full-text manuscripts was conducted, and only eight of the 50 articles met all eligibility criteria and were included in the analysis (Fig. 3) [7, 8, 10, 11, 21,22,23,24]. Overall, agreement between the two reviewers regarding final eligibility was excellent (kappa value 0.87, 95% CI 0.82–0.90). All eight studies were published in English between 2006 and 2015, with a cumulative total of 106 specimens tested. The study characteristics and results are summarized in Tables 1 and 2.

Fig. 3
figure 3

PRISMA flow diagram. From the initial 360 records, 8 studies were included

Table 1 Characteristics of the included studies
Table 2 Characteristics of the included studies including the test results

Risk of bias

The findings of the risk assessment bias are summarized in Table 3. Of the eight studies, only one study, by Lorbach et al. [24], was found to have a high risk of bias. The authors failed to report details for load to failure testing and humerus position during testing. Failure modes were also not included in their publication. Additionally, an unclear risk of bias was assessed in the study by Barber et al. [21] where the preload was mentioned, but not its magnitude. While the preload could be assumed to be 10 N based on the reported cyclic loads, it was not clear whether the initial cyclic loading force was equivalent to the preload (Table 1). Visual inspection of the funnel plot was clearly asymmetric, suggesting publication bias (Fig. 4). Moreover, Egger’s intercept value was significant (p = 0.04 two-tailed) and was calculated to be 7.5 (95% CI: 0.42–14.48, t = 2.59), confirming publication bias was present.

Table 3 Risk of bias assessment
Fig. 4
figure 4

The asymmetrical funnel plot suggests publication bias confirmed by a significant Egger’s intercept value of p = 0.04

Using the GRADE quality assessment, the quality of evidence was double downgraded for the following two factors: imprecision of results with wide confidence intervals and publication bias. This resulted in a low-quality rating according to the GRADE criteria [14].

Load to failure

Load to failure was reported in all studies. The pooled estimate for all eight studies utilizing a random effect model demonstrated significant differences in the load to failure (p = 0.006) between single- and double-row repair techniques (SMD 1.228, 95% CI: 0.055–5.226, p = 0.0001, I 2 = 60.47%) (Fig. 5).

Fig. 5
figure 5

The pooled estimated for all studies suggested higher load to failure loads for double-row repairs

Failure mode

Lorbach et al. [24] was the only study to not report the failure mode (Table 2). The failure modes were divided into two groups. In group one, the suture was either failing at the anchor interface or the anchor was pulling out of bone. In group two, the suture failed at the tendon–suture interface by cutting through the tendon or by suture breakage. Statistical analysis using a 2 × 2 contingency table revealed that there was no significant difference between single- and double-row repairs in either of the two groups, respectively (χ = 0.01, p = 0.92, χ = 0.02, p = 0.65).

Elongation/gap formation

Elongation or gap formation was also reported in all studies. The pooled estimate for all eight studies utilizing a random effect model demonstrated significant differences in gap size (p = 0.006) between single- and double-row repair failures (SMD 0.783, 95% CI: 0.169–1.398, p = 0.012, I 2 = 58.8%) (Fig. 6).

Fig. 6
figure 6

The pooled estimated for all studies from demonstrated smaller gap sizes for double-row repairs

Discussion

The results of this meta-analysis of basic science studies comparing the biomechanical properties of single- versus double-row rotator cuff repair demonstrated that double-row repair resulted in significantly higher ultimate load to failure values and decreased elongation/gap formation. Failure modes between these two groups were not significantly different. To our knowledge, this is the first systematic review with meta-analysis comparing single- and double-row repair techniques with respect to the resulting biomechanical properties, such as load to failure, gap formation and failure mode.

Studies have shown that stable fixation at the bone–tendon interface is one of the key factors in the early stages of healing [5]. Hughes and An [25] have calculated forces for the supraspinatus tendon during abduction, adduction, internal rotation and external rotation. They found that the calculated forces for maximal abduction exertion were 117 N, for adduction 59 N and for maximal internal and external rotation, 115 and 175 N, respectively. Interestingly, the maximal load to failure in all eight included studies was above these calculated values for both single- and double-row repairs. These findings might explain the lack of any significant differences in clinical outcome scores between the two repair techniques [26]. In contrast, for larger tears above 3-cm double-row repair demonstrated superiority compared to the single-row technique [27]. This difference has been attributed to greater compression of the repair and a larger footprint area, which has been shown to increase the healing potential [1, 24]. However, it is also possible that the absolute number of sutures, rather than the repair technique, is the common denominator for tendon healing. For instance, Jost et al. [28] in an ovine rotator cuff model demonstrated that a higher number of sutures decreased cyclic gap formation and increased load to failure. However, single- and double-row repairs were biomechanically equivalent when the number of sutures was kept constant [28]. While the findings of this meta-analysis clearly favour double-row repair with regard to failure load, the testing methods of the included studies were heterogeneous and should not be generalized. In fact, the various methods utilized introduced bias, and these results should be interpreted with great caution.

Gap formation can potentially compromise tendon–bone healing and was found to be one of the predictors for poor healing outcomes [1, 29]. However, there is no clear consensus as to what is the exact value for gap formation or elongation to define a failure [29]. Several authors have suggested a gap of 5 mm as failure [30, 31], whereas others have used values ranging anywhere from 3 to 10 mm [29]. Five of the eight studies that used single-row repair, and three of the eight studies using double-row repair technique [7, 11, 22,23,24], demonstrated values above the suggested threshold of 5 mm. Interestingly, when using meta-analysis tools, double-row repair was superior to single-row repair with significantly lower gap formation. Similar phenomenon was also observed in load to failure; however, the testing protocols used were heterogeneous making comparisons between these studies difficult and may have potentially introduced bias. Despite these significant findings, the results of these studies should be interpreted with caution. In addition, several authors have shown that the numbers of sutures passing through the tendon were the most important determinant of gap formation regardless of whether a single- or double-row repair was performed [1, 24, 28]. This suggests that surgical technique was only one of the factors determining a successful outcome following surgery. Age, tissue quality, the size of the tear, retraction and muscle atrophy and other patient-related factors such as smoking, bone quality, chronic vitamin D deficiency and diabetes were also shown to influence the healing outcome [5, 32].

Failure modes were not significantly different between the two groups and were equally distributed. For this measure, fifteen compared to thirteen specimens failed at the anchor–suture interface or via anchor pull-out, and 33 (69% for single row) compared to 36 (73% for double row) specimens failed within the suture-through-tendon mechanism. The majority of failures occurred at the suture–tendon interface, which has previously been reported as the weakest link of the reconstruction [23, 24]. Increasing the number of sutures and the suture configuration, rather than the number of suture anchors, may strengthen rotator cuff repairs and obviate the need for double-row repairs. However, if the number of sutures rather the number of anchors was the more important factor to reduce suture–tendon failure, the failure mode distribution should have been different between these two techniques. Unfortunately, the sample size of this study was insufficient to come to a valid conclusion (using G*Power 3.1.9.2, Universität Düsseldorf, Germany), and the calculated post hoc power was only 0.65 for failure mode.

The clinical relevance of this meta-analysis is unclear. In a recent meta-analysis investigating the clinical and radiological outcomes after arthroscopic single-row versus double-row repair, Perser et al. [33] demonstrated that double-row repair does not result in a significant clinical improvement in both clinical and radiographic outcomes. These findings were confirmed by Sheibani-Rad et al. [34] and DeHaan et al. [35] demonstrating similar functional and clinical outcomes for both single-row and double-row repairs. Millet et al. [26] could not demonstrate any detectable differences between single-row and double-row repairs, but showed a significantly higher retear rate of single-row (26%) versus double-row (14%) techniques. However, patients who underwent double-row repair experienced significantly more pain with failure of repair [36].

Meta-analysis uses a quantitative approach to systematically assess the results of previous research as well as the results in a more precise estimate of the treatment effect [37]. The inclusion of non-randomized observational studies may be subject to sources of bias and in fact may show larger treatment effects [38]. Pooling of these studies may also not lead to more definitive outcomes [37]. These are potential limitations with all non-randomized studies, and this may also apply to our meta-analysis. Moreover, when using the GRADE quality assessment, the quality was double downgraded for wide confidence intervals and publication bias. In fact, there was a huge discrepancy for testing methods applied between the included publications. Cyclic loading protocols ranged from 50 to 3000 cycles, different cyclic loads and applied rates were used, and cadavers were tested at different humerus abduction angles. One study applied different loading protocols for single- and double-row testing, which the authors justified by using the tension differential between the normal anatomic footprint and medialized repairs, an argument that seems flawed [22]. Given these substantial differences, gap formation and elongation may be a direct result of rate of load rather than construct stability, and these values must be viewed with caution as outlined earlier.

For load to failure, the rate of displacement ranged from 1 to 33 mm/s. Parimi et al. [39] showed that load to failure was rate dependent, with higher strain rates resulting in higher breaking strength. However, a major limitation of Parimi et al. was the large variance and confidence intervals with comparatively large loading rates. Loading rates below 33 mm/s are all considered slow, and one could assume that the results of load to failure testing would not be related to loading rate and should therefore be a good estimate of failure with minimal bias [39].

Despite these inherent limitations, the analysis has confirmed superiority of the double-row repair with regard to load to failure. The lack of a uniform testing protocol is a major concern, and it is critical that the orthopaedic and biomechanical community develops and adheres to a single standard testing protocol to reduce confounding factors, which in turn would allow for more reliable comparisons between studies. For example, Burkhart et al. [30] suggested cyclic loading from 10 to 180 N at a rate of 33 mm/s based on the previous work of Ikai and Fukunaga [40], where the load of 180 N represented approximately two-thirds of the maximal contraction force that could be delivered [30, 40]. Mazzocca et al. [10] suggested a load of 100 N, which represents approximately 25% of the maximum rotator cuff contraction. However, Mazzocca et al. [10] based these figures on loads that would occur during early rehabilitation with passive motion, whereas Burkart’s parameters appear to simulate genuine clinical conditions more accurately [30]. There is currently no published evidence to best define the preferred loading conditions.

Limitations

As outlined earlier, the limitations of this meta-analysis are directly related to the limitations of the included studies. Double downgrading of study quality was necessary because of the inherent biases and study heterogeneity, substantially decreasing the external validity of both the included studies and this meta-analysis. Therefore, it is important to again emphasize these results should be interpreted with caution. The estimation of standard deviation as outlined by Hozo et al. [20] could be seen as a limitation. However, they demonstrated the estimation error is within 4% of the sample mean using this approach, suggesting its influence here would be insignificant [20]. Research theses and conference proceedings were excluded for this study, and the omission of these ‘grey data’ could have resulted in or contributed to publication bias. Publication bias was assessed with meta-analysis tools and was found to be present. However, the presence of publication bias is mainly caused by small sample size studies, the tendency of journals to reject negative studies, researchers deciding not to submit results or sponsorships which prevent publication of non-favourable results [41].

Conclusion

The results of this systematic review and meta-analysis suggest that double-row repair is able to tolerate a significantly greater load to failure. Gap formation was also significantly lower in the double-row repair group.