Introduction

Articular cartilage lesions are a common pathology of the knee joint. In a retrospective analysis of 25,124 knee arthroscopy patients chondral lesions were found in 60% of the patients [1]. A total of 7% of all patients analysed under the age of 40 and 9% under the age of 50 years showed one to three localized grade III or IV cartilage defects according to Outerbridge et al. [2]. Full-thickness articular cartilage defects only have limited regenerative potential. Thus, spontaneous healing is unlikely [35]. Untreated full-thickness cartilage lesions are usually associated with significant pain and swelling. Moreover, patients have an increased risk of subsequent osteoarthritis [610] which is a major cause of disability and represents a significant socioeconomic burden [9, 11, 12]. Conservative therapy includes physical measures (physiotherapy, weight loss, and bracing) as well as medical treatment of cartilage lesions, e.g. oral administration or intra-articular injection of hyaluronic acid or chondroprotective agents (such as D-glucosamine sulphate, chondroitin sulphate, diacerein) [1316]. Unfortunately, none of these therapies can heal cartilage defects, but only relieve symptoms and improve knee function [14, 17, 18]. Surgery is primarily indicated for patients with grade III or IV cartilage defects according to Outerbridge. Surgical techniques include reparative marrow-stimulation techniques, i.e. migration of bone marrow cells into the cartilage defect, as well as restorative techniques using autografts, allografts, or synthetic material [19, 20]. These techniques have been shown to significantly relieve symptoms and improve function [13] and generally provide better results in the treatment of defects in the femoral condyles rather than in the patellofemoral compartment [16]. However, up to now no treatment option has proven to be superior to others in terms of efficacy and safety [21].

Microfracture treatment is a single-stage arthroscopic procedure that has gained popularity over the past two decades due to its minimally invasive approach, technical simplicity, limited surgical morbidity, and low costs [6, 23]. Because of these benefits and the fact that this technique does not rule out other cartilage repair procedures that might be needed in future [23], microfracture has become the treatment of choice for patients with knee cartilage defects of grade III or IV according to Outerbridge [24, 25]. Out of 150,000–200,000 US Americans who undergo knee surgery due to cartilage lesions every year, an estimated 60,000 are treated with this technique [26]. Moreover, microfracture has been shown to be the preferred method for the treatment of articular cartilage defects in recreational and professional athletes (including high-impact sports such as basketball, American football, soccer, and rugby) [27].

Numerous reviews about the microfracture technique have already been published, but to our knowledge no quantitative analysis of this technique has been performed so far. Being able to establish an average difference of the pre- and post-operative score values for this technique would be helpful for patients and clinicians. Therefore, the objective of our study was to systematically review medical literature in order to find controlled studies about microfracture in the treatment of patients with full-thickness cartilage lesions of the knee and statistically combine these studies in order to determine a best estimate of the average treatment effect. Furthermore, information was gathered to detect cartilage-specific and patient-specific factors that might have an influence on the clinical outcome.

Methods

We searched the electronic databases MEDLINE, EMBASE, CINAHL, and Cochrane Central Register of Controlled Trials. Our literature search was completed on 1 May 2011. As “microfracture” has been the generally accepted term for this technique since its introduction by Steadman [28] in 1980, we decided to use this search term. No language restrictions were applied. For the evaluation of clinical outcomes we used clinical scores, because their subjective variables referring to symptoms and function are highly associated with patient satisfaction [29, 30]. Our eligibility criteria are presented in Table 1.

Table 1 Eligibility criteria

First of all, two clinicians separately analysed the studies derived from our database research on the basis of the study title or abstract, respectively. Studies failing to meet the selection criteria (original paper, microfracture of the knee, outcome evaluation) and duplicates were excluded. Subsequently, full-text versions of the remaining 38 papers were obtained for detailed evaluation. Two reviewers then independently extracted all relevant information about outcome-related demographic data. Furthermore, they determined all eligible studies and independently evaluated their internal validity (risk of bias). For controlled trials we assessed the method of randomization, concealment of allocation, blinding, drop-out rates, as well as method of statistical analysis. For observational studies we used the criteria proposed by Deeks et al. [31]. Studies with a high risk of bias were excluded from further analyses and discrepancies were resolved by consensus. Finally, data on study characteristics and study design, patient age, defect size, etiology and localization, duration of symptoms, previous and concomitant surgeries, clinical follow-up, as well as pre- and postoperative clinical score values were extracted.

Statistical analysis

Because control procedures across included studies were heterogeneous, we could not perform a meta-analysis of the treatment effects of microfracture as compared to the control procedures. In order to determine a best estimate of the average expected treatment effect, we pooled the before/after-data of study arms using microfracture. Due to the fact that studies used different scales in the measurement of functional improvements, we standardized treatment effects using paired standardized mean differences. We assumed a pre/post-test correlation of 0.5.

Results

Figure 1 shows the selection process for eligible studies. Out of 1,030 citations obtained from electronic literature search, 38 original papers were suitable to determine cartilage- and patient-specific factors that might have an influence on the clinical outcome.

Fig. 1
figure 1

Selection process of eligible studies

Reparative microfracture does not induce growth of normal hyaline cartilage. The quality of the repair tissue consisting of type-I, type-II, and type-III collagen [32] varies from fibrocartilage [33] alone to a mixture of fibrocartilage and hyaline-like cartilage [3439]. Bae et al. [34], Gill et al. [40, 41], Miller et al. [42], Pässler et al. [43] and Steadman et al. [38, 44, 45] presume that the original defect filling will become stable over time and perform the function of hyaline cartilage, whereas Gudas et al. [33] and Gobbi et al. [35] express the opinion that the hybrid tissue does not support weight bearing in the long term. A decrease in score values was observed after 18 [46, 47] and 24 months [36, 37, 48, 49], respectively. However, according to Blevins et al. no deterioration of results was observed within three years after surgery [28].

Several studies conclude that microfracture is not effective for the treatment of large lesions [38, 48, 50, 51]. Miller et al. [42] reported that significantly less improvement could be achieved after treatment of defects larger than 400 mm² as compared to smaller lesions. According to Gudas et al. [33, 52], patients with small lesions (< 200 mm²) had a significantly better treatment effect than those with large defects (≥ 200 mm²). Asik et al. [53] reported equivalent results, as their relevant Spearman coefficient between defect size and Lysholm score was −0.538 (p < 0.001).

Asik et al. [53] and Steadman et al. [38] reported significantly better treatment effects for patients under the age of 35 years as compared to older patients and showed that Lysholm score values decrease with age. Asik et al. [53] observed a Spearman coefficient of −0.623 (p < 0.001). Steadman et al. [38] found a Pearson coefficient of −0.28 (p < 0.018). Furthermore, a multivariate analysis (R² = 0.193) by Steadman et al. [38] revealed age as the sole moderator with a regression coefficient of −0.299 (p = 0.011). According to Gudas et al. [33] and Knutsen et al. [37], patients younger than 30 years of age, and according to Kreuz et al. [46], patients younger than 40 years of age, respectively, had a significantly better clinical and functional outcome than older patients. However, no significant correlation was found between clinical scores and patient age in the studies of Miller et al. [42] and Kon et al. [51].

While Gill et al. [41, 54], Miller et al. [42], Mithoefer et al. [49] and Steadman et al. [38] found no influence of defect localization on the clinical outcome, Gobbi et al. [35] and Kreuz et al. [47] observed the best results in chondral lesions on the femoral condyles and consistently poor results at the patella. Poorer outcomes were observed in patients with degenerative defects as compared to traumatic lesions [35] and patients with traumatic lesions had significantly better clinical results than those with osteochondrosis dissecans [52]. In patients with higher preoperative activity levels (Tegner score [55] >4) significantly better clinical results were observed [28, 37]. Professional athletes had a higher repair cartilage fill volume [28] which was found to play a critical role in the postoperative functional improvement [56]. Furthermore, shorter preoperative duration of symptoms (<12 months) also significantly affected the clinical outcome [48]. Athletes without prior surgical intervention were more likely to return to high-impact sports than those who had undergone previous knee surgery [49]. Finally, it has been shown that it is more difficult to regenerate cartilage of hyaline-like quality when the patient has had previous knee surgery or a chronic defect [57]. Surprisingly, no statistically significant differences were observed between patients with and without anterior cruciate ligament [28, 35] or meniscus repair [35].

Five papers, i.e. four randomized controlled trials and one cohort study (the allocation of patients was determined by health care and insurance policy), met our eligibility criteria for statistical analysis. These papers are presented in Table 2. They include data of 187 patients between 15 and 60 years of age with chondral defects of 1–10 cm² and a follow-up period of two to five years. Concomitant surgery refers to anterior cruciate ligament reconstruction and surgical repair of meniscal tears. Osteoarthritis and valgus or varus deformities of more than 5° were exclusion criteria for all five papers. Thus, neither subgroup analysis nor regression analysis could be performed due to the small number of controlled studies.

Table 2 Demographic data of eligible studies

All patients were treated with the microfracture technique as described by Steadman et al. [58]. After surgery, they followed similar rehabilitation protocols, i.e. full weight-bearing was not allowed for four to eight weeks post-operatively (see Table 3). For the evaluation of the surgical outcome the Lysholm Score [59] was applied by Knutsen et al. and Basad et al. However, Gudas et al. and Kon et al. used the IKDC score [60] (2000 IKDC Subjective Knee Evaluation Form, Patients’ Part), whereas Saris et al. used the KOOS [61] questionnaire. Saris determined the overall KOOS rating as the average of sub-scores “function in daily living”, “pain”, “symptoms/stiffness”, and “quality of life”. Sub-score “sports” was excluded from the overall KOOS analysis, as patients were significantly limited in sports activities so that hardly any data were available for this sub-domain. Clinical knee scores evaluate comparable parameters, but rate them differently. Therefore, score values of one particular patient can vary considerably depending on the system chosen [6265]. For a better comparability of the individual treatment effects, the individual paired standardized mean differences were calculated and combined to the overall best estimate of 1.700 with [0.889; 2.511] as 95% confidence interval measured in units of standard deviation of the difference scores [66]. All data are presented in Fig. 2.

Table 3 Rehabilitation program
Fig. 2
figure 2

Forest plot of paired standardized mean difference

Due to the fact that the results of Gudas et al. differ significantly from those of Basad et al., Knutsen et al., Kon et al. and Saris et al. we excluded them from our further analysis, resulting in an overall best estimate of 1.106 with [0.566; 1.646] as 95% confidence interval (see Fig. 3).

Fig. 3
figure 3

Forest plot of paired standardized mean difference (without paper of Gudas et al.)

In any case, both analyses report a mean standardized effect size of more than 0.8, which has to be considered as “large” according to Cohen [67].

As we wanted to convert the summarized effect back into the respective scales of the numerical scores, we contacted the authors of the five relevant papers and asked for the standard deviations referring to the differences in the pre- and postoperative score values. Unfortunately, we only received one reply by D.B.F. Saris so that we could not calculate a clinically meaningful best estimate of the treatment effect achieved by microfracture.

Discussion

Our statistical analysis summarized five controlled studies about the microfracture technique. The forest plot in Fig. 2 shows that the paired standardized mean differences of four studies are comparable, whereas the results of Gudas et al. are significantly better. Notably, their patients belong to a clearly defined homogenous target group. All patients were either highly competitive sportsmen (40%) or well-trained and athletic people (60%) under the age of 40 years with condylar lesions of less than 4 cm². Including these athletes in the analysis of a diverse patient population would unjustifiably increase the mean expected treatment effect resulting in higher expectations that might not be fulfilled by the microfracture technique. Without the patients of Gudas et al., the best estimate of the paired mean standardized difference decreases from 1.700 to 1.106, which in our opinion, seems to be the relevant value for a representative patient population.

In order to provide an approximate clinically significant value of the mean expected treatment effect by microfracture, we performed a rough calculation by multiplying the standard deviation of the difference scores (20.16) measured as overall KOOS points with the standardized best estimate (1.106). According to the definition of the paired standardized mean difference [66], 22 overall KOOS points with a 95% confidence interval of [11; 33] represent a rough estimate of the mean expected treatment effect. As eight to ten KOOS points are suggested for a minimal perceptible clinical improvement (the difference on the measurement scale associated with the smallest change in the health status noticeable by the patient) [68], our rough estimate of the lower limit of the 95% confidence interval suggests at least a clinically significant improvement is achieved by the microfracture technique.

Limitations of our meta-analysis include the fact that failures (patients requiring revision surgery due to persistent or recurrent symptoms using the same or another surgical technique) were weighed differently by the authors. While Basad et al. excluded data of failures from the analysis a priori, they were considered by Gudas et al., Knutsen et al., Kon et al. and Saris et al. Their last recorded clinical follow-up score value before revision surgery was considered as their final clinical score value. Moreover, only Knutsen et al. and Saris et al. started their evaluation with all patients randomized to microfracture and did not exclude those who did not return for post-operative evaluation (see Table 2). Because deterioration of initial functional improvement has been reported between 18 and 24 months after microfracture [36, 37, 4649], the fact that the final outcome evaluation of the five controlled studies was not performed at the same time point must also be considered a limitation, as well as the similar but not identical rehabilitation protocols. Interestingly, Steadman’s [45] physiotherapy protocol includes the use of a continuous passive motion machine (any impact on the outcome could not be proven [69]) already in the recovery room. For lesions on the femoral condyle Steadman insists on crutch-assisted touch-down weight bearing for six to eight weeks with only exceptional bracing. In contrast, weight bearing as tolerated is allowed for patients with patellofemoral lesions treated with the microfracture technique. However, these patients have to wear a brace for at least eight weeks to prevent excessive shear forces on the maturing marrow clot.

We are aware that our rough calculation should be interpreted cautiously. Comprehensive, well-designed, long-term multicenter studies are required to evaluate clinical outcomes across study populations, including subgroup analysis to identify the predicting factors that can lead to functional deterioration after microfracture. Clearly defined indications would help to find out which patients will benefit most from this technique and which patients will probably not achieve permanent improvement. By allocating the latter to an alternative treatment option early failures would be avoided.

Conclusions

A calculation of 1.106 as the appropriate mean standardized treatment effect to be expected by means of meta-analysis of 187 patients (between 15 and 60 years of age with full-thickness chondral defects of 1–10 cm²) revealed a clinically relevant improvement of the post-operative clinical status as compared to the pre-operative status. An increase of 22 overall KOOS points may provide a rough estimate for the mean expected treatment effect achieved by microfracturing.