Introduction

Femoroacetabular impingement (FAI) is a condition increasingly being recognized as a cause of hip pain and disability in the active patient, as well as one of the risk factors of osteoarthritis of the hip in non-dysplastic patients [13]. The concept of FAI, as initially described by Ganz [3, 4], involves the abnormal contact between the femoral head and the acetabulum. Thus, a primary bone lesion predisposes to soft tissue injury, such as labral tearing or chondral delamination. Three different types of FAI have been described: CAM, Pincer, and mixed impingement. CAM impingement involves a non-spherical femoral head abutting the acetabular rim. Pincer impingement involves a prominent acetabular rim abutting a normally shaped femoral head [3, 4]. Mixed impingement involves varying degrees of both CAM and Pincer impingement in the same patient. This mixed impingement is the most common pattern seen in FAI patients [5].

FAI is a complex diagnosis involving pathology at various locations within the hip joint. Several radiographic measurements have been described to quantify the severity of the disease in symptomatic hips. These include the alpha angle, center edge angle, femoral offset, presence of a crossover sign, etc. To make the diagnosis, the clinician must combine several radiographic parameters with findings on clinical examination.

Numerous surgical options have been described to treat symptomatic FAI. These can be classified into three main categories: (1) open surgical dislocation, (2) arthroscopic assisted combined with an open component, and (3) arthroscopy alone. Hip arthroscopy is increasingly being utilized, given the potential advantages of reduced surgical trauma and decreased morbidity. A recent review of the American Board of Orthopaedic Surgeons database showed an eightfold increase in the number of hip arthroscopies performed between 1999 and 2004 [6].

Reported short-term clinical outcomes after FAI surgery are generally shown to be favorable in most clinical series [79]. Unfortunately, radiographic outcomes are not as consistently reported in the literature [10•]. The purpose of this systematic review was to identify reported radiographic outcome measures used in arthroscopic FAI surgical studies. We hypothesized that there is high variability among surgical studies in reporting of radiographic outcomes.

Methods

Search strategy

The online databases PubMed, EMBASE, and Medline were used to search for literature addressing radiographic outcomes after FAI correction reported in clinical studies from database inception until January 7, 2016. The search terms “femoral acetabular impingement”/“femoroacetabular impingement”, and “arthroscopy” were used (Appendix A).

Study screening

Two reviewers (J.K. and M.M.) independently screened the titles, abstracts, and full-text articles. Any disagreements were discussed between the reviewers until consensus was reached, and a senior author was included for consultation when necessary. The references of the eligible, included studies were then screened to capture any additional articles that may have eluded the initial search strategy.

Assessment of study eligibility and quality

The research questions and inclusion and exclusion criteria were determined a priori. The inclusion criteria were studies on human adults 18 years or older, English-language studies, studies on arthroscopic FAI surgery, and studies with reporting of postoperative radiographic data. All levels of evidence were included. The exclusion criteria were studies reporting on patients that had arthroscopic surgery with no bony work (i.e., no change in radiographic parameters from preoperative values), non-human studies, cadaveric studies, conference papers, book chapters, review articles, and technical reports. The methodological index for nonrandomized studies (MINORS) checklist was used by two independent reviewers to assess the quality of the methodology used by the included studies [11]. Non-comparative studies are assessed out of a maximal MINORS score of 16, while comparative studies are assessed out of a maximal MINORS score of 24.

Data abstraction and statistical analysis

Relevant data were abstracted and recorded in a Microsoft Excel spreadsheet (Version 2007, Microsoft Inc, Redmond, WA, USA). Recorded data included details on the characteristics of the included studies such as authors, country, year of publication, sample size, and study design. Specifics regarding the reported radiographic outcomes were then extracted from each study. Descriptive statistics (including means, medians, standard deviations, and ranges) were used to summarize the data.

Assessment of agreement

In order to assess the inter-reviewer agreement, a kappa (k) statistic was calculated for the title, abstract, and full-text screening stages. Agreement was categorized a priori as follows: k of 0.61 or greater was considered substantial agreement; k of 0.21 to 0.60, moderate agreement; and k of 0.20 or less, slight agreement. Agreement for quality assessment using the MINORS score was determined using an intraclass correlation coefficient (ICC). An ICC less than 0.25 was considered to indicate little to no agreement, 0.25 to 0.50 fair agreement, 0.50 to 0.75 moderate agreement, and greater than 0.75 was considered to indicate high agreement [12].

Results

Search strategy

The initial search of three databases resulted in 1304 total studies. Three-hundred thirty-nine duplicates were removed. A systematic screening approach resulted in 23 available full-text articles for review (Fig. 1). There was substantial agreement among reviewers at the title (k = 0.817; 95 % CI, 0.785–0.849), abstract (k = 0.876; 95 % CI, 0.824–0.928), and full-text (k = 1.00) screening stages.

Fig. 1
figure 1

Literature search outlining reasons for exclusion of identified articles

Study characteristics

All included studies were conducted between 2007 and 2015. Table 1 lists descriptive characteristics of the studies included for analysis. The articles included in this review involved a total of 1348 patients. Twenty of the studies involved arthroscopic management, while three of the studies compared arthroscopic management with open surgical treatment [28, 30, 31]. Of the studies included, 15 were conducted in the USA and three in Switzerland. The remaining five studies were conducted in France (two), Chile (one), China (one), and Mexico (one). The mean sample size was 57.8 patients.

Table 1 Characteristics of included studies

Study quality

We identified five level II studies (two prospective comparative studies, two lesser-quality randomized controlled trials, and one prospective matched-pair comparative study) (21.7 %), five level III studies (three cohort studies, one retrospective comparative study, and one retrospective case series) (21.7 %), and 13 level IV studies (nine case series, three therapeutic case series, and one prospective cohort study) (60.9 %). There were no level I studies identified. The level of inter-rater agreement in the assessment of quality using the MINORS score was high with an ICC of 0.93 (95 % CI 0.83–0.97). The median MINORS score of the non-comparative studies was 10/16 (range 7/16 to 13/16). For comparative studies, the median MINORS score was 17.5/24 (range 12/24 to 21/24). The factor most consistently reported was a clearly stated aim, which was reported by 20 (87 %) of the studies. The factor that was least consistently identified was prospective calculation of sample size, which was reported by only 5 (22 %) of the studies (Table 1).

Radiographic outcomes

Across all the studies, 15 different radiological outcomes were reported postoperatively (Table 2). The alpha angle was the most frequently reported postoperative radiographic outcome (16 studies, 69.6 %). For these studies that reported postoperative alpha angles, seven studies indicated that lateral images were used to calculate alpha angle (with frog-leg lateral, cross-table lateral, and Dunn view all being mentioned). Two studies mentioned that anteroposterior (AP) images were used to calculate alpha angle. For the remaining studies, there was no mention on what view was used. Center edge angle on postoperative radiographs was used in eight studies (34.7 %). Femoral offset was measured in four studies (17.4 %). Tonnis classification was mentioned in three studies (13 %). Tonnis angle and resection depth ratio (%) was mentioned in two studies (8.7 %). Other radiographic outcomes measured included ischial prominence size (mm), presence of crossover sign, anterior rim angle, anterior wall angle, anterior margin ratio, resection depth (mm), beta angle, sector of resection, and acetabular bony impingement angle.

Table 2 Reported radiographic outcomes for included studies

Eight studies reported multiple measures in their results [20, 21, 23, 25, 26, 28, 29, 31]. Of these, five reported outcomes for both CAM and Pincer radiographic outcomes in the same study [20, 25, 26, 29, 31]. Three studies included magnetic resonance imaging (MRI) as part of their postoperative imaging protocol [22, 26, 31], while three other studies utilized computed tomography (CT) scan as part of their postoperative imaging protocol [27, 29, 32]. Table 2 shows radiographic outcomes for the included studies.

CAM-type FAI

Alpha angle was the most commonly reported radiographic outcome for CAM deformities, reported in 16 studies. The pooled mean ± standard deviation for postoperative alpha angle was 45.6° ± 8.2 (range 36.4–68.5°) [8, 1416, 1820, 22, 23, 2532]. The second most commonly reported radiographic outcome for CAM deformities was femoral offset reported in four studies. The pooled mean ± standard deviation for postoperative femoral offset was 8.8 mm ± 0.9 mm (range 7.8–9.56 mm) [20, 23, 28, 31]. The resection depth ratio reported in two studies to be 13.2 % (osteochondroplasty group) versus 21.8 % (femoral neck fracture group) [23], and 6 % [31]. Mean change from preoperative values to postoperative values could not be calculated due to wide variation in how the data was presented.

Pincer-type FAI

Center edge angle was the most commonly reported radiographic outcome for Pincer-type FAI, reported in eight studies. The pooled mean ± standard deviation for postoperative center edge angle was 34.1° ± 6.2° (range 30.3–35.9°) [13, 17, 21, 25, 26, 29, 31, 33]. The second most commonly reported radiographic outcome for Pincer impingement was the Tonnis angle reported in two studies to be 8.2 ± 3.4° [21] and 3.1° [25]. Mean change from preoperative values to postoperative values could not be calculated due to wide variation in how the data was presented.

Discussion

Interest in FAI has increased over the last decade, which has led to an increase in the number of hip arthroscopy procedures being performed [6]. Unfortunately, there is a paucity of strong level 1 or 2 evidence studying clinical outcomes after FAI surgery. There is even less literature studying postoperative radiographic parameters and how these relate to clinical outcomes in the long term [10]. Reporting postoperative radiographic outcomes has been shown to be important as it may predict outcome following the index surgical procedure, while under-correction of bony pathology has been identified as a leading cause of failure in these patients [35•, 36•].

A full comprehensive radiographic assessment of FAI involves a true AP pelvis, false-profile, cross-table lateral, frog-leg lateral, and a 45° lateral Dunn view in order to extrapolate spherical pathology from one-dimensional imaging [37]. In addition, the pathology can be quite complex involving abnormal bony changes on the femoral head, acetabulum, or both. This systematic review explores the consistency of radiographic outcomes after arthroscopic management of FAI. Given the heterogeneity of pathology seen with FAI, it is not surprising that studies report a wide variety of radiographic outcomes postoperatively.

There was substantial variability among the studies reviewed when comparing postoperative radiographic outcomes reporting of both CAM and Pincer-type FAI. Fifteen different outcomes were described among the studies included (Table 2). The alpha angle was the most commonly reported outcome. Even with the alpha angle, there was heterogeneity in how the measurement was calculated. Two studies calculated the alpha angle on AP radiographs. Seven studies indicated that lateral images were used to calculate the alpha angle, with frog-leg lateral, cross-table lateral, and Dunn views all being mentioned as views used to calculate the alpha angle. It is difficult for a surgeon to compare their results with the literature when there is such heterogeneity in reporting postoperative images. Given that FAI is a complex morphological alteration of the hip joint, it is not possible to have one postoperative radiographic parameter that can be inclusive of all the correction that has been performed. Although there was variability in how alpha angle was calculated, this measure was consistently used in papers that reported on CAM impingement. Of 16 papers that report on CAM impingement, all the 16 included alpha angle as part of their analysis. Similarly for Pincer-type, eight of ten papers that report on this form of impingement use center edge angle as part of their analysis. There is not the same heterogeneity in calculating center edge angle compared to alpha angle. This angle was consistently calculated on AP radiographs of the pelvis.

There were limitations with this study. This review is limited both by the number of studies available and the quality of these studies. This review looked solely at studies published in the English language. As such, there may be a publication bias. No attempt was made to contact the authors of the included studies to ensure data accuracy. It is possible that patients were included in more than one publication and therefore counted more than once in this study.

There are several strengths to this review. Our search strategy was comprehensive in its review of the literature. We used focused and clear inclusion and exclusion criteria to define our research question. We included multiple reviewers when screening, evaluating, and abstracting the data from the literature. This study provides a novel focus on radiographic outcomes following arthroscopic management of FAI relative to the increasing body of literature on clinical outcomes. This review is thorough, containing literature from North America, Europe, South America, and Asia.

Future studies should aim to standardize radiographic outcome reporting and combine these with clinical outcomes. Large-scale clinical trials that attempt to explore clinical outcomes combined with radiographic outcomes should be used to determine successful FAI management. Ideally, outcomes would be reported using plain radiographs so as to minimize cost and radiation exposure to the patient. For this to be possible, correlation of plain imaging to advanced imaging (3-D CT scan, MRI, etc.) would need to be established. Computer templating software would be an instrumental tool for a surgeon to accurately plan the amount and location of resection in order to maximize clinical outcomes.

Conclusions

There is significant variation in the reported radiographic outcomes after arthroscopic FAI surgery, with the vast majority of studies not reporting these values. Alpha angle was the most commonly reported outcome for correction of femoral-sided deformity, while center edge angle was the most commonly reported outcome for acetabular deformity. No single radiographic parameter is sufficient in diagnosing FAI. The clinician must use a combination of radiographic values in conjunction with clinical examination to make an accurate diagnosis. This study highlights the need for consistent radiographic outcome reporting in conjunction with clinical outcomes after arthroscopic FAI surgery.