Introduction

Tibial plateau fractures are complex articular fractures that usually require surgical treatment. The ideal classification for such a fracture should describe its localization, its characteristics, should have high inter-observer reliability, should guide prognosis and assist in surgical planning.

Marchant first described tibial plateau fractures and divided them into: separated, depressed and combined fractures [1]. In 1960, Duparc and Ficat published one of the most complete classifications considering the coronal, sagittal and axial planes [2]. This classification was simplified in 1990 [3]. The Schatzker classification, published in 1979, divides fractures into six types and is the most widely used classification worldwide [4]. It places more attention on the lateral plateau and includes both anterior and posterior tibial plateau fractures. In 1987, the AO group presented an alphanumeric classification for these fractures that is similar to the Schatzker classification but which is more extensive [5]. In 2013, Gicquel compared the Schatzker, AO and revised Duparc classifications, concluding that computerized tomography (CT) scans improve inter-observer agreement, and they found similar results for the Schatzker and the revised Duparc classifications.

In the past few years, the CT scan has widened the scope of understanding of tibial plateau fractures and has favoured the development of column classifications. In 2010, Luo published the three-column concept for tibial plateau fractures, including a medial, a lateral and a posterior column [6]. This, and other research, has shown that the Schatzker classification might be insufficient to describe some types of fractures, such as those compromising the posterior zone of the tibial plateau [7, 8]. In 1974, Postel published a series comprising 279 cases of posteromedial fracture of the tibial plateau caused due to motorcycle accidents with the knee in flexion [9]. The frequent compromise of the posterior tibia has been identified in other series and this has motivated the use of posterior surgical approaches for treatment [10]. It is estimated that 65% of tibial plateau fractures are compromised of the posterior columns [6, 11, 12]. More recently, Chang et al. (2014) divided tibial plateau fractures into four columns: anteromedial, anterolateral, posteromedial and posterolateral [13], as shown in Fig. 1.

Fig. 1
figure 1

Classification of tibial plateau fractures into four columns: anterolateral (AL), anteromedial (AM), posteromateral (PL) and posteromedial (PM). A fracture might have one, or several, affected columns

We note that posterior fractures of the tibial plateau have, historically, been ignored, underestimated and mistreated. Classic classifications (such as AO and Schatzker) do not specifically evaluate the posterior part of tibial plateau fractures, and, in other cases (such as Duparc classification), they assess it incompletely. A four-column classification includes the whole articular surface of the proximal tibia. Classic classifications, moreover, have shown important inter-observer variability [14,15,16,17,18]. The aim of this study was to compare the intra-observer and inter-observer variability of a four-column classification compared to classic classifications.

Materials and methods

An inter-observer reliability study was performed in a level 1 trauma centre, the Fundación Valle del Lili in Cali, Colombia. Prior to the start of the research, the protocol was evaluated and approved by the Ethics Committee and Institutional Review Board of the Fundación Valle del Lili. The study included patients with tibial plateau fracture that were admitted to the hospital between January 2013 and September 2015. Patients were older than 14 years old, and received both x-rays and computed tomography (CT) of the knee. Fractures were excluded if they were associated with a bone tumour or with previous bone infection.

Once the fractures were deemed to have met the eligibility criteria, demographic data was registered and four orthopaedic surgeons were asked to classify each fracture according to four different classifications: AO, Schatzker, Duparc and 4-column.

Four-column classification system

Using the four columns described by Chang [13], a new classification system was developed in which every column received a letter: anteromedial column (A), anterolateral column (B), posteromedial column (C) and posterolateral column (D). Each fracture can be assigned a minimum of one letter and a maximum of four letters according to the degree of compromise in one or multiple columns. For example, a fracture in the anterolateral column would be represented as B (Fig. 2a and b); a fracture of the four columns as ABCD (Fig. 3a and b); a fracture of the anteromedial, posteromedial and posterolateral columns as BCD (Fig. 4), and so on.

Fig. 2
figure 2

a Case 1. Tibial plateau fracture Schatzker II, four-column type B; anterolateral column fracture with split and separation pattern. CT scan coronal view. b CT scan axial view

Fig. 3
figure 3

a Case 2. Tibial plateau fracture Schatzker V, four-column type ABCD; compromise of the four columns. CT scan axial view. b CT scan axial view

Fig. 4
figure 4

a Case 3. Tibial plateau fracture Schatzker V, four-column type BCD; the anterolateral, posteromedial and posterolateral columns are fractured. CT scan axial view. b CT scan coronal view

Intra-observer and inter-observer reliability

The four orthopaedic surgeons received the X-rays and CT scan for every case without any extra data or information about the patient and were asked to classify each case. They were instructed in the four classification systems and they received each classification with their proper illustration, to be able to check the classification as necessary. Two were senior orthopaedic surgeons with more than ten years experience in practice and the other two youth orthopaedic surgeons with less than ten years of practice experience. They were blind to demographic data from the patient, to postoperative x-rays, to the radiologists report and to the classifications done by the other participating orthopaedic surgeons. For the intraobserver reliability, one of the senior surgeons was asked to complete the classification of the fractures twice with a two-week interval between them and assigning a different order to the images the second time.

An independent typist registered the results. Demographic descriptive data was analysed calculating means and proportions. Genre, age, aetiology of the fracture, compromise of a posterior column and use of a posterior surgical approach were calculated. Kappa, intra and inter-observer concordance were calculated for the reliability analysis by an independent biostatistician. Stata 13.0 was used for the analysis. Firstly, the intra and inter-observer concordances were calculated by dividing the number of tests that were in agreement by the total number of tests. Chi-squared test was used to calculate differences between percentage agreements. Next, the kappa coefficient was calculated in order to take into account the effect of chance in the degree of agreement between observers. When interpreting kappa, values of 0–0.20 indicate slight agreement, 0.21–0.40 fair agreement, 0.41–0.60 moderate agreement, 0.61–0.80 good agreement, and 0.81–1.00 very good agreement [19].

Results

During the study period, 49 patients fulfilled the eligibility criteria and were included in the study. The mean age of the 49 patients was 39 ± 14.2 years, with a similar proportion of males (51%) and females (49%). Table 1 shows demographic data, including aetiology, showing that motor vehicle accidents were the principal cause of all fractures (69%).

Table 1 Demographic characteristics of patients with tibial plateau fracture

After the observers classified the 49 cases using the four-column classification, it was found that at least one of the posterior columns was affected in 67% of all fractures (n = 33). Nevertheless, only eight cases involved a posterior approach in the surgical management of the fracture.

Intra-observer reliability

The four-column classification showed very high percentage agreement (84%), with significant difference compared to the other classifications (p = 0.001). It was followed by Schatzker (60%), AO (50%) and Duparc (48%). Kappa values were also led by the four-column classification, the only one to show a good agreement (0.61), followed by Schatzker with moderate agreement (0.48), and Duparc (0.370 and AO (0.34), both with fair agreement.

Inter-observer reliability

The inter-observer results showed an overall good degree of agreement across the four observers for the four classifications. The four-column classification showed the highest percentage of agreement (79%), followed by Schatzker (71%), AO (59%) and Duparc (58%), with a statistically significant difference between them (p = 0.003). Meanwhile, kappa values showed a fair agreement for the Schatzker, four-column and Duparc classifications, while AO had only a slight agreement between observers (Table 2).

Table 2 Intra-observer and inter-observer reliability for the four orthopaedic surgeons according to different tibial plateau fracture classifications

Discussion

These results show that the four-column classification has the highest percentages of agreement for both intra and inter-observer reliability. It also has the highest kappa value for intra-observer evaluation and is second, after Schatzker, for inter-observer kappa. This shows that the four-column system is a very reliable classification, especially for the results among a same observer. This can be related to the cumulative experience of using it and the easy system of only choosing the affected columns, without having to decide if there is a split, a depression, or both. Therefore, it is a good complement for a classic classification. It has the advantage of being the most reliable and showing how compromised the different topographical areas from the articular surface of the tibial plateau are. Meanwhile, classic classifications are very useful in describing the fracture pattern but are less reliable.

Four-column and Schatzker were the most reliable among the four observers, showing the least variability between them and the lowest probability of chance affecting the results. These are important findings for clinical practice, as practitioners need to be able to speak the same language with colleagues when assessing a tibial plateau fracture, as this affects reliability between different observers.

New classification systems emerge as an attempt to better explain fracture patterns. A new classification, such as the four-columns classification, should be reliable but should also offer additional advantages in comparison to classic classifications. The way of dividing tibial plateau fractures, in the four-column classification, into two anterior and two posterior columns in the articular surface of the proximal tibia, and then assigning one letter to each column, is an easy way to classify these fractures. It also has the advantage of highlighting the posterior columns, which might be important to consider a posterior surgical approach. In this study, there were a high percentage of fractures in the posterior columns (67%), but only 24% of this group of patients were surgically treated through a posterior surgical approach. It is possible that, with the implementation of this new classification system, a higher percentage of cases would receive treatment using a posterior approach.

The main limitation of this classification system is that it provides information only regarding the compromise of the proximal tibia in the articular surface. It can be assumed that as more columns are fractured, the fracture status and prognosis are worse, but there are other important aspects that also need to be considered, such as the fracture pattern, metaphyseal/diaphyseal extension and soft tissue status. These could be assessed using a classic classification, such as Schatzker, and a soft tissue classification such as Tscherne [20]. The clinical practitioner, when classifying a tibial plateau fracture, might need to use the four-column classification in conjunction with a classic classification and a Tscherne classification, in order to make a complete assessment.

For future studies it is important to show the clinical relevance of using this four-column classification. There are some questions that may be answered: does increasing awareness in the posterior tibial plateau correlate with an increase in the use of posterior surgical approaches? Does the use of posterior surgical approaches in those cases correlate with better clinical, functional and radiological results in the short and long-terms?

To our knowledge, no previous studies have described a four-column classification system and compared it to the classic classification systems for tibial plateau fractures. This four-column classification has the advantage of showing the highest percentage of agreement between observers and among the same observer, with the best intraobserver and second best interobserver kappa value. Unlike the other classifications, it also includes the whole articular surface of the tibial plateau and highlights fractures of the posterior column. We therefore recommend its use in the clinical practice.

Conclusion

This new four-column classification offers a reliable system for classifying tibial plateau fractures. It showed the highest intra and inter-observer agreement. When taking into account the agreement that occurs by chance, Schatzker classification had the best interobserver kappa value, but again the four-column had the highest intraobserver kappa value. It should be noted that the four-column classification is a more inclusive classification for posteromedial and posterolateral fractures. We suggest that it should be used with one of the classic classifications in order to better understand fracture patterns, to allow more attention to be paid to the posterior columns, to improve surgical planning and to provide a more systematic way of choosing the best surgical approach.