Introduction

Vertebral fracture (VF) is the osteoporotic fracture most frequent, although it prevalence varies widely depending on the population studied [13]. It is a potentially severe complication of osteoporosis because of acute and chronic back pain, impaired quality of life and increased mortality [47]. Vertebral body fractures result in pain and functional restrictions, and provoke a marked decrease in quality of life [5, 7, 8]. Therefore, early prevention of spinal fractures and accurate diagnosis and treatment are crucial. In clinical practice, osteoporotic VFs are usually identified by characteristic changes in vertebral shape on a lateral spine radiograph. For research consensus, VFs are defined by a reduction in anterior, middle, or posterior vertebral height, although the minimum required reduction (e.g., 15 or 20 % of vertebral height) varies among definition schemes [9]. Despite its common severity, and its value to predict further osteoporotic fracture, often VF is underdiagnosed [10].

Hip fracture (HF) is the most serious complication of osteoporosis. Patients suffering from HF have a high morbidity, mortality and a lower quality of life [7, 1113]. There is a close relationship between previous VF and the incidence of new HF [14], but fewer studies have been performed about the prevalence of previously undiagnosed VF recognized at the moment of being admitted into a hospital due to a HF [15, 16].

We have performed this study in patients suffering from HF attended in an Orthopedic Ward on whom we have looked for previously undiagnosed VF, and the objective was to compare the concordance in the diagnosis of VF by three different observers: (1) an orthopedic surgeon, (2) a radiologist and (3) a bone mineral metabolism expert.

Materials and methods

The study was performed on 246 patients who were admitted into the Orthopedic Surgery Ward of the Hospital University Insular, Gran Canaria, Spain, suffering from a HF, from January 1, 2012 to December 31, 2012. We studied the prevalence of previously undiagnosed VF in these patients by performing an X-Ray study. 177 patients completed the study.

The patients were informed of the objectives of the study and their written consent was requested. For all subjects a questionnaire, previously validated and used in other similar clinical studies, was completed to gather clinical data on osteoporosis [17, 18]. A basic physical examination was also conducted. Lastly, a lateral thoracic-lumbar X-ray was carried out on the subjects.

All the X-rays were brought together and studied by three different observers: one was the Orthopedic Surgeon who attended most of the patients, another was a radiologist and the third was an expert on bone metabolic diseases. They were all instructed to use the Genant criteria for the diagnosis of VF [9]. According to this, the existence of vertebral deformity was stated when there was a reduction in the vertebral height higher than 20 %.

We excluded from the study those patients who where not living in the Canaries (mainly tourists), those who had a high-energy trauma and those suffering from cancer, with or without bone metastases (See flow chart, Fig. 1).

Fig. 1
figure 1

Flow chart of the patients included in the study. *5 patients were diagnosed of cancer and referred to Oncology

The data collected were entered into a database already set up in the statistical program SPSS (Statistical Package for the Social Sciences), for which we had the necessary legal licenses. For the analysis of the data, the Kolmogorov–Smirnoff test was applied to establish the goodness of fit to normality for the variables studied. For each group studied, the variables categorized were summarized in frequencies and percentages, and the numericals in averages and standard deviations.

Interobserver variation was measured by means of concordance, which was calculated in our study calculating the degree of agreement among several observers using the Kappa index, published by Cohen [19]. Cohen’s kappa coefficient is a statistical measure of inter-rater agreement or inter-annotator agreement [1] for qualitative (categorical) items. It is generally thought to be a more robust measure than simple percent agreement calculation, since κ takes into account the agreement occurring by chance. Applying this index, independent observers have a value = 0, while if the agreement is perfect, Kappa index reaches to 1.

Results

The flow chart of the study is shown in Fig. 1. From 246 patients who were admitted to the Orthopedic Ward because of a hip fracture, we excluded 69 because of different reasons. 177 patients completed the study, 108 women (61.1 %) and 69 men (38.9 %). Table 1 shows the characteristics of the population studied. The mean age of the whole patients was 78.6 ± 10.5 years old, 80.5 ± 8.8 years old for females and 75.5 ± 12.3 years old for males, p < 0.01. More than one third of the patients (34.1 %) had suffered from a previous fracture.

Table 1 Characteristics of the studied population

Table 2 shows the prevalence of vertebral fractures in the population studied depending on the observer. The differences vary from 41.8 %, observer 3 (Specialist in Bone Metabolic Diseases) to 47.5 %, observer 2 (Radiologist). Observer 1 (Orthopedic Surgeon) found a prevalence of 46.3 %, nearer to observer 2.

Table 2 Prevalence of vertebral fractures depending on the observers

Table 3 shows the agreement found among these three different observers in the diagnosis of VF. The lowest concordance was found between observer 3 (Specialist in Bone Metabolic Diseases), compared to the other observers, 72.3 % in both cases, while the highest concordance was obtained between observer 1 (Orthopedic Surgeon) and observer 2 (Radiologist), 80.1 %.

Table 3 Interobserver concordance in the diagnosis of vertebral fracture

Discussion

VF are the most frequent fragility fractures and its prevalence is thought to be 117 people per 100,000 in the general population [20], and the lifetime risk of spinal fracture in women over the age of 50 is about 15 % [21]. The data from the EVOS study have allowed a more precise assessment of radiographic vertebral fractures in a broad population.

Between the age of 75 and 89, the incidence of vertebral fractures is as high as 13.6 per 1000 inhabitants/year in men and 29.3 per 1000 people/year for women [1]. Recent studies indicate than the incidence of VF is even higher than previously described [2, 3, 7].

As a powerful predictor of future fracture risk, the identification of VF helps target individuals who will benefit from anti-fracture therapy, but this identification is problematic because: (a) “normal” radiological appearances in the spine vary greatly both among and within individuals; (b) “normal” vertebrae may exhibit misleading radiological appearances due to radiographic projection error; (c) “abnormal” appearances due to non-fracture deformities and normal variants are common, but it can be difficult to differentiate from true VF [22], and finally, (d) there is no consensus regarding the definition of osteoporotic VF [23].

Genant’s semi-quantitative assessment on standard radiographs may offer useful information about the severity and prognosis of osteoporosis and it is the standardized visual method for the diagnosis of vertebral fracture, commonly applied in research studies as a surrogate gold standard, based on its good reproducibility and ability to differentiate fractures from other deformities [23]. Underdiagnosis of vertebral fracture is a worldwide problem [10]. In a multicenter, multinational study, the authors assessed radiographic diagnoses of vertebral fracture in 2451 postmenopausal women with osteoporosis and comparison between local and central readings yielded a false-negative rate of 34 %.

Indeed, in patients suffering from HF, underdiagnosis of vertebral fracture is still a problem. Some studies performed to look for VF previously undiagnosed in different populations have shown high rates of VF, either in patients attended in primary care with chronic back pain [5], in outpatients attended at Internal Medicine [24], in patients with osteopenia [25] or in patients admitted in hospitals because of a HF [15].

In our study, depending on the observer who made the diagnosis, the prevalence of previously undiagnosed VF in patients with HF varied from 41.8 to 47.5 %. Irrespective of these differences, the figures are high enough to be a major cause of concern, since none of these patients had been diagnosed with VF or treated for this problem. Accordingly, we selected three different observers to assess the possible differences in the diagnosis of vertebral fractures among these patients. Actually, the same method was used, i.e., Genant’s semi-quantitative assessment [9]. Surprisingly, there was little concordance among the three of them. Cohen’s kappa coefficient varied from 0.432, the lowest, to 0.598, the highest, being the perfect concordance 1. Our results show that the diagnosis of VF may differ substantially depending on the observer who performs the study.

To avoid this variability, some other methods have been used; for example, DXA, when it is done for bone mineral density measurement, may also detect VF in asymptomatic patients [23]. Many other reports have stated that magnetic resonance imaging (MRI) has a high degree of accuracy for the definite diagnosis of incident spinal fracture, and it continues to be used as the most useful tool. However, due to limitations in access to the required equipment and considerations that must be given to the economics of medical treatments, it is not possible to use MRI with all patients [23, 26, 27].

Our results suggest that a gold standard for the diagnosis of VF is needed because, in some cases, one patient could be diagnosed with a vertebral fracture or not, depending on the physician who sees the X-Ray, and, certainly this situation should be avoided. The problem would be even bigger if instead of using a unique method for the diagnosis of vertebral fracture, Genant’s method, some other criteria were used [1, 2832]. Then, the concordance would be even lower.

One limitation of the study is the lack of a gold standard to diagnose vertebral fracture, which would allow us to know which observer was nearer to the right diagnosis of vertebral fracture.

In conclusion, patients suffering from HF have a high prevalence of previously undiagnosed and untreated VF. There is a low concordance among three different observers; so further studies are required to look for a gold standard that allow to diagnose accurately VF.