Introduction

Vertebral compression fractures are a common problem of osteoporosis, with an estimated incidence of half a million symptomatic fractures sustained in Europe each year [1, 2] affecting 1.1% women and 0.6% men in the age group 50–79 years [3]. Incidence of osteoporotic vertebral compression fracture (OVCF) increases as people age, although at all ages, more women than men are affected [3]. Symptomatic OVCF can cause significant pain and decrease a patient’s mobility with substantial impact on a patient’s quality of life (QoL) [4], both in the short and long term [5].

Short-term treatment goals for patients who suffered an OVCF are pain relief, restoration of mechanical stability and mobility improvement. To this end, patients may receive either non-surgical (conservative) management (NSM) in terms of narcotic analgesics, bed rest and bracing, or they may undergo vertebral augmentation [6, 7]. As the acute fracture heals, the initial pain caused by the fracture subsides in a majority of patients, usually within a couple of weeks [8, 9]. Still, up to one third of patients experience insufficient response and/or intolerance to conservative treatment [10]. Also, in the elderly patient population, NSM increases the risk for adverse outcomes associated with the use of narcotics and prolonged inactivity. Hence, vertebral augmentation procedure (VAP) has become a widespread treatment option for many OVCF patients. As such, both vertebroplasty (VP) and balloon kyphoplasty (BKP) are minimally invasive procedures where cement is injected into the affected vertebra in order to stabilise the vertebra and reduce pain caused by the fracture. The procedures differ in the way that VP uses needles to deliver the cement to the vertebra while BKP first expands the vertebra with a balloon, before cement is injected. Utilisation data show that in the USA, more than 300,000 inpatient VAPs were performed between 2005 and 2010, with BKP accounting for almost three in four procedures [11].

Because published evidence about the effectiveness and safety of VAP in the treatment of OVCF is limited (see e.g. [12,13,14]), more research is needed to fully understand the benefits and harms of VAP. Further, for more than a decade, the scientific and clinical community has been stressing the need for “establishing which patients are most likely to benefit and at which point after their fracture PVA [percutaneous vertebral augmentation] becomes appropriate” [15]. After many years of study, there is still no consensus pertaining to these questions. Appropriateness is commonly defined as a treatment where the expected benefits outweigh the expected harms by a sufficiently wide margin [16]. The term is often seen within other disease areas and has leveraged the development and dissemination of so-called Appropriate Use Criteria (AUC) — sometimes referred to as Appropriateness Criteria —, most prominently in the treatment of cancer (e.g. [17, 18]) and cardiovascular disease (e.g. [19, 20]). In the area of OVCF, two AUC documents [21, 22] and one study on the applicability of AUC have been published to date [23]. However, guidance on appropriateness of treatment may also be found in other types of publications such as clinical practice guidelines, systematic reviews and position statements. AUC definitions as used by professional associations concerned with the diagnosis and treatment of diseases of the spine are summarised in Table 1. Using these definitions as guidance, appropriateness criteria was defined as literature making treatment recommendations based on available evidence and collective judgement from experts. This paper seeks to systematically review the available literature on appropriateness criteria for treatment of OVCF.

Table 1 Definitions of Appropriate Use Criteria

Material and methods

Review protocol

The review methodology was predefined in a full protocol in accordance with the requirements of the Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols (PRISMA-P) 2015 statement [28]. Only minor amendments were made to the protocol. However, after study selection, we abandoned the idea of performing a quality assessment of included studies using the Appraisal of Guidelines for Research and Evaluation (AGREE) II [29]. This decision was based on the fact that most identified publications were not clinical practice guidelines, for which this tool was developed and validated.

Study identification

A comprehensive, systematic search of the databases PubMed and EMBASE was run on January 18, 2017, using the search termsFootnote 1: osteoporo*, vertebral, spinal, compression, fracture*, appropriate*, criteria, guideline, guidance, “position statement”, standard*, consensus, recommend*, “patient selection” and indication. Only publications reported in English between the years 2000 and 2016 were included. In addition, websites linked to professional associations, patient organisations, health technology assessment (HTA) bodies and databases of practice guidelines were searched during January 2017. This grey literature search was restricted to Europe’s “Big Five” (Germany, the UK, France, Italy and Spain), North America and Australia.

Selection of studies

After removal of duplicates, all references were screened for inclusion based on their title and abstract. The following inclusion criteria were applied: the publication (1) is an overview article such as an Appropriateness Criteria document, clinical practice guideline, consensus/position statement, meta-analysis or HTA, (2) is concerned with the treatment of vertebral compression fractures caused by osteoporosis (not by malignancy or trauma) and (3) makes recommendations on the appropriateness of one or more treatments in this patient group (not merely reviewing efficacy, safety or treatment indications). Subsequently, the full-text manuscripts of eligible publications were reviewed for inclusion by two independent reviewers (SL and JW).

Data extraction

Relevant data from the publications were extracted by both reviewers (SL and JW) using a standardised data extraction form, with any discrepancy resolved by discussion. The primary outcome measure was appropriateness criteria for the treatment of OVCF. Other extracted data was scope, method, patient population, treatment options, strength of recommendation, country/region for recommendation, publication type and publisher. Definition of appropriateness in the publications and evidence base used to support conclusions drawn was extracted when readily available. This information is provided in Table 2.

Table 2 List of publications included in the review

Data analysis

Appropriateness criteria were grouped by treatment option and publication type and summarised narratively in text form. No statistical analyses were performed.

Results

Study identification and selection

Details of the flow of studies included at each step of the review are specified in the PRISMA flow diagram (Fig. 1). A systematic search of the PubMed and EMBASE databases yielded 374 and 339 hits, respectively. Of the total of 713 references, 289 duplicates were removed. Based on title and abstract review of the remaining 424 records, 321 references were excluded. Additional references (n = 18) of potential relevance for this review were identified from websites of professional associations, patient organisations, HTA bodies and practice guideline databases. In total, 121 records were screened for eligibility. Of these, eight publication studies could not be obtained. Following detailed examination of the remaining 113 articles, 102 citations were excluded for not meeting the eligibility criteria. Most of these references were excluded because they were lacking statements on the appropriateness of a certain treatment. Eleven studies met the eligibility criteria and were included for data extraction.

Fig. 1
figure 1

Study flow diagram

List of included articles

A complete list of articles and their study characteristics, including publishers’ assessed level of evidence when readily available, is provided in Table 2. We included two recently published documents regarding Appropriate Use Criteria [21, 22], two position statements or consensus papers [31, 39] and six guidelines or technology assessments [40,41,42, 47,48,49]. Further, we included one systematic review [50], which also performed a meta-analysis on the data. No apparent biases linked to type of publication or publisher have been identified.

Appropriateness of non-surgical management

Five publications made treatment recommendations on NSM [21, 39, 40, 48, 49] (Table 3). As regards place in therapy, conservative therapy has been recommended as first-line treatment option in a range of patient groups, starting from all patients with OVCF [48, 49], to those with low level of pain, disability, or deformity [39] or negative magnetic resonance imaging (MRI) [21]. In a panel study, Anselmetti and colleagues assessed the appropriateness of different treatment options for OVCFs in 128 hypothetical patient profiles, which were derived by permutations of seven clinical factors (time since fracture, MRI findings, mobility limitation, severity of pain, spinal deformity, proof of ongoing fracture process, presence of pulmonary and/or gastrointestinal dysfunction) [21]. The authors conclude that NSM is usually appropriate in patients who lack MRI evidence of fracture and MRI-positive patients without other unfavourable factors, but recommend against NSM in patients with proof of ongoing fracture process and two or more other unfavourable factors [21]. Specific recommendations for subtypes of conservative therapy are available from the American Academy of Orthopaedic Surgeons’ (AAOS) clinical practice guideline for OVCF treatment [40]. For patients with acute OVCF, the guidelines suggest calcitonin treatment (AAOS strength of recommendation: moderate) and consider L2 nerve block as an option (AAOS strength of recommendation: limited), but are unable to recommend for or against other types of conservative therapy such as bed rest, analgesics and bracing (AAOS strength of recommendation: inconclusive) [40]. Across the majority of publications, NSM is considered an appropriate first-line treatment option for OVCF.

Table 3 Recommendations made for non-surgical management

Appropriateness of vertebral augmentation procedures

Two publications made statements on the appropriateness of VAP (either VP or BKP) in general [31, 39], without differentiating between the two techniques, and one publication made the same recommendations for both procedures [42] (Table 4). In fact, motivated by a lack of evidence for the superiority of one procedure over the other, some even regard “kyphoplasty and vertebroplasty as generally interchangeable techniques for the performance of percutaneous vertebral augmentation” [31]. Across the included publications, vertebral augmentation is considered as an appropriate treatment option for OVCF patients with high level of pain, who do not respond to non-operative therapy. Uncertainty remains, however, what constitutes a failure of NSM. Barr and colleagues propose to define failure of conservative medical therapy based on the patient’s pain level, response to analgesic drugs and functional status [31], acknowledging that individual differences in these variables preclude stringent cut-off values. Likewise, there is no hard-and-fast time period for a fracture to heal with NSM before VAPs may be offered. Even if Brunton and colleagues suggest to wait up to 6 weeks for symptoms to improve [39], Barr and colleagues found no support for the concept of a mandatory waiting time [31], and the National Institute for Health and Care Excellence (NICE) [42] does not discuss time since fracture in their recommendations. In general, VAP is the recommended treatment option in patients who have severe pain despite optimal pain management.

Table 4 Recommendations made for vertebral augmentation procedures

Appropriateness of vertebroplasty

In total, seven publications included statements on the appropriateness of VP in treating OVCF patients [21, 22, 40, 42, 47, 48, 50] (Table 5). A recent Cochrane review by Buchbinder and colleagues of 11 RCTs and one quasi-RCT of VP in patients with OVCF concludes that “current literature does not support the likelihood of identifying subsets of patients who would benefit from vertebroplasty” [50]. Further, Buchbinder concludes that their review does not support a role for VP in routine practice and that no demonstrable clinically important benefits of VP compared with a sham procedure were identified [50]. Similarly, the American Academy of Orthopaedic Surgeons (AAOS) makes only recommendations against the procedure. Based on five studies [32, 34, 36, 46, 55], they conclude that “By making a strong recommendation against the use of vertebroplasty we are expressing our confidence that future evidence is unlikely to overturn the results of these trials.” [40].

Table 5 Recommendations made for vertebroplasty

Others express a slightly less stringent approach towards VP. The Ontario Health Technology Advisory Committee (OHTAC) base their recommendations on the same studies as AAOS ( [32, 34, 36, 46]) except for one ([45]) instead of [55]) and recommend only against VP as first-line treatment [48]. Similarly, in the guidelines developed by the ACR based on analysis of current literature and expert opinion, McConnell and colleagues conclude that VP should be used for patients who have failed or cannot tolerate conservative or traditional management [22]. This recommendation is also supported by the NICE guidelines [42, 47]. Across the included publications, VP is considered an appropriate treatment option in patients who have failed NSM or cannot tolerate NSM.

Appropriateness of balloon kyphoplasty

Five publications made statements about the appropriateness of BKP [21, 22, 40,41,42] (Table 6). Based on five studies [38, 45, 56,57,58], the AAOS recommend BKP as an option for patients with an OVCF on imaging (limited strength of recommendation) [40]. This recommendation was downgraded from “moderate” to “limited” due to lack of evidence for VP and the technical similarities between VP and BKP. Recommendations by Karliner [41], published by the California Technology Assessment Forum (CTAF), are mainly based on results from one study [38]. Karliner suggests that it is important that patients considering BKP start with a trial of NSM given that the procedure carries some risk and that the mean age of the fracture was 5–6 weeks in the analysed RCT [38]. Most patient selection criteria are similar between VP and BKP in the reviewed publications. BKP has, however, been suggested to be advantageous in complex cases (e.g. several unfavourable factors or burst fractures).

Table 6 Recommendations made for balloon kyphoplasty

Discussion

Despite the significant burden of OVCFs, there are currently no universally accepted treatment pathways for this condition. Of the two main treatment options, which are NSM (conservative) and VAP, conservative therapy is considered by many as the first-line treatment option, see e.g. [22, 48, 49, 59]. This is mainly due to the fact that conservatively treated OVCF patients had sufficient pain relief within a couple of weeks [8, 9]. However, conservative treatment fails in a significant number of patients who continue to experience severe pain [10], which is when VAP may become a relevant treatment alternative. This paper reviewed and summarised the available literature on appropriateness criteria in the treatment of OVCFs to assess whether patient groups that may benefit the most from either NSM or VAP can be identified. To give an idea of possible conflict of interest, we have looked at type of publication and publisher. For example, one could presume that spine societies might have a subconscious bias to have a more favourable view toward spine interventions. However, we did not identify any noticeable bias linked to either type of publication or publisher.

Overall, there is a consensus that the majority of patients who have sustained an acute OVCF should be treated with NSM before other treatments are initiated. This is because the majority of patients with OVCF who are managed conservatively gradually improve. Primary candidates for conservative therapy include patients with negative imaging evidence of fracture and those who are not impaired by either severe pain, immobility or vertebral deformity [21, 39]. Controversy remains, however, about the length of a qualifying period of NSM before considering alternative treatment such as VAP, with suggested waiting time up to 6 weeks [39] or that “the appropriate timing in relation to the age of the fracture could be left for clinicians to judge” [42] . Although NSM is recommended in the first weeks after fracture, some evidence suggests that VP and BKP are most effective when completed within a few weeks of fracture [60].

The findings emerging from this review should be considered along with some limitations. The primary limitation is the lack of publications concerning treatment recommendations or appropriateness criteria for OVCF, possibly leading to heterogeneity among included studies meeting the pre-defined eligibility criteria. The study results could probably be improved by including more studies, e.g. non-English publications that may have been overlooked.

Another main limitation is the heterogeneity of patients and the heterogeneity of treatment recommendations in the included publications. Since the analysis is dependent on the quality of each of the individual studies, the accuracy and reliability of the pooled results in the source papers may have been influenced. This can probably be explained by limited clinical evidence of VAP, making the impact of expert judgement in each publication more important. An improved clinical evidence base for VAPs would be required to reduce the impact of opinion bias and provide better consistency between treatment recommendations and guidelines. One of the main reasons for variability in treatment recommendations is because the clinical data on the benefits and harms of VAPs compared to NSM or with each other is an area of controversy. For example, the two randomised controlled trials, comparing VP with a sham procedure involving the paraspinal injection of an anaesthetic, found no difference in terms of the primary outcome measure — pain relief [32, 34]. These studies, contradicting previous literature which was biased to favour VAP because of a placebo effect, spurred debate and concerns among both physicians and payers regarding the effectiveness of VP and why these two studies differed from the vast majority of the previous literature [61,62,63,64,65,66]. Thus, improving the understanding of the effectiveness of VAP is the most important area for further research in this field.

Beyond the uncertainty related to the clinical effectiveness of VAP, another important area for further research is the appropriate timing of VAP and the appropriate patient selection. Currently, there is a clear gap between the study populations in the main VAP clinical trials (patients with recent fracture) and treatment recommendations (e.g. patients that fail NSM). Also, optimal treatment for OVCF may differ between subgroups (e.g. elderly, comorbidities, level of spine deformity).

There will undoubtedly be a continued role for NSM for most patients and the future emphasis will likely include a refined pathway that uses the best clinical evidence to provide treatment guidelines. The production and collection of high quality clinical data analysing the treatments for OVCFs is of importance and this should include well-designed RCTs as well as on-label studies and an OVCF treatment registry. The assimilation of a larger body of data will promote appropriate treatment of patients with OVCFs.

In conclusion, there is reasonably good agreement in the scientific community that NSM is considered appropriate for patients with low level of pain and during the first weeks since fracture, and that VAP is appropriate for patients suffering severe pain who do not tolerate medical treatment or who do not respond to conservative therapy in a given time frame. However, the evidence to support consensus about use of VAP is limited and it is still very difficult to predict which patients will fail pain management. More research is needed to adequately identify the appropriate patient that would benefit the most from VAP.