Introduction

Approximately 10 million US adults aged 50 or more have osteoporosis, and an additional 33 million have low bone mass based on femoral neck bone mineral density [1]. The prevalence of osteoporosis-related fractures during the lifetime is 15–30% in white Caucasian males and 30–50% in white females [2]. In Taiwan, an analysis from Nationwide Health Insurance Research Database (NHIRD) revealed the average prevalence of osteoporosis in those aged ≧50 years was underestimated to be 1.63% for men and 11.35% for women [3]. Another study recruiting adults undergoing health examination showed the prevalence of osteoporosis was 18.5% [4]. However, in Asia–Pacific regions, osteoporotic hip fractures are projected to increase 2- to threefold between 2018 and 2050 [5], and Taiwan leads the Asia–Pacific region in the incidence of osteoporotic hip fractures [6].

FRAX® (fracture risk assessment tool) is an algorithm that can be used to calculate age- and gender-specific fracture probabilities from obtained clinical risk factors (CRFs) and bone mineral density (BMD) [7]. In fact, the CRFs for fracture, including age, sex, body mass index, history of fragility fracture, family history of fracture, smoking status, alcohol intake, use of oral glucocorticoid, rheumatoid arthritis, and other causes of secondary osteoporosis, were identified based on serial meta-analyses of substantial population-based cohorts [8, 9]. The country-specific Taiwanese FRAX model was derived from incorporating the National mortality data and hip fracture incidence in 1996–2002 with the backbone FRAX model of 12 cohorts and launched since 2010 [10]; however, large prospective cohort studies are needed to validate this country-specific model. The aim of this study is to evaluate the predictive performance of the Taiwan FRAX calculator using longitudinal fracture data.

Methods

Participants

After excluding one subject with incomplete follow-up data, we studied 1975 subjects (M/F = 932/1043), aged 40 years old and over, from 2 concomitant cohorts established in different regions of Taiwan during the period 2009–2010 [11]. One cohort recruited subjects aged 40 and above from Yunlin County in middle Taiwan [12]. Another cohort in Tianliao District of southern Taiwan recruited men and woman aged 65 or more [13,14,15,16,17]. The inclusion criteria are as follows: subjects who (1) aged 40 or more at baseline BMD assessment; (2) had complete data of FRAX CRFs and probability estimates (with and without BMD); (3) had medical coverage from Taiwan’s National Health Insurance Research Database (NHIRD) during the observation period from 2008 to 2016.

Among the 1200 subjects in the Yunlin cohort (males = 524; females = 676, average age = 59.6 ± 11.4 years old), the prevalences of osteoporosis diagnosed using DXA (dual-energy X-ray absorptiometry) in men and women were 40.2% and 41.0%, respectively [12]. In another elderly cohort, the Tianliao cohort recruiting 775 subjects (male = 408; female = 367, average age = 74.2 ± 6.1 years old), the prevalences of osteoporosis were found to be 18.1% in men and 60.4% in women [13, 14]. This study was approved by the Institutional Review Board of the National Cheng Kung University Hospital.

Measures and questionnaires

During enrollment, the lumbar and hip BMD of the study participants were measured using dual-energy X-ray absorptiometry (DXA, Prodigy, GE Lunar, USA, in Yunlin and Explorer, Hologic, USA, in Tianliao) [11]. Structured questionnaires were administered at the baseline to collect information on CRFs [15, 16]. Habitual smoking and high consumption of alcohol were defined as stipulated by the FRAX [17]. Information on rheumatoid arthritis and secondary osteoporosis were obtained by taking histories. Glucocorticoid exposure was documented according to the FRAX definitions [7].

The National Health Insurance (NHI) of Taiwan, a globally unique program in force since 1995, manages more than 99.9% of the medical claims data of all citizens, keeping records in the National Health Insurance Research Database (NHIRD) [18]. This database currently provided by Health and Welfare Data Science Center, Ministry of Health and Welfare (HWDC, MOHW), has been used as an important source of data for evidence-based studies in Taiwan. All of the participants in the present study were included in Taiwan’s NHIRD at some point during our observation period (2008 to 2016). We merged the basic demographic data of the study participants enrolled in the Yunlin and Tianliao cohorts, and subsequently confirmed the fracture rates using Taiwan’s NHIRD. The fracture outcomes observed up to December 31, 2016, were documented according to the ICD codes from Taiwan’s NHIRD. We identified all claims records of hospital admissions of patients from 2009 to 2016 in the NHIRD data after exclusion of trauma-relevant events (ICD-9-CM E800-E879 and E890-E99). Specific ICD-9 codes between 2009 and 2015 and ICD-10 codes after 2016 were used to identify osteoporosis-related fractures, including hip, distal radius, humerus, spine and osteoporotic fractures [11]. The accuracy of diagnosis of fracture had been endorsed [19]. The cumulative incidences of osteoporotic hip fracture and other major osteoporotic fractures (MOFs) at the end of follow-up were obtained. Patients were followed from the examination date in 2009 ~ 2010 until the fracture or until December 31, 2016.

Statistical methods

Data were analyzed using SAS 9.4 software. Intergroup comparisons of basic demographic data for the two cohorts were carried out using a Pearson’s chi-square analysis for categorical variables and Student’s t-test for continuous variables. The primary outcome was the first diagnosis of an osteoporotic hip fracture or other major osteoporotic fractures (including hip, vertebral, upper arm, or wrist). In an attempt to redefine the high-risk group for osteoporotic fracture prediction, the predictive ability of FRAX was assessed by analyzing calibration and discrimination. Predictive ability was assessed by comparing estimated risk fractures with incidence fractures (observed to expected fracture ratios, O/E). Expected fractures were estimated as the sum of the 10-year probability of fracture, referred from a validation study of the Spanish FRAX model conducted on a retrospective cohort of 1231 women aged 40–90 years [20]. Observed fractures were defined as the total number of fractures observed at the end of follow-up (until December 31, 2016). Subjects with recurred fracture at same site were analyzed at the first event. The cumulative incidence and incidence rates of osteoporotic fracture were calculated accordingly [21,22,23,24]. An AUC-ROC (area under the curve of the receiver operating characteristic) analysis was carried out to compare fracture discrimination based on the FRAX score with and without BMD and at different cutoff points. The Youden Index will be used for the suggestion of the optimal cutoff [25].

Results

Basic demographic analysis

In Table 1, compared with the subjects without incident fractures, those with incident fractures were older, shorter, and leaner, and comprised more women. Rationally, subjects with incident fractures had a lower bone mineral density but higher FRAX MOF and hip fracture risk scores and higher T-scores.

Table 1 Comparison of clinical parameters between participants with and without incident osteoporotic fracture

Osteoporotic fracture rates

Of the 1975 subjects followed up for an average 6.8 ± 1.1 years, 160 incident MOFs were identified (including 36% hip, 17% distal radius, 7.5% humerus, 45% clinical vertebra and osteoporotic fracture). A total of 18 and 142 fractures occurred in participants aged < 65 and ≥ 65 years, respectively. There were 58 incident hip fractures observed, most of which occurred in elderly individuals 65 years old and above. In this study, the cumulative incidence of all osteoporotic fractures was 8.1%. The incident osteoporotic fracture rates (per 100,000) were 921.2 for men and 1503.5 for women. The incidence rates of hip fracture for Taiwanese men and women (per 100,000) were 403.1 and 466.4, respectively.

The calibration and discrimination statistics

Table 2 shows the calibration statistics for the Taiwan FRAX model. In terms of major osteoporotic fractures, the predictive ability assessed using the O/E ratio calculated with FRAX with BMD was 0.81 (95% CI 0.69–0.94), and for hip fractures, it was 0.70 (95% CI 0.52–0.88). However, the 10-year FRAX model seemed to overestimate the risk of osteoporotic fracture when utilizing the fracture data with incomplete follow-up time. We further assumed that fracture rates are relatively stable over time, and the estimated hip fractures and MOFs of 6.8 years follow-up time would be 54.02 and 133.95 by interpolation. Consequently, at 6.8 years, the adjusted calibration ratio for hip fracture and MOF would be 1.07 (95%CI 0.82–1.39) and 1.19 (95%CI 1.02–1.39), respectively. Therefore, the calibration performance was improved and even better when referring to hip fracture (Supplementary Table 1). However, in the younger subgroup aged 40–65 years, the FRAX model tended to overestimate the risk of osteoporotic fracture (unadjusted O/E 0.42, 95%CI 0.22–0.61; adjusted O/E 0.61, 95%CI 0.36–0.97) (Table 3 and Supplementary Table 2).

Table 2 Calibration statistics for Taiwan FRAX 10-year major osteoporotic fracture/hip fracture
Table 3 Calibration statistics for Taiwan FRAX 10-year major osteoporotic fracture model by age subgroup

In the ROC analysis, the AUC for the prediction of major osteoporotic fractures using FRAX without BMD was 0.75 (95%CI 0.71–0.79), and using FRAX with BMD was 0.77 (95%CI 0.74–0.80). The results were similar in terms of hip fracture prediction (AUC 0.75 without BMD and 0.77 with BMD). Adding the BMD information in the FRAX model improved the AUC for fracture prediction. Table 4 shows the sensitivity and the specificity of the FRAX score with BMD at different cutoff points for identifying incident osteoporotic fractures. With the BMD information, the optimal cutoff value was 9.5% of the FRAX score for all major osteoporotic fractures, based on the Youden Index, where good sensitivity (76.9%) and specificity (65.3%) were obtained. For hip fractures, the optimal cutoff point for the FRAX probability with BMD was 4.0% based on the Youden Index, and the sensitivity and specificity were 74.4% and 68.3%, respectively. Our study further showed that for the FRAX scored with BMD, the sensitivity/specificity for estimating the risk of MOFs were 76.9%/65.3%, 73.8%/67.3%, 50.0%/82.4%, and 33.1%/91.4%, with cutoffs of 9.5%, 10%, 15%, and 20%, respectively. The sensitivity/specificity for estimating the risk of hip fracture with cutoffs of 1.0%, 1.5%, 3.0%, and 4.0% were 95.6%/27.7%, 91.9%/38.3%, 83.1%/58.5%, and 74.4%/68.3%, respectively. However, when the BMD was not included, the optimal cutoff values were 8.0% for the FRAX score for all major osteoporotic fractures and 4.0% for hip fractures using the Youden Index (Supplementary Table 3).

Table 4 Sensitivity and specificity of the FRAX score with BMD at different cutoff points for identifying incidence osteoporotic fracture

Discussion

The present study is the first large Taiwanese cohort study to validate the Taiwan FRAX model. The cumulative incidence of all osteoporotic fractures is similar with that of Japan (5.8%), Korea (11.2%), and Hongkong (9.4%) [21,22,23]. The incidence rates of hip fracture for Taiwanese men and women are also consistent with those of Asian populations [24].

The transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) initiative developed a set of recommendations for the reporting of studies developing, validating, or updating a prediction model. The present report is a prognostic prediction model study regarding prognosis following the rules set forth in the TRIPOD statement [26]. Our results showed that the Taiwan FRAX tool has good discriminatory/predictive ability for detecting both osteoporotic fractures and hip fractures in a Taiwanese population, but may have overestimated in individuals younger than 65 years of age. This phenomenon is contradictory to the Spanish FRAX validation studies, which showed underestimation of fracture probability in younger Spanish subgroups [20, 27]. In addition to validating the present prognostic model, investigations of the impact on clinical practice were also considered under the PROGRESS framework [28].

The majority of publications have identified an intervention threshold probability of 20% for major osteoporotic fracture and 3% for hip fracture [29]. A cross-country meta-analysis including 7 longitudinal cohorts was conducted to assess the diagnostic accuracy of FRAX in predicting incident osteoporotic fractures. When using 20% as the fixed intervention probability for major osteoporotic fractures, the sensitivity and specificity were 10.25% (95%CI 3.76–25.06) and 97.02% (95%CI 91.17–99.03), respectively. For hip fracture prediction, using 3% as the fixed probability threshold, the sensitivity and specificity were 45.70% (95%CI 24.88–68.13) and 84.70% (95%CI 76.41–90.44), respectively [30]. The Osteoporosis Society of Hong Kong documented the NOF (National Osteoporosis Foundation) 20% intervention threshold stated in its current guideline, but no practical instruction on the use of FRAX was given [31]. A Hong Kong Osteoporosis Study applied the NOF (National Osteoporosis Foundation, USA), NOGG (National Osteoporosis Guideline Group, UK), and Taiwanese guidelines to a cohort of 2266 postmenopausal women. During the mean follow-up of 4.5 years, a total of 106 incident osteoporotic fractures were observed, of which 21 were hip fractures. The optimal cutoff point of the 10-year probability of a major fracture was identified as 9.95%, with a sensitivity of 62.3% and a specificity of 73.5% [32]. In China, it was proposed that the intervention thresholds may be 4% for the 10-year probability of major osteoporotic fracture and 1.3% for hip fracture. According to the 37.5% prevalence of osteoporosis found in an epidemiology study of 778 postmenopausal women, the 62.5th percentiles of major osteoporotic fracture and hip fracture probability were 4.0% and 1.3%, respectively [33]. The Japanese guideline adopted 15% as a treatment probability threshold, considering that the 10-year probability of major osteoporotic fracture in the patients receiving pharmacological treatment in clinical settings was observed to be approximately 15% [34]. According to the aforementioned literature, the fixed probability thresholds have ranged from 4 to 20% for a major fracture and from 1.3 to 5% for a hip fracture [35], which is consistent with this report.

When cost-effectiveness was taken into consideration, a Taiwanese study revealed the cutoff of FRAX probability for intervention should be determined as 7% for women and 6% for men for hip fractures and 15% for women and 12.5% for men for MOFs, respectively [36]. In this study, the MOF and hip fracture intervention thresholds were estimated using the Markov model, in which willingness to pay was set at twice the gross domestic product per capita of Taiwan [36]. The relatively higher threshold than in the present study may be a reflection of low medical costs under Taiwan’s NHI reimbursement. Medical costs after fracture are relatively cheap in the consideration of GDP. Thus, the treatment thresholds were elevated concerning cost-effectiveness. This discrepancy may also result from the limitation of the Markov model due to not considering the BMD. On the other hand, discrimination statistic refers to the ability of the risk prediction model to distinguish between patients who experience a fracture and those who do not. The proposed cutoffs in our study were 9.5% and 4.0% for predicting major osteoporotic fracture and hip fracture determined by sensitivity/specificity of AUC-ROC analysis. Our results also revealed that adding the BMD into the FRAX model can improve the predictive performance related to incident fractures. In addition, the present report is the first large Taiwanese prospective cohort study recruiting a target population with fracture outcomes from the national databank after nearly 7 years of follow-up and was intended to validate the Taiwan FRAX model more reliably. There are 215 participants who received anti-osteoporosis treatment during the observation period. The results were proven to be consistent even after adjustment of anti-osteoporotic treatment and physical therapy during the follow-up period [11].

This study still has several limitations. First, all study participants were Taiwanese, which limits generalizability. However, the findings reconfirmed the usefulness of country-specific FRAX model for predicting incident fracture. Second, due to the fact that ICD-10-CM just has been implemented instead of ICD-9 in Taiwan since 2016, ICD-9-CM diagnosis codes were mainly utilized in the present study, which are relatively unspecified in terms of disease classification stakeholders [37]. Therefore, the mechanism of fractures in our study remained unclear using the ICD-9-CM code. Finally, a 6.8-year follow-up may not be able to adequately reflect the 10-year fracture risk precisely. However, a trend in fracture risk was proven and can be reconfirmed in the near future. In addition to validating the Taiwan FRAX model with the data used in the present cohort study, further reconstruction of the Taiwan FRAX model could be considered by either incorporating other important risk factors, such as a history of falls [11, 38, 39], the trabecular bone score [40], spine–hip discordance [41], or reducing the number of existing CRFs.

Conclusions

Our data revealed that the Taiwan FRAX® calculator was validated to predict incident fractures preliminarily. Cutoffs of FRAX probability were proposed as 9.5% and 4.0% for predicting major osteoporotic fracture and hip fracture in Taiwanese individuals, respectively.