Introduction

Osteoporosis is a disease characterized by low bone mass and structural deterioration of bone tissue, leading to bone fragility and an increased susceptibility to fractures [1]. As osteoporosis and related fractures occur primarily because of aging, they are a growing health care burden in the aging population [2]. However, only 40 % of women with osteoporosis in Europe [2] and 32 % in the USA [3] reported being treated. In Korea, also, osteoporosis is an important public health problem. According to the 2009 Korean Health Statistics, among participating women aged 50 years or older, the prevalence of osteoporosis was 38.7 % [4]. According to the National Health Insurance database, which includes almost all prescription and treatment claims with diagnostic codes in Korea, the number of patients being treated for osteoporosis substantially increased by 26.7 % between 2005 and 2008 [5]. However, osteoporosis is currently under-diagnosed and under-treated. Among women with osteoporosis, 30.3 % were diagnosed with osteoporosis by a physician, and only 14.3 % reported being treated [4]. Accordingly, it is critical that further effort is given to find more effective methods for the prevention and early detection of osteoporosis in both men and women.

At present, dual-energy X-ray absorptiometry (DXA) is the most widely accepted method of diagnosing osteoporosis and of monitoring changes in bone density [6]. However, due to its relatively high cost, DXA is not recommended as a routine screening test for the general population [7]. Most guidelines limit the use of DXA for postmenopausal women aged 65 years or older, with the exception of younger adults with known risk factors [8]. Therefore, several osteoporosis risk-assessment models have been developed for pre-screening using simple variables. However, most tools have been developed and validated for the Western population [916]. For Asian women, the Osteoporosis Screening Tool for Asians (OSTA) was developed in community-dwelling participants who visited 21 clinics in eight Asian countries, and was validated in a Japanese population [17]. OSTA is based only on age and weight and had a sensitivity of 91 % and a specificity of 45 % in the original development dataset. OSTA has been replicated in various ethnic populations including Korean postmenopausal women [18], Filipino women [19], and even in Caucasian [20] and African–American women [21]. However, the subjects used in the Korean OSTA studies were limited to patients in a few clinics. Meanwhile, the Korea National Health and Nutrition Examination Survey (KNHANES) included DXA tests from the second half of 2008, which is the first nationwide bone mineral density (BMD) dataset for the Korean population. Therefore, this study aims to develop and validate an osteoporosis risk-assessment model to identify high-risk Korean women who may benefit from further evaluation of osteoporosis based on a nationwide dataset.

Materials and methods

Participants

The KNHANES, conducted by the Korea Centers for Disease Control and Prevention and the Ministry of Health and Welfare, is a nationwide survey to assess the health and nutritional status of a non-institutionalized representative sample of the Korean population. Participants from residential districts throughout Korea were selected using a stratified, multi-stage, clustered probability sampling design; for the 2009 survey (KNHANES IV-3), household units were selected using the 2005 census in Korea [4]. For the 2010 survey (KNHANES V-1), sampling was either based on the registered market value of apartment building complexes or a registered database of the Korean government system that includes all registered citizens [22]. For development of the osteoporosis risk-assessment model, 1,917 postmenopausal women aged 50 years or older who participated in the 2009 KNHANES were included. Of them, 708 participants were excluded from the present analysis due to at least one of the following reasons: absence of BMD measurement (n = 149), previously diagnosed osteoporosis or treatment for osteoporosis (n = 473), missing blood tests (n = 199), and being in a bed-ridden state (n = 36). Finally, 1,209 women were eligible for this study. For validation of the model developed, the 2010 KNAHNES dataset was used. Of 1,657 postmenopausal women aged 50 years or older, 611 participants were excluded in the same manner as the development dataset: absence of BMD measurement (n = 128), previously diagnosed osteoporosis or treatment for osteoporosis (n = 359), missing blood tests (n = 196), and being in a bed-ridden state (n = 27). Finally, data from 1,046 participants were used for validation of the model developed. All survey participants signed an informed consent form. This study was approved by the Institutional Review Board of Korea Centers for Diseases Control and Prevention (2009-01CON-03-2C, 2010-02CON-21-C) and Yonsei University Health System (4-2011-0222) and was monitored by the Human Research Protection Center of Severance Hospital, Yonsei University Health System.

Measurements

The survey consisted of a health interview survey, a health behavior survey, a nutrition survey, and a health examination survey. Household interviews and self-reported questionnaires were used to assess health behavior, past or current history of disease, and family history. Smoking status was categorized as either current smoker or non-smoker (past or never). Drinking status was classified into either current drinker or non-drinker (past or never). Regular exercise was defined as moderate-to-high intensity of physical activity at least three times per week. Trained examiners in specially equipped mobile examination centers performed anthropometrics, blood tests, and BMD measurements. Body mass index (BMI) was calculated as an individual’s weight in kilograms divided by their height in meters squared. Blood samples were obtained after a minimum fasting time of 8 h and handled according to standard procedures. All samples were analyzed within 24 h after arriving at the Central Testing Institute in Seoul, Korea. Serum 25-hydroxyvitamin D [25(OH)D] levels were measured using a gamma counter (1470 Wizard, Perkin-Elmer, Finland) with a radioimmunossay (Diasorin, Still Water, MN, USA) [23]. Low vitamin D was defined as a serum 25(OH)D level of less than 20 ng/mL [23]. Serum parathyroid hormone (PTH) levels were assessed using a chemiluminescence assay (Diasorin) for the measurement of intact PTH. Elevated PTH was operationally defined as a serum PTH level of 80 pg/mL or greater. Elevated alkaline phosphatase (ALP) was operationally defined as a serum ALP level of 300 IU/L or greater. BMD was measured using a QDR Discovery fan beam densitometer (Hologic Inc., Bedford, MA, USA) at total femur, femoral neck, and L1–L4 spine. The results of DXA measurements were analyzed at the Korean Society of Osteoporosis using Hologic Discovery software (version 13.1) [24]. The stability of the DXA measurements was maintained by daily calibration [25]. T scores were calculated using sex-specific normal values for young Japanese women; the reference means (standard deviations) of BMD at the femoral neck and lumbar were 0.803 (0.107) and 1.006 (0.115), respectively. Osteoporosis and low BMD were defined as a T score less than or equal to −2.5 and −2.0, respectively, at either the femoral neck or lumbar spine.

Statistical analyses

Potential risk and protective factors for osteoporosis were selected based on previous studies and statistical investigation of the development dataset. Age [917], body weight [913, 1517], BMI [14], hormone replacement therapy (HRT) [1016], previous (low impact) fracture [10, 1215], smoking [15], and history of rheumatoid arthritis [15] were evaluated because they were components of previously developed osteoporosis risk-assessment models [917]. Additionally, all components of the WHO Fracture Risk Assessment Tool (FRAX®) [26] except corticosteroid use, secondary osteoporosis, and parent fractured hip were evaluated. There were no relevant data available to evaluate these variables in the KNHANES. In addition, regular exercise [27], history of type 2 diabetes [28], depression [29], and biomarkers [30, 31] including 25(OH)D, PTH, and ALP were evaluated. Simple linear regression analyses were performed to detect variables that achieved borderline statistical significance (P < 0.15). Among the potential risk factors identified, covariates for the multiple linear regression model were selected by tenfold cross-validation. In detail, the development dataset was randomly divided into 10 subsamples. Nine subsamples were used to select significant covariates using stepwise addition and deletion (P < 0.15), and the remaining one subsample was used for validation. This process was repeated 10 times with different subsamples to determine the optimal number of covariates [32]. Based on the results of the multiple linear regression analysis with the selected covariates, variables that did not reach a statistical significance (P ≥ 0.05) were excluded. Multicollinearity among the investigated variables was assessed by computing a variance inflation factor. Subsequently, a final multiple linear regression model with the selected covariates was computed. The regression coefficient of each covariate was used to calculate its index weight. To standardize the effect of each variable, a ratio using a coefficient for each covariate divided by the reference value, the absolute value of the coefficient for age (per 10 years), was calculated. Each standardized coefficient was then multiplied by an integer that was able to discriminate the effect of each variable and the final value was rounded off as an integer.

To develop a simple and effective model, three candidate models were tested; Model 1 included age and weight, Model 2 added health behavior, and Model 3 added blood test(s). Even though blood tests are invasive, costly and complicated to apply to a general population, Model 3 was tested to confirm whether blood tests add to the prediction of osteoporosis. In addition, these three models were compared to OSTA, which was a model available in the Korean population. Correlations of the scores from the three candidate models and OSTA with actual BMD T scores (lower values at either the femoral neck or lumbar spine) were evaluated using Spearman’s correlation analyses. The goodness of fit of each model was assessed using the Hosmer–Lemeshow test [33]. The ability of each model to discriminate those with osteoporosis from those without osteoporosis was compared using area under the receiver operating characteristics curves (AUC) with sensitivity on the y-axis and (1 – specificity) on the x-axis for all possible cut-off values. Next, a cut-off score was chosen which yielded 90 % sensitivity or greater for detecting those with osteoporosis in each model [10, 11, 17, 20]. The cut-off score was applied for each model and the final model was selected based on sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio, and negative likelihood ratio with their exact binomial confidence intervals. Additionally, false negatives (i.e., the number of missed cases that represents the number of undetected osteoporotic subjects per 1,000 subjects), false positives (i.e., the number of unnecessary DXA tests that represents the number of subjects without osteoporosis referred for DXA testing per 1,000 subjects), and AUC were compared. The model that showed the best performance using the fewest variables was selected and named the Korean Osteoporosis Risk-Assessment Model (KORAM) for postmenopausal women. KORAM was then validated using an independent dataset, KNAHNES V-1. Sensitivity analyses were performed with an outcome of low BMD T score of −2.0 or lower at either the femoral neck or lumbar spine [11].

According to the KORAM scores, three risk categories were created operationally: low, intermediate, and high risk of having osteoporosis. We defined low risk as having a less than 10 % probability of osteoporosis, high risk as having more than a 70 % probability, and intermediate risk as in between these values. The net reclassification improvement (NRI) [34] was then calculated to evaluate whether the KORAM risk categories provided a benefit in discriminating participants with osteoporosis from those without osteoporosis over the risk categories of OSTA. NRI was calculated by constructing 3 × 3 tables according to the risk categories of KORAM and OSTA, separately in participants with or without osteoporosis. Any upward movement in categories for participants with osteoporosis implied improved reclassification, and any downward movement indicated poor reclassification. The interpretation was opposite for participants without osteoporosis [34].

$$ {\text{NRI}} = \left[ {P\left( {{\text{up}}\left| {D = 1} \right.} \right) - P\left( {{\text{down}}\left| {D = 1} \right.} \right)} \right] - \left[ {P\left( {{\text{up}}\left| {D = 0} \right.} \right) - P\left( {{\text{down}}\left| {D = 0} \right.} \right)} \right] $$

where D represents osteoporosis, 1 for osteoporosis, and 0 for normal

A P value less than 5 % was considered significant. All statistical analyses were performed using the SAS software package (version 9.2.1; SAS Institute, Cary, NC, USA).

Results

Characteristics of study participants

Baseline characteristics of Korean postmenopausal women in the development dataset (2009 KNHANES) and the validation dataset (2010 KNHANES) are summarized in Table 1. A total of 1,209 and 1,046 postmenopausal women in the development dataset and the validation dataset, respectively, were eligible for the present study. Selected potential risk factors of osteoporosis were old age, low body weight, short height, current smoking, diabetes, low serum 25(OH)D, elevated ALP, and elevated PTH. Meanwhile, protective factors were current drinking, regular exercise, and currently taking hormones. The mean age was 63.5 years in the development dataset and 62.3 years in the validation dataset. However, the percentage of participants who were 75 years or older was higher in the development dataset (13.8 %) than in the validation dataset (8.9 %). The prevalence of osteoporosis was 33.9 % in the development dataset and 29.6 % in the validation dataset (Table 1).

Table 1 Baseline characteristics of participants in the development and the validation dataset

Development and validation of KORAM

According to tenfold cross-validation, seven variables associated with BMD T scores were selected: age, weight, HRT, regular exercise, current smoking status, current drinking status, and serum ALP levels. After adjustment for covariates, age, weight, HRT, and serum ALP levels had significant and independent associations with BMD T scores. No significant multicollinearity was observed among those variables. Regression coefficient, standard error, and index weight of each variable in the final multiple regression model are shown in Table 2. Based on the selected variables and their index weights, three candidate models were developed as follows:

Table 2 Regression coefficients and index weights in the final multiple regression model
$$ {\text{Model}}\;1 = \left[ {\left( {{{{\text{age}}\;{\text{in}}\;{\text{years}}} \mathord{\left/ {\vphantom {{{\text{age}}\;{\text{in}}\;{\text{years}}} {10}}} \right. \kern-0pt} {10}}} \right) \times \left( { - 4} \right) + \left( {{{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} \mathord{\left/ {\vphantom {{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} {10}}} \right. \kern-0pt} {10}}} \right) \times 3} \right] $$
$$ {\text{Model}}\;2 = \left[ {\left( {{{{\text{age}}\;{\text{in}}\;{\text{years}}} \mathord{\left/ {\vphantom {{{\text{age}}\;{\text{in}}\;{\text{years}}} {10}}} \right. \kern-0pt} {10}}} \right) \times \left( { - 4} \right) + \left( {{{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} \mathord{\left/ {\vphantom {{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} { 1 0}}} \right. \kern-0pt} { 1 0}}} \right) \times 3 + \left| {\left( {{\text{if}}\;{\text{no}}\;{\text{HRT}}} \right)} \right| \times \left( { - 3} \right)} \right] $$
$$ {\text{Model}}\;3 = \left[ {\left( {{{{\text{age}}\;{\text{in}}\;{\text{years}}} \mathord{\left/ {\vphantom {{{\text{age}}\;{\text{in}}\;{\text{years}}} {10}}} \right. \kern-0pt} {10}}} \right) \times \left( { - 4} \right) + \left( {{{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} \mathord{\left/ {\vphantom {{{\text{weight}}\;{\text{in}}\;{\text{kilograms}}} {10}}} \right. \kern-0pt} {10}}} \right) \times 3 + \left( {{\text{if}}\;{\text{no}}\;{\text{HRT}}} \right) \times \left( { - 3} \right) + \left( {{\text{if}}\;{\text{elevated}}\;{\text{ALP}}} \right) \times \left( { - 2} \right)} \right] $$

The ranges of scores in Model 1, Model 2, and Model 3 in postmenopausal women were from −23 to 9 (median −8), from −26 to 9 (median −10), and from −27 to 9 (median −11), respectively. In the development dataset, all models for postmenopausal women showed reasonable fitness according to the Hosmer–Lemeshow test (P for OSTA = 0.541, P for Model 1 = 0.773, P for Model 2 = 0.791, and P for Model 3 = 0.504). When we analyzed correlations between the predicted scores and actual BMD T scores, Spearman’s correlation coefficients were 0.635 for OSTA, 0.650 for Model 1, 0.660 for Model 2, and 0.665 for Model 3 (P for all <0.001). In terms of discriminative performance, Model 2 (AUC = 0.835) was superior to Model 1 (AUC = 0.828, P = 0.017) or OSTA (AUC = 0.816, P < 0.001), but not inferior to Model 3 (AUC = 0.836, P = 0.638).

We then compared the performance of OSTA and the candidate models with the selected cut-off scores to yield 90 % or greater sensitivity based on sensitivity, specificity, PPV, NPV, false negative, false positive, positive likelihood ratio, negative likelihood ratio, and AUC (Table 3). In the case of OSTA, the predefined cut-off score of −1 showed a relatively low sensitivity (79.5 %). Therefore, a cut-off score of 0 to yield 90 % or greater sensitivity was used as a reference. Compared with the other models, Model 2 with a cut-off score of −9 showed improved specificity, PPV, false negative, false positive, positive likelihood ratio, and negative likelihood ratio. Additionally, the AUC of Model 2 was significantly higher than those of OSTA, with a cut-off score of 0, and Model 1, with a cut-off score of −6, but comparable to that of Model 3, with a cut-off score of −9. Similar findings were shown in the validation dataset and in sensitivity analyses of both the development and validation datasets. Therefore, Model 2 with a cut-off score of −9 was finally selected and named the Korean Osteoporosis Risk-Assessment Model (KORAM) for postmenopausal women.

Table 3 Discriminatory performance of OSTA and the candidate models to identify postmenopausal women with osteoporosis and low BMD

Performance of KORAM by risk category

To define the clinical implications of KORAM, three risk categories were created with cut-off values of −9 and −15: greater than −9 for low risk, between −15 and −9 for intermediate risk, and less than −15 for high risk. In the development dataset, 36.4, 43.1, and 20.5 % were classified into low-, intermediate-, and high-risk categories, respectively. Among women in the low-, intermediate-, and high-risk categories, 8.2, 35.7, and 75.8 %, respectively, had osteoporosis. In the validation dataset, the percentage of women in the low-, intermediate-, and high-risk categories was 40.8, 45.4, and 13.8 %, respectively. The prevalence of osteoporosis was 11.0, 34.1, and 70.1 % in the low-, intermediate-, and high-risk categories, respectively (Table 4).

Table 4 Performance of KORAM to predict osteoporosis according to its risk categories

The net improvement of KORAM compared to OSTA with the suggested cut-off scores (−4 and −1) was 28.5 % (117 out of 410) for participants with osteoporosis, −21.2 % (169 out of 799) for those without osteoporosis, and 7.4 % [95 % confidence interval (CI) 1.1–13.6] overall. Further, the NRIs of KORAM compared to OSTA showed improved classification from 7.4 to 41.7 % across the different cut-off scores of OSTA: −4 and −1, −3 and 0, −4 and 0, and −5 and 0 (Table 5).

Table 5 Net reclassification improvement of KORAM compared to OSTA risk categories with the different cut-off values

Comparison of KORAM with the current Korean NHIC guidelines

Currently, Korean National Health Insurance Corporation (NHIC) guidelines for osteoporosis screening using DXA tests are limited to women aged 65 years or older, with the following exceptions: younger women with low body weight (BMI < 18.5), early menopause (<40 years), surgical menopause, and a past history or family history of non-traumatic fracture. KORAM was developed using each individual’s age, weight, and HRT status; thus, it may provide an efficient targeting guideline for the use of DXA tests. According to current NHIC guidelines, 64.1 % of women in the development dataset would be candidates for the measurement of BMD by DXA. With NHIC guidelines, 85.1 % of osteoporosis cases could be detected. The number of missed cases (false negatives) was estimated to be 149 per 1,000 subjects, and the number of unnecessary DXA tests (false positives) was estimated to be 534 per 1,000 subjects. With KORAM, 91.2 % of patients with osteoporosis could be detected. Moreover, the number of missed cases and unnecessary DXA tests per 1,000 subjects would be reduced to 88 and 495 cases, respectively (Table 6).

Table 6 Comparison between the current Korean NHIC guidelines and KORAM

Discussion

In this study, the Korean osteoporosis risk-assessment model for postmenopausal women was developed and validated in a Korean population based on a nationally representative BMD and health examination dataset: KORAM = [(age in years/10) × (−4) + (weight in kilograms/10) × 3 + (if no HRT) × (−3)].

To develop KORAM, we investigated clinically or statistically significant factors that were associated with BMD: age, body weight, height, current smoking status, currently drinking status, regular exercise, HRT, diabetes, depression, low serum 25(OH)D, elevated ALP, and elevated PTH. Of these factors, four variables in women including age, weight, HRT, and ALP levels were selected as potential components of the risk-assessment models. To form a simple and effective model, we first defined a baseline model for age and weight, which were core variables in the previously developed models [1013, 17]. We then assessed the incremental effect of adding HRT status to predict osteoporosis. Currently, HRT is a protective factor for osteoporosis and has been applied in previous models including simple calculated osteoporosis risk estimation (SCORE) [10], osteoporosis risk assessment instrument (ORAI) [11], osteoporosis index of risk (OSIRIS) [12], osteoporosis risk assessment by composite linear estimate (ORACLE) [14], and age, body size, no estrogen (ABONE) [16]. Additionally, serum ALP levels were negatively correlated with BMD in this study. Increased levels of bone-specific serum ALP, a biomarker of bone turnover, have been reported to be associated with an increased risk of osteoporotic fracture in community-dwelling postmenopausal women of Japanese ancestry [35]. Thus, we evaluated whether an invasive laboratory test such as serum ALP levels further improved the prediction of osteoporosis, even though serum ALP is less specific than bone-specific ALP [36]. When comparing the sensitivity, specificity, PPV, NPV, false positive, false negative, positive likelihood ratio, negative likelihood ratio, and AUC of each model, the model with age, weight, and HRT showed better performance than the baseline model. However, the addition of elevated ALP levels did not improve performance. Finally, KORAM was developed based on simple variables including age, weight, and HRT. Therefore, KORAM can be easily used in a primary care setting for pre-screening to decide whether to use DXA testing as well as in the general population for self-screening purposes.

In the present study, we evaluated both linear and logistic regression models to calculate an index weight of each variable to create a scoring system to predict osteoporosis. However, when we compared the performance of the models, the scoring system using the weights from linear regression analyses slightly outperformed those from logistic regression analyses (data not shown).

Additionally, all candidate models were compared with OSTA, which has been validated in many countries [1721]. Overall, KORAM showed better performance for the detection of osteoporosis than OSTA. The discriminative performance of KORAM was significantly superior to that of OSTA (AUC = 0.835 and 0.816, respectively, P < 0.001). In addition, KORAM demonstrated improved net reclassification from 7.4 % to 41.7 % over OSTA across different cut-off scores. Furthermore, when compared with current Korean NHIC guidelines, KORAM provided a more efficient targeting guideline for the use of DXA tests because it considers each individual’s age, weight, and HRT status (Table 6). Osteoporosis itself presents no specific symptoms, but the burden of osteoporotic fracture continues to grow with age. Therefore, the potential benefits and risks of DXA testing should be investigated. However, no clinical trials are available to evaluate the effectiveness of osteoporosis screening or any potential risks that can result from screening [7]. In false-negative cases, a diagnosis of osteoporosis can be overlooked and further treatment can be delayed. Conversely, false-positive cases can lead to unnecessary DXA tests, unnecessary exposure to radiation, and increased health care costs. However, even if participants who underwent DXA testing did not have osteoporosis (false-positive cases), the result of the DXA test will still assess the status of their bone health and help to estimate the proper interval for BMD testing. According to a recent study in white postmenopausal women aged 65 years or older, the estimated BMD testing interval for 10 % of participants to develop osteoporosis was over 15 years in women with normal BMD or mild osteopenia (T scores ≥ −1.5) [37]. However, this interval would be shortened to 5 years in women with moderate osteopenia (−1.99 ≤ T scores ≤ − 1.5) and even to 1 year in women with severe osteopenia (−2.49 ≤ T scores ≤ − 2.0) [37].

This study has several limitations. First, the cost-effectiveness of KORAM was not considered in the present study. Although more than 90 % of people with osteoporosis could be detected by KORAM, approximately 50 % would undergo unnecessary DXA tests. Therefore, to use KORAM in clinical practice, its potential benefits and drawbacks should be further evaluated. Second, despite using a nationwide representative dataset, the KNHANES is a cross-sectional study. Therefore, KORAM is limited to estimating prevalent cases of osteoporosis. Considering that the ultimate goal for improving bone health is to prevent osteoporotic fracture and that BMD scores provide only a marginal benefit for predicting osteoporotic fracture [32, 38], the role of KORAM in preventing osteoporotic fracture should be further evaluated. Currently, the Korean version of FRAX® to predict the 10-year probability of hip fracture and major osteoporotic fracture (clinical spine, humerus or wrist fracture) is available. Since KORAM and FRAX share two major components, age and weight, studies on correlations of a calculated risk of osteoporosis by KORAM with an estimated risk of osteoporotic fracture by FRAX and prevalent cases of osteoporotic fracture are needed in well-designed cohorts with the Korean population. Third, to calculate sex-specific T scores at the femoral neck and lumbar spine, site and sex-specific reference means and standard deviations were adopted from the Japanese data because no Korean reference data are currently available. Thus, KORAM should be further adjusted when Korean reference BMD data are available.

To our knowledge, this is the first development of an osteoporosis risk-assessment model for Korean postmenopausal women using a nationally representative dataset that includes BMD measurements and other relevant risk factors of osteoporosis. This study suggests that KORAM is a useful pre-screening tool for screening osteoporosis by DXA in the Korean population. Since KORAM is easy to calculate with simple variables, it can be used in either a primary care setting or in general use as a self-screening tool. However, prior to using KORAM in these settings, its cost-effectiveness, especially compared to current NHIC guidelines, should be investigated. In addition, replication studies using other Korean BMD datasets are recommended. Finally, further adjustment of KORAM using BMD data as a reference in the Korean population is necessary.