Introduction

The number of osteoporosis patients is increasing in the super-aged society in Japan. Osteoporotic fragility fractures are also increasing [1]. Although Imai et al. reported that the age-specific incidence of hip fractures in Niigata Prefecture was lower in 2015 than in 2010 [2], the total number of hip fractures in Japan continues to rise.

Among osteoporotic fragility fractures, hip fractures and vertebral fractures have a great impact in clinical practice. These fractures significantly degrade patients’ quality of life (QOL) [3] and are related to an increased mortality rate [4]. In addition, hip fracture treatment is expensive because it usually requires surgery. Hip fracture is a major factor leading to the need for long-term care and also increases nursing care expenses [5]. The increased expense associated with treating these fragility fractures is an urgent issue for health economics in Japan.

Osteoporosis treatment aims to prevent fragility fractures by raising bone mineral density (BMD). Japanese guidelines for the prevention and treatment of osteoporosis (2015 edition) provide standard treatment guidance [6]. The evidence for various osteoporosis treatment drugs is provided in the guideline; doctors use their experience and knowledge to select the most appropriate drug according to the patient’s age and fracture risk.

The Japanese guideline recommends alendronate, risedronate, and denosumab for patients with a high risk of hip fracture [6]. Major risk factors for hip fracture include female sex, low BMD, older age, and a history of previous fragility fractures. Additional risk factors include smoking, alcohol use, family history of hip fracture, exercise, BMI, and low calcium intake [6].

In recent years, the importance of health economic analysis has been recognized; results of economic analysis are required to be used in clinical decision-making and in medical policy development in Japan. A trial of cost-effectiveness evaluation was implemented beginning in fiscal year 2016; the results will be adapted to adjust the price of drugs and medical devices in the future [7]. In 2017, an official guideline for economic evaluation was developed in response to the request of the Central Social Insurance Medical Council (Chuikyo) [7]. Because osteoporosis treatment is associated with health economic problems in Japan, it is very important to analyze treatment from the viewpoint of health economics.

Osteoporosis treatment is generally considered to be cost-effective when used in patients with a high risk of fracture [6]. The purpose of this study was to analyze the cost-effectiveness of 5 years’ treatment with denosumab or alendronate in elderly Japanese women with low BMD and existing previous vertebral fracture, who were considered to have a high risk of fragility fractures.

As mentioned above, the guideline recommends alendronate, risedronate, and denosumab for patients at high risk of hip fracture [6]. In Japan, alendronate has the largest market share among bisphosphonates, so we chose alendronate as the representative bisphosphonate [8]. The relative risk reduction for fracture occurrence reported in a network meta-analysis was nearly equal for alendronate and risedronate [9]. In addition, drug adherence, side effects, and the offset time effect of the two bisphosphonate are thought to be almost the same. Because risedronate and alendronate produce very similar results, this study only included weekly alendronate in analysis.

Materials and methods

Target population

In Japan, osteoporosis is defined as a T-score lower than − 2.5 or BMD lower than 70% of the young adult mean (YAM), measured with dual-energy X-ray absorptiometry (DXA) [6]. In addition, patients with previous hip or vertebral fracture are diagnosed with osteoporosis, regardless of T-score or BMD [6]. Risk factors for hip fracture include female sex, old age, low BMD, and history of vertebral fracture [6].

The reference value for BMD in the proximal femur of Japanese women aged 20 to 29 years is 0.790 ± 0.090 g/cm2. Thus, 70% of the YAM for BMD is 0.553 g/cm2, 65% of YAM is 0.5135 g/cm2, and 60% of YAM is 0.474 g/cm2. T-scores were calculated using NHANES III reference value of 0.858 ± 0.120 g/cm2; 70% of YAM was equivalent to a T-score of − 2.54, 65% of YAM was equivalent to a T-score of − 2.87, and 60% of YAM was equivalent to a T-score of − 3.2 [10].

The target population in this study was individuals at high risk for fragility fractures, especially hip fractures. Specifically, the population was 75-year-old Japanese women with a BMD of 65% of YAM (T-score, − 2.87) and a history of previous vertebral body fracture. We used the population of women with a BMD of 65% of YAM (T-score, − 2.87), which is 5% lower than the diagnostic criteria, as a population at higher risk of fractures. Because Japanese epidemiological data indicate that hip fractures markedly increase when patients reach their 70s [11], we regarded the high-risk population for hip fracture as Japanese women aged 75 and older.

Markov model

Markov model analysis was performed to estimate the cost-effectiveness of osteoporosis treatment, comparing 5-year treatment with subcutaneous denosumab every 6 months versus weekly oral alendronate or no treatment.

In this model, the cohort was transitioned between health states in a 1-year cycle and was followed from the time of treatment initiation until death or 100 years of age, which was defined as a lifelong time horizon.

The model consisted of 12 health states: “event free,” “hip fracture,” “post-hip fracture,” “second hip fracture,” “post-second hip fracture,” “vertebral fracture,” “post-vertebral fracture,” “second vertebral fracture,” “post-second vertebral fracture,” “bed ridden,” “drop out,” and “death” (Fig. 1).

Fig. 1
figure 1

Markov model used in this study. Patients start in event-free state and move to other states with a certain transition probability in a 1-year cycle. Dropout may occur until the third year after start of drug treatment. Patients can move from all states to the state of “dead”; the arrows representing this movement are omitted

In this model, all patients began in the “event-free” state and were exposed to the possibility of an event occurring with a certain probability every year. Major events were “hip fracture” and “vertebral fracture.” In a cycle in which fracture did not occur, the patient remained in the “event-free” state. A patient who had hip fracture or vertebral fracture moved to the “post-hip fracture” or “post-vertebral fracture” state. We set the model so that hip fracture and vertebral fracture could occur up to twice. Models in previous reports [12] commonly allowed patient to have multiple vertebral fractures. Because the target population of this study was 75-year-old Japanese women, we considered that it would be rare for a patient to receive treatment for multiple vertebral fractures before death. Therefore, we set our model so that patients could have a maximum of two clinical vertebral fractures. Our intention was to set the number of vertebral fractures slightly conservatively.

Patients who had vertebral fracture could have hip fracture, but patients who had hip fracture could not have vertebral fracture, as in a previous report [12]. After hip fracture, patients had a certain probability of becoming bedridden and entering a nursing home. We assumed that patients who entered a nursing home stayed there until death without experiencing a new event. Therefore, patients in the state “bedridden” had only two possibilities: death or remaining bedridden. “Death” could occur in all states. Patients could discontinue osteoporosis treatment up to 3 years after treatment initiation; patients who continued treatment until 3 years continued treatment thereafter [13].

This model did not consider fragility fractures other than hip fracture and clinical vertebral fracture (e.g., wrist fracture or pelvic fracture) because data derived from the Japanese population are insufficient. We considered clinical vertebral fractures in this model; asymptomatic morphometric vertebral fractures were not considered. Thirty percent of all vertebral fractures are considered clinical vertebral fractures [14].

This model was constructed and analyzed with TreeAge Pro 2016 (TreeAge Software, Williamstown, MA, USA).

Outcome estimation

Transition probabilities

The annual incidence rates of hip and vertebral fracture were estimated with the equations of Moriwaki et al. [14] (Supplemental Table 1). These equations included patient age and BMD and epidemiological data from post-menopausal Japanese women with osteoporosis and osteopenia [14]. The transition probability (p) of events occurring within a certain time (t) was calculated using the incidence rate (r), according to the following formula: p = 1 − exp(−rt) [14]. In this study, the transition probability of clinical vertebral fractures was derived from the incidence rates of patients with previous vertebral fractures (Supplemental Table 1).

The relative risk of hip fracture in patients with previous vertebral fracture was derived from the report of Kanis et al. [15]. The relative risk of subsequent hip fracture in Japanese patients with previous hip fracture was derived from the report of Hagino et al. [16].

The age-specific mortality rate was derived from the 22nd Life Table published by the Ministry of Health and Welfare of Japan [17].

The odds ratio of death after hip fracture was determined with the formula reported by Moriwaki et al. [4, 14]. The odds ratio was calculated from the first to the tenth year; each odds ratio was incorporated into the model. We assumed that patients would continue to have an increased mortality rate after more than 10 years after a hip fracture.

The probability that a patient would become bedridden and enter a nursing home after hip fracture was derived from the report of Hayashi et al. [18].

Clinical efficacy

To use data close to the conditions that we established, we chose the data on fracture-prevention effect. The relative risks of hip and vertebral fracture in patients receiving alendronate were derived from data on the fracture-prevention effect of alendronate in patients with previous vertebral fracture in the sub-analysis of the FIT trial [19]. The relative risk of hip fracture in patients receiving denosumab was derived from data on patients aged over 75 years with T-scores below − 2.5 reported by Boonen et al. [20]. The relative risk of vertebral fracture in patients receiving denosumab was derived from data on patients aged over 75 years reported by McClung et al. in the sub-analysis of the FREEDOM trial [21] (Table 1).

Table 1 Parameters used in this study

In our model, the post-cessation drug effects continued for the same period as the treatment period; this period was called the offset time [12]. The effects of 5-year treatment with alendronate or denosumab were assumed to linearly approach zero at 5 years after stopping treatment, as in past reports [12, 23, 32].

Drug adherence consists of “persistence,” which is duration of treatment; “compliance,” which is evaluated with the medication possession ratio (MPR); and “primary non-adherence” [13]. We considered “persistence.” Strom et al. assumed that patients were at risk of dropping out during the first 3 years of treatment [13]; we used a similar setting. The persistence rate of alendronate use until the third year was derived from the prescription data of 13 Japanese university hospitals reported by Kishimoto et al. [22]. Because there is no literature on the 3-year persistence rate for denosumab use in Japan, we used the 2-year persistence rate of 2315 patients in Sweden reported by Karlsson et al. [24]. Because there were no data for the third year in that report, we estimated the 3-year persistence rate from the data of Karlsson et al., using the ratio of second-year to third-year alendronate persistence rates of Kishimoto et al. [22, 24] (Table 1).

We considered the reduction in fracture prevention effect resulting from partial drug compliance with the following equation: partial efficacy of fracture prevention effect = 1 − (relative risk of fracture by drug) × (MPR of drug).

For alendronate compliance, we used the MPR of weekly bisphosphonate in the first year from the report of Kishimoto et al. [22]. However, because there are no data on denosumab compliance in Japan, denosumab compliance information was obtained from the report by Hadji et al., which analyzed denosumab adherence in the first 12 months in routine practice in four European countries [25]. That study used the term “adherence” to described compliance.

The drug effect in patients who dropped out in the first year was assumed to be half of the original fracture-preventing effect in the first year. Patients who dropped out in the third year were assumed to have the original fracture-prevention effect in the first and second years and half the drug effect in the third year. After the third year, the drug effect decreased linearly, becoming zero in the fifth year.

The general strategy of osteoporosis treatment in this model was as follows. At the time of the initial hospital visit, the patient’s BMD was measured with DXA. Blood tests and X-ray of the lumbar spine were performed to confirm the presence or absence of secondary osteoporosis or vertebral fracture. After measuring biochemical markers of bone turnover, the doctor started osteoporosis treatment.

The patient visited the hospital once every 3 months and markers were measured within 6 months from the start of treatment. Patients treated with denosumab received daily calcium, magnesium, and natural vitamin D3 supplementation. Calcium and creatinine levels were checked every 3 months.

Hip fracture patients were assumed to receive osteoporosis treatment with denosumab or alendronate until their deaths.

The “no-treatment” group visited the hospital and continued follow-up with blood tests or DXA, but received no osteoporosis treatment. After fracture, they received fracture treatment and follow-up observation, but did not receive osteoporosis treatment. The “no-treatment” group was set for reference only for comparison with other treatments.

Costs

To estimate the opportunity cost of resource use from a societal perspective, we aggregated the direct medical costs borne by patients and third-party payers including public medical expenses and public nursing care expenses.

All costs were calculated using US$1.0 for Japanese Yen 100, for convenience. Costs are shown in Table 1.

Drugs costs were derived from a 2018 Japanese price list [26]. The cost of denosumab included the cost of daily calcium, magnesium, and natural vitamin D3 supplements.

The annual costs of hospital visits and examinations were calculated according to the Japanese tariffs in 2016 [33]. Treatment costs for hip fractures were calculated using the average values at a university hospital and two private hospitals reported by Kondo et al. [27]. Treatment costs for vertebral fractures were calculated using average values of treatment costs, including inpatient and outpatient treatment excluding surgery, from a questionnaire survey in Japan reported by Konno et al. [28]. The annual cost of level 5 nursing home care was calculated from the 2015 Japanese benefit for nursing home care [29]. We analyzed this model using a 3% discount rate for all costs.

Utilities

We used quality-adjusted life years (QALYs) as a measure of the treatment effect. Each utility value is shown in Table 1.

We used the utility values according to age for the health condition of elderly Japanese women reported by Nawata et al. [30]. We calculated the rate of change of the utility value resulting from fragility fractures in elderly Japanese women using the report of Hagino et al. [3]. The utility value after fracture was calculated by multiplying the utility value for each age in the event-free state by the rate of change due to fragility fracture. The utility value of the bedridden state was obtained from data reported by Imai et al. [31]. Outcomes were discounted at a rate of 3%.

Base-case analysis

In base-case analysis, we estimated the lifetime costs and QALYs of 5-year treatment with denosumab, alendronate, or no treatment in 75-year-old Japanese women with a BMD at 65% of YAM (T-score, − 2.87) and with a history of previous vertebral body fracture. The main purpose of this study was to compare the ICER of denosumab treatment versus alendronate treatment. The ICER was defined as follows:

ICER = incremental cost/incremental effectiveness = cost (denosumab treatment) − cost (alendronate treatment) / effectiveness (denosumab treatment) − effectiveness (alendronate treatment).

In addition, we calculated the ICERs for various combinations of age and BMD in the target population. Three levels of the YAM value (70% [T-score, − 2.54], 65% [T-score of − 2.87], 60% [T-score, − 3.2]) and three ages (75 years, 80 years, 85 years) were set; each combination of parameters was analyzed.

Sensitivity analysis

To clarify the robustness of the results of this study, deterministic analysis was performed. For the data range used in deterministic sensitivity analysis, the confidence interval shown in the literature was used. Costs were verified in the range of ± 30%. Changes in utility values were verified in the range of ± 10%.

Probabilistic sensitivity analysis was also performed with a 1000-times Monte Carlo simulation. The effects of drugs, utility values, dropout ratio, MPR of drugs, and odds ratio of death after hip fracture were randomly sampled from beta distribution and gamma distribution. Cost items were sampled from a triangular distribution.

Results

Validation

The number of fractures occurring in 10 years in this model was calculated and compared with the predicted fracture incidence according to fracture risk assessment tool: FRAX® over a 10-year periods [34]. In the no-treatment group of the base case, the incidence of hip fracture was 6.4% over 10 years and that of vertebral fracture was 28.1%. In other words, 10-year probability of major fracture was above 34.5%. In contrast, if we input BMD and age of our base case, using the average height and weight of 75-year-old Japanese women [35], FRAX predicted that the 10-year probability of hip fracture was 9.2% and that of the major fracture was 29%. Thus, the 10-year probability of vertebral fracture according to FRAX was below 20%.

Although the incidence of hip fractures in this model was slightly more conservative than that of FRAX, the incidence rates are similar. In contrast, the 10-year probability of vertebral fracture in this model was 28.1%, whereas the probability was less than 20% according to FRAX. In this model, vertebral fractures occurred somewhat more frequently than predicted by FRAX. We believe that the validity of this model was confirmed. If we set this model so that vertebral fracture could occur three or more times, the probability of vertebral fracture would further increase.

Cost-effectiveness according to base-case analysis

Table 2 shows the results of cost-effectiveness analysis of osteoporosis treatment for 75-year-old Japanese women with a BMD of 65% of YAM (T-score, − 2.87) and a history of previous vertebral fracture. Compared with alendronate treatment, the additional lifetime cost of 5-year treatment with denosumab was US$1770; the additional lifetime effect was 0.04 QALY. Thus, the ICER was US$40,241/QALY.

Table 2 Detailed results of base-case analysis

Table 3 shows the ICERs associated with different ages and BMDs (T-scores) in the target population. For 75-year-old women with a BMD of 60% of YAM (T-score, − 3.2), the ICER was US$22,759/QALY; for those with 70% of YAM (T-score, − 2.54), the ICER was US$63,941/QALY.

Table 3 ICERs associated with different ages and BMDs (T-score) in the target population

In the target population with a BMD less than 65% of YAM (T-score, − 2.87), which is an extremely low value, when willingness to pay was set at US$50,000 [36], denosumab was cost-effective in all age groups.

Deterministic sensitivity analysis

The results of deterministic sensitivity analysis are shown in Fig. 2. The parameters that had a significant effect on the outcome were the relative risks of hip fracture with denosumab and alendronate treatment, the compliance of denosumab and alendronate, the cost of denosumab, and the offset time of denosumab.

Fig. 2
figure 2

Results of deterministic sensitivity analysis. The dotted lines in the graph represent willingness-to-pay and the numerical value of the ICER in base-case analysis

Probabilistic sensitivity analysis

The acceptability curve and the scatter plot obtained from probabilistic sensitivity analysis is shown in Fig. 3. In the base-case analysis, the probabilities that the ICER of 5-year treatment with denosumab or alendronate will be less than US$50,000 were 52.3% and 47.7%, respectively.

Fig. 3
figure 3

Acceptability curve (a) obtained from probabilistic sensitivity analysis and scatter plot representing probabilistic sensitivity analysis comparing denosumab with alendronate (b)

Discussion

When willingness-to-pay was set at US$50,000, osteoporosis treatment with denosumab was cost-effective compared with weekly alendronate treatment for 75-year-old Japanese women with a BMD of 65% of YAM (T-score, − 2.87) and a history of previous vertebral body fracture. This study clarifies how the ICERs of denosumab treatment to alendronate treatment change with different combinations of age and BMD values. In the target population with a BMD of 65% or less of YAM (T-score, − 2.87 or lower), ICERs were lower than US$50,000. Our findings also suggest that for the same BMD, treatment of 80-year-old women is more cost-effective than treatment of the other two age groups. When the target treatment population was set at 85 years, the overall mortality rate increased, the number of prevented fractures decreased, and the efficiency deteriorated. Within each age group, the cost-effectiveness of denosumab tended to be better for populations with lower BMD.

In deterministic sensitivity analysis, the relative risk of hip fracture in patients taking denosumab versus alendronate had the greatest influence on results. This finding is similar to that of a previous cost-effectiveness analysis [32, 37]. To improve the robustness of cost-effectiveness analysis for osteoporosis treatment, it is important that the values of relative risk for hip fracture in patients receiving each drug be more specific.

The model was sensitive to denosumab compliance in deterministic sensitivity analysis. Jonsson et al. did not take compliance into consideration in their study because of lack of data, but proposed that not only a cost of drug and its fracture-prevention effects but also a country’s own persistence and compliance data are important [12]. Because there is little difference in the fracture prevention effect of alendronate versus denosumab for hip fracture, the high persistence rate and better compliance rate of denosumab are greatly related to its cost-effectiveness. In this study, the model was sensitive to the persistence rates of both denosumab and alendronate.

The parameter with the next greatest influence in sensitivity analysis was the combined cost of annual denosumab and daily calcium, magnesium, and natural vitamin D3 supplements. Karnon et al. conducted a cost-effectiveness analysis of denosumab, zoledronic acid, and alendronate in Australia [38]. In Australia, the price of alendronate has declined by approximately 65%, while the prices of denosumab and zoledronic acid have not declined. That study reported that denosumab was not cost-effective because its price had not declined. We conducted this analysis using the drug price revised in 2018 [26]; however, the result of cost-effectiveness analysis is affected by increases in drug prices. In this study, if the cost of denosumab was lower than US$820, which is currently 89.0% of the price, the ICER was always below the threshold in base-case analysis. To evaluate cost-effectiveness, more discussion of the cost of denosumab may be necessary.

The model was also sensitive to the offset time of denosumab in deterministic sensitivity analysis. In this study, the offset time of denosumab was set similarly to that of alendronate, as in previous reports [12, 32, 37]. It has been suggested that BMD may decrease quickly after discontinuation of denosumab, resulting in vertebral fractures [39]. Therefore, we need to verify in future studies whether it is reasonable to determine the offset time of denosumab in the same way as for alendronate. In this study, deterministic sensitivity analysis verified the effect of the offset time of denosumab in the range of 0 to 10 years. Assuming no offset time with denosumab, the ICER was US$60,906, which exceed the threshold. Offset time of denosumab has a large influence on the results, so careful consideration is necessary.

Several cost-effectiveness analyses of denosumab and alendronate have been conducted in other countries. Many of these studies have found that the cost-effectiveness of denosumab was better than that of alendronate [12, 37, 40]. The study reported by Karnon et al. is the only study reporting that denosumab was not cost-effective, because of its price [38].

Mori et al. performed the first cost-effectiveness analysis comparing denosumab with alendronate, and their study is the only prior research on the subject in Japan [32]. In an analysis of four age groups and multiple factors, that study found that denosumab was more cost-effective than alendronate. A comparison between that study and the present study is shown in Supplemental Table 2 [41, 42].

Our model was essentially not very different from that of Mori et al. Patients who had hip fracture could not have vertebral fracture. Other fragility fractures (e.g., wrist or pelvic fracture) were not considered. Mori et al. also set their model so that patients could have a maximum of two hip fractures, as in our model. The major difference between the models is that patients could have a maximum of two clinical vertebral fractures in our model, as discussed above.

Another difference between the structure of our model and that of Mori et al. is the determination of the transition probability for hip and vertebral fracture. To calculate transition probability, we used a formula to calculate the annual incidence rate of fracture according to age and BMD in accordance with the method of Moriwaki et al. [14]. In contrast, Mori et al. calculated the annual fracture rates by multiplying the age- and sex-specific fracture risk in the general population by the relative risk based on the presence of osteoporosis [32].

The advantage of the mathematical formula used in this study to determine the fracture transition probability is that it is possible to classify conditions according to age and BMD. Using this formula, it is possible to verify how drug efficacy changes with BMD and age. Evaluating efficacy with a combination of age and BMD is useful when making decisions in actual clinical situations and may also be useful for policy-making decisions. The validity of this formula can be discussed, but it has been sufficiently verified in previous studies [14]. In this model, we confirmed the validity using FRAX.

Mori et al. used the fracture incidence rate of each age and multiplied it by the relative risk of osteoporosis. One great advantage of that method is that it is very concise and is based on epidemiological data. However, it is difficult to classify conditions based on BMD, and there is the problem that the “degree” of osteoporosis cannot be verified.

There are also differences in the data on the drug effects of denosumab and alendronate. Mori et al. used data from the network meta-analysis of 2012 [9, 32]. The data used in the present study were derived from sub-analysis of large clinical trials of denosumab and alendronate use [19,20,21]. The reason for using these data was to match the subject of the present study, which was elderly women with a high risk of fragility fracture and a history of previous vertebral fracture.

Although there were some differences in the data used in the two studies, the cost-effectiveness of osteoporosis treatment using denosumab was better than that of alendronate treatment in both studies.

A strength of this study is that the analysis was performed with conditionalization according to BMD and age group. No previous studies of denosumab have examined how the combination of age and BMD affects cost-effectiveness. In clinical practice, doctors select drugs after considering several parameters, such as patient age, BMD, and history of previous fracture. As this study indicates, it is clinically important to analyze the cost-effectiveness of osteoporosis treatment according to BMD and age.

Although the cost-effectiveness of denosumab was good for the target population in base-case analysis, the cost-effectiveness may be higher for 80-year-old patients and for those with markedly low BMD values. Our results indicate that from a health-economic viewpoint, denosumab treatment should be considered for elderly osteoporotic women with markedly low BMD. Our results can be the basis for new treatment decisions in daily practice and for determining medical policy.

Health economic analysis indicated that denosumab treatment may not be recommended for patients over 85 years of age, even those with very low BMD, because the efficiency decreased as the mortality rate increased and the total number of preventable fractures decreased for patients aged 85 years and older. It is necessary to comprehensively evaluate the indications for denosumab treatment in super-aged patients, considering their background and risks.

There are several limitations in this study. First, other fragility fractures such as wrist fractures and pelvic fractures were not considered. The main reason for this exclusion is that domestic data on other fragility fractures are insufficient.

Second, data on the persistence rate of 3 years or more of denosumab treatment in Japan are insufficient, so we used data from other countries. The data we used were 2-year persistence rates for denosumab, so we assumed a third-year persistence rate.

Third, the offset time of effect after discontinuation of denosumab is controversial. It is necessary to sufficiently evaluate how to set the offset time for denosumab in the future.

Fourth, we did not consider the side effects of denosumab or alendronate. Side effects include gastroesophageal reflux disease with alendronate and cellulitis with denosumab; common problems of bone resorption inhibitors include osteonecrosis of the jaw and atypical femoral fracture. However, because these serious side effects are very infrequent, the current consensus is that the benefits of fracture prevention with osteoporosis drugs exceed the risk of these side effects [43]. Bone fusion is difficult in atypical femoral fractures, resulting in a high rate of nonunion [44], which decreases QOL and is a clinical problem that should not be ignored. It is necessary to consider incorporating these side effects into future models.

In this study, we set the willingness to pay threshold at US$50,000. Thresholds for judging cost-effectiveness analysis remain controversial in Japan. However, in many domestic medical economic analyses to date, the threshold value has been based on the report of Shiroiwa et al. [36]. In that report, the threshold in Japan was five million JPY, which we recognize as being common in Japan [36]. The Japanese cost-effectiveness evaluation expert group of the Central Social Insurance Medical Council (Chuikyo) mentions the report of Shiroiwa et al. as a criterion, and a willingness-to-pay threshold of US$50,000 has been regarded as the present standard [36, 45]. However, careful discussion is required to establish accurate standards in Japan.

In conclusion, osteoporosis treatment with denosumab was cost-effective for 75-year-old Japanese women with a BMD of 65% of YAM (T-score, − 2.87) and a history of previous vertebral body fracture, compared with treatment with weekly alendronate, when willingness-to-pay was set at US$50,000. Denosumab treatment may be cost-effective for elderly women over 75 years of age with a BMD value of 65% or less of YAM (T-score, − 3.2 or lower).