Introduction

Osteoporosis, which is characterized by compromised bone strength and increased susceptibility to fractures, which lead to deterioration in the QOL and increased mortality, is a national burden on an aging society [1, 2]. However, recent studies indicate that treatment with a parathyroid hormone, bisphosphonates or a selective estrogen receptor modulator (SERM) [39] may decrease the risk of fractures in patients with osteoporosis.

Although bisphosphonate treatment currently represents the most powerful form of treatment available for fracture prevention in osteoporotic patients, it has not succeeded in completely preventing osteoporotic fractures [35, 79]. Therefore, concurrent treatment of osteoporosis has been frequently used by Japanese practitioners without any concrete evidence regarding fracture reduction. Since the concept of evidence-based medicine (EBM) has been introduced to clinical practice since the 1990s [10], the Japan Osteoporosis Society and the Japanese Society of Bone Mineral Research have edited the clinical guideline for treatment of osteoporosis (Chief editor: Hajime Orimo [11]). However, the writers recognized that there was a lack of evidence in the effectiveness of concurrent treatment of osteoporosis. Furthermore, it was expected that the patients who visit clinics have varying degrees of risk of fracture, which may differ from the degree of those who participated in development trials for bisphosphonates. This possibility would make it easier to obtain pragmatic evidence in general clinical practice.

Starting in 2000, the Japan Osteoporosis Society had planned to investigate the effectiveness of treatment of osteoporosis in order to provide evidence to general practitioners. Before constructing evidence, some feasibility studies were required to confirm the consensus in the diagnosis of incident fractures among the researchers and to elucidate the risk of future fractures in the patient population. In addition to these efforts in the field of osteoporosis, the Japanese government also established an ethical guideline for clinical trials [12], and the International Committee of Medical Journal Editors launched a clinical trial registry [13]. Such kinds of progress in the circumstances of clinical trials have enabled for investigator-initiated clinical trials in general practice.

The Adequate Treatment of Osteoporosis (A-TOP) study group was established in 2000 [11] in affiliation with the Japan Osteoporosis Society and organized a team for clinical trial management. The team consisted of clinical investigators (planning and analysis), foundation (funding) managers, officers from non-profit organization (data management) and several companies (data collection). This was the first joint team to create the post-making evidence for osteoporosis. In November 2003, A-TOP initiated a randomized clinical trial referred to as the Japanese Osteoporosis Intervention Trial (JOINT). The purpose of JOINT was to confirm the clinical significance of concurrent use of osteoporotic drugs. The first protocol, named JOINT-01, was initiated in 2002, but was suspended the following year due to a change in drug labeling. A second protocol, named JOINT-02, was established to clarify the effect of adding 1-alpha-hydroxycholecalciferol (alfacalcidol) to alendronate (ALN), using the incident fracture rate as the primary endpoint. In this paper, the rationale, organization and study design of JOINT-02 are introduced.

Rationale and aims

In 2002, the Japan Society of Osteoporosis sent a letter to randomly selected practitioners and enclosed a questionnaire regarding whether concurrent treatment using bisphosphonate and another drug was being utilized to treat osteoporosis. Surprisingly, 87.8% (79/90 practitioner) of the doctors who responded did have experience using concurrent treatment [14]. The most frequent drugs used in concurrent treatment with amino-bisphosphonate were alfacalcidol (93.7%), followed by calcitonin (50.6%), as there were expectations for these drugs to exhibit more potent inhibition of fracture occurrence or more significant increase in BMD, even though there was no apparent evidence. In addition to the lack of evidence related to fracture prevention, the safety profile of concurrent treatment had not been evaluated. Thus, evaluations of the effectiveness and safety of concurrent treatment were urgently required. Etidronate [15] and ALN [8, 9] were used as the drugs to confirm anti-fracture effectiveness in comparison to alfacalcidol in Japanese osteoporotic patients. However, these clinical trials were carried out at specific institutions and were initiated by experts in accordance with tight regulations. As a result, there may have been differences in the selected treatment and in the backgrounds of the patients between treatments conducted at these institutions and those conducted in general practice. In addition, the adherence of the treatment is expected to be lower in general practice than in institutions with experts who are committed to developmental trials. Thus, pragmatic study is urgently needed to evaluate whether bisphosphonates are effective to the same extent at the level of general practitioners as compared to the prior study (Phase III study).

Feasibility studies

The Japan Osteoporosis Society started discussions to execute a national clinical trial for obtaining evidence regarding the effectiveness of concurrent treatment in 2000. An executive committee of A-TOP was organized in 2002 and planned on forming a consensus regarding judgment standards for pre-existing fractures and incident vertebral fractures [16]. Morphometric criteria for incident fractures combined with a semi-quantitative assessment were thought to provide useful information on the study of clinical osteoporosis, especially for international comparisons. Next, to assume the number of participants in the clinical trial, the incident fracture rate and the risk of incident fracture were analyzed in the patient population, and the number of participants with sufficient statistical power [17] was calculated. Bone resorption marker was an independent risk factor for incident vertebral fractures in Japanese women. When the newly discovered risk factor was incorporated into the inclusion criteria in addition to conventional selection criteria such as age, prevalent fractures and bone mineral density, a reduction of about 40% in the estimated sample size was achieved. Thus, measurement of bone resorption markers is useful in reducing the sample size and the observation period in fracture-prevention studies carried out for developing drugs used to treat osteoporosis.

Materials and methods

Study design

Objective

JOINT was the first national, prospective, randomized, multicenter, open-labeled, blinded endpoints, controlled trial for osteoporosis made up mainly of practitioners of investigators in Japan. The objective of JOINT-02 was to clarify additive efficacy in terms of fracture prevention and safety, QOL and adherence in simultaneous use of alfacalcidol and ALN.

Subjects, intervention and endpoints

Confirmations regarding the patients were made by practitioners based on the inclusion and exclusion criteria (Table 1) after obtaining written informed consent. The participants were selected by the practitioners and registered by Japan Clinical Research Support Unit (JCRSU), and then randomly allocated with a modified minimization method using age, number of pre-existing vertebral fracture number, bone mineral density (BMD) and value of bone metabolic marker into the group to be administered only ALN or the group that was to be administered both ALN and alfacalcidol. Registration and allocation of the participants were carried out on the Internet. After initiation of the assigned treatment, clinical data were collected at intervals of half a year for 2 years by I’cros Co., Ltd., through their visiting data collection service. Data were input to the database using a web system developed by ING Corporation. The primary endpoint was to compare the incident vertebral fracture rate between the intervention arms. The secondary endpoints were to compare the differences in the time to first incident vertebral fracture, non-vertebral fracture rate, bone mineral density, adherence, QOL and safety (Fig. 1). In addition, sub-group analyses categorized by baseline characteristics such as age, body mass index (BMI), serum 25-hydroxyl vitamin D levels, the number of pre-existing vertebral fractures and fracture grade were candidate factors. If participants wanted to change the designated treatment because of side effects or occurrence of fractures, they were permitted to do so, and follow-up observations were continued. Please see Table 1 and Fig. 1.

Table 1 Inclusion and exclusion criteria
Fig. 1
figure 1

Study design and outcomes

Sample size

Assumptions regarding the fracture rate in the ALN group were made based on a paper by Kushida et al. (Phase III trial for alendronate), in which it is reported that there was a 12.2% fracture rate during observations conducted over 2 years [8]. Since there are not much data on concurrent use of ALN and alfacalcidol [18], the authors’ expectations were such that the effects of ALN would be added to those of alfacalcidol and that the hazard ratio of the alfacalcidol combined arm to ALN alone would be 0.64 [19]. The sample size was then estimated to be 890 cases per arm (two-sided alpha = 0.05, power = 0.8), taking account of a dropout rate of 10% referring to the value of the prior clinical trial of fracture intervention [4].

Fracture evaluation

X-ray films of conventional lateral radiographs of lumbar and thoracic vertebrae were taken and collected by I’cros Co., Ltd. After masking the patient’s information, two independent readers (orthopedist, TN and radiologist, MF) simultaneously reviewed films from T4 to L4 in chronological sequence and graded vertebral fracture based on a semi-quantitative method [20]. Before the start of the study, these two observers held meetings to make adjustments between their own criteria for grading vertebral fractures. When the diagnosis of pre-existing fractures made by the reviewers differed from those made by the practitioners, the reviewers’ diagnosis was adopted preferentially. If inconsistencies arose between the readers in diagnosing pre-existing and incident vertebral fractures, the two readers negotiated between themselves to reach a consensus. Incident bone fractures other than those of the vertebrae were comprehended from the chart, and the occurrence of fractures was confirmed based on X-ray films or a record of the operation.

Clinical data

BMD at the lumbar vertebrae, hip (proximal femur), distal radius (dual energy X-ray absorptiometry) or left-sided second metacalpal bone (microdensitometry) was measured at baseline and at 6-month intervals for 2 years at each institute. The data of BMD obtained from the different machines and from different bone sites were calculated as the percentage change in each time point from the baseline value. The statistical difference in change of BMD between the group that received combined treatment and the group that was administered only ALN was compared based on each set of data for BMD for the different bone sites. Body height was measured at baseline, 12 and 24 months. Bone turnover markers (urinary type I collagen cross-linked N-telopeptide or urinary excretion of deoxypyridinoline) were measured at baseline and again 6 months after initiating treatment. QOL was assessed by using self-administered questionnaires (JOQOL and EQ-5D) at baseline and at 6, 12 and 24 months after initiating treatment [21]. Serum samples were sent to the central laboratory (SRL Co., Japan), and 25-hydroxyl vitamin D concentrations were measured. Other routine biochemical examinations were carried out at baseline and at 2 years after initiating treatment in order to estimate biochemical adverse events. All adverse events were reported to JCRSU, coded by MedDRA, and categorized as either “known” or “unknown.” If an unknown adverse effect occurred, it was reported to the investigator and ethical committee.

Ethics and registration

Ethical issues regarding protocol were reviewed by the ethical committee for JOINT under the Declaration of Helsinki (Dr. Rikushi Morita, Chairman). If it turned out that a patient was at a disadvantage under observation, the ethical committee was given permission to stop the protocol. This study was registered at UMIN-CTR (University Hospital Medical Information Network—Clinical Trial Registry) with the number C000000001.

Statistical analysis

Analysis of the intent to treat principle was applied to the statistical analysis. Efficacy analysis uses a full analysis set (FAS), and all of the enrolled patients are applied to the analysis except for patients without efficacy data, patients who do not correspond to inclusion criteria and patients who do not receive treatment. The PPS (protocol per set) group is defined as consisting of patients without any serious protocol violation.

Recruitment of the practitioner

The explanatory meeting of the protocol and registration was held in all of Japan. The executive members of the A-TOP research group were responsible for the presentation of the protocol and for the recruitment of study institutions and practitioners. The A-TOP committee created the WEB site on the Internet so that the registration of the study could be executed directly. I’cros Co., Ltd., was also involved as a collaborator in calling practitioners.

Results and discussion

Several principles had to be considered for acceptance of concurrent treatment: firstly, the concurrent treatment should be clinically and statistically significantly more effective than the basic treatment; secondly, the concurrent treatment should be of the same level of safety as single treatment; thirdly, the concurrent treatment should be cost-effective. Although these principles should be evaluated before adopting concurrent treatment, the authors have applied concurrent treatment to osteoporosis widely, without any background evidence. Among these principles, the authors have decided to evaluate the first two issues, effectiveness and safety, in the present study. Since this type of evaluation is absolutely required by a clinician, a researcher-initiative study was considered as being the most suitable type of evaluation. This was the reason why the authors decided to use a researcher-initiative clinical trial in determining the effectiveness of concurrent treatment for osteoporosis. The JOINT-02 protocol was the first randomized, controlled trial conducted nationwide for osteoporosis in Japan initiated by researchers, and its scale was also the largest ever. It was therefore necessary for various organizations to collaborate together, and there have been no previous reports on how to manage the PROBE trial in Japan. This is why we wanted to report the design of JOINT-02. In this paper, we have presented the organization of the A-TOP research group and execution of the JOINT 02 protocol. This is because we believe that this report should help a researcher who is willing to build a new nationwide investigation that is constructed by an organization of clinical research work.

Since the primary aim of JOINT-02 was to determine whether concurrent treatment using ALN and alfacalcidol is superior to treatment using ALN alone in terms of fracture prevention, true (“hard”) endpoints such as vertebral fractures or long bone fractures were selected as the primary endpoint. The diagnosis of whether vertebral fractures were present or not on the X-ray films was made by two independent reviewers who did not have any information about the patient; when the judgment of fractures was split between the two reviewers, the reviewers negotiated with each other. Identifying vertebral fractures is more difficult than identifying long bone fractures due to some cases of new vertebral fractures not showing clinical symptoms and the shape of the vertebral body making it difficult at times to recognize whether there is a fracture. Therefore, to avoid misdiagnosis, diagnosis of pre-existing and incident vertebral fractures was made by two different observers. In cases where there was a discrepancy in the diagnosis of vertebral fractures between the reviewers and the practitioner, which occurred with regard to pre-existing fractures, the reviewers’ judgment was given priority over that of the practitioner.

Surrogate (“soft”) endpoints such as change in BMD or bone turnover markers were considered to be inadequate as primary endpoints in making conclusive statements regarding the efficacy of concurrent treatment. In this type of study, the soft endpoint (BMD or biomarkers) will connect dropout bias less effectively than the case-selected hard endpoint, because such markers are not able to be made blind to the clinicians. Furthermore, previous studies indicate that changes in BMD or bone markers do not predict future fractures [2224].

In recent literature, it has been reported that poor adherence of bisphosphonates leads to a decline in the beneficial effects of this drug on bone [2528]. It is expected that in contrast to prior developmental trials, this current study may have a higher dropout rate, since in a researcher-initiative study, the registered practitioner is not forced to maintain adherence very strictly. As a result, adherence in the present study may resemble the actual circumstances of adherence to bisphosphonate treatment by a general practitioner. It will be interesting to determine whether adherence in this study modifies fracture prevention by alendronate.

Although a careful plan for this study has been set up, the results will be applied to osteoporosis patients with the same background as the present study population, but not adapted to the entire osteoporosis population. Despite this limitation, we believe that the results will give us very important information regarding the concurrent treatment of osteoporosis.