Introduction

Osteoporotic fracture has been recognized as a worldwide public health concern since the last century. Around the world, an estimated 200 million people are affected by osteoporosis [1], and every year, there are around nine million fractures caused by osteoporosis [2]. Within the population over 50 years of age, around 20% of males will experience osteoporotic fractures, while more than 30% of females will suffer from the same condition [3,4,5]. As a consequence of the progressive aging of the global population, the incidence of osteoporotic fracture is expected to rise rapidly. In the USA alone, the number of hip fractures is estimated to triple by 2040 [6]. The economic and social burdens caused by osteoporotic fractures are already regarded as critical issues among many developed countries [7,8,9], and the social burdens caused by osteoporotic fractures have also arisen among developing countries. For instance, in China alone, around 70 million Chinese are affected by osteoporosis, which alone results in around 700,000 hip fractures in China every year [10].

Lithium is one of the most effective medications for the long-term treatment of bipolar disorder [11, 12]. It has been demonstrated to be efficient in the treatment of acute mania and depression and to be serviceable in decreasing the risk of suicide [13]. Our previous meta-analysis demonstrated that antidepressants, including tricyclic antidepressants and selective serotonin reuptake inhibitors, are associated with increased fracture risk [14, 15]. However, the effect of lithium use on fracture risk and bone mineral density (BMD) is uncertain. Some studies suggest that lithium use is related to bone loss since lithium medication usually leads to a hyperparathyroid state [16,17,18,19] that increases the risk of bone loss, while other studies suggest that lithium use is associated with an increase in BMD [20] and a decrease in fracture risk [21,22,23].

We conducted a comprehensive systematic review and meta-analysis to assess all eligible observational studies in order to examine the effect of lithium use on BMD and fracture risk, both quantitatively and qualitatively. We also investigated whether such an effect varied by sex, age, geographical location, study size, alcohol use, or adjustment for comorbidities.

Materials and methods

This systematic review and meta-analysis was guided by Meta-analyses of Observational Studies in Epidemiology (MOOSE) [24], and the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) statement [25] was used as a reference. A protocol was also preregistered in the International Prospective Register of Systematic Reviews, where the objectives, literature search strategy, inclusion criteria, exclusion criteria, methods of study selection, data abstraction, and methods of statistical analysis were elaborated upon. The protocol is available at the following URL:https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=60499.

Eligibility criteria

Observational studies that investigated the association between lithium use and fracture risk or BMD were eligible for inclusion in our systematic review, while studies were eligible for the meta-analysis if they met the following criteria: (1) observational studies, (2) human participants, (3) lithium use as exposure, and (4) risk of fracture or BMD as the outcome, with corresponding 95% CIs or other data that could be used for variance calculation. Exclusion criteria for our meta-analysis were as follows: (1) risk of osteoporosis without fracture as the outcome and (2) duplicated studies.

Literature search

A comprehensive literature search of PubMed through February 16, 2018, was conducted (Supplemental Table 2). “Lithium,” “bone loss,” “bone fracture,” “fracture,” “bone density,” and “osteoporosis” were the medical subject heading (MESH) terms for our literature search. The keyword “bone mineral density” was also used during the search. We restricted the search to the English language and to human studies only. Using similar strategies, searches of MEDLINE Web of Science and Embase database were performed as well. Three investigators (B.L., S.Z., and A.D.R.) conducted the literature search independently. Librarians were consulted to ensure the comprehensiveness of the literature search. We also searched unpublished data, including abstracts, theses, and dissertations, by using the Google search engine. The reference lists of original studies and relevant meta-analysis articles and review articles were reviewed independently by the three reviewers.

Study selection

To identify potential eligible studies, the titles and abstracts of all references obtained in the literature search were screened independently by the three reviewers. During this stage, references that were agreed upon by all three reviewers to be irrelevant to this systematic review and meta-analysis were excluded, while the remaining references were assessed for eligibility by examining the full-text contents. All three reviewers independently scrutinized the full contents to assess eligibility. Fleiss’ kappa statistic, as an adaptation of Cohen’s kappa for three or more raters, was used to assess the agreement between the three investigators [26]. Areas of uncertainty or disagreement were adjudicated by a fourth reviewer (Q.W.). The process of study selection is presented as a PRISMA flow diagram (Fig. 1).

Fig. 1
figure 1

Flow diagram of study selection

Data abstraction

The following study characteristics were extracted: author(s), study name, publication year, study population and region, study design, and participant age, sex, and race. Additional characteristics such as study size, dose of lithium use, former use of other antidepressants, and other medications were extracted for further analysis. We also extracted hazard ratios (HR), relative risks (RR), and odds ratios (OR) of fracture associated with lithium use and the corresponding standard errors. For the studies with BMD as the outcome, the mean differences between lithium users and non-users and the corresponding standard deviations or standard errors were abstracted. If the original studies contained more than one estimate for the same outcome, we chose the one that was adjusted for the largest number of confounders. All data abstraction was conducted using REDCap, which is a browser-based application designed for researchers performing data collection. Three reviewers (B.L., S.Z, and A.D.R) performed data abstraction and data entry independently. Fleiss’ kappa was calculated to examine the level of inconsistency among the three reviewers, and any inconsistency with the data abstraction and data entry was discussed and adjudicated by a fourth reviewer (Q.W). To ensure accuracy, all three reviewers examined the data at least twice. Since we were able to obtain all the necessary information from the papers alone, we did not contact the authors for more information.

Study appraisal

The Newcastle-Ottawa quality assessment scales for cohort studies and case-control studies were used to examine risk of bias and to gauge the methodology quality of the included studies. For each study, we calculated a quality score based on a prespecified questionnaire [27]. For cohort studies, the following eight criteria were used for quality assessment: representativeness of the exposed cohort, selection of non-exposed cohort, ascertainment of exposure, demonstration that outcome of interest was not present at the start of the study, comparability of cohorts on the basis of the design or analysis, assessment of outcome, follow-up duration long enough for outcomes to occur, and adequacy of follow-up of cohorts. For case-control studies, the following eight criteria were used for quality assessment: adequacy of the case definition, selection of controls, definition of the controls, representativeness of the cases, comparability of cases and controls on the basis of the design or analysis, ascertainment of exposure, the same method of ascertainment for cases and controls, and non-response rate. The score for both cohort and case-control studies ranged from 0 to 9, where a score of 9 indicates the strongest regarding methodology. Information about the Newcastle-Ottawa Scale for each study is summarized in Supplemental Table 2. As the MOOSE group [24] recommends, the quality scores were not used as weights in conducting the meta-analysis. However, the quality scores were used in the sensitivity analyses.

Statistical analysis

The primary summary measures in the meta-analysis were confounder-adjusted RRs for fracture. Since the absolute risk of osteoporotic fractures is low, OR approximates RR [28]. Additionally, HR is broadly equivalent to RR [29]. Hence, in our statistical analysis, we approximated HRs and ORs as RRs. The natural logarithmic-transformed RRs were used to stabilize the variance, and the variances of the log-transformed RRs were calculated using CIs or p values given in the original studies. The reciprocal of the variance for each study was used as the weight of the corresponding study to calculate the overall effect size. The mean difference of BMD between the lithium-treated group and control group was calculated for each study, and the variances for the mean differences in BMD were computed by pooling individual variances or using CIs.

Several sensitivity analyses‚ in which the effect of lithium use on fracture risk were analyzed under different inclusion criteria, were conducted to assess the robustness of the major findings in our meta-analysis. We also performed several prespecified subgroup analyses to determine if fracture risk associated with lithium use was influenced by age, sex, geographical location, sample size of the study, or previous fractures and if the study had been controlled for other antidepressant uses.

The heterogeneity was assessed by Cochran’s Q statistic and Higgins’ I2 statistic. We expected the existence of heterogeneity since we had combined the outcomes from both case-control studies and cohort studies. Moreover, the participants from the original studies were different in terms of sex, age, race, ethnicity, location, and the fact that the studies had distinctive settings. Nevertheless, Cochran’s Q statistic (p = 0.61) and zero Higgins’ I2 indicated no significant heterogeneity among the fracture outcomes. Although no significant heterogeneity was detected, a random effects model was employed in the meta-analysis instead of a fixed effects model since the included epidemiologic studies were conducted differently with regard to study setting, study design, and study population. Thus, the true effect size may differ from study to study [30].

To provide a visual inspection of publication bias, a funnel plot was generated, and to examine publication bias quantitatively, Egger’s regression test was employed. All data analyses were performed using R statistical software.

Results

Study characteristics

A total of 3819 relevant references were identified in our initial search, and after removing any duplicates, 3057 records were identified as potential references. We screened the titles and abstracts of all the references and identified 65 studies for full-text review, with moderate agreement among the three investigators (k = 0.75). After full review of these 65 studies, eight studies [16, 20,21,22,23, 31,32,33] met the inclusion criteria for systematic review, with high agreement among three investigators (k = 0.92). All eligible studies were published in English. The characteristics of the included studies are presented in Table 1. Six [21,22,23, 31,32,33] reported risk of fracture as an outcome, while two [16, 20] reported BMD as an outcome. For two studies that reported BMD as an outcome, both reported PTH levels. Both studies reported BMD at lumbar spine and femoral neck. However, only one study [20] reported BMD at trochanter and Ward’s triangle. Of the two studies that reported BMD as an outcome, both found that the lithium-treated groups were associated with higher BMD at the lumbar spine and femoral neck compared to the control groups. Of the six studies that reported risk of fracture as an outcome, all of them found that lithium use was associated with a decreased risk of fracture.

Table 1 Characteristics of studies included in the meta-analysis and systematic review

All eight studies [16, 20,21,22,23, 31,32,33] were eligible for the meta-analysis. However, two of the eight studies were conducted with the same population [23, 33]. From these two studies, we selected the one that defined the exposure consistently with the other four studies that reported fracture risk [33] and we excluded the other [23]. Among the seven studies eventually included in the meta-analysis, five were case-control studies [16, 20,21,22, 33] and two were cohort studies [31, 32]. In total, there were 1,134,722 individuals that comprised our systematic review and meta-analysis. The number of participants ranged from 50 in the study by Nordenstrom [16] to 498,347 in the study by Vestergaard [23]. The mean age of participants varied from 43 in the study by Su [32] to 64.2 in the study by Bolton [31]. Among all the studies included, only the study by Bolton [31] reported the proportion of postmenopausal women. In this study, 99% of women participants are postmenopausal. All the studies included both male and female participants, and all the studies controlled for both sex and age. Alcohol consumption, as a potential risk factor for fracture, was controlled for in three studies [31,32,33], and the Charlson Comorbidity Index was controlled for in three studies [31,32,33]. The Newcastle-Ottawa Scale quality scores ranged from 7 to 9, with six studies scoring greater than 7 [16, 20, 23, 31,32,33].

Meta-analysis

In the five studies that reported risk of fracture, three case-control studies reported ORs [n = 1,024,984] and two cohort studies reported HRs [n = 109,485]. The approximate RRs (95% CIs) of fracture due to lithium use for each study and the overall RR are presented in the forest plot (Fig. 2). The approximate RR was less than 1 for each of the five studies included, and overall, a 20% decrease in fracture risk (RR = 0.80, 95% CI, 0.73–0.87, p < 0.001) was associated with lithium use. Both Cochran’s Q statistic (p = 0.60) and Higgins’ I2 statistic (0%) suggested that no heterogeneity was present.

Fig. 2
figure 2

Risk of fracture associated with lithium by individual studies and by all studies combined. The horizontal lines represent 95% CIs for the effect estimates. The size of the square boxes specifies the weights of studies. The diamond stands for the overall effect size

Both studies that reported BMD as the outcome were case-control studies [16, 20]. Increases in mean BMD at the lumbar spine (0.06 g/cm2, − 0.15–0.26, p = 0.62) and femoral neck (0.06 g/cm2, − 0.15–0.26, p = 0.60) in the lithium-user group were observed.

Sensitivity analysis

All of the estimated fracture risks associated with lithium use were significant under the different study inclusion criteria (Table 2). For instance, the overall RR almost remained unchanged when only the studies with more than 100,000 participants were included in the analysis. When only the studies with less than 100,000 participants were included in the analysis, the overall RR decreased slightly to 0.74, and when only the studies from Europe were included, the overall RR increased slightly to 0.81. When excluding the two studies that did not provide the definition of exposure as “ever use of lithium” [31, 32], the overall RR varied little. Furthermore, when the analysis was limited to the studies with quality scores greater than 7 [31,32,33], the overall RR decreased slightly to 0.78. Since all the original studies included in the meta-analysis reported the number of female participants and male participants specifically, we also calculated the gender ratios of the studies, where the gender ratio is defined as the number of female participants to the number of male participants. When we only included the studies with gender ratios greater than 2 [22, 31], the overall RR decreased significantly to 0.68. When the analysis was confined to the studies that adjusted for the Charlson Comorbidity Index [31,32,33], the overall RR decreased slightly to 0.78. When the analysis was confined to studies that had adjusted for previous falls [21, 33], the overall RR was 0.74. When only the studies that adjusted for psychotic diseases [22, 31,32,33] were included, the overall RR decreased to 0.76. Finally, when the studies that did not adjust for prior use of bone active drug were excluded [22, 33], the overall RR increased slightly to 0.85.

Table 2 Risk of fracture associated with lithium in studies under different inclusion criteria

Publication bias

The funnel plot (Fig. 3) revealed no publication bias, while the result from Egger’s regression test suggested that any publication bias was insignificant (p = 0.63). Since the results from both the funnel plot and Egger’s test suggested no publication bias for this study, the trim-and-fill correction procedure was not implemented.

Fig. 3
figure 3

Funnel plot with 95% confidence limits. The horizontal axis represents the effect size. The vertical axis stands for the standard error

Discussion

A comprehensive systematic review and meta-analysis were conducted to assess the association between lithium treatment and risk of fracture, based on all eligible studies from a wide range of geographical areas and subjects. In our systematic review, results from the six studies with fracture data were consistent in finding that lithium treatment is associated with a decreased fracture risk, while the two studies with BMD data found that individuals treated with lithium had a higher BMD compared to those who were not treated with lithium. Meanwhile, the pooled overall result from our meta-analysis found that lithium use was associated with a 20% decreased risk of fracture. Such results were consistently significant in all sensitivity analyses and subgroup analyses (Table 1), which indicate that our findings are robust. Moreover, the pooled results of the two studies that reported BMD suggested that lithium use is associated with an increased BMD at the lumbar spine and femoral neck. All the results that we obtained suggest that lithium use may decrease risk of fracture through its effect on BMD.

The mechanisms through which lithium may decrease risk of fracture are likely to be complex. Lithium may have a positive effect on BMD by affecting the transport of calcium. Research on animals found that lithium activates the canonical WNT path through its effect on the enzyme named GSK-3β [34]. Lithium may also have an effect on the calcium-sensing receptor transduction system in organs such as renal tubules [35]. By affecting the transport of calcium in a positive direction, lithium creates a friendly environment for the absorption of calcium, hence leading to increased BMD. As several former studies have suggested, decreased BMD is associated with increased risk of fracture [3, 36]. Therefore, lithium use is likely to decrease the risk of fracture by increasing BMD. In addition, bipolar disorder is associated with increased fracture risk [32, 37]. As an effective and widely used medication to treat bipolar disorder, lithium may decrease fracture risk by alleviating symptoms of bipolar disorder. Moreover, bipolar disorder is found to be associated with poor health behaviors such as smoking and increased alcohol consumption [38,39,40], and such health behaviors have been observed to affect bone metabolism [41]. Thus, for patients with bipolar disorder, lithium treatment may decrease the risk of fracture by alleviating the above conditions associated with bipolar disorder. However, several studies [16,17,18,19,20] suggest that since lithium use is associated with increased PTH level, long-term treatment of lithium might be associated with hyperparathyroidism. Hyperparathyroidism may induce hypercalcemia, which leads to bone resorption. Hence, it is likely that an appropriate dose of lithium medication is associated with slight increase in PTH level, which results in anabolic effect on bones and a decrease in the risk of fracture. The appropriate dose for lithium medication, which leads to a protective effect on bone, is warranted for additional investigation.

There are some limitations in our meta-analysis and systematic review. First, since only studies in the English language were included, language bias was expected to be an issue. To address this, we performed an additional search in PubMed, Embase, and MEDLINE for non-English-language studies only. However, after reviewing all the abstracts, no additional studies met our inclusion criteria. Secondly, important sources of heterogeneity such as variation in lithium dosage, duration of lithium use, and associated indications could not be assessed, because this information either was not available or was mixed in most of the included studies. Thirdly, the existence of other diseases may have confounded the effect of lithium use. However, even with the inclusion of only the studies that had been adjusted for comorbidities, our results remained consistent. In addition, because there are more case-control studies and fewer cohort studies that met inclusion criteria for this meta-analysis research, our findings are subject to selection bias from these case-control studies, even though our analysis suggested no evidence for publication bias in this meta-analysis. Finally, because a limited number of original studies were eligible for the meta-analysis, a meta-regression could not be conducted. Instead, we conducted sensitivity analyses and subgroup analyses, using the information that was available in the original articles.

In summary, we conducted the first systematic review and meta-analysis on the association between lithium use and fracture risk and demonstrated that lithium use is associated with a significant decreased risk of fracture. As a highly prevalent condition, osteoporotic fracture has caused an elevation of social and economic burden worldwide. Thus, the discovered link between lithium treatment and risk of fracture will have a crucial impact on global public health. Future large perspective studies are warranted to explain and expand upon how lithium is associated with a decreased risk of fracture and its association with BMD.