Introduction

Dual-energy X-ray absorptiometry (DXA) is an excellent clinical tool for the diagnosis and management of osteoporosis [1]. Despite its clinical utility, however, DXA does have some limitations. Specifically, it is influenced by bone size; thus, bones with identical volumetric bone mineral density (vBMD), but differing in size, will have different areal BMD (aBMD) values, with the larger bone having a greater aBMD. This effect of bone size is the key reason men have higher aBMD values than women at all skeletal sites, despite similar (or lower) vBMD values [2, 3].

While osteoporosis was originally defined in women, relative to young normal women (the “young reference value”), as a T-score of ≤−2.5 (i.e., aBMD 2.5 standard deviations or more below the mean of young normal women) [4], the extension of this definition to men has been controversial [2, 5]. By analogy with the definition used for women, osteoporosis in men has often been defined on the basis of T-scores that use gender-specific reference values. However, some data indicate that fracture risk depends on the absolute level of aBMD [6], leading to the recommendation that the same young reference value (i.e., that for women) should also be used to diagnose osteoporosis in men [79]. In fact, the current World Health Organization fracture risk assessment tool (FRAX) requires the use of female-referenced femoral neck (FN) aBMD in both sexes for estimating 10-year fracture risks [10], with the assumption that for a given FN aBMD, fracture risk is similar in men and women. If fracture risk is directly related to the strength of the proximal femur, then this assumption suggests that the relation between FN aBMD and femoral strength may be similar in men and women.

To gain mechanistic insight into whether sex-specific young reference values should be used in assessment of fracture risk, we used FN aBMD assessed by DXA, vBMD, and bone geometrical parameters at the FN assessed by quantitative computed tomography (QCT) and femoral strength estimated from the QCT images using finite element (FE) models [1115] to define, in a population-based sample of men and women, the gender-specific interrelationships among FN aBMD, vBMD, bone size, and FE-derived femoral strength.

Methods

Study subjects

We recruited subjects from an age-stratified, random sample of Rochester, MN, USA residents who were selected using the medical records linkage system of the Rochester Epidemiology Project [16]. This community is highly characteristic of the US White population but underrepresented with respect to Blacks, Hispanics, and Asians. Reflecting the ethnic composition of the community, 96% of the men and 99% of the women were White. The sample spanned ages from 27 to 95 years and included 215 women and 182 men; since FRAX is applicable to men and women aged 40 to 90 years, we used the 192 women and 156 men in this age range from this cohort for analysis. In addition, since FRAX is not applicable to patients being treated for osteoporosis, we excluded subjects on estrogen therapy, bisphosphonates, or selective estrogen receptor modulators; none were on therapy with teriparatide. This resulted in a final cohort for analysis of 136 women and 146 men. All subjects had QCT scans of the proximal femur (see below) and had aBMD assessed by DXA.

QCT of the proximal femur

As previously described for this cohort [3], single energy scans were made at the proximal femur with a multi-detector CT scanner. For the analysis of the scan images, we assessed a single reformatted oblique section contiguous at the midportion of and orthogonal to the FN, between the superior aspect of the head of the femur and the inferior aspect of the inferior trochanter. For all scanning sites, slice width was 2.5 mm and the in-plane voxel size was 0.74 mm. Calibration standards scanned with the patient were used to convert CT numbers directly to equivalent vBMD in milligrams per cubic centimeter [17]. To validate our image processing algorithm, we made ten scans of the European Spine Phantom (ESP), which is composed of hydroxyapatite [18]. The correlation between bone density results determined by our algorithm and that of the ESP was 0.998.

To study age- and sex-specific changes in bone mineral distribution, we developed software for the analysis of bone structure, geometry, and volumetric density from the CT images [19]. The CT data are treated as a 3D volume and reformatted in an orientation that is perpendicular to the primary loading forces on the bone in the region of interest. On this plane, the software program automatically places a single image line that extends from the centroid of the bone to outside the periosteal surface, and this line is then rotated about its centroid end in 3-degree increments. From the range of gray levels in the image, the cortex is identified as the maximal brightness found on this line. The full-width half-maximal points on either side of this maximum are interpreted as the periosteal and endocortical boundaries of the cortex. The resulting points are joined to create 2D cortical and subcortical regions. The vBMD and area of the two regions are measured independently, and a circular central trabecular region is also measured. The FN total cross-sectional area is also derived from this analysis as the area within the periosteal boundary as identified above.

DXA analysis

aBMD of the FN was obtained using the Lunar Prodigy (GE Medical Systems), with a coefficient of variation <1%.

FE estimates of proximal femoral strength

This was assessed in a subset of 28 men and 28 women who were randomly sampled from the larger group. The details of the finite element modeling are described elsewhere [12, 13]. Briefly, each QCT image was rotated into a standard orientation to simulate the femoral position at impact for a sideways fall. Each image voxel was resampled and converted into a 1.5-mm cube eight-noded finite element, and material properties for cortical or trabecular bone were assigned to each element using calibrated QCT density information and empirical relations between measured bone density and tissue mechanical properties [2022], using lower strengths for tension than compression [23, 24]. Loads and boundary conditions were then applied to simulate a sideways fall. This technique has provided strong Y = X type correlations between measurements of femoral strength in elderly cadaver femora loaded in a similar sideways fall configuration (n = 76, R = 0.88) [25], and others using similar techniques have also found consistently strong correlations [14, 15]. Further, the finite element technique, as implemented in this study, was highly predictive of new hip fractures in a prospective, blinded analysis of hip fracture in elderly men [13].

Estimation of fall loads and the load-to-strength ratio (Φ)

We calculated the load-to-strength ratio for a simulated sideways fall and impact directly on the greater trochanter. The “load” is an estimate of the impact force during a sideways fall, and the “strength” is the FE-derived estimate of the femoral strength for such a fall. Higher values of the load-to-strength ratio (Φ) place an individual at higher risk of hip fracture [26]. As described in more detail elsewhere [13],we calculated the in vivo impact force on the side of the trochanter for each subject using biomechanical theory [27, 28] and patient-specific weight and height information. A uniform value of trochanteric soft tissue thickness of 25 mm was assumed for all subjects, both men and women, since patient-specific measures of soft tissue thickness were not measured from the QCT scans for this study.

Statistical analysis

Variables were summarized using means and standard deviations (SD). The Pearson correlation coefficient was used to compare continuous variables, and the unpaired t test was used in analyses B and C to test for similarities between males and females. In analysis A, linear regression models were used to test for gender differences in vBMD and bone size after adjusting for aBMD and age. Model assumptions, including normality, were checked.

Results

We initially used the entire set of 282 scans available for analysis. Table 1 (analysis A) shows the clinical characteristics of the subjects. The men and women were similar in age and BMI, but the men were taller and weighed more than the women. We adjusted for FN aBMD and age, and then compared FN total cross-sectional area and vBMD (total, trabecular, and cortical) in men versus women. As shown in Table 2, this analysis demonstrated that following adjustment for aBMD and age, male sex had a positive parameter estimate for FN total cross-sectional area (larger area). For FN total, trabecular, and cortical vBMD, following adjustment for aBMD and age, male sex had a negative parameter estimate (lower vBMD values). Thus, following adjustment for FN aBMD and age, men had larger bones and lower vBMD as compared with women.

Table 1 Descriptive clinical data on subjects used in each of the three analyses
Table 2 Results of linear regression models to test for gender differences in vBMD and bone size after adjusting for DXA FN aBMD and age

In analysis B, we next matched 114 men to 114 women for FN aBMD. As shown in Table 1 (analysis B), matching subjects for FN aBMD resulted in the mean (±SD) age of the men (66.6 ± 13.1 years) being significantly greater than the women (61.0 ± 13.0 years, P < 0.001). Consistent with analysis A, matching for FN aBMD (Fig. 1a) resulted in men having significantly greater FN total cross-sectional area than the women (Fig. 1b). By contrast, the FN aBMD matching resulted in the men having significantly lower total (Fig. 1c), trabecular (Fig. 1d), and cortical vBMD (Fig. 1e) at the FN as compared with the women. Thus, the structural basis for the identical FN aBMD in the men and women was very different. These trends are visually apparent in Fig. 2, which shows representative images of cross sections of femur necks from men and women matched for FN aBMD.

Fig. 1
figure 1

a Areal BMD (aBMD) of the femur neck (FN) by DXA; b FN total cross-sectional area by QCT; c FN total volumetric BMD (vBMD) by QCT; d FN trabecular vBMD by QCT; and e FN cortical vBMD by QCT in 116 women and 116 men matched for FN aBMD. To better reflect the relative differences between women and men for each of the parameters, all values are normalized to a value of 100 for women. Bars denote SDs. *P < 0.05, **P < 0.01, and ***P < 0.001 versus men

Fig. 2
figure 2

Representative cross-sectional images of the FN in men and women matched for a BMD by DXA

This different structural basis for aBMD between men and women extended to femoral strength. In a third analysis (Table 1, analysis C), we constructed voxel-based FE models for bone strength in a subset of 28 men and 28 women (selected randomly, due to the resources needed to construct FE models, from the group of matched 114 men and 114 women in analysis B). As shown in Fig. 3, matching for FN aBMD (Fig. 3a) resulted in similar sex-related differences in FN total cross-sectional area (Fig. 3b), total (Fig. 3c) trabecular (Fig. 3d), and cortical vBMD (Fig. 3e) as we observed when analyzing the larger group of 228 matched subjects. Further, matching for FN aBMD resulted in the men and women having statistically similar (P = 0.335) values of FE-derived femoral strength (Fig. 3f), although this was ~11% higher in the men. Of note, fall loads were also ~11% higher in the men (Fig. 3g, P = 0.041), resulting in relatively similar values for the load-strength-ratio, Φ, in men and women (P = 0.609). These trends are visually apparent in Fig. 4, which shows representative FE models for a woman and man matched for aBMD, demonstrating the similar FE-derived femoral strength in the two subjects as a result of the greater vBMD but smaller bone size in the woman.

Fig. 3
figure 3

a Areal BMD (aBMD) of the femur neck (FN) by DXA; b FN total cross-sectional area by QCT; c FN total volumetric BMD (vBMD) by QCT; d FN trabecular vBMD by QCT; e FN cortical vBMD by QCT; f FE-derived bone strength; g estimated fall loads; and h load-to-strength ratio, Φ, in 28 women and 28 men matched for FN aBMD. To better reflect the relative differences between women and men for each of the parameters, all values are normalized to a value of 100 for women. Bars denote SDs. *P < 0.05, **P < 0.01, and ***P < 0.001 versus men

Fig. 4
figure 4

Representative FE models in a male and female subject matched for FN aBMD. The color bar shows regions of relatively high to low bone material properties on a per-voxel basis

Discussion

Using DXA to measure FN aBMD, QCT to assess bone size and vBMD at the FN, and FE modeling to evaluate hip strength, our study provides a potential structural explanation for the ability of FN aBMD by DXA to predict hip fracture risk equivalently in men and women [79]. The key findings of our study are that matching men and women for FN aBMD results in the men (1) being ~6 years older than the women; (2) having larger bone size (i.e., FN total cross-sectional area); and (3) having lower vBMD. Since DXA is inherently influenced by bone size, with larger bones having higher aBMD [2], this property of DXA integrates effects of vBMD and bone size. In our cohort of men and women who were matched for DXA FN aBMD, this integrative effect resulted in quite similar measures of FE-derived femoral strength and even closer values of the load-to-strength ratio. From a clinical perspective, our findings therefore provide mechanistic support for using the same young reference values (i.e., those of women) in the T-score calculation for FN aBMD when estimating 10-year fracture probabilities in men and women, as is currently being done in the FRAX algorithm [10].

We recognize, however, that the in vivo risk of fracture is more complex than the parameters reported here, and there could be a different relationship of aBMD to fracture risk in men versus women due to such factors as risk of falls, differences in soft tissues protecting the femur, etc., as suggested by some [29] but not other [6] studies. In a recent prospective FE study of hip fracture in men and women, Keyak and colleagues [30] recently found that there were sex differences between the association of FE-derived femoral strength and hip fracture risk. However, their key finding was that incident hip fracture was associated with a greater decrement in FE strength between fracture cases versus controls in men than in women. To the extent that, as shown here, men and women having matched values of FN DXA aBMD had quite similar values for FE strength at the hip and even closer values of the load-to-strength ratio; the findings of Keyak et al. [30] would suggest that even using a female reference range to define osteoporosis in men may overestimate the risk of fractures in men identified as having osteoporosis, using a male-specific reference range would overestimate risk even more.

We estimated femoral strength using FE models that consistently provide very high degrees of correlation with measured femoral strength, although these models are limited insofar as the in vivo resolution of the QCT scans does not allow them to fully account for differences in bone microstructure at the femoral neck, including the thin cortical shell that may also play a role in determining femoral strength. We should note, however, that our FE technique has been shown recently in a prospective, blinded study to be highly predictive of new hip fractures in elderly men [13]. Moreover, since aging women lose bone at a faster rate than aging men [3, 31], matching women and men for FN aBMD resulted in the men being ~6 years older than the women. This may result in additional age-related deficits in bone microstructure or tissue material properties in the men that contribute to the overall risk of fracture, and the techniques used in this study did not account for these possible deficits. However, since there is no evidence of any appreciable sex effect in the relation between vBMD and mechanical properties of trabecular bone, and since the CT scans and FE models account for any sex differences of vBMD in men versus women, it is unlikely that the FE technique used here would not also be highly predictive of hip fracture in women. Indeed, our preliminary analysis of the AGES cohort validated the finite element implementation used in this study for prospective hip fracture prediction in both women and men [29]. Finally, due to resource limitations, we performed FE models only in a subset of the overall cohort, but the pattern of differences in bone size and vBMD between men and women observed for this sub-cohort were similar to those for the larger cohort analyzed only for DXA aBMD and QCT parameters.

Our study focused on FN aBMD, vBMD, and femoral strength, since the current FRAX tool for estimating fracture risk uses aBMD at this site rather than the spine [10]. Whether the relationships observed here are also true for vertebral aBMD, vBMD and bone strength is unclear and requires further study. Indeed, we did not explain why the differences in geometry and vBMD between the sexes just happens to largely cancel out for both aBMD and for femoral strength, and this effect may or may not extend to femoral strength for habitual gait loading, for example. In addition, it is also possible that in different populations of men and women from the one studied here, these opposing effects of femoral geometry and vBMD on femoral strength may or may not cancel each other out, leading to some differences in FE-derived bone strength and the load-to-strength ratio between men and women matched for aBMD. Clearly, further studies are needed to address these issues.

In summary, our study demonstrates that matching for FN aBMD between the sexes leads to quite similar values of both FE-derived femoral strength and the load-to-strength ratio, Φ, in men and women, but the structural basis for this integrative effect differs between the sexes. Specifically, matching men and women for FN aBMD results in the men having lower vBMD but greater bone size. These two parameters impact femoral strength in opposite directions, which at least in this cohort, resulted in relatively similar values of femoral strength and Φ between the sexes.