Introduction

High-resolution peripheral quantitative computed tomography (HR-pQCT) is a powerful clinical research tool providing non-invasive measures of bone microarchitecture as well as micro-finite element (µFE) estimates of bone strength. Bone strength estimates from HR-pQCT data predict incident fracture and improve fracture prediction beyond femoral neck areal bone mineral density (BMD) or fracture risk assessment tool (FRAX) scores alone [1, 2]. These findings suggest utility of HR-pQCT in fracture prediction; however, its use in the clinical realm is limited due to a lack of instrument availability, comparatively high data acquisition burden, and limited availability of reference data.

A number of studies have provided reference data for HR-pQCT outcomes [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17]. These provide a means of expressing an individual’s outcomes relative to peers, including adolescents and young adults [6, 8], and adults from a range of racial and ethnic backgrounds [3,4,5, 7, 9,10,11,12,13,14,15,16,17]. However, the majority of studies [3,4,5,6,7,8,9,10,11,12, 14, 17] utilized data acquired with the first-generation HR-pQCT scanner, which had nominal isotropic resolution (82 µm) and required derivation of trabecular microarchitecture based on assumptions about tissue density. The more recently introduced second-generation HR-pQCT scanner has provided enhanced spatial resolution (61 μm) and allows for direct voxel-based measures of trabecular microarchitecture [18]. In addition, the second-generation scanner has a larger scan length in the axial direction enabling greater access to proximal scan positions. Assessment of cortical bone-rich diaphyseal sites may provide unique insight into cortical bone changes with aging, disease, and lifestyle and pharmaceutical interventions.

Preliminary studies have provided reference data for outcomes acquired at distal bone sites with the second-generation HR-pQCT scanner [13, 15, 16]. However, as with the previous studies using the first-generation scanner [3,4,5,6,7, 9,10,11,12, 14, 17], scan locations were offset a fixed distance proximal from the end of the respective bone (typically 9.5 mm [first-generation HRpQCT] or 9.0 mm [second-generation] for the distal radius and 22.5 mm [first-generation] or 22.0 mm [second-generation] for the distal tibia). Scanning at a fixed offset can lead to the introduction of systematic bias in cross-sectional comparisons as it does not account for the impact of bone length [19,20,21]. For example, scanning at a fixed offset in an individual with shorter bone lengths causes the volume of interest to be more proximal. The result is a systematically smaller bone size, with elevated density due to the inclusion of more cortical and less trabecular bone [22].

To address the impact of bone length on HR-pQCT outcomes and enable assessment of anatomically similar regions between individuals, the recommendation is to position volumes of interest relative to a percent offset of bone length [23]. To our knowledge, only one reference data set performed scans using percent-of-length offsets, but the data were acquired using a first-generation scanner and limited to adolescents and young adults [8]. Also, no study that we are aware of has provided reference data for scans performed at proximal/diaphyseal bone sites.

The primary purpose of the current study was to generate age- and sex-specific normative data for HR-pQCT measures of the distal and diaphyseal radius and tibia across adulthood using a second-generation scanner and percent-of-length offsets. A secondary objective was to generate regression equations to convert between μFE-estimated failure load using criteria conventionally utilized with first-generation HR-pQCT scanners (“Pistoia criteria”) [24], but still being used with second-generation scanners [15, 16], and those specifically developed for the different scan volume and resolution of second-generation HR-pQCT scanners (“Arias-Morena criteria”) [25]. Using the cumulative data, the ultimate goal of the study was to provide researchers and clinicians with practical Microsoft Office Excel-based calculators to compute subject-specific percentiles, z-scores, and t-scores for each outcome and plot outcomes on centile curves.

Methods

Participants

HR-pQCT scans were performed on 1,598 adults (aged ≥ 18 years) between 12/2017 and 6/2021 within the Musculoskeletal Function, Imaging, and Tissue (MSK-FIT) Resource Core of the Indiana Center for Musculoskeletal Health’s Clinical Research Center (Indianapolis, IN). Participants were recruited to the core by investigators seeking standardized musculoskeletal outcomes for their research subjects, as well as from the local community via self-referral from posted flyers, word-of-mouth, an online research volunteer registry (https://research.indianactsi.org/volunteers/registration/), and community presentations. The MSK-FIT core has Institutional Review Board approval from Indiana University to assess all-comers who provide written informed consent.

To be eligible for inclusion in the current dataset, participants were required to (1) be between 18 and 80 years of age; (2) self-identify as being of White ancestry; (3) be ambulatory; and (4) have no self-reported diabetes, liver or kidney disease, past or present history of cancer, thyroid disorders, cystic fibrosis, or rare bone disease. Individuals with the later conditions were excluded due to their known or potential impact on HR-pQCT outcomes [26] and their overrepresentation in the MSK-FIT Resource Core cohort resulting from investigator-initiated trials. An upper age limit of 80 years was selected because the MSK-FIT Resource Core cohort currently has a limited number of participants beyond this age, with centile curves being sensitive to low numbers of individuals at the tail of distributions. Individuals with osteoporosis were included if they were not currently taking bone active therapies. There was no criterion regarding the timing of the last dose in individuals with a history of taking these therapies.

Participant characteristics

Height (to the nearest 0.1 cm) and weight (to the nearest 0.1 kg) were measured without shoes using a calibrated stadiometer (Seca 264; Seca GmbH & Co., Hamburg, Germany) and scale (MS140-300; Brecknell, Fairmont, MN), respectively. Appendicular skeletal muscle mass relative to height (ASM/height2; kg/m2) and whole-body aBMD were assessed by whole-body dual-energy x-ray absorptiometry (DXA) (Norland Elite; Norland at Swissray, Fort Atkinson, WI). Regional DXA using the same scanner assessed total hip and spine aBMD to determine osteoporosis status.

High-resolution peripheral quantitative computed tomography (HR-pQCT)

HR-pQCT (XtremeCT II; Scanco Medical, Bruttisellen, Switzerland) was used to image participants’ non-dominant arm and leg. Scanner stability throughout the duration of data collection was confirmed by scanning phantoms with inserts of known density and volume, as per manufacturer instructions. The non-dominant leg was defined as the leg contralateral to the participants’ non-dominant arm, as per the concept of crossed symmetry [27, 28]. Bone length was measured in triplicate using a segmometer (Realmet Flexible Segmometer, NutriActiva, Minneapolis, MN) and as described by Bonaretti et al. [19]. Per convention, ulna length was measured as a surrogate for radial length. The skin overlying the distal apex of the subject’s ulnar styloid was marked and the subject’s elbow placed on a rigid surface. The distance between the surface and styloid mark was measured. Tibial length was measured as the distance between skin marks placed at the distal tip of the medial malleolus and the medial joint line of the knee. Short-term precision for repeat mark placement and length measures of the ulna and tibia in 15 individuals tested on two consecutive days showed root mean square standard deviations of 2.8 mm (1.1%) and 6.3 mm (1.7%), respectively.

Scans were acquired as previously described [29] by one of three similarly trained operators. Subjects laid supine on a moveable treatment plinth and the limb to be imaged immobilized using a padded, anatomically formed carbon fiber cast. The scanner operated at 68 kVp and 1.47 mA to acquire 168 slices (10.2 mm of bone length) with a voxel size of 60.7 µm. After performance of scout scans, reference lines were placed at the medial edge of the distal radius articular surface and center of the tibia joint surface [23]. Scan stacks were centered 4% and 30% of bone length proximal to the radius reference line and 7.3% and 30% of bone length proximal to the tibia reference line, as recommended when employing the percent-of-length offset approach [23]. Scans were scored for motion artifacts on a standard scale of 1 (no motion) to 5 (significant blurring of the periosteal surface, discontinuities in the cortical shell) [30]. Scans scoring ≥ 3 were repeated when time permitted. Scans with a motion artifact of 4 or 5 were excluded from analyses.

Reconstructed images were analyzed according to the manufacturer’s standard protocol. The outer periosteal and inner endosteal surfaces of the bone were identified automatically, and segmentations checked for accuracy and manually modified when needed. To assess microarchitectural outcomes, images were filtered using a low‐pass Gaussian filter (sigma 0.8, support 1.0 voxel) and fixed thresholds used to extract trabecular and cortical bone (320 and 450 mgHA/cm3, respectively).

The following outcomes were recorded at distal sites: total vBMD (Tt.vBMD, mgHA/cm3) and area (Tt.Ar, mm2); trabecular vBMD (Tb.vBMD, mgHA/cm3), area (Tb.Ar, mm2), bone volume/total volume (Tb.BV/TV, %), thickness (Tb.Th, mm), number (Tb.N, 1/mm), and separation (mm); and cortical vBMD (Ct.vBMD, mgHA/cm3), area (Ct.Ar, mm2), thickness (Ct.Th, mm), porosity (Ct.Po, %), and porosity diameter (Ct.Po.Dm, mm). The following outcomes were measured at diaphyseal sites: Tt.Ar (mm2), Ct.vBMD (mgHA/cm3), Ct.Ar (mm2), Ct.Th (mm), Ct.Po (%), and Ct.Po.Dm (mm).

μFE analysis (Scanco Medical FE software version 1.13) was used to estimate stiffness (kN/mm) and failure load (N) at both the distal and diaphyseal sites. Each voxel within the segmented HR-pQCT images was assigned a modulus of 10 GPa and Poisson’s ratio of 0.3 [24]. Axial compression was applied and failure load estimated using two sets of criteria. The first set (“Pistoia criteria”) were those conventionally utilized with first-generation HR-pQCT scanners and estimated failure load when 2% of elements exceeded 0.7% strain [24]. The second set of criteria (“Arias-Morena criteria”) were developed for the different scan volume and resolution of second-generation HR-pQCT scanners and estimated failure load when 5% of elements exceeded 1% strain [25].

Short-term precision on duplicate scans with repositioning, but using the initially obtained limb length, in 15 individuals showed root mean square coefficients of variation (RMS-CV) of < 0.6% for bone density (Tt.vBMD, Ct.vBMD, Tb.vBMD) and < 0.8% for bone size (Tt.Ar, Ct.Ar, Tb.Ar) at both distal and diaphyseal sites and 1.8–2.9% and < 1% for estimated failure load at distal and diaphyseal sites, respectively. RMS-CVs for trabecular microarchitecture outcomes (Tb.BV/TV, Tb.N, Tb.Th, Tb.Sp) at distal sites were 0.7–3.2%, whereas for Ct.Po they were 10.3–15.4% and 4.6–6.7% at distal and diaphyseal sites, respectively. The impact of a second measure of limb length prior to repositioning on precision was not assessed.

Statistical analyses

All statistical analyses were performed for females and males separately. Sex-specific participant characteristics were described according to decade stage of life (18–29, 30–39, 40–49, 50–59, 60–69, and 70–80 years). Categorical variables were summarized by frequencies. Continuous variables were summarized by medians and interquartile ranges. After checking assumptions of normality, linearity, homoscedasticity, and absence of collinearity using IBM SPSS Statistics (v27; IBM Corporation, Armonk, NY), linear regression was used to model the relationship between the two sets of criteria (Pistoia and Arias-Morena) for estimating failure load.

Sex‐specific reference centile curves for each outcome were generated using the LMS method [31] with R package GAMLSS (version 5.2.0) [32]. The LMS method uses Box-Cox transformation to achieve normality at a given age (Box-Cox Cole and Green [BCCG] distribution). Nonparametric smooth curves are fit to the parameter values across the age range using penalized likelihood with penalty on the second derivatives. Additionally, non-negativity of biological measurements is incorporated by adopting truncated normal distributions, where the truncations are negligible in most cases.

The GAMLSS package provides more flexibilities other than normal and BCCG distributions. Additional distributions include Box-Cox power exponential (BCPE) and Box-Cox t (BCT), which model kurtosis in addition to the skewness modeled by BCCG. We used Akaike information criterion (AIC) to compare the different possible models and determine which best fit the data. However, we report results solely based on BCCG (and normal) distributions for reasons detailed in the results section.

Centile curves, z-scores, and t-scores were calculated from the estimated parameter curves. As per convention, z-scores represent the number of standard deviations the subject-specific outcome varies from age- and sex-matched median outcomes, whereas t-scores represent the number of standard deviations the subject-specific outcome varies from the best sex-specific median outcome. The age when peak outcome was achieved was determined as the highest point on the fitted median curve. Ct.Po, Ct.Po.Dm, and Tb.Sp were exceptions as lower values were considered more desirable. Peak for these outcomes was the lowest point on the median curve.

Because the LMS transformation method to achieve normality constrains maximum obtainable z- and t-scores, LMS-derived z- and t-scores are not suited for identifying extreme values. To address this, modified z- and t-scores are provided for scores greater than + 2. In modified z-scores, the HR-pQCT outcome is expressed relative to the sex- and age-matched median in units of half the distance between 0 and + 2 z-scores, as per the approach used for growth charts [33]. Modified t-scores are expressed in units half the distance between 0 and + 2 z-scores at the age of best sex-specific median outcome in the database.

Results

Participant and scan characteristics

A total of 1,184 participants (867 females, 317 males) were included (Table 1). The sex discrepancy resulted from the self-referral pattern of local community members volunteering to be tested. Reasons for exclusion included (1) age > 80 years (n = 25); (2) race not White (n = 226); and (3) self-reported ineligible disease or illness (n = 163). The final cohort included females and males ranging in age from 18.0 to 79.9 years and 18.4 to 80.0 years, respectively. Total hip and/or spine aBMD t-score was − 1 to − 2.5 and ≤  − 2.5 in 252 (29.1%) and 20 (2.3%) females, respectively. One hundred eight (34.1%) and 21 (6.6%) males had a total hip and/or spine aBMD t-score − 1 to − 2.5 and <  − 2.5, respectively.

Table 1 Participant characteristics stratified by decade of age

HR-pQCT reference data in females was generated from 831, 850, 753, and 748 scans of the distal radius, radial diaphysis, distal tibia, and tibial diaphysis, respectively. Lower scan numbers than participants was due to: (1) scan not performed due to a lack of subject time (n = 10, 10, 64, and 67 participants at the distal radius, radial diaphysis, distal tibia, and tibial diaphysis, respectively); (2) participant too large for the carbon fiber cast or presence of a local tomography artifact on the scan due to the limb being too large and absorbing tissue being outside of field of view (n = 30 and 39 participants at the distal tibia and tibial diaphysis, respectively); and (3) excessive motion artifact (n = 26, 7, 20, and 13 scans at the distal radius, radial diaphysis, distal tibia, and tibial diaphysis, respectively).

Male HR-pQCT reference data was generated from 310, 302, 283, and 248 scans of the distal radius, radial diaphysis, distal tibia, and tibial diaphysis, respectively. Less scans than participants was due to: (1) scan not performed (n = 10, 23, and 33 participants at the radial diaphysis, distal tibia, and tibial diaphysis, respectively); (2) participant too large for the carbon fiber cast or presence of a local tomography artifact on the scan due to the limb being too large and absorbing tissue being outside of field of view (n = 8 and 24 participants at the distal tibia and tibial diaphysis, respectively); and (3) excessive motion artifact (n = 7, 5, 3, and 12 scans at the distal radius, radial diaphysis, distal tibia, and tibial diaphysis, respectively).

Percentile, z-score, and t-score calculator, and centile curve plotter

Excel calculators based on the normative curves fitted using the LMS approach were developed for both distal and diaphyseal scan locations within the radius (Supplemental file 1) and tibia (Supplemental file 2). The AIC procedure indicated BCCG or normal distributions best fit the data for 34 and 35 of the variables in females and males, respectively (Supplemental file 3). BCPE or BCT best fit the data for the other 16 and 15 variables in females and males, respectively. We solely report results based on BCCG and normal as: (1) the estimated centiles were very similar regardless of the AIC differences; (2) the kurtosis modeled by BCPE or BCT presents only in the extreme tails of distributions and, as such, affects the distribution only beyond the 1st and 99th centiles [34]; and (3) we wanted to use Excel to create downloadable normative data calculators. It is straightforward to incorporate either BCCG or normal distributions in Excel using built-in functions, but the same cannot be said for BCPE or BCT distributions.

Entry of basic demographic information (sex, date of birth, and scan date) and one or more HR-pQCT outcome into the Excel calculators (Fig. 1A) results in plotting of sex-specific centile curves and computation of subject-specific percentiles, z-scores, and t-scores (Fig. 1B). For Ct.Po, Ct.Po.Dm, and Tb.Sp higher values were considered less desirable and, thus, yield lower z- and t-scores. Representative images of reconstructed and segmented scans in females and males are provided in Figs. 2 and 3, respectively.

Fig. 1
figure 1

Screenshots of the Excel-based calculator for the radius available in Supplemental file 1. Following entry of basic demographic information and one or more HRpQCT outcome (A), subject-specific percentiles, z-scores, and t-scores are computed, and centile curves plotted (B)

Fig. 2
figure 2

Representative HRpQCT images of the distal radius and tibia and radial and tibial diaphyses in young (18–20 years) and older (75–80 years) females. Scans were chosen based on failure load (using Pistoia criteria) being within ± 2% of the respective percentile (10th, 50th, and 90th)

Fig. 3
figure 3

Representative HRpQCT images of the distal radius and tibia and radial and tibial diaphyses in young (18–20 years) and older (75–80 years) males. Scans were chosen based on failure load (using Pistoia criteria) being within ± 2% of the respective percentile (10th, 50th, and 90th)

Centile curves at distal bone sites

Centile curves for Tt.vBMD, Tb.BV/TV, Tb.Th, and failure load (using Pistoia criteria) are presented for the distal radius (Fig. 4) and tibia (Fig. 5). Centile curves for other outcomes, including failure load estimated using Arias-Morena criteria, are provided in Supplemental files 4 (distal radius) and 5 (distal tibia). For most measures, the fitted median centile curve was highest at the youngest age in the cohort. Exceptions at the distal radius were Tt.vBMD (peak at 35.3 and 31.8 years in females and males, respectively), Ct.vBMD (peak at 41.3 and 35.9 years in females and males, respectively), Tt.Ar (peak at maximum database age in males [80.0 years]), Ct.Ar (peak at 39.8 years in males), Tb.Ar (peak at maximum age in females [79.9 years] and males [80.0 years]), and Tb.Th (peak at maximum age in females [79.9 years]). Exceptions at the distal tibia were Ct.vBMD (peak at 36.3 and 33.2 years in females and males, respectively), Tt.Ar (peak at maximum age in females [79.9 years] and males [80.0 years]), Ct.Ar (peak at 30.1 years and 44.1 years in females and males, respectively), Tb.Ar (peak at maximum age in females [79.9 years] and males [80.0 years]), Ct.Th (peak at 35.8 years in females), and Tb.Th (peak at maximum age [79.9 years] in females).

Fig. 4
figure 4

Centile curves for Tt.vBMD (A,B), Tb.BV/TV (C,D), Tb.Th (E,F), and estimated failure load (using Pistoria criteria) (G,H) at the distal radius for females and males using a 4% percent-of-length offset

Fig. 5
figure 5

Centile curves for Tt.vBMD (A,B), Tb.BV/TV (C,D), Tb.Th (E,F), and estimated failure load (using Pistoria criteria) (G,H) at the distal tibia for females and males using a 7.3% percent-of-length offset

Centile curves at diaphyseal bone sites

Centile curves for Ct.vBMD, Ct.Th, Ct.Po, and failure load (using Pistoia criteria) are presented for the radial (Fig. 6) and tibial (Fig. 7) diaphysis. Centile curves for other outcomes, including failure load estimated using Arias-Morena criteria, are provided in Supplemental files 6 (radial diaphysis) and 7 (tibial diaphysis). The fitted median centile curve was highest at varying ages depending on the specific HR-pQCT outcome. Peak Ct.vBMD values occurred in females and males aged 36.9 to 46.5 yrs at both the radial and tibial diaphysis. Tt.Ar at the radial diaphysis in females and males was greatest at the maximum age in the database. Tibial diaphysis Tt.Ar in males was also greatest at the maximum database age. Ct.Ar, Ct.Th, stiffness, and failure load at the radial and tibial diaphysis had highest values in females and males aged < 50 years. Ct.Po at both the radial and tibial diaphysis was low in females and males aged < 50 years and increased thereafter.

Fig. 6
figure 6

Centile curves for Ct.vBMD (A,B), Ct.Th (C,D), Ct.Po (E,F), and estimated failure load (using Pistoria criteria) (G,H) at the radial diaphysis for females and males using a 30% percent-of-length offset

Fig. 7
figure 7

Centile curves for Ct.vBMD (A,B), Ct.Th (C,D), Ct.Po (E,F), and estimated failure load (using Pistoria criteria) (G,H) at the tibial diaphysis for females and males using a 30% percent-of-length offset

Impact of criteria on failure load

There was a strong linear relationship (R2 > 0.99) between failure load estimated using Pistoia [24] and Arias-Morena [25] criteria (all P < 0.001). Site-specific regression equations to convert estimated failure load using the different criteria are:

$$\begin{array}{cc}\mathrm{Distal}\;\mathrm{radius}&\mathrm{Arias-}\mathrm{Morena}\;\mathrm{failure}\;\mathrm{load}=1.581\ast\mathrm{Pistoia}\;\mathrm{failure}\;\mathrm{load}+170.24\end{array}$$
$$\begin{array}{cc}\mathrm{Radial}\;\mathrm{diaphysis}&\mathrm{Arias-}\mathrm{Morena}\;\mathrm{failure}\;\mathrm{load}=1.526\ast\mathrm{Pistoia}\;\mathrm{failure}\;\mathrm{load}-59.12\end{array}$$
$$\begin{array}{cc}\mathrm{Distal}\;\mathrm{tibia}&\mathrm{Arias-}\mathrm{Morena}\;\mathrm{failure}\;\mathrm{load}=1.589\ast\mathrm{Pistoia}\;\mathrm{failure}\;\mathrm{load}-2.91\end{array}$$
$$\begin{array}{cc}\mathrm{Tibial}\;\mathrm{diaphysis}&\mathrm{Arias-}\mathrm{Morena}\;\mathrm{failure}\;\mathrm{load}=1.555\ast\mathrm{Pistoia}\;\mathrm{failure}\;\mathrm{load}-164.17\end{array}$$

Discussion

The current study provides unique data and resources which complement and further recently published HR-pQCT reference datasets. The unique data include the generation of reference data: (1) at anatomically standardized regions; (2) using a second-generation HR-pQCT scanner; and (3) at diaphyseal (in addition to distal) bone locations. The unique resources are the: (1) generation of regression equations to convert between μFE-estimated failure load using the original Pistoia [24] and newer Arias-Morena [25] criteria and (2) development of practical Excel-based calculators (Supplemental files 1 and 2) based on the normative curves fitted using the LMS approach. The regression equations enable conversion and cross-study comparisons of data acquired using the two sets of failure load criteria. The Excel-based calculators can plot an individual’s outcomes on the centile curves presented here and compute subject-specific percentiles, z-scores, and t-scores.

The generation of reference data at anatomically standardized regions by using percent-of-length offsets is novel. To our knowledge, percent-of-length offsets have only been used once previously to generate HR-pQCT reference data, with the data limited to distal bone sites in adolescents and young adults and acquired using a first-generation scanner [8]. Recent HR-pQCT guidelines recommend the field transition from performing HR-pQCT scans at conventional fixed offsets to using percent-of-length offsets [23]. A percent-of-length offset controls for differences in bone length which can result in inclusion of differing proportions of cortical and trabecular bone when using fixed offsets [19,20,21]. The latter may lead to the generation of imprecise percentiles and z- and t-scores which may be important when determining bone status within an individual or population of interest.

The distal scan sites in the current cohort (4% distal radius and 7.3% distal tibia) were chosen as, on average, they reportedly approximate the scan locations while using fixed offsets [19, 23]. If scan locations when using percent-of-length and fixed offsets do generally approximate, centile curves and peak median values on the population level should be similar when using the two different offset approaches. However, there are noticeable differences when comparing our distal radius data to those reported by Whittier et al. [15] who also used a second-generation HR-pQCT scanner to generate reference data in a mostly White population.

Peak median values in young adults for distal radius Ct.vBMD, Tb.vBMD, and trabecular microarchitectural outcomes (Tb.BV/TV, Tb.N, Tb.Th, Tb.Sp) are similar (< 5% difference) between the current study and that of Whittier et al. [15]. However, both females and males in our cohort had ~ 15% lower distal radius Tt.vBMD with ~ 17% greater Tt.Ar, ~ 30% greater Tb.Ar, and more than 10% lower Ct.Th. These differences are consistent with systemically scanning a more distal location in the current cohort, with the distal radius increasing in cross-sectional size and its cortex thinning as you move distally [22].

One reason for a systemic difference in scanning location may be related to subtleties associated with using a different reference line location. We placed our reference line at the medial edge of the distal radius articular surface (as recommended when using percent-of-length offsets [23]), as opposed to the more distal central ridge between the scaphoid and lunate fossae typically used with a fixed offset. A 4% offset from our reference line location corresponded with average fixed offsets of 5.1 mm (IQR = 4.7–5.4 mm) and 6.1 mm (IQR = 5.7–6.6 mm) in females and males, respectively. These locations may not accurately correspond with that when scanning 9.0 mm proximal from the more distal reference line placed when using the fixed offset approach.

In comparison to at the distal radius, reference line positioning at the distal tibia is consistent when using either a percent-of-length or fixed offset allowing for better assessment of comparability. A 7.3% offset in our cohort corresponded with an average offset of 21.7 mm (IQR = 20.6–22.7 mm) and 24.1 mm (IQR = 22.8–25.6 mm) in females and males, respectively. The average offset in females approximates the 22.0 mm offset typically used when scanning with a fixed offset approach when using second-generation HR-pQCT. Accordingly, peak median values at the distal tibia in younger females in the current study and that of Whittier et al. [15] are strikingly similar. However, the larger average offset in males in the current study highlights the limitation of using fixed offsets.

Using the conventional 22.0 mm fixed offset in males would have, on average, shifted scan location distally approximately 20% (2.1 mm) of total scan length (10.2 mm) because of the longer bone lengths in our male participants. While the rate of change of bone properties at the distal tibia is lower than that at the distal radius [22], comparing the current data to that reported by Whittier et al. [15] suggests that the later study did indeed scan a more distal location in males. In particular, Whittier et al. [15] reported peak median outcomes at the distal tibia in younger males that were larger for Tt.Ar, and lower for Tt.vBMD (~ 5%), Ct.Ar (~ 15%), and Ct.Th (~ 13%) than in the current study.

Peak median values (using the Pistoria failure criteria [24]) for estimated failure load in younger adults in the current study were ~ 15–20% and ~ 25–30% higher at the distal radius and tibia, respectively. In addition to the aforementioned potential differences in scanning location and subsequent density and morphological outcomes, a reason for our higher peak median values for estimated failure load is our use of a higher modulus (10 GPa vs. 8.7 GPa). A higher modulus linearly increases estimated failure load [35].

The centile curves generated with the current cross-sectional data allow for comparisons between ages, sexes, and sites and affirm previous observations regarding bone changes with aging. At each site and in both sexes, the fitted median centile curve for Ct.vBMD progressively increased between ages 18 and ~ 40 years before declining thereafter. The decline was more precipitous in females at both the distal and diaphyseal sites and was coupled with a transition in the rate of centile curve change of median Ct.Po in females aged 50–60 years—likely corresponding with changes associated with perimenopause. Fitted median centile curves for Ct.Ar and Ct.Th progressively declined with age in females at all sites. As Tt.Ar remained relatively constant, the decline in cortical properties in females most likely occurred near the cortical/trabecular transitional zone. These changes in cortical bone properties were tempered in males.

Preferential loss of bone near the cortical/trabecular bone transition may explain some of the trabecular observations in females with age. In particular, the fitted median centile curve for Tb.Th at both the distal radius and tibia in females began to increase after reaching a nadir around 50 years of age. The apparent increase in thickness may be due to our inclusion of a relatively lower number of older adults who may have been on the healthier end of the bone health spectrum. LMS curves are sensitive to individuals at the tails of the distribution. However, the hypothesis of higher than average bone health in our older females is not supported by the curves for Tt.vBMD and estimated failure load which continued to decline with age, and the observation that females aged 70–80 years in our cohort had an average total hip aBMD t-score in the osteopenic range (mean ± SD =  − 1.1 ± 0.8).

An increase in Tb.Th at the distal radius with age in females has been shown in a previous prospective HR-pQCT study [36] and is likely due to trabecularization of cortical bone adjacent to the trabecular compartment. Endochondral resorption and intracortical remodeling during aging leaves cortical bone fragments in the cortical/trabecular transitional zone [37]. As the standard HR-pQCT image-processing algorithm automatically segments cortical and trabecular bone based on density thresholds, cortical remnants present in older individuals can be assigned to the trabecular compartment resulting in the underestimation of trabecular and overestimation of cortical bone changes [38]. In our case, the result was a rise in the fitted median curve for Tb.Th in older females at distal sites coupled with an apparent reduction in the decline of the fitted median curves for Tb.vBMD and Tb.BV/TV.

The assignment of cortical bone fragments to the trabecular compartment possibly also contributes to the pattern of change in Ct.vBMD at the diaphyseal sites. As mentioned above, there was a transition to a larger rate of median Ct.vBMD loss at diaphyseal sites after reaching a peak at age 40 years. This was followed by a second transition in the rate of median curve change around 60 years of age wherein Ct.vBMD loss appeared to plateau with increasing age. The reduction in Ct.vBMD loss beyond 60 years may be due to slowing of menopause-related changes. Alternatively, it may be due to bone loss in the cortical/trabecular transitional zone resulting in a greater proportion of lower-density cortical remnants being counted as trabecular bone. The net result is an apparent reduction in the loss of Ct.vBMD.

HR-pQCT provides the ability to separate the cortical and trabecular compartments; however, parameters potentially influenced by transitional zone changes require careful interpretation. In particular, t-scores for parameters that showed an apparent reversal in trends with increasing age (e.g., Tb.vBMD, Tb.BV/TV, and Tb.Th at the distal radius and Tb.Th at the distal tibia in females) may not be useful for detailing an individual’s bone status (in fact, t-scores for Tb.Th at distal sites in females are not calculatable as the peak on the fitted median curves occurred at the oldest age in the dataset). Instead, it may be that whole bone parameters (e.g., Tt.vBMD, stiffness, estimated failure load) provide the most robust outcomes for assessing general bone status, particularly at distal skeletal sites. Supporting this, estimated failure load at distal sites has been found to have the greatest association with the risk of fracture [2]. The utility of estimated mechanical properties at diaphyseal sites remains to be established as the μFE models utilize axial or uniaxial compressive loads, whereas resistance to bending will also be relevant at diaphyseal sites.

Our study has a number of strengths, including the development of reference data for second-generation HR-pQCT scans at distal and diaphyseal sites in a self-reported racially homogenous population. However, our study is not without limitations. Data were obtained at a single center and variability in machine performance at other centers may influence outcomes and percentiles. Our normative curves were generated using cross-sectional data, which may not represent an individual’s trajectory over time [39]. We used the HR-pQCT manufacturer’s standard software to segment bones and did not use alternative segmentation approaches (e.g., StrAx software [40]) to potentially improve localization of the cortical/trabecular transitional zone. We did not assess the appropriateness of our sample size and sample composition, as recently recommended by Cole [34]. It remains unclear whether the regions assessed correspond with where fractures occur clinically, particularly at the distal radius [41]. Finally, we had more limited inclusion of males and older (age > 70 years) adults, our data are specific to White individuals living in the Midwest of the United States, and we did not collect data on medication use (beyond excluding those currently taking bone active therapies).

In summary, the current study provides age- and sex-specific normative data for HR-pQCT measures of the distal and diaphyseal radius and tibia acquired using percent-of-length offsets. The generated centile curves and calculators enable HR-pQCT outcomes in an individual or population of interest to be expressed relative to the reference cohort. Comparison to a reference cohort can be of benefit as it can negate the need to establish a control group in cross-sectional studies and can indicate the magnitude of difference from expected values. However, careful consideration needs to be given to the suitability of the reference population in terms of race and ethnicity, age, sex, and other demographic characteristics. Also, consideration needs to be given to the HR-pQCT scanning and analysis procedures utilized, including how limb lengths are measured and where reference lines are placed. Ultimately, the expectation is that these data will be expanded over time and the wealth of information HR-pQCT provides becomes increasingly interpretable and utilized.