Introduction

Quantifying muscle mass is particularly relevant not only to assess and manage muscle atrophy during aging (Narici et al. 2003; Morse et al. 2005a) but also to determine the impact of muscle quantity changes on muscle performance. Nuclear magnetic resonance (NMR) imaging is acknowledged as one of the best methods to assess muscle volume (MV). MV is generally computed from several contiguous muscle anatomical cross-sectional areas (ACSAs), depending on the measurement precision required. An estimate of strength efficiency may also be computed as maximal strength divided by muscle ACSA and is often used as an indicator of muscle quality (Delmonico et al. 2009).

In order to reduce the time and the cost of image processing, the authors have proposed to estimate MV from a single slice. Indeed, complete segmentation of all the muscles of an entire thigh may take between 3 and 4 h. Morse et al. (2007) showed that only one quadriceps femoris muscle (QUAD) ACSA taken at 60 % of the femur length (distal to proximal) could estimate QUAD volume with a standard error of estimate of 9.9 ± 5.7 %. These authors used a third-order polynomial regression model to estimate the ACSA at 60 % that was only validated in young men and confined to the QUAD. Cotofana et al. (2010) showed that the best correlation coefficients between ACSA and MV calculated at different femur levels varied according to the muscle studied. Best correlations were found at 20–40 % (proximal to distal level) for the extensor muscles, 70 % for the flexor muscles, 30 % for the adductor muscles, and 70 % for the sartorius muscle. These authors mentioned that the best correlation was found nearly at the maximal ACSA of each muscle, keeping in mind that the different thigh muscles do not have their maximal ACSA at mid-length. In both of these studies, the population tested was very restricted (18 young men and 41 perimenopausal women respectively) and probably not representative of the whole healthy population. Moreover, only results on muscle groups and not on individual muscles were provided.

Previous reports have observed an accumulation of fat within muscles during aging (Forsberg et al. 1991; Frantzell and Ingelmark 1951; Hasson et al. 2011), which also results in a loss of functional muscle mass. Using MRI, Kent-Braun et al. (2000) have reported a similar trend of intramuscular fat (IMF) accumulation in the ankle dorsiflexor muscles with age in both genders. Hasson et al. (2011) have recently shown that IMF within the dorsi- and plantar-flexor muscles was muscle-dependent and distributed nonlinearly along the leg. These authors described that absolute amounts of adipose tissue infiltration within the muscle were constant along the muscle length with consequently the lowest proportion near the maximal ACSA.

Emerging evidence suggests that IMF accumulation is associated with muscle atrophy, increased risk of physical limitation, or poor habitual physical activity level (Visser et al. 2002; Hasson et al. 2011; Leskinen et al. 2009) and that it is relevant to investigate lipid metabolism in skeletal muscle (Stein and Wade 2005). This is why estimating IMF accumulation recently gained attention. Although the impact and mechanism of IMF in sarcopenia or dynapenia are still unclear, it follows that IMF infiltration in the muscle has direct consequences in the estimation of muscle quantity and muscle quality.

The objectives of this study were (1) to assess MV and IMF for all the muscles of the thigh in a large population of healthy individuals, (2) to test the effect of gender and age on MV and IMF estimates, (3) to determine the best representative slice for the estimation of MV and IMF for the individual muscles of the thigh, the corresponding muscle groups and the whole thigh, and (4) to establish and validate prediction equations to estimate MV and IMF from this single slice.

Methods

Participants

One hundred five healthy French volunteers (35 young subjects aged between 20 and 30 and 70 older subjects aged between 70 and 80) participated in this study as subjects for the MyoAge project (McPhee et al. 2013). Eligible participants were free from contraindications for MRI; neurologic, cardiovascular, or muscle disease; and medications that may affect the study variables. Subjects signed a written informed consent after confirmed eligibility. This research was approved by the local ethics committee (CPP Paris-Ile de France VI–IB number: 2010-A006114-35). Five subjects did not accept to undergo the full MRI protocol due to the length of the exam or claustrophobic feeling. One hundred subjects were thus analyzed. This population was then split into two groups: 72 subjects were used to establish predictive equations for MV and 71 for IMF (one subject was discarded due to the presence of artefacts during the reconstruction process). Twenty-eight subjects aged between 70 and 80 were used as supplementary subjects to test the validity to estimate MV and IMF from a single slice.

Image acquisition

Bilateral MRI scans of the thighs (from iliac crests to the articular surface of the tibia) were acquired on each subject placed supine in a 3.0 Tesla MRI scanner (Tim Trio, Siemens Healthcare, Erlangen, Germany). In order to avoid movements and minimize compression of the legs, their heels were fixed on a non-metallic support. Muscle ACSA were measured on 5-mm-thick contiguous axial out-of-phase images using a three-dimension three-point Dixon gradient echo sequence (Glover and Schneider 1991) (TE = 2.75/3.95/5.15 ms, one excitation, 448 × 224 × 64 matrix, 448 × 224 × 320 mm3 field of view) to differentiate lean and fat mass. A 3° flip angle and a 10-ms repetition time were used to obtain a proton density weighting. All sequences were performed with a quadrature bird cage body coil for transmission and sets of phased-array receiver coils surrounded the thigh, with 3 × 6 flexible coils covering the segment and 3 × 6 coils embedded in the patient table. Neither acceleration nor parallel imaging techniques were used. Image analysis software (Radionet v.2.2.10., Scito, Paris, France) was used for manual segmentation of the thigh muscles. Image acquisition lasted around 6 min for the Dixon sequence.

Dixon techniques rely on the water/fat chemical shift difference and consequently on their phase difference in the NMR signal to separate them with post-processing. The three-point Dixon technique acquires three images with three selected echo time such that the fat and water are, respectively, in phase, opposed phase, and in phase (it can be also opposed phase, in phase, and opposed phase). Contrarily to the two-point Dixon and the extended two-point Dixon methods, the three-point Dixon method brings in better robustness since it deals with B0 inhomogeneities that are assumed to be directly proportional to the phase difference of images acquired at the first and third echo times. Once B0 inhomogeneities are calculated, fat and water signals are obtained through simple linear system solving. It is important to point out that an accurate estimate of B0 inhomogeneities is fundamental for correct fat and water separation and to prevent fat and water swaps. This can be achieved through phase unwrapping techniques such as the one presented by Abdul-Rahman et al. (2007) that was considered in this work to unwrap the phase difference between the first and third echo time.

MRI measurements

Depending on the stature of the subjects, about 98 MRI slices (±7 slices) were analyzed to assess all the individual thigh muscles, which, in turn, takes generally 3 to 4 h. Muscles were manually segmented as presented in Fig. 1. Muscle volume was computed using the cylinder method as given by:

$$ \mathrm{M}\mathrm{V}={\displaystyle \sum_{i=1}^{n-1}h\frac{\left(\mathrm{ACSA}i+\mathrm{ACSA}\left(i+1\right)\right)}{2}} $$

where n is the number of slices used and h is the inter-slice distance.

Fig. 1
figure 1

Illustration of segmented MRI ACSAs of the thigh at a 20 %, b 40 %, c 50 %, d 60 %, and e 80 % femur length (from distal to proximal). Group muscles are color-coded: QUAD = white; HAMST = red; ADDUC = yellow; SA = green. Note the exclusion of non-muscular elements (blood vessels, nerves) in the manual segmentation

To avoid the influence of subject height as a possible confounding variable, MV was normalized by the femur length. MVn is noted in the following and expressed in cm2.

The muscles selected for segmentation were the four heads of the QUAD (rectus femoris (RF), vastus lateralis (VL), vastus intermedius (VI), and vastus medialis (VM)), the four muscles composing the hamstrings muscles (HAMST) (biceps femoris short head (BFB), biceps femoris long head (BFL), semitendinosus (ST) and semimembranosus (SM)), adductors (ADDUC: adductor longus, brevis, magnus and pectineus together (AD), and gracilis (GR)), and the sartorius (SA). Because delineations between adductors muscles were not so obvious at proximal part, all adductors muscles were segmented together, except for the GR which was easily recognizable (Fig. 1). Consequently, by summing all the muscle ACSAs at a precise level along the femur length, an estimate of the whole thigh ACSA was also obtained. Visible fat at the muscle periphery, aponeurosis, blood vessels, nerves, and femur bone were excluded as much as possible.

The muscle ACSAs selected for correlation analyses were chosen at 20, 30, 40, 50, 60, 70, and 80 % of femur length (the distal point was taken as 0 %). Femur length was measured as the distance between the bottom of the lateral condyle of the femur and the top of the greater trochanter. Considering these landmarks, some muscles may thus terminate proximally at more than 100 %.

IMF was computed as already described (Glover and Schneider 1991) in all the individual muscles using the same inter-slice distance as for MV. Its estimate represents the fraction of NMR signal that is attributable to fat.

Statistical analyses

Statistical analyses were performed using SPSS (version 19.0). The effects of gender and age on MV and IMF and interaction between both factors were assessed using a two-factor ANOVA. Spearman rho was computed to test the relationship between MV and IMF and between BMI and IMF. Linear regressions between ACSA and MV and between IMF computed on all the slices and a single slice were carried out to determine their degree of association and, therefore, the best level to estimate MV and IMF from a single slice, respectively. Estimated MV and IMF were obtained by using linear regression equations. This was done for each muscle, each muscle group, and the whole thigh. Number of subjects, adjusted R 2, and standard error of estimate (SEE) were used to refine the choice of the slice level. A population of 28 supplementary subjects aged between 70 and 80 was used to validate the prediction equations for the quadriceps femoris muscles for both MV and IMF. SEE was used to quantify the fitting error in the supplementary subjects. The level of statistical significance was set to 0.05.

Results

Participant characteristics

Participant characteristics stratified by sex and age groups are given in Table 1 (n = 72). Women presented with significantly smaller height, weight, and BMI than men (all P < 0.001). Older subjects exhibited smaller height and higher BMI than their younger counterparts (all P < 0.001). However, their weight was not significantly different. BMI was much more increased in men than in women with aging (interaction between age and gender, P = 0.005).

Table 1 Characteristics of the control population

Effect of age and gender on muscle volume and intramuscular fat

Figure 2 illustrates a typical example of the effect of aging on thigh muscles where it can be clearly seen in both the atrophic process and the intramuscular fat infiltration. Results showed that all the thigh muscles taken individually were significantly smaller in older subjects compared to the youngest in both genders (all P < 0.001) (Table 2). Men also exhibited significantly higher MV than women for all the muscles. Statistical analyses also indicated significantly higher IMF percentages (between nearly 1.5 to 2.5-fold increase) in each thigh muscle in older subjects compared to the youngest for both genders (all P < 0.001) (Table 3). IMF was significantly correlated with BMI for all individual muscles or muscles groups (rho ranging between 0.324 and 0.567; all P < 0.01). Although relative changes in fat infiltration were generally higher in men, there was not significant age–gender interaction.

Fig. 2
figure 2

Comparison of typical axial MRI slices obtained from a young active woman (a aged 28 years) and an older active woman (b aged 80 years) at 40 % femur length. Note that accumulation of intramuscular fat worsens significantly with aging (along with extramuscular fat)

Table 2 Effect of age and gender on muscle volume of the individual muscles of the thigh
Table 3 Effect of age and gender on intramuscular fat of the individual muscles of the thigh

Individual muscles and muscle groups presented various percentages of atrophy and intramuscular fat. Comparing older and young subjects, the most atrophied muscles were the muscles of the quadriceps femoris, VM apart (between −37 and −25 % in both genders), and the gracilis muscle (−31 % for men and −25 % for women). Concerning the hamstrings, ST was the most atrophied in men (−28 %), while SM was the most atrophied in women (−24 %).

In young subjects, the most fat-infiltrated muscles were the BFB, GR, and SA in both genders. The largest relative changes were observed for VI, ST, and SM in men and for BFL and ST in women. The BFB was the most preserved muscle with respect to both muscle volume and fat infiltration with aging.

For each individual muscle or muscle group, a significant correlation between MVn and IMF was found (Table 4).

Table 4 Correlation between MVn and IMF for each muscle, muscle group, and whole thigh

Muscle volume and intramuscular fat estimates from a single slice

A typical example of ACSA profiles for each muscle and each muscle group is illustrated in Fig. 3. An estimate of MV and IMF was computed in ACSA of the thigh at 20, 30, 40, 50, 60, 70, and 80 % of femur length. The correlations between MV (respectively IMF) estimated on all the slices and MV (respectively IMF) estimated from a single slice were dependent on the femur level (Supplementary Tables 1 and 2). The best linear fit was not always found at the same femur level for both MV and IMF, depending on the muscle. Taking into account all of the linear fitting data (merging data from Supplementary Tables 1 and 2), the best estimate of both MV and IMF from a single slice occurred at 50 % for the muscles of the QUAD, 40 % for the muscles of the HAMST, and 60 % for the muscles of the ADDUC. At these thigh levels, Table 5 provides the prediction equations for MVn (in cm2) and IMF (in %) for each muscle, each muscle group and the entire thigh.

Fig. 3
figure 3

ACSA profiles. a Evolution of the anatomical cross-sectional areas (ACSAs) of the quadriceps muscles and of the whole quadriceps at levels of percentage of the femur length (from distal to proximal). b Evolution of the ACSAs of the hamstrings muscles and of the whole hamstrings at levels of percentage of the femur length. c Evolution of the ACSAs of the adductors muscles and sartorius at levels of percentage of the femur length

Table 5 Prediction equations for MV and IMF from the optimal slice. Muscle volume can be deduced by the following equation: MV (cm3) = femur length x (intercept + ACSA (cm2) × slope). Intramuscular fat within the whole muscle can be computed as: IMF (%) = intercept + IMF (% within the slice) x slope

Validation of predictive equations on supplementary subjects

The prediction equations for MVn and IMF were applied to a group of 28 supplementary subjects for RF, VI, VL, VM, and QUAD. SEE was close to the SEE values observed for the control population both for MV (RF: 0.59, VI: 0.68, VL: 0.62, VM: 0.86, QUAD: 1.26) and IMF (RF: 0.73, VI: 0.44, VL: 0.52, VM: 0.30, QUAD: 0.30) (see Supplementary Tables 1 and 2). This result proves that the prediction equations for MV and IMF were also robust for another population than the control population.

Discussion

The present study is the first to provide a complete analysis of individual muscles of the thigh in terms of muscle volume and intramuscular fat infiltration in young and elderly healthy subjects. This paper gives reference values for each individual muscle of the thigh and consequently to the muscle groups and the whole thigh. It also provides prediction equations to estimate MV and IMF from a single MRI slice.

Our results suggest that age-related loss of thigh muscle mass and increase in intramuscular fat is muscle-dependent, probably due to the fiber-type difference or the type of physical activity which may more or less involve the different thigh muscles depending on the action performed. However, one must keep in mind that our study was cross-sectional. As in previous studies (Macaluso et al. 2002; Maden-Wilkinson et al. 2013; Ogawa et al. 2012), we found that the proportion of QUAD muscle in the total thigh was reduced with aging because QUAD is more susceptible to muscle atrophy with aging than the other muscle groups. Trappe et al. (2001) suggested that individual muscles of the quadriceps were similarly affected by aging; however, our data support another report that shows the RF was most affected by aging (Maden-Wilkinson et al. 2013), while the VM was better preserved. Atrophy of the QUAD was substantial both in men and women (26 and 27 %, respectively). Data on other muscles are very scarce and this study provides first data on all the individual muscles of the thigh.

Nilwik et al. (2013) found a similar QUAD ACSA as in the present study, but a smaller difference (14 %) between young and old subjects. These differences may be explained by differences in muscle segmentation since we excluded inter-muscular fat in the segmentation process. ACSAs for QUAD and for entire thigh estimated at 50 % were similar to the ACSA at mid-thigh observed in previous studies (Ogawa et al. 2012; Maden-Wilkinson et al. 2013). Atrophy of quadriceps femoris with aging is mainly attributed to reduction in type II muscle fiber size (Nilwik et al. 2013), probably linked to a lack of repairing ability due to a reduced satellite cell content in type II fibers (Verdijk et al. 2007).

Kent-Braun et al. (2000) have selected a single slice corresponding to the largest ACSA using MRI to estimate the percentage of non-contractile area of the ankle dorsiflexor muscles. Non-contractile area, which can be assumed to be mainly fat, was observed to be increased by 132 % in men and 160 % in women with aging. In the muscles of the thigh, intramuscular fat infiltration was found to be highly significant with aging, but variable depending on the muscle. We found similar IMF differences for QUAD (+72 %) and HAMST (+116 %) between young and older men as the ones reported by Overend et al.(1992). Interestingly, IMF was correlated to BMI. Thus, a significant amount of BMI may be explained by IMF, which suggests that IMF plays a non-negligible role in BMI increase in men.

To our knowledge, this is the first study which tests if intramuscular fat estimated from a single MRI slice is representative of fat infiltration computed over entire muscles or muscle groups. Depending on the muscles or muscle groups studied, the distal-to-proximal level of the slice to be analyzed varied. The optimal slice for each muscle was roughly situated where it was the largest, as already observed (Morse et al. 2007; Cotofana et al. 2010). For the entire thigh, the best estimate for MV and IMF was obtained at 50 % femur length. Therefore, the best way to estimate muscle volume atrophy and intramuscular fat infiltration of the thigh with a single MRI slice consists in segmenting all thigh muscles at the mid-femur level slice which takes less than 10 min. Femur length has also to be estimated since predictive equations for MV used this variable as a normalization factor.

It is important to note here that the three-point Dixon presents several limitations. First, the echo times that were used are not optimal as explained in (Reeder et al. 2005). In order to optimize SNR within the images, the authors recommended a set of asymmetric echo times as well as an iterative algorithm called IDEAL to compute fat and water signals. In spite of his good performances in terms of SNR, this algorithm assumes very low B0 inhomogeneities and converges to local minimum when this condition is not verified which resulted also in a bad fat and water separation. Many solutions were proposed to tackle this problem (Hernando et al. 2008; Hernando et al. 2010; Sharma et al. 2012), but all of them work only for 2D images with a high computation time and they are not available on clinical machines. Furthermore, in the case of our application, the fat content is very low compared to the water one which means that the impact of the selection of echo time on the SNR is minor as shown in (Reeder et al. 2005).

The second limitation of the standard three-point Dixon approach is the oversimplified fat modeling that ignores the actual spectrum of lipids. In fact, this technique underestimates the fat signal because it accounts only for the methylene component in the triglyceride molecules that account for most of body fat. In the studies that introduced more accurate fat spectral models (Hines et al. 2009; Reeder et al. 2009), the authors showed linear relationship between multi- and single peaks fat models using in vivo data and phantoms. Using simulations, it was also shown that a linear relation exists between actual fat signal and the estimated one using three-point Dixon technique (Azzabou et al. 2015). From these studies, one sees that standard Dixon techniques provide fat values that are directly proportional to the actual amounts of fat.

The present study presents other limitations. First, the subjects were all healthy, active, and socially involved and the older group were therefore a homogeneous group of robust, older individuals. Muscle volume, intramuscular fat, and other variables may be much more altered when considering diseased or truly sedentary older people. Our data are therefore indicative of healthy aging. Second, the results of this study rely on the hypothesis that thigh muscles are homogeneous in terms of fat infiltration, which may not be the case in some disorders (for instance in type 2 diabetes (Karampinos et al. 2012)). Third, the design of the study was cross-sectional which limits the power of the conclusions that can be inferred due to the possible social and secular factors that might influence the direct comparison of older and young subjects. Finally, there are subtle inter-individual variations in muscle shape that may lead to an over- or under-estimate of muscle volume from a single slice measurement.

In conclusion, the present study provides novel quantitative data for all individual thigh muscles in terms of muscle volume and fat infiltration using MRI. The results suggest that a single MRI slice can be representative of a whole muscle, muscle group, or thigh to estimate muscle volume and intramuscular fat infiltration. This provides a simple and practical way to assess muscle quantity and quality, which can also have applications in diseases such as cancer (Argiles et al. 2006), HIV infection (Roubenoff 2000), chronic heart failure (Piepoli et al. 2006) or chronic obstructive pulmonary disease (Vermeeren et al. 2006), bed rest (Kawakami et al. 2000; Kouzaki et al. 2007), or training (Morse et al. 2005b; D’Antona et al. 2006). Finally, because IMF is increased with aging, we suggest considering intramuscular fat infiltration when estimating muscle volume to assess sarcopenia; otherwise, this may underestimate the loss of muscle mass with aging. More work is also needed to determine the reasons why aging leads to a shift in metabolic processes to increase the deposition of fat in muscle and why some muscles are more affected than others.