Abstract
Objective
To quantify individual muscle volume in rat leg MR images using a fully automatic multi-atlas-based segmentation method.
Materials and methods
We optimized a multi-atlas-based segmentation method to take into account the voxel anisotropy of numbers of MRI acquisition protocols. We mainly tested an image upsampling process along Z and a constraint on the nonlinear deformation in the XY plane. We also evaluated a weighted vote procedure and an original implementation of an artificial atlas addition. Using this approach, we measured gastrocnemius and plantaris muscle volumes and compared the results with manual segmentation. The method reliability for volume quantification was evaluated using the relative overlap index.
Results
The most accurate segmentation was obtained using a nonlinear registration constrained in the XY plane by zeroing the Z component of the displacement and a weighted vote procedure for both muscles regardless of the number of atlases. The performance of the automatic segmentation and the corresponding volume quantification outperformed the interoperator variability using a minimum of three original atlases.
Conclusion
We demonstrated the reliability of a multi-atlas segmentation approach for the automatic segmentation and volume quantification of individual muscles in rat leg and found that constraining the registration in plane significantly improved the results.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Muscle quality is commonly defined as the ratio between muscle strength or power and muscle mass [1–3]. Muscle mass can significantly change in a variety of conditions such as growth [4, 5], aging [6, 7] or training [8]. It can also decrease in pathological conditions including neuromuscular disorders, cardiovascular diseases and diabetes [1, 9]. In this context, muscle size has to be accurately quantified in order to properly investigate the corresponding functional changes. Moreover, given that these changes can be muscle-specific [10–13], the corresponding quantification must ideally be performed individually for each muscle.
Magnetic resonance imaging (MRI) has been recognized as the method of choice because of its noninvasive nature and capacity to distinguish fat, connective and muscle tissue. Considering the high contrast difference between muscle and fat signals, several automatic methods based on the signal intensity threshold have been successfully developed in order to quantify fat-free muscle area [14, 15]. However, muscle size has been considered as a whole so that individual muscle changes such as those occurring in aging [11, 16] or neuromuscular disorders [12] have not been investigated. The automatic segmentation of individual muscles within a limb or within the same muscle group is technically impossible using the same approach considering the poor contrast between connective and muscle tissues. In this context, the gold standard method is based on the manual delineation of regions of interest on contiguous individual slices, which is tedious and time-consuming and has questionable reproducibility. The corresponding results are largely operator-dependent, and it has been reported that measurement of a specific muscle volume using MRI with manual delineation in the forearm would not detect changes below 7 % in healthy adults [17]. In humans, a few alternative methods allowing a selective segmentation of muscles have been proposed in the last few years. A ‘pseudo muscle volume’ has been quantified on the basis of the manual segmentation of a limited number of slices by interpolating the missing data using geometrical [18] or mathematical models [5, 7]. Although these methods can reduce the processing time, they still involve extensive manual work so that a robust automatic method is highly desirable. However, automatic segmentation of individual muscles from MR images is technically challenging given that individual muscles share the same MR properties and that fasciae display very poor contrast. So far, very few methods targeting automatic segmentation of individual muscles have been reported, and to our knowledge none of them have been applied to small animal models despite their increasing use in the investigation of functional changes in skeletal muscle in various healthy and pathological conditions. The fully automatic method based on mesh modeling, which has been proposed recently, still requires integration of manual constraints [19]. Other methods based on the utilization of surface landmarks [20] or principal component analysis [21] have provided promising results, but no information related to the volume error has been reported. Interestingly, tissue segmentation methods based on an atlas database have been successfully used in brain, bee brain, heart and prostate as wellas whole body fat/water separated MR images [22–27]. In this context, the specific hurdles related to individual muscle segmentation are mainly related to the high voxel anisotropy, i.e., a different spatial resolution in at least one of the three dimensions.
The purpose of the present work was to develop an original segmentation method based on a multi-atlas database and allowing a complete automatic quantification of individual muscles in rat leg.
Materials and methods
Animals
Sixteen Wistar Han rats (2.5 months old, 8 males/8 females) were investigated according to the guidelines of the National Research Council Guide for the Care and Use of Laboratory Animals and the French law on the Protection of Animals. Animals were initially anesthetized in an induction chamber with 4 % isoflurane (Forene®; Abbott France, Rungis, France) mixed in 33 % O2 (0.4 l/min) and 66 % N2O (0.8 l/min) and then positioned supine in a home-built cradle designed to be accommodated within the MR superconducting magnet to investigate functional changes throughout a muscle exercise session [28].
MRI experiment
MRI of the right leg was performed using a fast spin echo sequence (RARE) at 4.7 T on a Biospec 47/30 superconductor magnet (Bruker, Germany). Eighteen images were recorded in order to cover the entire length of the leg from the knee to ankle using the following parameters: rare factor = 6; repetition time = 2000 ms; echo time = 16 ms; field of view = 30 × 32 mm2, matrix size = 256 × 256 pixels; slice thickness = 1 mm, interslice space = 0.5 mm; slice orientation = axial; 8 accumulations for a total acquisition time of 8 min 32 s. Throughout the procedure rats were continuously anesthetized by gas inhalation of 2 % isoflurane mixed in 33 % O2 and 66 % N2O using a home-designed facemask. Body temperature was maintained at 37 °C using an electric blanket controlled by a temperature unit (Harvard Apparatus, MA, USA) connected to a home-made rectal probe [28].
Image analysis
Each set of images was both manually and automatically segmented.
Manual segmentation
Gastrocnemius and plantaris manual delineation was performed on each slice and for each data set (subject) using the FSL software (FMRIB Software Library, Analysis Group, FMRIB, Oxford, UK). The corresponding results were used as the gold standard measurements and were referred to the reference segmentation database (atlases). In order to assess the reproducibility of this process, manual delineation was performed twice by the same operator and once by two different operators, and the corresponding global muscle volume was compared between each process. The first version of the first operator manual segmentations was used as the ground truth segmentation of the atlas, resulting in 18 ground truth atlas images.
Automatic segmentation-general pipeline
The present automatic segmentation method has been tested using either one (Fig. 1) or multiple atlases (Fig. 2) for both muscles.
As indicated in Fig. 1, the set of images to be segmented (i.e., subject images) was initially corrected for bias inhomogeneities as previously described [29], and the background was removed [30]. Then, the images were successively registered to each atlas (one or multiple) using an affine transformation (12 degrees of freedom) [31] followed by a nonlinear procedure [32]. Each step of this standardized procedure was optimized in order to account for the high voxel anisotropy (see below). When using N atlases (Fig. 2), the initial segmentation process provided N intermediate segmentation maps, which were combined using a vote procedure in order to provide the final segmentation map.
Each step of the segmentation process was optimized throughout different options of the nonlinear registration (more specifically the model of deformation used), the image upsampling, vote procedure and artificial atlas addition.
Automatic segmentation: specific implementations
Nonlinear registration
The standard full 3D registration method (3D) previously described by Sdika [32] was modified using a slice-by-slice nonlinear registration (SBS) or using an additional constraint, i.e., the deformation along Z was zeroed (2Dc), both constraining the deformation in the XY plan. Acquisition and post-processing constraints were related to the fact that the image resolution was larger in the XY plane as compared to the Z direction, thereby compromising the interpolation process in the Z direction. Both methods assume that the linear registration is reliable for a proper slice alignment and that nonlinear deformation does not occur along Z. These assumptions seem reasonable in our context given that structures in the limb are mostly tubular and do not usually present any drastic morphological changes along the limb length.
Image upsampling
Alternatively, we addressed the voxel anisotropy issue by upsampling the image along Z before the registration processes. This process aimed at improving the resolution in the Z direction and thus the interpolation process. In this case, images were upsampled by a factor of three using a windowed cardinal sine kernel. This rescaling process has been combined with each of the previously described nonlinear registration models, i.e., with the standard 3D (R-3D), 2D constrained (R-2Dc) and slice-by-slice (R-SBS) models.
Vote procedure
The final segmentation of a given subject was obtained by combining the N segmentation maps produced by each atlas using a vote procedure. The final label was the one with the largest number of votes (Fig. 2). A simple majority vote (mv) and a vote weighted by the registration residual (wv) as previously described [24, 33] have been implemented and evaluated.
The weighting function used was:
where r is the residual of the registration at a given pixel, \(G_{{\sigma_{\text{s}} }}\) is a Gaussian function, * is the convolution operator, and σ s and σ r are parameters to tune the weighting.
Artificial atlas addition
Given that the quality of a multi-atlas segmentation procedure is an increasing function of the number of atlases used, we tested an original approach of artificial atlas creation by shifting the original atlas slices up to ± one (z1) (i.e., slice one became slice two, …) or ± two (z2) slices just before the nonlinear registration. As a result two (with z1) or four (with z2) artificial atlases were added to the database for each original atlas. This translation should compensate for linear registration errors along Z, especially when the nonlinear registration is 2Dc or SBS.
Automatic segmentation evaluation
The different settings proposed in this study were evaluated using a leave-one-out procedure, i.e., each atlas was sequentially used as the subject image and automatically segmented using each of the other subjects as atlases. A final segmentation was obtained from each possibility, and the results presented are related to the corresponding average. The standard deviations were not included in the figures because they were too small.
The segmentation quality for each method was quantified using the relative overlap (RO) defined as:
where \(L_{\text{g}}^{\text{s}}\) is the mask of the structure s (gastrocnemius or plantaris muscles) in the ground truth (i.e., manual segmentation) and \(L_{\text{a}}^{\text{s}}\) is the mask of the structure s in the automated segmentation. V is the volume of voxels (in mm3) inside the binary mask, i.e., the number of voxels multiplied by the voxel size (0.12 × 0.12 × 1.5 mm). As a result, a high RO is indicative of a high spatial overlap of the two given segmentations. We also calculated the RO values between the manual delineations performed by the same operator (intra) and two different operators (inter) in order to evaluate the accuracy of the present automatic methods with respect to with the accuracy of the reference method.
Muscle volume quantification
Gastrocnemius and Plantaris muscle individual volumes were quantified from both the manual delineation (reference measurements) and fully automatic segmentation results using the setup resulting in the highest RO values. The relative corresponding error was calculated as the volume difference between the two segmentations and expressed in %.
Statistical analysis
The intra- and interoperator variability of the manual segmentation was quantified from the global volume measurements using the coefficient of variation (CV) and intraclass correlation coefficient (ICC) for both gastrocnemius and plantaris muscles specifically.
Results
Representative transverse images of the rat leg (Fig. 3a) and the corresponding manual (Fig. 3b) and fully automatic segmentations (Fig. 3c) of the gastrocnemius and plantaris muscles are presented in Fig. 3.
Reproducibility of the reference measurements
Manual delineation of both muscles by a trained expert required a minimum of 15 min per subject (i.e., 18 slices) and has been performed twice by the same operator and an additional time by a second operator. The corresponding volumes for each subject are presented in Table 1. The intraoperator coefficient of variation (CV) was 1.1 ± 0.8 % and 5.1 ± 4.2 % for the gastrocnemius and the plantaris muscles, respectively. The corresponding ICC was 0.99 (gastrocnemius) and 0.87 (plantaris). While measurements performed by two different operators provided comparable results for the gastrocnemius muscle as indicated by the small CV (2.3 ± 1.4 %) and high ICC (0.97), the corresponding measurements for the plantaris muscle were found to be less reproducible, with a CV of 18.7 ± 7.3 % and a poor ICC (0.41).
Accuracy of the automatic methods
We calculated the relative overlap index (RO) in order to evaluate the performance of each automatic procedure for both gastrocnemius and plantaris muscle segmentations and for different numbers of atlases, from 1 to 14. Processing time to complete both muscle segmentations for one subject ranged from 10 to 15 min, depending on the initial settings when one atlas was used. This time increased linearly with respect to the number of atlases (e.g., a minimum of 1 h was required to perform the same segmentation with 6 atlases). For the sake of comparison, the RO calculated for the manual segmentations performed by the same operator and two different operators were 88.3 and 83.2 for the gastrocnemius and 81.5 and 68.1 for the plantaris muscle segmentations.
Vote procedure
Table 2 presents the RO values obtained using the different nonlinear registration models (i.e., SBS, 2Dc and 3D) combined with either the majority or weighted vote with respect to the number of atlases for both muscle segmentation. Our results showed that, compared to the simple majority vote, the weighted vote improved the RO when the images were registered using the 2Dc or the SBS model and so regardless of the number of atlases used or the muscle segmented. However, the corresponding improvement was globally larger for the plantaris (3.3 ± 2.6 % and 2.2 ± 1.8 % using the 2Dc and SBS models, respectively) as compared to the gastrocnemius (1.1 ± 0.4 % and 0.6 ± 0.2 %) muscle. Using the 3D model, no effect of the vote procedure was observed for the gastrocnemius segmentation while the RO was globally slightly improved (by 1.2 ± 1.5 %) for the plantaris segmentation by the weighted vote procedure. According to these results and for the sake of clarity, only the weighted vote was used for the remaining comparisons.
Images upsampling
We found a beneficial effect of the upsampling along the Z direction when the images were registered with the 3D model. Using this specific setting, the upsampling improved the RO associated with the gastrocnemius (0.6 ± 0.03 %) and plantaris (1.5 ± 0.07 %) muscle segmentation using 1–14 atlases. This indicates that the image anisotropy is a limiting factor for automatic segmentation using atlases and that the upsampling method can take this issue into account. When only an in-plane registration was performed, i.e., using the 2Dc and SBS models, this beneficial effect no longer existed. More specifically, combined with the SBS model, this procedure resulted in a decreased RO by 0.3 ± 0.06 % for both muscles regardless of the number of atlases used. Combined with the 2Dc model, we found a slight improvement for the gastrocnemius segmentation (RO improved by 0.3 ± 0.09 %), while this setting was globally detrimental for the plantaris muscle (−0.1 ± 0.11 %). Accordingly, we discarded the R-SBS and R-2Dc of the nonlinear registration model comparison.
Nonlinear registration models
As illustrated by the higher RO values, our results clearly showed a higher reliability of the automatic segmentation for the gastrocnemius muscle as compared to the plantaris regardless of the initial setting. However, the accuracy levels differed according to the registration methods independently of the muscle segmented (Fig. 4).
According to the RO values, we found a lower performance of the SBS model compared to the other nonlinear registration models in our experimental conditions. More specifically, while RO increased from 80.2, 80.8 and 80.2 to 86.7, 87.3 and 87.4 with respect to the number of atlases using the 3D, R-3D and 2Dc models, respectively, for the gastrocnemius muscle, the corresponding values using the SBS model were significantly reduced to 75.8 and 84.5 % (Fig. 4a). The same difference was observed for the plantaris segmentation (Fig. 4b). For this specific muscle, the RO values ranged from 64.5, 65.9 and 66.1 to 73.2, 74.8 and 76.8 when the images were registered using the 3D, R-3D and 2Dc models, respectively, and from 57.7 to 70.7 using the SBS model (Fig. 4b). Finally, considering both gastrocnemius and plantaris muscle segmentation, the 2Dc model outperformed the other models in our experimental conditions.
Overall, with the exception of the SBS model, our results showed that a minimum of three atlases was necessary to automatically segment either muscle with a level of accuracy similar to the reference method when two operators were involved. However, despite an increased RO with respect to the number of atlases, none of the automatic methods reached the intraoperator threshold whatever the muscle (Fig. 4). In addition, our results showed that the corresponding improvement was significantly reduced beyond size atlases; from 1 to 6 atlases the RO was improved by 6.4, 6.3, 7.0 and 9.0 % for the gastrocnemius and by 10.0, 10.0, 11.6 and 16.1 % for the plantaris segmentations using the 3D, R-3D, 2Dc and SBS models, respectively, whereas the addition of eight more atlases resulted in a further RO improvement lower than 1.5 and 3 % for the gastrocnemius and plantaris segmentation, respectively (Fig. 4).
Artificial atlas addition
According to the previous results, we evaluated the effect of the addition of artificial atlases on the automatic segmentation quality using the weighted vote combined with the 2Dc nonlinear registration model. As illustrated in Fig. 5, the addition of artificial atlases translated by ± one (z1) or ± two slices (z2) improved the RO when a low number of original atlases was used regardless of the muscle segmented. However, the z1 setup resulted in a larger improvement compared to z2 for both muscles (Fig. 5). In addition, using the ‘z2’ atlases we observed a degradation of the RO when more than 4 and 8 original atlases were used, while the ‘z1’ atlases improved the RO up to 8 and 14 original atlases for the gastrocnemius (Fig. 5a) and plantaris (Fig. 5b) segmentation, respectively, compared to the results obtained without (i.e., z0).
With respect to computation, the shift of up to k slices of the atlases and the addition to the vote data set multiply the number of atlases as well as the computation time by 2k + 1. So, when z1 was used, the computation time was three times greater than when only the original atlases were used.
Muscle volume quantification
The relative volume error (expressed in % of the reference measurement) resulting from the automatic segmentation of both gastrocnemius and plantaris muscles with respect to the number of original atlases and using the following settings: the 2Dc nonlinear registration and weighted vote using (2Dc-z1) or not (2Dc) artificial atlases are displayed in Table 3. The corresponding absolute volumes in mm3 when using six atlases as well as the percent error are presented in Table 4. We observed a systematic overestimation of the gastrocnemius muscle when the segmentation was performed automatically compared to the reference measurements (Table 4). Regarding the plantaris muscle, the absolute volume errors ranged from 0 to 19.5 % and 0.7 to 18.9 % using 2Dc or 2Dc-z1 procedures. However, despite this larger variability, the average volume calculated from the automatic measurements did not significantly differ from the reference method as a result of individual volumes over and underestimations (Table 4). These errors are in agreement with the volume differences calculated from manual delineation performed by two different operators, i.e., 3.0 ± 2.2 % and 22.9 ± 8.6 % for the gastrocnemius and plantaris muscles, respectively, but did reach the intraoperator variability (1.6 ± 1.1 % and 6.8 ± 5.3 % for the gastrocnemius and plantaris, respectively).
Finally, it is worth noting that in contrast to the improvement we observed for the RO values, the addition of artificial atlases did not improve the accuracy of the volume estimation for both muscles.
Discussion
In this study, we described for the first time a multi-atlas-based segmentation method that can be efficiently used to automatically quantify individual muscle volumes within the limb from rat MR images. We have investigated the effects of a number of modifications to the standard method in terms of accuracy (determined by the RO index) with a particular attention given to the voxel anisotropy of the images and reliability of individual muscle volume quantification.
We showed that an automatic process including a 2Dc nonlinear registration model, weighted vote and minimum of three atlases provided a better RO value than the value obtained from a comparison between manual segmentations performed by two different operators. On that basis, we demonstrated that the accuracy of the automatic segmentation of both gastrocnemius and plantaris muscles outperformed the interobserver manual segmentation variability. This result highlighted the robustness of the method, which only needs a very limited amount of manual work, i.e., on three atlases. The corresponding global muscle volume error was 4.2 and 8.0 % for the gastrocnemius and plantaris muscles, respectively. Interestingly, this error was lower than the interoperator variability for the reference method illustrating the reliability of the present fully automatic method. Finally, we showed that the quality of the automatic segmentation was improved by the addition of artificial atlases especially when a low number of original atlases were used. Of interest, this improvement did not affect the global muscle volume quantification.
The reference segmentation of both gastrocnemius and plantaris muscles was performed three times manually for each slice and each data set, twice by the same operator and an additional time by a new operator. The measurement variability was determined from the global volume quantification using the CV and ICC coefficients. Our results clearly disclosed a muscle-dependent reproducibility with CV much larger (18.7 ± 7.3 %) and ICC (0.41) and RO (68.1) much lower for the plantaris as compared to the gastrocnemius muscle. This inconsistency is likely related to the muscle size given that in our context the plantaris muscle volume was more than six times smaller than that of the gastrocnemius regardless of sex. Such a difference has already been suggested in humans. A <1 % interoperator variability was reported for a large muscle group (i.e., quadriceps) [18], whereas a much larger variability has been reported for smaller muscle groups such as the flexor (3.3 %) and extensor carpi ulnari (5.7 %) muscles [17]. This difference can also be related to the unclear distinction of the surrounding plantaris muscle fascia, especially on the distal part of the leg, making the segmentation on the corresponding images highly operator-dependent. On this basis, it is important to keep in mind that the reproducibility of the manual delineation and as a matter of consequence the performance of the automatic segmentation method is closely linked to the ability to clearly distinguish individual muscles on the MR image, which is itself dependent on the inherent contrast and image resolution.
To our knowledge, no data related to the reproducibility of the manual delineation from small animal MR images have been reported so far. However, both intra- and interoperator CV (1.1 ± 0.8 % and 2.3 ± 1.4 %, respectively) and ICC (0.99 and 0.97, respectively) reported for the gastrocnemius muscle volume are in agreement with values previously reported in humans for different muscles [17, 18, 34]. Although frequently used for the segmentation of soft tissues [22, 24, 25, 33, 35], only one study so far has used a multi-atlas-based approach in order to automatically segment a single muscle [36]. More specifically, the authors reported an automatic method dedicated to the delineation of the pectoral muscle from breast MR images using a standard 3D registration procedure but did not target the muscle volume quantification. The accuracy of this automatic segmentation [36] illustrated by a dice similarity coefficient (DSC) of 0.74 ± 0.06 [i.e., an RO of 0.59, DSC, RO = DSC/(2-DSC)*100] was actually inferior to the accuracy of our results. In addition, further supporting our initial conclusion, these authors reported a low reproducibility of the pectoral muscle manual delineation (RO = 0.54) and indicated that this poor reproducibility might be linked to the difficult distinction among the pectoral muscle, surrounding tissues and intercostal muscles on MR images [36].
Our results clearly showed that, regarding voxel anisotropy, a nonlinear registration constrained in two dimensions (i.e., 2Dc) was more beneficial to the automatic segmentation quality than the standard 3D model, especially given that the image resolution did not allow a clear distinction of a small muscle like the plantaris muscle Indeed, in the context of an atlas-based segmentation, voxel anisotropy should result in a biased image interpolation along Z, thereby affecting the segmentation mask and deformation itself. Several factors can explain the superiority of the 2Dc registration as compared to the other models. In the 3D model, the parameters for the deformation are estimated in all three directions, whereas the 2Dc model restrained the deformation within the atlas slices where the resolution is the highest while the Z component is set to zero providing a de facto regularization beneficial to the registration. As a result, this model still takes advantage of the 3D nature of the data and is able to generate a more coherent and robust deformation during the registration as compared to the SBS model for which no regularization is imposed slice to slice. Following the same idea, image upsampling along Z before the registration was, as expected, beneficial when it was combined with the standard 3D model. This improvement clearly indicates that image resolution and more specifically voxel anisotropy are limiting factors for the segmentation process using atlas-based approaches. Interestingly, this limitation can be artificially reduced using an upsampling procedure, though at the expense of a larger processing time. Another possibility would be the acquisition of fully isotropic 3D data sets as previously described for the automatic quantification of fat-free muscle volumes [26, 27]. The ICCs between automatic and manual segmentation reported by these authors were above 0.89 for all muscle groups and showed that under those acquisition conditions age and BMI (from lean to obese) differences did not affect the multi-atlas method performance [27]. Interestingly, using full isotropic voxels, Karlson et al. [26] reported a similar accuracy regarding automatic muscle volume estimates as compared to the manual segmentation, and so despite an eight-fold resolution difference when data were acquired at 1.5 T (voxel dimensions = 3.5 × 3.5 × 3.5 mm3) or at 3 T (voxel dimensions = 1.75 × 1.75 × 1.75 mm3). Although this approach might provide a more accurate registration and, on that basis, an improved segmentation quality, it would be at the expense of both post-processing and acquisition times. In addition, the resulting gain in terms of volume quantification accuracy should only be marginal given that no drastic change in anatomical shape is expected between two consecutive slices. The improvement resulting from the upsampling procedure combined with the standard 3D model was actually not better than the automatic segmentation using the 2Dc model when considering both muscles together.
Based on the RO index, the 2Dc nonlinear registration also provided the most accurate segmentation for both muscles when it was associated with a weighted vote procedure and no upsampling. The corresponding RO values increased from 80.2 to 87.4 and from 66.1 to 76.8 for the gastrocnemius and plantaris segmentation with respect to the number of atlases used (from 1 to 14). The RO values reported for the gastrocnemius segmentation were in agreement with the results reported by Andrews et al. [21] in much larger muscle groups (i.e., knee extensor and flexor muscles) in humans. On the basis of a principal component analysis, they reported good reliability of the automatic segmentation illustrated by an average DSC value of 0.92 ± 0.03 (i.e., RO = 0.85) [21]. However, it has to be kept in mind that the level of accuracy expected from the automatic method has to be considered with respect to the reproducibility of the standard method. Unfortunately, these authors did not report the variability of their manual delineation process [21]. In the present study, although the accuracy of the automatic segmentation was largely lower for the plantaris than for the gastrocnemius muscle, the RO values achieved for both muscles outperformed the interoperator RO value using a minimum of three atlases. Similarly, the DSC values from the automatic segmentation reported by Gubern-Merida et al. [36] for the pectoral muscle also outperformed the DSC value calculated from the computed interobserver variability (DSC = 0.70 ± 12), thereby highlighting the reliability of the multi-atlas approach for the individual muscle segmentation. It is worth pointing out that in spite of an increased RO value with respect to the number of atlases included in the database, the addition of further atlases beyond six resulted in a minor improvement of the RO values in both muscles, thereby suggesting that the manual work can be minimized to the delineation of six subjects for a high-quality level according to our data (i.e., RO = 86.2 and 74.8 for the gastrocnemius and plantaris segmentation, respectively).
From a physiological point of view, the accuracy of a segmentation method can also be evaluated on the basis of the error related to the muscle volume measurement. Here, the global volume error was 4.2 and 8.0 % and slightly decreased to 3.5 and 7.4 % using 3 or 6 original atlases for the gastrocnemius and plantaris muscles, respectively. Similarly to what we observed for the RO values, the benefit associated with the addition of more than six atlases was negligible for the volume quantification of both muscles. More specifically, using 14 original atlases the global volume error was 3.5 and 7.0 % for the gastrocnemius and plantaris muscles, respectively. The present errors were consistent with those previously reported from the commonly used alternative methods based on a reduced number of slices manually delineated [7, 18, 37, 38]. In these models, the truncated cone formula has been particularly used to describe the limb shape between two consecutive, manually delineated slices (as the limb describes a global conic shape). However, it has been shown that the related bias significantly increased with the gap between slices. Tracy et al. [38] showed in humans that a minimum of 5–6 slices (depending on the thigh length) evenly distributed along the quadriceps muscle was necessary to estimate the corresponding volume within a 4 % error range independent of age or sex. Bias reduction to a value of <2 % [38] or to a point lower than the variability of the reference measure [18] required the manual delineation of 12 slices. This was reduced to seven slices using the Cavalieri method (assuming a cylindrical shape between two consecutives slices) for this specific muscle group. Although considerably reduced compared to the reference method (i.e., manual delineation from ~100 contiguous slices), the corresponding manual work remains tedious, especially for large cohorts. In this respect, Morse et al. [37] reduced the manual work to a single slice segmentation by developing a regression model describing the evolution of the cross-sectional area specific to each muscle composing the quadriceps as a function of muscle length in humans. Depending on the location of the segmented slice, the global error ranged from 10 to 27 %. Although these errors have been considered acceptable for the purpose of individual muscle size quantification, their use in a different context is highly questionable. The investigation of any new population would require a new model validation and would lose the benefit of any manual work previously performed. The present automatic method provides a high level of accuracy despite the large morphological heterogeneity of the population, thereby strongly suggesting the possibility of using a given database of atlases for the automatic segmentation of individual muscles from new populations. However, it should be noted that the present method, similarly to the previous automatic work [21, 36], does not identify intramuscular fat so that any comparison of populations with a different relative proportion of intramuscular fat such as in aging [11, 39] and in various pathological conditions [12] would require an additional thresholding process [14].
Conclusion
We reported in the present study an original multi-atlas-based automatic segmentation process dedicated to the skeletal muscle offering high accuracy and reliability. We demonstrated that automatic quantification of individual muscle volume in rat leg can be performed in a population characterized by a wide range of muscle size, thereby opening up promising opportunities in humans.
Abbreviations
- 3D:
-
Standard 3D nonlinear registration process
- SBS:
-
Slice-by-slice nonlinear registration process
- 2Dc:
-
2D constrained nonlinear registration process
- R-:
-
Rescaled along Z
- RO:
-
Relative overlap index
- CV:
-
Coefficient of variation
- ICC:
-
Intraclass coefficient
- mv:
-
Majority vote procedure
- wv:
-
Weighted vote procedure
- z0:
-
No slice translation
- z1:
-
1 slice translation
- z2:
-
2 slice translation
- DSC:
-
Dice similarity coefficient
References
Park SW, Goodpaster BH, Strotmeyer ES, de Rekeneire N, Harris TB, Schwartz AV, Tylavsky FA, Newman AB (2006) Decreased muscle strength and quality in older adults with type 2 diabetes: the health, aging, and body composition study. Diabetes 55:1813–1818
Barbat-Artigas S, Rolland Y, Zamboni M, Aubertin-Leheudre M (2012) How to assess functional status: a new muscle quality index. J Nutr Health Aging 16:67–77
Metter EJ, Talbot LA, Schrager M, Conwit R (2002) Skeletal muscle strength as a predictor of all-cause mortality in healthy men. J Gerontol A Biol Sci Med Sci 57:B359–B365
Tonson A, Ratel S, Le Fur Y, Cozzone P, Bendahan D (2008) Effect of maturation on the relationship between muscle size and force production. Med Sci Sports Exerc 40:918–925
Morse CI, Tolfrey K, Thom JM, Vassilopoulos V, Maganaris CN, Narici MV (2008) Gastrocnemius muscle specific force in boys and men. J Appl Physiol 104:469–474
Jubrias SA, Odderson IR, Esselman PC, Conley KE (1997) Decline in isokinetic force with age: muscle cross-sectional area and specific force. Pflüg Arch Eur J Physiol 434:246–253
Hasson CJ, Kent-Braun JA, Caldwell GE (2011) Contractile and non-contractile tissue volume and distribution in ankle muscles of young and older adults. J Biomech 44:2299–2306
Hug F, Marqueste T, Le Fur Y, Cozzone PJ, Grélot L, Bendahan D (2006) Selective training-induced thigh muscles hypertrophy in professional road cyclists. Eur J Appl Physiol 97:591–597
Kalyani RR, Corriere M, Ferrucci L (2014) Age-related and disease-related muscle loss: the effect of diabetes, obesity, and other diseases. Lancet Diabetes Endocrinol 2:819–829
Brown M, Hasser EM (1996) Complexity of age-related change in skeletal muscle. J Gerontol A Biol Sci Med Sci 51:B117–B123
Frontera WR, Reid KF, Phillips EM, Krivickas LS, Hughes VA, Roubenoff R, Fielding RA (2008) Muscle fiber size and function in elderly humans: a longitudinal study. J Appl Physiol 105:637–642
Kesper K, Kornblum C, Reimann J, Lutterbey G, Schröder R, Wattjes MP (2009) Pattern of skeletal muscle involvement in primary dysferlinopathies: a whole-body 3.0-T magnetic resonance imaging study. Acta Neurol Scand 120:111–118
Akagi R, Takai Y, Ohta M, Kanehisa H, Kawakami Y, Fukunaga T (2009) Muscle volume compared to cross-sectional area is more appropriate for evaluating muscle strength in young and elderly individuals. Age Ageing 38:564–569
Mattei JP, Fur YL, Cuge N, Guis S, Cozzone PJ, Bendahan D (2006) Segmentation of fascias, fat and muscle from magnetic resonance images in humans: the DISPIMAG software. Magn Reson Mater Phy 19:275–279
Positano V, Christiansen T, Santarelli MF, Ringgaard S, Landini L, Gastaldelli A (2009) Accurate segmentation of subcutaneous and intermuscular adipose tissue from MR images of the thigh. J Magn Reson Imaging 29:677–684
Klein CS, Rice CL, Marsh GD (2001) Normalized force, activation, and coactivation in the arm muscles of young and old men. J Appl Physiol 91:1341–1349
Smeulders MJC, van den Berg S, Oudeman J, Nederveen AJ, Kreulen M, Maas M (2010) Reliability of in vivo determination of forearm muscle volume using 3.0 T magnetic resonance imaging. J Magn Reson Imaging 31:1252–1255
Nordez A, Jolivet E, Südhoff I, Bonneau D, de Guise JA, Skalli W (2009) Comparison of methods to assess quadriceps muscle volume using magnetic resonance imaging. J Magn Reson Imaging 30:1116–1123
Gilles B, Magnenat-Thalmann N (2010) Musculoskeletal MRI segmentation using multi-resolution simplex meshes with medial representations. Med Image Anal 14:291–302
Wang C, Teboul O, Michel F, Essafi S, Paragios N (2010) 3D knowledge-based segmentation using pose-invariant higher-order graphs. Med Image Comput Comput-Assist Interv 13:189–196
Andrews S, Hamarneh G, Yazdanpanah A, HajGhanbari B, Reid WD (2011) Probabilistic multi-shape segmentation of knee extensor and flexor muscles. Med Image Comput Comput-Assist Interv 14:651–658
Sdika M (2010) Combining atlas based segmentation and intensity classification with nearest neighbor transform and accuracy weighted vote. Med Image Anal 14:219–226
Rohlfing T, Brandt R, Menzel R, Maurer CR (2004) Evaluation of atlas selection strategies for atlas-based image segmentation with application to confocal microscopy images of bee brains. NeuroImage 21:1428–1442
Isgum I, Staring M, Rutten A, Prokop M, Viergever MA, van Ginneken B (2009) Multi-atlas-based segmentation with local decision fusion–application to cardiac and aortic segmentation in CT scans. IEEE Trans Med Imaging 28:1000–1010
Klein S, van der Heide UA, Lips IM, van Vulpen M, Staring M, Pluim JPW (2008) Automatic segmentation of the prostate in 3D MR images by atlas matching using localized mutual information. Med Phys 35:1407–1417
Karlsson A, Rosander J, Romu T, Tallberg J, Grönqvist A, Borga M, Dahlqvist Leinhard O (2015) Automatic and quantitative assessment of regional muscle volume by multi-atlas segmentation using whole-body water-fat MRI. J Magn Reson Imaging 41:1558–1569
Thomas MS, Newman D, Leinhard OD, Kasmai B, Greenwood R, Malcolm PN, Karlsson A, Rosander J, Borga M, Toms AP (2014) Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system. Eur Radiol 24:2279–2291
Giannesini B, Izquierdo M, Le Fur Y, Cozzone PJ, Fingerle J, Himber J, Künnecke B, Von Kienlin M, Bendahan D (2005) New experimental setup for studying strictly noninvasively skeletal muscle function in rat using 1H-magnetic resonance (MR) imaging and 31P-MR spectroscopy. Magn Reson Med 54:1058–1064
Sled JG, Zijdenbos AP, Evans AC (1998) A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE Trans Med Imaging 17:87–97
Zhang Y, Brady M, Smith S (2001) Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imaging 20:45–57
Jenkinson M, Smith S (2001) A global optimisation method for robust affine registration of brain images. Med Image Anal 5:143–156
Sdika M (2008) A fast nonrigid image registration with constraints on the Jacobian using large scale constrained optimization. IEEE Trans Med Imaging 27:271–281
Artaechevarria X, Munoz-Barrutia A, Ortiz-de-Solorzano C (2009) Combination strategies in multi-atlas image segmentation: application to brain MR data. IEEE Trans Med Imaging 28:1266–1277
Elliott MA, Walter GA, Gulish H, Sadi AS, Lawson DD, Jaffe W, Insko EK, Leigh JS, Vandenborne K (1997) Volumetric measurement of human calf muscle from magnetic resonance imaging. Magn Reson Mater Phy 5:93–98
Cabezas M, Oliver A, Lladó X, Freixenet J, Cuadra MB (2011) A review of atlas-based segmentation for magnetic resonance brain images. Comput Methods Programs Biomed 104:e158–e177
Gubern-Mérida A, Kallenberg M, Martí R, Karssemeijer N (2012) Segmentation of the pectoral muscle in breast MRI using atlas-based approaches. Med Image Comput Comput-Assist Interv 15:371–378
Morse CI, Degens H, Jones DA (2007) The validity of estimating quadriceps volume from single MRI cross-sections in young men. Eur J Appl Physiol 100:267–274
Tracy BL, Ivey FM, Jeffrey Metter E, Fleg JL, Siegel EL, Hurley BF (2003) A more efficient magnetic resonance imaging-based strategy for measuring quadriceps muscle volume. Med Sci Sports Exerc 35:425–433
Kent-Braun JA, Ng AV, Young K (2000) Skeletal muscle contractile and noncontractile components in young and older women and men. J Appl Physiol 88:662–668
Author’s contribution
Michael Sdika: project development, data management and data analysis; Anne Tonson: project development, data collection and data analysis; Yann Le Fur: data collection; Patrick J. Cozzone: data analysis; David Bendahan: data analysis.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical standards
All procedures involving animals were performed according to the guidelines of the National Research Council Guide for the care and use of laboratory animals and the French law on the Protection of Animals.
Additional information
Michael Sdika and Anne Tonson have contributed equally to this work.
Rights and permissions
About this article
Cite this article
Sdika, M., Tonson, A., Le Fur, Y. et al. Multi-atlas-based fully automatic segmentation of individual muscles in rat leg. Magn Reson Mater Phy 29, 223–235 (2016). https://doi.org/10.1007/s10334-015-0511-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10334-015-0511-6