Introduction

In forensic anthropology, living stature is a crucial component in the process of human identification, and various bones and body parts have been examined for their utility in stature estimation [1,2,3,4,5,6,7,8,9,10,11,12,13]. The principal purpose of estimating living stature is to narrow the potential list of presumptive identifications, especially in natural or man-made disasters, wars, and crimes [14,15,16]. Equations for estimating stature are usually based on univariate or multivariate regression models in which one or several bone dimensions have been used to build equations with the highest accuracy. In this context, researchers have analysed general and specific equations in different populations [17,18,19]. Generic equations refer to a single formula that can be applied to all individuals, regardless of differences such as sex, age, and ancestry. In contrast, group-specific equations do not treat multiple subgroups as a single entity but are based instead on the use of different equations for subgroups of the population. The accuracy rates of generic formulae versus specific equations have resulted in contradictory findings. In some studies, generic equations obtained higher accuracy than specific ones [20, 21], whereas others yielded higher estimation accuracy using specific equations [22,23,24,25,26,27].

One of the suggested ways to estimate living stature with minimal error is to use regression equations specific to stature groups, using different equations for short, medium, and tall people. This method was first introduced using tibial length in a modern Turkish sample and was shown to be more accurate than nonspecific equations, especially for tall and short individuals [22, 23]. In a later study, Duyar et al. [24] analysed the tibia and ulna, both individually and in combination, and demonstrated that stature-specific equations lead to more accurate stature estimations. This was further assessed by Sargın et al. [25] who tested this procedure on a relatively small group of male and female cadavers and reported that it provides less erroneous estimates than generic equations. Recently, Saco-Ledo et al. [28] tested the stature-group regression equations on modern Spaniards and found lower estimation errors than for nonspecific equations. However, because all of the aforementioned studies, except one, were performed on males, the accuracy of this new stature estimation procedure in females is unknown. Considering this limitation, the aim of the present study is to develop stature-specific equations for estimating living stature based on tibial length in a large female sample.

Materials and methods

Study design

This cross-sectional study was conducted on a sample of Spanish females. The participants were randomised into two groups: a calibration group and a validation group. Three stature sub-groups (Group ST1, stature ≤ 158.9 cm; Group ST2, stature between 159.0 and 171.1 cm; Group ST3, stature ≥ 171.2 cm) were also designated using the 15th and 85th percentiles as cut-off points in both groups [22, 28]. In the Gaussian distribution, the area below the curve between the ± 1 standard deviation (SD) mark constitutes approximately 68.27% of the total area, which we accepted as 70%. We have placed the body height values between − 1 SD and + 1 SD into the “medium”, the values below − 1 SD (i.e. 15th percentile) into the “short”, and those above + 1 SD (i.e. 85th percentile) into the “tall” stature categories. The use of the 15th and 85th percentiles as cut-off points is common in studies focusing on aspects of public health (e.g. National Health and Nutrition Examination Survey, NHANES, in the USA), particularly those focusing on body weight [29,30,31,32].

The study consists of two phases. In the first phase, the stature-group-specific equations for three sub-groups were developed using data from the calibration group. In the second phase, the tibial length values of individuals in the validation group were tested using regression equations derived from the calibration group.

The project followed the recommendations of the Declaration of Helsinki on Human Rights [33] and was approved by the Clinical Research Ethics Committee of the Sports Administration of Catalonia (12/2015/CEICEGC).

Participants

The sample consisted of 351 Spanish adult female living participants who were randomized into two groups (the calibration group with 185 participants and the validation group with 166 participants) using the Statistical Package for Social Sciences (SPSS). The calibration group was categorized according to stature into three sub-groups, short, stature ≤ 158.9 cm (n = 27); medium, stature between 159.0 and 171.1 cm (n = 130); and tall, stature ≥ 171.2 cm (n = 28). The validation group was also divided into three groups according to the same stature categories as the calibration group in order to test the equations and included n = 18, n = 127, and n = 21 participants in the short, medium, and tall stature groups, respectively. The inclusion criteria for study participants included being between 18 and 55 years of age and being of Spanish nationality.

Anthropometric measurements

Two authors of this paper (GSL, JP), who are anthropometrists certified by the International Society for the Advancement of Kinanthropometry (ISAK), performed the anthropometric measurements. Stature and tibial length were measured following the protocol of a previous study on Spanish males [28]. The tibial length of the right side was measured as the distance between tibiale mediale and sphyrion tibiale, and stature was measured as the distance between the vertex and the bottoms of the feet. The technical error of measurement (TEM) was acceptable (TEM ≤ 1.0%, intra-observer; TEM ≤ 1.5%, inter-observer) based on 20 individuals. Stature and tibial length were measured using a Harpenden Anthropometer (Holtain Model 601) and a Harpenden Stadiometer (Holtain Model 603) in the laboratory. Other anthropometric instruments, such as an anthropometric bench, a square with an attached level, and a segmometer (UWA), were used in the evaluations in the fieldwork.

Statistical analysis

Descriptive statistics were calculated to permit comparisons between stature groups. Normality of the data and the homogeneity of variance between groups were tested using the Kolmogorov–Smirnov test or the Shapiro–Wilk test and Levene’s test. Group comparisons were performed with parametric or nonparametric tests. For group comparisons, we employed the t test for independent samples and the Mann-Whitney U (Table 1), and Kruskal-Wallis and Scheffe (Table 2) tests for nonparametric and parametric comparisons. Using known stature and tibial length, we verified the association between these two variables and developed multiple equations for stature estimation. We analysed correlations, regression equations, coefficients of determination (R2 and AdjR2) and standard errors of estimation (SEE). Normality and homoscedasticity of the residual were also verified. All statistical analyses were performed with SPSS software (IBM Corp. released 2011, IBM SPSS Statistics for Windows, Version 20.0, Armonk, NY). The figures were produced using the SigmaPlot software (Version 12.0, Systat Software Inc., San Jose, CA, USA). The level of significance was established at p < 0.05.

Table 1 Descriptive statistics of the study group and the validation group
Table 2 Descriptive statistics of the study group based on stature groups

Results

Anthropometric variables

The general characteristics of the study sample are shown in Tables 1 and 2. We did not find significant differences (p > 0.05) between the calibration and the validation groups (Table 1). Statistical analysis supported the assumption of normality of the anthropometric variables (p > 0.05) in both groups. The variance within groups was homogeneous for age, stature, and relative tibial length, but not for absolute tibial length.

The scatterplots (Figs. 1 and 2) show significant high correlations (p < 0.001) between stature and tibial length in both the calibration and validation groups—the dotted lines indicate the 95% prediction intervals for the calibration group (Fig. 1) and the validation group (Fig. 2). These figures clearly demonstrate the close relationship between tibial length and stature, and by extension, the value of tibial length for estimating stature.

Fig. 1
figure 1

Scatterplot with 95% prediction intervals (dotted lines) based on tibial length and stature in the calibration group

Fig. 2
figure 2

Scatterplot with 95% prediction intervals (dotted lines) based on tibial length and stature in the validation group

Table 2 reports significant differences (p ≤ 0.001) between the stature sub-groups for almost all of the anthropometric variables, except between the medium and tall stature groups for relative tibial length. The anthropometric variables showed a normal distribution for tibial length and relative tibial length (p ≥ 0.05) but not for stature groups or ages (p < 0.05). The variance was homogeneous across groups for the different variables (p ≥ 0.05). Tibial length and relative tibial length have higher values in the tall and medium stature groups than in the short stature group. Therefore, the results indicate that a piecewise regression technique could be an innovative approach for estimating stature, since relative tibial length increases as stature increases in Spanish adult females.

Regression equations

The ordinary least square (OLS) regression formulae were generated for each stature group (short, medium, and tall) using the measurements of the calibration group (Table 3). Both SEE and R2 values for the equations of all stature groups were lower than those of the general formula. Although the low R2 values of the equations for the stature-specific groups are considered a negative result, the low SEE values indicate that these equations provide more accurate estimations of stature. The main factor underlying the lower R2 values in the group-specific regression equations is the smaller number of participants in those groups.

Table 3 Regression equations by stature groups in the calibration group

Differences between stature estimates using the equations of the calibration group in the validation group

Table 4 shows the difference between statures estimated by the general equation and the group-specific regression equations using data from the validation group. Overall, the equations specific to stature groups yield more accurate stature estimates. The most significant difference between the two calculation methods (i.e. general or group-specific equation) was observed in the short and tall stature groups. The equations for short and tall groups produce underestimates (mean 0.34 cm), while the equation for the medium group yields overestimates (mean 0.18 cm) when applied to the short and tall stature groups. However, in the medium-stature group, little difference was found between the two estimation procedures.

Table 4 Differences between measured and estimated stature in the validation group

Discussion

The findings of this study clearly demonstrate that, as stature increases, the ratio of tibial length to stature does not remain constant but instead increases. While the average tibial length-to-stature ratio is 21.9 in the short sub-group, it rises to 22.6 in the medium group, and 22.9 in the tall group, an increase that is statistically significant between the short and tall stature groups. Relative tibial length has also been shown in previous studies to be larger in taller males [23, 28]. Therefore, the results presented here demonstrate that relative tibial length increases steadily, in a positively allometric fashion, as stature increases in both females and males. The proportional allometric increase is approximately equivalent in both sexes. These findings suggest that separate regression equations should be created for short, medium, and tall people when calculating stature from tibial length.

Group-specific (sex, age, population, or ancestry) formulae are often used in anthropological studies and forensics to estimate stature. In recent years, however, some studies have claimed that generic formulae yield less erroneous estimates than group-specific ones [20, 21]. These studies have recommended using general equations for estimating stature, because these equations can estimate the stature regardless of age, sex, ancestry, or population. However, the SEE values of these generalized equations are remarkably higher than we found in our study, suggesting that calculations specific to stature groups yield more accurate estimates than generic equations. The findings of other studies also support these results [22, 24,25,26,27,28]. Furthermore, while another study in Spanish individuals reported low SEE for stature group regression equations based on multiple long bone lengths [8], our stature-group-specific equations have even lower SEE. Thus, the simple and multiple regression equations of previous work [8] still show a higher SEE than the simple regression stature-group-specific equations of this study, emphasizing the utility of our equations and the use of tibial length to estimate stature. In this context, the specific equations based on stature groups obtained higher accuracy than has been reported in other studies that developed equations for estimating stature in Spanish females [5, 8, 34].

While stature estimation equations specific to stature groups have previously been developed from a sample of Spanish males [28], the present study is one of the first to investigate stature estimation in females using this method. A previous study conducted by Sargın et al. [25] on the cadavers of 62 females and 78 males from Antalya, Turkey, estimated female stature based on stature-group-specific regression equations. The equations specific to the stature groups yielded more accurate cadaver height estimations, but these findings had not previously been tested in a larger sample, or in living populations. The conclusions of the present study on Spanish females for estimating living stature support the results of the previous investigation on cadavers [25].

As this is only the second study investigating stature estimation in females with stature-group-specific regression equations, when formulating this study, we wondered about the differences in SEE for equations based on tibial length between Spanish females and males. In males [28], the stature-group-specific equations were found to produce accurate estimates (SEE = 2.12–2.66 cm), similar to the analogous equations in Spanish females (SEE = 2.35–2.66 cm). However, we found some differences when the equations were applied to the validation group. The results reveal differences primarily in the medium stature groups—the equations of all Spanish females and the medium stature-group obtained similar accuracy when we used the data of the validation group; but the equation with all Spanish male participants obtained much lower accuracy than the specific equation for medium stature when the data of the validation group were used.

Since the tibia is one of the long bones that is most highly correlated with stature, it has been used frequently in studies aimed at reconstructing living stature in forensic, archaeological, and paleontological contexts. Thus, the tibia has been the subject of numerous studies across various populations from different geographical regions [1,2,3,4, 7, 9, 13, 19, 35,36,37,38,39,40,41,42,43,44,45,46]. Some of these studies were based on percutaneous measurements of the tibia of living people, while some were based on radiographic images and others on the measurements of cadavers or dry bones [2,3,4]. We collated the SEE values calculated from females of these studies, listed from lowest to highest (Table 5). Studies which were conducted on dry bones, and stature estimations based on anatomical reconstructive techniques, are not included. In almost all cases, the SEE values reported are considerably higher than those we report here. This indicates that we have reached a relatively low SEE value compared to the range of these previous studies. Furthermore, for short (2.66 cm), medium (2.57 cm), and tall (2.35 cm) individuals, we found even lower values than all previously reported SEE values. These findings clearly demonstrate that formulae specific to stature groups return more accurate estimates than generic equations.

Table 5 Standard errors of estimate (SEE) in estimating stature based on different tibial length measurement techniques in females from different geographical regions

In forensic anthropological contexts, recovered remains may include the entire skeleton, or may be limited to isolated bones or dismembered body parts. Cases where only isolated bones or dismembered body parts are present pose a potential hurdle for determining which stature-group-specific equation to use. However, it is possible to overcome this issue. It is well known that there is a strong correlation between the main long bones of the limbs and the body height [1,2,3,4, 22, 28]. This is further shown in the present study, where the correlation between stature and tibia length was found to be high for the whole sample (r = 0.785, p < 0.001). This strong correlation supports the use of the length of the tibia as an indicator of stature, which permits placement of the individual into a stature group. Thus, mirroring our stature groupings, we categorized tibiae as short (length < 34.5 cm), medium (length between 34.6 and 39.5 cm), and long (length > 39.6 cm), using the 15th and 85th percentiles as cut-off points. Based on these values, the forensic practitioner is able to estimate stature using the most appropriate stature-specific formula even in cases where they are limited to working with an isolated tibia.

We have considered several limitations of the present study, including the fact that our short and tall stature groups have few participants, and we do not yet have enough data available for large-scale comparisons, especially for females. On the other hand, it is important to emphasize the considerable strengths of this study, in that we have produced stature estimation equations for different stature groups based on tibial length and have therefore made available highly accurate stature-specific equations for use in forensic anthropology.

Conclusions

The regression equations specific to stature groups led to more accurate estimates than generic formulae. These findings suggest that this calculation procedure yields similar standard error of estimation in Spanish males and females, and that generating regression equations specific to stature groups appears to be an effective technique for estimating living stature. Thus, this procedure, piecewise regression, could be used in osteological, archaeological, and forensic cases, where isolated bones or body part(s) can be evaluated in order to decide which stature-specific equation is best applied. In future research, we recommend the analysis of stature-group-specific equations for estimating stature in other human populations.