Keywords

1 Introduction

Most scientific research is about people’s intuitive physical data, indicating height and weight. However, research on details, such as the study of the hands and feet, is more conducive in other disciplines. For example, reference [1] mentions the improvement of suggestions and opinions on the production of Chinese shoes by analyzing the data of the collected feet. The measurement of the hand has different measurement methods. Reference [2] gives two methods in detail. Overall, there are little literatures discussing the hand characteristics. [2] also gave some comparison of hand anthropometry of females in three ethnic groups: Western Europeans, Indian and West Indian. Additionally, appreciation of the gender differences in hand shape is essential to the proper design of both men’s and women’s hand tools, gloves. Traditionally, women’s gloves are often made using a small version of a men’s glove with all dimensions proportionally scaled according to hand length. However, if women’s hands differ in shape from men’s hands, this is an inappropriate model and could lead to improper gloves fit in women. Courtney [3] carried out an anthropometric study of hand dimensions of Hong Kong Chinese female workers compared to other ethnic groups. Twenty-three hand dimensions were measured and compared with data from the United Kingdom, Japan and the United States of America. Other research referred to hand anthropometric study, please see literatures [4,5,6].

The data information we analyzed is from ISO/TR 7250-2:2010 “Basic human body measurements for technological design—Part 2: Statistical summaries of body measurements from national populations”. This technical report is intended to serve as a continually updated repository of the most current national anthropometric data. It also provides statistical summaries of body measurements together with database background information for working age people in the national population.

2 Method

At first, we conduct the direct comparison among these countries based on analysis of variance (ANOVA). It is because we want to know whether hand length are significantly different among these countries as well as between male and female in these countries. We take hand length as response variable, gender and country as two factors. Analysis of variance is used to analyze the effects of factors, which may have impact on the result of an experiment.

Secondly, we performed factor analysis on the five variables that affect the hand length to extract more intuitive data features. The five variables are the vertical length of the palm, the metacarpal, the length of the index finger, the width of the proximal end of the index finger, and the width of the distal end. We want to know if there are any internal dependencies between these five variables.

Thirdly, we construct a multiple regression analysis of the above five variable opponents, and study how these five variables affect the human hand length, because the stepwise regression method is adopted, resulting the fifth independent variable removed from the model.

At last, we also want to know how much influence the gender has, so we set gender as a dummy variable. The study found that men’s hands are about 2% taller than female hands.

3 Results

3.1 Results of ANOVA for Hand Length

The general linear model procedure is used to do analysis with the help of SPSS software. The general linear model procedure has the following advantages: (1) it could be used in unbalanced designs; (2) it can evaluate differences between individual level means; (3) it allows covariates, random factors and nested factors.

As we know, ANOVA based on three assumptions: (1) the dependent variables of each population are normally distributed, (2) the variance of each population is equal, and (3) the samples of each population are randomly sampled independently. Then, first we perform homogeneity tests through Levene’s method. The results are in Table 1. From the p-value we can conclude that the data has equal variance.

Table 1. Levene’s test of equality of error variances

In Table 2, DF stands for degree of freedom. Without loss of generality we take nominal significance level as 0.05. That means if P-value of one factor is less than 0.05, we think it is significant; otherwise it is not significant.

Table 2. Tests of between-subjects effects

Table 2 provides the results of the analysis of variance for the model. The first line (Corrected Model) is the overall test of all the effects of the model. The significance of two factors, country and gender, is 0.399 and 0.001, respectively. Gender is significant, but country is not significant. However, the test result of the interaction effect between the two factors (country * gender) is extremely insignificant. The table also provides that determination coefficient (R Squared) of the model is equal to 0.367, indicating that the main effect and interaction effect of the two factors in the model together account for 36.7% of the total variation of the dependent variable. However, although the sample analysis shows that the effects of the model have some explanatory ability for the dependent variable, the statistical test of the effects shows that only one effect is beyond the random fluctuation range but that cannot be inferred to the whole.

Figure 1 is the interaction plot for hand length. From Fig. 1, we can obtain the following information: (1) Overall, the hand length of men is larger than hand length of women in all ten countries. (2) The ten countries fall into about three groups: The Netherlands and United States had significantly larger hand length than the other eight countries; The India had the smallest hand length. (3) On average, the hand length of male in the Netherlands and the United States are largest and the hand length of female in United States and Kenya are the largest. The hand length of people in Germany and Italy is similar.

Fig. 1.
figure 1

Interaction plot for hand length

3.2 Results of Factor Analysis

As the original data has some missing values, especially the United States, we do some basic data pretreatment under the help of R software. At first, in order to make sure the accuracy of the data, we delete the information about USA’s hand data, then, we make use of R software to multiple interpolate missing values. Furthermore, we try to find some common factors which effect the length of hand in nine countries (except the United States) no matter women or men. Aiming at above purposes, we take the method of Factor Analysis followed.

The purpose of factor analysis is to simplify the data or find out the basic data structure. Therefore, the premise of factor analysis is that there should be a strong correlation between observed variables. If the correlation between variables is small, they cannot share a common factor. Therefore, we produced the correlation coefficient matrix of the five variables in the table and conducted Bartlett’s test of sphericity and KMO measures.

The correlation analysis of five variables (Table 3) shows that there is a strong correlation between Index finger length and Index finger breadth, distal, which reaches about 0.957. The other three variables have a mutually strong correlation as well. All of these correlation values reach above 0.65. Looking at Table 4, the KMO measured value reaches above 0.8, and the significance level of Bartlett spherical test is extremely high, which indicates that factor analysis can be conducted on this data.

Table 3. Correlation matrix
Table 4. KMO and bartlett’s test

According to the eigenvalues criterion, only one factor can be selected for the hand data including nine countries (except USA). In order to illustrate explicitly, we have selected two factors. From Table 5, it shows the concrete information. The first column “Component” indicates the different kinds of factors, the following three columns are the initial eigenvalues, “Total” column refers to the eigenvalues, “% of Variance” is the percentage of Variance, and “Cumulative %” is the cumulative percentage of variance. For instance, the first two factors explain the data’s variance 83.902% and 8.425% respectively. While these two factors explain the vast majority of the total data variance accumulatively, reaching more than 90%. The middle three columns show the two factors selected, as well as their explanatory variance ratio and cumulative ratio. The last three columns are the variance explanations of each selected factor after factor rotation.

Table 5. Total variance explain

Communalities is also one of the important judgment principles for determining the number of factors. Table 6 provides information in this respect. In the table, “Initial” is the Initial value, while “Extraction” is the communalities of each variable. The results show that the communalities of the five variables is very high, and the information of each variable contain more than 84%.

Table 6. Communalities

According to the order in which factors are extracted, we portray the screen plot about the change of factor eigenvalues (vertical axis) with the number of factors (horizontal axis). Actually, the number of factors is also judged according to the shape of the figure. It can be seen from the scree plot: from the first factor, the curve goes down rapidly, then becomes gentle, and finally becomes an approximately straight line. The factor load after rotation with the “varimax” method shows in Fig. 2 as well. We can conclude that “Index finger breadth, proximal” (x4) is close to the second factor, and “Index finger length” (x3) and “Index finger breadth, distal” (x5) are closer to the first factor.

Fig. 2.
figure 2

Screen plot and component plot

3.3 Results of Multiple Regressiom

In order to study how the five variables in the data affect the hand length including nine countries, we build a linear regression model with the stepwise method. Since more than one variable is included in the independent variable, we take multiple linear regression model. The model structure is as follows:

$$ y = b_{0} + b_{1} x_{1} + b_{2} x_{2} + b_{3} x_{3} + b_{4} x_{4} + b_{5} x_{5} + \varepsilon $$
(1)

Among them, y indicates hand length, \( x_{1} \) = Palm length perpendicular, \( x_{2} \) = metacarpals, \( x_{3} \) = Index finger length, \( x_{4} \) = Index finger breadth, proximal, \( x_{5} \) = Index finger breadth, distal. \( b_{1} \), \( b_{2} \ldots \ldots b_{5} \) are model parameters, \( b_{0} \) is intercept value.

Table 7 shows the interpreting proportion of each model by means of stepwise regression. The order of taking independent variables into the model is Index finger length (x3), Palm length perpendicular (x1), metacarpals (x2), Index finger breadth, proximal (x4). The last model has accounted for hand length more than 95%, but the Durbin-Watson test value of the model is 1.192, which cannot support the premise of error independence, so the regression model is only for reference.

Table 7. Model summary

Table 8 shows the coefficients of each model after stepwise regression. In the first step, there is only an independent variable, and the regression coefficient is a large positive value (2.039), reflecting that the larger “Index finger length” (\( x_{3} \)) is, the larger hand length is. The second step regression model increase the independent variable “Palm length perpendicular” (\( x_{1} \)), whose standardized coefficient is 0.448, that is similar to “Index finger length” (\( x_{3} \)). The third step takes “metacarpals” (\( x_{2} \)) into regression model, the fourth step increases the “Index finger breadth, proximal” (\( x_{4} \)).

Table 8. Coefficients
$$ y = 0.384x_{1} + 0.303x_{2} + 0.43x_{3} - 0.103x_{4} $$
(2)

The above formula is the standardized performance of stepwise regression in the last model. From the standardized coefficients respectively, “Index finger length” \( (x_{3} ) \) has the greatest influence on the hand length, each unit it increases, the hand length increases by 0.43 unit. In addition, both “Palm length perpendicular” (\( x_{1} \)) and “metacarpals” (\( x_{2} \)) have positive effects on hand length. From their standardized coefficients (0.384 and 0.303, respectively), the effects of the two variables are similar. Finally, “Index finger breadth, proximal” (\( x_{4} \)) has a negative effect on hand length, whose coefficient is -0.103.

Then, we draw four-in-one residual plot (Fig. 3), we explain the figures in the following:

Fig. 3.
figure 3

Residual plots for hand length

Normal Probability Plot – Because the points on the normal probability plot roughly follow a straight line, we can assume that the residuals do not deviate substantially from a normal distribution.

Histogram – Use the normal probability plot to make decisions about the normality of the residuals. With a reasonably large sample size, the histogram displays compatible information.

Versus Fits – The constant variance assumption does not appear to be violated, because the residuals are randomly scattered about zero.

Versus Order – The plot of the residuals versus order does not show any pattern. Therefore, there is no time dependence in the residuals.

3.4 Results of Adding Dummy Variable

Next, considering the impact of gender on hand length in the data of these nine countries, we take gender as a dummy variable in the regression equation, and the other independent variables do not change. Building a multivariate regression equation with dummy variables, Table 9 provides a global test of the regression equation, showing that the F value of the regression equation is 199.989, which is statistically significant at the 0.000 level, indicating that there is at least one independent variable which owns a very significant linear relationship with dependent variable in the equation.

Table 9. ANOVA

Table 10 provides the regression coefficients for the respective variables after adding dummy variables, including unstandardized coefficients and standardized coefficients. Among them, the last two columns of the table also provide collinearity test items for multiple regression. When the independent variables are completely linearly related, the estimation of the regression model no longer has a unique solution, so the least squares method is no longer applicable, neither. It can be seen from the test that the tolerance of \( x_{3} \) and \( x_{5} \) is less than 0.1, which means their variance inflation factors are all greater than 10 but less than 25, indicating that there is a certain degree of correlation between independent variables, and it is necessary to carefully apply multiple regression equations.

Table 10. Coefficients

We give the following regression equation for the dummy variable according to the column labeled the standardized coefficients in Table 9:

$$ y = - 0.022gender + 0.377x_{1} + 0.289x_{2} + 0.451x_{3} - 0.1x_{4} - 0.014x_{5} $$
(3)

From the degree of significance of the coefficient, the t-statistic value of gender is -0.624, whose p-value is equal to 0.536 greater than 0.05, rejecting the null hypothesis. The index finger breadth, distal (\( x_{5} \)) is also not significant, rejecting the null hypothesis at the level of 0.9, so we should be careful when interpreting variables with the multiple regression model results. When constructing dummy variables, we use 0 for men and 1 for women. Therefore, the regression coefficient of gender shows that on average, men’s hand length are 0.022 units higher than women, which is also a common sense that men’s hands are larger than women’s.

4 Conclusion

This paper mainly discussed the characteristics of body hand in ten countries based on the statistical data in ISO/TR 7250-2:2010 “Basic human body measurements for technological design—Part 2: Statistical summaries of body measurements from national populations”. Based on the average data and percentile values of hand length we can conclude that the ten countries fall into about three groups: The Netherlands and United States had significantly larger hand length than the other eight countries, the India had the smallest hand length, other countries are in a same condition. We do a factor analysis for five variables, which indicates that we can extract two main factors. “Index finger breadth, proximal” is close to the second factor, “Index finger length” and “Index finger breadth, distal” are closer to the first factor. Finally, we do a multiple regression to find out what extended gender and other variables’ impact on hand length. The results are “Index finger length” has a great significance and on average, men’s hands are about 2% taller than female hands. The obtained results could be a reference for designing the export gloves to these countries. What’s more, the obtained regression equations have a great reference value for forensic science and anthropology.