Introduction

Physical growth is one of the cardinal features of the developing child. The primary goal of all who care for children is that every child will be provided the opportunity to achieve their maximal growth and developmental potential. The early years of post-natal life are foundational for child and adult health, with nutritional status often associated with increased risk of non-communicable risk factors later in life [1,2,3]. Many factors influence nutritional status during early life including birth weight [4] feeding practices [4, 5], nutrient composition of the diet [6], genetics [7] and environmental factors [8, 9].

Anthropometry is typically used for assessment and results compared with standards or references to determine appropriateness. However, many anthropometric measures are problematic. Firstly, weight-for-length (WFL), and the BMI-for-age are commonly used indices to predict adiposity, but do not distinguish between fat mass (FM) and fat-free mass (FFM). Secondly, anthropometric cut-offs for these measures have been determined based on population distribution of the parameter rather than on biological or functional relevance. Thirdly, references have commonly been based on selected populations and/or populations of high-income countries resulting in inaccuracies in determining nutritional status in certain ethnic groups, most commonly, an underestimation of overnutrition and an over-estimation of undernutrition [10]. This is especially the case in populations with low birth weights who are relatively short and have low weight-for-age.

While it is understood that body composition, rather than anthropometry, is a better indicator of an infant’s nutritional status, body composition assessments may not be suitable for clinic settings or large scale epidemiological studies. Most anthropometry-based approaches are relatively easy to use, quick to produce results, inexpensive, portable and involve minimum discomfort to the child. It is therefore important to identify the best anthropometric predictive measures to reflect body composition. A number of anthropometry-based prediction equations have been developed to assess body composition in infancy [11, 12] and it has been shown that combining anthropometric measures [11, 13,14,15], gestational age [16], sex [17, 18], and ethnicity [9] improves predictability. However, many equations are not generalizable to populations other than those they were developed in and equations developed in multi-ethnic populations are scarce. This study aims to develop anthropometry-based prediction equations for body composition during the first two years of life in infants from diverse socioeconomic settings and ethnic groups.

Materials and methods

Study design and recruitment

Data are from the Multi-center Infant Body Composition Reference Study (MIBCRS), a longitudinal, prospective, multinational study, that followed infants from birth to 24 months in lower-middle (India, Pakistan and Sri Lanka), upper-middle (Brazil and South Africa) and high-income (Australia) countries and details of the recruitment process is described elsewhere [4, 19,20,21]. Each participating country conducted the study adhering to International Ethical Guidelines for Biomedical Research Involving Human Subjects and obtaining approval from their respective review committee. Informed written consent was obtained from the enrolled mothers and data were collected from 2013 to 2019. The main cohort comprised of 3–24 month data from Brazil, Pakistan, South Africa, and Sri Lanka from 708 mother infant pairs to assess body composition used for the development of equations (training data) and validation of the developed equations (validation data). An independent cohort comprised of 250 infants (3–6 months) from Australia, India and South Africa was used for external validation of the developed equations (test data). During follow-up, children were fed according to Infant and Young Child Feeding (IYCF) guidelines.

The sample size for study sites was calculated to have a power of 90% to detect FM and FFM for boys and girls less than one standard deviation away from a reference study, that found a mean FM of 3.10 ± 0.5 kg and 3.05 ± 0.46 kg, and mean FFM of 9.13 ± 1.06 kg and 8.99 ± 1.1 kg for boys and girls, respectively [22].

Body composition assessed using the Deuterium Dilution (DD) technique

DD was utilized to calculate FM and FFM of infants at 3, 6, 12, 18, and 24 months of age in the development and validation group, and at 6 months of age in the test group. Details of the technique are provided elsewhere [23].

Anthropometry

Anthropometric data for this analysis were used from the visits at 3, 6, 9, 12, 18 and 24 months in the development and validation group and from the 6 month visit in the test group. Standardized protocols for anthropometry were developed based on the WHO Multicentre Growth Reference Study (WHO-MGRS) protocol [24].

Infant weight was measured naked, using a paediatric electronic scale (Seca 376; Hamburg, Germany) and length using a Harpenden infantometer (300–1100 mm, accurate to 1 mm; Holtain Ltd, Crymych, Wales, UK) in all countries except for India and Sri Lanka (Seca 417; Hamburg, Germany). A detailed protocol has been published [23].

Triceps skinfold thickness (TSFT) and subscapular skinfold thickness (SSFT) were measured using a Holtain Tanner skinfold calliper to the nearest 0.2 mm on left arm (Holtain Ltd, Crymych Wales, UK). Each skinfold thickness was read after 2 s, consistent with the WHO-MGRS methodology with MAD of 2 mm [24]. Mid upper arm and head circumference were measured with a non stretchable flexible tape to the nearest 1 mm and MAD was 5 mm (Seca 212; Hamburg, Germany).

Quality control in data collection

Anthropometry protocol training was undertaken in Johannesburg, South Africa and subsequently, anthropometry standardization sessions were undertaken locally at three monthly intervals. Intra- and inter-observer technical errors of measurements were calculated and compared to the measurements obtained by the anthropometry supervisor (gold standard). A training workshop on the DD technique was held at St John’s Research Institute, Bangalore, India. The IAEA organized an inter-laboratory comparison for the analysis of deuterium enrichment among the laboratories responsible for analyzing samples.

Statistical analysis

Data were collated and captured on the REDCap system [25], hosted at the University of the Witwatersrand, Johannesburg.

Sample split for model development

Participants from the 3 to 24-month cohort were split randomly into training (two-thirds) and validation (one-third) groups. The training data consisted of 942 sets of anthropometric and body composition measurements (collected from 310 girls) and 954 sets of measurements (collected from 340 boys). Observations corresponding to the 24-month visit were included only up to 26 months of age. The validation data set consisted of 441 sets of measurements (collected from 154 girls) and 500 sets of measurements (collected from 170 boys) from the same four country cohorts. Test data for external validation of the fitted model consisted of participants from three birth cohorts in Australia (21 girls, 30 boys with one set of measurements each at 6 months), India (44 girls, 46 boys, with one set of measurements each at 6 months) and South Africa (120 sets of observations, collected from 59 girls and 88 sets of measurements collected from 50 boys, who provided data during at least one visit between 3 to 6 months). Additional information on the sample selection is provided in Supplementary Fig. 1. We describe the characteristics of our training, validation, and test data separately by sex using median and range (Table 1 and 2).

Table 1 Comparison of age, anthropometric and body composition measurements between the training and validation samples between 3 and 24 mo of age for each sex, pooled and for each country.
Table 2 Comparison of age, anthropometric and body composition measurements in the test samples between 3 and 6 mo of age for each sex, pooled and for each country.

Developing the prediction equations

A priori power analysis was not conducted for developing the prediction equations and therefore included longitudinal data of all infants (n = 650, observations = 1896) for whom the data was available. Previous studies that estimated body composition using anthropometry had similar or smaller sample sizes [11]. Joint distribution of all variables was examined using scatterplots and Pearson correlations.

A linear mixed model on the training data separately for girls and boys, using linear splines with knots at 9 and 18 months, was used to develop the prediction equations for FM (kg) and FFM (kg). We used linear mixed models with random intercepts to account for clustering of observations among individuals. The knots for age were selected based on visual inspection of trajectories for FM and FFM. Additionally, the models were adjusted for length (m), WFL (kg/m), TSFT, SSFT, and Asian ethnicity. Adjusting for head and arm circumference did not influence the results, and as such, were not included in the final prediction equations. South Asian ethnicity was ascribed based on the country from which the participant was recruited. Estimation of the 95% prediction interval was attained by incorporating uncertainty in random effects (for training data only), uncertainty in fixed effects, and residual variance of outcome variable. A detailed description of the methodology is provided in Supplementary Note 1.

Subsequently, the fitted model was internally validated on the ‘validation’ data and evaluated for the quality of predictions using: (a) error metrics (root mean squared error - RMSE; root mean square percentage error – RMSPE; mean absolute error – MAE; and mean absolute percentage error - MAPE) for the predicted values, and (b) the number of instances for which true values were outside the prediction interval. A summary of the different error metrics used is provided in Supplementary Note 2.

Validation of prediction equations

The fitted model was externally validated on the test data and evaluated for quality of predictions using the same error metrics as above. Additionally, systematic error in predictions were explored (on test data with observed and predicted values) using Bland-Altman plots.

Sensitivity analysis

First, model misspecification was assessed by repeating the analysis with models fitted (a) using quadratic term for age, (b) using natural cubic splines for age, and (c) using natural cubic splines for all predictors. Natural splines are a family of piecewise cubic polynomials and were fitted with four degrees of freedom. These models were then compared using error metrics and the conditional Akaike Information Criterion [26]. Second, prediction error was assessed against the logarithmic transformation of the outcome variables for the four model specifications due to observed right skew in the outcome variables. Third, prediction error was assessed after substituting WFL as a predictor with (a) BMI (kg/m2) and (b) ponderal index (kg/m3) for the linear spline model. All analyses were executed using R 3.6.1 using lme4 (v1.1-23), merTools (v0.5.2) and cAIC4 (v1.0) packages.

Results

The distribution of FM and FFM by age at each visit stratified by cohort for each participant in the training data set of the four countries in the main cohorts are depicted in Fig. 1. The variance of FFM increased with age. We also provide the distribution of FM and FFM by cohort for training and validation data (Supplementary Fig. 2) and for test data (Supplementary Fig. 3).

Fig. 1: The trajectories of fat mass and fat free mass in children aged 3–24 months from low and middle income countries according to sex.
figure 1

All are in kg; A male fat mass, B male fat-free mass, C female fat mass, D female fat-free mass. The thick lines are loess smoothed trajectories fit using penalized cubic splines for all participants (N) in the Training and Validation data.

Association of anthropometry with FM and FFM

Unadjusted non-linear associations of age with FM and FFM were observed in both boys (Supplementary Fig. 4) and girls (Supplementary Fig. 5). Further, linear associations of length and weight-for-age with FM and FFM, were also observed. All anthropometric variables were positively correlated with FM and FFM (except skinfold thickness measures with FFM). The associations of skinfold thickness with FM and FFM were non-linear at higher values, potentially due to high leverage points.

After adjusting for other covariates, both length and WFL were positively associated with FM and FFM in boys and girls (Table 3). Both measures of skinfolds were independently and positively associated with FM, while they were negatively associated with FFM in boys and girls. The association of age with FM was non-linear from 9 to 18 month with a rapid reduction in slope (Boys: −0.04 kg per month, 95%CI: −0.07, −0.01; Girls: −0.05 kg per month, 95%CI: −0.08, −0.02). The association was also non-linear after 18 months, with an offset in the slope reduction from an increase in slope (boys: 0.03 kg per month, 95%CI: 0.00, 0.06; girls: 0.06, 95%CI: 0.03, 0.09). Similarly, the association of age with FFM after adjusting for other covariates was non-linear, with a negative slope up to 9 months, a rapid increase in slope from 9 to 18 months and a subsequent rapid reduction in the slope.

Table 3 Coefficients for linear spline model.

The equations to predict FM and FFM for each sex are provided in Table 4. RMSE was higher for FM and FFM among older children in both boys (FM: 0.66; FFM: 0.66) and girls (FM: 0.51; FFM: 0.53). RMSE was also similar between training and validation for older age. In the assessment of FM and FFM of the 3−9-month age group, the RMSE was also similar between validation and test data (Table 4). Most observations were within the prediction intervals which incorporated uncertainty in fixed effects and residual variance, for validation and test data (Table 5). RMSPE and MAPE were higher for validation data while predicting FM in males and females, compared to test data. RMSPE and MAPE were also higher in test data for predicting FFM in males and females, compared to validation data. RMSE was under 510 g for prediction of FM in validation group, but RMSPE was very high. MAE in the prediction of FFM was in the range of 350 to 460 g in all three groups, but the MAPE was low (under 10%).

Table 4 Prediction equations for fat and fat-free mass in children 3 to 24 months.
Table 5 Estimates of uncertainty in prediction of fat and fat-free mass in training, validation and test groups.

When the RMSPE for assessment of FM predictions were evaluated by country, Sri Lanka showed a higher value (boys 47.1, girls 55.2) (Supplementary Table 1) compared to the mean value for training group (boys 35.2, girls 35.6) and validation group (boys 42.9, girls 40.8) (Table 5). Similarly, MAPE values for Sri Lanka (boys 33.7 girls 36.3) were higher compared to the mean value for training group (boys 21.4, girls 21.8) and validation group (boys 27.1, girls 24.3). For FFM, Sri Lanka RMSPE values (boys 7.4, girls 9.1) were similar to mean training group values (boys 7.1, girls 7.9) and mean validation group values (boys 7.5, girls 8.6). Similarly, MAPE values for Sri Lankan children (boys 5.7, girls 6.4) were comparable with mean training group values (boys 5.2, girls 5.8) and mean validation group values (boys 6.0, girls 6.4).

Of the three countries in the test group for assessment of FFM, RMSPE for South Africa (boys 9.9, girls 10.1), Australia (boys 10.0, girls 6.2) and India (boys 14.3 girls 18.2) (Supplementary Table 1) showed good agreement with the validation group (boys 7.5, girls 8.6) (Table 5). Similarly, MAPE for South Africa (boys 7.8, girls 8.2), Australia (boys 7.9, girls 4.9) and India (boys 12.5, girls 15.4) also showed a similar distribution as the validation group (boys 6.0, girls 6.4).

The Bland-Altman plot for predictions on the test data (Fig. 2) suggested no proportional bias for FFM in males. The model for FFM in males underestimated the outcome by 0.33 kg. The plot was suggestive of proportional bias (|r| ≈ 0.3) for FM in males and females, and for FFM in females.

Fig. 2: The Bland–Altman plots for prediction of fat mass and fat free mass on test data from Australia, India and South Africa children.
figure 2

All are in kg; the difference (d) of mean line (solid black) from zero is the systematic difference between measurements (i.e., estimated bias). Points outside the limits of agreement (±1.96 SD; dashed black) lines are outlier predictions. Correlation (r) in the Bland–Altman plot between average and difference suggests proportional bias such that the prediction model doesn’t agree over the range of outcome values. A Male fat mass (n = 164; d = 0.29, r = 0.24, p < 0.01), B male fat-free mass (n = 164; d = −0.33, r = −0.09, p = 0.21), C female fat mass (n = 185; d = 0.30, r = 0.27, p < 0.01), D female fat-free mass (n = 185; d = −0.32, r = −0.26, p < 0.01).

Sensitivity analysis

Comparison of alternate model specifications against the linear spline model showed similar error in predicting the outcomes in the training and validation data for both FM and FFM in males and females (data not shown). The conditional AIC for the linear spline model was better than the model with quadratic age (Supplementary Table 2). However, the models with natural splines fit for all predictors had the best fit for both outcomes in males and females. Log-transforming the outcome yielded similar errors in prediction as the model without transforming the outcome. Similarly, using an alternate version of weight adjusted for length (BMI, ponderal index) did not yield different results from using WFL in the linear spline model (data not shown).

Discussion

This research enabled the generation of four sets of anthropometry-based equations for the prediction of FM and FFM in male and female infants from diverse ethnic and socio-economic settings, in three different age categories.

To the best of our knowledge, this is the first multi-ethnic equation developed on a population representing different regions of the world, to evaluate body composition in the first 24 months of life. Validation of the equations showed good correlation between actual and predicted values. The study group was divided broadly into South Asian and non-South Asian ethnicity and is likely to be a logical division as most prior data have shown a significant difference between South Asian ethnicity and others.

Apart from validating the equations on a subsample of the same cohort, the 3–9-month age range equation was applied on a different cohort of children derived from three socioeconomically and ethnically diverse populations. RMSE and MAE for test data were quite similar to the training and validation data, and the numbers of observations that were seen outside the prediction interval were also similar. Equations had a better prediction value for FFM compared to FM. Thus, these equations might be better used to predict FFM, with calculation of FM by subtracting FFM from weight, rather than by directly assessing FM.

Regression models with diverse variables can be used as predictors of body composition during the first two years of life. The value of this approach has been shown previously by Gopalakrishnamoorthy and co-workers based on multi-ethnic population derived anthropometry-based equations for the assessment of body fat at 3 days and 15 weeks for both girls and boys, and a sex combined equation at 54 weeks of age using weight, length, head circumference and skinfold thicknesses, with a high coefficient determination (R2) (0.82–0.92) [27]. As sex and age during the first two years affect body composition, our study also provided sex-specific equations for the assessment of FM and FFM for three different age categories. Length, WFL, TSFT, SSFT and Asian ethnicity showed clear association with FM and FFM, although some studies have shown skinfold thicknesses not to have predictive value of total body fat [28]. We used both TSFT and SSFT as variables, which improved the predictability of FM, and the predictability was higher for males. Length and head circumference are determined more by genetics than by nutrition [29, 30]. Our prediction model did not improve by adjusting for head circumference and arm circumference.

A systematic review revealed that age, ethnicity and socioeconomic status affect the body composition of infants [31]. With physical growth, body composition changes from birth through infancy to childhood [32]. With age, the non-subcutaneous FM increases, with higher values seen in females [28]. These tissue changes underline the importance of using prediction equations suitable for different ages and sex [14]. Furthermore, ethnic differences in body composition have been noted [31, 33]. In the London mother and baby study, South Asian ethnicity had less FFM and more FM compared to European ancestry at 6–12 weeks of age and authors concluded that it is a reflection of ethnicity or maternal physiology during intrauterine life rather than dietary or behavioural factors [33, 34]. Developing reference data accounting for age and ethnic diversity provides a more meaningful understanding and assessment of body composition.

As stipulated in the Developmental Origins of Health and Diseases (DOHaD) hypothesis, the first 1000 days of life is closely linked with prospective health outcomes in later life. Details of body composition provide a much better understanding of physical growth related to health than anthropometry alone. Higher birth weight, a rapid gain in weight, especially accretion of fat during the first 1000 days of life, predisposes an individual to develop obesity in childhood, and assessment of body composition would help in getting a better assessment and taking steps in prevention [35]. Anthropometry-based equations have gained popularity over time due to their reasonable accuracy and ease of application and cost-effectiveness in low resource settings. Indirect body composition assessment techniques are not freely available for the first twenty-four months of life and the sex-specific prediction equations provided by this study for children from 3 to 24 months, developed on data from several countries of different socioeconomic backgrounds and ethnicities, would be a useful tool.

Many factors affect the results of prediction equations, and one of the most important is to apply a prediction equation on populations that are similar to the ones used in the original development. Even if age and sex are matched, they will still not produce accurate results if socioeconomic and ethnic background are not matched [14]. The test group was multi-ethnic and South Africa was part of both training and test groups. After desegregation of test data by individual country, test results showed good agreement with validation data for assessment of FFM. This affirms the notion that equations developed on the similar population are more applicable than those introduced from a different population.

A major strength of this study is that it used a multi-ethnic population of different socio-economic strata and equations have been cross validated on an independent group of infants with different socio-economic backgrounds. In conclusion, this study provides anthropometry-based equations for the prediction of FM and FFM for three age categories within the first 24 months of life for each sex. Predictability of FFM was higher than predictability of FM. Due to its wide representation, these equations would be better than previously developed equations and help in the assessment of body composition during early years of life easily and accurately.