Introduction

Normalization of the sizes of cardiovascular structures allows for comparison of measurements among children with different body sizes. In addition, it accounts for the physiologic effect of somatic growth and highlights the pathologic effect of disease processes on cardiovascular sizes [1]. The Pediatric Heart Network (PHN) recently calculated Z-scores based on body surface area (BSA) for common echocardiographic measurements to account for the effects of somatic growth on the sizes of cardiovascular structures in a large group of healthy and racially diverse children [2]. The PHN Echo Z-Score Project also determined that age, sex, race, and ethnicity did not have a significant effect on the relationship between cardiovascular measurements and BSA.

Many investigators have suggested the use of height instead of BSA to normalize cardiovascular measurements, because BSA does not characterize the individual effects of lean body mass and fat mass on cardiovascular growth, particularly in the overweight and obese populations [3].

Cardiovascular structures are sized to meet stroke volume requirements, but the effect of fat mass on stroke volume requirements is less than that of lean body mass [4]. In addition, visceral and subcutaneous fat have different effects on the cardiovascular system, with possibly pathologic impact by visceral fat on cardiovascular sizes [5]. This issue has been particularly highlighted in studies looking at normalized left ventricular (LV) mass to determine LV hypertrophy [6, 7].

Normalized aortic and LV measurements are often used to make clinical decisions related to medical and surgical interventions in children. The aim of this study was to develop and compare models for height-based and BSA-based normalization of aortic and LV sizes in a subset of the PHN study population of normal children. In addition, the impact of other demographic and anthropometric variables on both models was assessed. We hypothesized that height-based normalization models would result in a similar correlation coefficient as BSA-based normalization models, and the residual relationships of the models with BMI Z-score would be the same.

Methods

This study was a secondary analysis of children with normal echocardiograms collected from 19 North American centers as part of the PHN Echo Z-score Project. The retrospective study design assured adequate enrollment across the entire pediatric age range with equal numbers of boys and girls across a broad range of race and ethnicity categories, using specific inclusion and exclusion criteria as outlined in the methodology [2]. As previously reported because all submissions were de-identified, most children were retrospectively enrolled under a waiver of consent after Institutional Review Board (IRB) or Research Ethics Board approval. Race/ethnicity information was not routinely obtained at one center and was collected prospectively for eligible subjects after local regulatory approval. Some centers were able to perform research echocardiograms without charge and prospectively enrolled healthy children after IRB approval. For this analysis, the study population included healthy, non-obese children [body mass index (BMI) < 95th percentile for age and sex] over 2 years of age from the PHN Echo Z-Score Project. Subjects were categorized into 3 weight groups based on BMI: underweight (BMI < 5th percentile), normal weight (BMI in 5th to < 85th percentile), and overweight (BMI in 85th to < 95th percentile). Children younger than 2 years old from the original PHN cohort were excluded because the definition of underweight and overweight based on BMI is limited to children over 2 years old [8]. The study parameters for the analysis included the aortic annulus, aortic root, ascending aorta, LV end-diastolic volume, and LV mass, all measured using standard pediatric echocardiographic methods [9].

Statistical Analysis

The study parameters were normalized based on height using the same methodology based on BSA in the PHN Echo Z-Score Project [2]. In order to determine the best exponential transformation of height (heightα) for each parameter (X), several values of α were explored by nonparametric locally weighted scatterplot smoothing (LOESS) curve fitting [10], histograms, and linear regression plots of indexed parameter and height. The best height transformation was chosen when (1) there was a linear relationship between X and heightα, (2) the indexed parameter (X/heightα) was normally distributed, and (3) there was no clinically significant residual dependence of the indexed parameter on height (with a zero slope when the indexed parameter is plotted against height). Published reproducibility thresholds have reported that measurement variability is responsible for at least 5% of measurement differences for primary measurements such as aortic diameters and up to 10% of measurement differences for calculated parameters such as LV volume and mass [11]. Therefore, clinical significance for aortic measurements was defined as a difference of at least 5% between actual and predicted measurement values using models with and without the statistically significant effects. For LV volume and mass, a threshold of 10% was used to determine clinical significance.

The PHN Echo Z-Score Project determined the best exponential transformations of BSA (BSAα) for the study parameters and calculated BSA-based z-scores based on the mean and standard deviation for the indexed values from the full cohort [2]. Due to exclusion of subjects < 2 years of age from the current analysis, the full PHN cohort was not included in these analyses, and the mean and standard deviation for the BSA-based indexed values were recalculated for this subset of the study population. Height-based and BSA-based Z-scores for this study population were calculated using the following equation:

$$Z=\frac{\left[\left(\mathrm{parameter}\right)/({\mathrm{height}}^{{\alpha }}\, \mathrm{ or }\,{\mathrm{BSA}}^{{\alpha }})-(\mathrm{mean}\, \mathrm{value}\, \mathrm{of}\, \mathrm{indexed}\, \mathrm{parameter})\right]}{\mathrm{SD}\, \mathrm{of}\, \mathrm{indexed }\,\mathrm{paramater}}$$

After the height transformations were chosen for the height-based models, multivariable regressions were performed on the indexed parameters to assess for statistically and clinically significant linear and nonlinear effects of age, sex, race, and ethnicity as well as their interactions. A p-value of ≤ 0.05 was used for significant main effects and ≤ 0.01 for significant interaction effects. Higher order interactions were considered first and removed from the model if not significant. Lower order interactions and main effects were kept in the model even if not significant if the effect was part of a significant higher order interaction. Age was assessed both linearly and piece-wise linearly with plotting and nonparametric LOESS curve fitting [10]. To determine whether statistically significant main effects and interactions were clinically significant, predicted values from the two models with and without effects were tested to see if the predicted raw echo parameters from the two models differed by more than 5% for the aortic measurements and 10% for the LV calculations. This was assessed with a t-test of the absolute proportion difference between the models.

To assess for a residual relationship with BMI, the relationships between BMI Z-score and the indexed parameters based on height and on BSA were evaluated via linear regression and tested for statistical and clinical significance. Demographics and height-based and BSA-based Z-scores were compared among the 3 weight groups with analysis of variance (ANOVA), while the effect of BMI Z-score was assessed via linear regression modelling.

Results

Among the 3215 subjects in the PHN cohort, 2299 were greater than 2 years old with demographic and clinical characteristics as listed in Table 1. The chosen height transformation exponents (α) as well as the published BSA transformation exponents (α) for each parameter (X) are listed in Table 2. All indexed parameters (X/heightα and X/BSAα) were normally distributed based on visual inspection. There was a nearly linear relationship between all parameters and transformed height and BSA with correlations that ranged from 0.87 to 0.93 (p < 0.001 for all). The relationships of the indexed parameters with height and with BSA were statistically significant with non-zero slopes, but none of these differences were considered clinically significant as defined for this analysis (Table 2). Z-scores based on height were then calculated from (1) the mean indexed parameter values, (2) the values for α, and (3) the standard deviations for each parameter as listed in Table 2.

Table 1 Participant demographics: overall and by BMI category
Table 2 Height and BSA transformations: correlations with cardiovascular parameters and residual relationships

Multivariable regressions revealed statistically significant residual relationships of the indexed parameters with age, sex, race, and ethnicity, but none of the residual relationships were clinically significant (Supplementary Table A). There were also statistically, but not clinically, significant relationships between the indexed parameters and BMI Z-scores. Interestingly, when models that accounted for the effect of BMI Z-score were compared with models that ignored BMI Z-score, the percent differences between models increased from the 1st to the 3rd BMI quartile for the height-based models and decreased for the BSA-based models (Supplementary Table B). The median study parameter Z-scores based on height and BSA and their interquartile ranges for the underweight, normal weight, and overweight groups are depicted in Fig. 1. In the height-based normalization models, the median Z-scores were lower in the underweight group compared to the overweight group. In other words, the underweight group appeared to have smaller height-adjusted aortic and LV sizes than the overweight group with this model. On the other hand, in the BSA-based normalization models, the median Z-scores were increased in the underweight group compared to the overweight group. The relationship between body size parameter and BMI is further depicted in Fig. 2, showing that study parameter Z-scores based on height tended to increase with increasing BMI Z-score and those based on BSA tended to decrease with increasing BMI Z-score. Interestingly, for LV volume and mass, the absolute values for the slopes of the linear relationships were higher for Z-scores based on height compared to those based on BSA, indicating that normalization for height was less successful than normalization for BSA at eliminating residual dependence on body size in this non-obese population.

Fig. 1
figure 1

Cardiovascular parameter z-scores based on height vs. BSA, by BMI category

Fig. 2
figure 2

Cardiovascular parameter z-scores based on height vs. BSA, by BMI Z-score

Discussion

This study calculated Z-scores based on height for aortic and LV measurements and compared them to Z-scores based on BSA in a large group of healthy, non-obese children. The models for normalization based on height and BSA are similar in terms of heteroscedasticity and absence of clinically significant residual relationships with age, sex, race, and ethnicity. Importantly, BMI Z-scores have a positive residual relationship with height-based Z-scores, which are higher for the overweight population, and a negative residual relationship with BSA-based Z-scores, which are lower for the overweight population.

This study used height to normalize aortic diameters and LV volumes in a large group of children. Previous studies used height to normalize LV mass as calculated from M-mode measurements in children and adults [12,13,14,15]. In contrast, the PHN study used 2-dimensional echocardiographic measurements and the area-length method to establish LV mass Z-scores based on BSA in children [2]. Studies comparing height-based and BSA-based normalization models in the adult population have resulted in variable conclusions, likely related to the variable prevalence of obesity in the study populations [3, 7, 16,17,18]. Krysztofiak and colleagues studied a group of adolescent athletes and, like our study, also found that height-based LV mass Z-scores were higher and BSA-based LV mass Z-scores lower in the overweight athletes [19].

Models based on BSA normalize measurements by assuming that the effects of lean body mass and fat mass are equivalent, whereas models based on height place more emphasis on the effects of lean body mass. Both models tend to adequately account for the effect of body size for individuals with normal weight and a more predictable ratio of lean body mass to fat mass, but both fail to account for potential differential effects of lean and fat tissue on cardiac output. If the goal is to normalize the potential effects of fat mass and assess the effects of other pathologic factors on the heart, then some have suggested that normalization based on BSA may be more useful [20, 21]. On the contrary, if the goal is to preferentially evaluate the effect of lean body mass over fat mass on cardiovascular growth, specifically when comparing normal weight to overweight subjects, then normalization based on height may be more useful. It is important to remember, however, that height, like BSA, does not provide information on the actual ratio of lean body mass to fat mass, limiting its ability to fully distinguish between the effects of lean versus fat mass on cardiovascular sizes.

With the increasing prevalence of obesity and obesity-related risk factors such as hypertension that affect cardiovascular size, assessment of the effect of fat mass in addition to other pathologic factors may be more relevant when caring for obese patients. Interestingly, Foster and colleagues found that, in obese children, normalization based on lean body mass was more concordant with normalization based on BSA than with normalization based on height [22]. In addition, Mahgerefteh and colleagues found that normalization by lean body mass removed the effect of blood pressure on LV mass [6].

The PHN Echo Z-Score Project excluded obese children from the study population, precluding a comparison of the normalization approaches in this group. When overweight children are compared with normal weight and underweight children, there are significant differences in aortic and LV Z-scores among the 3 groups. Cardiovascular structures seem bigger in the overweight group with height-based normalization and smaller with BSA-based normalization in the same population. An illustrative example of these differences can be provided by 2 children, one underweight and the other overweight, with a LV mass of 75 g. If both children had a height of 1.4 m, the height-based LV mass Z-score is 0.6 for both subjects but the BSA-based LV mass Z-score would be + 2.2 for the underweight child and − 0.4 for the overweight child. If we perform the same analysis for 2 children with a BSA of 1.15 m2, the BSA-based LV mass Z-score is + 1 for both subjects, but the height-based LV mass Z-score would be − 0.6 for the underweight child and + 2.8 for the overweight child (Supplementary Table C).

In fact, when looking specifically at the relationship between BMI Z-scores (as a measure of adiposity) and LV mass Z-scores based on both approaches (Fig. 2), the downward slope of the BSA-based relationship is less steep than the upward slope of the height-based relationship. In other words, height-based Z-scores were more likely to demonstrate the difference between underweight and overweight children than BSA-based Z-scores. The persistent relationship between height-based LV mass Z-scores and BMI Z-scores may provide some insight into the actual effect of adiposity on LV mass in underweight children.

Limitations

This was a retrospective study with the known biases associated with this type of analysis. Furthermore, this study was limited to non-obese children, so one cannot fully extrapolate the findings to obese children who usually have other comorbidities that may affect cardiovascular growth. Moreover, patients with systemic hypertension and other systemic disorders affecting the heart were also excluded from this evaluation. Simultaneous blood pressure and measurement data were not collected to assess the impact of blood pressure on aortic and LV sizes. The threshold of 5% and 10% for clinical significance of residual relationships for aortic and LV calculations, respectively, were based on previously published inter-observer variability. In order to decrease the complexity of the analysis, we limited the analysis to aortic and LV measurements, and the results cannot be generalized to other echocardiographic measures. Finally, BMI as an index of adiposity is limited, and evaluating the individual effects of lean body mass and adiposity on cardiovascular growth is beyond the scope of this study.

Conclusion

Normalization of aortic and LV sizes based on height is feasible and comparable to normalization based on BSA in healthy, non-obese children. The use of height results in higher cardiovascular Z-scores in heavier children, whereas the use of BSA results in higher cardiovascular Z-scores in lighter children. The performance of normalization based on height and BSA in the obese population warrants further studies.