Introduction

While eating disorders (ED) onset occurs commonly during adolescence [1, 2], first signs of disordered eating attitudes and behaviors (DEAB) can be encountered in children as young as 6 and 7 years old [3]. DEAB include a broad selection of attitudes and behaviors with regards to weight, shape, and eating [4], like image distortion, dissatisfaction with body weight and shape, fear of weight gain, as well as food restriction, fasting, bulimic behaviors, vomiting, and binge eating [5,6,7]. Moreover, a high level of DEAB in childhood have been prospectively associated with ED development during adolescence [8], as well as with a higher risk of obesity [9], which stresses the importance of early detection of DEAB. Therefore, a reliable and sensitive scale for DEAB that is adapted for children is more than necessary.

Although there is a wide number of measures to assess DEAB among adult population that can be used with adolescents, few are adapted to children [10]. Among available measures for elementary school children, the most used are the Eating Disorder Examination Questionnaire adapted for children (ChEDE-Q [11, 12]), the Eating Disorder Inventory for children (EDI-C [13, 14]), and the Children’s Eating Attitudes Test (ChEAT [15]). Of these three measures, the ChEAT is undoubtedly the preferred one to screen for DEAB in non-clinical populations because it provides a total score reflecting global severity of difficulties related to eating and weight which is simple and quick to calculate [16]. Also, its popularity arises from its wide accessibility, as the ChEAT has been translated and validated in numerous languages and countries such as the USA (English [4, 15, 17]), Spain (Spanish [18, 19]), Portugal (Portuguese [20]), Belgium (Dutch [21]), Japan (Japanese [22]), Sweden (Swedish [23]) and Finland (Finnish [24]). However, to our knowledge, the psychometric properties of a French version of the ChEAT have not been reported yet. Given that French is spoken in Canada as well as in many European (e.g., France, Belgium, Switzerland, Luzembourg) and North-African (e.g., Algeria, Morocco, Tunisia) countries, a French version of this questionnaire is needed. Such version will facilitate cross-linguistic comparisons of children’s DEAB.

The ChEAT is a language-simplified version of the Eating Attitudes Test (EAT [25, 26]), a self-reported questionnaire that assesses DEAB in adults and older adolescents. The most widely used version is the EAT-26 [26]; its 26 items can be summed up for a total score or divided in 3 subscale scores (“Dieting”, “Bulimia and Food Preoccupation”, and “Oral Control”). Both the EAT-26 and the ChEAT demonstrated good basic psychometric properties, making them useful screening tools for DEAB in general populations [16]. Mostly, previous validation studies of the ChEAT have reported good convergent validity with diverse measures of body image [4, 19,20,21]. However, the original 3-factor structure of the EAT-26 [26] have never been replicated with the ChEAT. Instead, most of the validation studies of the ChEAT proposed models with 4 to 6 factors, and some of them excluded items from the original 26-item version based on poor factor loading [27, 28]. Recently, Murphy et al.[29] reviewed 13 studies that included factorial analysis of the ChEAT and found no consistency among the proposed factorial structures. Therefore, they developed a new 5-factor structure keeping only the 14 items that fulfilled at least one of these 2 criteria: (1) in more than one third of the reviewed studies, the item loaded on the same factor with a factor loading > 0.70, or (2) in more than two thirds of the reviewed studies, the item loaded on the same factor with a factor loading > 0.30. They were the first to propose a short version of the ChEAT. The 5 factors they identified with their 14-item version were “Dieting”, “Weight preoccupation”, “Food preoccupation”, “Social pressure (to eat or to gain weight)” and “Vomiting (or purging)” with three items representing each subscale, except for the last factor, “Vomiting (or purging)”, that was only represented by two items. Using a large non-clinical sample of 13,674 boys and girls aged from 10 to 15 years old, they compared their 5-factor structure with the original 3-factor structure [26] and a single-factor structure. They found that their shorter (14-item) 5-factor version had a better fit than the original (26-item) 3-factor or the single-factor structure [29]. To provide support for the good convergent validity of their new version, Murphy et al. [29] demonstrated that their five-factor model could better explain adiposity than the three-factor or the single-factor model.

Murphy and colleagues’ shorter version of the ChEAT is of particular interest for screening DEAB in children, since this 14-item version is quicker to administer than the original 26-item version. However, considering that their sample was a mix of children and adolescents aged from 10 to 15 years old, it is impossible to determine with certainty whether this version is suitable for children only. The aim of the present study was to confirm the factorial structure proposed by Murphy et al. [29] and to examine the internal consistency of a French version of the 14-item ChEAT among a French-Canadian sample of children aged from 8 to 12 years old. A second aim was to test the measurement invariance by sex to determine whether the factorial structure applies to both boys and girls.

Methods

Participants

The sample consisted of 1092 children aged between 8 and 12 years old recruited from 27 public elementary schools located in two urban areas in the province of Quebec, Canada. Among participants, 468 were boys (43.1%) and 617 were girls (56.9%). Their mean age was 10.31 (SD = 1.18). Every body mass index (BMI) categories were represented: 5.3% of the children could be classified in the underweight category, 73.9% in the normal weight category, 16.3% in the overweight category and 4.5% in the obese category. Almost all of the children were born in Canada (95%) and came from a family where their parents were either married or living in a common-law relationship (83%). In general, children came from a wide range of social status, although most came from educated and well-off households. Nearly a third of the families had an annual family income of CAD $100,000 or more, and almost half of the children had a parent with a university degree.

Procedure

The children were invited to take part in a large study focusing on eating habits, body weight, and body image. They were informed of the study in class and received an envelope containing questionnaires on various themes related to eating and body image (many of them were not used in this study), and a consent form they could bring home to their parents. For those interested to participate, parental written consent was obtained. Children also provided written assent to participate. They had to complete questionnaires at home without the help of their parents, and to return the completed questionnaires to their teacher. Parents also had to complete questionnaires aside, including demographic information. Completion took approximatively 30 min for children and 45 min for parents. Height and weight were collected individually at school and out of sight of the children’s peers. Measures were took by trained research assistants. Children had to take off their shoes and to remove unnecessary clothes (e.g., hat, hoody, and coat). A metric scale and a numeric weighing scale were used, and height was measured to the nearest 0.1 cm and weight to the nearest 0.2 lb.

Measures

Disordered eating attitudes and behaviors (DEAB). The children’s version of the Eating Attitudes Test (ChEAT [15]) was adapted into French using standardized translation back-translation techniques [30]. Accordingly, an iterative process of independent forward- and back-translation was realized. Two independent professional translators were involved in the translation process. Moreover, an expert committee, including members of the research team and the translators, was formed to discuss any inconsistencies between the original and back-translated versions of the questionnaires until a consensus was obtained, as well as to supervise the process and make sure that the translated items were adequate and understandable by the participants. The ChEAT is a 26-item self-report questionnaire, measuring disordered eating attitudes and behaviors through a 6-point Likert scale ranging from 1 (never) to 6 (always). Its reliability and concurrent validity have been demonstrated previously [4, 31]. In the present study, participants completed the original 26-item version, but only 14 items were retained for the present analyses, according to the 14-item version of Murphy et al. [29]. The items were scored following the original procedure [25]; the three lowest frequency choices (never to sometimes) were scored as 0, and the next three were scored 1 (often), 2 (very often), and 3 (always). The total score derived from the 14 items ranges from 0 to 42, with a higher score reflecting a higher presence and frequency of DEAB.

Body mass index (BMI). Height and weight were measured two times to ensure the validity of the measurement. The mean of the two measures was used to compute BMI (kg/m2). For weight, measurements in pounds were transformed into kilograms. The children’s BMI was classified into four categories (underweight, normal weight, overweight, or obese) according to childhood age- and sex-specific BMI cut-offs proposed by Cole et al. [32,33,34].

Statistical analysis

Group comparisons tests (t-test and ANOVA) were run to examine potential differences on ChEAT total score between sexes and BMI categories. Post Hoc comparisons with Bonferroni correction would be computed to target specific group differences in the case of a significant ANOVA. Confirmatory factorial analyses (CFAs) were performed with Mplus [35] using weighted least square mean and variance (WLSMV) adjusted estimators because normality could not be assumed. Full information maximum likelihood (FIML; maximum likelihood estimation under the missing at random assumption) was used to manage missing data. The sample size reached recommendations of 5–10 observations for each item included in a factor analysis [36]. To measure the fit of the model to the data, the following indices were used: Normed χ2 statistic (χ2/df), Comparative Fit Index (CFI) and Root Mean Square Error of Approximation (RMSEA). A χ2/df ratio of 3 or under, and a value over 0.90 for the CFI indicate an acceptable fit, whereas values of 0.08 and under are acceptable for the RMSEA indices [37, 38]. To test whether the model differed across sexes, measurement invariance with regard to sex was verified using multigroup tests. A first multigroup model was tested with all the factor loadings and intercepts allowed to load freely for both sexes, then a second multigroup model with the factor loadings constrained to be equal across sexes was tested, χ2 statistics of both multigroup model were then compared. To measure internal consistency, ordinal alpha was used to be consequent with the ordinal nature of the data [39, 40]. Ordinal alpha can be interpreted like Cronbach’s alpha, 0.70 being the acceptable lower bound [41].

Results

Descriptive statistics and group comparisons tests according to sex and BMI classification are presented in Table 1. Results showed that girls reported significantly higher scores on the ChEAT than boys, and that overweight and obese groups reported significantly higher scores than underweight and normal weight groups. Also, the obese group reported significantly higher scores than the overweight group. Frequency of responses for the 14 items of the ChEAT is presented in Fig. 1. The 5-factor structure with 14 items proposed by Murphy et al. [29] was first tested and revealed a poor fit to the data (normed χ2 (χ2/df) = 11.32; CFI = 0.866; RMSEA = 0.101, 90% CI(0.095 to 0.108)) because factor loadings for items 9 and 26 were very low due to the near absence of variance (only 10 out of 1092 participants did not score 0 on those items). Those two items, which represented the factor “Vomiting (or purging)”, were thus excluded. The analyses were re-conducted with the 12 remaining items and the structure that was tested was composed of the 4 remaining factors. This 4-factor structure with 12 items (henceforth called the ChEAT-12) provided a good fit to the data (normed χ2 (χ2/df) = 2.10; CFI = 0.987; RMSEA = 0.033, 90% CI (0.024 to 0.042)), with factor loadings ranging from 0.61 to 0.97 (see Table 2). Internal consistency of this 12-item version was good (α = 0.86), and each of the 4 subscales reached an acceptable level (α from 0.69 to 0.95; see Table 2). The four-factor structure was proved to be invariant across sexes as no difference were found between the multigroup model with all the factor loadings and intercepts allowed to load freely and the multigroup model with constrained factor loadings (χ2 = 9.61, p = 0.57).

Table 1 Group comparisons according to sex and BMI classification
Fig. 1
figure 1

Frequency (%) of responses to ChEAT-14 items

Table 2 Factor loadings and internal consistency of the ChEAT-12

Discussion

The purpose of the present study was to confirm the factorial structure of a short version of the ChEAT, previously proposed by Murphy et al. [29], among a large sample of French-speaking children aged from 8 to 12 years old. Our results provided support for a 12-item version that is suitable for elementary school children.

In our non-clinical children population, most of the items of the ChEAT had a low but acceptable endorsement, excepted the items 9 (“I vomit after I have eaten”) and 26 (“I have the urge to vomit after meals”) that turned out to be irrelevant, because less than 1% of the sample endorsed them. Consequently, when computing the CFA, every item loaded properly on their respective factor except for items 9 and 26. After the removal of those items, a second CFA with the 12-item version provided a good fit for the data, which supported the removal of the 2 “vomiting (or purging)” items. Using the original version of the ChEAT, previous studies have reported similar issues with those items. For instance, many validation studies conducted among children samples excluded those two items, either because authors believed they were not suitable for children [19, 42] or because children uniformly scored zero [43, 44]. In their study, using a large community sample, Murphy et al. [29] reported only 4–5% of endorsement for those 2 items, but they kept them in their 14-item version because response frequency was high enough to estimate the parameters in their CFA. Although, we cannot exclude that other factors, linguistic or cultural, may be partly responsible for those difference, the most plausible explanation for this divergence between studies is that the sample of Murphy was older, with a sample ranging from 10 to 14 years old (Mage = 11.6). Indeed, vomiting and purging behaviors, in general, represent extreme behaviors for weight control that may reflect a degree of severity rarely observed in younger children. For instance, with a community sample of over 13,000 children aged from 9 to 14 years, 1-year incidence of purging was around 1% [45], while in a community sample of 5812 adolescents, around 7% reported purging behaviors at age 16 [46]. Taken together, these results support a 12-item version for elementary school children, without items referring to vomiting or purging behaviors, but did not invalidate the use of the 14-item version with adolescents.

This study allows to fill a gap in the present literature by providing a French short-version of the ChEAT that is suitable for children. First, neither the psychometric properties of the ChEAT nor any other DEAB screening tool in French had been previously validated. Therefore, there was no validated assessment for children’s DEAB that was available for French-speaking population. Currently, French is the 5th most spoken language on earth and it is the official language in 29 countries. This represents almost 300 million people and many children [47]. Second, even if a 26-item version of the ChEAT may already seem short, this questionnaire may most likely be used in a context where overall completion time is an issue (e.g., questionnaire battery, along with an interview). In a meta-analysis, the questionnaire length has been targeted as one factor that contributes to the response burden (i.e., the effort required by a person to answer a questionnaire) and that may raise the number of missing answers [48]. Therefore, reducing the questionnaire length and then, the response burden, may prevent methodological problems with children like missing data [48]. Completion time may even be more critical with children as their attentional capacities are more limited when they are under 13 years old [49]. Third, the sample of the current study included more than 1000 young boys and girls, and participants were distributed equally across sexes. Most of the previous validation studies did not include children under 10 or 12 years old, and some of them focused on girls only. In this study, girls showed a higher score of DEAB than boys, which is consistent with previous studies that explored sex differences in children of 10 years of age and up [50]. However, the measurement invariance showed that the factorial structure of the 12-item version did not differ across sexes. Therefore, although there is a difference in DEAB severity across sexes, boys and girls tend to endorse the same DEAB. Moreover, in order to provide a parsimonious tool while maintaining high validity, Murphy et al. [29] rigorous work was taken as a starting point. The fact that the 12-item version proposed in this study has already been validated among adolescents in the almost same version (except for the 2 “vomiting (or purging)” items), give further confidence in its validity. Finally, in this study, children classified in the overweight or obesity categories showed a higher severity of DEAB than those classified in the normal or underweight categories. Therefore, with the 12-item version, the DEAB severity difference between BMI classification observed in previous studies were replicated [51, 52].

The current study has limitations to consider. First, although the sample did offer an adequate representation of boys and girls, it would have been interesting to have more ethnic and socioeconomic diversity. Replication with diverse communities is very important as eating attitudes and behaviors may differ depending on cultural and socioeconomic background [53]. Similarly, the present sample was mostly composed of children without clinical eating or weight problems. Further validation on the 12-item version with a sample of children presenting with more severe DEAB, including children presenting an eating disorder or obesity, would be relevant. Second, other important psychometric properties like test–retest reliability were not examine with this French version and should thus be considered in future research. Third, for the moment, the French version of the ChEAT was proven to fit well a French-Canadian children sample, but the extent to which this factorial structure could be generalized to other French-speaking populations, or to other linguistic groups, remains to explore.

Conclusion

This study is the first to provide psychometric properties of a French assessment of DEAB in elementary school children. The French version of the ChEAT-12 represents a quick and reliable assessment for DEAB with non-clinical population and could be used as a screening tool with elementary school children, even though no cut-off is established yet. Early detection of attitudes and behaviors in regard to weight, shape, and eating is crucial because it can contribute to prevent the development of serious mental and physical health problems like eating disorders and obesity in adolescence.

What is already known on this subject?

  • The ChEAT is one of the most popular measuring tools for disordered eating attitudes and behaviors in children and a short version have been proposed.

What your study adds?

  • This study is the first to provide a French assessment of disordered eating attitudes and behaviors in elementary school children and bring support for a short 12-item version.