Introduction

Eating disorders are serious mental illnesses that usually begin in adolescence [1], and in many instances recovery requires intensive professional treatment and support [2]. Ideally, these disorders are diagnosed by a professional multidisciplinary healthcare team after a comprehensive physical and psychological assessments against established diagnostic criteria of mental disorders. The two commonly used diagnostic systems are The International Classification of Diseases (ICD), currently in its 10th edition [3], and the Diagnostic and Statistical Manual of Mental Disorders (DSM), currently in its 5th edition) [4]. The latter is used much more often for research purposes, such as epidemiological studies. Anorexia Nervosa (AN), Bulimia Nervosa (BN), Binge Eating Disorder (BED), Avoidant/Restrictive Food Intake Disorder (ARFID), and Other Specified Feeding and Eating Disorders (OFSED) are the five major eating disorders listed in the DSM-5 [4]. Each eating disorder has its own set of criteria based on extensive research and clinical experience [4].

The causes of eating disorders are unknown, although there is general agreement that a variable and complicated combination of biological, psychological, social, and cultural risk factors increase the probability of eating disorder [2]. According to Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2019 eating disorders impact about 42 million people globally [5], and they are frequently misdiagnosed and undertreated [6]. Moreover, their prevalence worldwide is well-documented and appears to be increasing [7,8,9].

The prevailing view of eating disorders is that they are categorically distinct patterns of maladaptive eating habits linked to profound cognitive modification centered on the overvaluation of weight and shape as determinants of one's identity and worthiness [1, 2]. In this regard, many experts in the field believe that “disordered eating” (defined and discussed below) is qualitatively different from the eating disorders [10,11,12]. For this reason, screening tests aimed at measuring patterns of disordered eating are thought not to be a good proxy for estimating the prevalence of eating disorders based on large samples. Screening tests are best used in a typical two-stage design, in which people who were found positive for the screening test criteria then participate in semi-standardized or standardized interviews to ascertain the correspondence of those interview data with internationally agreed-upon diagnostic criteria [13] For example, consider the findings of a recent systematic review and meta-analysis of the prevalence of eating disorders and disordered eating in Western Asia [14]. The prevalence of disordered eating, as measured by three widely used screening tools (see Table 1) was: Eating Attitudes Scale 26 (EAT-26) and Eating Attitudes Scale 40 (EAT-40) = 22.1%; Sick, Control, One Stone, Fat, Food (SCOFF) questionnaire = 22.3%; and the Eating Disorder Examination-Questionnaire (EDEQ) was 8.0%. On the other hand, from those studies using semi-structured interviews against established criteria ICD/DSM, the estimated prevalence of anorexia nervosa was 1.6%, while the figures for bulimia nervosa and eating disorder not otherwise specified (EDNOS [BED + OSFED]) were 2.4% and 3.5%, respectively [14].

Table 1 Detailed description of the clinical measures involved in the systematic review and meta-analysis of disordered eating among university students, psychometric properties, cut-off points and full citation

Disordered eating

By comparison with the criteria for the eating disorders, the widely used phrase "disordered eating" is a broad construct that encompasses unhealthy (i.e., disordered) relationships with food, exercise, body weight, and one's body/body image. It sometimes refers to the presence of individual features of the clinical syndromes, and other times to unhealthy attitudes and practices such as negative body image and calorie-restrictive dieting. Typically, it is assumed that the constituents of disordered eating are less severe than the syndromes codified by DSM and ICD. Nevertheless, as is the case for negative body image, disordered eating is in and of itself a public health problem in many countries because it is associated with a number of negative health consequences, including depression, anxiety, and binge drinking [15,16,17].

Yet, “disordered eating” is used loosely in the literature, and rarely is it defined theoretically. Smolak and Levine have argued that disordered eating is defined by (a) “subclinical” but unhealthy, maladaptive, and misery-inducing levels of negative body image, weight and shape concerns, and calorie-restrictive dieting and/or binge eating [10,11,12]; plus (b) at least several of the following: individual eating disorder symptoms such as self-induced vomiting after eating; abuse of laxatives, diuretics, diet pills, and exercise; unrealistic beauty standards, including an idealization of thinness; irrational and maladaptive beliefs about body fat and fat people, often coupled with a high drive for thinness; and harsh self-surveillance and self-criticism, often in transaction with low and unstable self-esteem [10,11,12].

Longitudinal risk factor research consistently shows that negative body image and disordered eating are perhaps the best predictors of the development of full-blown eating disorders, at least in adolescent girls and adult women [13]. Another way of looking at disordered eating as an “at risk” status is the fact that its components constitute many of the items making up the measures used to screen people to determine, relatively quickly and at low cost, who is “at risk” for actually having an eating disorder upon closer examination using a structured diagnostic interview. Table 1 presents a list of these measures in alphabetic order. The principal purpose of this meta-analysis is to examine the prevalence of disordered eating, as assessed by these measures; hereafter we refer to this concept as screen-based disordered eating, using the acronym SBDE.

Risk and the transition to college or university

Eating disorders can emerge at any age, ranging from early childhood to older adulthood [18, 19]. However, given what is known about risk factors and the modal ages of onset, one high-risk period is late adolescence and emerging adulthood, that is, roughly ages 17 through 22 [20]. Within that developmental period, one general context associated with increased risk for eating disorders is the transition, for some people, from high school to college or university. University students are under increased pressure to conform to body and appearance ideals because romantic/sexuality expectations and peer comparisons, intensified by social media, grow alongside parental and academic demands for competitive excellence, if not perfection [21,22,23]. The confluence of these pressures, along with the distinct possibility of specific stressors such as parental divorce or sexual harassment/assault at college, amplify general sociocultural risk factors for disordered eating and eating disorders.

The transition to college or university also increases the probability of initiation or more frequent use of cognitive enhancers and psychostimulants to boost weight/shape/stamina management and cognitive capabilities. Drugs with potential connections to disordered eating, as well to purported success at (or at least coping with) college and its social life, include nicotine, caffeine in coffee and energy drinks, alcohol, stimulants, and dietary and ergogenic supplements [24,25,26,27]. Moreover, pressures contributing to disordered eating and eating disorders are greater for university students who fall into one or more of the following categories: identify as female; are LGTBQ + ; are involved in the performative arts (e.g., dance) or certain competitive sports; or are overweight or obese [12, 28, 29].

The reported prevalence of SBDE among various, different types of samples of university students in the literature varies from 3.1% (Liao et al. 2006) to 74.5% [30]. Variations between samples are likely due to variability in sampling methodology, sex, age, BMI, measure, and country, but this has not been investigated in a systematic way. A recent systematic review and meta-analysis by our group showed that among medical students higher BMI, Westernized culture, and the research tool used were the main confounders [31].

The explicit purpose of this meta-analysis is to provide an overall or 'absolute' estimate of the prevalence of SBDE among university students as a population at risk for eating disorders and for the distress and comorbid problems attendant to disordered eating itself.

A search of the literature and other registration platforms yielded no previous global meta-analytic review of SBDE among university students in general. Thus, this meta-analysis extends previous reviews by our research team of SBDE in medical students [31,32,33] by evaluating the prevalence and several potential confounders of disordered eating in more general and diverse samples of undergraduate university students across the world. Specifically, the event rate was categorized using pre-defined cut-off scores from validated screening devices, that is, continuous measurements of eating disorder risk such as the EAT-26 and SCOFF (Table 1).

To examine the sources of the expected heterogeneity in disordered eating estimates, confounder analyses, adjusting for age and BMI, will be also conducted for country, percentage of the sample that is female, culture (Western vs non- Western), measure, and timeframe/year. Results of these meta-analyses should be useful in determining allocation of resources in the development and dissemination of prevention programs for undergraduates.

Methods

This study's protocol was registered on 2021–09-19 at Open Science Framework (OSF; https://doi.org/10.17605/OSF.IO/MB74E), an open-source platform that allows researchers to share their findings with others and get assistance throughout their research. To make the review visible and avoid duplication the protocol was also entered into the PROSPERO International prospective register of systematic reviews (CRD42022303882).

This study was reported using Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA2020; [34]. Statistical analyses were conducted and presented according the Meta-analysis of Observational Studies in Epidemiology (MOOSE) protocol [35].

Search strategy

During September 2021 two authors (DS and MK) did an electronic search of the literature using nine databases: PubMed/MEDLINE, American Psychological Association PsycINFO, ScienceDirect, Springer, EBSCOhost, Embase, Cumulative Index to Nursing and Allied Health Literature (CINAHL), Scopus, and Web of Science. The full-text search was conducted according to the following keywords and lists: List A: university student [OR] tertiary student [OR] college student [AND] List B: eating disorder* [OR] eating behavior/behaviour* [OR] feeding disorder* [OR] eating symptom* [OR] eating attitude* [OR] eating problem*. The * ensures that the search term covers both the singular noun forms, as well as the reverse order of the words in the phrase. For example, searching for “eating disorder” encompasses “disordered eating” and “eating disorders”.

To verify that we included all relevant publications, we also examined the reference lists of selected articles to identify other potentially relevant articles and reviews. Meta-analyses that do not include grey literature are more likely to inflate effect size estimates, and produce less exact effect size estimates than those that do [36]. Consequently, while examining the reference sections we looked for organizational reports, unpublished studies, and studies published outside of widely known journals.

Three team members (DS, MJ, and SH) then independently assessed the initial set of articles identified by screening the titles, abstracts, and full-text articles according to the inclusion and exclusion criteria. All duplicate studies were eliminated. Initial data extraction and quality assessment were conducted independently by two team members (DS and OA). Any disputes regarding the suitability of a study for inclusion in the review were resolved by a conversation with the senior reviewer/expert clinician (ML or HJ), followed by consensus of the research team.

Eligibility inclusion and exclusion criteria

This meta-analysis included the full text of original English-language articles, published before 1 October 2021, related to SBDE among university (all countries) or college (in the USA) students. The population was defined as undergraduate students from different disciplines and majors.

To cast the net widely, we included studies that met the following criteria: (1) were published in an English-language journal; (2) the entire sample or a distinct subset of the sample consisted of university or college (in the U.S. sense of the term, i.e., not a private high school) undergraduate students; (3) participants completed one of the screening measures (see Table 1) for determining who is at-risk for an eating disorder, such that scores could indicate endorsement of the extremes of attitudes and behaviors that may exist in many cultures [10, 37, 38]; and, given the preceding criterion, (4) participant responses to the screening measure (e.g., the EAT-26) were scored and reported such that the percentages of participant falling above and below established cut-off points could be determined.

The following sources of data were excluded: (1) studies of students in post-baccalaureate programs (e.g., those pursuing masters or doctoral degrees); (2) investigations of mental health issues other than the prevalence of SBDE; and (3) studies for which we were unable to get the necessary data even after contacting the authors. Figure 1 shows the PRISMA 2020 [39] flow diagram for study selection.

Fig. 1
figure 1

PRISMA 2020 flow diagram for study selection

Procedure

ASReview, a free online tool that combines digital technologies (e.g., natural language processing) with artificial intelligence and machine learning, was used to screen and code the 89 studies selected for systematic review [40]. The Abstrackr semi-automated abstract screening tool for systematic reviews was used to increase the precision of abstract screening [41]. When necessary, data were extracted from plot images using WebPlotDigitizer v4.5, an open-source web-based tool [42].

To standardize data description the following variables, in addition to the key result of the event rate of screen-based disordered eating among university students, were independently extracted by three members of the research team (DS, MK, and SH): Author names, year of publication, country in which the data were collected, sample size, mean age (years), sex (percentage of female participants), mean body mass index (kg/m2), and measure used to determine presence or absence of SBDE. This meta-analysis study included samples from 40 countries, which were further coded into two categories, Western and non-Western countries, according to regional groups of member states defined by the United Nations [43].

Consensus among the aforementioned three reviewers was used to settle disagreements. If consensus could not be reached, a fourth author (ZS) was involved in resolving the issue by discussion. If relevant data were missing from a publication, the corresponding author of the article was contacted.

Assessment of study quality and risk of bias

The Newcastle–Ottawa Scale (NOS) was used by two authors (ZS and HJ) independently to evaluate the quality of the studies included [44]. The NOS checklist consists of three items: participants selection (sampling), comparability, and outcome and statistics. The NOS is based on a rating system [44] in which each item receives 1 to 3 (or 4) stars. This means that the maximum score for each study is either nine (cross-sectional and cohort studies) or 10 stars (randomized controlled trials and case–control studies). A study with 8 stars has good quality and low risk of bias, a study with a score of 5–7 stars has the moderate quality and moderate risk of bias, and a study with a score of 0–4 stars has low quality and high risk of bias.

Data analysis and data visualization

A classical meta-analysis based on the random-effects model was used, with the assumption that actual effects will vary over time [45]. We used the general inverse variance method with the logit transformed [PLO] proportions [46], and the DerSimonian-Laird method was used to estimate and adjust for the between-study variance in effects [47]. Random-effects modelling was used because it assumes that, in using different measures (e.g., EAT and SCOFF; Table 1), different sets of studies are estimating different, yet conceptually related, effects. For each study the pooled prevalence and the 95% confidence interval are reported.

A forest plot was used to display data [48]. It is a disadvantage that forest plots may display only confidence intervals at a significance level, such as p < 0.05. Confidence intervals should also be used to determine whether a research effect is substantial and therefore results are reproducible, so drapery plots were also used [49] and analysis of the p curve was also reported [50]. The drapery plot depicts the p value function as curves that provide the prediction range for a single future study for all individual studies and pooled values in a meta-analysis [49].

To further strengthen the results of the classical meta-analysis, Bayesian meta-analysis was also conducted and reported. Meta-analysis using Bayesian methods has three principal advantages over many classical methods [51]. First, they account for the imprecision of the between-study variance estimates [51]. Second, Bayesian methods take into account “priors”, that is, what is previously known on the topic [52]. Finally, Bayesian methods include external evidence, such as information about the effects of interventions or likely differences between studies [51, 52].

Our prior distribution focused on eating disorder prevalence in the absence of screen-based estimates and was proposed to be µ = 10%, τ = 2%, and η = 5% according to previous global estimates [53, 54]. The large eta was postulated in our review due to the amount of "error" that could be obtained using a screen-based self-reported tool, as indicated by on our comprehensive review of the clinical measures commonly used (Table 1).

Bayesian meta-analysis uses the Bayesian hierarchical model, which relies on the same basic assumptions underpinning the conventional random-effects model [55]. The difference is that in the Bayesian model the prior distribution (informative, weakly informative, or uninformative) is assumed for µ and τ2. The prior distribution describes the uncertainty surrounding a particular effect measure within a meta-analysis, such as the odds ratio or the mean difference [55]. Uncertainty may be attributable to the researchers’ subjective beliefs about the size of the effect or to sources of evidence excluded from the meta-analysis. Quantity uncertainty is reflected by the width of the prior distribution [56]. It is possible to use a non-informative prior when there is little or no available information, such that all values are equally likely [56]. A credible interval (CrI) in Bayesian statistics is a range of values where an unobserved parameter value is likely to occur [57]. In our analyses we reported the [95% CrI].

We assessed between-study heterogeneity using the I2 statistic; a value between 75 and 100% represents a high degree of heterogeneity [52]. We also evaluated heterogeneity using Cochran's Q statistics [58], and tau22) and tau (τ) [52]. The H statistic [59] is the square root of the following: Cochran’s χ2 heterogeneity statistic divided by the degree of freedom [52]. To visualize heterogeneity we used a simple form of the Galbraith radial plot [60] in which the inverse of standard errors (horizontal axis) is plotted against observed effect sizes or outcomes standardized by their corresponding standard errors (vertical axis). On the right-hand side of a full-scale Galbraith plot, an arc shows the corresponding effect sizes or outcomes [61, 62].

Meta-analysis' validity and robustness may be compromised by the inclusion of outliers. Whenever the study's confidence interval does not align with the pooled effect's confidence interval, the study is classified as an outlier and can be addressed by the sensitivity analysis [63]. Therefore, using a Jackknife sensitivity analysis, we eliminated one study at a time to make sure we did not have any inordinate influence from any single study [64]. This analysis involves repeating the main meta-analysis as many times as the number of studies included, discarding one different study each time [64].

A publication bias occurs when the odds of research being published are influenced by its findings [65]. To investigate publication bias, funnel plots were employed as a preliminary visual tool [66]. To correct for funnel plot asymmetry owing to probable publication bias, the trim and fill approach [67] was used to generate adjusted point estimates. To conduct a more rigorous analysis of publication bias, Peters’ correlations [68] and Egger's regression [66] were also utilized as gold standards.

Subgroup meta-analyses were used to investigate heterogeneous outcomes and to answer specific queries regarding distinct populations or study characteristics [69]. Subgroup analyses were performed on categorical variables including country, culture (Western vs. non-Western), and some of the clinical measures/scales used in various studies. To investigate the effect of time as a confounder the studies were clustered into 5-year intervals: 1985–1989, 1990–1994, 1995–1999, 2001–2004, 2005–2009, 2010–2014, 2015–2019, and 2020 onwards. The subgroup meta-analyses addressed any subgroup of five studies or more, and all results are reported graphically using forest plots.

Meta-regressions are, in essence, regression models in which the values of one or more explanatory factors are used to predict the outcome variable [70]. A meta-regression analysis' regression coefficient will indicate how the outcome variable changes as the explanatory variable (the possible moderator/effect modifier/confounding variable) is increased by one unit [70]. A term for the interaction between age, sex and BMI was tested. In statistically significant meta-regression models effect size was reported using R2, and percent of variance explained of 1–8%, 9–24% and ≥ 25% were regarded as small, medium and large effect size, respectively [71].

R software for statistical computing was used to analyze all data [72]. The packages ‘meta’ [73] and ‘metafor’ [74] were used to perform all classical meta-analytics. Package ‘bayesmeta’ was used to perform Bayesian random-effects meta-analysis [75]. Using the package ‘robvis’, risk-of-bias plots were generated for quality assessment [76]. For all investigations, a summary plot (weighted) was generated to show the proportion of information inside each judgment for each domain [76]. Summary of all studies' risk of bias assessments. The risk of bias in each domain, as well as the overall risk, is depicted by a traffic light plot.

Results

Descriptive

The initial literature search, conducted during September to October 2021, yielded 1523 studies, of which 89 independent studies [7, 30, 77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163] (across all times and measures, K = 105 data points for analyses; N of participants = 149,629) met the inclusion and exclusion criteria. Details of the studies included are shown in Table 2. Of the 89 studies only two (2.2%) were grey literature: [117] and [121]. They were of a similar quality compared to the published studies.

Table 2 Selected descriptive results of the studies included in this systematic review and meta-analysis of disordered eating among university students

Most of the studies analyzed were cross-sectional (95%), although a few used cohorts (3%) or other methodology (2%), and 11% presented data collected during the COVID-19 pandemic. Furthermore, the Eating Attitudes Test-26 (EAT-26) and Sick, Control, One Stone, Fat, Food (SCOFF) measures were the most common scales, making up 64% of total studies (see Table 3). The mean percentage of participants self-identifying as female was approximately 70% [95% CI = 66–75%], while, as expected, the average respondent's age was 21 years old [95% CI = 20–22; range 18–29 years), with a median sample mean BMI of 22 kg/m2 [95% CI = 21–24]; range 17–26 kg/m2).

Table 3 A meta-analysis of disordered eating among university students

The studies represented samples from 40 countries and territories, and the majority (54%) of studies reported data from non-Western countries. The countries and territories were: Argentina, Australia, Austria, Bangladesh, Brazil, Canada, China, Egypt, Finland, France, Germany, Greece, Hungary, India, Iran, Italy, Japan, Kuwait, Lebanon, Malaysia, Mexico, Morocco, Norway, Pakistan, Palestine, Poland, Portugal, Puerto Rico, Republic of Korea, Romania, Saudi Arabia, South Africa, Spain, Thailand, Tunisia, Turkey, United Arab Emirates, United States, Vietnam, and West Indies.

Seven countries accounted for 42% of the studies qualifying for this meta-analysis: Saudi Arabia (8%), United States of America (7%), Spain (7%), India (7%), France (5%), Malaysia (4%), and Turkey (4%).

Quality assessment of included studies

Figure 2 presents summary plots for the quality and risk of bias of the studies qualifying for this meta-analysis. The overall risk of bias in this sample of studies was moderate, as the categorization of bias was low (17%), moderate (81%), and high (2%). Figure 3 provides a summary of all studies' risk of bias assessments the risk of bias in each domain, as well as the overall risk, is depicted by a traffic light plot.

Fig. 2
figure 2

Summary plot of the assessment of the risk of bias

Fig. 3
figure 3

Traffic light plot of the assessment of the risk of bias

Meta-analysis of the overall prevalence of disordered eating

Figure 4 presents the raw prevalence data for each study and the results of the basic meta-analysis. The prevalence of SBDE among university students (k = 108, N = 146,210) using random-effects meta-analysis was (K = 105, N = 145,629) was [95% CI] = 19.7% [17.9%; 21.6%], I2 = 98.2%, τ [95% CI] = 0.6 [0.64; 0.94], τ2 [95% CI] = 0.34 [0.41; 0.88], H [95% CI] = 7.39 [7.1; 7.7], Cochran's Q p-value = 0.001. Bayesian meta-analysis, shown in Fig. 5, yielded disordered eating odds of 0.24 95% CrI [0.20, 0.30], τ = 92%. An odd estimate of 0.24 equals an SBSE prevalence of to approximately 1: 4 or more simply 24–25%.

Fig. 4
figure 4

Classical random-effects meta-analysis of disordered eating in university students

Fig. 5
figure 5

Bayesian meta-analysis of disordered eating in university students

There is no publication bias in our data, as evidenced by visual examination of the funnel (Fig. 6) and Galbraith radial plots (Fig. 7), as well as Egger's regression test at 0.90 and Peter's test at 0.05. The Jackknife sensitivity analysis showed that excluding one study at a time from this meta-analysis did not affect the prevalence of SBDE in university students by more than 0.5% (Fig. 8), suggesting that our weighted prevalence findings are robust and relatively insensitive to outliers. Another indication that the results from all studies are reproducible is seen in a drapery plot based on p values (Fig. 9), which eliminates the need to rely on the p < 0.05 significance threshold when interpreting the results of any given study.

Fig. 6
figure 6

Funnel plot of disordered eating in university students

Fig. 7
figure 7

Galbraith radial plot of disordered eating in university students

Fig. 8
figure 8

Sensitivity plot of disordered eating in university students

Fig. 9
figure 9

Drapery plot of disordered eating in university students

Confounder analyses

Age, BMI, and sex

Meta-regression analysis (Figs. 10 and 11) showed that BMI and sex are statistical confounders, p = 0.001 and p = 0.04, respectively, for the prevalence of SBDE in university students. As the mean BMI of the sample increased, or as the percentage of the sample that was female increased, so did the weighted prevalence of SDBE. The effect size was large for BMI (R2 =  ~ 0.50), but small for the proportion of the sample that was female (R2 =  ~ 20). There was no statistically significant relationship between mean age of the sample and level of disordered eating (p = 0.49; see Table 3). The study concerned university college students; thus, it is highly unlikely that age could have any impact on the estimates because of the small age range.

Fig. 10
figure 10

Meta-regression between sex and disordered eating in university students

Fig. 11
figure 11

Meta-regression between BMI and disordered eating in university students

The interaction term between age, sex and BMI yielded a statistically significant result, p = 0.01, but the interaction was not explored further because the effect size was negligible, R2 = 0.10.

Country and culture

Figure 12 shows the weighted prevalence levels as a function of country in which the data were collected. These varied tremendously, and, as noted above, the number of studies (k) was very low for many countries. Lebanon (k = 1, N = 627) reported the highest SBDE prevalence of 74.5% [70.1; 77.8], while Argentina (k = 1, N = 472) and China (k = 3, N = 2,301) reported the lowest percentages of 3.8 [2.4; 6.0], and 4.0 [3.2; 5.0], respectively. A subgroup meta-analysis, conducted for the eight countries with at least 5 studies (see Table 3), yielded evidence of statistically significant heterogeneity (p = 0.001) in the prevalence of SBDE. Visual inspection of Table 3 suggests that, at the very least, a greater percentage of university students in the USA are reporting SBDE than their counterparts in India (which has a low heterogeneity index), France, and Saudi Arabia. The difference between different countries was statistically significant, p = 0.001.

Fig. 12
figure 12

Subgroup meta-analysis by Country

Table 3 shows that non-Western countries (k = 55, N = 29,663) have a slightly higher weighted mean prevalence of SBDE, 20.9% [17.8; 24.5], than Western countries (k = 50, N = 115,966), 18.4% [16.4; 20.6], but the difference is not statistically significant (p = 0.52; see Fig. 13).

Fig. 13
figure 13

Subgroup meta-analysis by Culture.

Measure of screen-based disordered eating

There was significant heterogeneity across the various measures (Table 3) used by the sample of this studies in this meta-analysis, I2 = 98.2%, τ2 = 0.36, p = 0.001 (Fig. 14). Considering the measures as 15 categories, the BEDS-7 (k = 1, N = 154) yielded the highest SBDE prevalence at 50.0% [42.2; 57.8], while the EDE-Q (k = 1, N = 503) yielded the lowest prevalence at 7.0% [5.04; 9.54]. In descending order, the prevalence of SBDE for the measures used in at least 5 studies were SCOFF (k = 22, N = 100,638) = 27.6% [24.1; 31.5], EAT-26 (k = 45, N = 23,821) = 16.9% [13.9; 20.3], EDE-Q (k = 6, N = 2255) = 18.1% [8.4; 35.0], EDI (k = 10, N = 6,394) = 16.9% [9.6; 28.2], EAT-40 (k = 6, N = 4355) = 10.6% [7.5; 14.9].

Fig. 14
figure 14

Subgroup meta-analysis by disordered eating measure

Timeframe/years

Ninety-three percent of the studies were published after 2009, while 23% were published in 2020, 2021, or 2022. No studies meeting the inclusion and exclusion criteria were published between 1995 and 2004. Results of subgroup meta-analysis showed a statistically significant (p = 0.001) increase in the prevalence of SBDE among university students (see Fig. 15). Specifically, there appears to be an increase between 2005–2014 and 2015–2022; for the categories 2005–2009, 2010–2014, 2015–2019 and 2020 onwards the weighted pooled prevalence of screen-based disordered eating was 10.6% [07.3; 15.1], 13.0% [8.3; 19.8], 23.8% [20.7; 27.2] and 20.8% [17.6; 24.5], respectively. To further illustrate the effect of year on the prevalence of SBDE among university students, a meta-regression showed that time of publication is a statistically significant predictor (p = 0.001) of increased prevalence rate of SBDE among university students (see Fig. 16).

Fig. 15
figure 15

Subgroup meta-analysis by Timeframe/Year

Fig. 16
figure 16

Meta-regression between year and disordered eating in university students

Discussion

This meta-analysis of 89 studies (total N = 145,629) from 40 countries suggests that the prevalence of screen-based disordered eating among university undergraduate students is 19.7%. We also found that increasing BMI is a strong statistical confounder, while female sex is a statistically significant but weak confounder. Age, which typically does not vary a great deal for undergraduates, had an insignificant effect. Non-Western countries have a slightly higher risk prevalence of screen-based disordered eating (20.9%) compared to Western countries (18.4%), but the difference is not statistically significant. Although slight asymmetry to the right was apparent in the funnel plot, using Egger’s regression test we ruled out significant heterogeneity.

The overall screen-based prevalence rate of disordered eating (nearly 20%) is approximately twice the global prevalence of eating disorders estimates of around (approximately 10) [53, 54]. This is perhaps due to the margin of error of screen-based measurement tools combined with the fact that disordered eating focuses on the presence of individual symptoms while an eating disorder focuses on meeting a group of symptoms for a minimum period of time to meet established diagnostic criteria. However, a figure of 15–20% is also what would be expected if a number of those sometimes co-occurring symptoms were normally distributed within the populations that have been studied to date.

At 19.7%, our overall prevalence rate of screen-based disordered eating corresponds with perfectly Levine and Smolak’s (2021) conclusion based on their narrative review [12]. Furthermore our results are consistent with the findings of the two studies with the largest sample sizes: 20.3% and 20.7% in the studies by Falvey, Hahn, Anderson, Lipson, and Sonneville (2021; N = 77,193) and Tavolacci et al. (2015; N = 3457), respectively [107, 153]. We acknowledge that further research is needed, because the prevalence may be lower. If we assume that the sensitivity of measures such as the EAT-26 and the SCOFF is around 85% [6], this means that at least 15% of 20%, or at least 3%, have disordered eating beliefs, anxieties, and behaviors that are correlated with a wide variety of health problems and that put them at risk for a possible eating disorder [12, 164]. However, regardless of the psychometrics of the screening measures, epidemiological studies indicate that it is highly unlikely that 20−3% = 17%, that is, 1 in 6, university undergraduates have a diagnosable but not yet diagnosed eating disorder. If we place the point prevalence of DSM-5-defined eating disorders at a conservative estimate of 8–10% [8], then our meta-analytic findings suggest, again conservatively, that 10–12%, or at least 1 in 9, of university undergraduates meet our criteria for disordered eating.

The statistic of 1 in 9, let alone 1 in 5—or possibly 1 in 4, based on the Bayesian estimate—undergraduates scoring above the at-risk cut-off on various validated measures that screen for eating disorders and represent disordered eating deserves the attention of researchers, clinicians, public health officials, and mental health advocates for two reasons. First the rationale for, and validity data in support of, the instruments used to measure screen-based (at-risk) disordered eating strongly suggests that there are a meaningful number of university undergraduates who have an eating disorder that is currently undiagnosed and presumably untreated. Although this has been pointed out by many researchers and clinicians, it remains alarming [32] because early detection and treatment have been shown to decrease symptoms to a greater extent and improve the chance of recovery [165].

This aspect of our findings is supported by a very recent study that sought to determine whether established disparities in ED prevalence and receiving mental healthcare for marginalized groups within this population have widened or narrowed over time for different cognitive and behavioral ED symptoms, current probable EDs, lifetime ED diagnoses, and mental healthcare use among college students across the United States [166]. This study found that between 2013 and 2020 there were non-linear increases in ED symptoms and mental health care among young adults in the United States. Moreover, consistent with the data from this meta-analysis, young adults in the United States with higher BMIs had more ED burden with time, as did bisexuals, homosexual, lesbian, or queer people [166].

There has been a strong sense among clinicians in some parts of the world (e.g., the USA and Canada) that “the problem of eating disorders and disordered eating” has been worsened by the COVID-19 pandemic. In a recent study of five cross-sectional samples of French university students, surveyed at 3-year intervals between 2009 and 2021, Tavolacci et al. (2021) found that the prevalence of screen-based disordered eating for the female and male samples remained fairly stable at 26%-31% and 7–13%, respectively, between 2009 and 2018. However, for both females and males the prevalence increased dramatically (+ 20% and + 18%, respectively) between 2018 and 2021, perhaps reflecting the stressors of the COVID-19 pandemic.

Our global data, which incorporates a set of studies of French undergraduates by Tavolacci et al. (2015, 2018, 2020), paints a different picture. We also found evidence of stability—in our study, between 2005 and 2014—but the increase we noted was in the period 2015–2022. Moreover, we found a slight decrease in the weighted pooled prevalence of screen-based disordered eating from the pre-COVID period of 2015–2019 (23.8%) to the post-COVID period (2020 onwards; 20.8%). Our data are consistent with several other recent reviews [167, 168] in suggesting that a COVID-19 effect is accurate in some places and for some vulnerable people—and, in particular, people who already have an eating disorder—but not in general. Future research and meta-analytic reviews are needed to clarify the moderating and mediating variables at work here.

Over the past 10 years the prevalence and seriousness of disordered eating and eating disorders in males has received considerable attention [169]. Nevertheless, our confounder analysis indicated that, as the preponderance of females in a sample of university undergraduates increased, to a small but significant degree so did the prevalence of screen-based disordered eating. This finding, in the context of the very significant sex difference seen in, for example, the Tavolacci et al. (2021) samples, is a reminder that, while males certainly have disordered eating and eating disorders, as do those who do not identify as either female or female, there is still something about the construction, policing, and sociopolitical status of femininity that places females, including female undergraduate students, across the globe at greater risk for screen-based disordered eating, as well as eating disorders [170]. Further research, including meta-analysis, should continue to investigate risk and protective factors that moderate and mediate this disparity.

As noted previously, our confounder analyses also found that, even though the range was narrow, the greater the mean BMI of the sample, the higher the prevalence of screen-based disordered eating. Burnette et al. (2018) reported a similar finding for a sample of U.S. female undergraduate students, but not male undergraduates [28]. Our finding is also consistent with cross-sectional and longitudinal epidemiological studies of U.S. adolescents and emerging adults showing that greater levels of disordered eating and poor psychosocial health among overweight youth [171,172,173]. In this regard Yoon et al. (2020) reported that BMI and disordered eating behaviors rose in a correlated fashion across 15 years of 4 follow-ups of a community sample that was 11 through 18 at baseline [174].

In some cultures at least, it is likely that there is a reciprocal relationship between disordered eating and body mass gain which is mediated by internalized weight stigma and a dieting mentality, and shaped by other sociocultural factors that promote the well-established risk factors of body dissatisfaction and weight and shape concerns [174]. However, further cross-cultural research on the confounders and mediators of the relationship between BMI and screen-based disordered eating is necessary to test particular path models that acknowledge cultural variability. For example, a recent survey of Chinese female undergraduates found that the relationship between body shame and scores on a dietary restraint measure was stronger for those with lower BMI scores [175]. Moreover, a recent meta-analysis of the relationship between disordered eating and use of social networking sites found that there was a small positive relationship for university students, but regardless of sample that relationship was weaker for those with higher BMIs [176].

We did not find the age was associated with disordered eating. This is in line with the results of a study of 680 U.S. female undergraduate students who were screened to eliminate those who would probably qualify to an eating disorder diagnosis. This study found a trivial, nonsignificant association of 0.05 between age and scores on a semi-structured interview assessing screen-based disordered eating [177].

It is possible that the truncated range of the mean sample ages accounts for this null finding. Further research on the relationship between age in undergraduates and postbaccalaureate (e.g., graduate, medical, law students) and screen-based disordered eating is needed. Preliminary epidemiological data from a community sample in Cyprus indicates that a higher percentage of both men (12.3%) and women (23.2%) in the 25–45 age range met or exceeded the EAT-26 cut-off score of 20 than their counterparts in the 12–18 and 46–60 ranges [178]. Of course, age is embedded in the transition from older adolescence to emerging adulthood that is a foundation for the interest in screen-based disordered eating in undergraduates, so longitudinal designs beginning in early or mid-adolescence (see, e.g., Project EAT; Yoon et al., 2020) are needed.

Although our confounder analysis did not find a general difference in screen-based disordered eating between Western and non-Western countries, broad sociocultural factors are likely to be relevant to developmental phenomena, so cross-cultural replications will also be necessary. In this regard a cross-sectional study of over 3,200 males and females in South Korea, using the same EAT-26 cut-off score we did in our meta-analysis, found that the prevalence of disordered eating varied only between 6.7% and 7.2% for age categories 10–12, 13–14, 15–17, and older in 900 undergraduates ages 18 through 24 [179]. Yet, a previous meta-analysis by our team [33] of over 3200 pre-medical undergraduate students from Brazil, China, India, Malaysia, Pakistan, and the UK found that the prevalence of EAT-26-based disordered eating in females was moderated by higher BMI and older age, whereas this was not the case for males.

One strength of our meta-analysis with undergraduates in general is that nearly two thirds of the individual studies used the EAT-26 and SCOFF screening measures. The EAT-26 measure is well validated in a variety of clinical and non-clinical populations from different cultural backgrounds [180]. Likewise, the SCOFF measure appears to be a very practical, highly effective screening tool for detecting risk for eating disorders [181]. In contrast, we recommend against further use of the EAT-40 in studies of screen-based disordered eating, due to a very low sensitivity that results in a large rate of false-negatives [182], which probably accounts for the fact that in our meta-analysis the six studies (published between 1998 and 2019) using the EAT-40 yielded a prevalence of disordered eating (10.62%), half that of the remaining studies.

Study strengths and limitations

To our knowledge this is the first meta-analysis of the prevalence of screen-based disordered eating in the population of university undergraduate students across the world. Other strengths are the large sample size (135,454 participants from 91 studies) and the categorization of event rates using cut-off scores from well-validated measures of FEDs. Nevertheless, the findings should be interpreted with consideration of several limitations. First, the self-report nature of the data from the studies included may be confounded by shame and/or social desirability, and diluting the power of anonymity and leading to underreporting of ED symptomatology. Second, most of the studies considered by this meta-analysis had a cross-sectional design, so that the direction of the causality remains unclear. Third, we exclusively examined English Language articles, which may have led to omission of some relevant non-English articles. Finally, another limitation is inherent in one of the strengths: the inevitable heterogeneity of the numerous studies selected.

Conclusion and implications

The pooled prevalence of screen-based disordered eating in university undergraduate students in 40 countries appears to be 20%. As a number of studies eliminated people with ongoing eating disorders from their sample, this finding supports many previous studies indicating that far too many students have an eating disorder and are not accessing accurate diagnosis and available treatment (see, e.g., Falvey et al., 2021). Moreover, as probably only a 10–15% of that 20% have a diagnosable eating disorder, our data indicate that a large percentage of undergraduates are struggling with disordered eating as a biopsychosocial health problem. It is important to develop means of identifying these students and offering them original or culturally appropriate versions of the Body Project [183] or other effective preventive interventions for high-risk undergraduate students [12]. Finally, our review indicates that prospective studies, using sensitive and specific screening measures such as the EAT-26 and the SCOFF, are urgently needed to illuminate the interactions between the risk factors and to use the information to construct or improve prevention programs for students with disordered eating attitudes and behaviors.

What is already known on this subject?

  • Individuals with eating disorders have disordered eating attitudes and behaviors, but not everyone who engages in disordered eating has, or will be diagnosed with, an eating disorder. The difference is in the frequency and severity of the behaviors, as well as the amount of distress they cause the person.

  • Previous research has reported that university students have a higher prevalence of disordered eating than the general population, indicating that more research is needed.

What does this study add?

  • About 20% of university students exhibit a high level of screen-based disordered eating behavior and can be classified 'at risk' of developing a clinical eating disorder.

  • Results of Bayesian analyses confirm the results of a prevalence-based approach. This is the first time Bayesian statistics are used to compute odds of disordered eating.

  • Strong evidence suggests that screen- based disordered eating is increasing among university students in recent years; thus, planning access for preventive interventions and for supporting those who need outreach and treatment is urgently needed.