Introduction

Obesity is a serious public health problem across the world and, despite increased efforts of prevention and treatment, its prevalence has risen substantially [2], sustaining the need for development of novel etiological hypotheses and strategies for management of this condition [3]. While the importance of exposure to calorie-rich environments is relatively consensual, the factors explaining differential susceptibility for obesity among individuals exposed to such environments are controversial [4]. Factors similar to those seen in substance dependence and addiction have been proposed to underlie the individual risk of obesity, in a process characterized by food cravings, loss of control over consumption of calorie-dense foods, increased intake over time, and unsuccessful efforts to eat less [5, 6]. Such process of “food addiction” (FA) is thought to lead to overeating, which in turn contributes to weight gain [7, 8]. Nevertheless, FA is only one among several etiological factors underlying obesity, and it is neither ubiquitous among obese patients, nor sufficient for the occurrence of obesity [9].

Evidence for validity of the FA construct, both at a behavioral and a neurobiological level, has been progressively supported by research with humans and with animal models [10]. Nevertheless, the concept remains controversial, and it is clear that further research is warranted to better define and understand the behavioral and neurobiological profile of FA [11]. This growing area of research has been supported by development of the Yale Food Addiction Scale YFAS; [12], a self-report measure of addiction-like eating behaviors, based on the diagnostic criteria for substance dependence, as defined in the DSM-IV-TR [13]. This instrument allows for an examination of the psychological, behavioral, cognitive and physiological indicators of addiction-like responses to food [14]. Even considering recent changes in the criteria for substance use disorders, proposed in the DSM-V, the overlap with previous criteria supports its continued use [15].

The YFAS is currently available in many languages [12, 1620], and most studies suggest that it is psychometrically sound for the study of FA. Support for validity of the YFAS has also emerged in a variety of relevant populations, namely patients with eating disorders [19] or who are overweight or obese [21], including those in weight-loss surgery programs [22, 23]. According to a recent meta-analysis, the mean prevalence of FA as assessed by the YFAS was 11.1% in normal-weight individuals, while in patients with overweight/obesity, it increased to 24.9%, and was even higher (57.6%) among those with binge eating disorder (BED) and bulimia nervosa [24]. However, there are also data that question the relevance of FA in the context of obesity and weight-loss treatment. In fact, while Burmeister et al. [1] confirmed that elevated scores on the YFAS were related to less success after 7 weeks of weight-loss treatment, other studies have found only a trend [23] or no effect, at all, of FA on weight loss [25]. Further research is, thus, necessary to clarify the predictive validity of FA in the context of weight-loss treatments, namely through use of the YFAS. This study was aimed to validate the YFAS for use in Portuguese-speaking adult populations.

Methods

Participants

This study was conducted using two separate samples. The non-clinical sample consisted of 278 participants, recruited from three educational institutions using non-probabilistic sampling. The clinical sample, on the other hand, was composed of 190 participants with obesity, recruited using sequential sampling at weight-loss surgery clinics in three Portuguese hospitals (Centro Hospitalar de São João, Hospital do Espírito Santo, and Hospital São Bernardo-Centro Hospitalar de Setúbal). Participants younger than 18 years of age were not eligible. Those with self-reported active neurological, psychiatric, gastrointestinal, hepatic or pancreatic diseases, illicit substance use or alcohol abuse and dependence, prior major gastrointestinal surgery, intra-gastric balloon or history of food allergies, as well as those who were pregnant, breastfeeding, illiterate or did not understand instructions for study, were excluded.

Material

Yale Food Addiction Scale (YFAS). The YFAS [12] consists of 25 self-report items measuring addiction-like eating behaviors of high fat/sugar foods, occurring over the prior year. The seven symptoms of substance dependence, as outlined in the DSM-IV-TR (American Psychiatric Association, 2000), were revised to apply to eating behavior: consumption in greater quantity and for longer than intended; desire or unsuccessful effort to cut down or quit consumption; large amount of time and effort to obtain, consume, and recover from consumption; important social, occupational, or recreational activities given up or reduced; continued behavior despite knowledge of adverse physical and psychological consequences; tolerance; and withdrawal. In addition to 20 items distributed across the seven criteria (symptoms), 2 items are designed to assess clinically significant impairment, and 3 items serve as primers for other questions. Each symptom is satisfied when one or more of the respective items are endorsed. The YFAS provides two scoring options, a continuous version of the scale (YFAS symptom count) that indicates the number of dependence symptoms that have been met (scores range from 0 to 7), and a dichotomous version that provides a diagnosis of FA (i.e., FA status yes/no) when the respondent presents at least three symptoms and reports clinically significant impairment and/or distress.

Other self-report instruments. The Portuguese Power of Food Scale P-PFS [26] is a Portuguese version of the PFS [27], a 15-item questionnaire that assesses psychological impact of an environment with high availability of palatable foods. It rates three dimensions of proximity to food: food available but not physically present, food physically present but not yet tasted, and food tasted but not yet consumed. For this study, Cronbach’s alpha (α) of the P-PFS was .88 for the non-clinical sample and .93 for the clinical sample. The Portuguese Dutch Eating Behavior Questionnaire P-DEBQ [28] is a Portuguese version of the DEBQ [29], a 33-item questionnaire assessing different aspects of eating behaviors, distributed in three subscales: restrained eating, emotional eating and external eating (non-clinical sample—α = .93; clinical sample—α = .90). The Portuguese Eating Disorder Inventory P-EDI [30] is a Portuguese version of the EDI [31], a 64-item measure designed to assess psychological and behavioral traits associated with eating disorders, across eight subscales: drive for thinness, bulimia, body dissatisfaction, ineffectiveness, perfectionism, interpersonal distrust, interoceptive awareness and maturity fears (non-clinical sample—α = .88). The Portuguese Binge Eating Scale P-BES [32] is a Portuguese version of the BES [33], a 16-item scale to assess severity of binge eating using behavioral, cognitive and affective symptoms (non-clinical sample—α = .89). The Newest Vital Sign (NVS) [34] is a screening test designed to assess health literacy. A Portuguese version (P-NVS) is currently being developed [35] (non-clinical sample—α = .78; clinical sample—α = .89).

Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I). The SCID-I [36], a semi-structured clinical interview for diagnosis of psychiatric disorders according to DSM-IV Axis I, translated and adapted to Portuguese by Tavares [37], was used to assess lifetime and current BED diagnosis in the clinical sample. Different raters applied this interview in the North and South of Portugal, allowing for assessment of the association between rater and SCID/BED diagnosis. The Chi-squared test for independence (with Yates’ continuity correction) indicated that there was no significant association between these variables (χ 2 (1) = .81, p = .276, φ = .10).

Body mass index (BMI). The BMI (kg/m2) of the non-clinical participants was calculated according to self-reported height and weight values provided by participants. In the clinical sample, height and weight were obtained by direct measurement with a SECA digital scale and stadiometer.

Procedures

Permission for translation and validation of a Portuguese version of the YFAS was granted by the original authors (Ashley Gearhardt). The translation process was based on the back-translation technique [38]. The study protocol was approved by Ethics Committees at the Champalimaud Foundation, Faculdade de Medicina da Universidade de Lisboa, Universidade de Évora, Centro Hospitalar de São João and Hospital São Bernardo-Centro Hospitalar de Setúbal. Written informed consent was obtained from all subjects.

A pilot study was carried out with a preliminary version of the Portuguese YFAS (P-YFAS), administered to 7 individuals with obesity and 10 normal-weight individuals, of both genders. Pilot participants were interviewed to assess their reactions to item difficulty, wording and meaning, as well as to the instructions for administration, leading to minor adaptations on wording of several items and a final version of the P-YFAS.

In the non-clinical and clinical samples, self-administered questionnaires were applied using a paper-and-pencil format, completed after socio-demographic information (gender, age, and education) and anthropometric data were collected. The assessment protocol differed slightly between samples: YFAS, PFS, DEBQ and NVS were applied to both samples, the EDI and BES only to the non-clinical sample, and the SCID-I (BED diagnosis) only to the clinical sample. To assess test–retest reliability, the YFAS was re-administered 4 weeks later in a subgroup of 104 participants of the non-clinical sample. This interval was chosen to allow for comparisons with existing data [18]. Temporal stability of the scale was also verified in 30 participants of the clinical sample, with time intervals of up to 8 months, which are closer to customary clinical reassessment time points.

Data analysis

Data analysis was focused on exploring the psychometric properties of the P-YFAS. Based on the one-factor model found in previous studies e.g., [12, 17, 20, 22], a confirmatory factor analysis for dichotomous data (not including the primer items) was conducted on Mplus version 6.0, using a mean- and variance-adjusted weighted least square (WLSMV) estimator with a polychoric correlation matrix. To assess model fit, several criteria were considered: root mean square error of approximation (RMSEA) <.06 [39], comparative fit index (CFI) >.90 [40], weighted root mean square residual (WRMR) <1.0 [41], non-significant Chi-squared (χ 2) test (p < .05) [42], and normed Chi-squared (χ 2/df) <3.0 [43]. Internal reliability was assessed using the Kuder–Richardson’s alpha (KR-20) and, in the non-clinical sample, test–retest reliability was estimated using the two-way mixed, single-measure intraclass correlation coefficient (ICC). To test temporal stability of P-YFAS symptom count in the clinical sample, two subsamples were considered according to test–retest time intervals, namely short (6–8 weeks) and long (3–8 months) intervals. Paired-samples t tests were conducted to analyze whether there were differences in the means of these subsamples at the two time points.

Distributions of continuous measures (total scores and subscales) used to analyze YFAS validity were examined for normality, independently in clinical and non-clinical samples, and found to be normally distributed, as per analysis of kurtosis, skewness and comparison of mean and median. Association between P-YFAS diagnosis and age, education (years of schooling) and BMI was examined using two-tailed Student’s t tests (with Cohen’s d to provide a measure of effect size), and association with gender using independence χ 2 tests (with Yates’ continuity correction). Convergent validity was assessed using Pearson correlation coefficients (r) between P-YFAS symptom count and other instruments measuring eating behaviors, specifically instruments most linked to binge eating and/or reward-related eating. The P-PFS and selected P-DEBQ subscales (emotional eating and external eating) were used in both samples, while the P-BES total score and selected P-EDI subscales (bulimia, body dissatisfaction, and drive for thinness, representing the best screen for BED according to the recent literature [44]), were used in the non-clinical sample. Divergent validity was assessed using Pearson correlations between P-YFAS symptom count and related but distinct constructs, following the same procedure adopted in previous psychometric studies on the YFAS. Here, we considered the dietary restraint (DEBQ restrained eating subscale) and health literacy (NVS) constructs. In the non-clinical sample, three P-EDI subscales (maturity fears, perfectionism, and interpersonal distrust, chosen based on their low screening properties for BED [44]) were additionally used to test divergent validity. To assess discriminative validity, we performed a one-way analysis of covariance (ANCOVA) to compare the mean scores of the P-YFAS symptom count in clinical and non-clinical samples, while adjusting for age and education.

With the exception noted above, data analyses were performed using SPSS version 22.0. All analyses were two-tailed, with significance considered at p < .05.

Results

In the non-clinical and clinical samples, participation rate was 97.8 and 88.8%, respectively. See Table 1 for a summary of socio-demographic and clinical characteristics for participants of both samples.

Table 1 Socio-demographic and clinical information of the non-clinical and the clinical samples

P-YFAS factor structure and reliability

Confirmatory factor analysis with P-YFAS dichotomous items achieved satisfactory goodness of fit for a one-dimensional solution (data not shown). Internal reliability was good (KR-20 = .80) and, with one exception, all items presented a significant coefficient. Item #24 presented a non-significant coefficient and a very low item-total correlation (.06), prompting its exclusion and repetition of analyses (see supplementary material, Table S1). After excluding item #24, fit statistics improved [χ 2 (189) = 456, p < .001, χ 2 /df = 2.41; RMSEA = .06, 90% CI .05/.06, p (RMSEA < .05) = .06; CFI = .90; WRMR = 1.41] with satisfactory fit achieved for all indices with the exception of the χ 2 test. The latter was significant possibly due to the fact that the χ 2 statistic is sensitive to sample size. The KR-20 internal reliability coefficient was .82 and all items loaded significantly on the common factor (p < .001), with item-total correlations between .20 and .60. Five items (#4, #11, #21, #22, and #25) had low item-total correlations (.20 to .30), but KR-20 did not decrease when these items were deleted (see supplementary material, Table S1). Thus, a one-factor model without item #24 was the factorial structure proposed for the P-YFAS. To verify differences between the P-YFAS and the original YFAS, all the following analyses were performed with both versions. Since differences between the two versions were not significant, only the results regarding the final 21-item P-YFAS are presented from here on.

Test–retest reliability in the non-clinical sample was adequate (ICC = .64, 95% CI .51–.74). In the clinical sample, there were no significant differences in the YFAS mean scores for test and retest, after both short (t (12) = −0.39, p = .701, η 2 = .01) and long (t (14) = 0.61, p = .551, η 2 = .03) reassessment intervals.

FA diagnosis: prevalence and associated socio-demographic and clinical features

FA diagnosis was ascertained using the P-YFAS in all subjects from the non-clinical sample, and in all but 12 subjects from the clinical sample. For those 12 subjects, this was not possible due to missing data. FA was, thus, diagnosed in 2.5% (n = 7) of the non-clinical sample and 25.8% (n = 46) of the clinical sample. Similar FA rates were found using the 22-item version of the P-YFAS (2.9 and 25.8%, respectively). Endorsement rates of each symptom assessed by the P-YFAS are available as supplementary material (Table S2). Associations between FA diagnosis and gender were not significant in the non-clinical (χ 2 (1, n = 278) = .02, p = .898, φ = −.03) or the clinical sample (χ 2 (1, n = 178) = .42, p = .515, φ = −.07). FA diagnosis was also not associated with years of schooling and BMI (Table 2). Differences in FA diagnosis were found only according to age in the non-clinical sample, where individuals with FA diagnosis were found to be younger (M = 20.29, SD = 1.98) than the remaining sample (M = 23.13, SD = 6.31; t Welch (9.8) = 3.37, p = .007, d = .59). Given the small number of individuals with FA diagnosis in the non-clinical sample, this finding should be considered with caution.

Table 2 Differences in age, education, and BMI according to P-YFAS diagnostic threshold

Convergent, divergent, and discriminative validity

Table 3 shows correlations between P-YFAS symptom count and clinical measures selected to test convergent and divergent validity. When compared with other measures relevant to eating behavior, the P-YFAS achieved adequate convergent validity in the non-clinical sample (.216 < r < .487, p < .001), and good convergent validity (.412 < r < .727, p < .001) in the clinical sample. Regarding divergent validity, correlations were non-significant, with exception of the DEBQ restrained eating subscale, which correlated slightly, but significantly, with P-YFAS symptom count in both samples.

Table 3 Convergent and divergent validity

Discriminative validity of the scale was supported by finding of a significant difference between the two samples on P-YFAS symptom count scores (non-clinical sample: M = 1.55, SD = 1.31; clinical sample: M = 2.81, SD = 1.87), even after adjusting for age and education (F (1, 452) = 25.25, p < .001, \(\eta_{p}^{2}\) = .05; ANCOVA). Among the clinical sample, the proportion of individuals with an FA diagnosis (25.8%) was also significantly higher than what was found for the non-clinical sample (2.5%; χ 2 (1, n = 456) = 55.23, p < .001, φ = .36; χ 2 test).

Discussion

This study describes the translation of the YFAS [12] into Portuguese (P-YFAS), with assessment of its psychometric properties, in terms of validity (construct, convergent, divergent, and discriminative) and reliability (internal consistency and test–retest), measured in two adult samples (non-clinical and clinical). We confirmed the previously proposed one-factor structure of the YFAS after excluding item #24 (unsuccessful effort to cut down), given its non-significant coefficient and very low item-total correlation. Exclusion of item #24 has been reported in other versions of the scale [1620, 22], with suggestions towards its removal from the questionnaire [16, 17, 20]. Both in our study and those mentioned above, item # 24, along with others that compose the criterion for “persistent desire or unsuccessful attempts to quit” (items #4, #22, #25), allow only for a slight differentiation between food-addicted and non-addicted individuals (see Table S2). We believe that the low discriminative power of these items could result from the prevalence of dieting in the general population [45]. Important next steps in FA research include further refinement of the YFAS, with selection of the more specific items associated to FA.

The P-YFAS exhibited good internal reliability, and test–retest analysis also supported stability of the scale. As yet, only two studies have investigated the stability of the YFAS across time [18, 46]. Both were conducted with non-clinical populations and achieved an ICC slightly higher than what was found here, also for the non-clinical population (~.70). Here, we also report, to our knowledge for the first time, preliminary data for temporal stability of the YFAS in a clinical population. In this population, even though we tested a smaller group of individuals (n = 30) over longer time intervals (up to 8 months), the P-YFAS was found to be relatively stable at both shorter (6–8 weeks) and longer (3–8 months) intervals.

Using the P-YFAS, we found an FA prevalence of 2.5% in the non-clinical sample and 25.8% in the clinical sample. To date, three studies analyzed the prevalence of FA in weight-loss surgery candidates and reported considerably higher rates: 31.8% [47], 41.7% [22], and 53.7% [23]. Comparing the samples of these studies, we did not find striking differences regarding age, and there are insufficient data to allow for comparisons of BMI. The FA rate in our non-clinical sample was also lower than in previous studies using student [16, 18] and population-based samples [17], with prevalence rates around 9%. Only one study [48] found a rate similar to ours (1.6%), but in participants classified as underweight or normal-weight. In our sample, these BMI classes represented 80.2% of the participants, which may have had some impact in FA assessment. However, given the limited evidence for differences in FA prevalence according to weight or age categories [9], this hypothesis should be considered cautiously, and the impact of these and other clinical and socio-demographic variables on the expression of FA symptoms should be further explored. In interpretation of our findings, we are more inclined to argue that the FA assessment can be sensitive to cultural eating habits and food preferences. Culture can affect attitudes towards food, differences in accepted portion sizes, and personal feelings of embarrassment and distress towards overeating [49]. It is possible that in Portugal, a country with rich gastronomic tradition, people view larger food portions as more acceptable than in other countries. The question regarding cultural differences in FA symptomatology is, thus, in our view, essential to extend validation of the YFAS.

The P-YFAS symptom count score was shown to have adequate convergent and divergent validity, with moderate to high correlations between the P-YFAS score and measures of eating pathology. Contrary to our expectation, in divergent validity analysis, we observed small yet significant correlations between P-YFAS and P-DEBQ restrained eating subscale scores. In fact, these two constructs, apparently with opposite expressions (restrictive eating vs. overeating), can be related to the extent that intense dieting may result in persistent hunger and limit cognitive control, making dieters susceptible to disinhibited eating. Dietary restraint may, thus, represent an “intent to diet” rather than caloric restriction per se [29]. Discriminative validity of the P-YFAS was also supported given that, after adjusting for age and education, the symptom count score was able to differentiate between clinical and non-clinical samples, and the P-YFAS diagnostic also differed significantly according to sample group (clinical vs. non-clinical).

The results of our study must be interpreted in the context of the study design. The main findings are based on self-reported measures, implying risks regarding accuracy [50]. Generalizability of our findings is also a concern, with the clinical sample predominantly female and comprising a specific group of patients with obesity: candidates for weight-loss surgery. Confirmation of the psychometric properties of the P-YFAS should, thus, be performed with larger samples of individuals, including broader profiles of patients with obesity. Specifically with regards to short- and mid-term stability of the scale, future research should extend current findings in both samples, since the mid-term stability in non-clinical samples has been analyzed only once [46], and the data we provide regarding stability in clinical samples are preliminary due to the small number of participants that were re-assessed. Finally, the unbalance in sample sizes in this study, resulting from the smaller size of our clinical sample, limited the testing of whether the same P-YFAS configuration holds across groups. In future studies, it would be pertinent to test measurement invariance between groups, namely between clinical and non-clinical samples.

Conclusions

Here, we have translated and adapted the Portuguese version of the YFAS in non-clinical and clinical samples, and found it to have adequate psychometric properties. Nevertheless, our data raised questions related to conceptual definition of the symptom “unsuccessful effort to cut down”, which seems to be a broad and unspecific symptom. Furthermore, the lower percentage of FA diagnosis found in our samples suggests that the YFAS is sensitive to cultural food habits and preferences. Future directions for extending YFAS validation should include the clarification of the impairments specific to FA condition and their relationship with clinical and socio-demographic variables, distinctively in clinical and non-clinical samples.