Introduction

Eating disorders (EDs), notably anorexia (AN) and bulimia nervosa (BN), are characterized by self-inflicted weight loss and recurrent episodes of bingeing and purging, respectively. An irrational overvaluation of the importance of controlling food, weight, and body shape represents the specific clinical features [1]. Severe EDs impair quality of life and interpersonal relations [2], and increase the number of productive years lost to disability [3]. The standardized mortality rate is about five times higher than in the general population [4, 5], and it takes six to nine years before 70% of the patients no longer meet the diagnostic criteria for an ED [6, 7].

Comorbid personality disorders (PDs) are frequently encountered in the treatment of EDs, and may become as protracted and impairing as the EDs. Previous studies [810] show that a comorbid borderline, avoidant, or obsessive-compulsive PD may worsen the long-term treatment-outcome of EDs. Moreover, a comorbid PD may complicate treatment challenges by increasing the risk of premature treatment termination due to a fragile therapeutic alliance [9, 11], prolonging treatment for non-therapeutic reasons [1214] or resulting in insufficient focus on alleviating ED symptoms due to the need to address the PD. However, there are inconsistent findings from studies [15, 16] and reviews [17, 18] as to whether a concurrent PD predicts a poor outcome of an ED, whether PDs improve at the same rate as the ED or tend to persist after the alleviation of ED symptoms [1922].

In order to develop and examine comprehensive treatment models in terms of their cost-effectiveness, ability to overcome treatment challenges, and to prevent an unfavourable ED outcome, it is essential to determine how frequently or likely comorbid PDs are expected to appear in EDs. Research to date has shown that a comorbid PD among ED patients may be rather common, yet with large variations in the comorbid proportions, ranging from 27 to 93% across all PDs. However, 2–50% of the variation in comorbid borderline PD seems related to heterogeneity in samples, with higher PD proportions among inpatients compared to outpatients or community samples [2325], and methods for generating PD diagnoses (e.g., higher PD proportions when using self-report instruments compared to clinical interviews) [2629].

In contrast to qualitative reviews [25], meta-analyses provide proportion estimates, and thereby a more precise summation of the field. In addition, moderator analyses may identify variables explaining the true variation between studies. A moderator of particular interest is age of onset of an ED. While a higher age of onset is related to a lower proportion of PD [30], an early teenage onset is related to more severe general psychopathology [22] that may impair a normal personality development.

Three meta-analyses have addressed the comorbidity of EDs and PDs, comprising 17 studies from 1995 to 2004 [23], 19 studies from 1989 to 1997 [31], and 28 studies from 1983 to 1998 [29]. These studies yielded an overall mean proportion of PDs ranging between .19–.50 for AN and .25–.59 for BN. The only study [29] providing confidence intervals showed large and overlapping intervals for the comorbidity proportions, and hence no statistically significant difference between the mean PD proportions of AN (.50) and BN (.59). All the three studies report an equal proportion (.18) of cluster A PDs for AN and BN, respectively, and higher proportions of cluster B disorders for BN (.27) compared to AN (.19). Cluster C PDs were more prevalent in AN (.46) than in BN (.35) in two of the studies, while the study by Rosenvinge et al. [29] found negligible differences (i.e., .45 and .44, respectively). The latter study did not report estimates for specific PD diagnoses in contrast to the other two meta-analyses [23, 31]. These indicated that the avoidant PD was prevalent in both AN and BN (.35 and .25, respectively). The obsessive-compulsive PD was more prevalent in AN (.26) than in BN (.17), while the opposite was the case for the cluster B borderline PDs (.36 and .29, respectively). Moderator analyses, conducted in the Rosenvinge et al. [29] study, found higher PD proportions in studies using self-report instruments rather than structured clinical interviews, and in inpatients compared with outpatient samples. However, these analyses were limited to PD cluster levels. Similar analyses were not conducted at all in the other two meta-analyses, and all studies failed to include a healthy comparison group.

In the current meta-analysis, we included a healthy comparison group and the largest number of studies to date. As such we were able to overcome limitations of previous meta-analyses, and to test the impact of several moderator variables such as age, age of ED onset, and BMI on PD comorbidity. Finally, a larger pool of studies could facilitate discussions about the theoretical and clinical implications of how clusters and specific PD diagnoses are distributed across the two ED diagnoses AN and BN.

Methods

Search strategy

PsychINFO, Embase, and Medline were searched for empirical studies published in English and German between January 1980 and January 2016, using the following search terms: “eating disorder(s)” or “anorexia” or “bulimia” and “personality disorder(s)” and “comorbidity”. Using this search strategy, a total of 1168 articles were located.

An additional sample of 62 articles were found in a predecessor of OVID (K = 19), and in the reference lists of six meta-analyses and review articles (K = 43) [18, 23, 25, 29, 32, 33].

This study was undertaken as part of a project of meta-analytic investigations into the comorbidity between PDs and several symptom disorders, from which papers have been published on anxiety disorders [34], mood disorders [35], and binge eating/eating disorder NOS [36], respectively. A total of 17 studies were found during the search for papers in relation to the mood and anxiety disorder projects, leading to a total of 1247 articles.

Study selection

To be included in the analysis articles had to be (1) empirical studies (2) published between 1980 and January 2016, (3) in German or English, with (4) patients of at least 18 years of age, with (5) a primary diagnosis of an ED. In addition, the study had to report (6) diagnostic information about the proportion of comorbid personality disorders. Reading the title and the abstract of the 1247 articles, 1091 papers were removed because patients had been diagnosed with another axis I diagnosis than an ED, or with a comorbid substance-use disorder in addition to the ED, or because the participants had recovered from their ED. Thus, 156 papers remained. An additional 69 articles were removed for various reasons, including sample overlap (K = 20; the most recent and complete article was selected), failure to report the proportion of PDs or statistics that could be converted into proportions (K = 32), studies that had PDs as an inclusion criteria (K = 9) or subjects below 18 years of age (K = 1).

A total of 87 studies comprised the final database, and of these, 25 studies reported information on AN and 32 on BN. The list of included studies and an overview table (A1) are both available from the first author.

Data extraction

The following variables were coded: year and country of publication, type of ED and PD diagnosis, comorbid events of PD (the proportion of the sample having eating disorders, total and for different sub-types of EDs), sample size, percentage of female participants in the sample. Moderator variables included average age and age of onset, type of sample (inpatient, outpatient or recruited), diagnostic system (i.e., DSM-III, DSM-III-R, DSM-IV, ICD-9 or ICD-10), method for diagnosing PD (interview, self-report questionnaire or clinical assessment), rater blind to the ED diagnosis (yes/no), and weight (BMI and weight classification; underweight, normal, overweight).

Coding and estimation of coder reliability

Coding was performed by three graduate students in psychology, who were trained and supervised by the first, second, and last author. Further details on the estimation of coder reliability may be found in Friborg et al. [36].

Statistical analyses

A meta-analysis was conducted using the Comprehensive Meta-Analysis V3 program [37]. SPSS (v 23.0) was used for descriptive statistical analyses. The mean weighted event rate (number of PD cases/sample size) was used as an effect size measure. A random effects model was estimated for all meta-analysis calculations as this model assumes effect sizes in a population not to be constant, and that other factors than sampling error can contribute to the observed variation in effect sizes (for example, study design, patient characteristics and measurement methodology) [38]. Studies were weighted by the inverse of the variance components comprised of both random variation (sampling error) and variation between studies [39], resulting in more equal weights between the studies compared to weights assigned when using the fixed effect model. To examine variation between studies, a Q-statistic was calculated [40] in addition to I 2 (percentage of observed variance that is real) [39]. A significant result indicates heterogeneity and the need to further examine moderators that may explain the true variance between studies. The analyses were conducted separately for AN combined (anorexia nervosa, anorexia nervosa restrictive type, and bulimic type), for BN (purging and non-purging subtype), and for any type of eating disorder. The category “any type of eating disorder” included AN, BN, binge-eating disorders, eating disorders not otherwise specified (EDNOS), in addition to eating disorders in general where the sub-type was not reported in the article or reported for BN and AN combined. To ensure independent effect sizes, the study was used as unit of analysis, and multiple effect sizes from a study were combined before the meta-analysis calculations were conducted when needed. Some articles reported the proportion of PDs in healthy controls, and these studies were combined for comparison purposes.

Moderator variables were analyzed if at least three studies were available for each sub-group. Categorical variables were examined by comparing groups with a mixed effects analysis, which uses a random effects model to combine studies within each sub-group and a fixed effect model across sub-groups. The BMI was infrequently reported, and thus the three categories underweight, normal, and overweight were used in the moderator analyses. Continuous moderators (age and age of onset) were analyzed using meta-regression with age and age of onset as independent variables and the logit event rate as the dependent variable. The regression parameters were estimated using a random effects model with full maximum likelihood estimation [39]. As the CMA-program performs meta-regression on logit event rates instead of event rates directly, the results have to be transformed back (anti-log) for interpretation purposes. Meta-regression was conducted for the combined category of eating disorders and for any type of personality disorders. Only significant individual moderators were included in the meta-regression to ensure a sufficient sample of studies. Categorical moderators were dummy-coded before entered in the meta-regression analysis. For the moderators weight, age, and age of onset, the analyses could be based on sub-groups from a study if different weight and age groups were reported separately.

Results

Sample characteristics

A total of 87 published studies were included with a mean publication year of 1998 (SD = 6.5). The studies were conducted in 18 different countries. The largest group consisted of American studies (46%), followed by UK (9%), and German studies (7%). A total of 79% studies included only women, whereas the rest had a small proportion of men, and one study included men only. The mean age was 27.3 years (SD = 6.6), and the mean age of onset was 18.2 years (SD = 1.8).

Comorbidity of PDs for any ED and healthy controls

Meta-analysis results for any type of ED and healthy controls are presented in Table 1. The mean proportion of any PD was .52 in any EDs, and significantly different from the PD rate (.09) in healthy controls due to non-overlapping confidence intervals. For the three clusters, the mean PD proportion increased from .12 (cluster A), to .28 (cluster B), and to .38 (cluster C) for any ED, and were significantly larger than the proportions found in healthy controls (.02, .04, and .08, respectively). For the specific PDs the proportions of paranoid, borderline, avoidant, dependent, and obsessive-compulsive PD were significantly higher for patients with any ED compared to healthy controls. Borderline and avoidant PDs had the highest prevalence in any ED with a mean proportion of .22 and .20, respectively (Table 1). The heterogeneity statistics (Q value) were significant for all analyses in the any ED group, inviting for moderator analysis. The corresponding Q-statistics for healthy controls were non-significant except for any PD and dependent PD.

Table 1 Mean proportions of PD diagnoses for any eating disorders and comparison groups

Comorbidity for AN and BN

Meta-analysis calculations were also performed for AN and BN separately (Table 2). The mean proportion of any PD was .49 for AN and .54 for BN. The pattern of the estimated mean proportions was similar between AN and BN, and the confidence intervals were overlapping between the two groups. Both disorders yielded high proportions of borderline and avoidant PDs (between .19 and .25). For patients with AN, obsessive-compulsive PD was also relatively frequent (.23) and significantly higher than in BN (.12). The calculated Q-statistics indicated significant variation among studies for both AN and BN, except for antisocial and narcissistic PDs in AN (Table 2).

Table 2 Mean proportions of PD diagnoses for anorexia nervosa and bulimia nervosa

Moderator analyses

The heterogeneity statistics indicated significant variation for almost all analyses (Tables 1, 2) indicating a need for examining moderators (Table 3). In order to maximize the number of studies available for moderator analyses, any type of PD was first examined for the combined category of any type of ED. There were no significant differences in PD proportions between the three diagnostic systems (DSM-III, DSM-III-R, and DSM-IV) for determining a comorbid PD diagnosis. However, using the DSM-III system for diagnosing EDs resulted in substantially higher mean PD proportions than with DSM-IV (.69 vs .45). The type of assessment for diagnosing PD also revealed significant differences, showing higher proportions for self-report questionnaires (.71), compared to clinical assessments (.45), and interview (.50). Whether or not the clinician was blind to the ED diagnosis did not affect the mean PD proportion. However, the number of studies in each group was relatively small (K = 3 and K = 4) providing low statistical power. Patient weight was a significant moderator, and samples classified as overweight had generally lower mean PDs compared to normal and underweight samples. Type of sample (outpatient, inpatient, both, and recruited) was not a significant moderator.

Table 3 Moderator analyses results: mean proportions of any type of personality disorder for the combined category of eating disorders

The moderator analyses were also conducted for AN and BN separately (Table 4). Not all comparisons could be conducted, as many sub-groups included fewer than three studies. The assessment method for PDs for BN patients was significant and similar as reported above. Furthermore, BN outpatients and patient samples that were recruited yielded higher comorbidity rates compared to inpatient samples and samples that consisted of both in- and outpatients.

Table 4 Moderator analyses results: mean proportions of any type personality disorder for anorexia and bulimia nervosa

The two continuous moderators, mean age and age of onset, had to be examined in a meta-regression model using logit transformed event rates for any PD as the dependent variable. Sub-groups of EDs were treated as independent observations for studies which reported mean age and age of onset for different sub-groups. The meta-regression was significant for age (K = 84, β 0 = 1.25, β = −0.04, p < .001), indicating a decline in PDs with increasing age for any ED. The corresponding analyses for AN and BN resulted in non-significant findings.

Moderator analyses for age of onset with any PD as the dependent variable were not significant for the combined ED category and for AN and BN separately.

To explore the combined effect of multiple moderators, the significant individual predictors from the previous moderator analyses for any ED were included in a meta-regression. The variables were age and three categorical variables (patient weight, diagnostic system for ED, and Method for assessing PD). Each categorical variable consisted of three groups resulting in a total of six dummy variables. The total model with seven variables explained 37% of the between-study variance (Table 5). Two of the predictors were significant, including the dummy variables Weightoverweight (β = −1.33, p < .05) and MethodPDinterview (β = −1.39, p < .05) indicating lower rates of PDs in samples classified as overweight compared to normal weight, and using interview for diagnosing PDs.

Table 5 Meta-regression analysis results of any type of personality disorder for the combined category of eating disorders based on a random effects model

Discussion

Summary of main results

Personality disorders (PDs) are highly comorbid in both anorexia nervosa (AN) and bulimia nervosa (BN) as more than half of the patients have comorbid PD diagnoses. For both disorders, cluster C PDs are most frequent, followed by clusters B, and A in descending order. For AN, the proportion of cluster C PDs was significantly higher compared with cluster A and B, while there was a non-significant trend towards a higher proportion of cluster B than cluster A PDs. For BN, the clusters B and C PDs were equally prevalent, and both occurred significantly more often than the cluster A PDs.

No difference across the specific cluster A and cluster B diagnoses were observed across AN and BN, but in both groups borderline was the most frequent cluster B specific PD. Within cluster C PDs, a statistically significant difference was detected only for obsessive-compulsive PD, which was more prevalent in AN than in BN.

Summary of moderator results

We examined several variables as potential moderators. The diagnostic system used to diagnose EDs showed that the DSM-III, DSM-III-R, and the DSM-IV systems in declining order yielded lower comorbid PD estimates.

The use of questionnaires to diagnose PDs revealed higher comorbidity estimates than structured clinical interviews. We also observed a difference in comorbid PDs between the different weight groups, in which the overweight group had a considerably lower rate of PDs than the underweight and normal weight groups. Mean age showed a significant relation with PD comorbidity by being lower in older groups. Age of onset of ED did not explain a significant variation in the proportions of PDs between the studies. The meta-regression model with the predictors patient age and weight, diagnostic system for ED, and assessment method for PD explained a total of 37% of the between-study variance.

Models of understanding and clinical implications

The common factor model of understanding the comorbidity between PDs and AN or BN, respectively, posits that a common trait ranging from constriction/perfectionism to impulsivity may result in specific associations between AN and cluster C PDs, notably the obsessive-compulsive PD, as well as BN and cluster B, notably the borderline PD [1, 8, 41, 42]. To some extent, this prediction was supported in the sense that the obsessive-compulsive PD proportion was almost twice as high among AN patients (.22) than among BN patients (.12). There was also a trend towards a higher proportion of a comorbid borderline PD among BN patients (.25) compared to AN patients (.19). The frequent observation of diagnostic crossovers between EDs [43, 44] fits with a common factor model for PDs. Such crossovers are expected to be more likely if the PD pathology underpinning both AN and BN are of a comparable rather than of a dissimilar nature. If EDs share a common underlying personality pathology, we would expect the PD comorbidity differences between AN and BN to be negligible both on the PD clusters and the specific PD levels. Such a prediction was partly supported, as the most frequent specific PD diagnoses across the two ED groups (i.e., the borderline and the avoidant PDs) showed proportions of comparable rates (±.05). Although the differences were somewhat larger on cluster B and C levels (±.10 and ±.07 for AN and BN, respectively), they still were within a comparable window.

For obvious reasons, the present study cannot address whether a comorbid PD complicates the outcome of EDs. Still, the high proportion of PDs in any ED and in AN (i.e., .49), and BN (i.e., .54) bring forward a question of whether comorbid PDs might complicate the treatment of EDs. In particular, the comparable proportions (i.e., of the borderline PD and the obsessive-compulsive PD in AN, and between the avoidant and the borderline PDs in BN) may serve to undermine a prototypal picture of these EDs and, rather, to draw the clinician’s attention to a broader clinical approach covering a spectrum of obsessionality and impulsivity. In the present study some comorbid PDs (i.e., the schizotypal and the antisocial PDs, and to some extent the schizoid PD) occurred equally rarely in patient samples as in healthy control samples. This finding suggests that the complication model may need modifications. Summarized, assessment of PD comorbidity particularly related to borderline, avoidant, and obsessive-compulsive PDs should be a routine in ED treatment.

Strengths and limitations

This is the largest meta-analytic review of comorbid PDs in EDs to date. The “file-drawer” problem that may often be the case for non-significant psychotherapy trials is less relevant for studies on comorbidity as low or high comorbid prevalences are of equal concern and interest. In addition, we observed considerable heterogeneity in the reported proportions, which normally yields asymmetrical funnel plots compatible with non-publication bias. Moreover, as the comparisons between blinded and non-blinded studies yielded comparable proportion estimates, the validity of the present findings is considered to be good.

Despite the large number of studies included, the moderator analyses were still somewhat restricted, due to scattered missing data prohibiting moderator analyses on sub-group levels, and resulting in meta-regression analyses only for a sub-set of samples and moderator variables.

Given the mean onset of AN in the early teens, it may seem like a limitation to exclude those few studies comprising patients below 18 years of age. In adolescence personality and its deviations are not fully developed. Hence, we would argue that to include the few studies diagnosing PDs among adolescents would yield data with highly disputable validity as well as more heterogeneity in the results, and thus introducing a more serious limitation. The small proportion of men in the primary studies implies that the comorbidity estimates may not be generalizable to men with EDs. However, given the dominance of women with EDs this is a minor limitation.

Conclusions

The present study adds to the literature and the clinical contention that PDs occurs frequently among patients with AN and BN, as has been found with respect to other eating disorder diagnoses [36]. It is noteworthy that the methodological quality of studies has improved over the years, particularly with respect to the use of standardized methods for determining comorbidity. However, future studies should further improve the methodology by establishing a standardization protocol describing what kind of information all clinical studies should minimally report.