Introduction

Anxiety and depression are frequently observed in patients suffering from physical diseases, with an enormous burden on the patients and on the society [1]. These symptoms, however, often remain undetected by physicians, with detection rates below 50 % [2, 3]. Several screening instruments have been developed to identify patients with increased distress, e.g., the Hospital Anxiety and Depression Scale (HADS). Developed by Zigmond and Snaith [4], this questionnaire has been translated into many languages and used in many clinical studies, especially in cardiology and oncology. Compared with multiple other questionnaires on anxiety, depression, and general distress, applied to cancer patients, the HADS obtained the best overall evaluation in a review with 132 studies [5], while another meta-analysis with 41 studies found out that the HADS has a good (but not superior to other screening instruments) case-finding ability. A large Norwegian study [6] established an association between HADS depression and all-cause mortality rate. Many studies have been performed to test the psychometric properties of the instrument [7], especially the dimensional structure. A recent review [8] found that exploratory factor analyses generally confirmed the bidimensional structure, while confirmatory factor analyses (CFAs) in most cases support three-dimensional solutions, e. g., [912]. However, these three-dimensional solutions are not identical. In clinical practice, the original scale structure (anxiety and depression) is maintained in most cases. However, sometimes a summarizing score over all 14 items is calculated, indicating general mental distress [13]. The HADS was also tested with item response theory [1416].

The interpretation of individual HADS scores requires the availability of normative data obtained from a sample of the general population. Such normative studies have been performed in several countries: Germany [17], UK [18, 19], Norway [20] (depression subscale only), Sweden [21], Hong Kong [22], and South Korea [23]. There are some remarkable differences between the countries. Anxiety mean values range from 3.6 [22] (Hong Kong) to 6.4 [19] (UK), while the depression means were found between 3.3 [22] (Hong Kong) and 6.6 [23] (Korea), with the European countries in between. Age and sex differences were also investigated in these studies. Most investigations found increasing depression mean scores with increasing age, especially in the largest study with more than 60,000 people in Norway [20], while the age dependency of anxiety was less pronounced. Sex differences were also not identical in the investigations. Women generally reported more anxiety than men in most studies, with differences ranging from 0.5 [24, 25] to 2.1 [21] points, whereas depression differences between males and females were small. We cannot give reasons for the differences in the study results. However, these discrepancies, even between European countries with similar social structures, indicate the limited degree of generalizability and the need for normative studies in other countries.

Though the HADS has been used in Latin America in several clinical investigations (e.g., [2629]), normative data has yet to be made available for this part of the world with about 500 million inhabitants. Anxiety and depression can be considered components of quality of life. Latin American normative values have been published for the quality of life instruments WHOQOL-BREF [30] (Brazil), SF-36 [31] (Mexico), and SF-6D [32] (Brazil). An international study [33], including Argentina and Brazil, compared mean values of the general population with the WHOQOL-BREF. Generally speaking, these studies did not detect great differences between Latin America and Europe or Northern America with respect to mean scores of the mental health components of the quality of life instruments. However, to date, HADS norms for Latin American countries are missing, and it is unknown whether age and sex differences are similar to those found in Europe and Northern America.

The aim of this study was to provide HADS normative values for the Colombian general population, to test age and sex differences, and to explore psychometric properties of this questionnaire.

Methods

Study design

The study sample consists of adult people (18 years and above) of the general population of Colombia. The Ethics Committee at the Universidad de los Andes approved the study, and informed consent was obtained from all participants. The research market company “Brandstrat Inc.” conducted the interviews in eight main cities of Colombia: Bogotá, Cali, Medellín, Barranquilla, Bucaramanga, Pereira, Cartagena, and Manizales. Trained interviewers performed the survey. They asked the eligible participants to take part in the study, and in case of affirmation they gave them a booklet with several questions and questionnaires and asked them to fill them in. After finishing, the interviewers controlled the booklet for missing data and asked the people to complete the questionnaires in case of missing data (except household income). The total duration was about 45 min. Each Colombian city is divided into barrios (quarters) with different mean socioeconomic strata (SES) of the inhabitants (SES ranging from 1 = very low to 6 = very high), which are characterized by the mean socioeconomic level of the inhabitants. The sampling procedure adopted in this survey assured that each stratum (with corresponding barrios) was representatively included in the sample. Within each barrio, the participants were randomly selected. In case of non-response, another eligible participant from the same stratum was asked. This technique yielded a stratum distribution in the study sample identical with that of the general population. Due to this procedure, the resulting sample can be assumed to be representative of the population of Colombia, living in private houses. A total of 2,372 people were contacted, and finally 1,500 people responded with complete data sets, resulting in a response rate of 63 %. The interviewers did not obtain data in case of non-participation. Therefore, we have no data on reasons of non-participation. As an incentive for collaboration in the study, a brochure with information about healthy lifestyles was given to the participants.

Questionnaires

The HADS consists of 14 items, seven items indicating anxiety and seven items indicating depression. The answer format offers four options, scored with values from 0 to 3. This results in scale values between 0 and 21 for each scale. Three ranges were defined by the original test authors: 0–7 (non-cases), 8–10 (doubtful cases), and 11–21 (cases). Several studies also use a HADS total score, simply summing up the anxiety and depression items. Several cut-offs for this total score have been proposed by different researchers, based on sensitivity and specificity calculations. In the review article [7], 16 papers are listed with 11 different cut-off scores. Singer and colleagues [13] recently recommend the cut-off score of 13+ (“medium model”), which is nearly in the middle of the cut-off scores recommended in the literature. We used the Spanish version of the HADS [34], but made a minor cultural adaptation after performing a pretest. We replaced the wording “aleteo en el estómago” with “mariposas en el estómago” which is an expression commonly used in Colombia and is, therefore, easily understood by our population.

To assess construct validity, the survey also included several other questionnaires:

GHQ-12: General Health Questionnaire [35]. This 12-item questionnaire is a validated indicator of psychological distress. In this study, we used the one-dimensional Likert scaling (0–1–2–3). Points were summed to a global score ranging from 0 to 36.

MFI-20: Multidimensional Fatigue Inventory [36]. The MFI-20 covers five dimensions of fatigue: general fatigue, physical fatigue, reduced activity, reduced motivation, and mental fatigue. We calculated a sum score for all 20 items.

LOT: Life Orientation Test [37]. This questionnaire is a six-item scale (with four filler items), containing three items each for positive and negative general life expectations. As in the original conceptualization, in this study the LOT was used as a uni-dimensional scale with the poles pessimism and optimism.

SF-8: short form health survey (eight items) [38]. The shortest member of the SF family is the SF-8. It generates a health profile of eight scores describing health-related quality of life, which is summarized into physical component and mental component summary scores.

FLZ: Questions on Life Satisfaction [39]. This questionnaire assesses general life satisfaction in eight dimensions: friends/acquaintances, leisure activities/hobbies, health, income/financial security, occupation/work, housing/living conditions, family life/children, and partner relationship/sexuality. In addition, the subjective importance of each of the dimensions is assessed. Finally, the total FLZ score is calculated as the sum of the satisfaction scores of the eight dimensions, weighted by their importance ratings.

Data analysis

The influence of age and gender on anxiety and depression was tested with two-factorial ANOVAs. The 75 and 90 % percentiles are those scores for which 75 or 90 % of the respondents reach this or a lower score. Pearson correlation coefficients were used to calculate the association between the HADS scores and the scales of the other questionnaires. Internal consistency was determined with Cronbach’s alpha [40].

Confirmatory factorial analysis was performed with Mplus. We examined three models: the two-dimensional standard model (anxiety and depression), the one-dimensional model (mental distress, no segmentation into subscales) [41], and the so-called bifactorial model, including a broad general factor, onto which all items load, and two conceptually narrower factors onto which items with related content (anxiety or depression) load (cf. [12], Fig. 2, model 9). We used the following criteria: bayesian information criterion (BIC), standardized root mean square residual (SRMR), root mean square error of approximation (RMSEA) [42], comparative fit index (CFI) [43], and tucker lewis index (TLI) [44]. RMSEA <0.05 indicates a “close fit”, RMSEA between 0.05 and 0.08 a “reasonable fit”, and RMSEA >0.10 an “unacceptable fit”. TLI and CFI are coefficients that describe how well a model fits the data relative to a model that assumes only sampling errors to explain the covariance of the observed variables. According to Hu and Bentler [45], CFI and TLI should be greater or equal 0.95.

Results

Description of the sample

Sociodemographic characteristics of the final sample are given in Table 1. The sample is roughly representative of the adult Colombian population concerning age, gender, and civil status. In the right part of Table 1, census data of the total Colombian population are given, provided by the Departamento Administrativo Nacional de Estadística (DANE, Colombian Statistical Administrative Office) [46]. The percentages for the sex and age categories obtained in our sample are similar to those of the general population. The DANE data sets concerning education and employment status cannot be used here, because they also include children. With the exception of household income, there were no missing data since the interviewers controlled the completeness of the questionnaires and asked the participants to complement the questionnaire in the case of missing values.

Table 1 Sociodemographic characteristics of the sample

Normative data

The HADS mean scores for the total sample are 4.61 (SD = 3.64) (anxiety) and 4.30 (SD = 3.91) (depression). Table 2 presents the mean scores for age groups and both genders separately, together with 75 and 90 % percentiles. Females are more anxious (diff = 0.4) and more depressed (diff = 0.5) than males. There is a roughly linear age trend for depression (both genders), but not for anxiety. Table 3 shows the results of the two-factorial ANOVA. The greatest effect (p < 0.001) is the influence of age on depression.

Table 2 HADS mean scores and percentiles
Table 3 ANOVA results. Effects of sex and age group on anxiety and depression

An alternative way to express age and gender differences of anxiety and depression is presented in Table 4. The figures indicate the percentages of subjects whose score is equal or greater than the cut-off given in the left column. The values of cut-off 8+ represent all subjects with 8+ and not only those between 8 and 10.

Table 4 Percentages of subjects above cut-off (anxiety 8+, depression 8+, total 13+)

Influence of sociodemographic variables on anxiety and depression

People with high educational level, employment, and high income show lower rates of anxiety and depression. Depression mean score group differences are more pronounced than anxiety differences. Living with/without a partner has nearly no influence on anxiety and depression. Among the sociodemographic variables, education has the highest impact, with an effect size of d = 0.43 in the depression subscale. Though anxiety and depression mean scores increase with increasing numbers of chronic diseases, due to small sample sizes of the groups with chronic conditions the group differences are not statistically significant (Table 5).

Table 5 Anxiety and depression, broken down by sociodemographic variables and the number of chronic diseases

Psychometric properties

The correlations between the HADS scales and other scales (construct validity) are presented in Table 6. HADS anxiety and depression are positively correlated with questionnaires on fatigue (MFI-20) and mental disorders (GHQ-12), and negatively correlated with quality of life, life satisfaction and optimism. There are only marginal differences between anxiety and depression in the pattern of the correlations.

Table 6 Correlations between HADS and other questionnaires

The following coefficients of internal consistency (Cronbach’s alpha) were obtained: anxiety (α = 0.76), depression (α = 0.78), and total score (α = 0.86). The part-whole corrected item-scale correlations for the anxiety (A) and depression (D) items were as follows: r = 0.52 (A1), r = 0.59 (A3), r = 0.50 (A5), r = 0.44 (A7), r = 0.30 (A9), r = 0.49 (A11), r = 0.57 (A13); r = 0.56 (D2), r = 0.61 (D4), r = 0.49 (D6), r = 0.40 (D8), r = 0.54 (D10), r = 0.54 (D12), and r = 0.38 (D14). Anxiety and depression were correlated with r = 0.67.

The CFA yielded the following results: two-dimensional standard model (df = 76): BIC = 46778, SRMR = 0.047, RMSEA = 0.071, CFI = 0.899, and TLI = 0.880; one-dimensional model (df = 77): BIC = 46923, SRMR = 0.050, RMSEA = 0.079, CFI = 0.873, and TLI = 0.850, and bifactorial model (df = 63) [12] BIC = 46,682, SRMR = 0.039, RMSEA = 0.062, CFI = 0.937, and TLI = 0.909.

Discussion

The main aim of this study was to test psychometric properties of the HADS questionnaire in a Latin American country and to provide normative values. The mean scores of anxiety and depression were within the range obtained in studies from other countries. The anxiety mean score (M = 4.6) was nearly identical with that of the German [17] (M = 4.7) and the Swedish [21] (M = 4.5) study, while the mean score of Hong Kong [22] (M = 3.6) was lower, and the scores of Korea [23] (M = 5.3) and the UK [18] (M = 6.4) were higher. Concerning depression, the mean score of this Colombian study (M = 4.3) was higher than those of Hong Kong (M = 3.3), Norway (M = 3.5), UK (M = 3.7), and Sweden (M = 4.0), but lower than those of Germany (M = 4.7) and Korea (M = 6.6). In contrast to Korea, where higher levels of mental distress were reported, the values obtained in Colombia were generally in the range of the European studies.

Age and sex effects were only partially identical with those obtained in the other studies. Women reported significantly more anxiety (p < 0.05) and depression (p < 0.01) than men, while in most other studies there is only a sex difference in anxiety. As in other studies, depression markedly increases with age, for both sexes. This justifies presenting normative values separately for sex and age groups. In contrast to the German study, linear models did not seem to be appropriate, since in anxiety and depression there were non-linear age relationships. Therefore, it was not appropriate to present beta coefficients for the description of age dependency.

Researchers who prefer to express the burden of anxiety and depression in terms of percentages of subjects exceeding a certain cut-off score can see in Table 4 that about 25 % of the males and about 30 % of the females suffer from heighted mental distress (total score, cut-off 13+). Since there is no convention about an appropriate cut-off point, it is difficult to compare these results with other studies that used different cut-offs. However, Table 4 can help translate the mean scores (given in Table 2) into percentages of “cases” (given in Table 4).

HADS scores correlated negatively with quality of life (SF-8), life satisfaction (FLZ) and optimism (LOT), while the correlations were positive for fatigue (MFI-20) and mental distress (GHQ-12). There were only marginal differences in the correlation pattern between anxiety and depression. The highest coefficients were obtained between HADS and MFI-20 (fatigue), which may be, at least in part, due to the higher number of items of the MFI-20, compared to the other questionnaires. The correlation coefficients of the HADS total score were generally higher than those of the single scales (anxiety and depression). Together with the high correlation between anxiety and depression (r = 0.67) and the high internal consistency (Cronbach’s alpha = 0.86) of the total scale we conclude that the use of the total score, albeit not recommended by the original test authors, is a useful measure for mental distress and can by applied in clinical practice. CFA results only partially supported the two-dimensional structure of the HADS. The model fit was best for the bifactorial model, including both subscales and the general factor. The one-dimensional model showed the lowest fit coefficients, and the fit of the two-dimensional standard model was in between. While the RMSEA criterion for this model was acceptable, the values for the TLI and CFI were below the criterion of 0.95.

All items contributed to the reliability of the scales, with two exceptions, items A9 and D14. These two items neither enhanced nor diminished Cronbach’s alpha. Taken together, the results of the CFA and reliability analyses justify the use of the HADS in the original way, applied to the population under study.

The study has several limitations. A standard diagnostic interview for anxiety and depression was not included. The questionnaires used to assess validity of the HADS have not been validated yet for the Colombian population. Though in Colombia most people live in cities, the underrepresentation of rural areas might cause a bias. However, the sampling method assured that the barrios with their different socioeconomic levels were representatively included in the examination. The generalizability of the study results may be a matter of debate. Even in Western European studies there were considerable differences in mean HADS scores and in age and gender dependencies. Therefore, it is difficult to assess to what degree the results reported here can be generalized to Latin America. Though some psychometric properties of the HADS in the Colombian sample were not convincing, we believe that it is useful to maintain the internationally accepted scale structure. If we had created new subscales according to factorial analyses, the results (mean scores, age and gender differences, normative values, correlations with other questionnaires) could not be compared with those reported in other investigations.

Clinicians in Colombia can use the reference data presented here to assess the degree of psychological distress of their patients. The psychometric analyses showed that the standard use of the HADS is also appropriate in Colombia. Normative and psychometric studies in other Latin American countries are needed to assess the generalizability of the study results.