Introduction

Anxiety refers to a general unpleasant state due to excessive worry or fear [1]. Among patients diagnosed with breast cancer, anxiety symptoms typically arise from the fear of deterioration in condition and the uncertainty in treatment outcomes [1]. Anxiety symptoms can be broadly classified into physical and emotional components. Physical symptoms include sweating, trembling, and numbness; whereas emotional symptoms include nervousness, irritability, and fear of the worst happening [1, 2]. Previous studies had also shown that approximately half of the newly diagnosed breast cancer patients were afflicted with varying degrees of anxiety symptoms, with greater prominence noted among patients receiving chemotherapy [35]. Anxiety symptoms are often associated with poorer quality of life and greater impairment in daily functioning [68]; hence, accentuating the need to alleviate this undesirable adverse effect. However, anxiety is often under-diagnosed and hence under-treated [2, 9]. Appropriate instruments that measure anxiety levels are then vital to aid in the study of anxiety among breast cancer patients.

In the literature, numerous patient-reported outcome tools have been developed to measure the severity of anxiety. Among them, the State-Trait Anxiety Inventory (STAI) and the Hospital Anxiety and Depression Scale (HADS) are most commonly used [10]. While the former is limited by its ability to make a distinction between depression and anxiety symptoms [11], the latter is more suitable for assessing subjective anxiety symptoms [10]. On the contrary, the Beck Anxiety Inventory (BAI) is a tool dedicated to assess self-reported anxiety. Although some of the somatic symptoms in the BAI may overlap with the physical aspects of breast cancer, this inclusion in addition to the subjective aspects is valuable to comprehensively and holistically evaluate a patient’s anxiety level.

The validity and psychometric properties of BAI have been well established in both healthy and diseased populations characterized by various psychiatric disorders [1217]. Besides the English version of BAI, the psychometric properties of other language versions like the Spanish version were evaluated to confirm the instrument’s cross-cultural validity [18]. In particular, the English and Chinese versions of BAI are available to the Singaporean community since these are the two major spoken and written languages. However, there is a lack of psychometric data at present for both the English and Chinese versions of BAI to support its usage among breast cancer patients in Singapore.

Hence, validation studies on the English and Chinese version of BAI are not only valuable to establish their clinical significance as reliable instruments, but also to address the frequently overlooked issue of equivalence of translated scales. By demonstrating that the two BAI language versions are equivalent, the scores from both versions can be subsequently combined with a satisfactory level of reliability to produce a more robust and representative dataset for future analysis. Measurement equivalence between different language versions of the same patient report tool can facilitate the pooling of data from multinational studies into a single analytical framework in clinical trials and/or observational research. Therefore, this study was designed (1) to evaluate the cross-cultural validity of the BAI instrument among the English-speaking and Chinese-speaking breast cancer patients in Singapore, and (2) to assess the measurement equivalence of the two BAI language versions.

Methods

Study setting and population

This was a prospective study conducted at the National Cancer Centre Singapore and KK Women’s and Children’s Hospital where approximately 70 % of all cancer patients are treated. This study was approved by the Singhealth Institutional Review Board.

All eligible patients must be (1) diagnosed with breast cancer by a medical oncologist, (2) above 21 years of age, (3) able to read and understand English or Chinese, (4) scheduled to receive chemotherapy, and (5) chemotherapy and/or radiotherapy naïve. Patients were excluded if they were documented with brain metastasis; had a history of brain injury, evidence of delirium, dementia, or aphasia; previously diagnosed psychiatric illness, alcohol or drug abuse, or central nervous system diseases; were physically or mentally incapable of giving written consent; or were symptomatically ill. Eligible patients were categorized into English-speaking and Chinese-speaking groups based on their indicated mother tongue or preferred language used in daily communications.

Study procedures

Using the consecutive sampling approach, all eligible patients were recruited at the outpatient clinics by a team of research assistants at the two study sites. Patients’ demographic and clinical information such as age, education level, and cancer stage were collected from existing in-house electronic databases and patients’ interviews. English and Chinese versions of the Beck Anxiety Inventory and the European Organization for Research and Treatment of Cancer Quality of Life Core 30 (EORTC QLQ-C30) questionnaires were completed during each 30-min interview. These questionnaires were administered to the English-speaking and Chinese-speaking patient groups respectively by trained bilingual interviewers to ensure administration coherency.

Tools

The BAI contains 21 items that assess the severity of clinical anxiety symptoms experienced by patients in the past month. It has four domains: autonomic, neurophysiological, panic, and subjective [11]. Patients will rate each symptom on a four-point Likert scale in increasing severity, from 0 (not at all) to 3 (severe). The global score is an arithmetic summation of the ratings across all 21 symptoms and ranges from 0 to 63. Higher global score indicates higher anxiety level. The Chinese version of BAI was adapted from a previous study conducted on patients with anxiety disorders in Singapore [19, 20].

The EORTC QLQ-C30 is designed specifically to assess cancer patients’ health-related quality of life for the past week [21, 22]. The questionnaire consists of 30 items that are classified into the following: five functional scales (physical, role, emotional, cognitive, and social); three symptom scales (fatigue, nausea/ vomiting, and pain); a global quality of life scale, and six single items (dyspnea, insomnia, appetite loss, constipation, diarrhea, and financial stability). Both the English and Chinese versions have been validated within the Singapore cancer population and have demonstrated satisfactory psychometric properties [23]. For this study, specific EORTC QLQ-C30 scales are utilized to establish the validity of BAI and to provide determinants for known-group categorization.

Data analysis

The data collected were analyzed using the general principles of scale development according to the classical test theory. All statistical analyses were performed using the SPSS Statistics Version 23. Demographic and clinical characteristics of the sample were described by descriptive statistics, and a comparison of these characteristics was made between the English-speaking and Chinese-speaking groups. Chi-square test was used for categorical measures. Independent samples T test was used for continuous measures that can be approximated to be normally distributed; while a non-parametric test, Mann-Whitney U test, was used for continuous measures without normal distributions. All two-tailed significance tests were conducted at a significance level of 0.05.

Validity

To test the adequacy of BAI in measuring a patient’s self-reported anxiety symptoms, the EORTC QLQ-C30 was used as a standard for validation because it has been previously validated in the Singapore cancer population [2226]. Due to the non-normality nature of the data obtained, Spearman’s correlation was performed for all correlation analyses [27]. A significant correlation is indicated by a P value less than 0.05. The magnitude of correlation values will denote the corresponding correlation strength: less than 0.4 being poor; 0.4 to 0.7 being moderate; and greater than 0.7 being strong.

Foremost, the criterion validity was assessed by the correlation between the BAI total score and the emotional functional scale of EORTC QLQ-C30 that measures the similar construct of anxiety. We then hypothesized that a higher BAI total score would correlate with patients exhibiting a lower state of emotional functioning. Next, convergent validity analysis of the BAI domains was performed to demonstrate that the BAI and its domains correlate appreciably with their known related constructs. Studies among breast cancer patients have suggested that anxiety is associated with a myriad of emotional factors including depression, fatigue, psychological disturbances and pain. These are collectively known as a symptom cluster [28]. Thus, we hypothesized that a higher BAI subjective domain score would also correlate with a lower state of emotional functioning. Additionally, fatigue is highly related to anxiety especially in breast cancer patients undergoing chemotherapy [9, 2931]. Hence, by using the fatigue symptom scale of EORTC QLQ-C30, we hypothesized that higher BAI total and neurophysiological domain scores would correlate with a greater level of fatigue. Furthermore, anxiety and its associated symptoms would affect quality of life negatively [28, 32]. Consequently, we hypothesized that a higher BAI total score would correlate with a lower global quality of life score. The EORTC QLQ-C30 dyspnea item is a panic symptom characterized by breathlessness [33]. Therefore, we would expect a higher BAI panic domain score to correlate with a greater degree of the dyspnea symptom.

The discriminant validity of BAI was assessed on the basis that scales purported to measure different aspects of quality of life would be poorly correlated. Therefore, we hypothesized that the BAI total and domain scores would have poor correlations with the EORTC QLQ-C30 constipation and diarrhea items.

Known-group validity was performed to ascertain whether the BAI scores were able to distinguish groups with different known characteristics. Statistically significant difference in scores between the identified groups is denoted by a P value less than 0.05. First, we hypothesized that the patients diagnosed with an advanced stage of breast cancer (stage 0–2 vs. 3–4) and possessing a higher ECOG performance status (ECOG 0 vs. ECOG ≥ 1) would report higher BAI total and domain scores [6]. Second, religiosity has been proposed to be a moderator of anxiety levels in cancer patients [1]. Hence, we hypothesized that patients with religious beliefs would report lower BAI total and domain scores. Third, anxiety is associated with a cluster of emotional symptoms. Patients who scored higher in the EORTC QLQ-C30 items (item score 1–2 vs. 3–4) under the emotional functional scale were expected to report higher BAI total and domain scores. Finally, research has suggested that patients implicated with cognitive problems often experience anxiety [34, 35]. We then hypothesized that patients who encountered greater memory-related difficulties would have higher BAI total and domain scores. Mann-Whitney U test was used to compare the scores between the identified groups.

Reliability

Internal consistency of the BAI domains was evaluated using Cronbach’s alpha coefficients (α). A Cronbach’s alpha coefficient value of 0.7 and above will represent satisfactory consistency [36]. In order to identify inappropriately placed items of a particular domain, an item-to-domain correlation analysis was performed. For each item, the corrected item-to-domain correlation was tabulated after removing its contribution to its domain score.

Measurement equivalence

The clinical importance of the score differences observed between the English and Chinese versions of BAI was investigated using the method employed for establishing therapeutic equivalence in clinical trials [37, 38]. The degree of measurement equivalence between two language versions is the extent of similarity in their psychometric properties [39]. The differences in BAI scores might not be exclusively attributable to the language variations, but instead, arise from the differences in other characteristics underlying the English-speaking and Chinese-speaking groups. Thus, it was important to adjust for the significant demographic, clinical, and EORTC QLQ-C30 scores differences. Univariate analysis was performed to identify these statistically significant variables that attained a P value of less than 0.05. These variables were included in the multiple regression analysis to tabulate the 95 % confidence intervals (95 % CI) of the adjusted mean difference. This 95 % CI had a 95 % probability of containing the real difference in mean scores that were solely attributable to the language’s influence.

We adopted the equivalence margin construction from an equivalence study conducted by Cheung et al. [40]. Aligned with Cohen’s proposition that an effect size of 0.2 to 0.5 S.D. (standard deviation) is small, we defined the equivalence margin to be ±0.25 S.D. [41]. Therefore, measurement equivalence would be established if 95 % CI of the adjusted mean difference fell within this equivalence margin. Else, the equivalence could still be possible if it did not exceed the ±0.5 S.D. range which represented the threshold for detecting a small difference [39, 41].

Results

Patient characteristics (Table 1)

A total of 244 breast cancer patients receiving chemotherapy were recruited. Among them, 144 (59.0 %) were English-speaking. Patients were predominantly Chinese (80.3 %) diagnosed with early breast cancer stages 1 or 2 (66.4 %). When compared to the English-speaking patients, the Chinese-speaking patients were found to be older (53.7 ± 8.4 vs. 48.9 ± 9.2 years, P < 0.001) and had fewer years of education (8.3 ± 3.8 vs. 12.3 ± 3.0 years, P < 0.001). Additionally, a higher proportion of Chinese-speaking patients were not working (58.0 vs. 34.0 %, P < 0.001) and majority of them stayed in public housing (92.0 vs. 80.6 %, P = 0.03). There were also more post-menopausal Chinese-speaking (67.0 vs. 41.7 %, P < 0.001) than English-speaking patients. No significant differences were observed for the other demographic and clinical characteristics.

Table 1 Baseline demographic and clinical information of patients

Validity

In the assessment of criterion validity of both the English and Chinese versions of BAI, the EORTC QLQ-C30 emotional functional scale had a negative and moderate correlation with the BAI total score (r = −0.655 and −0.601, respectively) (Table 2). Regarding the correlations with related constructs, the EORTC QLQ-C30 emotional functional scale was similarly observed to be negatively and moderately correlated with the BAI subjective domain score (r = −0.648 and −0.645, respectively). Also, the EORTC QLQ-C30 fatigue symptom scale had a positive and moderate correlation with the BAI total (r = 0.606 and 0.554, respectively) and neurophysiological domain scores (r = 0.521 and 0.438, respectively). In terms of the impact of anxiety on patients’ quality of life, the global quality of life scale demonstrated a negative and moderate correlation with the BAI total score (r = −0.475 and −0.456, respectively). Finally, the dyspnea scale showed a positive and moderate correlation with the BAI panic domain (r = 0.572 and 0.518, respectively). The EORTC QLQ-C30 constipation and diarrhea items measure constructs unrelated to anxiety. The correlations of these items with the BAI total and domain scores of both language versions were all either weak (|r| = 0.183–0.356) or insignificant at a significance level of 0.05 (two-tailed).

Table 2 Spearman’s correlations between the BAI domains and EORTC QLQ-C30 subdomains for the English and Chinese versions of BAI

Known-group validity analysis revealed that both language versions of BAI were unable to discriminate between patients of different breast cancer stages and religious beliefs (Table 3). Nevertheless, it was observed that the English version was able to differentiate patients with different ECOG status by their BAI total scores (ECOG = 0, 8.00 ± 7.74 vs. ECOG ≥ 1, 11.50 ± 12.09, P = 0.02). Furthermore, the BAI total and individual domain scores of the English version demonstrated the ability to distinguish patients who were experiencing more tense feelings, worries, irritability, depression symptoms, and difficulties in recalling past events. On the other hand, the individual BAI domain scores of the Chinese version had a smaller range of discriminating ability. In particular, the autonomic domain score could only discern patients who were facing greater challenges in events recollection.

Table 3 Comparison of the BAI total and domain scores between groups categorized by ECOG performance status, breast cancer stage, and EORTC QLQ-C30 items 21–25 for the English and Chinese versions of BAI

Reliability assessment (Table 4)

In the reliability analysis, satisfactory Cronbach’s alpha values of greater than 0.7 were obtained for all domains except for the panic domain (α = 0.61). In both language versions, satisfactory Cronbach’s alpha values were obtained for the subjective (α = 0.85 and 0.87, respectively) and neurophysiological domain (α = 0.83 and 0.74, respectively). However, for the autonomic domain, satisfactory Cronbach’s alpha value was attained only in the English version (α = 0.81) and not in the Chinese version (α = 0.62). For the panic domain, unsatisfactory Cronbach’s alpha values were obtained in both language versions. In general, the Cronbach’s alpha values for all the domains in the Chinese version were lower than that in the English version, with the exception of the subjective domain. Furthermore, in the item-to-domain correlation analysis, it was observed that the items “indigestion,” “feeling of choking,” and “fear of dying” had poor correlations with the other items in their corresponding domains.

Table 4 Internal consistency of the BAI domains and the corrected item-to-domain Spearman’s correlations (by total and language versions of questionnaire)

Measurement equivalence of the English and Chinese versions of BAI

Demographic variables that were shown to be statistically different between the two language versions were age, years of education, race, working status, and residential type; while the menopausal status was shown to be the significant clinical variable (Table 1). Also, significant differences in the EORTC QLQ-C30 subdomain scores between the two language groups were detected for the following: role functioning (P = 0.004), fatigue symptom (P = 0.01), and dyspnea item (P = 0.001) (Table 5). Hence, these variables were adjusted for in the multiple regression model. No significant differences were detected in the BAI total and domain scores between the two language versions (Table 6). All 95 % CIs of the adjusted mean difference fell within the ±0.5 S.D. margin, but did not completely lie within the ±0.25 S.D. equivalence margin. Hence, this suggests possible measurement equivalence between the English and Chinese versions of BAI.

Table 5 Comparison of the EORTC QLQ-C30 subdomain scores between the English-speaking and Chinese-speaking patients using Mann-Whitney U Test
Table 6 The BAI total and domain scores (by total and language versions of questionnaire) and measurement equivalence between the English and Chinese versions of BAI

Discussion

To our knowledge, this is the first study to evaluate the validity, reliability and measurement equivalence of the English and Chinese versions of BAI among breast cancer patients in Singapore. Results revealed that both language versions are valid and reliable to a fairly satisfactory extent in assessing patients’ perceived anxiety levels and are possibly equivalent.

As hypothesized, the validity analysis affirmed the adequacy of BAI in measuring patients’ self-reported anxiety levels. The known-group validity analysis yielded results that deviated from our original hypotheses. Notably, the BAI total and domain scores were unable to distinguish patients with advanced breast cancer stage and poorer performance status. This could be attributed to the disproportionate sample size of each known group. Majority of the patients were diagnosed with breast cancer stage 2 and below (66.4 %) and possessed good performance status. Thus, the studied sample may not be fully representative of the entire spectrum of breast cancer patients in Singapore. Furthermore, the BAI total and domain scores were unable to discern patients based on their religious beliefs due to two reasons. Firstly, patients’ religious beliefs do not indicate their engagement level in the religious activities. The mere presence of religious beliefs may not give an accurate reflection of its role in helping patients cope with their cancer states [42]. Secondly, some studies have demonstrated that anxiety levels did not correlate directly with the religious beliefs or activities, but with the interpretations of the meaning of life that were delivered through these beliefs [43, 44]. However, information on this spirituality facet could not be isolated from the sheer presence of religious beliefs. Hence, our data did not allow for a classification based on patients’ perceptions of the meaning of life. Overall, the task to separate the patient pool into distinct groups by religious beliefs appears to be more complex than expected due to the underlying factor of spirituality [45]. Therefore, a more meaningful assessment will warrant the usage of additional tools like Spiritual Well-Being Scores to assist in the identification of known groups [44].

Satisfactory internal consistency was observed for all domains except for the panic domain. Furthermore, item-to-domain correlations indicated that majority of the items in the BAI were well related to the constructs of their corresponding domains. However, the items “feeling of choking” and “fear of dying” were highlighted to be problematic within the panic domain since they had poor correlation values of less than 0.40. The poor correlations are likely a consequence of problems in the original items rather than translation errors. For “feeling of choking,” we postulate that patients may not fully understand the term “choking.” For instance, patients may associate it to obstructions in the respiratory airways or gastrointestinal tract, instead of viewing it as a panic component. This case is further exemplified in patients with respiratory diseases, as they are more prone to identifying this familiar symptom as an implication of their illness and not relate it to a state of panic [46]. We have also identified “fear of dying” as a problematic item that deviated from its underlying construct of panic. We posit two reasons for this observation. Firstly, the understanding of “fear of dying” may be overlapped with “fear of worst happening” since death is essentially the worst outcome. Thus, the item could be more related to the subjective domain in measuring the emotional aspect instead of the panic aspect as expected [47]. Moreover, the “fear of dying” among these early stage breast cancer patients may be milder and subtler since treatment options are readily available with optimistic success rates. Consequently, their ability to comprehend this fear as a panic symptom may be diminished.

We have also evaluated the measurement equivalence between the English and Chinese versions of BAI. This assessment is necessary before the data from the two language versions can be combined in a single analytical framework. After adjusting for the statistically significant demographic, clinical, and EORTC QLQ-C30 variables, the 95 % CIs derived from the multiple regression model reflected the contribution of language variation to the score differences. As all 95 % CIs partially overlapped with the pre-defined ±0.25 S.D. equivalence margin, measurement equivalence cannot be confirmed. Nonetheless, these results are encouraging evidence to support the equivalence of the English and Chinese versions of BAI among breast cancer patients in Singapore. An extensive study is required to verify the measurement equivalence definitively.

There are several limitations in this study. As this study was only conducted at one time point, only internal consistency was evaluated in the reliability assessment, and test-retest reliability could not be examined. In addition, while we acknowledge that the emotional functioning subscale of the EORTC QLQ-C30 is not conventionally accepted as a “gold standard” measure of anxiety, we have proceeded to validate the BAI tool using this subscale due to the nature of the study population concerned. The breast cancer population differs from the usual psychiatric and community samples, as the anxiety symptoms experienced by our target population manifest from the debilitating nature of cancer and chemotherapies that is not ascribable to psychiatric disorders. For example, patients could be plagued by anxiety from unknown treatment status as well as worrying about the recurrence of cancer [9]. Furthermore, as BAI encompasses somatic symptoms that are grounded in a particular disease state, we find the emotional functioning subscale of the EORTC QLQ-C30 to be the most appropriate in providing a context within cancer patients for a meaningful comparison. In other similar studies, EORTC QLQ-C30 has also been utilized as a reliable instrument for correlation analysis with other anxiety-assessing instruments [48, 49].

Conclusion

The BAI is a valid and reliable instrument for assessing the perceived anxiety levels among breast cancer patients in Singapore. Both language versions of the BAI have also demonstrated possible measurement equivalence. However, potentially problematic items may have compromised the internal consistency of the panic domain for both language versions. Thus, one should interpret the score of this panic domain with caution. Despite so, this study has provided some preliminary supportive evidence for the validity, reliability, and measurement equivalence of the two BAI language versions to establish their significance as clinical research tools in Singapore.