Arthritis is the leading cause of disability for people over age 15 in the United States (e.g., O’Donnell et al., 2010) as well as a common source of emotional distress among the affected (Margaretten et al., 2011; Murphy et al., 2012). Osteoarthritis (OA), a disease characterized by degeneration of cartilage in joints, is the most common subtype with a lifetime prevalence of about 9 % in the U.S. (Johannes et al., 2010; Neogi, 2013) and an estimated annual cost of $185.5 billion in healthcare expenditures alone (Kotlarz et al., 2009; White et al., 2007). OA is particularly widespread among adults over age 60 and those who are overweight or obese (Garver et al., 2014). Rheumatoid arthritis (RA), an autoimmune disease involving extreme joint tissue inflammation (Chang & Wei, 2011), is far less pervasive and affects 0.80–1.10 % of U.S. adults, typically with an onset at around age 40 (e.g., Silman & Hochberg, 2009). Regardless, RA is among the most potentially painful and crippling arthritis subtypes and can spread eventually to all body sites, including the heart, blood vessels, and lungs (Chang & Wei). Despite differences in symptoms, prevalence, and causes, OA and RA have shared psychological costs.

In line with World Health Organization QOL group (1993) recommendations regarding the assessment of health-related quality of life as well as key areas of functioning evaluated on popular multidimensional measures of disability (e.g., Kerns et al., 1985; Lurie, 2000; Nanda et al., 2003), pain severity, suffering (i.e., emotional distress), and interference with daily activity are widely-recognized, overlapping psychological consequences of ongoing pain that have been emphasized in pain assessment protocols. For example, following the onset of OA, substantial numbers of patients experience pain-related disability (Neogi, 2013), ongoing pain (Dray & Read, 2007; Jakobsson & Hallberg, 2002), and depression or anxiety (Memel et al., 2000; Sale et al., 2008). Similarly, aside from pain, long-term effects of RA include severe disability and reduced life expectancy (Villeneuve, 2013) as well as elevations in emotional distress. For example, 13–42 % of RA patients are estimated to experience comorbid depression (Bruce, 2008; Margaretten et al., 2011).

Notwithstanding the critical role that biological factors have for onsets and courses of OA and RA (e.g., Kourilovitch et al., 2014; Lafeber & Van Spil, 2013), cognitive social learning models have highlighted the importance of social relationships, interactions of people’s behavior with situations in their lives, and subjective expectancies, interpretations, and beliefs as factors that affect functioning. Regarding the latter, various authors have underscored the critical role of conceptually-overlapping beliefs reflecting locus of control (LOC), self-efficacy (SE), and/or helplessness as key influences on outcomes of OA and RA (e.g., Hewlett et al., 2008; Lorig et al., 1989; Stein et al., 1988a; Wallston et al., 1994).

To elaborate, research on locus of control (LOC) emerged from Rotter’s (1966) view that people rely on general beliefs about their ability to influence or control events, particularly when they are confronted with new situations. As applied to health and illness, Wallston and colleagues (e.g., Wallston, 2005; Wallston et al., 1994) contended that patients who believe good health results from taking care of themselves typically have stronger internal or personal health locus of control (HLOC) orientations and do what they can to maintain health and/or recover from illness. Conversely, patients who believe their health is mainly due to luck or efforts of health professionals often have stronger external HLOC orientations and may see little reason to make personal efforts vis a vis their health. Prospective studies of general population samples have linked stronger internal HLOC to better health (Gale et al., 2008) and reduced risk for heart attack and cancer (Sturmer & Hasselbach, 2006) at long-term follow-up.

In relation to OA and RA, patients with weaker internal LOC or HLOC orientations have been found to report more interference with activities of daily living (e.g., Cross et al., 2006; Graves et al., 2009; Lopez-Olivo et al., 2011; Zyrianova et al., 2011), depression (e.g., Rezaei et al., 2014; Smith et al., 1991; Treharne et al., 2005; Wallston et al., 1994; Zyrianova et al., 2011), and pain (e.g., Baker et al., 2008; Cross et al., 2006; Lopez-Olivo et al., 2011; Olchowska-Kotala et al., 2014; Rezaei et al., 2014; Tennen et al., 1992; Wallston et al., 1994) than peers who have stronger internal LOC orientations. However, modest or non-significant associations have been observed between internal LOC and impairment (e.g., Olchowska-Kotala et al., 2014; Tennen et al., 1992), depression (Ziarko et al., 2014) or other forms of distress such as anxiety (Treharne et al., 2005), and reported pain severity (Lopez-Olivo et al., 2011; Treharne et al., 2005) in some samples. While early research also found correlations between stronger external LOC orientations and elevations in pain and depression (Wallston et al., 1994), subsequent work suggested high scores on external LOC indices have weak associations with impaired daily functioning (Cross et al., 2006; Lopez-Olivo et al., 2011) and reported pain (e.g., Baker et al., 2008; Cross et al., 2006; Lopez-Olivo et al., 2011). In explaining non-significant effects of HLOC, Wallston (1992) pointed out that simply believing one’s actions affect health is not enough to maintain or improve health. Rather, good health and quality of life must also be highly valued for related efforts to be made.

Related social learning accounts contended that repeated experiences of unpredictable, uncontrollable aversive events result in learned helplessness, a dispositional tendency to believe highly desirable outcomes will not occur or highly aversive outcomes will arise, regardless of what one does (e.g., Abramson et al., 1978; Seligman, 1975). Such beliefs are typically accompanied by emotional, motivational, and cognitive deficits reflecting passivity (Stein et al., 1988a, b). While LOC beliefs and helplessness beliefs are moderately correlated (Wallston et al., 1994), the former reflect appraisals of where control over outcomes reside while the latter also tap expectations that one’s efforts will be ineffective. Learned helplessness models were applied initially to depression but later extended to the seemingly unpredictable and uncontrollable waxing and waning of RA symptoms (e.g., Nicassio et al., 1985). Within RA samples, elevations in helplessness beliefs have been linked to functional impairment (e.g., Bhat et al., 2010; Cadena et al., 2003; Covic et al., 2000, 2003; Englbrecht et al., 2012; Schoenfeld-Smith et al., 1996; Sinclair & Wallston, 2010; Smith et al., 1990, 1994), depression (e.g., Anderson et al., 1988; Cadena et al., 2003; Sinclair & Wallston, 2010; Smith et al., 1994) or general psychological distress (e.g., Englbrecht et al., 2012; Schoenfeld-Smith et al., 1996; Smith & Wallston, 1992), and reported pain (e.g., Bhat et al., 2010; Cadena et al., 2003; Covic et al., 2003; Smith & Wallston, 1992). Within OA samples, moderate positive associations have also been observed between helplessness beliefs and measures of reported pain (e.g., Creamer et al., 1999; Engel et al., 2004) and impairment (e.g., Creamer et al., 2000). Despite relatively consistent results, select arthritis studies have reported mixed or non-significant associations between helplessness beliefs and impaired daily functioning (e.g., Engel et al., 2004) as well as depression (Smith et al., 1990) and other forms of emotional distress (e.g., Anderson et al., 1988; Engel et al., 2004).

Bandura’s (1982, 1997) cognitive social learning model is a third, highly influential perspective that posits behavior can be affected by outcome expectations or beliefs that certain actions will lead to particular outcomes. However, self-efficacy (SE) expectations—confidence in one’s capacity to successfully perform specific actions that produce desired outcomes in given situations—are even stronger influences on behavior according to Bandura. While early work indicated SE beliefs are an important determinant of effort and perseverance shown in the face of stressors including pain, Bandura (1997) also hypothesized SE beliefs influence the performance of actions needed to manage or control pain itself and situations associated with pain. Extending Bandura’s “situation-specific” formulation of SE, pain researchers, including those whose work focuses on arthritis patients (e.g., Lorig et al., 1989), developed multidimensional domain-specific measures of SE beliefs related to the capacity to control pain, maintain daily activities despite pain, and manage experiences that accompany pain such as emotional distress.

Among patients with RA or OA, SE beliefs have had negative correlations with functional impairment (e.g., Baruth et al., 2013; Brekke et al., 2001; Cadena et al., 2003; Cross et al., 2006; Harrison, 2004; Marcum et al., 2014; Morone et al., 2009; Pells et al., 2008; Shelby et al., 2008; Somers et al., 2014), sometimes over and above the contributions of illness-related variables (e.g., Cadmus et al., 2010). Within some samples, patients who report weaker SE beliefs also report more severe pain (e.g., Brekke et al., 2001; Cadena et al., 2003; Harrison, 2004; Pells et al., 2008; Shelby et al., 2008; Somers et al., 2014) and heightened emotional distress (e.g., Barlow et al., 1997; Brekke et al., 2001; Cadena et al., 2003; Cross et al., 2006; Lowe et al., 2008; Pells et al., 2008; Rayahin et al., 2014; Shelby et al., 2008). Once again, however, modest and/or non-significant correlations with impaired functioning (e.g., Maly et al., 2006a, b; McKnight et al., 2010; Regan et al., 1988; van den Akker-Scheek et al., 2007; Wright et al., 2008), pain severity (e.g., Barlow et al., 2002; Lefebvre et al., 1999; Lopez-Olivo et al., 2011; Wylde et al., 2012) and/or emotional distress (e.g., Barlow et al., 1997; Lefebvre et al., 1999; Lopez–Lopez et al., 2008) have been documented in the SE studies of OA and RA.

Each cognitive social learning account outlined above was developed first as a general explanation of behavior and later applied to illuminate individual differences in functioning within chronic pain samples. In contrast, fear-avoidance (F-A) paradigms are pain-specific approaches designed to explain how cognitive factors including attentional focus and beliefs about pain as a signal for current bodily harm or potential future tissue damage influence the development and course of chronic pain (e.g., Leeuw et al., 2007; Vlaeyen and Linton 2000; Waddell et al., 1993). F-A models contend, in part, that pain-fearful people appraise pain sensations as signals for potential injury or ongoing harm (Vlaeyen & Linton, 2000) that can result in pain catastrophizing or exaggerated interpretations of pain as an extreme threat, even after healing is evident (Leeuw et al., 2007). Beliefs about pain as a source of threat, harm or impending catastrophe can exacerbate avoidance of physical movement. In turn, the resulting inactivity may contribute to physical deconditioning, long-term disability, depression, and increased pain (Vlaeyen & Linton, 2000). F-A models were used initially to explain how acute back pain can become chronic but subsequent extensions focused on other pain conditions, including arthritis (e.g., Somers et al., 2009; Sullivan et al., 2011).

Within RA and OA samples, patients who strongly endorse beliefs that pain is a source of actual or potential issue damage may be more susceptible to functional impairment (e.g., Heuts et al., 2004; Morone et al., 2009; Scopaz et al., 2009; Sullivan et al., 2011), emotional distress (e.g., Downe-Wamboldt & Melanson, 1995; Regan et al., 1988; Schiaffino & Revenson, 1995a; Scopaz et al., 2009; Somers et al., 2009; Sullivan et al., 2011), and more severe pain (e.g., Regan et al., 1988; Sullivan et al., 2011). Similarly, patients who report more pain catastrophizing can report more interference with daily activities (e.g., Costa et al., 2014; Edwards et al., 2010; Marcum et al., 2014; Morone et al., 2009; Shelby et al., 2008; Somers et al., 2009, 2014; Sullivan et al., 2011), distress (e.g., Edwards et al., 2010; Keefe et al., 2004; Lopez–Lopez et al., 2008; Rayahin et al., 2014; Shelby et al., 2008; Somers et al., 2009), and pain (e.g., Costa et al., 2014; Edwards et al., 2010; Keefe et al., 2004; Shelby et al., 2008; Somers et al., 2009, 2014; Sullivan et al., 2011) than peers who catastrophize less about pain. Nonetheless, within at least some samples, responses on measures of threat/harm/F-A beliefs or catastrophizing have had weak correlations with impaired functioning (e.g., Dekker et al., 1993; Downe-Wamboldt & Melanson, 1995; Rayahin et al., 2014; Regan et al., 1988; Schiaffino & Revenson, 1995a; Somers et al., 2009, 2014) and pain severity (e.g., Dekker et al., 1993; Kwissa-Gajewska et al., 2014; Schiaffino & Revenson, 1995b).

Taken together, the preceding overview underscores specific belief constructs based on cognitive social learning perspectives and F-A models as potentially important correlates of disturbances in functioning among patients with OA and RA. Regardless, each of these perspectives has also generated at least some conflicting evidence of weak associations between belief constructs and functioning. Hence, it is not clear whether these belief categories have strong overall strengths of relation with problems in functioning, nor is it evident that beliefs based on a particular cognitive social learning model or the F-A account have more value in informing problematic functioning among arthritis patients. Finally, it is not clear whether particular content themes of belief categories that cut across theoretical perspectives have stronger links with disturbances in OA and RA samples than other themes do. Meta- analysis, a procedure that relies on synthesized data from multiple studies instead of potentially idiosyncratic results of specific studies (Chan & Arvey, 2012), may clarify each of these issues. Given variable results in this literature, meta-analysis might also illuminate moderators that explain why belief-functioning relations have been robust in some studies and weak in others.

Regarding possible methodological moderators, indirect evidence suggests certain types of pain beliefs are more relevant than others in predicting arthritis outcomes. Recent meta-analyses on general chronic benign pain samples have found small to medium average effect sizes between measures of threatening pain appraisals or F-A beliefs and functional impairment (Jackson et al., 2014a; Zale et al., 2013) while a large mean effect size was reported for SE- impairment relations (Jackson et al., 2014b). These general patterns do not necessarily apply to OA and RA but imply effect sizes might be larger in studies of helplessness beliefs or SE beliefs rather than those that examine fear, threat, or harm beliefs salient to F-A models. On the other hand, exaggerated pain catastrophizing beliefs also appear to have robust associations with poorer functioning in narrative reviews (e.g., Leeuw et al., 2007). To date, meta-analyses on health/illness/pain LOC beliefs have not been conducted within chronic pain or arthritis samples. In contrast, following Wallston (1992), overall effect sizes for health or illness LOC beliefs could be comparatively weak because such measures are less typically task- or pain-specific than SE scales or those reflecting features of F-A models.

Operationalizations of functional impairment, content of emotional distress indices, and control of measures in prospective research can also influence effect sizes as well. Within general chronic pain samples, threat appraisals of pain and SE beliefs have had stronger associations with self-report than behavior performance indices of impairment (Jackson et al., 2014a, b). Moreover, beliefs related to threat or F-A should also have stronger conceptual links with fear or anxiety than other forms of emotional distress such as depression. Finally, within longitudinal studies, both logically and empirically, effect sizes between baseline beliefs and follow-up outcomes are more likely attenuated when baseline levels of the corresponding outcome have been statistically-controlled rather than left uncontrolled (Jackson et al., 2014b, 2016). However, significant average effect sizes under more stringent conditions of first controlling for baseline responses on an outcome would support the status of initial pain beliefs as a risk/protective factor for, not merely a correlate of, subsequent responses on that outcome.

Regarding possible sample characteristics that moderate belief-functioning relations, OA and RA differ in symptoms, prevalence, onset, causes, and severity (e.g., Axford et al., 2008; Silman & Hochberg, 2009); hence, it is possible that effect sizes also vary between these conditions. Past meta-analyses produced contradictory conclusions regarding moderating effects of pain site/condition on belief-outcome associations (Hinrichs-Rocker et al., 2009; Jackson et al., 2014a, b, 2016) though only small subsets of arthritis samples were assessed in these reviews. In terms of demographics, recent meta-analyses found SE beliefs and challenge appraisals had comparatively stronger relations with impairment or distress in older chronic pain samples, those having longer pain durations, and those comprising proportionately more women than men (Jackson et al., 2014a, b). Nonetheless, this pattern did not extend across all outcomes and analyses were based on numerous pain conditions, not just OA or RA. Hence, because details related to diagnosis, age, gender, and pain duration are typically reported in arthritis research, evaluating their moderating effects may help to identify specific arthritis patient subgroups in which beliefs correspond more strongly to outcomes.

Based on the preceding overview, this meta-analysis assessed the overall impact of belief constructs related to cognitive social learning models (i.e., health/illness LOC, helplessness, SE) and F-A models (i.e., pain as a source of threat, harm, or catastrophe) on problems in functioning that are directly relevant to health-related quality of life of OA and RA patients (i.e., functional impairment, affective distress, pain severity). While beliefs were expected to have significant overall associations with each area of functioning, significant effect size heterogeneity was also anticipated. Therefore, we explored the extent to which relations of beliefs with problems in functioning were moderated by methodological factors (i.e., belief measure content, impairment index content, affective distress measure content, research design) and sample characteristics (age, gender composition, pain duration, arthritis subtype).

Materials and methods

Search strategy

To identify relevant studies, PubMed, PsychINFO, Web of Science, Google Scholar, and Science Direct database searches were performed between dates of inception and March, 2015. Search terms were “arthritis” OR “rheumatism” AND “beliefs” OR “pain beliefs” OR “attitudes” OR “health” OR “illness” OR “appraisal” OR “control” OR “locus of control” OR “helplessness” OR “self-efficacy” OR “fear” OR “threat” OR “challenge” OR “harm” OR “catastrophizing”. Pain/health beliefs measures including the Arthritis Self-Efficacy Scale (ASES; Lorig et al., 1989), Rheumatoid Arthritis Self-Efficacy Scale (RASE; Hewlett et al., 2008), Multidimensional Health Locus of Control Questionnaire (MHLC; Wallston et al., 1994), Illness Perception Questionnaire (IPQ; Weinman et al., 1996), Survey of Pain Attitudes (SOPA; Jensen et al., 1987), Beliefs About Pain Control Questionnaire (BPCQ; Skevington, 1990), Rheumatology Attitudes Index (RAI; Callahan et al., 1988), Arthritis Helplessness Index (AHI; Nicassio et al., 1985), Tampa Scale for Kinesiophobia (TSK; Vlaeyen et al., 1995), Fear Avoidance Belief Questionnaire (FABQ; Waddell et al., 1993), and Pain Catastrophizing Scale (PCS; Sullivan et al., 1995) were also search terms. The broad search field, “anywhere” was used to identify citations. Reference lists of obtained articles were also reviewed to identify relevant papers.

Selection criteria

Abstracts of all potentially eligible studies were independently screened by the authors to exclude papers whose content was not salient. Subsequently, full-text versions of relevant papers were retrieved and reviewed to determine if they met the following 11 inclusion/exclusion criteria:

  1. 1.

    Studies included adult human participant samples with an average age of 18 years or older.

  2. 2.

    Studies were comprised principally of patients with ongoing OA and/or RA. Studies of laboratory pain, osteoporosis, other arthritis conditions such as fibromyalgia and gout, chronic pain unrelated to arthritis, and cancer pain were excluded.

  3. 3.

    Belief measures had to reflect the domains of health, illness, or pain and tapped LOC, SE, helplessness, or F-A model belief constructs. Studies that assessed “general” LOC orientation, general SE, general competence, physical SE, or other pain beliefs related to other issues (e.g., Jensen et al., 1987) were excluded.

  4. 4.

    Included studies reported at least one association between a measure of health, illness, or pain beliefs and an index reflecting disability/functional impairment, affective distress, or pain severity.

  5. 5.

    Both cross-sectional and longitudinal studies were eligible for inclusion. When prospective studies only reported baseline assessment findings in sufficient detail, effect size calculations were limited to the initial assessment, per cross-sectional studies. Longitudinal studies that reported only associations between change scores for beliefs and change scores in outcomes were excluded. However, baseline belief-follow-up problems in functioning effect sizes were assessed in the moderator analyses examining effects of analytic approach within longitudinal study subsets (i.e., baseline functioning levels controlled or not controlled) to clarify the status of initial beliefs as correlates versus risk/protective factors for later problem outcomes, independent of corresponding baseline levels of problem functioning.

  6. 6.

    Studies were included when belief measures were treated as independent variables or predictors rather than dependent measures.

  7. 7.

    Regarding pain severity as an outcome, only studies that used continuous intensity measures or continuous measures re-coded into groups (e.g., median splits) were included for analyses. Conversely, research using less sensitive dichotomous categorical indices such as “presence of pain” (yes or no) or “presence of moderate to severe pain” (yes or no) was excluded.

  8. 8.

    Reliability and validity data for all measures of pain beliefs and problems in functioning were reported or available.

  9. 9.

    A minimum sample N of 50 was required because mean scores are more easily distorted by extreme responses in small samples.

  10. 10.

    Only papers from peer-reviewed, English-language journals were retained for analyses. Reviews and studies that reported results based on secondary analyses of data in other articles were excluded.

  11. 11.

    Because large-scale meta-analyses are time-consuming, resource-depleting, and labor intensive, we followed other published pain meta-analyses (Grossman et al., 2004; Jackson et al., 2014b) by including only papers that provided effect size information in a transparent manner or sufficient data for effect size calculations.

The formal assessment of study quality was also forgone due to the lack of consensus about criteria that best define study quality (Crombez et al., 2013a, b). However, several inclusion criteria (i.e., use of reliable, valid, sensitive measures, minimum sample N’s, publication in a peer-reviewed journal) ensured that all included studies had key methodological strengths and received favorable external editorial responses. All potentially relevant papers were reviewed independently by the authors. Disagreements in study selection and coding were resolved through discussion until consensus was achieved.

Search results

Possible studies were found in databases by key words (n = 7050), and health/pain belief questionnaire names (n = 136), emails to authors who published in the area (n = 12), and reference lists of obtained papers (n = 54). After removing duplicates, 6483 studies were retained for initial consideration. Figure 1 summarizes the screening process. Ultimately, 106 studies comprising 111 samples (N = 17,365 patients) fulfilled all selection criteria. Rates of inter-rater agreement were satisfactory for study inclusion in the meta-analysis (k = .89) and coding of variables used in the meta-analyses (k = .91).

Fig. 1
figure 1

Diagram of study selection process

Coding study characteristics

For each included study, data were extracted regarding first author, year of publication, sample size, gender composition (percentage of women per sample), mean age, arthritis type (RA vs. OA vs. mixed RA and OA), pain duration in years, research design (cross-sectional vs. longitudinal), health/illness/pain belief measure content (internal/personal LOC vs. external LOC vs. SE vs. fear/threat/harm vs. helplessness vs. catastrophizing) and problem area (i.e., disability/impairment vs. affective distress vs. pain severity). Impairment indices were coded as either self-report or behavioral performance measures in moderator analyses. Affective distress scales were coded as measures of depressive symptoms, anxiety/fear, or general emotional distress. Levels of inter-rater agreement in classifying measures of pain beliefs and functioning based on these categories were 100 %.

Characteristics of included studies

Table 1 summarizes features of studies included for meta-analysis. Papers published from 1988 to March, 2015 were included for analysis; more than half (53.77 %) were published after 2005, thus highlighting pain beliefs as a pertinent focus in recent arthritis studies. On average, samples included 156.44 patients (SD = 120.68, range 50–815), were late-middle-aged (M = 59.94 years, SD = 7.08, range 44.12–75.00 years) and had more women than men (M = 75.16 %, SD = 16.50 %, range 0–100 %). For arthritis type, 58 samples comprised RA patients, 38 included OA patients, and 15 evaluated both RA and OA patients. The mean pain duration of samples was 11.28 years (SD = 5.25 years, range 1.92 years to 25.9 years). Regarding longitudinal study subsets, follow-up impairment effect sizes were reported in 25 studies, an average of 10.21 months after baseline impairment assessments (SD = 8.29 months). Follow-up affect distress effect sizes were presented in 24 papers, an average of 10.10 months after initial distress evaluations (SD = 10.04 months). Follow-up pain severity ratings were reported in 18 studies, an average of 8.27 months after baseline assessments (SD = 5.77 months). In relation to belief categories of focus based on cognitive social learning models, 18 studies assessed internal and/or external health/illness LOC beliefs, 16 examined helplessness beliefs, and 62 evaluated pain SE beliefs based on measures listed in Table 1. Regarding belief constructs reflected in F-A models of pain, 10 studies assessed fear/threat/harm beliefs related to pain or illness and 18 studies evaluated pain catastrophizing beliefs. Finally, functional impairment, affective distress, and pain severity were assessed in 84, 74, and 61 samples, respectively (Table 1).

Table 1 Overview of studies included in meta-analysis on associations between pain beliefs and problems in functioning

Effect size computations

Comprehensive Meta-Analysis Version 2.0 (CMAV2; Borenstein et al., 2005) was used for analyses. Bivariate correlations (r) represented individual effect sizes of included studies. Papers that reported regression coefficients within multiple regression models rather than bivariate associations were retained for analyses. However, such correlations may be attenuated within multiple regression models that include multiple predictors. More occasionally, measures in multiple regression models act as “suppressors” of error variance, artificially inflating effect sizes of other predictor measures. Therefore, where relevant, overall effect sizes for each outcome were also computed and presented in the text, after excluding data from multiple regression studies.

Effect sizes were weighted by the inverse of a study’s variance. Cochran’s Q test evaluated effect size heterogeneity for each outcome. When Q values reflected significant heterogeneity, subgroup analyses and meta-regression assessed sources of variability in effects. I square values represented amount of observed heterogeneity. Following from Higgins and Thompson (2002), I square values of 25, 50 and 75 % were low, moderate, and high, respectively. Overall effect sizes were based on random effects models as recommended by Hoffman et al. (2007). Drawing upon Cohen (1992), effect sizes of r = .10, r = .30, and r = .50 were interpreted as small, medium, and large, respectively.

Outlier analysis

Outlier analyses were conducted on the overall data set based on effect size magnitudes and sample sizes (Avolio et al., 2009). Hunter and Schmidt (2004) contended that extreme values can cause significant within-group heterogeneity of individual effect sizes that do not reflect reality. In the process of considering outliers, values more than three standard deviations from the mean were operationalized as outliers (Kline, 1998).

Moderator analyses

Categorical moderators operationalized above, including (a) pain/health/illness belief measure content (b) arthritis subtype, (c) analytic approach used within longitudinal studies of each outcome, (d) type of impairment scale, and (e) content of affective distress were assessed with Cochran’s Q values with effect sizes based on correlation coefficients. Moderating effects of continuous measures of sample age, gender composition, and pain duration were assessed via method of moments analyses (Higgins & Thompson, 2002).

Evaluation of publication bias

Publication bias was estimated by inspecting effect size funnel plots for asymmetrical distributions around mean effect sizes via trim and fill analyses (Duval & Tweedie, 2000). Typically, larger samples (top of effect size plot) provide the most accurate estimates of effect size; the spread should increase symmetrically with smaller samples toward the bottom of the plot. Hence, when there is no evidence of publication bias, the plot should resemble an inverted funnel. Bias against the publication of studies having non-significant effects is suggested by funnel plots that include fewer effects sizes on the left than the right side of the mean effect size for an outcome. This method provides an estimate of the nature and number of studies missing from a distribution and adjusted effect sizes based on estimated contributions of missing studies.

Results

Outlier analysis

One study with a very large sample (i.e., Brekke et al., 2001) was identified as an outlier but retained for analyses because overall effect sizes did not change as a function of its inclusion or exclusion. However, three effect sizes identified as outliers in the belief-affective distress analyses—Kwan (r = −0.900), Rezaei (r = 0.380), Zyrianova (r = −0.896)—were omitted to be conservative because their retention resulted in a slightly stronger average effect size (r = -.377, p < .001, 95 % CI −.319 to −.432) compared to their omission.

Overall associations between pain beliefs and outcomes of arthritis

Highly significant, medium average effect sizes were found for overall relations between pain beliefs and each facet of problem functioning. Specifically, beliefs reflecting elevations in SE or internal/personal control beliefs as well as reductions in external control, helplessness, fear/threat/harm, and catastrophizing were related to moderately less functional impairment, affective distress, and pain severity in OA and RA samples (Table 2). Omitting studies that reported odds-ratios, standard regression or partial correlation coefficients within regression models, average effect sizes for belief-impairment relations (r = −.358, P < .001, 95 % CI −.317 to −.399), belief-distress associations (r = −.368, P < .001, 95 % CI −.329 to −.405), and belief-pain severity relations (r = −.349, P < .001, 95 % CI −.303 to −.393) were slightly larger than those obtained from retaining regression coefficient data. Table 2 also indicates medium to high heterogeneity was evident in each overall effect size analysis. In light of such variability, moderator analyses assessed effects of select methodological factors and sample characteristics on belief-functioning effect sizes.

Table 2 Overall effect sizes and heterogeneity tests for associations between pain beliefs and outcomes related to arthritis

Moderator analyses for belief-impairment associations

Table 3 presents subgroup analyses of potential categorical moderators of belief-impairment relations. The significant moderating effect of pain belief subtype revealed significant, medium average effect sizes in studies of associations between SE, helplessness, or catastrophizing beliefs and impairment. In contrast, small, though significant average effect sizes were found for studies that assessed internal/personal health LOC and threat/fear/harm beliefs. Finally, small, non-significant effect sizes were observed in studies that assessed beliefs reflecting external LOC (e.g., powerful others, chance, luck). Effect size heterogeneity was substantially lower after controlling for belief subtype but was still considerable and highly significant (Q within value = 573.637, p < .001).

Table 3 Effects of categorical moderators on associations between pain beliefs and functional impairment

Overall belief-impairment relations did not differ much as a function of having RA or OA or use of self-report versus behavior performance measures of impairment (Table 3). Within the prospective study subset, belief-impairment relations were not moderated by analysis approach: a medium average effect size was observed for prospective studies that did not control for initial impairment levels but a small, significant average effect size was also found for longitudinal studies controlling for baseline impairment, supporting the status of pain beliefs as risk/protective factors for impairment. Finally, age (point of slope = .00371, p = .239), pain duration (point of slope = .00438, p = .408), and gender composition (point of slope = .00072, p = .566) did not moderate belief-impairment relations.

Moderator analyses for belief-affective distress associations

Belief subtype also moderated belief-distress effect sizes (Table 4). Effect sizes for catastrophizing beliefs had medium to large confidence intervals while those for SE, helplessness and/or threat/fear-avoidance beliefs were medium in strength. Conversely, studies assessing LOC beliefs had small, significant mean effect sizes. Effect size heterogeneity remained highly significant after controlling for belief subtype (Q within value = 250.126, p < .001). Moderating effects of arthritis subtype and affective distress measure content were not significant (see Table 4). However, in the longitudinal study subset, analysis approach was a significant moderator; effect sizes were small and medium, respectively, for studies that controlled and did not control for baseline distress levels. Heterogeneity was substantially lower but remained significant after controlling for analytic approach in longitudinal studies (Q within values = 43.530, p < .01). Sample age (point of slope = .00255, p = .426), gender composition (point of slope = .00095, p = .506), and pain duration (point of slope = .00004, p = .993) did not moderate belief-distress effect sizes.

Table 4 Effects of categorical moderators on associations between pain beliefs and affective distress

Moderator analyses for belief-pain severity associations

Once again, belief subtype moderated belief-pain severity effect sizes (Table 5). Significant, medium effect sizes were found in studies that assessed SE, helplessness, and catastrophizing beliefs, small to medium effect size confidence intervals emerged in studies that tapped internal/personal health LOC or threat/fear beliefs, and a small, non-significant mean effect size was observed in studies of external LOC beliefs. Substantial heterogeneity remained after controlling for belief subtype (Q within values = 293.604, p < .001). Belief-pain severity effect sizes did not differ as a result of arthritis subtype (Table 5). However, in the prospective study subset, medium and small average effect sizes emerged, respectively, for studies that did not control for baseline pain severity and those that did; covarying initial pain severity levels, the effect size was reduced but significant, indicating pain beliefs served as risk/protective factors related to follow-up pain severity levels. Belief-pain severity effect sizes did not vary due to sample age (point of slope = .00220, p = .551), gender (point of slope = -.00024, p = .855), or pain duration (point of slope = .00297, p = .548).

Table 5 Effects of categorical moderators on associations between pain beliefs and pain severity

Effects of publication bias

Trim and fill analyses indicated that symmetrical effect sizes distributions were observed in study plots assessing relations of pain beliefs with each facet of functioning (see Supplementary Figures 2-4). Consequently, no studies were estimated to be missing from any of the three distributions and no effect size adjustments were made.

Discussion

In this meta-analysis, we assessed overall associations of beliefs about pain, health, and illness with functional impairment, affective distress, and pain severity in published accounts of RA and OA samples. Highly significant, medium, average effect sizes highlighted how belief/appraisal constructs based on select cognitive social learning perspectives as well as F-A models of pain have robust links with key problems in functioning in these groups. Overall findings bolstered recent meta-analyses implicating SE beliefs (Jackson et al., 2014b), F-A beliefs (Zale et al., 2013), and threat appraisals of pain (Jackson et al., 2014a) as correlates of functioning in samples having diverse pain conditions. However, in line with variable results between individual studies, overall belief-outcome effect sizes were accompanied by medium to high levels of heterogeneity.

Towards informing the potential utility of different theoretical models, the nature of pain beliefs assessed was a significant moderator across all three problem areas and partially explained effect size variability for each analysis. The longstanding recognition of SE beliefs as important influences on functioning in OA and RA samples is reflected in the development of arthritis-specific SE measures (Hewlett et al., 2008; Lorig et al., 1989) and interventions to increase SE in these groups (e.g., Barlow et al., 2002; Smarr et al., 1997). Dovetailing with reviews linking high SE levels with beneficial health outcomes in non-clinical samples (Holden, 1992) and those with various chronic benign pain conditions (Jackson et al., 2014b), medium effect sizes for SE in this meta-analysis underscored how confidence in one’s capacity to perform particular actions corresponds to reduced impairment, distress, and pain intensity among RA and OA patients. While analyses could not demonstrate causal effects of SE beliefs on these problems, in the context of evidence showing how treatment-based improvements in SE contribute to reductions in disability, reported pain, and/or emotional distress (e.g., Lorig & Holman, 2003), graded mastery experiences, vicarious learning of mastery, physiological arousal reinterpretation, and social persuasion may enhance SE (Bandura, 1997; Lorig et al., 1989) and reduce negative psychological consequences of OA and RA.

Aside from SE, medium average effect sizes for helplessness beliefs indicated patients who believed they were incapable and/or unable to influence the course of their illness were especially prone to elevations in impairment, affective distress, and pain severity. Such results may not elucidate underlying mechanisms but some authors have argued helplessness beliefs are part of a vicious cycle that includes passive coping and impairment which, in turn, perpetuate helplessness (e.g., Covic et al., 2003; Smith & Wallston, 1992). Respectively, helplessness beliefs and SE reflect dispositional tendencies and highly specific actions. Nonetheless, beliefs that one is ineffective in or incapable of undertaking actions that affect functioning are a common element of helplessness and low SE. Consequently, interventions designed to increase SE (e.g., graded mastery) may help to reduce helplessness beliefs as well.

Average effect sizes were more modest and/or not significant in studies drawing from alternate social cognitive paradigms that evaluated relations between health, illness, or pain LOC beliefs and functioning. More precisely, small, statistically significant average effect sizes were found for relations between internal LOC and each problem in functioning while mean effect sizes for powerful others’ and chance/luck LOC were small to very small and not reliably significant. One potential limitation of using health LOC scales in groups with pain conditions is the possibility that LOC orientation related to one’s general health status is not the same as LOC beliefs about one’s illness (Wallston et al., 1994). Relatedly, patients with chronic pain may not know how to interpret or respond to health LOC queries such as, “If I do the right things, I can stay healthy”. While the development of illness- or pain-specific LOC scales has helped to address these limitations, a strong internal LOC orientation towards health or illness may have more limited benefits for people who feel incapable of carrying out behaviors that improve functioning (Schoenfeld-Smith et al., 1996). On this basis, assessments and interventions that focus on increasing SE and/or reducing helplessness may be more useful in working with OA and RA patients than evaluating and/or attempting to change LOC beliefs.

F-A models emphasize beliefs about pain as a threat for potential future tissue damage or ongoing harm and pain catastrophizing as precipitants of activity avoidance, physical disuse, disability, and increases in pain (Vlaeyen & Linton, 2000). Even though avoidance can occur both in the presence and absence of fear or threat appraisals (Wideman et al., 2013), significant small to medium average effect sizes for relations between problems in functioning and beliefs reflecting F-A, threat, and/or harm supported conclusions of meta-analyses on sundry pain conditions (Jackson et al., 2014a; Zale et al., 2013) and contentions that F-A model features apply to OA and RA patients (Heuts et al., 2004; Steultjens et al., 2002). Medium average effect sizes found for catastrophizing and each focus of problem functioning were even stronger than those based on F-A, threat or harm beliefs. Pain catastrophizing scales have substantial associations with F-A belief and threat appraisal indices but these constructs are also somewhat distinct (Jackson et al., 2014a). For example, content reflecting pain as a potential threat is present in the two most popular pain catastrophizing scales used in this meta-analysis (Rosentiel & Keefe, 1983; Sullivan et al., 1995), yet both scales include items that tap perceived helplessness. As noted above, beliefs about one’s incapacity or ineffectiveness in making changes have powerful repercussions for functioning among RA and OA patients.

In contrast to pervasive moderating effects of pain belief content, there was no evidence that belief-problem area effect sizes were moderated by age, gender composition, arthritis subtypes, or pain durations. Jackson et al. (2014b) reported SE-impairment relations were comparatively stronger in older samples and those having pain for longer periods of time. However, samples assessed in that meta-analysis were over 12.5 years younger on average, had a substantially shorter mean pain duration, and were far more variable in relation to pain conditions assessed compared to samples included in this review. Hence, beliefs about pain and health may have similar implications for difficulties in functioning regardless of age, pain duration, gender composition or arthritis subtype of samples. The non-significant moderator effect for impairment measure type highlighted how average strengths of relation between pain beliefs and self-reported impairment did not differ markedly from those based on performance-based impairment indices. Consequently, methodological concerns such as common method variance (i.e., exclusive use of self-report measures) do not provide a plausible explanation for belief-impairment findings. Similarly, effect sizes for belief-affective distress relations were comparable regardless of whether distress content reflected anxiety/fear, depression, or general emotional distress. Hence, at least when certain types of appraisals are considered, subject matter overlaps in the measurement of pain beliefs and emotional distress (e.g., fear) do not account for associations.

Finally, moderator analyses of longitudinal study subsets related to follow-up levels of impairment, distress, and pain severity indicated, as expected, that average effect sizes were stronger when baseline levels of impairment, distress, and pain severity were not controlled. Nonetheless, all average effect sizes for relations between baseline pain beliefs and follow-up adjustment difficulties were significant even when corresponding baseline responses on measures of adjustment problems had been statistically controlled. Significant effect sizes based on this more stringent analytic approach supported the proposition that beliefs about pain, health and illness are risk/protective factors for, not mere correlates of, later problems in functioning in these patient groups. As such, interventions that change less adaptive pain beliefs would seem to have important implications for later outcomes. While several prospective studies featured follow-up assessments 2–4 years after baseline (e.g., Brekke et al., 2001; Miller and Cronan (1998); Rayahin et al., 2014; Smith et al., 1994), these results were based on average baseline to follow-up intervals of 8–10 months. Consequently, additional long-term longitudinal research is needed to gauge the impact of initial pain beliefs on problems in functioning over more extended periods.

A particular strength of this meta-analysis was the evaluation of a large number of relevant studies, all of which featured methodological strengths including reliable, valid, sensitive measures, minimum sample sizes, and candid reporting of effect size information. Nonetheless, several limitations of the review must also be acknowledged. First, due to the focus on RA and OA samples, generalizations cannot be made to other types of arthritis such as fibromyalgia or gout or non-arthritic chronic pain syndromes. Second, findings may not apply to pain beliefs (e.g., pain as a cause of disability, others’ solicitude when one is in pain) and facets of functioning (e.g., anger as the expression of distress, fatigue, medication use, health care visits) outside the scope of this review. Third, given that effect size variability was significant after moderating effects of pain belief content were controlled, other factors may have affected heterogeneity. While age and gender were nearly always reported in selected studies, other key demographics including employment status, socioeconomic status (SES), and relationship status were examined less often; assessing these factors as a matter of course within individual studies would facilitate evaluations of possible moderating effects of SES indicators in future meta-analyses. Another plausible methodological influence on heterogeneity was the wide variety of measures used to assess impairment, distress, and pain severity. For example, 44 different impairment indices were used including those that reflected perceived physical functioning, psychosocial functioning, and global quality of life. While the inclusion of a broad array of impairment measures bolstered the external validity of studies from this domain, it likely contributed to effect size heterogeneity as well.

Fourth, some included studies, particularly those that relied on stepwise regression analyses, selectively reported only statistically significant effect sizes and excluded such details for non-significant predictors. This practice is problematic because it can inflate risk for type I errors and distort the scientific record to seem more robust than it is in reality. In light of this concern, we urge researchers and those who evaluate their work to ensure all relevant effects are reported transparently and in sufficient detail, including those that are not statistically significant. Doing so can help to ensure sober conclusions are drawn about this and other literatures. Finally, causal effects of pain beliefs on problems in functioning could not be tested because neither experimental manipulations of pain beliefs nor random assignment to groups was characteristic of included studies. As intervention-based studies accumulate, causal effects of modifying pain beliefs on changes in functioning may be clarified.

Conclusion

In sum, this meta-analysis indicated beliefs about health, illness and/or pain have significant moderate overall associations with impairment, distress and pain severity in RA and OA samples. Across all three problem areas, effect sizes were comparatively larger in studies that assessed beliefs related to SE, pain catastrophizing, and/or helplessness. Given personal incapacity or ineffectiveness is a content theme that each of these belief categories has in common, the use of belief indices that reflect this focus within assessment and interventions that increase subjective appraisals of effectiveness would seem to be critically important in these patient groups. In contrast to the impact of pain belief content of results, overall effect sizes were not moderated by arthritis subtype, pain duration, age, and gender composition of samples, use of subjective versus behavior performance measures of impairment or content of affective distress indices (i.e., depression, anxiety-fear, or general psychological distress). However, moderator analyses of longitudinal study subsets established beliefs about health, illness, and pain not only as correlates of subsequent functioning but also risk/protective factors that may predict the likelihood that adjustment will be more or less disturbed an average of 8–10 months later. Future work should assess the relative impact of beliefs reflecting one’s own ineffectiveness in dealing with pain and/or accompanying concerns versus other types of pain beliefs on problems in functioning within other chronic pain conditions. Furthermore, prospective studies with follow-up durations beyond 1 year would help to clarify the long-term impact of baseline pain beliefs on functioning of chronic pain samples.