Introduction

Chronic musculoskeletal pain has a substantial negative impact on quality of life and the ability to engage in meaningful activities and participation in the society, including work [1, 2]. In the Netherlands, musculoskeletal disorders such as back, neck and shoulder pain constitute about 35 % of all sickness absence and long-term disability compensations [35]. Medical care utilization and sickness absence due to musculoskeletal pain are associated with high economic burden to society similar to other western countries worldwide [6]. The majority (~80 %) of the costs are related to the inability to work [5]. One of the preferred interventions to promote return to work for patients with chronic musculoskeletal pain is vocational rehabilitation, because it has been proven to be effective in reducing disability and improving work participation, and it appears to be cost-effective [7, 8]. To further improve the effectiveness of vocational rehabilitation to optimize work participation, it is recommended to intervene as soon as possible, perhaps even as soon as the sub-acute phase of musculoskeletal pain [9, 10]. A network of 14 rehabilitation centers in the Netherlands has been established to deliver evidence-based vocational rehabilitation for workers with sub-acute and chronic musculoskeletal pain.

Vocational rehabilitation (VR) in its broadest form has recently been defined in a position paper as ‘a multidisciplinary evidence-based approach that is provided along a continuum of services and activities to working age individuals with health-related impairments, limitations, or restrictions with work functioning, and whose primary aim is to optimize work participation [11]. The authors of the position paper proposed the use of the International Classification of Functioning, Disability and Health (ICF) within the VR field (regardless of health condition). On one hand, the ICF Core Set for Vocational Rehabilitation has been developed with two versions: (1) the 90 ICF categories of the comprehensive version is intended for multidisciplinary setting and (2) the 13 ICF categories of the brief version is intended for single discipline encounter or clinical trials. The brief version due to less number of ICF categories is doable for practical application and feasible in VR-related patient evaluation and assessment [12]. On the other hand, the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT), has provided recommendations for improving the design, execution, and interpretation of clinical trials of treatments specific to patients with pain [13, 14]. IMMPACT has proposed a core set of outcome measures for patients with pain. These recommendations were also broad, because they were intended to encompass the wide field of pain care, which extends far beyond the context of vocational rehabilitation.

So, here are two sets of recommendations or sets of domains, ICF Core Set and IMMPACT, which broadly address VR and pain, respectively. For clinical use and research of pain within the context of VR, developing a specific core set is needed, while learning from the two existing sets. The recommendations of the brief ICF Core Set for Vocational Rehabilitation and those of IMMPACT cover two broad areas, but should be merged and made applicable to a specific context, patient group and setting (pain and VR in the Netherlands in this study). However, we did not find papers relevant to the Netherlands, or anywhere else, describing the process and outcome of an ICF-IMMPACT core set, let alone the operationalization of those domains. The aim of the present study was to develop a core set of diagnostic and evaluative (clinical and economic) measures specifically for vocational rehabilitation of patients with sub-acute and chronic musculoskeletal pain, while using the ICF as the reference framework in VR, and IMMPACT in the outcome measurements around pain. In this study, the context is situated in the Netherlands, including its health care and social security policies as of the year 2012. As part of integrating our knowledge on the ICF, work, and pain, our research question is: how can the brief version of the ICF Core Set for Vocational Rehabilitation and the IMMPACT recommendations be best applied in one blended VR-Pain Core Set for patients with sub-acute and chronic musculoskeletal pain in the Netherlands?

Methods

Three main steps were taken. The first step was to remove irrelevant and duplicate domains of the brief ICF Core Set for Vocational Rehabilitation and the IMMPACT recommendations. Irrelevant domains were defined as those domains that do not apply or only apply to an estimated 1 % of the target population (as judged by the authors and the expert panel). Duplicate domains were defined as domains that cover overlapping, equal or very similar content or concept. Additionally, the remaining domains were checked to see whether they could be used for economic evaluations also. If not, this was added. The second step was to match the remaining domains with existing instruments or measures. Instruments were proposed based on availability and its proven use in Dutch practice and peer reviewed literature. Existing instruments were included based on proof of sufficient reliability (test–retest reliability: Intra Class Coefficient (ICC) >0.90 (preferred), Kappa >0.60, Pearson correlation coefficient >0.80; internal consistency: Cronbach’s alpha >0.80 [15]; construct validity (yes/no/not applicable (na)); responsiveness to change (yes/no/na; relevant for outcome measures only); existence of a validated version in Dutch language (yes/no; relevant for questionnaires only); and feasibility (acceptable patient and practitioner burden: yes/no). The second step was not performed to provide a systematic review of the psychometric properties of all instruments available, but to check whether the psychometric properties of the proposed instruments of the preliminary VR-Pain Core Set were acceptable.

The result of step 1 and 2 was a preliminary version of what we would call the VR-Pain core set. In step 3, to be informed by input from relevant people, the preliminary VR-Pain core set was presented to 3 expert or user panels: Dutch VR centers (proposed users (management and clinicians); n = 13), Dutch pain rehabilitation development centers (pain rehabilitation experts; n = 4), and members of the VR-Pain Core Set consensus group (VR experts; n = 23) [12]. Participants were sent the introduction to, methods and results of steps 1 and 2, including the preliminary VR-Pain core set. They were asked whether they agreed with the taken steps and the proposed core domains of the preliminary VR-Pain Core Set, and whether they agreed with the proposed instruments. In case of non-agreement, they were asked to explain their disagreement and to suggest improvements. In case the comments were unclear, the first author contacted the responder. All participants had 3 weeks to respond. Participants were sent a reminder after 2 weeks. The authors of this paper then synthesized the comments of the responders into a final VR-Pain core set.

Results

Step 1

The domains of the brief VR-Pain Core Set and the IMMPACT recommendations are presented in Tables 1 and 2. The results of Step 1, the selection of irrelevant domains and reduction of duplicates, are also presented in Tables 1 and 2.

Table 1 ICF categories of the brief ICF Core Set for Vocational Rehabilitation and relevance of the domains to the proposed VR-Pain core set
Table 2 IMMPACT recommendations and supplemental domains and relevance of the domains to the proposed VR-Pain core set

Step 2

Results of the process of matching core set domains to instruments, including its quality appraisal, are presented in Table 3. Additions as described in step 3 were also incorporated in Table 3. Domains from the IMMPACT recommendations were provided with ICF codes, with exception of personal factors which are not currently coded in the ICF.

Table 3 Quality appraisal of VR-Pain core domains classified per ICF category

Step 3

The preliminary VR-Pain core set was emailed to members of the expert panels in February 2012. Overall response was n = 18 (response rate 45 %); proposed users n = 11 (85 %), pain rehabilitation experts n = 4 (100 %), VR experts n = 3 (13 %). Of the VR experts, an additional n = 3 responded that the specific nature of the subject of this study was out of their field of expertise. One of the VR experts was contacted by phone, because the answers and comments were ambiguous. Eleven (61 %) respondents agreed with the proposed domains of the preliminary core set, while five disagreed, and two did not answer. Ten (55 %) respondents agreed with the proposed instruments of the preliminary core set, five disagreed, and three did not answer or indicated to have insufficient knowledge to judge. ‘Disagreements’ were most often accompanied by a short explanation and/or suggestion. The project members have decided unanimously that some comments should not be regarded as disagreements with the proposed domains or instruments, but rather as an item that a single expert proposed to add to the preliminary set. However, because not single experts, but rather the brief ICF Core Set for Vocational Rehabilitation and the IMMPACT recommendations formed the basis of this new and specific core set, it was decided that items proposed by single experts were not added to the definitive set, unless the project team decided otherwise based on the underlying core sets.

Based on the responses of the participants, the following domains were added to the VR-Pain Core set: adverse effects that has not lead to discontinuation of the program (adherence to the intervention; treatment records) and personal problems unrelated to work (Work Reintegration Questionnaire; WRQ). With regard to the instruments, the following measurements were changed or added: energy and drive functions (ICF code b130) will be measured with numerical rating scale (NRS) for fatigue; physical functioning will not be measured with the Pain Disability Index only, but also with RAND-36 scale physical functioning; Astrand or Bruce submaximal ergometry will be used to measure exercise tolerance functions; assessment of functioning at home or in unpaid work will be added as part of the demographic questionnaire.

Description of Instruments of the Final VR-Pain Core Set

The EuroQol-5D (EQ-5D) is a 6-item questionnaire to investigate quality of life. The EQ-5D categories measure 5 dimensions: mobility, self-care, activities of daily life, pain and anxiety/depression. Five questions are categorical (1–3 scale) and one question assessing general health status is on interval level (VAS 0-100). A Dutch language version of the EQ-5D is available [16, 17]. The EQ-5D is a widely employed instrument to assess health related quality of life (QoL), is used in cost effectiveness research based on quality adjusted life years (QALY) and is recommended by the Dutch Healthcare Insurance Board [18]. Lower levels of QoL are associated with productivity loss in patients with low back pain [19].

A single item of the Work Ability Index (WAI) will be used to assess self-reported work ability. Current work ability compared to lifetime best can be scored on a 0–10 response scale, where 0 represents ‘completely unable to work’ and 10 ‘work ability at its best’. A strong association between the single item and the complete WAI was observed (r = 0.87) [20].

The PROductivity and DISease Questionnaire (PRODISQ) [21] will be used to assess employment status, absenteeism and presenteeism. Absenteeism refers to time off from work. Presenteeism refers to productivity loss while at-work. Both may be associated with a health condition. Absenteeism is measured with a three-month recall period, and will be measured specifically related to pain condition. The number and duration of a maximum number of three absenteeism periods are collected. Presenteeism is measured with two items on a 11-point scale, also known as the QQ-index (quantity and quality). The first item measures quality of work done in the last day at work, ranging from 0 (I couldn’t do anything) to 10 (I could do the same as normal). The second item measures quantity of work done in the last day at work, ranging from 0 (the quality of my work was dramatic) to 10 (the quality of my work was normal).

The Pain Disability Index (PDI) is a 7-item questionnaire to investigate the magnitude of the self-reported pain related disability, independent from region of pain or pain-related diagnosis. The questionnaire is constructed on a 0-10 numeric rating scale in which 0 means no disability and 10 maximum disability. Total scores can range from 0 to 70, with higher scores reflecting higher interference of pain with daily activities. The PDI measures family / home responsibilities, recreation, social activity, occupation, sexual behaviour, self-care and life support activity [22, 23].

The RAND-36 scale physical functioning will be used to measure self-reported physical functioning independent of (pain) diagnosis [24]. The RAND-36 has been used widely across health conditions (www.rand.org, accessed August 2012). The physical functioning scale consists of 10 questions with 3 possible answers on a Likert scale: ‘yes, strongly limited’, ‘yes, a bit limited’, and ‘no, not limited’. The total score can range from 0 to 100, with higher scores indicating better physical functioning. The validity and reliability of the Dutch version are good [25].

The Work Reintegration Questionnaire (WRQ) is an instrument for assessing the most important psychosocial factors in the delay of recovery and work resumption. The questionnaire consists of 78 items distributed among 8 scales; ‘Distress’, ‘Illness behaviour’, ‘Job strain’, ‘Job dissatisfaction’, ‘Control’, ‘Avoidance’, ‘Perfectionism’ and ‘Stressful home situation.The Work Reintegration Questionnaire (WRQ) measures the following dimensions: distress, interference, work stress, work satisfaction, insecurity / avoidance, perfectionism / persistence, home situation [26]. The questionnaire was developed in Dutch (VAR: vragenlijst arbeidsreintegratie). A validated translation in English is currently in development (personal communication with author).

Pain intensity and fatigue can be assessed using an 11-point NRS (NRS-pain and NRS-fatigue), ranging from 0 (no pain / fatigue) to 10 (worst possible pain / fatigue), requiring patients to rate their current and average intensity of the last 7 days [9].

Exercise tolerance functions will be assessed with standardized lifting capacity tests from the Workwell functional capacity evaluation (FCE): lifting low and overhead lifting. Procedures are described in detail elsewhere [27]. These tests were found to be predictive of functional capacity performance in general in patients with back pain and neck / upper extremity pain [28]. A standardized submaximal Astrand bicycle test [29, 30] or Bruce treadmill test [31] will be used to assess exercise tolerance functions as well as energy and drive functions.

The Trimbos iMTA questionnaire for measuring Costs of Psychiatric Illnesses (TiC-P), module 1, will be used to assess health care utilization. The questionnaire has a recall period of 4 weeks. Visits and consultations of the following health care providers were measured: general practitioner, physiotherapist, manual therapist, exercise therapist, occupational therapist, psychologist, insurance physician, medical specialists in hospitals, hospitalization (number of days), occupational physician, social worker, and dietician. Further items were alternative care, home care, medication use, and job related care like job coaches, ergonomic changes at the work site and re-integration specialists [32]. Slight adaptations in the context and scope of health care practitioners were made to better fit TiC-P to the target population (i.e. from psychiatry to pain and work).

Global perceived effect (GPE) can be measured with a 7-point Likert scale ranging from 1 to 7 (1; ‘extremely worsened’, 2; ‘much worsened’, 3; ‘little worsened’, 4; ‘unchanged’, 5; ‘little improved’, 6; ‘much improved’, 7; ‘completely improved’). Two GPE questions are proposed: how much did your treatment change your pain compared to pre-treatment level, and how much did your treatment change your work status compared to pre-treatment level?

Treatment records will be used to assess diagnosis, adherence to the treatment program, adverse effects that has not lead to discontinuation of the program, and reasons for premature withdrawal.

Discussion

Sub-acute or chronic musculoskeletal pain can lead to a decrease in work participation up to the point where VR becomes essential. It is important to understand and address musculoskeletal pain in the context of VR because if we can mitigate the burden of work disability, we can facilitate early and sustained return to work. To do so, in this study, we attempted to blend two sources of domains around pain and VR, listed the instruments by which we can operationalize the domains, and developed a VR-Pain core set that may benefit clinical and research application in the VR-pain field in the Netherlands and potentially in other countries as well. To this end, the final VR-Pain core set consisted of 12 instruments that covered 18 domains.

As presented in Tables 1 and 2, domains and items of the underlying sets were removed by the authors because they were not deemed relevant for the (vast majority of) the target population. While this was not or incidentally challenged by members of the expert groups, this still needs elucidation. Acquiring skills (ICF code d155 Acquiring skills) for example, was excluded, because this it is not primarily affected (or core) in people with pain, and therefore not a goal in VR. Acquiring skills was deemed very relevant for the brief ICF Core Set for VR, because VR in its broadest form is provided to workers with a range of disabilities reaching far beyond pain, including workers with neurological and intellectual problems. Because of the specific setting for which the VR-Pain core set was developed, other items that were initially included to accommodate the wide application of both ICF and IMMPACT sets did not make to the core and final VR-pain set like unpaid work, acquiring, keeping, and terminating work, end-of-life issues, and higher level cognitive functions. To exclude the latter, however, may be subject to debate, because workers with pain often report challenges with concentration and memory. These concomitant complaints are regarded as related to pain and fatigue (which are already included in the final set), and perhaps symptoms related to central sensitization. Non-specific pain does not directly affect the brain and higher neurological functions as captured in ICF code b164 Higher level cognitive functions. For similar reasons biological markers were also excluded. Finally, while environmental issues such as insurance and social security systems are considered relevant [4] and vary across jurisdictions, they do not vary across the workers in the specific setting for which this VR-Pain core set was developed for. For generalizations beyond the Dutch borders, we advise researchers to describe the issues in future reports within the context or controlling for insurance and social security systems.

Most instruments proposed to make up the VR-Pain core set all comply with basic clinimetric properties as presented in Table 3. Because the properties included validated Dutch language versions of questionnaires and feasibility, the set of instruments proposed is likely to differ from core sets for different countries and languages. Additionally, we have attempted to choose instruments that could be used for clinical as well as for research purposes, including economic evaluations. Even though EQ-5D was not recommended by either one of the underlying core sets (ICF Core Set or IMMPACT), it was added because this instrument can be used for economic evaluations. Additionally, it captures an important secondary aim of vocational rehabilitation, which is to contribute to increase quality of life.

New core sets that apply to specific groups may thus be developed based on existing core sets. The exercise of developing a new core set based on two established ones has not been presented previously. Thus, the methodology described in this paper is new. We have aimed to describe this methodology transparently, to enable readers to either replicate these steps when developing or validating other core sets specific to their setting, or to use it as a basis for further development of this methodology. By asking Dutch experts in the pain rehabilitation field, prospective users and international VR experts, we aimed to test the content validity of the newly developed core set. However, this paper may also be regarded as external validation of the underlying core sets. In choosing the instruments, we aimed to combine sound psychometric properties with the options for future cost-effectiveness studies or intervention trials. This will enable future users to study clinical and economic outcomes in the (Dutch) usual care setting, which should make a significant contribution to the field of VR and pain.

While the response rates of the Dutch pain rehabilitation experts and prospective users was high, response of the international experts was low. Some international experts responded that this exercise was specifically not in their sub-field of expertise (e.g. cognitive vocational rehabilitation), the majority of this group did not respond at all, which may be attributed to lack of time availability or were unable to follow up on the electronic invitation and reminder. The relevance of this non-response is unknown. Because based on the responses only small changes were made to the final core set, and no differences in response patterns between expert groups were observed, we assume that the relevance of the non-response to be limited. Patients were not invited to participate in this specific exercise, because patient involvement was already incorporated in the development of the two underlying core sets. Both the ICF Core Set for Vocational Rehabilitation and the IMMPACT recommendations are in principle experts-based. Even though the VR-Pain core set was agreed upon by most experts (‘externally validated’), it is in its essence also an experts-based core set. Future use will discover whether the set of instruments is deemed too extensive for routine clinical use, and where and why this core set should be adapted to new developments in the VR-Pain field.

In line with the recommendations underlying the ICF VR expert group [12], the lack of classification of the personal factors in the ICF which can play a crucial influence on work functioning, will need careful consideration in the future. Although some performance-based instruments are included in the final VR-Pain core set, the majority of the instruments are self-report based. Apart from its strengths (outcomes are judged by the patients, not by or interpreted by others), this may also introduce a risk of bias, particularly in the estimation of absenteeism and presenteeism. Additionally, while clinimetric properties of the individual instruments in the VR-Pain core set were checked, they were not systematically reviewed.

The clinical relevance of using this VR-Pain core set is that it will provide a firm base for routine clinical use and evaluation of services in vocational rehabilitation settings with pain-related cases. Clinicians can, based on their clinical expertise or professional guidelines, add diagnostic instruments to this core set as needed. Moreover, the VR-Pain core set should not replace clinical expertise, but rather should complement it. The methodology described in this paper may be generalizable to develop other setting-specific core sets or a combination thereof. Additionally, most of the instruments in the VR-Pain core set are used internationally, which will address generalizability and comparability. Costs calculations underlying the EQ-5D, PRODISQ and TiC-P questionnaires, however, are based on Dutch guidelines which are expected to be different from other countries. While the VR-Pain core set is developed for the Netherlands, the burden of pain and work disability in the Netherlands is similarly high as in other industrialized countries [1]. Therefore, it is recommended that similar core sets are to be developed and tested for different countries. To enable generalization across countries, facilitate common language and stimulate future developments, we recommend that whenever possible, the same instruments are used.

Conclusion

A VR-Pain core set with content that covers relevant domains for pain and VR and with validated corresponding instruments that measure these domains has been developed. The VR-Pain core set may be used for clinical purposes, and (cost) effectiveness research in the field of vocational rehabilitation and pain. Caution is warranted for direct use outside of The Netherlands, because differences in cultural and service and political systems exist, hence the basis for costs calculations may be different. Additionally, for use and generalization beyond that of The Netherlands, it is recommended that environmental factors (ICF e580 and e590, Table 1) be considered and examined.