Introduction

Neck pain, which often causes great pain, disability and financial loss, is a very common condition worldwide [1, 2]. It is a very common disease of the musculoskeletal system [3]. The annual incidence of neck pain in the general population is about 15%, with a 1-year prevalence ranging from 4.8 to 79.5% [4, 5]. In addition to a patient's functional status, cervical pain and impaired mobility can negatively impact work activities and quality of life postpartum. Thus, it is critical that clinicians use valid and reliable measures early on to assess patients' disability and outcomes [6, 7].

A patient-based, self-administered Health-Related Quality of Life (HRQoL) questionnaire was developed in order to gain a better understanding of the severity of their illness and to identify more appropriate treatment options [8].

Scoring systems are often used in multicenter studies conducted in different countries and cultures, and there is a growing need for this to increase the statistical power of evidence-based experiments [9, 10]. However, prior to the use of a reliable and valid HRQoL questionnaire in different cultural contexts, in addition to translating the questionnaire, it is necessary to maintain the content validity of the tool in order to avoid cultural bias [11, 12].

In 1998, Deyo et al.[13] recommended a simple and effective tool for assessing outcomes in lumbar spine disease, a set of six core questions known as the Core Outcome Measurement Index (COMI). These questions assessed pain (axial and radiating), functioning, symptom health and disability (social and occupational). The COMI is the European Spine Society's (EUROSPINE) primary patient-reported outcome scale for international spine surgery [14]. The psychometric properties of the COMI-neck Scale, developed to complement the COMI Back Scale, have been validated [15]. COMI has been shown to be reliable and valid in patients with chronic neck pain and those who have undergone disk replacement for degenerative disorders of the neck [15, 16]. Cross-cultural adaptations of the COMI for neck pain have been done in various languages including Turkish [17], Polish [18] and Dutch [19]. However, there is no Chinese version of the COMI-neck scale.

The aim of our study was to cross-culturally translate and adapt COMI into COMI-SC, and assess its reliability and validity in native Chinese with neck pain.

Materials and methods

Patients and data collection

A total of 122 patients were enrolled from seven departments. The inclusion criteria were as follows: (1) patients complaining of chronic neck pain (more than 3 months); (2) aged > 18 years; (3) able to read and understand Chinese;(4) patients who signed an informed consent and agreed to participate in this study. Exclusion criteria were patients with spinal tumor, trauma (including fractures) or whiplash, central and peripheral neurological problems, systemic diseases and psychiatric disorders.

In order to test the upper or lower limit effect, validity and reliability of the scale, Terwee et al. [10] determined that the study should include at least 50 patients in order for the sample size to meet the criteria. Informed consent forms were signed by all recruited patients. The Clinical Research Ethics Committee of PLA General Hospital, Beijing, China, approved this study.

Patients were recruited and completed the COMI-SC, Neck Disability Index (NDI), Neck Pain and Disability Scale (NPDS), Visual Analogue Scale (VAS) and Short Form Health Survey (SF-36) after first providing sex, age, height, weight, marital status, pain duration and education. Two weeks later, the COMI-neck was administered again to 122 patients. Test–retest reliability was determined. (Table 1).

Table 1 Demographic characteristics of participants

Translation and cross-cultural adaptation questionnaires

We translated and adapted the English version of the COMI in five steps, based on previous guidelines and research and with the consent of the original authors [11, 20]. The steps included forward translation, translation synthesis, backward translation, summarization of the prefinal version and finalization (Table 2). After pretesting the patients, all researchers discussed the issues in the above five steps, the Chinese version of COMI was developed.

Table 2 Steps of translation and trans-cultural adaptation

Instruments

The core outcome measures index for the neck (COMI-neck)

COMI-neck is a short, self-administered complete scale developed by White et al. [15] in 2004. It can be used to assess neck pain. The COMI-neck consists of 7 items. Five dimensions are assessed: pain, neck-related function, symptom-specific well-being, general quality of life and disability (social and work). Questions 1–5 ask how the patient felt in the previous week, and the last two questions ask about disability in the patient's first four weeks. A 5-point adjective scale was used for the remaining questions except for pain, which used a 0–10 graphical rating scale (GRS). For questions 1 and 2, pain is represented by the higher of the two pain scores (''COMI high pain'') (0–10). The average of the last two items represents the disability dimension. COMI-neck is therefore divided into five dimensions: pain, function, symptom-specific wellness, QoL and disability (social and occupational). The final COMI-neck score is averaged across the five dimensions and the score for each dimension is converted to a 0–10 scale. A higher score indicates a worse patient status [15, 16].

Neck disability index (NDI)

The NDI is a self-administered 10-item scale that measures pain intensity, personal care, weight lifting, reading, headaches, concentration, work, driving, sleep and recreation [21, 22]. Each item is scored from 0 to 5, and an overall score is given for each dimension. The NDI has proven reliable and valid in patients with neck pain.

Neck pain and disability scale (NPDS)

The NPDS is a 20-item questionnaire designed to assess neck pain and disability. It is based on the One Million Visual Analogue Scale [23]. A 10 cm visual analogue scale (VAS) is used for each item. Scores range from 0 (no pain or disability) to 50 (maximum pain or disability). There are 6 vertical lines and 5 grids on each line. The vertical grid is worth half a point, and a mark between the vertical line and the grid increases the score by a quarter. The lowest score is 0 points. The highest score is 100 points.

The short form health survey (SF-36)

SF-36 is a self-reported health questionnaire that assesses patients' overall quality of life [24]. The questionnaire is divided into 8 dimensions, namely physical function (PF), social functioning (SF), physical role (RP), emotional role (RE), mental health (MH), vitality (VT), physical pain (BP) and general health (GH) [25].

Visual analog scale (VAS)

VAS is a visual analogue scale of a patient's perception of pain levels, with a score ranging from 0 (no pain at all) to 10 (worst possible pain) [26].

NDI, NPDS, VAS, and SF-36 have been translated into Simplified Chinese and proven with good reliability and validity [22, 27].

Psychometric assessments and statistical analysis

The questionnaire has both floor and ceiling effects when more than 15% of patients score 0 (lowest) or 10 (highest) [28]. Retest reliability and internal consistency are standard methods for testing COMI reliability [29]. Internal consistency is measured using Cronbach's alpha. Terwee et al. suggest a value of 0.70–0.95 for good internal consistency [10]. To determine test–retest reliability, the intra-group correlation coefficient (ICC) derived from a two-way ANOVA in a random effects model is used. ICC > 0.8 and > 0.9 represent better and worse reliability, respectively. The same patient was asked to complete the COMI-SC questionnaire again two weeks after completing it for the first time. Bland–Altman plots are used to assess intra-object variability and inter-object systematic error [30, 31].

Assessing the content and construct validity of the COMI-SC is also important. Pearson correlations were calculated between the COMI-SC and the NDI, NPDS, SF-36 and VAS. Content validity was assessed by two rehabilitation specialists and three orthopedic surgeons who analyzed the relevance of each item to the patient's condition. Construct validity was assessed by calculating Pearson correlation coefficients between the COMI-SC and the NDI, NPDS, SF-36 and VAS. By comparing the correlation of the data with the calculation, the construct validity of the COMI-SC was assessed. The correlation coefficients were poor ( − 0.2 < r < 0.2), fair (0.2 < r < 0.4 or  − 0.4 < r < -0.2), moderate ( − 0.4 < r < − 0.6 or − 0.6 < r < − 0.4), good (0.6 < r < 0.8 or  − 0.8 < r < − 0.6) or almost perfect (0.8 < r < 1.0 or − 1.0 < r < − 0.8) [32].

To further test the structural validity of the questionnaire, exploratory factor analyses (EFA) and confirmatory factor analysis (CFA) were performed [33].

All statistical analyses were performed using the Statistical Package for Social Science (SPSS) for Windows version 27.0 (IBM Company, Armonk, NY) and IMS SPSS AMOS 24.0 (IBM Company, Armonk, NY).

Results

Cross‑cultural adaption of the COMI-neck

There was no semantic or linguistic ambiguity in the forward and back translation of COMI-neck into Simplified Chinese. All projects were approved by the expert group. It was found that the patients had no ambiguity about the topic after testing the first version of COMI-neck.

Floor and ceiling effects

For each questionnaire, the percentage of floor effect (worst condition) and ceiling effect (best condition) is shown in Table 3. No floor or ceiling effect was observed in any of the individual COMI-neck projects. Nor do the NDI, NPDS and VAS projects show an obvious floor or ceiling effect. There were high floor effects at baseline (81.5% and 87.1%) for some of the individual items of the SF-36 (physical role and emotional role).

Table 3 Absolute values of all scores

Reliability

The Cronbach's alpha of the overall COMI-neck was 0.861. This indicates good internal consistency between the items. Table 4 shows the ICC values for the five dimensions of the COMI-neck. There was no significant difference between the retest scores. The Bland–Altman plot (Fig. 1) showed no systematic error. Both results are an indication that the COMI-neck has excellent retest reliability.

Table 4 Test–retest analysis of COMI-neck
Fig. 1
figure 1

The Bland–Altman plot for test–retest agreement of COMI-neck. The differences between scores for COMI-neck from test and retest were plotted against the mean of the test and retest. The line indicates mean difference value of the two sessions and the 95% (mean ± 1.96 standard deviation) limits of agreement

Validity

The content of the questionnaire was judged to be valid enough to assess the impact of neck pain in the enrolled participants by rehabilitation specialists and orthopedic specialists in COMI-neck.

Table 5 shows the correlation coefficients of COMI-neck with NDI, NPDS, VAS and SF-36 subscales, indicating good construct validity. COMI-neck includes 5 dimensions (pain, neck function, symptom-specific well-being, QoL and disability (social and work)) and a summary score. The results showed that the COMI-neck was moderately correlated with the NDI (0.890, P < 0.001). It had a very good correlation with the VAS (0.845, P < 0.001). It correlated moderately with neck dysfunction and disability of the NPDS (0.420, P < 0.001), pain intensity during movement (0.416, P < 0.001), static neck pain intensity (0.437, P < 0.001). And it was also moderately correlated with the physical function of SF-36 ( − 0.989, P < 0.001), role physical ( − 0.597, P < 0.001), bodily pain of SF-36 ( − 0.639, P < 0.001), general health ( − 0.563, P < 0.001), vitality ( − 0.702, P < 0.001), social functioning ( − 0.764, P < 0.001), role emotional ( − 0.675, P < 0.001) and mental health ( − 0.507, P < 0.001). The symptom-specific well-being of COMI-neck was poorly correlated with Neck Dysfunction and Disability of NPDS (0.083, P = 0.366), Pain Intensity with Movement (0.078, P = 0.394) and Static Neck Pain Intensity (0.055, P = 0.545).

Table 5 Construct validity of COMI-neck

To further explore construct validity, we did factor analysis and found that 3-factor loading explained 71.558% of the total variance [Kaiser–Mayer–Olkin (KMO) = 0.780, C2 = 502.82, P < 0.001]. Bartlett’s test of sphericity was significant (df = 21, P < 0.001) (Fig. 2).

Fig. 2
figure 2

Scree plot of eigenvalues from the 7-item COMI-neck questionnaire

We get the CMIN/DF = 1.813, Tucker–Lewis Index (TLI) = 0.966 (> 0.9), Comparative Fit Index (CFI) = 0.982 (> 0.9), Normed Fit Index (NFI) = 0.961, Root Mean Square Error of Approximation (RMSEA) = 0.082 (< 0.5), which indicates that the model fits well. The AVE and CR values also reflect the validity (Table 6).

Table 6 AVE and CR of COMI-neck

Discussion

We have successfully cross-culturally adapted the COMI for use with patients with neck pain in the Chinese Simplified language. As HRQoL questionnaires play an increasing role in modern healthcare, hospitals, government agencies and research institutions are conducting more outcome studies. Short questionnaires such as COMI-neck are ideal for longitudinal assessment of treatment outcomes [34].

Translation and cross-cultural work are carried out in strict accordance with standardized procedures. There are no systemic problems. We have found that the Simplified Chinese version of the COMI-neck has a high degree of compatibility with the original COMI-neck.

The absence of floor and ceiling effects in the Simplified Chinese version of COMI-neck indicates acceptable internal consistency. This is consistent with previous findings [17, 18]. The physical and emotional dimensions of the SF-36 scale showed a large floor effect, both greater than 80%. This may have been due to the following reasons. Firstly, most of the patients included in the study suffered from mild to moderate neck pain and were treated conservatively in an outpatient setting, and secondly, most of the patients had a chronic course with no acute or recurrent symptoms, so they take this for granted. In life or at work, they therefore take it for granted.

The internal consistency of COMI-neck was nearly good (Cronbach’s alpha = 0.861), it is better than the Turkish version [17], but the original version [16], Polish version [18], Italian version [35], Cronbach's alpha is not described in the article. The authors measured ICC values for 5 questions in the original version of COMI-neck. The average ICC value was 0.768 [21]. The ICC value for the Polish version [18] is 0.878. The ICC value for the Turkish version [17] is 0.959 and the ICC value for the Italian version [35] is 0.87. The Simplified Chinese version of COMI-neck has a higher ICC value than the other versions, with the exception of the Turkish version. The original version was retested after 3 days, the Polish version took on average 8 days, and the remaining versions took 7 days. It is explained that the test and retest interval is an important factor that has a significant impact on the ICC value.

We correlated the COMI-neck with the NPDS, NDI, VAS, and SF-36 and found it to be consistent with what we had hypothesized. The higher the scores of COMI-neck, NPDS, NDI and VAS, the worse the patient's function; the lower the score of SF-36 in each dimension, the better the patient's function or quality of life. Therefore, the correlation coefficient between COMI-neck and SF-36 is below 0. The correlation coefficient between the three NPDS dimensions and the symptom-specific well-being dimension in COMI-neck is low. This may be because the COMI-neck questions are about the rest of the patient's life, most patients are optimistic, and the NPDS questions are about current physical condition. In our study, the correlation coefficients between the COMI and the NDI, VAS, and physical subscales (bodily functioning, bodily role, bodily pain, and general health) of the SF-36 were relatively higher. Exploratory factor analysis confirmed COMI-neck's robustness and unidirectional structure.

This study showed the validity of COMI-neck. It discriminated between people with neck pain and other factors that affect daily activities. This finding is in line with the results of the COMI-neck in other languages as well.

The CFI, NFI, CMIN/DF, TLI and RMSEA values indicate a good model fit in the confirmatory factor analysis. The values of AVE and CR on the dimension of disability did not reach the expected values, while the other values indicate a good convergent validity and a good portfolio reliability of the collected questionnaires. This may be due to the small size of the sample used in the model.

This study has several limitations. Firstly, the responsiveness of the COMI-neck was not assessed, which could be done in the future. Secondly, although Simplified Chinese is the official language of China, China is a multiethnic country and each ethnic minority has its own language. Therefore, the issue of ethnic minorities should be considered. In addition, the sample size of patients is relatively small and does not represent the general population of mainland China. In addition, this review only examined patients with non-specific neck pain and did not include acute or subacute conditions or patients with specific neck diseases.

Conclusions

This study shows that COMI-neck has been successfully translated and cross-culturally adapted into Simplified Chinese and has sufficient reliability and validity to be used for standard assessment of patients with non-specific chronic neck pain.