Introduction

Patients diagnosed with cancer often experience sleep disturbance in the form of poor sleep patterns or insomnia, which can start on diagnosis and continue until the end of the individual’s life [1, 2]. It is estimated that sleep disturbance affects 50–75% of cancer patients [3, 4]. Sleep disturbance in cancer patients can have a negative impact on their quality of life (QoL), which includes their physical and psychological functioning [5, 6]. In addition, sleep disturbance has been associated with cancer recurrence [7, 8], and may result in poor healing [9], cognitive dysfunction [10] and reduced work activity [11].

One of the most widely used instruments that measures sleep quality in both clinical and non-clinical populations is the Pittsburgh Sleep Quality Index (PSQI) [12]. The scale is easy to understand, can be completed within 5 to 10 min, and contains 19-items that are combined into seven subscales: subjective sleep quality, sleep latency, sleep duration, habitual sleep efficiency, sleep disturbances, use of sleep medications and daytime dysfunction. The original PSQI was used to assess sleep quality in three groups: a healthy sample, patients with depression, and those with sleep disorders. The instrument exhibited good validity and test–retest reliability [12].

The PSQI has been used across a variety of clinical populations, such as those with diabetes [13], dementia [14], fibromyalgia [15] and cancer [16]. Mollayeva et al. [17] undertook a systematic review of 37 articles that evaluated the weighted mean difference of the PSQI for clinical and non-clinical samples. The review found that the PSQI’s mean global score was significantly higher in clinical samples compared to non-clinical ones.

The PSQI has been translated into different languages and cultures, including Greek [18], German [19], Chinese [20] and Spanish [15], all versions exhibited adequate psychometric properties. In addition, the PSQI has been translated and validated with an Arabic sample living in the United States [21]. However, whilst it is acknowledged that Arabic countries share a common language with this American sample, there may be cultural differences that need to be taken into consideration. In addition, the American-Arabic version of the PSQI was tested in a non-clinical sample [21].

To the best of our knowledge, the PSQI has not been validated in cancer, clinical populations in Arabic countries [22]. Therefore, the aim of this study was to examine the validity and reliability of the Arabic PSQI with a heterogeneous sample of cancer patients who have completed surgical intervention, chemotherapy and/or radiotherapy treatment. Validation of the Arabic PSQI in an Arabic country will provide a suitable and acceptable cultural fit instrument for the screening and assessment of sleep disturbance in clinical practice. The original PSQI was specified as a single factor model [12]. In cancer populations, it has been found that the one-factor model fits the PSQI global score [23], whereas Otte et al. [24] have suggested a two-factor model. Therefore, a one factor and two-factor model were examined in this study.

Methods

Participants

A cross-sectional survey was used and participants were recruited from the National Oncology Centres outpatient clinic in Oman. Data was collected from November 2018 to January 2019. Inclusion criteria for participation were as follows: adult patients over 18 years of age, able to speak and write in Arabic, no known psychiatric or neurological disorders that could interfere with study participation, a diagnosis of any type of cancer and completion of surgical intervention, chemotherapy and/or radiotherapy treatment at least one month prior to recruitment. Participants were excluded if they were undergoing chemotherapy or radiotherapy treatment or were newly diagnosed cancer patients awaiting surgery.

Ethical permission was sought from the Research and Ethical Review and Approval Committee in the Directorate General of Planning and Studies at the Ministry of Health, Oman.

Instrument

The PSQI is a self-completed questionnaire that assesses sleep quality over the previous month [12]. The scale has 19-items, five of which should be answered by bed mates or roommates. These five are not calculated in the global score of the PSQI, but rather are only used as clinical information. They were not included in this study. The 19-items are categorised into seven components: subjective sleep quality, sleep latency, sleep duration, habitual sleep efficiency, sleep disturbances, use of sleep medications and daytime dysfunction. The scores of each of the seven components range from 0 to 3 and a global score can be calculated by summing the seven components, giving a range from 0 to 21; higher scores indicate poor sleep quality. Buysse et al. [12] suggested that those with global scores of more than five should be classified as poor sleepers: those with scores of less than or equal to five should be classified as good sleepers. Demographic data was collected on age, gender, educational level, marital status, employment status, type of cancer, stage, type of treatment undergone and time since diagnosis.

Translation of PSQI

The PSQI Arabic version was translated and validated by Suleiman et al. [21] in a sample of healthy Arab Americans. To ensure the applicability of the instrument, in the current study, the PSQI Arabic version was compared with the original English version by two independent and bilingual translators, both of whom were aware of the study objective. This procedure was to ensure that the scales were translated into Modern Standard Arabic, which is employed for official use in all Arabic countries.

In order to examine the clarity and comprehensibility of the PSQI Arabic version in this study, a pilot study was carried out with ten patients diagnosed with cancer. This highlighted some difficulties regarding the layout of the questionnaire. Accordingly, the structure of the PSQI Arabic version was amended: all the questions were placed in a table so they were easy to identify and would not be missed by participants.

Statistical analyses

Data were analysed using the Statistical Package for the Social Sciences (SPSS) 25.0. The internal consistency of the PSQI was evaluated using Cronbach’s alpha coefficient for each subscale and the overall scale, with an alpha of 0.70 or higher considered as acceptable [25]. Floor and ceiling effects were calculated by the number of individuals in the total sample who achieved the lowest or highest levels of the scores in the scales; these were deemed of importance if more than 15% of the respondents achieved the lowest or highest possible scores [26].

Discriminative validity of the PSQI evaluated the ability of the scale to differentiate defined groups according to patient demographic and clinical characteristics. One-way analysis of variance (ANOVA) and independent sample t-tests were used to test the differences in the PSQI’s mean scores between gender, cancer site and comorbidities.

Construct validity was tested with Confirmatory Factor Analysis (CFA). The PSQI was originally specified as a single factor model, which was proposed as a unidimensional structure [12]. In cancer populations, Ho and Fong [23] and Fontes et al. [27] provided supported for the one-factor model, however, Otte et al. [24] have suggested that a two-factor model is the best fit in breast cancer populations. Therefore, CFA was conducted using AMOS 25 to exam the fit statistics of the two models. The one-factor model, as given by the original publication [12], included all seven components. In the two-factor model as Otte et al. [24] suggested, the first factor encompassed sleep quality to include subjective sleep quality, sleep latency, sleep disturbances, use of sleep medication and daytime dysfunction. The second factor encompassed sleep efficiency and included sleep duration and habitual sleep efficiency.

The following criteria were used to evaluate the model fit according to Hair et al. [28]; non-significance of chi-square (χ2–test); chi-square per degree of freedom (χ2/df) of less than three; root mean square error of approximation (RMSEA) of less than or equal to 0.08; comparative fit index (CFI) greater than 0.95; the goodness of fit index (GFI) and adjusted goodness of fit index (AGFI) of more than or equal to 0.90 to indicate good fit; and the Tucker-Lewis index (TLI) greater than 0.90.

Results

Of the 400 patients eligible to participate, 369 completed and returned valid questionnaire booklets, giving a response rate of 92.2%. The demographic and clinical characteristics of the participants are presented in Table 1. The majority of participants were female (66.4%) and three quarters were married (74%). Over 55% of the participants completed secondary level of education or held a degree. Approximately 37% of the participants were diagnosed with breast cancer and 14% with colorectal cancer. The global PSQI scores ranged from 2 to 19 with a mean of 9.22 (SD 4.21). Using the cut off criteria of total PSQI score of more than five, as suggested by Buysse et al. [12], 288 (78%) participants were identified as poor sleepers.

Table 1 Demographic and clinical characteristic of participants (N = 369)

Factorial validity

A one-factor model was tested (χ2 (df = 14) = 178.035, P < 0.001, GFI = 0.88, AGFI = 0.76, CFI = 0.76, TLI = 0.65, RMSEA = 0.18. and χ2/df = 12.71), which indicated the poor fit of the model. Based on the modification indices, a path of covariance was then added between error terms for component three (sleep duration) and component four (habitual sleep efficiency). The refined one-factor model fit indices of the sample were χ2 (df = 13) = 35.18, P < 0.001, GFI = 0.97, AGFI = 0.94, CFI = 0.97, TLI = 0.95 RMSEA = 0.06 and χ2/df = 2.7, which indicates an acceptable fit for the data (Fig. 1). The two-factor model performed better fit (χ2 (df = 13) = 35.18, P < 0.001, GFI = 0.97, AGFI = 0.94, CFI = 0.97, TLI = 0.95, RMSEA = 0.06. and χ2/df = 2.7) (Fig. 2), which shows a similar acceptance of fitness indices of the refined one-factor model.

Fig. 1
figure 1

Confirmatory factor analysis for the refined one-factor model of PSQI Arabic cancer patients (n = 369) with correlation between e4 and e5. χ2 (df = 13) = 35.18; P value < 0.001; χ2/df = 2.7; Goodness of Fit Index (GFI) = 0.97; adjusted goodness of fit index (AGFI) = 0.94; Comparative Fit Index (CFI) = 0.97, Tucker-Lewis Index (TLI) = 0.95; Root Mean Square Error of Approximation (RMSEA) = 0.06

Fig. 2
figure 2

Confirmatory factor analysis for the two-factor model of PSQI Arabic cancer patients (n = 369) with correlation between e4 and e5. χ2 (df = 13) = 35.18; P value < .001; χ2/df = 2.7; Goodness of Fit Index (GFI) = 0.97; adjusted goodness of fit index (AGFI) = 0.94; Comparative Fit Index (CFI) = 0.97, Tucker-Lewis Index (TLI) = 0.95; Root Mean Square Error of Approximation (RMSEA) = 0.06

Reliability

The global PSQI score did not exhibit floor or ceiling effects for the Arabic cancer patients; none of the participants had the lowest possible score of 0 or the highest possible score of 21. However, some of the seven components did demonstrate floor and ceiling effects. Ceiling effects were observed in subjective sleep quality, sleep latency, sleep duration and habitual sleep efficiency; more than 15% of participants achieved the highest score. Floor effects were apparent in the use of sleep medication and daytime dysfunction, as more than 15% of the respondents achieved the lowest score.

Internal consistency was assessed using Cronbach’s alpha. The PSQI Arabic version score was 0.77, which was considered to be acceptable. All seven components appeared to measure a particular aspect of the PSQI. The largest component-total correlation coefficient was found for subjective sleep quality (0.62), whereas the smallest was found for sleep disturbances (0.38). The items’ total correlations and Cronbach’s alpha if item deleted of the seven components and total score compared with the original validation of the PSQI (Table 2).

Table 2 Internal consistency. Cronbach’s α and ceiling-floor effect for PSQI component and total score (N = 369)

Discriminative validity

Differences between the mean of the global PSQI according to the gender, cancer site, cancer stage and comorbidities are shown in Table 3. The mean of the global PSQI was highest among patients with comorbidities compared to patients without (P < 0.001). Gender showed no statistically significant differences on PSQI global score (P = 0.53). Patients with lung (M = 12.95, SD = 3.53), gastrointestinal (M = 12.05, SD = 3.96) and brain (M = 11.84, SD = 4.89), cancer had significant differences in reporting the higher mean scores on global PSQI (P < 0.001). The global PSQI was able to discriminate between cancer stages (P = 0.006). Post hoc analyses using Turkey’s HSD indicated that cancer stage four had higher mean global scores (M = 10.69, SD = 4.29) than other cancer stages. This indicated that the PSQI Arabic scale has discrimination validity.

Table 3 Global PSQI score according to the patient demographic and clinical characteristics

Discussion

Assessment of sleep problems for Arabic cancer patients is necessary, however, a valid and reliable sleep instrument is needed to support clinical decision making for interventions that can improve sleep. To our knowledge, this is the first study to examine the psychometric properties of the PSQI in patients diagnosed with cancer in Arabic populations. This study demonstrated that the Arabic version of the PSQI tested in a cancer population demonstrates adequate reliability as well as discriminative and construct validity. Confirmatory factor analysis supported a one factor and two factor model, although the one-factor model performed best. Although PSQI components had floor or ceiling effects, the global PSQI score did not show either floor or ceiling effects. This indicates that the item analysis supported the content validity of the overall score of the scale [26].

The study found the prevalence of poor sleepers (PSQI > 5) was 78%, which is higher than findings by George et al. [29] who reported poor sleeper as 64%, but consistent with Wu et al. [30], who reported 77.3% of poor sleeper in cancer survivors.

There was an acceptable degree of internal consistency between the total score of the Arabic version of the PSQI and the seven components: Cronbach’s alpha was 0.77, which was considered to be acceptable. All seven components appeared to measure a particular aspect of the PSQI. Previous studies have reported similar results in cancer populations [16, 27, 31]. Mollayeva et al. [17] conducted a systematic review of 37 studies that evaluated the measurement properties; they found Cronbach’s alpha to be between 0.70 and 0.83. The original internal consistency proposed by Buysse et al. [12] was 0.83. The Arabic version of the PSQI in non-clinical populations reported an alpha reliability of 0.65 [21]. Thus, the findings related to internal consistency from this study were consistent with the published literature.

Adequate items’ total correlations provided further support for internal reliability consistency. Six components of the PSQI reported the items’ total correlation coefficients levels between 0.66 (subjective sleep quality) and 0.42 (habitual sleep efficiency). Only sleep disturbance components had 0.38 (Table 2). This indicates that Arabic PSQI had good internal consistency. Similarly, Buysse et al. [12] have reported that the item total correlation of sleep disturbance was 0.35. The sleep disturbance component was answered by nine questions, which could result in variations between participants [12]. The items’ total correlation needed to be above 0.3, which was set as the minimum level [25].

The PSQI appeared to be capable of discriminating between groups that differed in cancer site, prognosis and comorbidities. Similar discriminative validity has been noted in psychometrics studies among patients with cancer [16, 18].

The current study found that a refined one-factor model showed acceptable fit with the residual covariance between component three (sleep duration) and component four (habitual sleep efficiency); this can be explained as these two components were answered in the PSQI by question one, ‘usual bed time at night’, question three, ‘usual getting up time in morning,’ and question four, ‘hours of sleep per night’ as both components were derived from the same questions. In this study, the participants might have miscalculated both sleep duration and sleep efficiency. Several studies have reported low accuracy or difficulty for participants to determine the time of falling asleep [32, 33]. In accordance with previous studies, this correlation was found between component three (sleep duration) and component four (habitual sleep efficiency) [23, 24, 27, 34, 35].

None of the refined one and two-factor model structures had absolute fit as chi-square (χ2) had significant p-value. Despite this, both the refined one and two-factor models had few cut-offs of the fit indices in GFI, AGFI, CFI and RMR, which indicted an acceptable fit. Three studies have investigated the CFA of the PSQI among cancer patients and found similar results to the present study, but their conclusions were different. Otte et al. [24] suggested that a two-factor model was a better fit than the one-factor model of the original PSQI. Whereas two other studies supported the one-factor model of the original PSQI [23, 27]. This study supports the original single factor structure of the PSQI that Buysse et al. [12] proposed.

There are several limitations in this study. The study employed a cross-sectional design. The stability of the instrument was not reported using test–retest reliability, therefore further assessment of test–retest reliability is needed. This study did not test the concurrent validity, further research study should assess sleep quality by PSQI and another self-reported tools such as Insomnia Severity Index (ISI) [36] to identify the correlations between them. In addition, the study did not have control groups of non-cancer samples who also complain of sleep disorders. The data were collected after treatment was completed and so the results are not generalizable to patients currently receiving treatment.

The findings demonstrate that the Arabic version of the PSQI demonstrates acceptable psychometric properties in cancer patients and that it can be used to measure sleep quality. The current results support the view that the PSQI is best represented as being a unidimensional scale in an Arabic sample.

Conclusion

Arabic patients with cancer may affected by sleep disturbance, during or after treatment of cancer. A validated tool to assess sleep disturbance is needed to address the sleep problem in Arabic patient with cancer. This study found that Arabic version of the PSQI has acceptable psychometric properties of internal consistency reliability as well as discriminative and construct validity. Therefore, the Arabic version of the PSQI is a reliable and validated instrument to assess sleep quality in Arabic cancer populations. Further research on the Arabic version should evaluate test–retest reliability.