Introduction

Prostate cancer (PCa) is the most common cancer among men in Germany, with 57,370 newly diagnosed cases in 2014 [1]. As the survival rates are increasing [2] and the 5-year as well as the 10-year overall survival rates are at a high level (91% and 90%, respectively in 2014) [1], patient-reported outcome (PRO) measurements of PCa-related symptoms and functions are an important part of clinical outcome measurement. PCa patients often describe specific cancer-related and therapy-related adverse effects (AEs). Disease-related symptom measurement in PCa patients should therefore differentiate between illness-specific domains such as urinary incontinence, bowel function, sexual function, and endocrine function [3,4,5]. Measurement of these PCa-related symptoms is of interest not only in relation to investigation of individual patients’ AEs during PCa treatment, for example, but also for comparing patient outcomes across providers.

A widely established instrument for health-related quality of life (HRQoL) assessment in men with PCa is the Expanded Prostate Cancer Index Composite with 26 items (EPIC-26). The EPIC-26 questionnaire is an abbreviated version of the EPIC-50 questionnaire [6], which is used in many completed and ongoing studies. The original EPIC included a set of 50 questions. As there was a need to facilitate HRQoL measurement to allow more comprehensive research and clinical approaches, Szymanski et al. developed the EPIC-26 [7]. Subsequently, Schmidt et al. identified the EPIC-26 as being the HRQoL instrument with the best characteristics for managing change in PCa care [8]. Psychometric test results are available for several translations of the EPIC-26 (e.g., into Chinese, Norwegian, and Canadian French) [9,10,11].

A German translation of the EPIC-26 was published by Beyer et al. [12]. Umbehr et al. [13] validated a German version of the EPIC questionnaire with 50 items (including the items in the shorter EPIC-26) using a translation different from that used by Beyer et al. as the authors were focusing more on a cultural-adaptational approach resulting in a questionnaire applicable not only in Germany, but also in the German-speaking parts of Austria and Switzerland. No results from psychometric testing of the validity and reliability of the German version of EPIC-26 as translated by Beyer et al. have yet been published. The aim of the present study was therefore to test the German version of EPIC-26 for reliability and validity using responses from patients with PCa who answered the questionnaire before and 1 year after definitive treatment. Data from the ongoing prospective, multicenter Prostate Cancer Outcomes (PCO) study (www.pco-study.com) [Kowalski et al. submitted] were used. The German PCO study is part of an ongoing international study (the TrueNTH Global Registry) that was initiated by the Movember Foundation. The aim of the study is to compare clinical and patient-reported outcomes internationally for patients with localized PCa [14]. Fourteen countries—Austria, Australia, Canada, the Czech Republic, Denmark, Finland, Germany, Ireland, Italy, Spain, the Netherlands, New Zealand, the UK, and the USA—are currently participating in the TrueNTH Global Registry.

Ethical approval

The PCO study was approved by the local ethics committee in Berlin (Eth-12/16). All of the patients included provided written informed consent.

Materials and methods

Patients

Patients with localized or advanced PCa (any T, any N, M0) are asked to complete the EPIC-26 questionnaire before and 1 year after definitive treatment. Patients treated with active surveillance (AS) or watchful waiting (WW) are asked to complete the follow-up questionnaire 12 months after being included in the study. PCa-specific clinical parameters (e.g., cancer stage according to the German Guideline for Prostate Cancer presented in Appendix 4, type of therapy, comorbidities) as well as patient-reported outcomes (PROs) measured using the EPIC-26 were collected in a setting of consecutive inclusion of patients between July 2016 and March 2018, in accordance with the reference guide of the International Consortium for Health Outcomes Measurement (ICHOM) standard set for localized prostate cancer [15, 16]. The study includes PCa patients receiving radical prostatectomy including cystoprostatectomy (RPE), radiation therapy (RT), watchful waiting (WW), active surveillance (AS), and other treatments (OT, such as androgen deprivation or focal therapy). All risk groups (low, medium, high, T3/T4, N1) are represented.

Questionnaire validation was conducted on the basis of the responses given by these patients in the participating German prostate cancer centers. Since this ongoing study started in July 2016, post-treatment data (12 months after study inclusion) were already available for patients who were recruited early during the study.

The German version of the EPIC-26

The EPIC-26 is a set of 26 items, which are categorized into one single item (overall urinary problems) and five subdomains consisting of four to six items: incontinence (four items), irritative and obstructive urinary problems (four items), bowel function (six items), sexuality (six items), and vitality/hormonal function (five items). Responses are measured on a Likert-type scale. The original and German versions of the EPIC-26 are included in the appendixes (Appendixes 2 and 3).

The German version of the EPIC-26 published by Beyer et al. [12] is used in this study. The process of translation and the evaluation of the translation are described in that paper.

Statistical analysis

Reliability was measured using the responses before definitive treatment by calculating interdomain correlations and Cronbach’s alpha coefficients [17] as an indicator of internal consistency. Item-to-scale correlations for each subdomain were carried out as described by Cohen [18]. Sensitivity to change was examined by calculating correlations between EPIC-26 scores before and 12 months after treatment for those patients with available data. Paired Student’s two-sample t test was performed.

Construct validity was analyzed on the basis of responses before definitive treatment by using a confirmatory factor analysis to examine the five-subdomain structure and the scales’ dimensionality in the EPIC-26. Beforehand, an exploratory factor analysis was performed to analyze whether a five-subdomain structure is statistically tenable. Maximum likelihood with robust correction was used.

The open-source software program RStudio 1.1.383 was used for all statistical analyses.

Results

Sample

A total of 3094 PCa patients from 44 German PCa centers were included. At the time of data transfer (April 2018), post-treatment data were already available for 521 of these patients (within the 1-year follow-up time frame, 839 patients were asked to answer the post-treatment questionnaire). The mean age at diagnosis in the total patient sample was 66 years. In all, 2608 patients were treated with RPE alone; 221 with RT; 172 patients with a combination of RPE and RT; 79 patients with AS or WW; and 14 patients received other treatment options, including radical cystectomy (cf. Kowalski et al. submitted) (Table 1).

Table 1 Characteristics of the patients (n = 3094)

Statistics

Reliability

Reliability was investigated before treatment (T0) on the basis of the total patient sample. Item-to-scale correlation ranged from 0.32 to 0.95, with the lowest coefficients for the items “bloody stools (#6d)” (r = 0.37) and “breast problems (#13b)” (r = 0.32) (Table 2). Internal consistency was assessed by calculating Cronbach’s alpha for the EPIC-26 subdomains, where a Cronbach’s alpha > 0.65 was considered sufficient [19]. This was possible for all but one subdomain; for the urinary irritative/obstructive subdomain score alone, Cronbach’s alpha was 0.64 (Table 3). Table 2 provides an overview of the characteristics of the subdomains (in comparison with selected findings from Szymanski et al. [7]).

Table 2 Item-to-scale correlation coefficients, mean scores, and factor loading for the German and American versions of the EPIC-26 (before treatment)
Table 3 Characteristics of the domains in the German version of the EPIC-26 questionnaire in comparison with selected American findings (T0: n = 3094, T1: n = 521)

Table 4 shows the interdomain correlation matrix. The low correlation scores (range between 0.11 and 0.32) indicate sufficient discrimination between the single scores and thus confirm the five-dimension structure of the EPIC-26 (all P < 0.001).

Table 4 Interdomain correlation matrix, P < 0.001

Sensitivity to change

Sensitivity to change was investigated in 521 patients for whom data were available for questionnaire responses 12 months after treatment. For all subdomains, significant changes over time were found (P < 0.001), with the greatest difference being seen in the sexual domain (Table 3).

Construct validity

Construct validity was investigated before treatment using the total patient sample. After factor extraction using Kaiser’s criterion (all factors with an eigenvalue greater than one are retained), solutions with up to six factors were possible. Exploratory factor analysis was therefore performed with four, five, and six latent variables. After extraction of five factors and factor rotation using the varimax method, the results showed a fit at χ2 = 1664.44 (n = 3094, degrees of freedom 185, P < 0.001), a standardized root mean square residual (SRMR) of 0.02, and a Tucker–Lewis index of 0.94. This five-factor solution was superior to the four-factor and six-factor solutions (cf. Appendix 1). For all exploratory factor analyses, item #5 was not taken into account, as it is not used in the original EPIC-26 or any validated translation of it. A confirmatory factor analysis with five latent variables (the five domains of the EPIC-26) was thus estimated using the robust maximum likelihood estimator. After 68 iterations, the results showed a fit at χ2 = 991.72 (n = 3094, degrees of freedom 265 after 60 iterations, P < 0.001), a comparative fit index (CFI) of 0.95, a root mean square error of approximation (RMSEA) of 0.04, and an SRMR of 0.04. Factor loadings ranged between 0.12 and 0.95, with the weakest association with their latent variable for the items “hematuria (#4c)” (B = 0.22), “bloody stools (#6d)” (B = 0.19) and “breast problems (#13b)” (B = 0.12) (Table 2).

Discussion

The aim of this study was to validate the German version of the EPIC-26 questionnaire. In general, the results of the psychometric tests described are similar to those for the original American questionnaire [7]. As a result, the findings suggest that the German version of the EPIC-26 can be used as a reliable and valid instrument to analyze symptoms and function in PCa patients.

The mean scores for the different time points were comparable to previously published results [7, 10, 11] (Table 3). However, differences between the study samples and timing of measurements in comparison with validations carried out for other languages need to be carefully discussed, as some of the published studies report results of questioning before treatment and some after it. Eighty-five percent of the patients included in the present sample, compared with 76% in the source population for these patients, were treated with RPE. Common AEs consequently include erectile dysfunction, for example, which is less prevalent in patients who undergo RT. The relatively low score for the sexual domain 12 months after treatment (mean = 24.28) in the sample therefore needs to be interpreted with these circumstances being taken into account. However, it may be assumed that the over-representation of patients with RPE in the study does not impede the quality of the questionnaire validation presented here.

The findings showed that the German version of the EPIC-26 questionnaire has sufficient reliability. The internal consistency of the scales (with Cronbach’s alpha ranging between 0.64 and 0.93) and item-to-scale correlation coefficients (all ≥ 0.30) were acceptable. A Cronbach’s alpha > 0.65 was considered satisfactory. Only the urinary irritative/obstructive subdomain had a Cronbach’s alpha below this threshold (α = 0.64). This has also been reported for other versions of the EPIC-26 [10, 11], suggesting weaker internal consistency for this subdomain in general that is mainly due to the item “hematuria.”

Psychometric testing commonly requires item-to-scale correlation coefficients to be over 0.40, or at least not less than 0.30, to confirm adequate reliability. As reported above, item-to-scale coefficients of between 0.30 and 0.40 were found for the items “bloody stools” and “breast problems.” This is consistent with previously published results [10, 11]. For the “breast problems” item, this low correlation with the vitality/hormonal domain was also reported for the original EPIC-26 questionnaire. As these findings have now been confirmed by several studies, it may be suggested that the construction of the vitality/hormonal domain should be revised, and providers may be advised to interpret the single items rather than the total score.

An acceptable level of sensitivity to change was found, analyzed by comparing the responses of patients before and 12 months after treatment with only the bowel and vitality/hormonal subdomain, showing low mean differences which are still statistically significant. This corresponds to clinically known AEs in PCa patients after treatment [4, 20]. Since only 33 of the 521 patients included in the analyses of sensitivity to change were treated with RT, the differences in EPIC-26 reported mostly involve RPE-related side effects.

Four-dimension, five-dimension, and six-dimension structures in EPIC-26 were compared using exploratory factor analysis. Kaiser’s criterion suggested six factors. However, Kaiser’s criterion often overestimates the number of factors [21]. After comparison of the three possible solutions, and without taking into consideration the item “overall urinary problem” (item #5), which is not used for domain construction in any of the existing translations, extracting five factors appeared to be the most convincing approach in relation to factor loadings, discriminatory power, and the authors’ intention to cover the five domains in the instrument. Exclusion of the item “overall urinary problem” was based on the questionnaire’s construction process. If the item is permitted for the factor analysis, a four-factor solution combing items #1 to #5 (all urinary items) is also statistically approved. The five-dimension structure of the EPIC-26 was also confirmed using confirmatory factor analysis. The weakest loading of an item and its postulated factor (e.g., subdomain) were found for the items “hematuria” and “breast problems.” As these findings are also reported in other studies, the importance of these two items for the domain may be questionable.

Limitations

Test–retest reliability and convergent validity were not examined in the present study. The large sample size (n = 3094) and the multicenter approach may be regarded as strengths of the study.

Conclusion

Measuring patient-reported outcomes is attracting increasing interest in oncological research and also in health-care provision. Valid and reliable psychometric instruments are therefore needed to evaluate disease-related symptoms in patients with prostate cancer. An international consensus group has strongly recommended using the EPIC-26 questionnaire for comparable and standardized evaluation of PROs and AEs in patients with localized prostate cancer [15]. The psychometric findings presented here mean that the German version of EPIC-26 is now available as a well-validated and reliable instrument for further PRO research and can be recommended for German-speaking patients with prostate cancer.