Introduction

The prevalence of kidney stones (KS) has been rising worldwide but is more noticeable in developing countries. Moreover, KS has a high recurrence rate of 30% at 10 years and often triggers the need for invasive procedures [1,2,3]. Despite Spanish language ranks among the three most spoken languages around the world, the impact of KS over Spanish-speaking patients’ lives has not been widely reported. In 2002, Medina-Escobedo et al., found a self-reported KS prevalence of 5.5% in southeast Mexico. Furthermore, KS was the leading indication for a nephrectomy (~ 60%) in the health system database [4, 5].

Patients with KS have a lower health-related quality of life (HRQOL) when compared to non-stone formers. Likewise, patients with stones and/or stone-related symptoms have lower HRQOL than patients who are free from stones and/or symptoms. KS affects patients’ quality of life (QoL) in several spheres, including work, financial expense, and lifestyle. Therefore, it has been suggested that HRQOL should be an important outcome of KS management [7, 8]. When evaluating the impact of KS disease, related complications, and the procedures performed for its diagnosis and treatment, urologists usually rely on several outcomes or markers to determine the success of treatment. (i.e., stone-free rate). However, the patient’s own perception of his/her illness as an indicator of success should be considered, and the HRQOL assessment is an objective and accurate way to do it.

Several tools to assess patients’ HRQOL have been validated in Spanish, such as the widely used Short Form (SF36) health survey, but some particularities among diseases can interfere in how HRQOL is measured [9]. Therefore, a disease-specific evaluation of HRQOL is a better approach. Recently, the Wisconsin Stone Quality of Life Questionnaire (WISQOL), which contains 28 items grouped into 4 domains, was developed and validated to assess HRQOL in patients with KS. This questionnaire has proven to be a better assessment tool measuring HRQOL associated with KS than the SF36.[10, 11]. Our objective was to develop and internally and externally validate a Spanish version of the WISQOL (S-WISQOL) in a Mexican population. A secondary objective was to assess the HRQOL of patients with different profiles of KS at two medical centers using the validated instrument.

Methods

Translation

After approval from authors of the original WISQOL, a double-back translation was performed by two different certified translators. Then a panel of urologists (MBM, DOP, CMP, JPFT) discussed wording discrepancies between the two translated versions and a first draft was obtained. It was administrated to 5 subjects for feedback. Writing and syntactic corrections were made resulting in the final version of the S-WISQOL (Online Resource 1). The Fernández-Huerta method was used to assess the readability of both questionnaires, the S-WISQOL and SF36 [12].

Inclusion criteria and data selection

All participants prospectively included were native Spanish-speakers and stone formers living in Mexico. Participants with at least elementary school education (6 years of schooling), without any visual or cognitive impairment, and without acute illness were invited to enroll at the 2 medical centers involved in the study (Merida and Monterrey). Patients who did not complete the questionnaire were excluded.

Additionally, sociodemographic and clinical data were obtained through an interview. Stone-related issues such as years with KS, hospitalizations and procedures due to KS, JJ status, and single kidney status, were documented to evaluate the overall impact of KS. Every participant signed an informed consent, and the study protocol was approved by the ethics and research committee from each of the institutions that recruited subjects.

Statistical analysis

Internal consistency of the S-WISQOL was obtained with Cronbach’s α overall and between centers. External validity was assessed by comparing S-WISQOL results between both centers. The Spearman test was used for item and domain correlations and for S-WISQOL total score. Correlation of total scores of the S-WISQOL and SF36 was assessed to determine convergent validity. Correlations between clinical variables and results from the S-WISQOL and SF36 were analyzed to assess the effects of these on results. In addition, S-WISQOL total scores were compared between subjects from different clinical settings to determine any impact on HRQOL.

For categorical variables the χ2 test was used. The U Mann–Whitney or T test were used for quantitative data, according to its distribution as assessed by the Kolmogorov–Smirnov normality test. Data were analyzed with IBM® SPSS™v.21, p values < 0.05 were considered statistically significant. Cronbach’s α and correlation coefficients results were interpreted as in the original WISQOL development study [11].

Results

A total of 87 subjects completed the S-WISQOL and SF36. Subject’s baseline characteristics are summarized in Table 1. The Fernandez–Huerta readability test found the S-WISQOL as “easy”, and the SF36 as “fairy easy”, which means that 6 and 5 years of schooling, respectively, are needed for understanding. A good correlation between SF36 and S-WISQOL total scores was observed (r = 0.75, p < 0.001) as shown in Fig. 1.

Table 1 Baseline characteristics of subjects who completed the S-WISQOL and SF36 at both centers
Fig. 1
figure 1

​Correlation between S-WISQOL and SF36 total scores in stone-formers

S-WISQOL internal consistency overall and at each center is presented in Table 2. Spearman correlation coefficients ranged from 0.60 to 0.89 (mean 0.73) and were, thus, considered as good. Interdomain and domain–total score Cronbach’s α coefficient went from acceptable to excellent. In the Merida sample, domain 1–2 and 2–4 coefficients were lower, confirming that they are measuring different constructs.

Table 2 Internal consistency comparison. Overall and at both centers

Domains comparison between centers is presented in Table 3.

Table 3 WISQOL’s total scores and domains; overall and comparison between centers

A weak correlation of years with KS and S-WISQOL score (− 0.283, p = 0.008) was found, but this was not true for the SF36 (p = 0.058). Similarly, the number of hospitalizations due to KS and S-WISQOL total score (− 0.254, p = 0.018) were correlated, but neither was correlated with the SF36 (p = 0.153).

There were no differences for S-WISQOL total scores between subjects with and without an indwelling JJ-stent at enrollment (p = 0.255). Other comparisons were also not significant, such as between those receiving renal replacement therapy and not (p = 0.787), those with and without type 2 diabetes mellitus (p = 0.788), those with and without hypertension (p = 0.818), or subjects with single-kidney status and those with both kidneys intact (p = 0.314).

Discussion

This comprehensive study shows that the S-WISQOL is a valid tool to assess the HRQOL of Spanish-speaking patients with KS. Spanish is the second most spoken language worldwide; thereby, this work provides a reliable and easy-to-read instrument to assess the disease-specific HRQOL in Spanish-speaking countries or Spanish-speaking immigrants.

Convergent validity was shown by a good correlation between S-WISQOL and SF36 total scores (Fig. 1). The SF36 can be influenced by other comorbidities; whereas, the S-WISQOL focuses solely on the effects of KS on HRQOL. Similar results were found in the original WISQOL and in a recently validated Turkish version, showing a moderate and acceptable correlation, respectively. Moreover, we found that readability between both tests was similar and all subjects enrolled in this study met the required years of schooling for proper comprehension; hence, the differences found between the S-WISQOL and SF36 questionnaires are not attributable to its understanding [9, 13].

We found that internal consistency was comparable between centers. In addition, interdomain correlations were all acceptable, and total score–domain measures ranged from good to excellent. Spearman correlation coefficients were considered good (Table 2). These results show the strong impact of each domain on S-WISQOL scores. Correlations between domains 1 (social) and 2 (emotional), and between domains 2 and 4 (vitality) were moderate. Likewise, psychometric analysis of the Turkish version of the WISQOL found a moderate correlation between domains 2 and 4, and so did the analysis of the original WISQOL. In agreement with authors, it is considered that higher correlations would have suggested insufficient differences between subscales [9, 13].

External validity analysis showed that S-WISQOL scores were different between centers (Table 3). This finding supports that HRQOL impairment is different for each patient, as subjects from Merida had a longer duration of time with KS, and higher numbers of procedures and hospitalizations due to KS. All these features showed a correlation with total S-WISQOL scores. Thus, the lower scores of the S-WISQOL among these patients are reasonable. Nevertheless, these variables did not affect SF36 results, indicating that the S-WISQOL is better at detecting stone-related conditions affecting HRQOL than the SF36. These findings, altogether with the easy readability results, confirm the external validity and reliability of the S-WISQOL. Likewise, S-WISQOL score was not influenced by other medical conditions that are prone to affect HRQOL, like the presence of a JJ-stent, renal replacement therapy, type 2 diabetes mellitus, and hypertension, enhancing further the fact that the S-WISQOL is an effective instrument to assess disease-specific HRQOL of patients with KS. In fact, evidence supports that HRQOL is impaired in asymptomatic stone patients, suggesting that the WISQOL may be sensitive to underlying effects of KS that patients may not sense or report as “symptoms” [14].

Domain score comparisons between centers showed differences in all domains but 3 (stone-related symptoms). This suggests that baseline differences between patients from each center, as previously mentioned (Table 1), may have impacted the social, emotional, and vitality domains and might explain the differences found for S-WISQOL scores.

This observation is supported by a recent review showing that most patients with KS have worse HRQOL, as assessed by the SF36, the most affected domains were bodily pain and general health, concluding that the way KS affects HRQOL might have a role in the decision-making for treatment [8].

Contrary to the development of the original WISQOL, in this work, all patients had stones at enrollment. There is evidence supporting that the stone itself may cause worst HRQOL. A recent publication found that asymptomatic stone formers with current stones, whether they were aware or not, had lower HRQOL than asymptomatic patients without stones at the time they were surveyed [10].

Donnally et al., conducted a follow-up study to assess changes in HRQOL over time using the SF36. Although low scores were obtained, no changes between baseline and follow-up were found. This is perhaps because the SF36 was not the most suitable tool for detecting changes in KS status at follow-up [15]. We believe that S-WISQOL is a better tool for assessing follow-up changes in HRQOL among stone formers. Other studies have demonstrated the impact on HRQOL of certain treatments for KS; Pérez-Fentes et al., found an improvement in the bodily pain domain of the SF36 after stone treatment [16]. Using the EQ-SD index, also a generic HRQOL instrument, Eryildirimir et al., found that patients who received medical expulsive therapy had better HRQOL than those treated for colic [17]. The Turkish WISQOL found an improvement in WISQOL total score and within the four WISQOL domains four weeks after percutaneous nephrolithotomy (p < 0.001). Likewise, using the WISQOL, Jiang et al., found that using a nephrostomy tube rather than a JJ-stent after a percutaneous nephrolithotomy leads to better HRQOL; and de Sousa Morais et al., supported this finding, as they reported a higher negative impact on HRQOL in patients undergoing retrograde JJ-stent placement than those treated with percutaneous nephrostomy placement for urgent urinary diversion due to KS [18, 19]. Other studies have showed that stone formers who use potassium citrate or thiazides, and stone formers with cystinuria undergoing tiopronin, scored better in all WISQOL domains vs. those who were not prescribed these medications [20, 21].

Further studies assessing HRQOL in KS patients using the S-WISQOL are warranted in the next few years, and the resulting data could lead to better and yet more informed decision-making. The self-reported perceptions of patients about their own disease is a tangible and objective way to assess HRQOL. This is important as everything that is measured might be improved.

Some limitations are identified. First, the validation was conducted in patients from northeast and southeast Mexico. To assess the broader generalizability of the S-WISQOL within a Spanish-speaking population, further studies with wider cultural backgrounds and involving patients from diverse regions are needed. Second, a relatively small number of patients was included. Finally, larger-scale external validations are needed as are assessments of the predictability of the S-WISQOL in capturing changes over time in a patient’s stone and/or stone-related symptom status.

Conclusions

KS impairs patients’ HRQOL. The recently developed S-WISQOL is a valid and reliable instrument for assessing disease-specific HRQOL. With this instrument, the impact of KS overall and of related diagnostic and therapeutic procedures can be objectively assessed. Here, we provide a valid, reliable, and easy-to-read Spanish version of the WISQOL questionnaire which will be useful to both urologists and patients in Spanish-speaking settings as we move forward to improving our patients’ HRQOL.