Introduction

In Japan, the cancer incidence by head and neck region was estimated to be nearly 16,000 in 2009. The number of head and neck cancer (HNC) patients tends to increase yearly [1]. Dysphagia is common in patients with a history of HNC [2]. Dysphagia can lead to not only malnutrition but also aspiration pneumonia [3]. As a result, dysphagia causes discomfort and loss of quality of life (QoL) [4, 5].

In recent years, patients’ reported outcomes, especially using QoL questionnaires, in addition to the improvement of therapeutic outcomes, have played an important role [6]. There are some health-related QoL questionnaires for patients with HNC: the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30) and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Head and Neck (EORTC QLQ-H&N), the Swallowing Quality-of-Life Questionnaire (SWAL-QOL), and the MD Anderson Dysphagia Inventory (MDADI) [7,8,9]. The EORTC QLQ-C30 is a general measure of health-related QoL for all patients with cancer. The EORTC QLQ-H&N is a disease-specific QoL instrument for patients with HNC. The Japanese versions of the EORTC QLQ-C30 and EORTC QLQ-H&N have already been validated [10]. The EORTC QLQ consists of one questionnaire including two sections: a core questionnaire 30 and an additional module for each tumor site. The combination of the two instruments, the EORTC QLQ-C30 and EORTC QLQ-H&N, provides a better measure of QoL as affected by HNC treatment side effects including dysphagia [7]. The SWAL-QOL is generally used for dysphagia-related psychological characteristics of HNC patients in the research field [11]. On the other hand, the SWAL-QOL is a QoL instrument for all patients with dysphagia, which is not specific to HNC. Therefore, the EORTC QLQ-C30, EORTC QLQ-H&N, and SWAL-QOL are commonly used for measuring QoL for HNC patients. In recent years, the scope of QoL questionnaires has widened. For example, health-related QoL utilizes outcome measures of therapeutic interventions, predictors of outcomes, and screening tools for diseases [12]. In fact, health-related quality of life (EORTC QLQ-H&N and SWAL-QOL) can predict subsequent survival using cutoff scores for screening as an example [13]. However, the EORTC QLQ-C30, EORTC QLQ-H&N, and SWAL-QOL have many questionnaire items. Therefore, there is a problem of the feasibility of completing these questionnaires [13]. The MDADI (containing 20 items and 4 subscales) was developed to assess dysphagia’s effects on the QoL of patients with HNC. The MDADI has two notable points. First, it is easy to use for clinical practice because this questionnaire has a small number of items. The developer of the MDADI also commented that information from the MDADI can be useful in treatment decision making by patients and physician [14, 15]. Second, the content validity of the MDADI was ensured using patients with HNC and their families, whereas the content validity of the SWAL-QOL was not specific for HNC patients. As a screening tool for HNC patients with dysphagia, the MDADI may be able to identify patients with high mortality risk with high accuracy [16]. The MDADI was considered useful for patients’ reported outcomes and as a screening tool in clinical practice. Nevertheless, there is no Japanese version of the MDADI.

On the other hand, swallowing function can be objectively assessed by various methods. Gold standards for examination and diagnosis of dysphagia include fiber optic endoscopic evaluation of swallowing (FEES) and videofluoroscopic swallowing study (VFSS) [17]. However, FEES and VFSS have some problems such as being time consuming, costly, and a burden for patients with dysphagia [18]. The Functional Oral Intake Scale (FOIS) is an objective and noninvasive dysphagia assessment tool. It has a seven-point scale reflecting the dietary intake of patients with dysphagia. The evaluator can assess swallowing function by comparing the seven severities of FOIS. In addition, the FOIS requires no training for the assessor, no VFSS is required for scoring, and has high inter-rater reliability. The validity and ratings of FOIS are found to be associated with dysphagia severity [19]. Past research showed that VFSS findings are correlated with QoL scores [20]. In addition, some studies have examined whether the FOIS was correlated with the MDADI [21]. However, little has been reported on the stepwise relationship between the severity of FOIS and MDADI scores, that is, whether or not there is a discrepancy between the severity of FOIS and MDADI scores.

From the above, the purpose of this study was to translate and validate the Japanese version of a dysphagia-specific quality of life questionnaire, the MD Anderson Dysphagia Inventory (MDADI-J). Our secondary purpose was to determine the stepwise correlation between the MDADI and FOIS by trend testing in Japanese patients with HNC.

Materials and Methods

Translation Process

We followed internationally accepted guidelines [22,23,24] (Fig. 1). First, two bilinguals (whose native language is Japanese and second language is English) translated the MDADI into Japanese with the permission of the original author (forward translation) [24]. The two experts compared the two versions, and an iterative consensus was obtained. Second, the consensus version of the Japanese translation was sent to two additional bicultural experts (whose native language is English and second language is Japanese), who translated it into English (back-translation). This back-translated version was subsequently compared to the original English version to ensure that the translations were accurate. We repeated the translation and back-translation procedures until the discrepancies between the original and back-translated versions were resolved.

Fig. 1
figure 1

Process of cross-cultural adaptation

Data Collection

Patients who fulfilled the following criteria were included in our study: (1) patients treated for head and neck cancer, (2) outpatients treated for HNC in Dentistry and Oral Surgery in NTT Medical Center Tokyo (Tokyo, Japan) and Oral Surgery and Oral medicine in Kyushu Dental University Hospital (Fukuoka, Japan), (3) aged 20 years or older, and (4) able to respond to a self-administered written survey. The patients were excluded by the following criteria: receiving treatment for a mental disorder or had a prior history of mental disorder.

During the survey period from October 2015 to August 2016, we recruited participants using a sequential sampling method in NTT Medical Center Tokyo and Kyushu Dental University. After that, we selected participants based on a random sample for test–retest reliability. This study was conducted with the approval of the institutional ethics committee of Tokyo Medical and Dental University (approval number 1256). Written informed consent was obtained from each of the participants prior to their participation in the study.

Survey Instruments Scoring

MDADI

The MDADI consists of 20 items containing four subscales: global (a single item), emotional (eight items), functional (five items), and physical (six items). Respondents were asked which response best reflected their experience in the past week, and response options were as follows: “strongly agree,” “agree,” “no opinion,” “disagree,” or “strongly disagree”; these were scored on a scale of one to five. On the other hand, E7 (I do not feel self-conscious when I eat) and F2 (I feel free to go out to eat with my friends, neighbors, and relatives) were scored as five points for strongly agree and one point for strongly disagree. The average score for each question of a subscale was multiplied by 20 to calculate subscale scores [14].

EORTC QLQ-C30 and EORTC QLQ-H&N35

The EORTC QLQ-C30 (version 3.0) is a cancer-specific health-related QoL questionnaire. The EORTC QLQ-C30 consists of 30 items containing five functional subscales (physical, role, emotional, social, and cognitive), eight symptoms (fatigue, pain, nausea/vomiting, appetite loss, constipation, diarrhea, insomnia, and dyspnea), global health/QoL, and financial impact. The scores of the EORTC QLQ-C30 are linearly transformed to a scale of 0–100, with a higher score indicating a higher level of functioning or global health-related QoL, or a higher level of symptoms or problems [25].

The EORTC QLQ-H&N35 is a head and neck cancer-specific health-related QoL questionnaire. The EORTC QLQ-H&N consists of 35 items containing seven subscales: pain (four items), swallowing (five items), senses (two items), speech (three items), social eating (four items), social contact (five items), and sexuality (two items). There are 11 single items covering problems with teeth, dry mouth, sticky saliva, cough, feeling ill, opening the mouth wide, weight loss, weight gain, use of nutritional supplements, feeding tubes, and painkillers. The scores of the EORTC QLQ-H&N35 are linearly transformed to a scale of 0–100, with a higher score indicating a higher level of symptoms or problems [25]. We expected that swallowing function-related subscales of the EORTC QLQ-H&N would be correlated with the overall MDADI-J and with all subscales.

FOIS

Oral intake and nutritional status were scored with the FOIS (range from one to seven with score), with a higher score indicating a higher level of swallowing function [26].

Statistical Analysis

Reliability was evaluated by internal consistency (Cronbach’s α coefficients), average inter-item correlation, and test–retest reliability (intra-class correlation coefficient, ICC) over 2 weeks [27]. Concurrent validity was assessed using Pearson’s correlation coefficients between the MDADI-J and EORTC QLQ-C30, and the EORTC QLQ-H&N referring to the method of a previous study [28]. Discriminant validity was assessed by Student’s t test between clinical tumor stages I–II and III–IV. The trends between the MDADI-J total score and FOIS were analyzed using the Jonckheere–Terpstra test. Each mean number was shown by average ± standard deviation. Statistical analysis was performed using SPSS (version 22; SPSS Japan Inc., Tokyo Japan). We calculated two-tailed P values in all the analyses. The alpha level of significance was set at 0.05.

Results

Patient Characteristics

A total of 75 participants enrolled in the study. When we analyzed the data, 3 (4.0%) were excluded because of failing to meet the inclusion criteria. Nineteen (26.4%) of the patients completed a second survey 2 weeks later. The patient characteristics are summarized in Table 1. This survey was completed by 72 patients treated for head and neck cancer, 39 of whom (54.2%) were male and 33 of whom (45.8%) were female. The mean age of the patients was 64.1 ± 11.4 years, with a range of 34–82 years. For the 72 patients who received treatment, the mean (SD) time since last treatment was 11.7 (24.8) months, with a range of 1.0–168 months. The oral cavity was the most frequent primary tumor site, and some patients had advanced stage or recurrent cancer. Dysphagia status was assessed using the FOIS (Table 2). There were no patients with grade 1 of FOIS: none of the patients had unstable medical conditions because of severe saliva aspiration.

Table 1 Demographic and clinical characteristics (N = 72)
Table 2 Distribution of patients for the Functional Oral Intake Scale (FOIS)

Reliability and Validity

Internal Consistency

Cronbach’s α coefficients were calculated for each subscale of the MDADI-J (Table 3). The Cronbach’s α was 0.67 for MDADI-J emotional, 0.81 for MDADI-J functional, 0.83 for MDADI-J physical, and 0.92 for MDADI-J total score. The global subscale was excluded because it consists of a single question. The average inter-item correlation coefficients for all question items were 0.42, and for the 3 subscales it ranged from 0.39 to 0.49.

Table 3 Cronbach’s α coefficients and average inter-item correlation coefficients for total scores on the subscales of the Japanese version of the MD Anderson Dysphagia Inventory (N = 72)

Test–Retest Reliability

Reproducibility, as assessed by the ICC, was 0.580 for global, 0.78 for emotional, 0.79 for functional, 0.81 for physical, and 0.84 for the total score (Table 4).

Table 4 Intra-class correlation coefficient (ICC) of the Japanese version of the MD Anderson Dysphagia Inventory (N = 19)

Concurrent Validity

Pearson’s correlation coefficients between the MDADI-J and EORTC QLQ-C30, and the EORTC QLQ-H&N were calculated (Table 5). The style of Table 5 was adopted, referring to the existing literature [28]. The EORTC QLQ-C30 and EORTC QLQ-H&N were associated with each factor of the MDADI-J.

Table 5 Correlation coefficients between the Japanese version of the MD Anderson Dysphagia Inventory (MDADI-J) and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30) and the EORTC Quality of Life Questionnaire-Head and Neck 35 (EORTC QLQ-H&N35) (N = 72)

Discriminant Validity

Discriminant validity was compared with the independent variables using the Student’s t test between stages I–II and III–IV. There were significant differences for each factor (Fig. 2).

Fig. 2
figure 2

Comparison of each subscale of the Japanese version of the MD Anderson Dysphagia Inventory between early and advanced stages. *P < 0.05, **P < 0.01. aClinical tumor stages I–II, bclinical tumor stages III–IV

Relationship Between MDADI and FOIS

Total scores all increased along with improvement of the FOIS (trend test) (Fig. 3).

Fig. 3
figure 3

Relationship between the total score of the Japanese version of the MD Anderson Dysphagia Inventory and Functional Oral Intake Scale (trend test). aFunctional Oral Intake Scale

Discussion

We translated the MDADI into Japanese from English and confirmed its validity and reliability. The Cronbach’s α coefficients of other language versions (Dutch, Korean, Portuguese, and Swedish) range from 0.79 to 0.86 (0.67 in Japanese) for the emotional subscale, from 0.68 to 0.82 (0.81 in Japanese) for the functional subscale, from 0.74 to 0.89 (0.83 in Japanese) for the physical subscale, and from 0.81 to 0.94 (0.92 in Japanese) for the total score [13, 24, 29, 30]. The Cronbach’s α coefficients of the Japanese version that ranged from 0.67 to 0.92 were almost on the same level with other language versions of the MDADI. The average inter-item correlation coefficients had similar values to those previously reported [31]. For test–retest reliability, the ICC of other languages (Dutch, Korean, Portuguese, and Swedish versions) range from 0.82 to 0.83 for the global subscale, from 0.88 to 0.93 for the emotional subscale, from 0.84 to 0.97 for the functional subscale, from 0.84 to 0.94 for the physical subscale, and from 0.80 to 0.96 for the total score [13, 24, 29, 30]. Therefore, the ICCs of the Japanese version for all subscales, which ranged from 0.78 to 0.84, were comparable to the MDADI of other language versions, except for the global subscale (ICC = 0.58). The ICC of the global subscale also has the lowest score of all subscales in the original, Korean, and Swedish versions (Dutch and Portuguese versions were not shown). This result may be because the global subscale was constructed with only a single item. However, the ICC of the total score in Japanese is of the same degree compared with studies using other languages [13, 29]. Therefore, the ICC of the total score was acceptable (with the highest ICC = 0.84).

The relationship between the MDADI-J and EORTC QLQ-C30, and the EORTC QLQ-H&N resulted in reasonable values. We expected that swallowing function-related subscales of the EORTC QLQ-H&N (Swallowing, Trouble with social eating, Teeth, Opening mouth, Dry mouth, and Sticky saliva) would be correlated with the overall MDADI-J and with all subscales, because the MDADI-J was developed for HNC patients with dysphagia. Our hypothesis that the swallowing function-related subscales of the EORTC QLQ-H&N would correlate with all the MDADI-J subscales was correct, whereas there were no correlations between the MDADI-J subscales and the swallowing function-unrelated subscales of the EORTC QLQ-H&N (Painkillers and Nutritional supplements). These results were consistent with earlier studies showing that dysphagia-specific questionnaires such as the SWAL-QOL were more closely related to the MDADI than comprehensive questionnaires such as the Short Form-36 [14, 30]. Therefore, we considered that the concurrent validity of the MDADI-J was verified.

According to a review of previous research, discriminant validity has only previously been examined and verified in the Swedish version of the MDADI [29]. Equally, the discriminant validity of the MDADI-J was supported, as statistically significant differences were observed between distinctly different groups in clinical practice (clinical tumor stages I–II and III–IV). These results support the fact that several factors including clinical tumor stage have been identified that may predict worse QoL outcomes [32]. From the above, these results suggested that the reliability (internal consistency and reproducibility) and validity (concurrent and discriminant validity) of the MDADI-J have been confirmed, so we consider the MDADI-J to be useable.

In addition, we confirmed the monotonic increase between the total score and the FOIS by the trend test. The total MDADI-J score increased with the progress of the FOIS [33]. This finding raises the possibility that self-rated perception agrees with an objective evaluation in oropharyngeal dysphagia. This result corroborated previous reports that VFSS findings are correlated with MDADI scores [20]. The above results suggest that patients’ complaints about oropharyngeal dysphagia as well as objective assessments can play an important role in clinical practice. Future studies could set MDADI cutoff scores for the classification of dysphagia severity with the aim of applying them to self-assessment tools for swallowing function in patients with HNC.

Our study has some limitations. First, sample size and characteristics of the sample may have influenced relationships among the variables: the study population may not be fully representative of the total population of patients with HNC, and thus further research is required. Second, the majority of patients included in the study received surgical treatment. However, a variety of patients with different treatment protocols were included. Therefore, this is a limitation that findings of a wider population may have been different. Third, the sample size of some groups was too small in the trend test, and therefore the accuracy of the statistical analysis was insufficient. However, the result of this trend test might remain the same even with larger samples. Fourth, interpretation of the MDADI-J global scale needs to be considered carefully because the ICC of the global subscale had the lowest score of all subscales. Fifth, in this cross-sectional study, the possibility of confounding by the patients’ characteristics, such as the mean time since last treatment, cannot be excluded, and this requires further evaluation in longitudinal studies.

Conclusion

We have two major conclusions:

  1. (1)

    The validity and reliability of the Japanese version of the MDADI were confirmed.

  2. (2)

    The present study suggests that MDADI-J scores tend to constantly decrease as FOIS scores deteriorate.