Introduction

The physiological profile of dysphagia associated with head and neck cancer (HNC) is well established in the literature [1]. Dysphagia is a multifactorial condition that can have serious medical, psychosocial and existential effects on the everyday lives of HNC survivors [1,2,3,4]. As a result, measuring dysphagia-related outcomes throughout the patient journey is a mainstay of both research and clinical practice in HNC [5,6,7]. An array of validated and reliable tools have been developed to measure dysphagia and its impact. However, different tools may fit different purposes and often more than one tool is required to measure the multifactorial and complex nature of dysphagia in HNC.

Numerous tools have been developed to address swallowing outcomes [8,9,10], and therefore, selecting the most appropriate tool or group of tools specifically for HNC care can be an arduous and time-consuming process. Furthermore, systematic reviews of the tools currently used to measure and monitor dysphagia (without any specific population focus) have noted that many have low levels of validity and reliability [8,9,10]. Whilst selecting measures based on their psychometric properties is a critical consideration, the selection of the most appropriate tools for a particular clinical or research context is heavily influenced by the content or aspect of dysphagia that is under investigation. In HNC care, it is recognised that tumour presence and treatment factors have differential impacts on swallowing specifically [1, 11,12,13] and the individual as a whole. Hence, research of the past decade has highlighted the importance of considering dysphagia and its impacts from more than simply an impairment focus [4, 14, 15].

It is also well accepted now that measures of dysphagia in HNC should include clinician-rated measures of swallow function and patient-reported outcomes [16]. It is to be anticipated then that the tools used to assess swallowing and its impact specifically in the area of HNC care will differ with regard to the nature of administration, as well as the depth and breadth of concepts targeted. Instrumental assessments such as videofluroscopic swallowing studies and fibreoptic endoscopic swallowing studies directly observe anatomy and physiologic function during swallowing and numerous tools have been developed and validated in order to report and interpret instrumental assessments. These instruments are completed by clinicians and are often considered gold standard as they do not include subjective factors related to patient-reported functions. Numerous validated instruments are also available to assess patient functioning following HNC treatment. These measures can be either clinician-rated and based on either non-instrumental assessments (e.g., clinical assessment) or global indicators of functional status (e.g., diet level and gastrostomy dependence), or patient-rated to assess an individual’s perspective of their functioning post-treatment.

To accurately compare the contents of various tools used in HNC care, a framework that provides a comprehensive and standardised terminology is required. The International Classification of Functioning, Disability and Health (ICF), endorsed by the World Health Organization (WHO) [17], was developed from a bio-psycho-social model and provides a universal language of functioning and health and serves as a common reference in which various health and health-related concepts can be linked [17]. The ICF describes functioning in two parts, namely (1) functioning and disability; and (2) contextual factors. Functioning and disability is comprised of two components including body functions and body structures, as well as activities and participation. Contextual factors are made up of environmental factors, which refer to the physical, social, and attitudinal environment, as well as personal factors, which refers to the background of an individual’s life and living and are made up of features of the individual that are not part of a health condition [17].

Each of these components are comprised of several domains and within these domains are categories that can be used to describe, in detail, an individual’s health and health-related states using specific category codes [17]. The ICF consists of over 1400 categories that are represented by alphanumeric codes in which the letters b, s, d, and e are used to indicate body functions, body structures, activities and participation, and environmental factors, respectively [17]. These letters are then followed by a numeric code that represents the chapter or domain followed by additional coding. In the case of dysphagia, difficulties with chewing would be represented by the body functions code b5102chewing, reduced social participation would be coded to activities and participation d9205socializing, and the need to modify food and fluids would be coded as an environmental factor e1100food. Personal factors include features such as gender, race, profession, and coping style. Personal factors are not currently coded in the ICF [17].

It has previously been established that the ICF can be used successfully to represent the complex and multifaceted impact of dysphagia in HNC [18]. Furthermore, the ICF has been used successfully in content comparisons of tools used for measuring other health states, including quality of life in HNC [19]; obesity [20], musculoskeletal disorders [21] and hearing impairment [22]. A content comparison of outcome tools being used to measure and monitor dysphagia in HNC is necessary to assess the current scope of available tools and identify any gaps in the ICF concepts not covered by these existing dysphagia tools. In addition, such analysis can assist clinicians and researchers to undertake more considered decisions regarding the selection of the most appropriate tool for a particular research or clinical context. Therefore, the aim of the current study is to comprehensively identify the tools commonly used to measure swallowing in the HNC patient population and examine the contents of these tools using the ICF as a reference.

Methods

This study was conducted in two stages. The first stage involved conducting a literature audit to identify and select tools commonly used in the published literature to measure swallowing in HNC patients. In the second stage, the contents of the tools identified were examined and mapped to the ICF framework to register an inventory of available tools and identify any gaps in content specific to swallowing.

Stage 1: Literature Audit

Search Strategy

The literature audit of 11 databases (PubMed, Medline, CINAHL, Web of Science, Scopus, Embase, PsycINFO, the Cochrane Central Register of Controlled Trials, Joanna Briggs Institute of Evidence-Based Practice, Allied and Complementary Medicine Database, and the Database of Abstracts of Review of Effects) was conducted by two members of the research team. The purpose was to identify articles that were published in peer-reviewed journals between January 2004 and June 2017. The main search terms included head and neck neoplasms and deglutition disorders. Additional search terms are listed in the supplementary material.

Study Selection

A rigorous consensus process was conducted in which all titles, abstracts, and full-text were reviewed at least twice by two different members of the research team. The research team consisted of a panel of seven internationally recognised experts in dysphagia management in HNC, each with a track record of publications in this field. Any discrepancies in ratings were resolved by consensus with a third-rater. Citations were included if they presented first-hand data concerning people with HNC and included any measure of swallowing or swallowing related outcomes. Citations were excluded if they were: (1) not related to HNC (i.e., citations related to esophageal, thyroid or skin cancer); (2) review papers, case reports (single participant), case studies (< 10 participants), educational papers, conference abstract, economic evaluations; (3) involved pediatric patients (< 18 years old); (4) an animal study; (5) published prior to January 2004; (6) not explicitly related to swallowing outcomes; (7) in a language other than English.

Tool Selection

A full review of each accepted article was conducted by all members of the research team and information regarding tools used to measure swallowing outcomes were extracted. Tools were initially accepted if they met the following criteria: (a) tools that were specific to swallowing in HNC; (b) tools that were specific to swallowing but not necessarily HNC; and (c) tools that were specific to HNC and contained a minimum of one question related to swallowing. Once this initial set of tools was identified, a secondary set of inclusion criteria were applied. To give meaningful information on swallowing and swallowing-related outcomes, it is important that the psychometric properties of a tool have been examined [10]. Hence, tools were only accepted if they had at least some published data supporting their psychometric properties. In addition, in order to be considered “common”, tools were required to have been cited by at least two different author groups, demonstrating that groups other than those who developed the tool have commenced using it in research. This methodology and criteria for inclusion is consistent with previous studies that have examined the contents of outcome measures against the ICF [19]. An audit of 20% of the abstracts and full-text data extraction was conducted by a second rater and discrepancies were resolved by consensus.

Stage 2: ICF Coding Process

Meaningful concepts within each tool that specifically addressed swallowing and/or eating were identified and mapped to the ICF using the established coding rules [23, 24]. Some key points from the coding rules include: (1) link meaningful concepts to the most precise ICF category (2) if a single item contains more than one concept, each concept should be linked (3) do not use so-called ‘other specified’ or ‘unspecified’ ICF categories (identified by the final code 8 and 9), and (4) personal factors are assigned the code pf. As an example, item one of the Eating Assessment Tool (EAT-10) [25] was coded as follows: ‘My swallowing problem has caused me to lose weight’, the meaningful concepts ‘swallowing problem’ and ‘lose weight’ would be linked to b5105 ‘swallowing’ and b530 ‘weight maintenance’, respectively. In addition, if an item within the tool did not relate directly to swallowing, then the meaningful concepts within that item were not mapped. For example, the first item of the Functional Assessment of Cancer Therapy—Head and Neck 4 (FACT H&N) [26] ‘I have a lack of energy’ does not explicitly relate to swallowing and therefore the meaningful concepts within this item were not mapped. The ICF codes representing the concepts contained in each tool formed the basis of the content comparison and analysis.

This coding process was conducted independently by two members of the research team experienced with the ICF and conducting ICF coding. The reliability of the mapping process was evaluated by computing percentage exact agreement and by calculating kappa coefficients based on the two independent mapping versions of each tool. The kappa analysis was performed with SPSS version 24 (https://www.ibm.com/au-en/products/spss-statistics). Where any discrepancies were noted in the coding of the two-raters, these were discussed and a consensus decision reached.

Results

Stage 1: Literature Audit

The electronic literature audit yielded 17,461 citation hits (Fig. 1). After removing duplicates, 7863 citations remained. As a result of the title screen, a further 6247 citations were removed. The remaining 1616 citations were evaluated based on their abstract and full-text against the inclusion/exclusion criteria and relevant tools were identified in the accepted articles. A total of 502 studies were included and 65 tools extracted. Each tool was then screened against the secondary criteria and 27 met the study’s full set of inclusion criteria. Table 1 provides an overview of the characteristics of each tool. Of the 27 tools included, 14 were clinician-rated tools and 13 were patient-rated tools.

Fig. 1
figure 1

PRISMA flow diagram detailing search strategy and selection criteria

Table 1 Summary of selected tools

Stage 2: ICF Coding Process

Coding Agreement

Table 2 shows the evaluation of the coding procedure by percent exact agreement and kappa coefficients. Percent exact agreement and estimated kappa values ranged from 52.94 to 100 and 0.5 to 1.00, respectively, indicating a wide range in agreement between the two coders, depending on the instrument. The main reason for this finding was that the established coding rules were not always sufficient when coding swallowing measures and allowed too much individual interpretation. There were also ambiguities regarding descriptions of specific category codes in the ICF.

Table 2 Percent exact agreement and Kappa coefficients for the coding procedure

Identification of Meaningful Concepts within Tools

A total of 898 meaningful concepts related to dysphagia were identified across the 27 tools. Out of the 898 meaningful concepts, 836 (93%) were linked to 60 different ICF categories. Twenty-seven of the 60 ICF categories were represented by the ‘body functions’ component, five by the ‘body structures’ component, 19 by the activities and participation component, and nine by ‘environmental factors’. The specific codes listed by component (body functions, body structures, activities and participation and environmental factors) are detailed below and in Tables 3, 4, 5, 6, 7 and 8. A small number (62 concepts, 7%) of all meaningful concepts could not be linked to the ICF. Of these 32 items were identified as personal factors and 30 were other meaningful concepts pertaining to swallowing/eating including ‘control’, ‘enjoyment’, ‘pleasure’, and ‘under strictly defined conditions’. As these concepts cannot be classified in the current version of the ICF, this information was not able to be coded.

Table 3 Frequency of ICF categories from the component “body structures” and “body functions” represented in the clinician-rated tools
Table 4 Frequency ICF categories from the component “body structures” and “body functions” represented in the patient-rated tools
Table 5 ICF categories from the component “activities and participation” represented in the clinician-rated tools
Table 6 Frequency of ICF categories from the component “activities and participation” represented in the patient-rated tools
Table 7 Frequency of ICF categories from the component “environmental factors” represented in the clinician-rated tools
Table 8 Frequency of ICF categories from the component “environmental factors” represented in the patient-rated tools

ICF Coding: Body Functions and Body Structures Component

Body functions refer to the physiological functions of the body systems and body structures refer to the anatomical parts of the body [17]. Concepts related to body functions were represented in the majority of tools (n = 23), whereas relatively few tools covered concepts related to body structures (n = 4) (Table 3 and 4). The most frequently occurring body functions codes across tools included b5105—swallowing (n = 23), b450—additional respiratory functions (n = 11), b5104—salivation (n = 11), b5102—chewing (n = 10), and b530 weight maintenance functions (n = 9). The MASA-C was found to cover the largest number of body functions and body structures codes for clinician-rated tools (n = 14), whereas the VHNSS covered the largest number of body functions and body structures codes for patient-rated tools (n = 13). Tools which did not cover any components pertaining to body functions were the FOIS, PSS-HN, and EORTCQLQ-C30.

ICF Coding: Activities and Participation Component

The ICF defines activities as the execution of a task and participation as an individual’s involvement in a life situation [17]. Twenty-four tools included at least one concept that was able to be mapped to the activities and participation component. In total, 19 activity and participation codes were utilised across tools. The most frequently represented activities and participation codes included d550—eating (n = 23), d560—drinking (n = 6), and d9205—socializing (n = 5). The AusTOMs (n = 7) [25] and the SWAL-QOL (n = 11) provide the greatest coverage of activities and participation for clinician and patient-rated tools, respectively. The activities and participation component was not covered at all in a number of tools such as the MBSImP.

ICF Coding: Environmental Factors Component

Environmental factors include the physical, social, and attitudinal environment in which people live and conduct their lives [17]. The mapping exercise was only able to link nine environmental factor codes with the meaningful concepts identified across tools. Despite the limited number of codes, 23 tools included at least one unique concept that was mapped to environmental factors. The most common environmental factor codes across tools included e1100—food (n = 22) and e1151 (n = 14) assistive products and technology for personal use in daily living. The RBHOMS (n = 4), the EORTC QLQ-HN35 (n = 4) and the MDADI (n = 4) capture the largest number of environmental factors. Environmental factors were not covered by MBSImP, OPSE, PAS, or WST.

Discussion

As per prior systematic reviews of general dysphagia tools [8,9,10], the results of the current study demonstrate that a large number of tools are being used in research to measure and describe dysphagia in HNC care. Such diversity has important implications for research and the ability to combine and compare outcomes across studies. Further to this, this study demonstrates that the majority of tools used to measure and describe dysphagia in HNC focus largely on the physiological aspects of swallowing, followed by activities and participation, with only a few categories of environmental factors represented. This is consistent with the state of the current literature which has a greater focus on describing the assessment and management of the physiological changes to the swallow mechanism as a result of HNC and its treatment. Over the last decade, however, there has been an increasing interest in studying the broad-ranging impact of dysphagia in HNC beyond the physiological changes to the swallow. Therefore, to accurately capture the profile of dysphagia and its associated impact on the lives of our patients, we need to consider which tools or combination of tools will provide a more holistic picture of patient functioning.

It is beyond the scope of the current study to make recommendations regarding which tool or combination of tools should be used to measure dysphagia outcomes in HNC. However, it is apparent from the large number of tools identified in this study that the research regarding dysphagia in HNC lacks a uniform approach to outcome measurement. As a result, the research outcomes across studies will continue to be difficult to compare and combine, thus limiting the potential to strengthen treatment evidence through meta-analysis and data pooling. Previously, Chera et al. [52] recognised the need to establish a core set of patient-reported symptoms in HNC treatment trials. Twelve HNC core symptoms were recommended including swallowing, oral pain, skin changes, dry mouth, dental health, opening mouth/trismus, taste, excess/thick saliva, shoulder disability/motion, voice/hoarseness, social domain, and functional domain [52]. Whilst this is an important first step regarding consistency of outcome measurement across trials, limited information is provided regarding dysphagia and its impact on patient functioning. To determine whether dysphagia treatments in HNC are effective, a consensus regarding a minimum core set of outcome measures must be established.

Hutcheson and colleagues raised this issue in their development of the Dynamic Imaging Grade of Swallowing Toxicity (DIGEST) [53]. Recognising the issues of combining swallowing outcome measures into simple toxicity gradings for use in large scale clinical trials, the DIGEST was developed as a companion to the widely used CTCAE and provides a modified barium swallow (MBS) graded measure of pharyngeal dysphagia [52]. At the time the current study was conducted, this tool was not published by other groups, however, this type of simple, singular measure may help address the challenges of comparing dysphagia outcomes across clinical trials in the future. Naturally though, selecting a single measure such as the DIGEST may help assist researchers to track and compare impairment-based changes, with impacts to activity and participation and environmental impacts failing to be captured.

Research has found that dysphagia has a pervasive effect on the everyday lives of HNC patients affecting activities of daily living, changes to social networks and support as well as the need to develop coping strategies and seek out professional support [54]. To better understand the functional impact of dysphagia in HNC from the patients’ perspective, Nund et al. mapped the patients’ perspectives to the ICF [18]. Though that study was limited to people treated non-surgically, the results indicated that body functions, activities and participation, and environmental factors were almost equally important to people with dysphagia in HNC. Therefore, selecting tools which only focus on the physiological and anatomical assessment of the swallow are likely to inadequately capture the impacts and concerns of this population [18]. It is, therefore, suggested that clinicians and researchers consider selecting tools that will provide pertinent information on body functions, activities and participation and environmental factors as they relate to swallowing.

However the challenge, as highlighted by the current study findings, is that very few tools cover more than one ICF domain equally. Therefore, if we are to embrace a more holistic perspective of dysphagia, more than one tool is needed. Even with a multi-tool approach, there are current gaps in the aspects of dysphagia appropriately captured by current existing tools. Whilst there are tools that exist which cover body functions and activities and participation equally (e.g., MDADI and SWAL-QOL), there are a lack of tools available that evaluate environmental factors related to dysphagia [55]. To address this gap, Chan et al. recently developed and validated the Head and Neck Cancer Survivor’s Assessment of Mealtimes (HNSAM) [15]. The HNSAM was developed from the HNC survivors perspective and classified against the ICF framework. Environmental factors are clearly identified in the tool and incorporate a number of areas, such as support and attitudes of family, friends, and health professionals, that are missing from the current range of patient-reported outcome measures. Though the HNSAM showed initial validity and reliability, the tool was developed in Chinese and the authors of the HNSAM have identified the need for psychometric testing in English [15].

Heterogeneity in outcome measurement is not unique to dysphagia in HNC. The Core Outcome Measures in Effectiveness Trials (COMET) initiative, launched in 2010, brings together researchers interested in the development and application of ‘core outcome sets’. The COMET databases (see http://www.comet-initiative.org) currently lists 1125 references of planned, ongoing and completed work in core outcome sets, however, there are no listed projects relating to dysphagia. As highlighted by the results of the current study, the number of tools in use, and the diversity of concepts addressed by each tool highlights that leaders in this area also need to embrace the concepts of developing a core outcome measure set for HNC. Whilst the existence of a core set does not preclude the use of other tools for other research purposes, it will help establish a common language through which researchers and clinicians can document the widespread and long-term impacts of dysphagia on patient survivorship. This is an area for future work.

It is acknowledged that limitations exist in the current study. First, it is acknowledged that the inter-rater reliability of ICF coding was quite variable. Though the majority of ratings demonstrated moderate to almost perfect agreements, a small number of ratings had weak levels of agreement. As reported in previous studies, the coding rules were not always sufficient when coding the swallowing measures and allow for too much individual interpretation [22]. In addition, there was some discussion between the two researchers regarding the latent interpretation of several statements in order to be correctly linked. Latent interpretation is common in content analysis and refers to the underlying meaning of a concept [20]. This factor was especially relevant for the concepts swallowing, eating, and drinking. For example, item 13 on the MDADI states “I cough when I try to drink liquids”. In this case, one researcher coded the concepts “cough”—b450, “drink” d560, and “liquids” e1100 whilst the other coded “cough”—b450, “drink” b5105 (swallowing), and “liquids” e1100. The two researchers interpreted the meaning of “drink” differently and had further discussions regarding whether this item on the MDADI was really asking about the “impairment” or the “activity”. Ultimately the researchers met on several occasions to discuss the latent interpretations of these items and consensus was reached during those meetings. It is further acknowledged that only a minimum standard was set for the psychometric properties of the tools included in this review. The authors are very aware of the differences between the level and rigor of psychometric evaluations conducted across the various tools included in this review and encourage researchers and clinicians to also consider the psychometric evaluations when selecting tools for their context. Finally, it is acknowledged that newer tools have been established in the last few years, particularly in the domain of image processing for instrumental assessments. Though these tools did not meet the criteria for inclusion in the current study, they should be considered when defining a core set of outcomes in the future.

In conclusion, this study has confirmed the diversity in outcome measures currently in use to document dysphagia and its impacts following HNC. It has provided insights regarding the depth, breadth, and precision of concepts covered, and demonstrated areas that few tools currently capture, such as environmental factors. This information can be used by clinicians and researchers when selecting instruments for use in dysphagia management to ensure that the instrument chosen covers the area of most concern/interest. Given the large number of measures, there is a need to establish a core outcome set of outcome measures for dysphagia in HNC. In the meantime, researchers and clinicians need to consider not only the psychometric properties, but also the content covered by the tools, to determine whether the tools will provide an accurate and holistic picture of patient functioning post HNC care.