Introduction

Dysphagia is conceptualized as difficulty in any stage of swallowing, which is commonly accompanied by a broad spectrum of clinical conditions [1,2,3]. In children, oropharyngeal dysphagia negatively interferes in growth and development, and may result in dehydration and malnutrition, aspiration pneumonia, enteral feeding, food refusal and aversion, as well as individual and family negative psychosocial effects [1, 4, 5]. Therefore, making early identification and proper management is crucial [4].

The prevalence of dysphagia is increasing in the pediatric population according to a search fulfilled in the USA, where 61.2 million children aged 3 to 17 years, 569 thousand children suffered from a swallowing problem lasting greater than 1 week in the past 12 months during of the year 2014, and that over 100,000 pediatric patients were discharged from intensive care hospitals diagnosed with feeding and swallowing difficulties [6]. Increase in the diagnosis of pediatric dysphagia was considered statistically significant between the years 1997 and 2012 through the Kid's Inpatient Database search in which more than six million pediatric admissions were analyzed, and a total of 83,711 cases with dysphagia ICD-9 codes were recorded [7]. From 5,107 weighted cases in 1997, this number increased to 27,464 cases in 2012 [7]. Diagnosis of these disorders occurs primarily through the non-instrumental swallowing assessment which aims, but not limited to analyze the presence or absence of swallowing difficulties and propose an appropriate management of impairment [1, 3, 8]. Instrumental evaluation is known, to complement clinical swallowing assessment aiming to analyze swallowing biomechanics, physiological abnormalities, and to help determine the disorders etiology [2, 3, 9]. The evaluator's experience as well as the use of protocols may be determining factors in the precise interpretation of radiological or endoscopic findings of swallowing components [1, 3, 5].

In regard to the use of these protocols to guide instrumental assessment of swallowing in the pediatric population, there are two lines of thought in the scientific literature: the first states that protocols help standardize evaluation and reproducibility of results [10], and the second line of thought states that protocols weaken individuality of each patient’s findings, and it suggests that there are no protocols that are truly representative of individual’s typical swallowing function, this line of thought is relevant, because the qualitative assessment swallowing allows a broad view of the deglutition process and elaboration of the specific therapeutic approaches, in view of the possibility to identify peculiarities, case-by-case, in the exam [4, 8, 9, 11]. However, the use of standardized protocols is recommended to decrease the discrepancy of instrumental assessment interpretation of the results. Hence, it may increase reliability, precision, and accuracy of this assessment [10, 12]. In addition, they contain specific components of the oral and pharyngeal phases, thus avoiding discrepancies and inconsistencies in test results that undermine clinical decision making [3, 4, 8, 13, 14].

Analysis in evidence level of the protocols available for instrumental swallowing assessment in the neonatal and pediatric populations is of considerable clinical relevance, and they may help support therapeutic decision making. Thus, the first aim of this systematic review of literature is to identify and analyze any studies that used a protocol, including both validated and unvalidated protocols to assess swallowing biomechanics using instrumental evaluations through VFSS or FEES in the neonatal and pediatric populations. The secondary aim is to determine if among included protocols there are some which were validated and to assess the risk of bias of these protocols.

Methods

This review was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement (Fig. 1) [15]. The PRISMA flowchart can be found in the results section.

Fig. 1
figure 1figure 1

Checklist items to be included in the report of systematic review or meta-analysis

Literature Search

A broad search strategy was conducted using acronym PEO (population, exposure and outcome); being population: neonatal and pediatric, exposure: VFSS or FEES, and outcomes: validated or unvalidated protocol to assess swallowing biomechanics. The search was conducted in the following online databases: PubMed (1950–August 22, 2019), Scopus (1823—August, 22, 2019), CINAHL (1961–August 22, 2019), Cochrane Library (1996–August 22, 2019), and Biblioteca Virtual de Saúde (BVS) (1967–August 22, 2019), this last one having several databases, of which were selected the LILACS, IBECS e SciELO. The electronic search strategy was designed and tailored for each database with support of a librarian. The search used combined key terms and Medical Subject Headings as shown in Table 1.

Table 1 Search strategy for the following online databases: PubMed, Cochrane Library, CINAHL e Scopus in August, 22, 2019

Additionally, a search in the gray literature on the subject was performed on sites that provide unpublished studies, such as the registry base for Clinical trials protocols and CAPES theses and dissertations. Furthermore, reviews and reference lists of included articles were reviewed for other potential publications and articles indicated by experts in the field. There was no restriction of languages or publication dates.

When the full texts were unavailable, assistance from the library system of the Universidade Federal do Rio Grande do Sul was requested to retrieve the full articles. Studies were excluded from the research if it was impossible to get the full text even after attempting all the sources. All databases were searched using keywords selected according to the controlled descriptors for Medical Subject Headings (MeSH), Embase Subject Headings (Emtree) and Descritores de Ciências da Saúde (DeCS), combined with the uncontrolled descriptors, representing the textual words and their synonyms (Table 1).

Study Eligibility

The studies included in the systematic review that used protocols to assess swallowing biomechanics through VFSS and FESS in the neonatal and/or pediatric populations, spanning the age group from 34 weeks gestational age (GA) to 19 years [16].

Exclusion criteria included case reports and case series designs, book chapters, studies that did not aim to assess swallowing with food, and studies that included only the non-instrumental swallowing assessment protocols. Moreover, studies that did not describe in detail the swallowing findings analyzed in the instrumental evaluation were also excluded, as well as if the protocol used in the study investigated only esophageal dysphagia, if psychogenic dysphagia was studied or if the study was conducted in a non-human population.

Study Selection

Two independent reviewers (CAC and PPM) assessed all selected titles and abstracts and determined the eligibility of the study by listing it as “included”, “excluded”, or “not clear”. When there was disagreement among the researchers about the eligibility of any abstract, consensus was reached between them. When an article was not elected after reading its full text, the justification was based on consensus according to exclusion criteria. Disagreements in the final ratings of full-text studies were resolved with the assistance of a third reviewer (DSL) who defined whether or not to comprise the study in the systematic review.

Data Extraction

Data extraction was completed by a single reviewer (CAC) for full articles that met all inclusion criteria outlined above. Data extraction included the following: (1) authors, (2) year of publication, (3) country of origin, (4) language, (5) characteristics of the study population (sex, dysphagia etiology), (6) average or median age, (7) sample size, (8) study design, (9) objective (s), (10) protocol used for instrumental examinations, (11) protocol characterization, and (12) research results. When statistical data regarding accuracy values, positive predictive value, negative predictive, sensitivity, and specificity were available, the instruments were also collected.

Although initially was planned to perform a meta-analysis, it was not possible to carry out because of the low quality of the study’s methodology and population heterogeneity.

Methodological Quality and Risk of Bias Assessment of Studies

Two reviewers assessed the quality of the selected studies using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) criteria, which cover four key domains for assessing risk of bias and applicability of the study results [17]. These domains are patient selection, index test, reference standard, and flow and timing of samples/patients through the study and the first three domains are also assessed in terms of applicability concerns. In cases in which there was disagreement regarding the risk of bias analysis, two reviewers (DM and LS) met and discussed their ratings until they achieved consensus. Cohen’s Kappa was calculated to evaluate the level of agreement between both raters for titles/abstract and full-text review.

The QUADAS-2 is applied in four phases: summarizes the review question, tailors the tool to the review and produces review-specific guidance, constructs a flow diagram for the primary study, assesses risk of bias and concerns regarding applicability [17]. Risk of bias is judged as “low”, “high”, or “unclear”. If all signaling questions for a domain are answered “yes” then risk of bias can be judged “low”. If any signaling question is answered “no” this flags the potential for bias. Review authors then need to use the guidelines developed in phase 2 to judge risk of bias. The “unclear” category should be used only when insufficient data are reported to permit a judgment. If a study is judged as “low” on all domains relating to bias or applicability then it is appropriate to have an overall judgment of “low risk of bias” or “low concern regarding applicability” for that study. If a study is judged "high" or "unclear" on one or more domains then it may be judged “at risk of bias” or as having “concerns regarding applicability” [17] (Fig. 2).

Fig. 2
figure 2figure 2

Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) at Bristol Medical School [17]

Results

Figure 3 shows the overview of the selection process for included studies. The initial survey identified a total of 1020 studies, of which 30 were duplicates. After the duplicates were removed, 990 abstracts were screened from the selected databases. Of these, 17 studies were left to read their full texts and defined by the eligibility criteria if they would be included. At the end of this process, six articles remained. Regarding the search in the gray literature, seven studies met the inclusion and exclusion criteria and were included in the research. Ultimately, a total of 13 articles met the inclusion criteria for review and data extraction (Table 2 summarizes the characteristics).

Fig. 3
figure 3

PRISMA flowchart for selection and inclusion of articles in the systematic review of instrumental assessments of swallowing in the pediatric population

Table 2 Characteristics of inserted studies (n = 13)

Thus, most of the included articles were developed in North American countries. Cross-sectional and retrospective designs were the most frequent, with VFSS being the predominant instrumental assessment exam (Table 2). The reasons for exclusion of the articles were as follows: a study that did not report the use of an instrumental evaluation protocol, conducted in adult population or a scientific meeting abstract. Only one study currently in progress was included; it was selected from Clinical Trials, scheduled for completion in December 2021 [18], and the other study was selected from Clinical Trials but it was withdrawn before participants were enrolled [19].

At abstract screening, the inter-rater agreement was 99.4% with a Cohen’s Kappa statistic of 0.85 (p < 0.0001) demonstrating almost perfect agreement between raters. When examining ratings at the full-text level, levels of inter-rater agreement was 82.3% with a Cohen’s Kappa of 0.59 (p = 0.027) indicating moderate agreement.

Protocols for Instrumental Evaluation and Risk of Bias

From all researched literature, four studies refer to the construction of a protocol for VFSS in the pediatric population [20, 21, 26, 27]. Only the BaByVFSS Impairment Profile underwent a process of content construction and validation [20]. However, studies describing FEES protocols in the neonatal or pediatric populations were not found. The study of the FEES included in the systematic review, only mentions swallowing components analysis focusing on pharyngeal phase [24]. Therefore, publications aiming to develop a protocol for VFSS or FEES to analyze swallowing components in different age groups in the neonatal or pediatric populations were not found.

Analysis of swallowing components parameters were mentioned in four studies [19, 23,24,25] and seven studies cited the reference used to create the protocol. They were adapted from protocols suggested in book chapters [27, 28], built on the routine of care [29], adapted scientific articles such as the papers by [26], [30] and [22], respectively [21, 23, 24], or constructed from non-instrumental assessment of swallowing adding laryngeal and pharyngeal mechanism phases viewed only through radiological exams [22]. Other protocols did not mention any reference used for their conception [18,19,20, 25, 26, 33]. One study included patients with a specific clinical condition, Cerebral Palsy; however, it was focused to evaluating FEES results in the diagnosis of oropharyngeal dysphagia in these neurologic patients [24]. Table 3 describes the characteristics of each protocol inserted.

Table 3 Characteristics of selected protocols (n = 13)

Risk of bias evaluated through the QUADAS-2 tool was performed for all 11 studies. The DiPerna study is currently under way, and therefore there is no description of its results, making it impossible to perform bias risk analysis [18]. Another study, selected from the search in Clinical Trials was excluded from the evaluation, because although it has been already completed, its results and the authors’ contact information are not available [19]. A summary of QUADAS-2 assessment of included trials is described on Table 4.

Table 4 Risk of bias and applicability of QUADAS-2 (n = 11)

In the assessment of risk of bias, in the first domain of “patient selection”, only two studies were considered to be at low risk [21, 22], while for the second domain, “index test”, all studies inserted in the analysis were evaluated at low risk. The third domain, “reference test”, was not applicable in five studies, four of which were evaluated with low risk of bias [21, 22, 25, 29]. The fourth domain, "flow and time", received the low-risk classification in three studies [21, 22, 29]. Regarding the questions of applicability, six studies were considered "unclear" in the selection of patients, but the whole sample received a low risk of bias for applicability concerns about text index. The reference standard was not applicable in five studies, with the remaining six classified as low risk of bias.

Characteristics of the Target Population

The characteristics of the patients included in the studies can be found in Table 5. The etiology of the patients’ dysphagia comprises neurological, respiratory, cardiac, systemic, gastrointestinal, syndromic, social and psychological, anatomical and functional diseases, among others. Neonatal and pediatric populations ranged from 34 weeks GA to 19 years old. And there were no healthy participants in the sample of the studies included.

Table 5 Characteristics of inserted patient populationss (n = 13)

Discussion

This systematic review provides an overview of the 13 studies that propose protocols for instrumental assessment of swallowing by VFSS and FEES in the neonatal and pediatric populations. These studies demonstrate variability regarding evaluation of swallowing components, neonatal and pediatric diseases, as well as the proposed methodology for instrumental evaluation. Variations between these protocols reflect diversity of instrumental assessment for the target population, generating variability in swallowing assessment by the professionals who perform them [10, 12]. Therefore, swallowing assessment becomes prone to divergences, which may impact the diagnosis of these difficulties and therapeutic intervention measures [10, 12].

Lack of high-evidence level studies conducting standardized instrumental assessment of swallowing protocols in the neonatal and pediatric populations reflects the complexity of conducting robust research. The main problem is the fact that it is not possible to obtain a control group for these studies, since children without dysphagia should not, for ethical reasons, be exposed to VFSS because of radiation exposure or to the more invasive FEES [3, 5, 31]. It has commonly been assumed that due to swallowing components heterogeneity in different age groups, it is challenging to use a single protocol that aims to analyze their particularities at all stages of food transition and that also contemplates different stages of the children’s neurodevelopment [34].

Challenges Found in Researched Studies

Besides the difficulties to perform controlled studies for instrumental in the neonatal and pediatric populations as mentioned in the paragraph above, it is relevant to consider that the conception of an adequate protocol for instrumental assessment encounters some challenges, such as difficulties in the process of validation for cut of point where can be found the characterization of differences between the health and the impairment population [10]. Thus, it is considered unethical to expose health children to any of the instrumental exams, an invasive procedure such as FEES or an evaluation that exposes to radiation such as VFSS [3,4,5].

The difficulties to perform instrumental assessment through FEES in the neonatal population refer mainly to the discomfort while performing this procedure [4, 5]. Because this exam uses an endoscope that passes through the infant’s nose in to the larynx and it requires right afterward that the infant accepts some volume by mouth, so that the evaluator is able to assess the infant’s swallowing biomechanics [4, 5]. In some situations it is necessary to use topical analgesic as an auxiliary resource during the performance of FEES, and other use nonnutritive sucking to help calm the neonate during the exam [3,4,5, 35].

However, there are some other challenges when elaborating a validated protocol for VFSS in the neonatal and pediatric populations. Besides the aspects that have been already discussed previously, it must be highlight the limited financial resources of some institutions to access to this exam. It is crucial to consider the radiation that this exam exposes the infant or child [38]. The evaluator must follow recommendations to perform this instrumental assessment according to the guiding principle of radiation safety – ALARA principles (As Low As Reasonable Achievable) [36, 37]. In addition to this, it is necessary that the infant accepts enough volume to allow for oral feeding ability for diagnostic purposes [38].

At last, it must be acknowledged that any of the instrumental exams, FEES or VFSS, are deemed to be complimentary to the clinical assessment, because there are not truly representative of a routine meal given the atypical context that both exams expose the infant or the child and their parents/caregivers [38].

Risk of Bias

QUADAS-2 is designed to assess the quality of primary diagnostic accuracy studies [17]. The main objective of the tool used is to assess the risk of bias and the applicability of accuracy tests, so whether the overall methodological quality of the studies generates a satisfactory classification with low risk of bias in most measurement properties, the use of these questionnaires in daily practice and research can be justified. On the other hand, without a satisfactory improvement in the measurement properties of the inserted studies and without an analysis of psychometric properties it is not possible to recommend them for clinical or research. New protocols for analysis of swallowing function of the neonatal and pediatric population need to be developed using and reporting pre-established psychometric criteria, as recommended in the literature.

Identification of studies that performed validity and reliability analysis of protocols used for instrumental exams is restricted. Only one study performed construction analysis of its instrument and the risk of bias was considered low [20]. This study proposed a protocol for swallowing assessment by VFSS, called the Baby VFSS Impairment Profile, in bottle-fed infants [20], and the first steps for the validation were published in 2018 [32].

Considering that the FEES articles included in this study did not perform sample size calculation, randomization of evaluations, or blinding of evaluators [24], this systematic review did not find any study with a standardized and validated protocol to evaluate instrumental analysis through FEES. The study that compares safety of FEES assessment in the neonatal population through VFSS is fragile in its methodology, although it does not get to the point of invalidating its results (Table 3) [29]. This study concludes that FEES is a safe method for analyzing laryngeal penetration and aspiration, in line with current findings in the literature for the neonatal population [29, 34, 35].

Due to small number of studies that propose to perform a reliability and validity analysis of the existing instrumental evaluation protocols, it would be favorable to prioritize the development of studies that analyze psychometric characteristics of existing evaluation protocols to provide these instruments with more robust scientific evidence, promoting more effective clinical reasoning, and therefore stronger clinical decisions. Selecting appropriate protocols based on validated and reliable assessment scores gives greater confidence in documenting test findings.

Recommendations for Future Studies

Therefore, for future research with instrumental assessment for swallowing in the neonatal and/or pediatric populations should use sample size calculation for validation stage of protocols, evidences of validity, and internal structure as reliability (internal consistency and Test–retest) as well as adequate training to swallowing experts should be giver to perform and analyze the exams.

For the conception of protocols, researchers should consider the differences in swallowing biomechanics physiological and anatomic between neonatal and pediatric populations considering variances in consistencies and utensils used for each group, so that levels of oral feeding performance are taken into consideration according to the infant or child swallowing skills. These recommendations should be taken into consideration to minimize the risk of bias in the research, escalate methodology levels of these studies and allow better reproducibility and accuracy results between evaluators.

Limitations

Due to the heterogeneity of the included studies, their low methodological quality, and the absence of sufficient statistical data to perform a meta-analysis, it was not possible to perform a quantitative analysis of the data that would allow recommendation of protocols with higher level of evidence.

Conclusion

This systematic literature review identified and analyzed protocols available for instrumental assessment of VFSS and FEES in the neonatal and pediatric populations. Our literature search found 13 studies of these, eleven were assessed through QUADAS-2, and two classified with low risk of bias. One study found is in the process of standardization and validation of an instrumental assessment protocol for swallowing in bottle-fed infants through VFSS. No study that proposed a standardized and/or validated protocol for FEES was identified. It is relevant to mention that the variability of the methodology described in the articles did not allow accomplishment of a meta-analysis that could certainly contribute to more robust evidences.

Furthermore, this review identifies a lack of validated protocols used for instrumental exams that allow evaluation of swallowing components in different phases of swallowing using various consistencies and utensils throughout the feeding transition period within different age groups in the neonatal and pediatric populations. Therefore, development of research with rigorously controlled methodologies aiming to create a protocol to evaluate swallowing through VFSS and FEES for pediatric purposes is essential, as well as to consider factors such as sample size calculation for validation studies, evidences of construct validity and reliability interobserver for swallowing components with prior adequate training for researchers. All these steps may help heighten methodological quality of research using instrumental assessments in the neonatal and/or pediatric populations.