Introduction

The frequency of swallowing disorders (dysphagia) is increasing in young children secondary to advances in medical and life sustaining measures that improve the survival of infants born premature, with low-birthweight and with complex medical conditions [1,2,3,4]. The highest prevalence rates are reported for children with cerebral palsy and other neurodevelopmental disorders [5,6,7]. Significant morbidities associated with dysphagia include respiratory and nutritional compromise, the development of chronic feeding problems, and stressful child and caregiver interactions [8,9,10,11]. In addition, childhood dysphagia may signal the development of future speech and language delays [12, 13], poorer developmental outcomes [14], and the adult onset of disease and serious health conditions [15,16,17]. Early detection and prompt interventions are imperative to reduce the consequences of dysphagia and have resulted in the increased use of the videofluoroscopic swallow study (VFSS) in children.

The VFSS is the primary method used to evaluate feeding difficulties and swallowing disorders in bottle-fed babies. To date, VFSS procedures and the interpretation of images have not kept pace with the advances in our understanding of the swallowing mechanism. Despite efforts by clinicians to adopt best practices based on the existing evidence, VFSS procedures in children are variable and largely determined by the skills of the examining clinician, institutional wisdom, and information derived from experience with adults [18]. Correspondingly, there are no broadly applied standardized approaches to characterize the type and severity of the swallowing impairments in the bottle-fed babies assessed with videofluoroscopic imaging.

Herein, we describe the development of a standardized VFSS assessment tool, the BaByVFSS Impairment Profile (BaByVFSSImP©), for the quantification of a range of physiologic swallowing impairments in bottle-fed babies. The current investigation builds upon the years of experience from the adult Modified Barium Swallow Impairment Profile (MBSImP™) [19] now field tested by over 7000 clinicians in 27 countries [19, 20]. Similar to the MBSImP, the BaByVFSSImP goes beyond observations of aspiration and residue and aims to capture impairment in swallowing physiology. Although many components of swallowing are similar to those tested on the adult tool, several components and their scoring schema were modified based on the varying dynamics of swallowing in bottle-fed babies.

Our previous study demonstrated validation through expert consensus and rater-reliability for 24 components of swallowing function hypothesized as critical to swallowing in bottle-fed babies [21]. One component on the adult tool, esophageal clearance, was eliminated in the current project. This component was not consistently assessed during VFSS to adhere to practice mandates to keep radiation exposure “as low as reasonably achievable (ALARA)” by not unnecessarily repeating examinations of the esophagus [22, 23]. ACR-SPR Guidelines for examination of the esophagus and upper gastrointestinal tract specify that fluoroscopic studies be “optimally performed” while children are lying down for visualization of the nasopharynx to the gastric fundus [24, 25]. The primary purposes of the present study were to: (1) establish the construct validity of 23 physiologic components of swallowing and establish their functional groupings hereby referenced as ‘domains’, (2) organize the components and domains into a clinically logical tool (BaByVFSSImP), and (3) assess the clinical relevance of the BaByVFSSImP through association of domain scores with airway protection [penetration-aspiration scale (PAS)] and specific external indicators of well-being, including feeding recommendations and caregiver burden [26].

Method

Participants

Caregivers provided written informed consent for in- and out-patient bottle-fed babies referred for clinically indicated VFSS at Johns Hopkins University School of Medicine (JHH) and the Medical University of South Carolina (MUSC) between 2012 and 2016. Institutional Review Board approval was obtained at both sites. Babies were referred by primary physicians or subspecialists based on signs/symptoms of dysphagia or aspiration, or because they were considered high risk for dysphagia and aspiration based on their medical history [6]. Exclusion criteria included babies whose caregivers were non-English speaking and without adequate interpreter services, in foster care, under the care of the department of social services or wards of the state, whose caregivers refused to consent, or children who were not accompanied by an adult qualified to provide consent for study enrollment.

Data Collection

Caregivers provided demographic information and completed surveys about the health and medical status and feeding/swallowing histories as per standard clinical procedures. They also completed the Feeding/Swallowing Impact Survey (FS-IS), a validated instrument designed to measure the impact of children’s feeding/swallowing difficulties on caregivers [2, 8, 27]. Feeding status prior to each VFSS and the level recommended by the examining speech-language pathologist (SLP) after each VFSS were recorded. Eight possible levels of feeding status were identified from clinical experience and the literature, and rank-ordered on a scale from 0 to 7, with 0 representing the lowest score (no restrictions) to 7 being the highest or most severe restrictions. (Fig. 1).

Fig. 1
figure 1

Feeding level recommendations

Standardization of Radiation Imaging

Dose and image quality of the fluoroscopic equipment were standardized by pediatric radiologists at each site. The lowest level of magnification (approximately 6.7 FOV) needed for visualization of space between the laryngeal surface of the epiglottis and arytenoids was used. The fluoroscopy acquisition rate was continuous, and images were recorded at the standard resolution of 30 frames/s in digital format.

Radiologic Protocol

Each VFSS took place in a standard radiology fluoroscopy suite and was conducted jointly by a pediatric SLP and pediatric radiologist. Babies were positioned in their typical or optimal feeding position (head midline and neck in a neutral position, or pre-determined “best” position) in a Multiple Application Multiple Articulation seating system (MAMA Systems Inc., Oconomowoc, Wisconsin, USA), a specialized chair for pediatric VFSS. The visualization field included the lips anteriorly, nasal cavity superiorly, cervical spinal column posteriorly, and the pharyngoesophageal segment inferiorly.

Commercially available standardized preparations of room temperature liquid barium (Varibar® thin and nectar) were presented as clinically indicated. Contrast was delivered by nipples and bottles that were chosen and adapted according to the clinical characteristics and needs of the baby. Fluoroscopic images were acquired during the first series of suck-swallow sequences and then randomly during periods of subsequent sequences—a measure used to limit exposure to radiation [18, 28, 29]. Outside the scope of this study, the SLP proceeded with clinically indicated compensatory strategies (e.g., bolus or nipple modifications or positional changes) for optimization of airway protection and bolus clearance. Consistent with the MBSImP scoring, therapeutic modifications were recorded for clinical purposes but not included in the scored observations [30, 31].

Data Capture, Storage and Sharing

VFSS data were recorded using digital video imaging [digital swallowing workstation™ (DSW), Model 7200 KayPENTAX, Lincoln Park, New Jersey, USA] for signal acquisition, digital storage, and retrieval of the swallowing data. Each VFSS exam was downloaded to computer storage media, saved on the DSW, de-identified, converted from native format to universal digital video format (.mpg) for scoring, and shared between the two institutions using the password protected Research Electronic Data Capture (REDCap) database system.

Component Scoring

Seven raters were ASHA (American Speech-Language Hearing Association) certified SLPs with 2–28 years (median: 7 years) of experience, were like-trained, and achieved ≥ 80% scoring reliability as requisite for study participation [21]. SLP rater training included didactic sessions led by MPIs, independent practice and testing as previously detailed in a prior publication [21]. Five of the raters had experience with pediatric dysphagia, which included conducting and interpreting VFSS examinations in bottle-fed infants and young children. Two of the raters had worked exclusively with adult dysphagic patients and completed training with the MBSImP approach [21].

Raters, who were blinded to the clinical information and all identifiers, scored the exams using slow motion and frame by frame playback. Components were scored using a rank-ordered numeric scale (highest number representing the worst performance) ranging from 3 to 5 possible scores per component with each representing a distinguishable observation and ambiguity eliminated as much as possible based on results from the previous reliability study [21]. Consistent with our clinical scoring approach for quantifying observations of swallowing impairment in adults [19, 31, 32] and our reliability testing in bottle-fed children [21], clinicians reviewed all images from each swallow study to identify the most impaired components. The most impaired component was assigned the highest (i.e., the most severe) score and termed the Overall Impression Score (OI). Raters also scored the maximum Penetration/Aspiration Score (PASmax) across all swallows in the series [26] (Table 1).

Table 1 Domains with components and score variants

Statistical Analysis

Construct Validity

Data available for all 300 babies were included in computing a polychoric matrix because the scores were not multivariate normal. Only the thin liquids swallows were analyzed in this investigation. Components with missing data on greater than half of the sample were not included in the matrix. The correlation coefficients from the matrix were input into a Confirmatory Factor Analysis (CFA). The authors proposed a priori that the 23 swallowing components would aggregate into 5 factors or ‘domains’ based on their purportedly close association and shared roles in swallowing function as described in the literature. Minimal threshold to demonstrate loading of a component onto a factor (domain) was established as a loading ≥ 0.5. The hypothesized factors (or domains) to explain the desired latent construct, swallowing impairment, included: Nutritive Sucking/Oral Containment and Clearance, Pharyngeal Swallow Initiation, Pharyngeal Containment and Clearance, Airway Protection, and Esophageal Entry and Clearance.

Exploration of Clinical Relevance

Factor scores could not be calculated for each infant because the polychoric correlation matrix was used to generate the factor loadings rather than standardized scores. Therefore, we summed the component scores that loaded on a particular domain for each individual to derive domain scores. We examined the relationship of summative domain scores with: PASmax, recommendations for oral feeding intake status following the VFSS, and FS-IS. Given the ordinal nature of the score data, Spearman Rank Order correlation was used to measure the strength and direction of association between the ranked variables.

Demographic and clinical characteristics were summarized using medians and interquartile ranges (IQRs) or counts and percentages and compared across sites using Wilcoxon rank-sum tests or Fisher’s exact tests as appropriate. Analyses were performed using SAS version 9.4 (SAS Institute Inc., Cary, NC). All tests were two-sided and significance was set at P < 0.05.

Results

A total of 300 consecutive in- and out-patients referred for a VFSS met criteria for initial analysis. There were 121 females and 179 males with a median age of 3 months 1 day (interquartile range [IQR]: 1 month 4 days–7 months 4 days), and a relatively equivalent sex distribution between the two sites (Table 2). One site enrolled a significantly greater number of Hispanic and non-white participants, and children from lower income families. Children presented with one or more diagnostic conditions with equal representation at both centers, except for a significantly higher number of children with pulmonary diagnoses from one site and more cardiac diagnoses at the other site.

Table 2 Demographic and clinical characteristics of all infants and stratified by institution (N = 300)

Fluoroscopy Time and Barium Dose

The mean (SD) fluoroscopy time across both sites was 106 (51) s. The median consumption of thin liquid barium by participants during the swallowing series was 15 mL, with a maximum consumption of 80 mL.

Component Scores and Functional Domains

From the 300 examinations included in computing the polychoric matrix, the effective sample size was reduced to 241 based on the component that had the least amount of non-missing data (Initiation of Nutritive Sucks component). The score for the Oral Residue component was missing in 52% of the sample and therefore, was not included in the factor analysis. Twenty-one of the 22 scored components factor loaded at ≥ 0.5 onto the five factors shown in Table 3. The Lip Closure component failed to load onto any component. Of interest, the Late Laryngeal Vestibular Closure component loaded onto two factors (henceforth, domains). It is grouped with the Airway Invasion/Laryngeal Closure domain because of its more robust association with other components loading in this domain.

Table 3 Confirmatory factor analyses for 22 swallowing components

The five-factor solution was confirmed from the CFA, however, specific components comprising the domains differed from those that were hypothesized. (Table 3) As such, the domains were renamed to reflect the latent construct or functional domain that the observed variables explained: Lingual Motion/Pharyngeal Swallow Initiation, Palatal-Pharyngeal Approximation, Airway Invasion/Laryngeal Closure, Aspiration, and Pharyngeal Transport and Clearance. The resulting BaByVFSSImP was comprised of 21 components of swallowing impairment.

Domain Score Associations with Airway Invasion (PASmax), Feeding Recommendations, and Impact on Caregiver (FS-IS)

In accordance with the theory that measures more proximal (physiology) to the intended function (swallowing) and those more distal to it (quality of life) are expected to be related but not in a robust manner [32], moderate or clinically modest correlations were set at approximately 0.20 for associations between the domain scores and the external indicators of well-being (i.e., feeding status/recommendations and caregiver impact) [33, 34]. Strong positive correlations were found between Airway Invasion/Laryngeal Closure domain (rS = 0.66) and Aspiration domain (rS = 0.88) with PASmax scores. Pharyngeal Transport and Clearance (rS = 0.21) showed a modest relationship with PASmax. Clinically modest correlations were also found between feeding recommendations (made by the examining clinicians following the VFSS and blinded to scores) and Aspiration domain (rS = 0.34), and Pharyngeal Transport and Clearance domain (rS = 0.22). There were no clinically significant relationships between domain scores and subscales on the FS-IS. (Table 4).

Table 4 Association of domain summative scores, PASmax, VFSS intake recommendations, and FS-IS

Discussion

This study represents the first known prospective study focused on the quantitative assessment of swallowing observations from VFSS in bottle-fed babies directed toward minimizing unnecessary variation in current clinical practice. The BaByVFSSImP was developed to promote valid and reproducible examination results between clinicians and clinical settings to facilitate consistent communication among providers and to guide interventions that appropriately target swallowing impairment across the continuum of care. As such, the clinical contributions of this work are highly significant. Our previous work established the content validity of the tool through expert consensus and confirmed the reliability of the measures when scored by like-trained SLPs [21]. This investigation is an extension of that work and set out to test the construct validity of the tool and explore the clinical relevance of the measures.

Through the use of conventional methods of validity testing, 21 components of swallowing grouped into 5 functional domains. One component was eliminated by the CFA (Lip Closure) as it did not load on any factor (domain) because it either measured something different from the other factors or it was not within the imaging field on VFSS recordings. Modification of the VFSS to include lip closure should be dependent upon information necessary for clinical decision-making. Even though oral residue was shown to be a reliable observation on our original reliability study [21], scores for this component were missing on more than half of the VFSS samples rated in this investigation. Interviews with raters indicated this component was very difficult to discern because the rapid sequential swallowing characteristic of bottle-feeding and/or the presence of the nipple within the baby’s mouth obscured the raters’ view of oral residual between swallows. It should be noted that the results of the factor analysis are limited to the cohort under investigation, and the omitted components may have clinical relevance in a larger set of patients. We will evaluate the clinical relevance of these two components as additional clinicians are trained and data are acquired across multiple sites.

The grouping of the physiologic swallowing components into functional domains demonstrates that when one of the components in a domain is impaired, it is highly likely that the other components in that domain are also impaired. Although a hypothesized five-factor solution emerged from the CFA, the combination of components loading onto factors was different from those originally proposed. As such, the domains were re-labeled and logically reordered to reflect the swallowing function accomplished collectively by the components contained therein.

The modest to strong associations of the BaByVFSSImP domain scores with PASmax scores point to the relevance of the airway closure and pharyngeal clearance components within those domains and airway invasion. However, the lack of redundancy between physiologic scores and penetration/aspiration measures speak to the notion that one observation is neither necessary nor sufficient to explain the other [32]. Clinicians not only need to identify the aspiration event during a VFSS, but as importantly determine its physiologic source that should become the target of feeding/swallowing intervention.

Three of the BaByVFSSImP summative domain scores showed clinically significant associations with feeding recommendations made by the examining clinicians who were blinded to the scores of the rating SLPs. This result is not surprising because mechanisms of pharyngeal clearance and airway invasion likely impact the feeding recommendation and support the inclusion of the items in the domain. On the other hand, the BaByVFSSImP summative scores are not sufficiently robust to test the association with swallowing impairment and caregiver burden. Future studies will test the associations of individual component scores, and other co-morbidities that may be better able to detect the impact of swallowing impairments on caregivers.

Study Limitations

While the distribution of the component scores demonstrated a fairly good range, they were predominately non-normal and may support further data reduction by collapsing the range of scores for some components. That is, some score variants for a given component rarely occurred and therefore may not sufficiently contribute meaningful information regarding impairment of that particular component. Further field testing in larger numbers of patients and subsequent refinements are predicted as has been seen in the adult model of the MBSImP. Also, larger numbers of patients will be necessary to identify meaningful cut-points that classify patients into severity levels based on a combination of scores and functional measures.

In addition, the current investigation included only thin liquid boluses. Components may load differently onto the factors (domains) when thicker liquids are compared to the thin liquids results reported herein. Future planned investigations will explore the differences in component scores between thin and thick liquid boluses during bottle-feeding.

Finally, heterogeneity of the patient sample is both a limitation and a strength. Our intention was to develop a tool for assessing swallowing physiology regardless of patient diagnosis for clinical translation with all bottle-fed babies, however, the patterns of impairment profiles are likely to differ by patient diagnosis [35].

Conclusion

In summary, the BaByVFSSImP is a novel tool that promotes standardized assessment of physiologic swallowing impairment using visual observations of VFSS recordings in bottle-fed babies. The content and construct validity of the tool have been established. The tool holds promise for the identification of physiologically based treatments for dysphagia intervention, aiding in guiding clinical decision-making regarding enteral feeding, and tracking the trajectory of swallowing disorders through development in young children. The quantification of physiologic swallowing impairment afforded by this novel instrument has potential to serve as a performance metric in clinical trials, identify phenotypic profiles of swallowing impairment, and to predict feeding and swallowing outcomes and associated communication-cognitive development.