Introduction

The safety and efficiency of swallowing is dependent upon the complex, rapid sequential contraction and relaxation of 30 pairs of bilaterally innervated muscles of the head and neck coordinated by five cranial and three peripheral nerves [1]. Establishing norms for not only the duration of events in the healthy swallow, but the sequence in which these events occur, allows investigators and clinicians to make comparisons between populations or to longitudinally track changes within an individual [2]. For example, one could examine whether airway closure is achieved prior to the point at which the bolus reaches the upper esophageal sphincter (UES). Failure to adhere to this sequence (laryngeal vestibule closure prior to arrival of the bolus at the UES) may result in compromised swallowing safety, given that the bolus would be adjacent to an open airway. In 2007, Mendell and Logemann conducted a review of studies examining temporal sequencing in the healthy swallow. They reported a great deal of variability in the use of measurement protocols. Importantly, they determined that several studies examined sequence in relation to a reference event—however, the reference event selected was not consistent across studies [2].

Kendall et al. [3] described an alternative approach to investigating swallow sequencing by quantifying the frequency with which 12-paired events occurred during the pharyngeal phase of a healthy swallow. The paired events were derived from the temporal events listed below (described in Kendall’s original publication on page 87). For ease of interpretation, we have provided our own variable names in square brackets and these will be used in the remainder of this manuscript.

  • BP1—Arrival of bolus head at UES [bolus at UES].

  • AEstart—Beginning of superior arytenoid movement [laryngeal elevation].

  • AEclose—First frame depicting laryngeal vestibule closure [laryngeal closure].

  • H2—Maximum anterior–superior hyoid displacement [hyoid max].

  • Pop—Beginning of UES opening [UES opening].

  • PESmax—Point of maximum pharyngoesophageal segment distention [UES max].

  • HL—Closest approximation of hyoid and larynx [hyolaryngeal approximation].

  • PAmax—Point of maximum pharyngeal constriction [max PC].

In their sample of 60 healthy individuals (30 male) aged 18–62, they found a significant degree of variability in the order in which paired sequences occurred. This variability reportedly increased with smaller bolus sizes. However, four obligatory sequences (regardless of bolus volume) were identified:

  1. 1.

    Laryngeal elevation prior to UES opening.

  2. 2.

    UES opening prior to (or simultaneously with) bolus at UES.

  3. 3.

    UES opening prior to hyolaryngeal approximation.

  4. 4.

    UES max prior to max PC.

They also identified the most commonly occurring sequence of events as follows: laryngeal closure < UES opening  bolus at UES < hyoid max < UES max < hyolaryngeal approximation < max PC. This sequence was found to occur 25% (45/180) of the time [3].

In 2014, Molfenter, Leigh, and Steele set out to replicate this study, in order to confirm the findings in a new sample restricted to young healthy adults (< 45 years old). Their study expanded on the original Kendall study given that, in addition to bolus size, barium viscosity and barium concentration were manipulated as well. Further, their study included three swallows per bolus condition in order to investigate sequence consistency across repeated trials. Molfenter and colleagues confirmed only two of Kendall et al.’s [3] obligatory sequences:

  1. 1.

    Laryngeal elevation prior to UES opening.

  2. 2.

    UES opening prior to hyolaryngeal approximation.

Additionally, Kendall et al.’s [3] most common sequence was only observed on 4/293 trials. In fact, the Molfenter study [4] identified 214 different event sequences, with only 3 sequences occurring 4 or more times. Neither bolus volume nor viscosity were found to influence the degree of variability in the sequence of swallow events. One exception was that smaller volumes resulted in increased variability for the UES opening prior to hyolaryngeal approximation sequence.

Ultimately, Molfenter and colleagues [4] concluded that a young healthy swallow is characterized by variability in the sequence of temporal events and hypothesized that this allows for flexibility in the face of unexpected demands. They identified variability in the swallow sequence of individuals with dysphagia as an area for future research. Specifically, they proposed that reduced variability in this population may impact their ability to adapt to different ways in which the bolus might travel through the pharynx [4].

It is widely accepted in the field that changes to swallowing occur as a normal part of the aging process. Specific age-related changes that may impact the temporal sequence of swallow events include increased oropharyngeal transit time [5], delayed initiation of the pharyngeal swallow [6,7,8,9,10,11], reduced tongue driving force and pharyngeal contractions [12, 13], reduced muscle strength and coordination [14,15,16], and reduced hyolaryngeal elevation [7].

Both the Kendall et al. [3] and Molfenter et al. [4] studies examined the variability of paired sequence occurrence in healthy adults under age 62. However, there is a significant gap in the literature regarding the impact of aging on the swallow sequence. Thus, this study specifically addresses the following questions:

  1. (1)

    What proportion of healthy older individuals adheres to Kendall’s original 4 obligatory sequences?

  2. (2)

    Do obligatory sequences differ by age category?

  3. (3)

    Do the latencies between event pairs in obligatory sequences differ by age category?

  4. (4)

    What is the most common overall event sequence in a healthy older population?

  5. (5)

    Does variation in swallow sequencing (represented by the number of unique overall sequences) differ by age category?

This work has important clinical ramifications. Presently, it is unknown how swallow sequence changes in the context of aging. Examining this is crucial, given that disordered swallowing typically occurs in the second half of life (secondary to stroke, cancer, degenerative disease etc.). Examining normal swallowing sequences in healthy aging adults will provide a normative reference for distinction between age-related changes to sequence variability, and changes that are seen in older dysphagic populations.

Materials and Methods

Participants

This IRB-approved study represents secondary analyses of two videofluoroscopic (VF) datasets: 20 healthy young adults (10 male, ages 22–45 with a mean age of 31.5) from the Molfenter and colleagues 2014 study and 23 healthy older adults (10 males and 13 females, ages 65–90, with a mean age of 74.7). Inclusion criteria for this analysis required the confirmation of safe and efficient swallowing on all boluses, using the Penetration–Aspiration Scale (PAS) [17] and Normalized Residue Ratio Scale (NRRS) [18]. Swallows with PAS scores of 1 or 2 were considered safe [19] and swallows with no significant residue (NRRSv < 0.082 and NRRSp < 0.067) were considered efficient [20]. Exclusionary criteria included a history of dysphagia, neurological insult/injury, and/or head and neck cancer or surgery. Swallow sequence data from the healthy young dataset have been previously published [4]; however, given that the healthy older dataset was collected with fewer swallow conditions, this analysis required that we identify and exclude swallow conditions (10 mL thin liquid at 20% w/v and the 5 mL thin liquid at 40% w/v) from the original healthy young dataset and recalculate proportions of events and sequences as required. Thus, it should be acknowledged that the data reported here for the healthy young dataset are different than those reported in the 2014 publication.

VF Procedure

For the healthy older dataset, VF was collected on a GE Advantix digital fluoroscope (GE Healthcare) at 30 pulses per second, and captured on a KayPENTAX digital swallowing workstation at 30 frames per second. Nine swallows per participant were included in this analysis: 3 × 5 mL thin liquid barium, 3 × 20 mL thin liquid barium, and 3 × 5 mL nectar thick barium. Barium stimuli were standard Varibar™ (Bracco Imaging); however the thin liquid barium was prepared to match the 20% w/v concentration of ‘ultra-thin’ stimuli used in the Molfenter [4] study. This preparation has been shown to improve detection of penetration–aspiration [21]. The order of stimuli was intentionally not randomized to minimize risk of potential aspiration of large volumes (5 mL prior to 20 mL) and to minimize contamination of post-swallow residue to later occurring swallows (which is more likely with nectar thick liquids). The procedures for data collection in the healthy young dataset are consistent with above and have been previously published [4].

Sequence and Latency Analysis

Individual swallows were spliced out of the full-length study for randomized analysis and identification of each of the following swallow events: laryngeal elevation, laryngeal closure, UES opening, bolus at UES, hyoid max, hyolaryngeal approximation, UES max, and max PC. These events were identified by the first author using frame-by-frame viewing of each swallow in ImageJ software (National Institutes of Health, Bethesda, MD) as per the guidelines laid out by Kendall et al. [3] using the operational definitions and modifications described by Molfenter and colleagues [4]. In the healthy older dataset, a total of 14 swallows were excluded from the analysis: seven due to inability to visualize the hyoid, two due to poor image quality, and five due to piecemeal deglutition. In the healthy young dataset, a total of 7 swallows were excluded due to piecemeal deglutition. As a result, 193 swallows from healthy older adults and 173 swallows from healthy young adults were included in the final analysis. Once the frame of each swallow event was identified, the order in which paired events occurred was determined. Data were then analyzed to quantify the proportion of swallows that obey Kendall’s four original obligatory sequences (Question 1), and compared across age groups (Question 2). Events occurring 97% of time or greater were considered obligatory [4]. For latency analysis, the frame on which the first event in a pair occurred was subtracted from the frame of the second event and converted to milliseconds by dividing by 30 (given the data were collected at 30 frames per second) and multiplying by 1000 (Question 3). Finally, the order of events was characterized for each swallow and tabulated (Questions 4 and 5).

Statistical Analysis

All data were analyzed using SPSS version 24. Questions 1 and 4 were answered using descriptive statistics. Chi-square statistics were used to compare proportions of obligatory sequencing by age (Question 2), as well as to compare the proportion of unique swallow sequences by age (Question 5). Repeated measures ANOVAs were used to compare the event latency by age category while controlling for repeated boluses per condition (Question 3). To control for the multiple comparisons problem, Bonferroni adjustments were applied and two-tailed p values < 0.0125 were considered statistically significant.

Results

Twenty percent of the data for healthy older adults, randomly sampled across participants and stimuli, was subject to inter- and intra-rater reliability ratings. Inter-rater reliability was conducted by a trained graduate student with experience in biomechanical analysis of swallowing. Reliability was examined in two ways. First, reliability of the adherence to Kendall’s sequences [3] was examined using Cohen’s Kappa scores [22]. These results are reported in Table 1. Next, the reliability of event latencies was examined using laryngeal elevation as a reference point. Latencies were calculated for all measured events, with the exception of laryngeal elevation, which by definition had a fixed latency. Reliability was established using two-way mixed intra-class correlation coefficients and the results are reported in Table 2. All latency measures achieved reliability scores of ‘good–excellent’ (> 0.75) [23]. Adequate reliability for the healthy young dataset was established and reported in the original Molfenter study [4].

Table 1 Intra- and Inter-rater agreement for adherence to obligatory sequences
Table 2 Intra- and inter-rater intra-class correlation coefficients (ICCs) and corresponding 95% confidence intervals (CI) for event latencies from laryngeal elevation

Question 1. What proportion of healthy older individuals adheres to Kendall’s original 4 obligatory sequences? Table 3 presents the frequencies with which Kendall’s obligatory sequences occurred, broken down by bolus volume and viscosity. In accordance with Molfenter and colleagues’ [4] definition, a sequence was said to be upheld in each cohort if it occurred at least 97% of the time, across conditions. The following sequences were found to hold true: laryngeal elevation prior to UES opening, UES opening prior to hyolaryngeal approximation, and UES max prior to max PC. The event pair UES opening before/with bolus at UES was not confirmed as obligatory. No clear patterns with respect to sequence differences by bolus volume or viscosity were noted. Additional figures that incorporate swallow trial, bolus volume, and bolus viscosity can be found in the Appendix.

Table 3 Frequency (%) distribution of obligatory sequences across bolus viscosity and volume

Question 2. Do obligatory sequences differ by age category? Adherence to obligatory sequences was then compared between healthy young and healthy older datasets using Chi-square statistics (Table 4). Significant differences between age groups were found for two sequences: UES opening before/with bolus at UES (χ2 = 193.154, p < 0.001) and UES max before max PC (χ2 = 35.137, p < 0.001). For completeness, these comparisons were then re-tested in each bolus condition (5 mL thin, 20 mL thin, and 5 mL nectar), and confirmed to be significant for all comparisons.

Table 4 Frequency distribution of obligatory sequences across age groups

Question 3. Do the latencies between event pairs in obligatory sequences differ by age category? Latencies (in milliseconds, ms) between the first and second event in an obligatory sequence are displayed in Table 5 below. Once again, significant differences were discovered between age groups for UES opening before/with bolus at UES and UES max before max PC. Younger subjects had longer (negative) latencies between UES opening and bolus at UES and older subjects had significantly prolonged (positive) latencies between UES max and max PC. Note that UES opening before/with bolus at UES had noticeably shorter latencies regardless of age compared to the other three event pairs.

Table 5 Event latencies of obligatory sequences

Question 4. What is the most common overall event sequence in a healthy older population? Our analysis revealed marked variability in the sequence of swallow events. Kendall’s most common event sequence (Laryngeal closure < UES opening  bolus at UES < hyoid max < UES max < hyolaryngeal approximation < max PC) occurred on only two occasions (1%) in the healthy older dataset. Only five sequences were found to occur more than five times, accounting for between 2.5 and 6.2% of the sample. The most frequently occurring event sequence in this dataset was UES opening/bolus at UES < laryngeal closure < UES max < hyolaryngeal approximation < hyoid max < max PC, which was observed 12 times. This same pattern was found to occur only once in the healthy young dataset. The most frequently occurring sequence in that dataset was bolus at UES < laryngeal closure < UES opening < UES max < hyolaryngeal approximation/max PC < hyoid max, which occurred four times. This sequence was not seen in the healthy older dataset.

Question 5. Does variation in swallow sequencing differ by age category? Overall, 105 unique sequences were identified in 193 analyzed swallows (60.7%) from the healthy older dataset. This represents a significant decrease in variation when compared to the healthy young sample. There were 142 unique sequences identified in the relevant swallows from the healthy young dataset out of 173 swallows (82.1%). This difference is highly significant (χ2 = 31.8472, p < 0.0001). Proportions of unique sequences were also tested by swallow condition, as detailed in Table 6 below. Significant reductions in variation in the older group were observed for the 5 mL nectar and 20 mL thin conditions but not the 5 mL thin condition. Finally, the proportion of unique swallows at the individual participant level was compared descriptively across age groups. Visual inspection of this data reveals a trend toward decreased variation at the participant level as well (Table 7).

Table 6 Percentage of unique sequences by swallow condition
Table 7 Percentage of unique sequences per participant by swallow condition

Discussion

In this study, we examined four swallow event pairs that were previously identified by Kendall [3] as obligatory and later tested by Molfenter [4] in a healthy young dataset. The present study contributes novel data from healthy older adults (> 65 years old). The strength in this design is that the parameters, methods, and measures were nearly identical to those used in the healthy young dataset [4]. The exception is that more swallow conditions were collected in the healthy young dataset (3 × 10 mL and 3 × 5 mL at 40% w/v barium). These extraneous swallows were identified and excluded from the comparison analyses.

Two event pairs were found to be obligatory in both the healthy young and healthy older populations: laryngeal elevation before UES opening, and UES opening before hyolaryngeal approximation. Both of these results are expected, given the strong physiological ties between each event pair. Regarding laryngeal elevation before UES opening, this result is expected in healthy populations, given that (1) hyolaryngeal elevation is a key precipitating factor of UES opening [24], and (2) the laryngeal elevation event is a component of hyolaryngeal elevation. It remains to be seen whether this sequence remains obligatory in certain dysphagic populations. Regarding UES opening before hyolaryngeal approximation, in their definition of hyolaryngeal max Kendall and colleagues [3] specifically state that larynx-to-hyoid approximation occurs while the UES is open. This makes the sequence in question obligatory by definition, and of limited interest from a clinical standpoint.

One event pair, UES max prior to max PC, was confirmed in the healthy older sample, but not in the healthy young, with a statistically significant difference in adherence observed. A corresponding significant increase in event latency was observed in older versus younger datasets as well. This finding is consistent with literature on aging, which shows that pharyngeal contraction interval (onset-to-peak pharyngeal contraction) has been found to increase with age, while UES relaxation interval (onset-to-peak UES opening) has been found to decrease [25, 26]. Post hoc analysis of the current datasets revealed that maximum pharyngeal contraction was the last event to occur 99% of the time in healthy older adults vs. 36% in healthy young. This finding may have important implications related to post-swallow residue in dysphagic populations, particularly those with UES dysfunction.

Finally, UES opening before/with bolus at UES, was not confirmed in either population. According to the operational definition used in both the Molfenter [4] study and the current study, this sequence requires that UES opening occurs prior to or simultaneously with the arrival of the bolus at the base of the pyriform sinuses. Molfenter and colleagues noted that differences in the definition of bolus at UES may account for some of the inconsistency in the findings [4]. However, this explanation does not sufficiently explain the significant difference found between healthy young and healthy older populations. The directionality of this finding was particularly surprising (86% adherence in healthy older vs. 12% adherence in healthy young), given the reports that initiation of the pharyngeal swallow is delayed in healthy older adults, compared to healthy young [6,7,8,9,10,11, 27]. However, examination of the latencies for these paired events reveals less than a 80 ms difference between the two events. These findings are similar to reports by Logemann et al., who found a 30–40 ms difference between the point at which the bolus arrives at the level of the upper pyriforms and the point of UES opening, in both healthy young and healthy older populations [7, 25]. These findings confirm that bolus arrival at the pyriforms and UES opening are highly coupled events in healthy individuals, and lead us to question the clinical relevance of examining this sequence. Clinical utility is further brought into question when considering the fact that this sequence had the weakest reliability of the four, which may likely be attributed to the fact that these two events usually occur within one to two frames of each other. It remains to be seen whether this event pair, can meaningfully distinguish functional from impaired physiology in patients with dysphagia, especially in those patients with sensory deficits. Presumably, when an individual has reduced pharyngeal sensation, the bolus may pool at the base of the pyriforms for a prolonged period prior to UES opening.

While not one of our primary research questions, this dataset allowed us to look at trial-to-trial variability within a swallow condition by obligatory sequence. This is possible because each participant swallowed three-repeated boluses in each condition. These results are outlined in the Appendix for the healthy older dataset and appear in the original publication for the healthy young dataset. Interestingly, for UES opening before/with Bolus at UES, the least amount of variation was seen for the 20 mL condition. Similarly, the greatest decrease in overall sequence variability was noted in the 20 mL condition. This finding corroborates both Molfenter and Kendall’s findings that smaller bolus volumes appear to have greater variation. It is possible that this decrease in variation is explained by the fact that the 20 mL condition most closely approximates natural drinking behaviors [28]. While there appear to be differences by bolus trial number within bolus conditions for the UES opening before/with Bolus at UES sequence, no clear patterns of order affect can be elucidated across conditions. No notable differences were observed for the remaining three sequences, given the high degree of adherence across bolus conditions.

In regards to overall sequence, while a most common sequence was identified in the healthy older population, this sequence did not occur frequently enough to be deemed clinically relevant. Of interest, is the finding that significantly less variation occurred in the healthy older population, when compared to healthy young–a finding which is consistent with previous research [7]. This appears to hold true across swallow conditions at both the individual and group level. A possible explanation for this lies in increased pharyngeal transit times (PTT) observed in aging. While a recent systematic review examining swallow timing in aging notes that findings of increased PTT with aging are sparse, a lack of direct comparisons between age groups within reviewed studies was also acknowledged [27]. A direct comparison between datasets used in this study reveals a trend towards increased PTT in aging, for both the 5 and 20 mL thin conditions. In the healthy young dataset, average PTT was reported to be 471 ms and 528 ms for 5 and 20 mL thin liquid boluses, respectively [29]. In the corresponding healthy older adults, we found that the average PTT was 614 ms for 5 mL thin liquid boluses and 699 ms for 20 mL thin liquid boluses. The shorter PTT in younger individuals requires that swallowing events happen more rapidly, with shorter latencies, thus increasing the likelihood that the order of events will vary.

This study is not without limitations. First, this study is limited by the narrow range of volumes, viscosities, and textures tested. While notable variation was not observed between 5 mL thin and 5 mL nectar boluses, it is plausible that greater variation may have been seen, given a wider range of stimuli. It remains to be seen whether sequence variation differs between liquid and solid stimuli. An additional limitation is that while the sample was sex-balanced, sex differences were not directly tested. Finally, this study could be strengthened with the identification and inclusion of bolus past mandible (BPM). The BPM event is incorporated into variables that quantify the onset of pharyngeal swallowing. Given that pharyngeal swallow trigger is known to be delayed in aging populations [6,7,8,9,10,11], this may have served as an interesting point for comparison.

Further testing of certain event pairs in dysphagic populations, specifically UES opening before/with bolus at UES and UES max prior to max PC, appears to be warranted. It is the opinion of these authors that future research examining event pairs should be expanded to pairs of events that have direct implications on the safety and efficiency of the swallow. One such example is laryngeal elevation before/with BPM. This sequence would capture the beginning of airway closure in relation to the point that the bolus enters the pharynx. Changes in this sequence may directly impact swallow safety. Another event pair warranting exploration is laryngeal vestibule opening from UES opening and/or UES max as this too may yield important information regarding swallow safety. Lastly, it is the opinion of these authors that future research should include a ‘naturally-occurring sip’ condition, as this study has provided anecdotal evidence that in healthy populations, variability may be induced by our manipulation of bolus volume.

Conclusions

Our findings suggest that paired swallow event sequences may not be robust across the age continuum and that variation in overall swallow sequences appears to decrease with aging. Findings regarding obligatory sequence adherence and, perhaps more importantly, latency, provide normative references that may be used as a basis of comparison for individuals with dysphagia. This study has proposed relevant sequences for future studies, added support to a body of evidence that indicates increased pharyngeal transit times with aging, and perhaps most importantly, provided preliminary evidence for the impact of bolus volume on swallow variability.