The coordination of events during swallowing is likely to have a major impact on swallowing function. For example, if swallowing gestures occur out of sequence or without regard for the location of the bolus, poor airway protection and ineffective pharyngeal transit of the bolus may result. Previous studies have established a range of normative values for gesture timing and bolus movement relative to the onset of bolus pharyngeal transit, but the sequence of swallowing gestures demonstrated by individual subjects has not previously been evaluated [1,2,3]. In particular, there are no studies of the sequence of bolus gestures in which the location of the bolus is incorporated into the analysis.

The UC Davis Dysphagia Team has developed a quantitative method of swallowing evaluation using videofluoroscopy [1,4]. The timing of 14 swallowing gestures and 4 points of bolus transit are measured in each patient examined by the Team and plotted against a set of normative data developed from the evaluation of 60 normal volunteers. Both the timing and the sequence of swallowing gestures and bolus transit are graphically depicted by this method (Fig. 1). Occasionally, a patient's values fall inside the single standard deviation from the normal mean for a given parameter (see inserts in Fig. 1), but the sequence of swallowing gestures in that individual does not mirror the general sequence determined by the mean timing of gestures from the normative data. This finding causes concern that the dysphagia complaints in such a patient could be due to a disorder of gesture coordination rather than to a disorder of gesture timing. The question remains, however, as to the amount of sequence variability that is normal and whether or not certain event sequences are preserved across changes in subject and bolus characteristics.

Figure 1
figure 1

Patient timing data measured from the dynamic swallow study can be plotted against the mean and standard deviation of data compiled from swallow studies performed in 60 normal adult volunteers.

This study proposed to evaluate variability in the sequence of specific events that occur during deglutition and establish normative data for comparison to patient data. Sequence analysis was under- taken of the dynamic swallow studies performed in our population of 60 normal controls. Included in the analysis is an evaluation of events occurring during hypopharyngeal bolus transit defined as the time between the arrival of the bolus in the vallecula and the arrival of the tail of the bolus into the esophagus.

Materials and Methods

Dynamic videofluoroscopic swallow studies were performed on 60 consecutive normal adult volunteers. The study group consisted of 30 males and 30 females with an age range of 18 to 62 years. None of the study subjects had any complaints of dysphagia or a history of central nervous system (CNS) or craniofacial abnormalities.

The radiographic studies were conducted in the Voice–Speech–Swallowing Center at UCDMC in accordance with the routine radiographic protocols approved by the institution. Equipment used included a properly collimated Phillips Radiographic/Fluoroscopic unit that provides a 63 kV, 1.2 mA type output for the full field of view mode (9 in. input phosphor diameter) (Phillips Medical Systems North America Company, Shelton, CT). Fluoroscopy studies were recorded on high-quality videotape for playback and analysis using a Sony Model 1380 Videocassette VHS Recorder/Player (Sony Corporation of America, New York, NY). A graphic time display provided by an RCA character generator (RCA, Indianapolis, IN) and JVC AC adapter Model C412 (JVC, Wayne, NJ) was included on the videotape so that timing information at 0.01-s intervals was recorded. The swallow studies were recorded at 30 frames per second. Measures were made during the swallowing of a 1-, a 3-, and a 20-cc liquid bolus (Liquid Barosperse Barium Sulfate Suspension, catalog No. 179312, Lafayette Pharmaceuticals, Inc., Anaheim, CA). Swallowed material was presented to the subject by cup.

The studies from each subject were analyzed without information about the identity of the subject by the primary author. The videotaped images were captured on a Cardio-Loop (Eigen, Inc., Grass Valley, CA) which allowed for image enhancement and manipulation when needed. Timing measures were reported in seconds. A list of timing measurement definitions and corresponding abreviations provided in Table 1.

Table 1 Definitions and abbreviations

The onset of bolus pharyngeal transit (B1) is defined as the first movement of the head of the bolus from a stable or “hold” position that passes the posterior nasal spine and results in all or part of the bolus entering the oropharynx. The posterior nasal spine is located at the end of the hard palate and is a good landmark for the anterior border of the oropharynx. The timing of all subsequent swallowing events is determined relative to the onset of bolus pharyngeal transit.

The completion of bolus pharyngeal transit (BP2) is defined as the moment the tail of the bolus leaves the upper esophageal sphincter and the bolus has fully entered the esophagus. This time corresponds to closure of the upper esophageal sphincter (Pcl). Bolus pharyngeal transit time can be divided into an oro- and a hypopharyngeal phase by the arrival of the bolus in the vallecula (BV). If the bolus bypasses the vallecula, the time when the bolus passes the level of the base of the vallecula is designated as the onset of hypopharyngeal transit. The events included in this study were limited to those that occurred during hypopharyngeal transit.

The hyoid bone moves during a swallow as a result of suprahyoid muscle contraction [2,3,5]. The first superior–anterior movement of the hyoid that results in a swallow is designated as H1. H2 is defined as the time when the hyoid reached its maximum displacement during the swallow.

The outlines of the epiglottis and the arytenoid cartilages are visible against the relative hypodensity of the pharynx in the lateral projection on the X-ray. As a result, it is possible to identify movement of these structures, in particular, when the most superior part of the arytenoid cartilages begins to elevate and again when it makes contact with the down-folding epiglottis [6]. The onset of arytenoid cartilage elevation relative to the start of the swallow (AE-start) and the point of vertical supraglottic closure (when the most superior part of the arytenoid cartilages makes contact with the down-folding epiglottis, designated as AE-close) were therefore measured.

The onset of upper esophageal sphincter (UES) opening relative to the onset of bolus pharyngeal transit was measured. Pop designates the time when the UES opens. This time is identified as the moment the narrowest part of the upper esophagus between C4 and C6 opens. We use this definition of the location of the UES because we feel that the narrowest point of opening is the most functionally significant. It is also very reliably identified on a dynamic swallow study as opposed to a measurement made from the often poorly defined top of the air column in the trachea. The arrival of the head of the bolus at the UES is designated as BP1. PESmax time reflects the time at which the UES has reached its widest opening (in lateral view).

While the UES is open, larynx-to-hyoid approximation occurs, presumably as a result of thyrohyoid muscle activity. This event brings the larynx further anterior and under the tongue base. The timing of this event is designated as HL.

As the bolus is propelled into the upper esophagus, the pharynx is typically completely obliterated by the tongue pushing against the contracting posterior pharyngeal wall. This gesture is crucial to a successful swallow and the timing of it is designated as Pamax time. Incomplete contact between the contracting pharynx and tongue base results in pharyngeal residue after the swallow that may be aspirated [7].

Interobserver reliability among four judges was established on measures made from 15 of the normal swallow studies and was found to be greater than 90% for each measurement parameter reported, except for larynx-to-hyoid approximation (HL) which had a 75% interobserver reliability.

The sequence of various sets of two swallowing events, including bolus transit events and swallowing gestures, was determined by comparing the relative timing of these events during the swallow. In other words, it was determined in what order the two events occurred in each individual swallow study. For example, did event A occur prior to event B, or vice versa? The frequency of each of the two possible sequences was then calculated for the study population of normal subjects by bolus size category. In some individuals, certain measurements could not be made secondary to movement artifact on the swallow study. The number of individuals included in each analysis is reported with the analysis data. Differences in the amount of variability for a given two-event sequence between bolus categories was compared using a chi-squared analysis. To account for multiple chi-squared tests, the alpha was adjusted to 0.007 (0.05/7).

Results

The results of the analysis are reported in general sequence order. Events that occur in proximity to one another were compared with one another. The analysis began with the first events to occur after the bolus had arrived in the vallecula, that is, the closure of supraglottic structures and maximal hyoid bone elevation.

The superior portion of the arytenoid cartilages had usually made contact with the down-folding epiglottis by the time the hyoid reached maximum elevation (AE-close usually occurred prior to H2; see Table 2). Reversal of that timing relationship was occasionally observed with the smaller bolus sizes but was not seen in the larger bolus category. (H2 before AE-close: 1 cc = 15%, 3 cc = 12%, 20 cc = 0%) However, differences in the amount of sequence variability across bolus sizes was not significant with stringent adjustment of alpha (p = 0.0143). Vertical closure of the laryngeal vestibule was previously found to occur earlier relative to the onset of the pharyngeal phase of the swallow with increases in bolus volume while the timing of maximal hyoid bone elevation is not as sensitive to changes in bolus size [1]. That is not to say that hyoid bone movement did not respond to changes in bolus volume. On the contrary, we know from previous studies that the hyoid bone moves farther with larger bolus volumes [8].

Table 2 AE-close before H2

The onset of UES opening also occurred most commonly prior to the time when the hyoid bone reached maximum elevation (Pop usually occurs before H2; see Table 3). In other words, the hyoid had reached maximum elevation after the onset of UES opening. However, with the smaller bolus sizes, that timing relationship is reversed 49% of the time (Pop after H2: 1 cc = 49%, 3 cc = 30%, 20 cc = 3%; Simultaneous Pop and H2: 20 cc only = 3%). Again, the percentage of sequence variability between Pop and H2 was significantly related to the bolus size category (p <0.0001).

Table 3 H2 after Pop

As both the onset of arytenoid cartilage elevation (AE-start) and the onset of UES opening (Pop) generally occurred prior to maximum hyoid elevation (H2), we looked at the relationship of these two events to each other. The onset of arytenoid cartilage elevation always occurred prior to the onset of UES opening (Table 4). Pop always occurred after AE-start, irrespective of the size of the bolus swallowed. Usually, the arytenoid cartilages had also completed closure against the epiglottis prior to the onset UES opening (Pop occurs after AE-close; see Table 5). In about 17% of cases overall, the UES began to open before the supraglottic structures were completely closed (Pop before AE-close: 1 cc = 22%, 3 cc = l2%, 20 cc = l6%). Variability of the timing relationship did not differ significantly with changes in the bolus size category (p = 0.3524).

Table 4 AE-start before Pop
Table 5 AE-close before Pop

Although less common, the UES was found to open (Pop) before the arytenoid cartilages achieved complete closure against the epiglottis (AE-close) in some cases. The location of the bolus in the pharynx at that time and the proximity of the bolus to the supraglottic structures during these events was of interest in terms of airway protection. The analysis of bolus arrival at the UES sphincter (BP1) relative to the timing of sphincter opening (Pop) and the timing of supraglottic closure (AE-close) was done. The bolus did not arrive at the UES and sit there waiting for the UES to open. BP1 never occurred prior to Pop. The UES was already open when the bolus arrived or opened simultaneously with bolus arrival (Table 6). This timing relationship did not change with changes in bolus size.

Table 6 Pop before or with BP1

Such was not the case with vertical supraglottic closure, however. Usually, but not always, the arytenoid cartilages had closed against the epiglottis (AE-close) before the bolus arrived at the sphincter (BP1), but in every bolus category there were some instances where the bolus arrived at the sphincter prior to complete closure of the supraglottic structures. (BP1 before AE-close: l cc = 7%, 3 cc = 12%, 20 cc = 14%; see Table 7) Bolus arrival at the sphincter before complete supraglottic closure was not significantly different with changes in bolus size category.

Table 7 AE-close before BP1

After closure of the supraglottic structures, the laryngeal skeleton was seen to elevate and approximate the hyoid bone (HL). This gesture pulled the larynx under the tongue and further protected the airway. Our analysis found that only rarely did HL occur before AE-close, irrespective of bolus size category (p = 0.3994). The supraglottic structures were closed before the larynx approximated the hyoid in by far the majority of cases. The time between these events became even greater with a larger bolus where the supraglottic structures closed earlier (HL before AE-close: l cc = 2%, 3 cc = 4%, 20 cc = 0; see Table 8).

Table 8 AE-close before HL

Both larynx-to-hyoid approximation (HL) and the onset of UES opening (Pop) generally occurred after closure of the arytenoid cartilages against the epiglottis (AE-close). The proximity of the timing of these events leads one to wonder if they are physiologically related. In other words, does the elevation of the larynx towards the hyoid initiate opening of the UES? The analysis of larynx-to-hyoid elevation versus the onset of UES opening revealed that HL always occurred after Pop (Table 9). The UES was already open when the larynx approximated the hyoid bone, making it unlikely that larynx-to-hyoid approximation resulted in opening of the sphincter. Because larynx-to-hyoid approximation occurs after the sphincter starts to open, it is possible that it is related to maximum opening of the UES. Our analysis, however, revealed that usually PESmax time occurs prior to HL (Table 10). There is variability in this relative timing relationship, which is reversed in some cases, especially with a smaller bolus size (HL prior to PESmax time: 1 cc = 26%, 3 cc = 21%, 20 cc = 10%). Differences in variability between bolus size categories were not significant (p = 0.0854). A direct relationship of HL with PESmax time could not be established by this method of analysis.

Table 9 HL before Pop
Table 10 PESmax time before HL

There are likely multiple factors that lead to maximum distension of the UES including intrabolus pressures. Adequate hyoid bone elevation has been shown to be needed for UES opening [7]. We found that the hyoid usually reached maximal elevation (H2) prior to maximal distension of the UES (PESmax time) (Table 11). Like the majority of hypopharyngeal events, there was variability in that relationship. However, unlike the majority of the timing relationships evaluated, greater variability was found during swallowing of a larger bolus size (p = 0.0003) (PESmax time prior to H2: 1 cc = 7%, 3 cc = 10%, 20 cc = 32%).

Table 11 H2 before PESmax time

Another contributor to maximum distension of the UES may be the timing of maximum constriction of the pharynx, driving the bolus from the pharynx into the esophagus. The comparison of the relative timing of these events revealed that PAmax time always occurred after PESmax time (Table 12). Based on this finding, it follows that the timing of UES opening (Pop) also always occurred prior to maximum pharyngeal constriction (PAmax time).

Table 12 Pop vs. PESmax time vs PAmax time

Both maximum pharyngeal constriction (PAmax time) and larynx-to-hyoid approximation (HL) were found to occur after maximum distension of the UES (PESmax time). The analysis of the relationship of these two events to each other revealed that usually HL happens before PAmax time (Table 13). A reversal of that timing relationship occurred on about 14% of cases, and the amount of variability did not change with changes in bolus size (p = 0.9078) (PAmax time prior HL: 1 cc = 16%, 3 cc = 14%, 20 cc = 12%).

Table 13 PAmax time prior to HL

In summary, certain bolus and gesture timing relationships were found to exist during hypopharyngeal bolus transit that did not vary between subjects or across bolus categories. The onset of arytenoid cartilage elevation (AE-start) always occurred prior to opening of the UES (Pop). The sphincter always opened prior to the arrival of the bolus at the sphincter (BP1). Larynx-to-hyoid approximation (HL) always occurred after the onset of UES opening. Maximum pharyngeal constriction (PAmax time) always occurred after maximal distension of the UES (PESmax time). Some variability in the relative timing of all other event sets analyzed by this study was found.

Given the above findings, the most common sequence of events should have been: AEclose/Pop/BP1/H2/ PESmax time/HL/PAmax time. When we evaluated individual swallowing sequences, we found 7 individuals who demonstrated the sequence when swallowing a 1-cc bolus, 25 individuals who demonstrated that sequence during a 3-cc bolus swallow, and 23 individuals who demonstrated the sequence during a 20-cc bolus swallow. A total of 45/180 (25%) swallows contained that sequence.

Discussion

This study revealed substantial variability in the sequence of events during hypopharyngeal bolus transit. The amount of variability in general was found to be greater during smaller-bolus-size swallows. In only two instances was sequencing variability found to be greater during the larger-bolus swallows. Vertical supraglottic closure (AE-close) typically occurred prior to the arrival of the bolus at the UES (BP1) but a reversal of that sequence occurred more frequently in the largest-bolus category. Maximal UES distension (PESmax time) typically occurred prior to maximum elevation of the hyoid bone (H2) but reversal of that sequence occurred more commonly during the swallow of the large-bolus size. These findings may be the result of faster pharyngeal transit of the bolus during swallowing of a larger-bolus size and the arrival of the bolus at certain locations in the pharynx before gestures could catch up [1]. Generally, it appeared that faster transit of the bolus, such as occurs during a larger-bolus swallow, results in less opportunity for variation and a more “reflexive” pattern of swallow event sequencing.

There are currently two theories relating to how the central nervous system controls the sequence of events during the pharyngeal phase of swallowing [9,10]. The reflex chain theory holds that each motion of the pattern stimulates peripheral receptors whose afferent impulses trigger the next motion. In other words, each gesture in the swallowing sequence triggers the onset of the next gesture. The central pattern generator theory holds that a central neural circuit exists in the brainstem that is capable of generating the pattern of muscular activation without sensory feedback. The central pattern generator responds to some sensory input to “finely tune” the pattern but otherwise can function without sensory feedback. Both of these theories are based on the notion that the pharyngeal phase of deglutition is primarily “reflexive” in nature and occurs as the result of a predictable sequence of neuromuscular activity. There is evidence from human and animal studies to support both theories [9,10,11,12]. However, with the advent of sophisticated brain imaging techniques it has become increasingly clear that deglutition is an extremely complex activity that generally involves input from several higher CNS structures as well as sensory feedback [13]. As a result, it is likely that the sequence of events during the pharyngeal phase of swallowing varies in response to multiple sensory and higher CNS stimuli.

The findings of this study support the possibility of multiple sources of input into swallow sequence control based on the amount of variability identified in normal individuals. Variability around certain events was found to be the norm. Unfortunately, the evaluation of event coordination in individual patients becomes difficult in this situation because any sequence of these events falls within a normal range. Certain other event sequences were preserved across individuals and bolus size categories, however. These few event sequences can be employed in the analysis of patient swallowing coordination and, if found to be reversed in patients, the event sequences will confirm a diagnosis of poor swallowing coordination.

In addition to providing information about normal variability in swallow sequencing, this study provided information about general swallowing physiology. By evaluating the relationship of swallowing events to one another and to bolus transit, this study supports the concept that the onset of arytenoid cartilage elevation and closure against the epiglottis appears to occur by a mechanism that is independent of hyoid elevation and larynx-to-hyoid approximation. The timing of vertical supraglottic closure was more responsive to changes in bolus size than any of the other swallowing gestures.

Further study of event coordination is warranted. Intrasubject variability should be analyzed and will provide information about changes in swallow sequences in the same subject at the same sitting and at different times. Perhaps with further study, the ability to identify subtle swallowing coordination abnormalities in individual patients will emerge.