Introduction

Balance impairments are common in older adults and lead to falls with subsequent injuries, reduced mobility, and decreased quality of life (Burns et al. 2016). In neurodegenerative disorders such as Parkinson’s disease (PD) and normal pressure hydrocephalus (NPH), balance impairment—or postural instability—is a key feature of the disease process and can be defined as an inability to maintain an upright posture in response to perturbations or changes in the environment. Neurologists and neurosurgeons commonly assess postural instability using the pull test (UPDRSPT) (Goetz et al. 2008) or push and release test (Valkovič et al. 2008), while physiatrists and physical therapists may also use the Berg Balance Scale (Berg et al. 1989), Mini-BEST(Yingyongyudha et al. 2016) or a combination of all these assessments. While useful to individual providers, these assessments are based on semi-objective ordinal ratings and are, therefore, subject to variability in both execution and interpretation (Munhoz et al. 2004; Nonnekes et al. 2015; Smith et al. 2016). The variety of testing and subjective scoring can make effective communication between providers and specialties difficult when discussing a particular patient’s condition.

Quantitative biomarkers for postural instability have historically required a large, dedicated setup and are time-consuming, costly, and not widely available to clinicians. Thus, identifying easily implemented biomarkers for the diagnosis, monitoring and therapeutic assessment of postural instability in the clinical setting is a priority (Horak and Mancini 2013). Such a biomarker should be able to accurately communicate a patient’s condition between providers and standardize clinical trial outcomes by negating any dependence on inter-rater reliability. Improvements in wearable technology have led to the emergence of these devices as potential tools in the development of quantitative biomarkers for gait and postural instability (Erb et al. 2020; Dorsey et al. 2020). Metrics from wearable sensors have been developed for almost all aspects of balance including quiet standing (Mancini et al. 2012; Rucco et al. 2018), anticipatory postural adjustments (Mancini et al. 2009), and balance during gait (Zampieri et al. 2010; Weiss et al. 2010, 2013). The major challenges with wearable technology are relating these developed metrics to clinically meaningful results such as fall risk and identifying which metrics are specific to different disease processes. For example, patients with PD will likely have a different pattern of deficits on quantitative wearable metrics compared to patients with NPH or cerebellar ataxia. This will depend on both the underlying disease process and the metrics chosen to evaluate patients.

In this study, we present a method of using kinematic measurements from the pull test in NPH patients to create a quantitative, reliable, and clinically useful measure of postural instability. We have chosen to quantify the UPDRSPT because it is the neurological gold standard and most widely used clinical test available for postural instability. The UPDRSPT can be simply described kinematically as a pull backwards at the shoulders manifested by an increase in the patient’s center of mass (COM) acceleration backwards, followed by a latency period prior to taking a step backwards in recovery. During one pull test trial, the patient’s COM velocity begins at zero, rises to a peak value and falls back to zero as he recovers. We hypothesized that when separated into groups by their pull test score, COM velocity profiles would significantly differ within hydrocephalus patients and that patients would significantly differ in their step length and reaction times by pull test score. We further hypothesized that within a subset of patients with “normal” pull test scores, patient COM velocity profiles would significantly differ from healthy, age-matched controls.

Methods

Participants

Twenty-one NPH patients and 20 age-matched healthy control participants were consecutively prospectively enrolled over a period of 2 years from the Minneapolis VA Health Care System (MVAHCS) and University of Minnesota. Control participants had unimpaired gait and balance and no prior clinical diagnoses of a neurological disorder affecting movement or perception of movement. NPH participants first see a movement disorders neurologist and are referred for neurosurgical consultation if they have clinical symptoms typical of NPH (gait/balance dysfunction, cognitive difficulties + urinary incontinence) along with ventriculomegaly out of proportion to cortical atrophy. They were excluded if they did not have the capacity to consent as identified by the University of San Diego Brief Assessment of Capacity to Consent (UBACC). We collected relevant demographic data from each patient as well as their responses to a number of questionnaires related to their balance and falls. Patients underwent kinematic, neuropsychological and physical therapy assessment prior to placement of a lumbar drain. They then underwent CSF drainage for 3 days and were re-assessed immediately after removal on the morning of the 4th day. The treating neurosurgeon (RM) used these assessments to decide whether to offer treatment with ventriculoperitoneal shunt (VPS) placement. Regardless of the surgical decision, all patients were prospectively followed with continued assessments. This study was approved by the MVAHCS and University of Minnesota Institutional Review Boards, and all participants provided informed consent for participation.

The NPH patients participated in 103 sessions with a mean of 15.1 pull test trials per session, for a total of 1555 trials. The age-matched controls participated in 20 sessions with a mean of 15.0 trials per session for a total of 299 trials. Demographic data can be found in Table 1. There were no statistically significant differences between groups in terms of age, as expected, but there were differences in sex and weight as most of the NPH patients were veterans, and therefore men. Control participants walked faster than NPH patients and had better subjective and semi-objective measures of balance (ABC, UPDRSPT scores, respectively). We did not find any differences in kinematic outcomes based on sex within UPDRSPT groups (Supplemental Fig. 1). Within NPH participants, stride velocity decreased with increasing UPDRSPT score due to decreases in step length (Supplemental Table 1).

Table 1 Participant demographics

Task details

Participants were equipped with a set of 15 inertial measurement units (IMUs; Xsens, Enschede, The Netherlands) recording at 60 Hz during each pull test session (Supplemental Fig. 2). Body limb segments for each patient were measured at their baseline testing session. The pull test was executed for each patient by a trained clinical examiner. The examiner followed the instructions on the MDS-UPDRS form for conducting the pull test (Goetz et al. 2008) and conducted between 10 and 20 pull test trials for each patient. The examiner used clinical discretion to determine the force of the induced perturbation during the recorded trials, but they were instructed to use a variety of intensities throughout the trials as able. The first trial after the instructional trial was scored in the standard manner on the MDS-UPDRS scale. This is scored as “0—normal”: recovers with 1–2 steps; “1—slight”: 3–5 steps but recovers unaided; “2—mild”: More than 5 steps but recovers unaided; “3—moderate”: stands safely but with absence of postural response, falls if not caught by examiner; “4—severe”: very unstable, tends to lose balance spontaneously or with just a gentle pull on the shoulders. Although there were 10–20 trials during each session, the patient’s session was scored solely on the first trial after the instructional trial as required by the MDS-UPDRS scale (Goetz et al. 2008).

Data extraction

A three-dimensional biomechanical model is created for each patient based on limb segment inputs and sensor orientation (Schepers et al. 2018). Center of mass (COM) and foot position data were exported from the recorded motion capture analysis file and imported and analyzed within Igor Pro 6.00 (Wavemetrics, Oregon, USA) to calculate velocity and acceleration for the COM, feet and other body segments. Custom functions were then used to identify the relevant points of interest (Supplemental Methods, Supplemental Fig. 1). Once relevant points were extracted from the trials, they were exported into R (v 3.6.2) (R Core Team 2018) and further analyzed. Plots were created using the R package “ggplot2.” Supplemental Table 2 contains a description and interpretation of the relevant kinematic variables while Supplemental Fig. 3 demonstrates the relevant points of interest for each individual trial (McGovern et al. 2020).

Statistical analysis

The purpose of this study was to identify kinematic outcomes specific to the scored UPDRSPT group and to look for differences between patients scored as “normal” or “0” on the MDS-UPDRS scale and age-matched controls also scored as “0.” As a result, we did not group NPH patients by treatment status, visit or any other surgically related variable as the only a priori factors we were interested in were the UPDRSPT score and disease vs. control. Therefore, once the relevant kinematic points were identified and variables calculated as described above, we grouped patients by UPDRSPT score and performed the statistical testing described below. We also took a subset of NPH patients scored as “0” and compared them to age-matched controls using the same set of statistical testing.

Statistically significant differences in center of mass velocity (VCOM) plots were considered to be non-overlapping 95% confidence intervals of timeframes 50 ms or greater. Linear models were created to analyze the interaction between step length and pull intensity as well as reaction time and pull intensity for each UPDRSPT group. To test for an overall effect, we first performed ANCOVA. If the effect was significant, we then compared differences between the slope of each of these groups using the estimated marginal means of linear trends (package “emmeans” in R) with a Tukey HSD correction. To compare the y-intercept of each of these groups, we used the estimated marginal means of the intercept values, again using a Tukey HSD correction. To assess whether there was a learning effect associated with repeated pull test measurements, we included trial number and its interaction with pull intensity in the linear models described above. Trial number did not show any statistically significant effects on step length or reaction time and so it was not considered in the analyses (Supplemental Table 3). For standardized perturbations, differences in step length, reaction time or pull intensity were evaluated with two-sided t tests. p values less than 0.05 were considered statistically significant.

Results

NPH patient kinematic profile

V COM differentiates between UPDRSPT scores within NPH patients

We first sought to describe an objective, kinematic profile for each pull test trial using the patient’s center of mass velocity (VCOM). In its simplest terms, each pull test trial can be described by the patient’s VCOM as it begins at zero when the patient is at rest, rises as the patient is pulled backwards, continues to rise as the patient steps backwards and then declines and approaches zero as the patient recovers from the perturbation (middle panel, Supplemental Fig. 3). When we stratified each patient session by UPDRSPT score and averaged VCOM for all trials, we found statistically significant differences in VCOM between all groups of patients within the first 500 ms after pull onset. Each group’s VCOM diverged within the first 300 ms of pull test initiation as increasing UPDRSPT score correlated with decreases in peak VCOM values (Fig. 1).

Fig. 1
figure 1

Kinematic pull test outcomes in NPH patients. Mean VCOM vs. time grouped by UPDRSPT score

We considered, however, that there are both clinician and patient-related factors that could lead to the kinematic differences seen between groups. First, differences in how hard patients in each group were pulled could lead to some of these differences as harder pulls would lead to higher peak VCOM values. Figure 2 shows that patients scored as “0” (198.8 + 66 cm/s) or “1” (202.8 + 74 cm/s) were pulled at similar intensity to each other overall but were pulled harder on average than patients scored as “2” (161.6 + 69 cm/s; 0 vs. 2, p < 0.001, 1 vs. 2, p < 0.001 Tukey HSD test) or “3” (137.3 + 56 cm/s; 0 vs. 3, p < 0.001, 1 vs. 3, p < 0.001, Tukey HSD test).

Fig. 2
figure 2

Mean pull intensity grouped by UPDRSPT score. Shaded area represents the 33rd to 66th percentile of all pulls

NPH patients’ response to a “Standardized Perturbation”

Therefore, we next restricted our analysis to a narrow range of pull intensities (33rd to 66th percentile, shaded area in Fig. 2), as measured by peak ACOM prior to step onset (see Supplemental Methods and Supplemental Table 2 for details). Since ACOM is measured solely prior to step onset, it should purely reflect the kinematic effect of the pull on the COM. When we examined the pull intensity of each group after restricting the data, we saw that our intended effect occurred with almost identical mean pull intensities (Supplemental Fig. 4) between each group.

Within the “standardized” perturbations, a very similar pattern of VCOM outcomes emerged for all groups with a few key differences when compared to all perturbations. First, because there were fewer trials included, all groups demonstrated wider confidence intervals with more overlap between groups (Fig. 3). Patients scored as “1”, “2” and “3” were all statistically indistinguishable within the first 300–400 ms until the patients scored as “3” diverged at 300 ms to a lower peak VCOM value. Patients scored as “1” and “2” had very similar VCOM responses, only separating from each other at peak VCOM values at 500 ms. Statistically significant decreases in peak VCOM remained associated with increasing UPDRSPT scores for all groups. Since statistically significant differences remained between patient groups even after standardizing the perturbation, we considered that patient-related factors must also be important in their distinct outcomes. As VCOM of all patients except those considered “normal” (“0”) were all quite similar within the first 350 ms of the “standardized” perturbation, we hypothesized that reaction time (time from pull onset to step initiation; Supplemental Fig. 3) may be faster for the patients scored as “0” but similar for all other groups. Since VCOM distinguished between all groups at 350–650 ms, we also hypothesized that the patient’s initial step length backwards would differ between all groups as this was the when the first step backwards was typically taken (Supplemental Fig. 3).

Fig. 3
figure 3

“Standardized” kinematic pull test outcomes in NPH patients. Mean VCOM vs. time for a “standardized” perturbation grouped by UPDRSPT score

Reaction time of NPH patients in a “Standardized” perturbation

To examine this, we first looked at reaction time over the standardized range of perturbations (Fig. 4A). Our original hypothesis was confirmed in that reaction time was faster in patients scored as “0” (227 ms + 74 ms) compared to all other groups. However, reaction time was also faster in patients scored as “1” (297 ms + 130 ms) compared to the “2” (380 ms + 180 ms) and “3” (358 ms + 170 ms) groups despite similar appearing VCOM in the initial 400 ms time period between these three groups. We then restricted VCOM exclusively to the time from step onset until the end of the trial to eliminate the effect of the patient’s reaction time on VCOM (Fig. 4B). All groups started with a nearly identical VCOM, indicating that the initial divergence seen between “0” and all other groups were likely related to differences in reaction time. From the initial step onset, VCOM quickly diverged within the first 300–400 ms (approximately the duration of the first step) and then converged again at the end of the second step at 600–700 ms. Statistically significant differences between similarly scored groups (i.e., “0” vs “1”, “1” vs “2”) occurred at or near peak VCOM values while profiles for dissimilar groups (i.e., “0” vs. “2”, “1” vs. “3”) remained easily differentiated throughout the duration of the first step. This indicated to us that differences in the first step backwards were likely responsible for this subsequent divergence in VCOM.

Fig. 4
figure 4

Reactive postural response to a “standardized” perturbation of NPH patients. A Reaction time to a standard perturbation grouped by UPDRSPT score. B Mean VCOM vs. time for a standardized perturbation starting at the beginning of the first step, grouped by UPDRSPT score

Initial step length of NPH patients in a “Standardized” perturbation

We, therefore, looked at the first step length over the standardized range of perturbations. Step length decreased with increasing UPDRSPT score with statistically significant differences between each group (Fig. 5A). When we restricted VCOM exclusively to the time from the end of the first step until the end of the trial to eliminate the effect of the patient’s first step, we saw no further differences between groups (Fig. 5B). This indicated to us that the vast majority of differences in VCOM between groups for a “standardized” perturbation was the result of reaction time and first step differences.

Fig. 5
figure 5

Reactive postural response to a “standardized” perturbation of NPH patients. A Initial step length to a standard perturbation grouped by UPDRSPT score. B Mean VCOM vs. time for a standardized perturbation starting at the end of the first step, grouped by UPDRSPT score

While a standardized perturbation provides insight into how patients with varying postural instability respond differently, in the real-world setting, patients frequently experience many different unexpected perturbations of a variety of intensities. To simulate this, we pulled patients backwards at a variety of intensities and examined the same parameters discussed above to examine the patients’ responses throughout the entirety of pull intensities.

Reaction time and step length scaling to perturbation intensity

Given the differences in reaction time and step length between groups for a standardized perturbation, we hypothesized that there may be differences in the relationship of reaction time and/or step length to pull intensity between groups as well. When examining reaction time, all groups except patients scored as “3” could scale their reaction time to increases in pull intensity (Fig. 6A). Thus, patients scored as “0”, “1”, and “2” all demonstrated quicker reaction times for increasing pull intensities (slope < 0) while patients scored as “3” demonstrated no significant change in reaction time with increasing pull intensity (slope = 0). Groups were mostly statistically indistinguishable when comparing reaction time slope and intercept values with the sole exception of the comparison between patients scored as “2” and all other groups (Supplemental Table 4). In general, the relationship between reaction time and pull intensity did not differentiate groups as well as the relationship between step length and pull intensity.

Fig. 6
figure 6

Reactive postural response scaling of NPH patients. A Reaction time as a function of pull intensity grouped by UPDRSPT score. B Initial step length as a function of pull intensity grouped by UPDRSPT score

Similar to reaction time, all groups except patients classified as “3” increased their initial step length for increases in pull intensity (non-zero positive slope, Fig. 6B, Supplemental Table 5). Groups could be stratified according to both their overall step length for a given pull intensity (y-intercept) and their ability to scale the initial step length to pull intensity (slope). Patients scored as “0” had the highest intercept and slope values with statistically significantly larger slope than all other patient groups and statistically significant larger intercept values than all other groups except “1” patients. Patients scored as “1” and “2” had similar slope values but could be statistically distinguished by their y-intercepts (smaller for “2” patients). Patients scored as “3” had similar intercept values to the other groups, but could be easily statistically differentiated by their slope values as they were the only group unable to scale their step length to increases in pull intensity (i.e., slope = 0).

Kinematic outcomes in “Normal” NPH patients and age-matched control participants

We wanted to compare whether NPH patients with a “normal” score on the UPDRSPT scale could be differentiated from age-matched control participants using VCOM. Overall, both NPH patients and age-matched controls had very similar VCOM profiles with control participants diverging from NPH patients at approximately 180 ms and reaching a slightly higher peak VCOM (59.1 + 2 vs 54.6 + 2.1 cm/s) with slightly later peak VCOM onset (533 vs. 500 ms; Fig. 7A). Control participants were pulled harder on average with mean peak ACOM values statistically significantly higher (268.4 + 83 cm/s2 vs. 198.8 + 66 cm/s2, p < 0.001 Tukey HSD test; Fig. 7B). To normalize for pull intensity, we next examined only responses to a “standardized” perturbation in a similar process to that described above for NPH patients. Control and “normal” NPH patients demonstrated an almost identical VCOM response to a “standardized” perturbation with no statistical differences throughout the entire response (Fig. 7C). There were no significant differences in either step length (37.9 + 7 vs. 37.2 + 12 cm, control vs. “normal” NPH, p = 0.62) or reaction time (202 + 50 vs. 213 + 90 ms, control vs. “normal” NPH, p = 0.31) responses to the “standardized” perturbation between NPH patients and controls (Fig. 7D, E). This indicated to us that the small differences seen in VCOM were likely due to the differences in pull intensity as controls were pulled harder.

Fig. 7
figure 7

Comparison of reactive postural response in control and “normal” NPH patients. A Mean VCOM vs. time for controls and “normal” NPH patients. B Mean perturbation pull intensity for controls vs. “normal” NPH patients. C Mean VCOM vs. time for controls and “normal” NPH patients for a “standardized” perturbation. D Initial step length and E reaction time responses to a “standardized” perturbation for controls and “normal” NPH patients

Reaction time and step length scaling distinguishes NPH patients from controls

Interestingly, although control and “normal” NPH patients had nearly identical kinematic outcomes in response to a standardized perturbation, their responses to the entire pull intensity range differed significantly (Fig. 8). Control participants had faster reaction times for a given pull intensity (lower y-intercept, 244 + 14 ms vs 323 + 21 ms, p = 0.006, Tukey HSD; Fig. 8A) but demonstrated less ability to scale their reaction time to pull intensity (lower slope − 1.33 × 10–4 vs. − 4.6 × 10–4, p = 0.006, Tukey HSD). Similarly, for step length, control participants demonstrated longer step lengths for a given pull intensity (increased y-intercept, 25.6 + 1.7 cm vs. 17.4 + 2.2 cm, p = 0.009) but less ability to scale their step length to a given pull intensity (lower slope; 0.049 + 0.01 vs. 0.083 + 0.009, p = 0.009).

Fig. 8
figure 8

Reactive postural response scaling of control and “normal” NPH patients. A Reaction time as a function of pull intensity. B Initial step length as a function of pull intensity

Discussion

We have described a kinematic method of measuring the UPDRSPT in an effort to establish a quantitative, clinically meaningful outcome measure that can be reliably used in the clinical setting and further developed as a biomarker of postural instability. The simplest description of the pull test’s outcome, the patient’s VCOM throughout the duration of the test, can reliably differentiate between UPDRSPT scores within NPH patients (Fig. 1). This indicated to us that VCOM discriminated between clinically meaningful criteria.

When limited to a narrow range of perturbations to approximate a standardized pull, differences between groups are reduced (Fig. 3), indicating that some of the differences in VCOM profiles may be attributed to how hard patients are pulled. Indeed, this is typically seen as one of the disadvantages of using the UPDRSPT in the clinical setting as it is difficult to standardize both between patients and providers leading to poor inter-rater reliability (Munhoz et al. 2004; Munhoz and Teive 2014). As we purposefully pulled at a variety of intensities, patients and controls with better balance would likely be pulled harder on average than those with postural instability. In fact, patients scored as “0” or “1” were—on average—pulled harder than those scored as “2” or “3” (Fig. 2), although there was significant overlap between the groups. This is likely reflective of better postural control in patients with lower scores as they were able to tolerate higher intensity pulls. Some patients actually demonstrated increases in mean pull intensity as their balance improved post-operatively (Supplemental Fig. 5). Thus, tracking this metric over time may be investigated in the future as an indicator of clinical improvement or worsening. Our use of “standardized” pulls also demonstrates that this process could be used to examine different patients’ responses to a similar perturbation.

In our experience, most providers tend to pull quite lightly, perhaps due to concern over causing the patient to fall backwards. This will lead to overestimation of the patient’s performance as even patients with poor balance can frequently recover from a light pull in a few steps. Performing the test with the examiner’s back against a wall can mitigate this concern and allow the examiner to put the patient through the entire range of pull intensities although this does require an examiner comfortable and skilled in the technique. Since changes in pull intensity are associated with differences in both patients and controls’ postural response, pull intensity needs to be quantified in some manner. Quantifying differences in pull intensity between providers could also serve as a possible mechanism to standardize evaluations in the future if desired.

While trying to standardize a perturbation has obvious advantages in comparing reactive postural stability between patients, our data demonstrate the value in purposefully inducing variability into the pull test. Patients’ responses vary according to the intensity of the perturbation, allowing the clinician to characterize the entirety of the patient’s response (Figs. 6, 8). While clinicians often try to “standardize” their perturbation within and between patients, from a practical perspective, this task is impossible (especially when patients are seen by multiple providers) (Munhoz et al. 2004). Even if the pull test (or any test of postural instability) could be standardized, our data demonstrate that a patient’s response at one standardized perturbation cannot necessarily be used as an approximation for the response at other perturbations without knowing how their response varies. In addition, depending where along the continuum of pull intensity the test was standardized, the same patient’s response could be considered adequate (not falling for a low intensity perturbation) or inadequate (falling after a high intensity perturbation). This would not change their underlying postural response, only the arbitrary manner in which we have decided to categorize them. For example, while all patient groups demonstrate similar responses in the lower range of pull intensities, they vary considerably for high intensity pulls. In addition, “normal” NPH patients and control participants looked identical in the middle range of standardized perturbations but their responses are statistically significantly different in the lower or higher ranges of pull intensity. Without testing the full range of pull intensities, it would be impossible to distinguish between “normal” NPH patients and controls. As a result, quantifying the inherent variability present in the test is much more likely to be fruitful in terms of both understanding and classifying postural instability in movement disorder patients.

While the gait of NPH patients has been investigated in detail (Stolze 2001), postural instability has typically not been very well studied and its underlying mechanisms are poorly understood. The neural control of postural stability depends on integration of somatosensory, vestibular, and visual inputs and maintaining equilibrium requires coordinating these systems to produce motor strategies that maintain one’s COM in a stable range to prevent falls (Horak 2006). Deficits in any of these systems impair coordination and the ability to maintain an upright posture. NPH patients have a larger static sway area and higher backwards directed COM velocity during upright stance (Blomsterwall et al. 2000). Some studies have suggested that NPH patients rely less on visual input than healthy controls (Bäcklund et al. 2017), but others have shown typical improvements in balance with eyes open vs. closed (Lundin et al. 2013). NPH patients whose motor performance improves after VPS demonstrate bilateral increases in supplementary motor area (SMA) activation (Lenfeldt et al. 2008), an area known to be important for postural control planning and execution (Slobounov et al. 2005; Richard et al. 2017; Fujimoto et al. 2017) though this study used a simple measure of finger tapping to measure motor performance. Thus, investigating the role of SMA in the pathogenesis of postural instability in NPH represents a potential area for future mechanistic studies.

No dynamic studies have examined NPH patients’ postural step response in detail prior to this study. When examining their underlying postural responses by UPDRSPT scores, NPH patient groups differed in both their ability to scale step length to increasing pull intensity and their overall step length for a given intensity (Fig. 6B). These differences were much more robust than differences in reaction time, signifying that planning and execution of the initial step backwards is likely the most important kinematic parameter for determining postural control and for clinically differentiating between NPH patients. Studies examining the postural step response using standardized perturbations in PD patients have generally shown that PD patients take smaller steps than healthy subjects, (King et al. 2010; Smulders et al. 2014; Di Giulio et al. 2016; Schlenstedt et al. 2017) with slower reaction times that may be due to an increase in the number (King et al. 2010) and size (Schlenstedt et al. 2017) of anticipatory postural adjustments prior to stepping. This suggests that the kinematic deficits captured within these parameters are not necessarily disease specific. With their step length kinematic deficits now well-defined, there is much mechanistic work in future studies to determine whether these deficits are due to NPH affecting visual, vestibular, somatosensory systems or some combination of all.

Age-matched controls and NPH patients who scored “0” on the UPDRSPT had identical kinematic responses to a “standardized” perturbation (Fig. 7), but had differing step length and reaction time scaling responses when examining the entirety of the pull intensity spectrum (Fig. 8). Interestingly, NPH patients with normal UPDRSPT scores demonstrated improved ability to scale their initial step length and reaction time to increasing pull intensity as compared to controls (Fig. 8). This kinematic response is typically seen in younger control subjects with increasing age reducing the ability to scale step size or reaction time (McGovern et al. 2020). As all these patients had surgical intervention, this suggests that surgery may fundamentally impact the reactive postural response in NPH patients. Further studies that specifically compare age- and UPDRSPT-matched controls to movement disorder patients with varying disease processes (e.g., PD and NPH) and interventions such as medication or surgery will need to be performed to identify age-related, disease-specific and interventional phenomenon.

This study has limitations related to its sample size, kinematic approach, technology and patient selection. Given the small number of patients and controls in this study, it is possible that the study is underpowered and, therefore, may be unable to detect more subtle differences between groups, particularly given the heterogeneity of patients with NPH. Kinematics is focused on the end result of the body’s motion in space, and therefore, it gives limited insight into the underlying control mechanisms producing that result. To gain insight into the involved motor control pathways, kinematics need to be combined with other techniques such as intracranial local field potential recordings, EMG, EEG, and/or fMRI. While easily incorporated into our clinical visits, our motion capture system uses 15 IMUs and is expensive. Future work needs to be focused on reducing the relevant set of sensors to 5 or fewer that can be cheaply purchased. With regards to patient selection, this study may be limited in its generalizability as we only included patients with NPH. It is possible that patients with Parkinson’s disease will demonstrate different kinematic patterns based on their UPDRSPT score. In addition, as our NPH patients consisted mostly of veterans, almost all our patients were men, meaning that our results need to be confirmed in more women.

The strengths of this approach lie in its practical advantage of incorporating the most widely used clinical test of postural instability and its analysis of the patient’s response throughout a range of perturbation intensities. Therefore, as a bridge between the clinic and laboratory, an easily adopted quantitative UPDRSPT should have significantly more impact on diagnosis, monitoring, and clinical trial assessment than a test that can only be applied in certain laboratories or less widely applicable settings.