Introduction

The global population of older adults (≥ 60 years) was 600 million in 2000 and is expected to reach two billion by 2050 [1]. Daily physical activity (DPA) gait analysis, in a natural unsupervised environment outside the lab, has become increasingly popular for the assessment of frailty and fall-risk [2,3,4,5] and has been employed for a variety of purposes, including aiding diagnostics, predicting outcomes, and serving as research endpoints. In addition to quantitative gait measures such as total number of steps and walking duration, DPA gait performance measures, representing the quality of walking, may provide additional information about age-related motor function deficits. Moreover, the amount of daily activities, like the number of steps, can be influenced by several factors other than neuromuscular functionality, such as individual (daily schedule and exercise routines) and environmental (weather, transportation, and living conditions) factors. It can be hypothesized that, gait performance parameters extracted from DPA, may not substantially be affected by these factors. Nevertheless, it is important to explore this hypothesis by assessing the repeatability of DPA gait parameters between different days of measurements.

In the clinic, repeatability of time-domain gait parameters is assessed for several purposes. Previous studies have reported excellent repeatability of time-domain gait parameters like stride-time, stride-length, step-length, and step-width in different scenarios of multiple walking sessions on the same day and walking sessions recorded on two different days [6, 7]. Earlier studies have also examined the repeatability of supervised gait parameters extracted from the three axes of a tri-axial accelerometer, and have reported that the vertical acceleration provides the best repeatability in time-domain gait parameters, such as step-time, step-length, cadence and velocity [8]. While these studies provide insights regarding the reliability of sensor-based gait measures, to the best of our knowledge, no study has examined the repeatability of gait parameters in an unsupervised environment.

In this study, we evaluated the repeatability of DPA gait performance parameters and quantitative measures between two separate days among older adults. The gait performance parameters extracted here are temporal gait parameters (step- and stride-time representing walking speed), time-domain gait variability, frequency-domain gait variability, and gait irregularity. Furthermore, gait variability measures were defined to assess the stride-to-stride fluctuation in walking cycles, since it has been associated with high risk of falling and cognitive impairments in elders [9,10,11]. Also, gait studies conducted in the clinic have shown that neurological and neuromuscular diseases, as well as frailty can influence the predictability of walking cycles, which can be described by gait irregularity measures [12,13,14,15]. In addition to all the above gait performance measures, gait quantitative parameters including total number of steps and total walking duration in 2 days were also studied here for comparison.

We hypothesized that unsupervised DPA gait performance parameters, irrespective of the frailty status, would be more repeatable between days than the sensor-based quantitative walking parameters like number of steps and duration of walking. In addition, we explored how frailty, as a common syndrome among older adults, can affect the DPA repeatability. Frailty is primarily indicated by reduction or impairment of physical function [4, 16], and gait performance parameters extracted from DPA are known to provide a more accurate frailty assessment when compared to quantitative parameters [2]. We hypothesized that with frailty gait performance parameters become more repeatable, due to a decline in physiological reserve and incapability of the neuromuscular system to provide a wide range of motor function execution [17,18,19].

Methods

This is an observational cross-sectional study, performed at the Arizona Center on Aging, Tucson, AZ. Study participants were recruited from primary, secondary, and tertiary health-care settings within our academic network and also from community providers and aging service organizations. DPA was recorded from eligible volunteers for two consecutive days (48 h), and the walking data was processed to study the repeatability of gait performance and gait quantitative parameters.

Eligibility for the study included older adults aged 65 years or older, without severe mobility disorder, and the ability to walk at least 10 meters with or without an assistive device. The exclusion criteria for the participant recruitment included dementia identified by a Mini-Mental State Examination (MMSE) [20] score of < 23 or terminal illness. All the recruited participants signed a written consent according to the principles expressed in the Declaration of Helsinki [21], approved by the Institutional Review Board of the University of Arizona.

Demographic and clinical measures

The MMSE, Mobility-Tiredness Scale [22], Center for Epidemiologic Studies Depression Scale (CES-D) [23], Falls Efficacy Scale-International (FES-I) [24], and Barthel Activity of Daily Living (ADL) Scale [25] were part of the interviewer administered questionnaires. Self-reported history of falls and use of assistive device were included as additional clinical measures.

Frailty was assessed using the five criteria proposed by Fried et al. [16]: self-reported weight loss, exhaustion and low energy expenditure, weakness measure by the grip strength, and slowness measure by the walking test. A score of one point was assigned for each of the five criteria recorded, totaling a score in the range of 0–5. Based on this score, frailty was categorized as non-frail (score 0), pre-frail (score 1–2) and frail (score 3–5).

Sensor-based DPA assessment

DPA was quantified for 48 h using a tri-axial accelerometer sensor (PAMSys, BioSensics Cambridge, MA, USA) attached to a tee-shirt, inserted in a device pocket located at the sternum. PAMSys is a small (5.1 cm × 3 cm × 1.6 cm) and light (24 g) recording system containing inertial sensors [26]. Vertical acceleration data was used for analyses here. A previously validated algorithm was employed to identify the walking bouts (PAMWare, BioSensics Cambridge, MA, USA). Briefly, walking bouts were defined by a minimum of three successive steps [26], where steps were estimated by the detection of an acceleration peak beyond a predefined threshold after filtering (Butterworth filter cut-off frequency of 2.5 Hz) [27]. Gait quantitative outcomes including walking bout times (duration of each walking episodes) and number of steps per walking bout were derived for all the walking bouts longer than three steps. Furthermore, gait performance parameters were extracted by defining continuous walking bouts. Continuous walking bouts, longer than 60 s were used to extract gait performance parameters [2, 28, 29]. Gait performance parameters including time- and frequency-domain gait variability and gait irregularity were extracted from continuous walking bouts that lasted 60 s or longer with no pause longer than 1.7 s between gait cycles [3]. Based on the stride-time duration average and standard deviation previously observed in frail participants, we conservatively selected an allowable pause of 1.7 s between gait cycles. Gait variability and irregularity were assessed for each of the continuous walking bouts [2] as defined below.

Gait variability Two methods were used to assess gait variability: (1) step- and stride-time variability using time-domain [2]; and (2) power spectral density (PSD) using frequency-domain analyses [2, 30, 31]. For each continuous walking bout, step- or stride-time variability was calculated as the coefficient of variation of the series of step- or stride-times. For the frequency-domain analysis, PSD of the acceleration data was calculated using Welch’s averaged modified periodogram method [32] to represent the frequency components of the acceleration signal [33]. A window size of 512 samples and a fast Fourier transform (FFT) length of two-times the next higher power of the window size was used [32]. A 50% overlap was considered between the windows and 0.5–3.0 Hz locomotion band was analyzed [32]. The PSD components extracted from the raw acceleration signal, included: PSD slope (PSD width to the peak) and dominant walking frequency. A larger slope (shorter and wider PSD peak) was used to identify a higher variability in walking.

Gait irregularity We used the Sample Entropy (SampEn) approach to determine gait-irregularity [34,35,36]. Mutual information method [37] was used to calculate the time-delay of the signal, and the average time-delay of all the continuous walks for each participant was used to calculate the SampEn [2]. We computed the sample entropy of the gait signal using embedding dimension m = 3, and tolerance r = 0.2 times the standard deviation of the signal [34,35,36].

Statistical analysis

Statistical analyses were carried out using JMP (Version 11; SAS Institute Inc., Cary, NC, USA) and MATLAB (Version R2019b; MathWorks Inc., Natick, MA, USA); statistical significance was concluded when p < 0.05. Separate analysis of variance (ANOVA) models were performed to compare sociodemographic parameters between the frailty groups. The DPA gait performance measures were grouped between day-1 and day-2 (i.e., the first and the second 24 h). Repeatability (test–retest reliability) of the parameters between the 2 days was analyzed using Intraclass Correlation Coefficient (ICC), calculated using a two-way mixed effects model with an absolute agreement definition [38]. Repeatability was calculated with a null hypothesis of ICC < 0.6 within a 95% confidence interval (CI). To interpret the results, ICC above 0.75 was considered excellent repeatability, values between 0.6 and 0.74 was considered good, 0.4–0.59 was fair, and 0–0.39 was poor repeatability [39]. Bland–Altman plots [40] were used to visualize between-day repeatability of gait performance and quantitative measures, as well as the between-day repeatability between frailty groups (non-frail vs. pre-frail/frail). Calculated values of agreement and reliability from measurement error studies are estimates of parameters, and as such we report estimates with CIs to indicate the uncertainty with which they have been estimated. All outcomes were normalized using logarithmic transformation before Bland–Altman analyses.

Results

Demographic and clinical measures

We collected DPA measurements from 63 participants: 29 non-frail, 29 pre-frail, and 5 frail according to the Fried frailty criteria [16]. Owing to the small sample size of the frail population, we combined pre-frail and frail participants for all analyses. Demographic and clinical characteristics of the participants are presented in Table 1. Pre-frail/frail participants were significantly older than non-frail participants, used more assistive devices, and had higher BMI (p < 0.05). Compared to non-frail participants, pre-frail/frail participants had significantly higher levels of perceived tiredness and fear of falling (p < 0.05).

Table 1 Demographic and clinical characteristics

Between-day repeatability of gait parameters

For all the participants, the gait performance parameters including PSD slope and dominant frequency showed an excellent repeatability with ICC > 0.75 and significant pICC values (pICC< 0.01), while the quantitative measures showed a poor repeatability with ICC < 0.30 and non-significant pICC values (pICC> 0.90, Table 2). The average ICC for gait performance parameters and quantitative measures were 0.72 ± 0.12 and 0.25 ± 0.03, respectively (Table 2).

Table 2 Between-day repeatability of gait performance and quantitative measures

Bland–Altman visualizations of stride-time, PSD-slope, and dominant frequency showed that mean differences of gait performance parameters were close to zero (mean ± SD: 0.00 ± 0.01) with majority of the between-day difference in participants within the upper (mean + 1.96 × SD) and lower (mean–1.96 × SD) limits of agreements (LOA) (Fig. 1), demonstrating high between-day repeatability. On the other hand, Bland–Altman visualization of the total number of steps showed that the mean differences of quantitative measures were not close to zero (mean ± SD: 0.54 ± 0.03) (Fig. 1), revealing poor between-day repeatability. Supplementary Table S1 shows mean of between-day differences and the 95% (upper and lower) LOA of all the parameters listed in Table 2.

Fig. 1
figure 1

Bland–Altman plots for stride-time, PSD slope, dominant frequency, and total number of steps among all participants

Between-day repeatability between frailty groups

Overall, the gait performance parameters showed higher repeatability among pre-frail/frail (average ICC: 0.79 ± 0.19) compared to non-frail participants (average ICC: 0.38 ± 0.24, Table 3). However, the quantitative measures showed poor repeatability in both non-frail (average ICC: 0.27 ± 0.03) and pre-frail/frail (average ICC: 0.14 ± 0.01) participants (Table 3).

Table 3 Between-day repeatability of gait performance and quantitative measures between frailty groups

Discussion

The usefulness of gait performance parameters for the assessment of frailty, as well as fall and neuromuscular diseases (e.g., Parkinson’s disease and low back pain) in a home-based environment has been previously established [2, 3, 41, 42]. However, to the best of our knowledge, this is the first study assessing the repeatability of gait performance parameters between multiple days of DPA recording, especially among frailty groups. As hypothesized, we observed that several sensor-based gait performance parameters show excellent repeatability between the 2 days of measurement.

Advantages of gait performance parameters

Compared to quantitative gait measures, gait performance parameters are more advantageous as they reveal information pertaining the functional performance [2, 3, 28,29,30,31,32,33,34,35,36,37,38,39,40,41]. Previous studies have shown that, gait performance parameters extracted from continuous walks were able to significantly predict frailty with a sensitivity and specificity of 76.8% and 80% [2], while on the same dataset quantitative parameters could not significantly discriminate between frailty groups when adjusted with age [3]. Gait performance measures play a crucial role not only in characterizing gait performance in frailty groups [2, 3], but also to assess fall-risk [29] and motor performance in general [41]. Weiss et al. established that 3-day accelerometer-based gait performance parameters provided a more accurate fall-risk identification compared to performance-based tests such as dynamic gait index, the Berg balance scale, the timed up and go test, and the four square step test [29]. Furthermore, Toosizadeh et al. demonstrated strong discriminatory power in detecting impaired motor function in Parkinson’s disease using in-home DPA measures, especially those related to walking [41].

Repeatability of gait performance parameters

Test–retest reliability or repeatability of gait tests are commonly assessed within clinical settings. In agreement with our findings, results from other supervised studies showed excellent repeatability in gait performance parameters including stride-time, stride-length, step-time, and step-width in between-session (average ICC: 0.92 ± 0.03), within-day (average ICC: 0.97 ± 0.01) and between-day (average ICC: 0.93 ± 0.03) recordings [6, 7]. Overall, higher ICC scores are expected to be achieved in in-clinic supervised studies compared to unsupervised DPA-based studies, as in-clinic studies facilitate a more controlled study environment, whereas DPA-based studies are not controlled. Specifically among older adult participants, Hamacher et al. examined the between-day test–retest reliability of temporal gait variability and observed that repeatability increases in gait variability between days after one familiarization trial has been conducted [43]. Another relevant study performed by Byun et al. used a tri-axial accelerometer for in-clinic acceleration recording, and established excellent repeatability (average ICC: 0.86 ± 0.04) in gait performance parameters including step-time, step-time variability and step asymmetry extracted from the vertical acceleration compared to anteroposterior and mediolateral accelerations [8]. Of note, vertical acceleration in tri-axial accelerometer data contains higher amount of information pertaining to steps and strides associated with walking compared to other directions, and therefore, vertical acceleration has been used for feature extraction and analysis in our study. To the best of our knowledge, currently there are no studies exploring the repeatability of gait performance in a home/community-based environment. We obtained excellent between-day repeatability values in gait performance parameters comparable to other studies with average ICC: 0.82 ± 0.00 (temporal gait parameters), 0.76 ± 0.00 (frequency-domain gait variability) and 0.77 ± 0.10 (gait irregularity).

Gait performance repeatability and frailty

Among older adults, frailty is an increasingly recognized geriatric syndrome, which is associated with decline in function and reserve across multiple physiologic systems, and high vulnerability to adverse health outcomes such as disability, falls, hospitalization, institutionalization, and mortality [4,5,6,7,8,9,10,11,12,13,14,15,16]. Frailty is primarily indicated by reduction or impairment of physical function [4, 16]. Within the current findings, we observed higher repeatability in gait parameters among pre-frail/frail older adults compared to non-frail individuals. One explanation for this observation may be the reduction in physiologic complexity due to frailty [19]. Lipsitz elaborated on the loss of complexity as a pathway to frailty [18], suggesting that loss of complexity in many physiologic mechanisms eventually lead to frailty in old age. Accordingly, since many of the physiologic inputs and the corresponding connectivity are diminished with aging, this would result in a less complex physiological behavior among frail older adults. As a result, low physiologic complexity in the dynamics of a system, especially under basal conditions, leads to a compromised capacity of the system to respond to unexpected stressors [19]. Therefore, attributing to the reduced physiologic complexity and adoption of a conservative strategy to encounter environmental stressors, pre-frail/frail older adults may adapt a lifestyle with routine and repeatable activities [17, 19, 44]. Our findings showing higher gait performance repeatability among pre-frail/frail compared to non-frail older adults are in agreement with the theory of loss of complexity with aging. While promising, further research is required to more systematically measure overall loss of DPA complexity with frailty, using longer duration of natural activity data collection.

Limitations and future direction

There are limitations to consider in the interpretation of current findings. The repeatability in step- and stride-time variabilities were noticeably lower in the non-frail group compared to the pre-frail/frail group. This was because 5 of the 29 non-frail participants had comparatively fewer walking bouts on day-1 than day-2, and also these walking bouts had a high range of gait-variability causing the step- and stride-time variabilities in the non-frail group to be poorly repeatable between the 2 days. However, the repeatability of frequency-domain variability ranged from fair to excellent in both the non-frail and pre-frail/frail groups. We speculate this pattern could be because frequency-domain parameters are more reliable to measure changes in walking compared to temporal approaches. This is in agreement with previous studies, where it was established that frequency-based measures can serve as objective markers of gait variability in real-settings among patients with PD [32]. When the data from these five volunteers was eliminated, the between-day repeatability in the non-frail group improved from poor to fair [stride-variability(%)—day-1: 7.92 ± 2.20, day-2: 8.67 ± 2.31; ICC: 0.44, pICC: 0.86]. In addition, our sample was predominantly women (80.6% female). Reliability depends on the study population, and not just on the measurement errors of the measurement method. Although we did not observe a gender-specific difference in gait performance parameters, there might be a limited generalizability to a population with a more balanced gender composition. Also, using only the accelerometer data, a very poor distinction of stair climbing from level walking have been reported previously [45]. Due to configuration of the motion sensor and the lack of built-in gyroscope system in the device, we were, therefore, not able to distinguish between level walking and stair ascending/descending; the data representing walking in the current study incorporated both level walking and stair climbing together. Finally, in future studies, several other gait performance measures can be extracted and tested for repeatability within DPA assessment, including gait initiation and dynamic stability (Lyapunov exponents).

Summary and conclusions

From our study, it is evident that gait performance parameters including average step- and stride-time and frequency-domain gait variability parameters provided higher test–retest reliability compared to quantitative measures. Furthermore, gait performance parameters showed higher repeatability among pre-frail/frail participants between 2 days compared to non-frail, which may be attributed to a lack of functional capacity among frail individuals for performing more intense and variable physical tasks. As there is an excellent repeatability observed, it may be sufficient to record DPA for a 24-h duration to assess frailty using gait performance parameters in future research.