Introduction

The elderly population (people above the age of 65 years) is steadily growing in the USA. According to the US Census Bureau, the population of people above the age of 65 years is expected to more than double by 2050 [1]. The expansion in the elderly population is expected to increase the number of hospitalizations by 67% [2]. Chronic diseases are increasingly prevalent in the elderly [3]. One of the main risk factors for chronic disease is physical inactivity (in addition to tobacco alcohol use, poor diet) [4]. Since physical inactivity is a modifiable factor, there has been an increasing interest in exploring strategies to increase physical activity (PA) in the elderly, with the aim of reducing the burden of chronic disease.

Wearable sensor technology has emerged as an important tool that can objectively measure PA and through feedback, increase PA in the elderly [5•, 6•]. The purpose of this review is to provide an overview of the importance of PA in the elderly, examine models of PA, and provide a review of recent studies on the use of wearable devices to measure PA in the elderly.

Importance of PA in the Elderly

There is a wealth of data on the importance of health status as a marker of quality of life in the elderly. PA predicts health status in the elderly in general, and mobility limitations, in particular [7]. This is particularly true for the oldest elderly, in whom reduced levels of PA and mobility is an important predictor of reduced quality of life [8, 9]. Some authors suggest that mobility limitations are part of normal aging [10]. However, others suggest that reduced PA and the resulting reduced overall energy expenditure is associated with progression of brain atrophy, particularly in the frontal lobes, which may explain the basis for impaired mobility [11].

Impaired mobility and reduced PA increases the risk of dependence and disability in activities of daily living [12]. Among elders with impaired mobility, there was an increased risk for inflammatory disease and increased risk for death [12]. These findings have been replicated in population-based studies [13]. Reduced PA increases the risk of falls [14], and risk for institutionalization [15].

Models of Physical Activity in Aging

It is universally accepted that PA and mobility reduce with aging. Several models have been proposed to explain changes in PA and mobility seen with aging. Models of PA describe the ability to perform specific physical activities, the timing and intensity of PA, or the spatial extent of PA. Wearable devices, for the most part, have been designed to measure the timing and intensity of PA, though recently complex systems of wearable devices have begun to measure how well people perform specific physical activities.

The most commonly used model of PA is the WHO model on International Classification of Function, Disability and Health [16]. The WHO model describes the ability to perform PA and its components. This model identifies determinants of mobility disability, including age, sensory, physical, cognitive, or psychosocial impairments, and environmental or financial barriers. PA is seen as an important modifier of mobility disability in the elderly [16]. With aging, several abilities are known to decline, for instance, impairments in motor, cognitive, and sensory systems result in reduction in mobility and overall PA. Environmental barriers may also result in limited mobility [17]. In the past few years, complex systems of wearable devices have been developed to measure an individual’s ability to perform PA (see Table 1) [54, 61, 65, 66]. Some of these devices (such as the APDM® and Dynaport McRoberts®) allow assessment of PA included in standardized clinical assessments such as the Timed Up and Go Test and the 6-min walk test.

Table 1 The most commonly reported sensors in each category

Changes in PA with aging have been explained using a model of energy expenditure (EE) [67•]. According to this model, total energy expenditure (TEE) is comprised of four major components including resting energy expenditure (REE), thermic effect of feeding (TEF), and physical activity energy expenditure (PAEE, which is further divided into non-exercise activity thermogenesis (NEAT) and volitional exercise thermogenesis (VET)). Total energy expenditure (TEE) describes an inverted U shaped function with age: beginning in late middle age TEE decreases with each decade [68]. Resting energy expenditure constitutes 60–80% of TEE and is responsible for organismal homeostasis. With aging, resting EE (dotted line in Fig. 1) decreases and is closely linked with reduction in whole body fat-free mass and reduction in PA [69]. PAEE is highly variable from one individual to the next, and is much less dependent on whole body fat-free mass. With advanced age, decreases in PAEE (dashed line in Fig. 1) are largely due to reduction in the amount of PA and intensity of movements such as locomotion [70]. Reduction in TEE (solid line in Fig. 1) may explain the fact that older adults, on average, reduce the amount of moderate-to-vigorous PA. Thus, age-related reduction in time and intensity of PA has an influence on reduction in PAEE. The change in EE with age is shown in Fig. 1.

Fig. 1
figure 1

Change in energy expenditure and spatial extent of physical activity with age

In addition to the ability to perform PA, and the timing and intensity of activity, mobility also consists of the spatial extent of travel within and outside the home environment. One approach to measuring the spatial extent of PA is through the concept of life-space, defined as the area in which a person moved in a given period of time [71•]. In this model, space is conceptualized in expanding concentric zones beginning with the bedroom, and extending to the residence, neighborhood, and town and beyond. Change in life-space in the context of changing energy expenditure with age is shown in Fig. 1. The life-space assessment provides an index of the magnitude of space in which a person habitually lives [71•]. The life-space questionnaire assesses the zone in which a person habitually moved within the past month, the frequency of that movement and whether assistance was required [72]. Given that the life-space assessment is highly correlated with functional independence in the elderly, its inclusion in routine clinical assessment is meaningful. Presently, there are no wearable devices that measure the spatial extent of PA. Devices can be designed to provide a remote assessment of life-space mobility through a combination of GPS and inertial sensors to define the extent of mobility from a defined “home” position such as the bedroom, since it constitutes the innermost zone in the assessment (Fig. 1).

Measurement of Physical Activity in the Elderly

  1. 1.

    Subjective Measurement: The most common method to assess PA in the elderly has been through self-administered or investigator-administered questionnaires [73]. Questionnaires have the obvious advantage of being administered to large numbers of subjects at relatively little cost, which is important for large-scale epidemiological studies [74]. Questionnaires are able to capture the breadth of PA over a period of time (such as a week or a month). Despite these benefits, PA questionnaires have well-documented limitations including recall bias, particularly in older adults. In addition, questionnaires typically assess leisure time PA, which does not include functional activities of daily living. This is particularly important since older adults spend a majority of time in light intensity activities that are likely not captured by questionnaires [74]. Moreover, most self-report questionnaires have not been validated for distinct cultural and ethnic groups [73]. Finally, subjective questionnaires of PA overestimate estimate activity levels. A comprehensive study measured PA levels with a questionnaire and a wearable device: in adults between the ages of 60 and 69 years, time spent in moderate physical activities was 348 min per week and time spent in vigorous activities was 52.9 min per week [75]. However, objectively measured moderate PA was 32.7 min per week and vigorous PA was 1.4 min per week [75]. These limitations underscore the importance of objective measurement of PA.

  2. 2.

    Objective Measurement: Several wearable sensors have been developed to objectively measure PA in free-living conditions. Depending on their type and complexity, sensors are able to provide information on number of steps, PA intensity, duration and type, energy expenditure during PA, heart rate, kinematic measures (joint excursions), and quantitative measurement of gait (speed, step length, cadence, etc.). Some complex systems have been designed to instrument standardized clinical assessments of balance and mobility. The major types of sensors that measure PA are listed in Table 1 (from simple to complex):

  1. a)

    Pedometers: Pedometers are one of the simplest devices for the measurement of PA. Pedometers are lightweight, portable, and inexpensive devices that are designed to measure the number of steps, though some pedometers also estimate distance traveled and energy expenditure [76]. The recommended placement for most pedometers is on the waist (proximity to center of mass). Pedometers are relatively inexpensive, which enhances their applicability.

The simplest pedometers (Yamax Digiwalker for example) have a spring-loaded lever arm that moves in response to vertical movement of the body center of mass [18,19,20]. Some pedometers (such as the Omron HJ) use a magnetic switch mechanism to count steps [23]. The third major type of pedometer uses accelerometers with a horizontal beam and a piezoelectric crystal to determine steps based on the number of zero crossings of the acceleration curve over time. New pedometers use microelectromechanical (MEMS) technology and algorithms to process the signal to calculate steps. The number of steps per day is used as a criterion for classifying the amount of PA (sedentary to active). However, these criteria were developed for younger people without disability or chronic illness [77]. Applying these criteria to older individuals may underestimate PA during activities of daily living (which accounts for a larger percentage of PA in the elderly).

The appeal of pedometers is that they are inexpensive, and provide immediate feedback to subjects, which may enhance motivation to engage in PA. However, pedometers have several limitations: they are inaccurate at slow walking speeds. Since most pedometers do not have an internal clock, it is not possible to compute the time spent in each PA. Finally, pedometers do not record the intensity of vertical displacement at each step. Thus the number of steps taken during running and walking the same distance will be the same, despite the differences in energy expenditure between walking and running [78]. Given these limitations, pedometers may not be suitable for use in research studies. They may be appropriate for clinical use to record the number of steps in individuals who walk at a minimum speed of 0.9 m/s.

  1. b)

    Accelerometers: Accelerometers, like pedometers, are small, non-invasive, lightweight, and portable sensors. They measure acceleration in gravitational units, either along one axis (uni-axial), two axes (bi-axial), or along three axes (tri-axial). Accelerometers are capable of recording data over extended periods of time.

Most accelerometers contain an integrated chip with a mass placed on a piezoelectric crystal. Vertical accelerations compress the crystal, which generates a voltage proportional to the magnitude of acceleration [79•]. The acceleration data are then calibrated against known criterion measures such as energy expenditure, metabolic equivalents (METs), or oxygen consumption [79•]. Given the moderate-to-strong correlation between accelerometer counts and oxygen consumption, acceleration counts have been used to categorize PA as sedentary, moderate, or intense based on published thresholds [80].

  1. b.1)

    Uni-axial and bi-axial accelerometers: All uni-axial and bi-axial accelerometers measure step count (see Table 1 section ii), and most of them also provide information on either energy expenditure (such as the Caltrac) or intensity of PA (such as the Actillume and Kenz Lifecorder). The Actillume sensor is unique since it provides information on sleep-wake time by measuring ambient light, which may be very useful for older people with sleep difficulty. As seen in Table 1, most uni-axial and bi-axial accelerometers are fairly accurate at step counts but not as accurate for computing intensity of PA and energy expenditure. Accelerometers (such as New Lifestyles and Tractivity) are not accurate at slow walking speeds or as accurate as tri-axial accelerometers. The stepwatch activity monitor is the most accurate at step counts at a variety of speeds [32, 33]. In addition, the Stw?pwatch activity monitor is waterproof which may be important if people forget to take the sensor off while bathing. Most accelerometers are capable of prolonged recording (7 days or more) which is often an important consideration for research studies.

Limitations

One challenge is that algorithms for processing accelerometer data are often proprietary so end-users do not have access to the criteria by which PA is classified. In some cases, sensors enable access to the raw acceleration data, but these data requires programming in order to compute magnitude and intensity of PA. Most accelerometers are not accurate at identifying light PA, and none of them is able to measure non-ambulatory activities such as cycling and weight lifting [79•].

  1. b.2)

    Tri-axial accelerometers: The most common type of accelerometer used to measure PA is the tri-axial accelerometer, listed in Table 1, Section iii. These sensors measure acceleration in three planes, which enable a more accurate computation of movement than sensors with uni-axial accelerometers [81]. While most tri-axial accelerometers are small and lightweight, the TriTac sensor is much larger, resulting in reduced subject adherence [22]. In fact, the TriTac sensor was not rated well by elderly subjects or nursing staff in terms of comfort, ease of application, size, weight, and appearance [22]. None of the accelerometers provide immediate feedback to the subject, which makes them ideal for use in research studies. This may be important in studies that are designed to accurately measure routine PA not contaminated by feedback. In contrast, these sensors may not be suitable for studies that aim to improve PA in older people through the use of feedback.

As seen in Table 1, most accelerometers have accompanying software that process the raw data and provide aggregate measures. The one exception is the Logger Technologi, which only provides raw data that require substantial processing and computation [50]. The most common aggregate data that are provided include step count and PA intensity. A few sensors (Actigraph [37•, 38•], Actical [41, 42, 43•], and PAMsys [46]) provide energy expenditure data as well. While most sensors measure overall magnitude of PA, only two systems (Activpal [44•, 45] and PAMsys [46]) classify type of posture (sit, stand, walk). A few of the tri-axial accelerometers have been extensively used in large-scale epidemiological studies. The Actigraph is a sensor that provides summary data on PA and also provides access to the raw data. The Actigraph sensor has been used in a number of large-scale studies including the Women’s Health Study [38•] and the National Health and Nutrition Examination Survey-NHANES [39•]. In the Women’s Health Study, accelerometer counts per minute were used to define sedentary behavior (defined as < 100 counts per minute). Based on this criterion, older participants were observed to spend two-thirds of their time in sedentary activity [38•]. In contrast, in the NHANES study, accelerometer counts were used to classify PA intensity (sedentary, light, moderate-to-vigorous). These data showed that non-sedentary activity decreased with age, though men increased sedentary behavior to a greater extent than women [39•]. Most studies attached the Actigraph on the hip, though more recently some studies have begun using the sensor on the wrist.

The Actical sensor is waterproof and can be used for prolonged data collection. This sensor has been used in the Framingham Heart Study (FHS), in which 78% participants (N = 2672) wore the sensor on their hip for 8 days. The intensity of PA was defined using accelerometer counts from published data. The Activpal sensor is directly attached to the skin on the thigh and often covered with transparent tape to ensure waterproofing [45]. The Activpal was used in the Maastricht study, a population-based cohort study in which participants wore the sensor for 8 days. The software accompanying the sensor is accurate in step count and calculating PA intensity. In addition, the software is able to classify postures, which enables a more accurate calculation of sedentary versus non-sedentary activity. However, large-scale studies used additional programming to compute PA intensity. An additional limitation was that the thigh location of the sensor was not optimal for adherence [45].

The PAMsys is another sensor with similar capability in terms of classifying postures. The sensor is lightweight, worn on the sternum, and has accompanying software that computes PA with proprietary algorithms [46]. The Geneactiv is a newer sensor that is waterproof and can be worn on the wrist in an attempt to improve adherence. The sensor has been used in the Whitehall study, a longitudinal study of British civil servants. Participants (N = 3749) wore the sensor on their wrist for 9 days. Acceleration counts were used to classify activity as sedentary, mild, or moderate-to-vigorous [47•]. The accompanying software uses proprietary algorithms to compute PA intensity, and the raw data are also made available. Tri-axial accelerometers are extremely accurate at step counts and PA intensity, energy expenditure and classification of postures. Since none of the sensors provide immediate feedback, they are optimal for research studies but may not be useful for improving PA.

  1. c.

    Combined Accelerometer and Heart Rate sensor: A major limitation of accelerometers is that they do not capture non-ambulatory physical activities such as weight lifting, cycling, yoga, etc. Since heart rate is strongly associated with intensity of PA, combined measurement of heart rate and acceleration may provide a more accurate measurement of non-ambulatory activity. The Actiheart is the only sensor that measures heart rate and PA [51•, 52, 53]. The sensor is positioned horizontally on the chest, which may be challenging for continuous wear. However, the Actiheart has been successfully used in large-scale studies such as the Baltimore Longitudinal Study of Aging [51•, 52, 53]. The waterproof sensor can continuously record data for 21 days and provides data on step counts, PA intensity, energy expenditure, heart rate, and inter-beat interval. As with many accelerometers, the Actiheart software uses proprietary algorithms to classify PA.

  2. d.

    Complex multiple sensor systems: Recent developments in technology has enabled researchers to create complex inertial measurement units (IMU), which contain accelerometers (measure linear acceleration), magnetometers (establish orientation of body in space), and gyroscope (measure rotation) in a single unit. Some of these IMUs, such as the MTx [54, 55], the Physiolog sensor [59], and the iPhone [60] measure raw acceleration data which require significant filtering, processing, and subsequent computational analysis to provide meaningful clinical measures. The only single IMU system that provides clinically relevant measures is the Dynaport [61, 62•], which has been tested for validity and reliability in measuring PA, energy expenditure, body sway, and gait analysis in older adults [82]. Other complex systems use multiple sensors to provide a more accurate measurement of spatio-temporal measures of posture and gait [65, 66]. The major advantages to these complex systems is that they have undergone extensive reliability and validity testing, and have instrumented standardized outcome measures (such as the Timed Up and Go test, the 10-m walk test), which makes them clinically relevant. The disadvantages of these systems are that they are significantly more expensive than single accelerometers and require training in administration and interpretation of the data. In addition, most of the complex systems use proprietary algorithms to compute spatio-temporal measures. Since some of these systems use multiple sensors, they may not be optimally suited to long-term data collection in community settings.

Consumer Grade Wearable Sensors

Within the past decade, the consumer grade wearable industry has grown exponentially. In 2015, the revenue for the consumer wearable industry was $5 billion [83]. The most popular brands of consumer wearable sensors include FitBit, Nike Fuel, Garmin, Polar, Apple iwatch, and Samsung gear fit [84•]. These devices are typically smaller and sleeker, and some of them are less expensive than research grade devices. Consumer grade devices commonly measure step count, distance walked, and energy expenditure [85••]. One feature that distinguishes them from research grade sensors is that they provide immediate feedback to the user, which may be beneficial in providing motivation to older adults for improving PA.

Among commercial trackers, FitBit One, FitBit Charge, FitBit Flex, and the Polar trackers have been tested against research grade sensors [84•]. While these trackers demonstrate a high correlation with research grade sensors for step counts, their accuracy varies with placement [86]. For instance, the FitBit tracker is more accurate at step count when placed at the ankle as compared with the waist, as seen in a recent study in post-surgical patients [87]. Our pilot unpublished work confirmed this finding in older adults walking at slow gait speed (< 1.0 m/s). The major limitations of consumer grade trackers are that they are often released on the market without extensive validity and reliability testing. Moreover, the algorithms used to calculate steps, energy expenditure are proprietary, making it difficult to validate these devices against research grade devices. Most often, these devices do not provide access to raw data, which makes it difficult to conduct validity testing. Perhaps the biggest challenge of consumer grade wearable devices is management of the large amounts of data to ensure security of personal health information, and ensuring adequate processing power to summarize large amounts of data collected from each individual.

While they are fairly accurate at step counts and PA duration for gait speeds above 1.0 m/s, they overestimate step counts and duration of PA for people with slow gait speed and gait impairments [84•]. Since slow gait speed and gait impairments are extremely common in older adults, careful consideration must be used for selecting the most appropriate tracker. Commercial grade trackers also undergo significant changes from 1 year to the next, and sometimes go off the market based on sales rather than utility. These factors are important in choosing the most appropriate tracker. At this time, the most frequently used tracker (FitBit) copyright has been tested for validity and reliability. While the FitBit may overestimate PA when compared with research grade trackers, it may be useful for comparing data within a subject, given its high test-retest reliability [84•]. Other wearable devices such as the Apple iWatch and Samsung Gear are also becoming popular. However, these newer devices have not been validated against research grade sensors.

Conclusions

The past decade has seen an exponential increase in the use of research grade and consumer grade wearable devices in older adults. As discussed in this review, there are numerous sensors ranging from simple pedometers to highly complex multiple sensor systems that use a combination of accelerometers, magnetometers, and gyroscopes. Given the number of sensor options, the choice of sensor depends on addressing the following questions:

What is the purpose of measurement? Is the purpose to monitor PA or is the purpose to use wearable sensors to improve PA? In addition, it is important to consider whether measurement of PA is for research or clinical practice.

What aspect of PA is of interest? Step count, intensity and type of PA intensity, spatio-temporal measures of mobility or gait. If step counts are the primary measure of interest, pedometers or simple accelerometers may be adequate. If PA intensity or type of PA is of interest, tri-axial accelerometers may be the better option, particularly for research studies. If more complex mobility measures or gait is of interest, complex systems may be the best option.

What is the desired length of monitoring?

Are there cost constraints?

Considerations for research: Since there are a wide variety of sensors available and a variety of methods, clear standards for data collection, data processing and computation of PA measures need to be established. If PA is being measured for research, it may be important to consider sensors that do not provide feedback to the user. As discussed earlier, researchers use cut points for acceleration or step count data in order to classify intensity of PA. Recently, the use of cut points has been challenged as these may be inaccurate in predicting energy expenditure [80]. New methods of pattern recognition using machine-learning algorithms need to be explored further [88]. Sensors should be tested for validity against known standards and tested for reliability. This is particularly true for determining type of PA, since there are only a few sensors that currently provide such information.

There is also an important need to establish protocols for non-ambulatory activities such as cycling, swimming, yoga, and weight lifting. Capturing non-ambulatory activity may require integration of accelerometers and heart rate monitor—currently, there is only one sensor (Actiheart) that measures such integrated data. The Actiheart and any newly developed sensors need to be validated for determining intensity of PA during non-ambulatory activity. For research studies, it may be best to use research grade sensors because they have been extensively tested for validity and reliability. For studies aimed at improving PA in older adults, researchers may consider using consumer grade sensors to make use of feedback to provide additional motivation.

Considerations for clinical practice: Reliability, validity, and cost may be important considerations for choosing appropriate wearable sensors. For older adults, it is important to consider sensors that are easy to wear, and require minimal user input for logging data. In addition, clear standards need to be established for minimal wear time in order to capture the desired amount of PA. Clinical practitioners may also consider using consumer grade sensors or using existing sensors in smartphones in order to improve PA in older adults. Data on adherence and unit failures should be carefully recorded.

Considerations for Future: In the future, in addition to development of clear standards for data collection and analysis, sensor developers need to provide access to raw data so that researchers can conduct independent validity testing. Moreover, it is important for hardware and software developers to provide access to algorithms for computation of PA intensity. For sensors that use acceleration data to compute measures related to mobility and gait, clear standards are needed for determining minimum amount of data required for such computations [89•].

An important need in the field is the development of sensors or sensor systems that are capable of measuring the spatial extent of travel within and outside the home environment, in addition to standard measures such as step counts, type and intensity of PA, and energy expenditure. In addition, there is a need for wearable systems to integrate collection of acceleration and heart rate data so that non-ambulatory PA can be captured. Finally, sensitive sensors need to be developed that are able to capture light intensity activity, such as performance of activities of daily living, which constitutes a majority of PA in the elderly. One excellent example of comprehensive measurement of acceleration, heart rate, and GPS data was recently published [90••]. In this study, a system of sensors comprised of tri-axial accelerometers, global positioning system (GPS), and heart rate monitor were wirelessly connected with a smartphone that received and compressed the data and communicated the data to a remote server. The authors noted several challenges including loss of GPS data, high battery consumption, loss of data due to poor wireless communication, and discomfort of the heart rate sensors [90••]. Solutions to these challenges will lead to the development of the next-generation wearable devices that have greater capabilities and provide data that are more comprehensive and clinically meaningful [91•].