Introduction

Measures of mobility in older adults are often used as indicators of health and functioning; yet, the role of mobility in the causal pathways between functional status and health-related quality of life (HRQoL) remains poorly understood. Some studies attempt to understand how demographic, biomedical, psychological, sociological, or environmental variables predict mobility [1, 2], while others have suggested the role of mobility as a predictor of other functional measures [1, 3].

Self-reports of difficulty with activities of daily living (ADLs) are often used to assess functional ability. These classic, clinically relevant indicators reflect what individuals report they are capable of doing. In terms of affecting daily living and quality of life, self-reported functional ability is one important variable that undoubtedly affects mobility, or what individuals actually do [2], which may, in turn, affect standard measures of HRQoL. Indeed, perceived difficulties in performing ADLs have been shown to be related to mobility [1, 2] and mobility has also been shown to be correlated with the physical and mental component summary scores of the SF-12 [1], a commonly used measure of health-related quality of life [4]. However, multivariate conceptual models of functional ability, mobility, and subsequent quality of life are rarely articulated and then evaluated empirically.

Wilson and Cleary [5] developed a theoretical framework to link the biomedical model of health with a quality of life model. Their model focuses on five levels of patient outcomes (i.e., biologic and physiologic variables, symptoms, functioning, general health perceptions, and overall quality of life) and proposes mediating relationships among these variables (e.g., the relationship between symptom status and overall quality of life is mediated by functional status and general health perceptions) in addition to explanatory roles of several personal and environmental variables. Parts of this model have been applied to a variety of patient populations and some have expressly examined the mediated effects implied by the model [610].

It is important to note that the Wilson–Cleary model was not intended to completely describe all relationships important to individuals’ health [11]. The model has been adapted, modified, and extended [1214], and relationships not directly specified in the model have been tested and supported. For example, social support, as an environmental variable in the Wilson–Cleary model, is posited to have direct effects on functional status, general health perceptions, and overall quality of life. However, there is evidence that social support mediates the functional status and quality of life relationship [15]. Others have explicitly examined the mediating role of personal variables proposed to have direct effects in the original Wilson–Cleary model [14]. Given this literature as well as past research suggesting their bivariate relationships [1, 2], an adaptation of the Wilson–Cleary model appears to provide a useful conceptual framework for examining the multivariate relationships among functional status, mobility, and quality of life.

The purpose of this study was to assess whether the relationship between functional status (i.e., ADLs or what individuals report they are capable of doing) and HRQoL is mediated by mobility (i.e., what people actually do in terms of mobility). The general conceptual overview is summarized in Fig. 1. By examining the predictive pathways from functional abilities to HRQoL through mobility over time, this study subjected the testing of this mediation hypothesis to the more rigorous longitudinal analysis methods that are becoming more common to test mediation relationships [1625]. In the case of true longitudinal mediation, Maxwell and Cole [18] have demonstrated that cross-sectional analyses provide biased estimates of the indirect effect (or mediated effect) in two different models of change, the autoregressive model and a random effects model.

Fig. 1
figure 1

Basic mediation model for the present study. Indirect effect = ab, direct effect = c’, total effect = ab + c’, proportion mediated = ab/(c′ + ab)

Most previous analyses of mediated relationships based on the Wilson–Cleary model have relied on cross-sectional data [68]. Some evaluations of the Wilson–Cleary model used longitudinal data, but did not test for mediated effects [2628] or used only two waves of data [9]. Although with additional assumptions it is possible to provide evidence of mediation with just two waves of data [17], three or more waves of data are usually necessary to subject each leg of the mediated effect to a temporal sequence that would more strongly support a causal interpretation [25, 29].

Methods

Setting and participants

This analysis is based on the data from participants in the University of Alabama at Birmingham (UAB) Study of Aging, a population-based, longitudinal study of mobility among community-dwelling older adults, randomly sampled from a list of Medicare beneficiaries living in one of five counties (two urban and three rural) in central Alabama. African Americans, men, and rural residents were oversampled to provide a balanced sample in terms of race, gender, and rural/urban residence (the study was designed to examine racial differences in mobility among older adults). Individuals in nursing homes and people unable to set their own appointments were excluded. Participants were recruited by mail and followed up by telephone. Trained interviewers conducted baseline in-home interviews between November 1999 and February 2001 with n = 1,000. Follow-up interviews were conducted by telephone. Minimal proxy input for factual questions was available and used for approximately 15 % of the participants [2]. For the primary variables used in the present study, responses did not come from proxy respondents (i.e., they were self-reported). Additional study details can be found elsewhere [2, 30].

For the present study, data from four waves equally spaced approximately 18 months apart (baseline, 18, 36, and 54 months) were analyzed. Participants who did not survive at least one year beyond the 54-month assessment were excluded. Thus, by design, we restricted our analysis to changes over time free of the terminal decline shown to occur in the last year of life on the variables of interest [31]. Of the 1,000 initial participants, 250 died prior to month 54 and 61 died after month 54 but before month 66. An additional 12 participants had no data other than baseline data and were excluded. Thus, 677 participants were used in the present analyses. As expected, compared to excluded individuals, participants in this analysis were younger at baseline, had fewer verified comorbidities, higher life-space mobility scores, less ADL difficulty, and higher HRQoL scores (p < 0.0001 for all comparisons). The original study and the present analysis were approved by the UAB Institutional Review Board, and all participants gave written informed consent.

Measures

Functional status

Physical functional status was measured by self-reported difficulty with ADLs. Participants were asked whether they had difficulty with specific ADLs, and a latent functional status variable was extracted from four dichotomous (yes vs. no) ADL items—transferring, bathing, dressing, and toileting. The factors at each time point were identified by fixing the factor loading of the first ADL (transferring) to be one. Responses to mobility-related ADLs (i.e., using stairs, walking, and getting outside) were also obtained, but these data were excluded from the latent functional status variable due to their conceptual overlap with the LSA. Data on another ADL (eating) were collected but excluded from the analysis because difficulty with this ADL was rarely reported. At some waves, no respondents endorsed this item, constraining the variance and complicating the extraction of a latent functional status variable that included this item at those waves.

Life-space mobility

The UAB Study of Aging Life-Space Assessment (LSA) measures mobility based on the distance each participant reported moving during the 4 weeks preceding the assessment [1, 2, 30, 32]. The instrument assesses mobility in terms of the extent and frequency of movement across five levels (i.e., within one’s home outside the bedroom, areas immediately outside of one’s home, within one’s immediate neighborhood, within one’s town, and outside one’s town), as well as the use of assistance in performing such movements. For each life-space level, participants were asked whether they traveled to that area, how frequently they traveled to that area, and whether they needed help from another person or assistance device to travel to that area. As described previously [1], the life-space composite (LSC) score is calculated by combining life-space level attainment, the frequency of attaining each level, and the degree of independence in achieving each level. Scores ranged from 0 (mobility limited to the room where one sleeps) to 120 (traveled out of town every day without assistance). The two-week follow-up test–retest reliability of the LSC was high (intraclass correlation = 0.96) [1]. This composite score was used as the mobility variable in the analyses. Given its high reliability and its calculation as a composite variable, the observed LSC score was analyzed as the mediator rather than extracting a latent mobility variable. The LSC has been used in this manner in other studies [2, 3, 33].

HRQoL

Version 1 of the SF-12 was used to assess HRQoL. Mental component summary (MCS) and physical component summary (PCS) scores were calculated using procedures described by Ware et al. [4]. These scores are transformed and scaled such that the general US adult population has a population mean of 50 and standard deviation of 10. Test–retest reliabilities of 0.89 and 0.76 have been reported for the SF-12’s PCS and MCS, respectively [4]. Given established, strong measurement properties, and frequent use within the field, MCS and PCS scores were treated as observed variables in all analyses. This is consistent with common practice, and prior research has shown very high correlations with these summary scores and latent variables based on individual items (0.97 for PCS and 0.96 for MCS) [34].

Covariates

The following variables, collected at baseline, were used as covariates in all analyses: sex, race (African American or white), age, recent smoking status (whether the participant smoked in the past year), education, and a verified comorbidity score. Education was divided into year categories: 1 ≤ 6; 2 = 7–11; 3 = 12; 4 ≥ 13.

The Charlson Comorbidity Index reflects the number and severity of comorbidities present in hospitalized patients and depends upon medical record reviews [35]. We used the Charlson comorbidity categories as a guide to calculate a comorbidity score appropriate for community-dwelling older adults. The comorbidity score reflected the number of comorbidities present, but not their severity. The comorbidity score was based on self-reported medical conditions verified either by the use of a prescription medication, physician report, or a hospital discharge diagnosis recorded in the previous 3 years. Verification procedures for these medical conditions have been fully described elsewhere [3, 33].

Verified conditions were put into twelve comorbidity categories based on the work of Charlson et al. [35]. The disease categories included: (1) congestive heart failure; (2) angina pectoris or heart attack; (3) valvular heart disease; (4) peripheral arterial disease; (5) hypertension; (6) diabetes mellitus; (7) chronic obstructive lung disease or asthma; (8) kidney disease; (9) liver disease; (10) cancer other than skin cancer; (11) any neurological disorder including stroke, Parkinson’s disease, seizures, or Alzheimer’s disease; and (12) gastrointestinal disease including stomach, bowel, or digestive problems. The number of verified medical condition categories present was used to calculate the comorbidity score. Thus, the comorbidity score could range from 0 to 12.

Sex, race, and recent smoking status were dichotomous variables while age, education, and comorbidity score were treated as continuous variables in all analyses.

Statistical analysis

Statistical mediation suggests that the causal effect of one variable on another is transmitted through a third variable called a mediator or intervening variable [36]. A variety of statistical methods for assessing mediation have been proposed [37]. The presence of longitudinal data provides additional data analysis options [17, 18, 20, 25], and the present analysis evaluated the proposed mediated effect utilizing an autoregressive approach.

The basic premise of autoregressive models is that the values of variables at future time points depend in part on values at earlier time points. The simplest of these models involves one variable measured at multiple time points for a set of individuals. This basic model (the univariate simplex model [38, 39]) was extended to an autoregressive mediation model [17, 40], involving the variables: X (the independent variable), M (the mediator), and Y (the outcome).

A basic framework for the analysis can be found in Fig. 2. For clarity, only structural paths (i.e., no measurement paths, no residual covariances) for the main variables (i.e., no covariate paths) are included in Fig. 2. A modified set of steps for autoregressive mediation modeling was utilized [17, 20]. Prior to fitting any longitudinal models, a cross-sectional assessment of mediation was evaluated using baseline data. Next, an initial autoregressive mediation model (i.e., Model 1’s) was fit separately for MCS and PCS as outcome variables based on the paths outlined in Fig. 2.Footnote 1 Although measurement invariance over time for the latent functional status variable based on ADLs was not assumed (i.e., factor loadings and thresholds were free to vary over time), these models did include covariates as control variables as well as covariances between residual terms of downstream endogenous variables within the same wave. This last model component is recommended by Anderson and Williams [41] and the presence of significant and meaningful covariation among these residuals implies the possibility of confounding due to variables missing from the model. Failing to specify such covariances may lead to biased estimates [17]. Following this step, a test of measurement invariance over time for the functional status variable was performed by constraining both thresholds and loadings to be equal across waves.

Fig. 2
figure 2

Autoregressive mediation model framework for the present study. Measurement paths (i.e., for functional status), covariate paths, and covariances between residual terms of downstream endogenous variables within the same wave are estimated but have been omitted for simplicity. FS functional status, LSC composite measure of life-space mobility, and MCS and PCS mental component summary score and physical component summary score of the SF-12, respectively

After these autoregressive measurement modeling options were finalized, the indirect effect (or mediated effect) of functional status on HRQoL through life-space mobility was estimated by the product of coefficients method and evaluated for statistical significance using both the Sobel test [42] and a bias-corrected bootstrap confidence interval with 1,000 draws [43]. In addition, the proportion-mediated effect was calculated as a measure of mediation effect size. This measure consists of the ratio of the mediated effect to the total effect and represents the proportion of the total effect that is mediated by the intervening variable [37].

With four waves of data, it is possible to estimate both time-specific indirect effects and the overall (or total) indirect effect [17, 23, 40]. As outlined in Fig. 2, there are three time-specific indirect effects (with the estimate of each indirect effect based on the product of coefficients method):

  1. 1.

    FS1 → FS2 → LSC3 → PCS4 (or MCS4) = x 1 × a 2 × b 3

  2. 2.

    FS1 → LSC2 → LSC3 → PCS4 (or MCS4) = a 1 × m 2 × b 3

  3. 3.

    FS1 → LSC2 → PCS3 (or MCS3) → PCS4 (or MCS4) = a 1 × b 2 × y 3

The overall indirect effect of FS1 → PCS4 (or MCS4) is the sum of these three time-specific indirect effects. This addresses the question of whether LSC mediates the effect of FS1 on PCS4 (or MCS4) at any time between waves 1 and 4 (rather than at some specific point), a question usually of greater interest to researchers [17, 23]. Thus, both time-specific and overall indirect effects were calculated and evaluated for significance.

In addition to the measure of indirect effect, the fit of the models to the data was evaluated using various fit indices including the chi-square goodness-of-fit statistic, the root mean square error of approximation (RMSEA), comparative fit index (CFI), and the weighted root mean square residual (WRMR). Good model fit is indicated by a small chi-square value (this test is quite sensitive to sample size), an RMSEA less than 0.05 [44], a CFI above 0.95 [45], and a WRMR of less than 1.0 [46].

Following the test of the initial mediation models (Model 1’s), the potential for omitted paths was evaluated using modification indices and expected parameter change values. Adjustments evaluated were those that could be justified theoretically, including: (1) covariances between measurement errors for the same ADLs measured at several time points, (2) wave-skipping autoregressive paths (e.g., FS4 predicted by FS3 and FS2), (3) cross-lagged relations (e.g., FS2 predicted by LSC1), (4) autocorrelation residuals (e.g., residual variance of FS3 with the residual variance of FS2), (5) contemporaneous paths (e.g., LSC2 predicted by FS2), and additional direct effects (e.g., PCS4 (or MCS4) predicted by FS2). Effects going backward in time were not considered. Elaborated models (Model 2’s) based on the results of these specification searches were then estimated separately for MCS and PCS. This approach allowed for the inclusion of fit-based model enhancements, some of which might have the potential to alter the size and/or significance of the indirect effect, so the tests of the mediated effects were repeated in the modified models.

All analyses were conducted in Mplus 6.11 (Muthén & Muthén, Los Angeles, CA). To account for the categorical functional status indicators (i.e., ADLs), a robust weighted least squares estimator (i.e., WLSMV) was used [47]. There were 677 complete cases at wave 1, 651 at wave 2, 626 at wave 3, and 611 at wave 4. Rather than listwise deletion (i.e., analyzing only those with complete data at all four waves), the WLSMV estimator in Mplus used the total available sample for analysis. Such estimates are consistent under the assumption of missing at random with respect to covariates (MARX) and more efficient than listwise deletion [48].

Results

Table 1 provides means and standard deviations or frequencies for the main study variables and covariates. In addition, the numbers of cases for each variable at each time point are provided. The functional status variable, as well as the SF-12’s PCS and MCS, demonstrated good internal consistencies.Footnote 2 Cronbach’s alpha for the ADL measure was 0.72, 0.85, 0.87, and 0.92 at waves 1, 2, 3, and 4, respectively. For PCS, Cronbach’s alpha was 0.85, 0.86, 0.88, and 0.90 at waves 1, 2, 3, and 4, respectively. For MCS, Cronbach’s alpha was 0.76, 0.81, 0.83, and 0.84 at waves 1, 2, 3, and 4, respectively. For all models tested, all factor loadings for the latent functional status variable were significantly different from zero (p < 0.0001) and all standardized loadings were greater than 0.75, with most near 0.9.

Table 1 Descriptive statistics

Because functional status was conceptualized as a latent variable represented by difficulty with multiple ADL items over time, it was possible to assess factorial invariance across waves. Initial models based on Fig. 2 (with separate models for PCS and MCS as outcome variables) were estimated allowing functional status factor loadings and thresholds to freely vary over time and compared to models where thresholds and loadings were constrained to be equal across waves. The WLSMV estimator in Mplus precludes the use of traditional chi-square difference testing to evaluate invariance. Thus, the DIFFTEST option in Mplus was used [49] in which a significant finding indicates a lack of measurement invariance. For both the PCS and MCS models, this test for the lack of invariance was statistically significant (Δχ2 = 37.65, df = 21, p = 0.0142; Δχ2 = 42.21, df = 21, p = 0.004 for PCS and MCS models, respectively). However, for relatively large samples, the test for invariance can be statistically significant (i.e., indicating a lack of invariance) even when there are only minor differences in model parameters [50, 51]. Examination of fit indices that take model parsimony into account indicated that the value of the RMSEA was actually reduced from the unconstrained to the invariant models (0.041 → 0.038 and 0.039 → 0.037, for PCS and MCS models, respectively) and the value of CFI was either increased or remained unchanged (0.981 → 0.982 and 0.983 → 0.983, for PCS and MCS models, respectively) with the addition of invariance constraints. These findings support an interpretation that the deviations from perfect measurement invariance were quite small and of no practical significance.Footnote 3 Thus, all subsequent models were tested using the constrained invariant factor loadings and thresholds over time for the functional status variable.

Results of the longitudinal mediation models with measurement invariance constraints imposed for functional status are summarized in Table 2. The fit indices for Model 1’s based on the framework in Fig. 2 indicated good model fit for both PCS and MCS. For PCS, only one time-specific indirect effect was statistically significant (FS1 → LSC2 → PCS3 → PCS4), but the total indirect effect was statistically significant using both the Sobel test and the bias-corrected bootstrap method (95 % CI: −1.282, −0.062). About 13 % of the total effect of functional status on PCS was mediated through LSC. With MCS as the outcome variable, all three time-specific indirect effects were statistically significant as well as the total indirect effect (using both the Sobel test and the bias-corrected bootstrap method [95 % CI: −1.17, −0.235)]. About 44 % of the total effect of functional status on MCS was mediated by LSC. However, the direct effect of functional status on MCS was only marginally significant (p = 0.051), suggesting the possibility of complete mediation of the functional status and MCS relationship by LSC.

Table 2 Model fit indices and unstandardized path coefficients for longitudinal mediation models

Modification indices were used to suggest additional paths, and these results can be found in Table 2. In addition, completely standardized path coefficients for the original paths as well as the newly added paths can be found in Figs. 3 and 4 for the PCS and MCS models, respectively. Newly added paths consisted of either wave-skipping autoregressive paths (e.g., FS4 predicted by FS2) or cross-lagged relations (e.g., FS2 predicted by LSC1). Modification indices suggested no other paths that would lead to substantial improvements in model fit after these paths were included. Thus, no additional paths were added for covariances between measurement errors for the same ADLs measured at several time points, autocorrelation residuals, contemporaneous paths, or additional direct effects. As expected, model fit improved significantly with the addition of the paths listed in Table 2. More importantly, the significance of the functional status → life-space mobility → HRQoL total indirect effect was maintained for both PCS and MCS. According to the Model 2’s, almost 20 % of the total effect of functional status on PCS was mediated through LSC, while it appears that all of the relationship between functional status and MCS is mediated by LSC (i.e., p = 0.691 for the direct effect of functional status on MCS).

Fig. 3
figure 3

Completely standardized parameter estimates from the final model (i.e., Model 2) with PCS as the outcome variable including fit-based model enhancements. Measurement paths (i.e., for functional status), covariate paths, and covariances between residual terms of downstream endogenous variables within the same wave are estimated but have been omitted for simplicity. Parameter estimates are located above the corresponding path. FS functional status, LSC composite measure of life-space mobility, and MCS and PCS mental component summary score and physical component summary score of the SF-12, respectively

Fig. 4
figure 4

Completely standardized parameter estimates from the final model (i.e., Model 2) with MCS as the outcome variable including fit-based model enhancements. Measurement paths (i.e., for functional status), covariate paths, and covariances between residual terms of downstream endogenous variables within the same wave are estimated but have been omitted for simplicity. Parameter estimates are located above the corresponding path. FS functional status, LSC composite measure of life-space mobility, and MCS and PCS mental component summary score and physical component summary score of the SF-12, respectively

As a basis for comparison to the longitudinal autoregressive models, the mediating role of life-space mobility in the relationship between functional status and HRQoL was evaluated cross-sectionally using baseline data. This cross-sectional analysis was conducted on both the 677 cases used in the longitudinal analysis and the complete sample at baseline (n = 1,000). The latter analysis was conducted because this is the approach that would have been used to evaluate the mediated relationship had no longitudinal data been available. Model fit in all cross-sectional models was acceptable, but the proportion-mediated effects were generally smaller in the cross-sectional models (PCS: about 5 %; MCS: about 21 %) compared to the longitudinal models, and in one case, the mediated effect was not statistically significant (for PCS with the complete sample at baseline—results available upon request).

Discussion

The association between functional status and quality of life in older adults is well established [15, 52, 53]; this study sought to investigate a potential reason for this relationship, namely the role of life-space mobility. Longitudinal analyses supported the predicted mediating role of life-space mobility in the relationship between functional status and HRQoL, a set of relationships adapted from the Wilson–Cleary model of patient outcomes. Results suggest that functional status limitations defined by ADL difficulty predict lower levels of life-space mobility which in turn predicted diminished HRQoL. Given these relationships, interventions designed to maintain or enhance ADL independence with the ultimate goal of preserving or improving quality of life should consider the mediating role of life-space mobility in this relationship. This variable may itself serve as a potential target for intervention or serve to provide greater insight into the functional status and quality of life relationship, thereby providing more information concerning the development and testing of interventions. Others have suggested the usefulness of the addition of life-space mobility measures to standard ADL assessment strategies [32].

The mediating role of life-space mobility appears to be more significant with the SF-12’s MCS as the outcome compared to PCS. The final level of the Wilson–Cleary model, overall quality of life, has been represented by different measures. Some have used HRQoL measures (generic or disease-specific) [8, 5456], while others have used life satisfaction measures [9, 27] or measures of mental health [6] and psychological distress [7, 14]. The items in the SF-12 most strongly related to PCS are closely related to physical functioning. Höfer and colleagues [54] used the physical functioning scale of the SF-36, which is a large part of PCS scores, as a measure of functional status. Given the conceptual overlap between functional status, life-space mobility, and PCS, it is not surprising that the mediating role of life-space mobility is less significant with PCS relative to MCS. MCS is more consistent with the end level in the Wilson–Cleary model.

Although not tested as an a priori hypothesis, the presence of cross-lagged relationships between functional status and life-space mobility as originally indicated by modification indices and subsequently tested in elaborated models suggests that functional status levels not only influence changes in life-space mobility, but also show effects in response to changes in mobility. This finding is consistent with previous suggestions that mobility limitations may precede difficulties with specific ADL tasks [1], in addition to being an outcome of functional status changes [2]. The potential reciprocal relationship offers additional support for including both traditional methods of functional assessment and life-space mobility measures in a comprehensive assessment strategy for older adults [32]. Other evaluations of the Wilson–Cleary model have examined such reciprocal effects. For example, Mathisen and colleagues [27] note reciprocal causal effects over time between general health perceptions and overall quality of life. Wilson and Cleary [5] note the possibility of such effects, and other researchers have called for an examination of these bidirectional relationships [7, 9]. Further research is required to support the view that the Wilson–Cleary model be modified to explicitly incorporate bidirectional causal effects.

Estimates of mediation effects from autoregressive models were only partially consistent with estimates from cross-sectional analyses. For example, researchers might have concluded that there was no significant mediation of the functional status and PCS relationship using all available cases at baseline. Longitudinal models revealed larger indirect effects as a percentage of total effects when compared to cross-sectional models, even suggesting complete mediation with MCS as the outcome.

To our knowledge, this is the first test of mediated relationships implied by the Wilson–Cleary model to use longitudinal data with three or more waves of data. There are several benefits of using longitudinal rather than cross-sectional data to evaluate mediation processes [20, 25, 40]. First, more information regarding the temporal sequentiality of the independent variable, the mediator, and the outcome variable is provided with longitudinal data, a critical underlying assumption of the causal relationships involved in mediation. Similarly, causal effects often take time to develop, and variables measured at the same time may not allow for the necessary development time. Second, associations between variables in cross-sectional models capture both time-varying and time-invariant covariances between participants, but longitudinal models allow these two sources of covariance to be separated and tested more directly in line with the causal assumptions of the mediation model. Finally, individuals may serve as their own control in assessing some relationships, thereby potentially controlling for static differences among individuals. Variables often have effects on themselves, such that variables at later times are related to the same variable at earlier times. Cross-sectional models implicitly assume these autoregressive effects are zero, and substantial bias can be introduced as a result. Autoregressive effects adjust for these relationships, and accordingly adjust the time-varying associations that are being tested as part of the causal model.

There are several limitations of the present study to consider. Although longitudinal analyses provide advantages over cross-sectional views of mediation, the data analyzed in the present study are still considered observational. Longitudinal structural equation modeling cannot prove causality in these kinds of designs, but it can test whether a proposed causal model is plausible and whether it provides good fit to the observed data. In the present study, 18 months separated each wave. The timing and spacing of measurements can influence the detection and estimation of effects in longitudinal studies [57], and the 18-month interval may not have been optimal to detect more rapidly transmitting causal effects. Missing data are usually problematic in any longitudinal study. It is not surprising that a substantial number of participants did not survive the entire study period. These individuals were excluded as the intent of the study was to examine the mediated effect of interest in a population not experiencing the terminal decline in functional status and mobility that occurs in the last year of life [31]. Different mechanistic models may need to be developed to assess relationships occurring in the period of the terminal drop. A smaller number of individuals failed to complete certain assessments during the study period. We assumed that the missing data mechanism in this data set was missing at random with respect to covariates [48]. For other missing data mechanisms, other approaches such as multiple imputation [58] or pattern-mixture modeling [59] would be more appropriate. Finally, although the model modification through specification searches only considered theoretical plausible paths, it is important to note that such modifications can lead to model misspecifications.

Conclusion

The present longitudinal analysis, based on an autoregressive model, supports the mediating role of life-space mobility and suggests that this role is more significant with the MCS of the SF-12 as the outcome than the PCS. Evidence for a reciprocal relationship over time between functional status and life-space mobility was suggested by the modification indices, which is consistent with other modifications of the Wilson–Cleary model. These elaborations to the model did not alter the substantive meaning of the mediation effects. Several authors have advocated for the use of longitudinal designs when evaluating mediation relationships. The present study provides an example of one approach for testing mediation with longitudinal data.