Introduction

Fatigue is a common and disabling symptom of multiple sclerosis (MS) [1]. Fatigue can negatively affect quality of life, including ability to perform self-care activities, maintain employment, and participate in social roles [2,3,4,5,6]. Unfortunately, the multifactorial etiology of fatigue remains unclear, and its treatments are suboptimal, with medications and behavioral interventions having modest effects on reducing fatigue [7,8,9,10,11,12]. Studies examining the multifactorial causes of MS fatigue have yielded inconsistent results [13], and there is significant heterogeneity in how participants with MS respond to fatigue interventions [14,15,16]. Identifying subgroups of participants who experience fatigue in the same way may help provide novel insights into the factors causing fatigue and how to develop personalized fatigue interventions that reduce response heterogeneity.

Studies have yielded inconsistent results on the relative importance of factors that influence MS fatigue [13, 17]. One possible explanation is that the factors that influence fatigue may vary overtime depending on the characteristics of the individual, which other symptoms are present, and social and environmental determinants, such as accessibility barriers, social support, and weather. Thus, identifying subgroups of fatigued participants who have similar characteristics and experience similar symptoms may help identify the conditional circumstances in which a factor may or may not significantly influence fatigue.

Differences in how fatigue is operationalized and measured also make it difficult to compare the inconsistent results and examine response heterogeneity in fatigue interventions [18]. Fatigue is most commonly measured using self-report questionnaires, which ask participants to recall the severity or impact of their fatigue over a specified period of time ranging from 1 week to a few months [18]. These questionnaires operationalize and measure fatigue by severity and impact, as well as its mental and physical components. The severity of mental and physical fatigue can be characterized by intensity, frequency, and duration [19], while their impact can be characterized by the extent to which they negatively affect daily activities [20]. Whether differences in the severity and impact of mental and physical fatigue need to be accounted for in studies remains unknown.

Accounting for the severity and impact of mental and physical fatigue is further complicated by their relationship with other symptoms [21, 22]. Research shows that fatigue can coexist or cluster with symptoms of depression, pain, anxiety, and insomnia [23,24,25,26]. This clustering may indicate common underlying mechanisms that an intervention could address. It may indicate a need to target multiple symptoms within a fatigue intervention. However, it is unclear how symptom clusters that include fatigue vary across individuals and under which circumstances individuals would benefit from a fatigue intervention that addresses other symptoms as well.

A promising approach to explain the heterogeneous results among MS fatigue studies and develop personalized fatigue interventions is to identify and characterize fatigue phenotypes [27, 28]. Here, we define a phenotype as a measurable, quantifiable, and distinct pattern of how fatigue is manifested and experienced overtime in relation to other symptoms and that can explain variations in clinically relevant outcomes [29, 30]. We can operationally define a phenotype as a cluster or subgroup of individuals who experience fatigue and other symptoms in the same way. Identified phenotypes can be characterized by comparing environmental and psychological characteristics within and between subgroups. To the best of our knowledge, no studies to date have identified and characterized fatigue phenotypes in people with MS.

The purpose of this study was threefold: (1) to identify phenotypes or subgroups of individuals with MS who experience fatigue in the same way in relation to symptoms of depression, anxiety, pain, and insomnia; (2) to examine membership stability within a phenotype over a 3-month period; and (3) to characterize phenotypes by health and function, social and environmental determinants, psychosocial factors, and engagement in healthy behaviors. To identify phenotypes and determine the probability of transitioning into a different phenotype overtime, we used latent profile and transition analyses with measures of mental and physical severity of fatigue, mental and physical impact of fatigue, depression, anxiety, and sleep quality.

Methods

We conducted a longitudinal observational study in which we administered online questionnaires on demographics, health and disability status, environmental factors, and the utilization of healthcare and wellness services at baseline, after 6 weeks from baseline, and after 3 months from baseline. Valid and reliable questionnaires were selected to help minimize biases. A recruitment goal of 300 participants was based on a power analysis that assumed moderately correlated data and moderate effect sizes in a structural equation model. Data were collected from July, 2017 to May, 2019. We note that the baseline data from this study were previously published to identify factors that predict fatigue self-management behaviors, as well as to explore the associations between fatigue, depression, and cognitive impairment [31, 32]. The participants were recruited by asking neurologists and rehabilitation clinicians to distribute flyers, as well as by asking advocacy organizations, such as the National Multiple Sclerosis Society, to distribute flyers via email listservs and at events and support groups. Interested individuals were screened over the phone. The inclusion criteria were 18–65 years of age, a self-reported diagnosis of MS by a physician, and at least mild MS fatigue (> 2 on the Quality of Life in Neurological Disorders short form for fatigue) [33]. Individuals who mostly used a wheelchair (> 6.5 on self-reported Expanded Disability Status Scale [34]) and had severe cognitive deficits (< 12 on the short Orientation-Memory-Concentration Test [35]) were excluded.

Measures

Identifying phenotypes

The questionnaires selected to classify the participants’ fatigue phenotypes were the Chalder Fatigue Scale (CFS) [36], Fatigue Scale for Motor and Cognitive Functions (FS) [37], Quality of Life in Neurological Disorders (Neuro-QOL) short forms for depression (DEP) and anxiety (ANX) [33; 38], Pain Effects Scale (PAIN) [39, 40], and Pittsburgh Sleep Quality Index (SLP) [41]. The questionnaires’ Cronbach’s alpha (α) and test–retest reliability (intraclass correlation coefficient, ICC) statistics that are reported below were calculated from the baseline and 6-week data. Baseline and 6-weeks were selected to include as many participants as possible in the reliability calculations while minimizing risks of capturing changes in health status.

The 11-items CFS yields two composite scores on the physical (CFS-P; α = 0.99; ICC = 0.68) and mental (CFS-M; α = 0.99; ICC = 0.73) severity of fatigue. The 20-item FS generates two composite scores on the physical (FS-P; α = 0.95; ICC = 0.90) and mental (FS-M; α = 0.99; ICC = 0.91) impact of fatigue. The eight-item Neuro-QOL DEP assesses feelings of hopelessness and negative affect over a 1-week period (α = 0.94; ICC = 0.90). The 8-item Neuro-QOL ANX measures unpleasant thoughts related to fear, worries, and hyperarousal (α = 0.93; ICC = 0.84). The six-item PAIN assesses ways in which pain interferes with daily activities and enjoyment (α = 0.97; ICC = 0.88). The SLP measures sleep quality and disturbances by generating a composite score of seven subscale scores on subjective sleep quality and duration and daytime dysfunction (ICC = 0.85).

Characterizing phenotypes

The health and function indicators used to examine the differences in phenotypes were the Community Participation Indicators Activity Ratio (CPI-AR) [42,43,44,45], Patient Determined Disease Steps (PDDS) [46, 47], and Self-Report Comorbidity Questionnaire for Multiple Sclerosis (CQMS) [48]. The 20-item CPI-AR assesses meaningful participation in specific social roles and daily activities (ICC = 0.85). A ratio is calculated using the number of important activities engaged in often enough or too much (numerator) over the number of important activities (denominator). The PDDS is a valid and reliable measure of disability in people with MS [46, 47]. The scale ranges from minimal MS impact with normal ambulatory function and no activity limitations to bedridden (ICC = 0.96). The CQMS consists of 36 co-morbidities for which participants indicate their presence or absence based on a physician diagnosis (ICC = 0.93).

Indicators of social and environmental determinants were the Craig Hospital Inventory of Environmental Factors Short-Form (CHIEF-SF) [49], utilization of health and wellness services [50,51,52], and the Modified Social Support Survey (MSSS) [39, 40]. The CHIEF-SF measures perceived barriers in the physical and social environment over the past 12 months. A composite score is calculated based on how frequently participants experience the environmental barrier and whether it posed a big or small problem (ICC = 0.85). The health and wellness services (H&W) indicator was a composite score of three separate questions on the utilization of wellness services (e.g., exercise class or nutritional counseling) and rehabilitation services (e.g., physical or occupational therapy), as well as the use of medications to manage fatigue. These three questions (yes/no) were tallied to generate a composite score ranging from 0 to 3. Finally, the five-item MSSS measures tangible and emotional social support (α = 0.88; ICC = 0.87).

Psychosocial factors were the Self-Efficacy to Manage Chronic Disease Scale (SMCD) [53] and Fatigue Catastrophizing Scale (FCS) [54]. The six-item SMCD assesses confidence in managing symptoms, emotional distress, daily activities, and medications (α = 0.88; ICC = 0.81). The 10-item FCS measures how frequently individuals have pessimistic evaluations or negative outcome expectations about the fatigue they experience (α = 0.98; ICC = 0.88).

Healthy behaviors were the Godin Leisure-Time Exercise Questionnaire (GLTEQ) [55, 56], the Meats, Eggs, Dairy, Fried foods, Baked goods, Convenience food, Fats added at the Table, and Snacks (MEDFICTS) questionnaire [57], and the Energy Conservation Strategies Survey (ECSS) [58]. The three-item GLTEQ yields a composite score based on the frequency of engagement in light leisure time activities, moderate activities, and strenuous exercises ≥ 15 min. The MEDFICTS measures adherence to the American Heart Association’s recommendations for the prevention of cardiovascular disease based on eight food categories that are weighted by weekly consumption and serving size (ICC = 0.84). The 14-item ECSS measures the frequency of engaging in fatigue self-management behaviors (α = 0.92; ICC = 0.86).

Analysis

This study’s primary objective was to identify subgroups of people with MS based on self-reported measures of mental fatigue severity, physical fatigue severity, mental fatigue impact, physical fatigue impact, depression, anxiety, and sleep using Latent Profile Analysis (LPA). LPA is an application of mixture modeling with the goal of identifying subgroups of individuals based on the joint distribution of a designated set of continuous observed variables as a function of a finite and mutually exclusive number of components [59]. LPA generates non-overlapping latent subgroups as one unobserved categorical variable, with its value representing which profile an individual belongs to with a certain degree of probability [60]. Several statistical indices and tests, including the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), sample-size adjusted BIC, posterior probability assessment, entropy, and bootstrap likelihood ratio test (LRT) were used to determine how well the models performed and the optimal number of subgroups [61, 62]. Posterior probabilities are computed using Bayes’ theorem as the probability of belonging to a subgroup conditional to the observed data and parameters estimated in the model [59]. A full information maximum likelihood procedure (as well as the MLR option in Mplus to request robust standard errors [63]) were used to include all available data in the analysis [59]. All available data were included in the analysis and data were assumed to be missing at random.

The study’s secondary objective was to examine stability in subgroup membership over a 3-month period using latent transition analysis (LTA). LTA is an extension of LPA that analyzes longitudinal data [59]. The LTA model is a type of latent Markov model in which the predictive portion is comprised of a multinomial logistic model used to estimate latent status membership probabilities and transition probabilities. Transition probabilities are the probabilities associated with subgroup changes (i.e., the probability of staying in a particular subgroup at the next time point).

The tertiary objective was to explore the differences in the latent subgroups of people with MS using descriptive statistics and ANOVA models. We examined whether there were differences in their health and function, social and environmental determinants, psychosocial factors, and healthy behaviors. A one-way ANOVA with a Bonferroni post-hoc test was conducted for this purpose. Given our expectation that there would be several significant differences between the latent subgroups (especially if we identified several subgroups), rather than rely only on the p-value, we used effect sizes to help gauge whether differences between subgroups were meaningful. Specifically, we used eta squared in SPSS version 28, which measures the proportion of variance associated with each effect in an ANOVA model. Eta squared ranges from 0 to 1, with a value closer to 0 indicating a low proportion of variance as explained in the model. We considered a small effect < 0.01, a medium effect 0.06, and a large effect ≥ to 0.14 [64].

All statistical tests were two-tailed. LPA and LTA were conducted using Mplus (version 8.6). R program in the R studio environment was used for graphical displays, and the Mplus Automation package was used to automate LPA estimation and interpretation [65,66,67]. Descriptive and post-hoc analyses of the latent subgroups were done in SPSS version 28.

Results

A total of 394 individuals were screened for eligibility; 17 individuals were deemed ineligible and 77 were not enrolled due to delays in returning the informed consent or were no longer interested; 289 individuals who returned the baseline questionnaire packet were included in the analyses. Table 1 shows the demographic characteristics of participants included in the analysis. The research sample was mostly representative of the population with MS, given our research sample ratio of about 1:4 male to female (85.5%), the majority of participants having relapsing–remitting MS (73.3%), and the research sample’s mean age of 52 years [68, 69]. The median PDDS score was 3, which indicates a gait disability that does not require regular use of an assistive device.

Table 1 Sociodemographic and health characteristics of research sample

Latent profile and transition analysis

The best fitting model included 6 subgroups of participants. The entropy for this model was at near maximum (0.83), and the average posterior probability of membership for each latent subgroup ranged from 0.83 to 0.92, denoting that the model classified participants relatively well [59]. The BIC was minimized for the 6-subgroup model. The model also achieved a near minimum across the AIC and sample-size adjusted BIC, as well as statistical significance in the bootstrap LRT. There was no practical advantage for seven subgroups compared to six subgroups with these model selection measures (seven subgroups had a group with only two participants). Participants were assigned to the six classes based on the highest posterior probability of membership, which aforementioned was high on average.

Figure 1 illustrates the pattern and manifestation of symptoms for the 6 subgroups using standardized scores. Subgroup #6 had the highest fatigue, depression, anxiety, pain, and insomnia symptoms; thus, we named this subgroup the Severe Phenotype. In turn, we named Subgroup #1 the Mild Phenotype for having the lowest fatigue, depression, anxiety, pain, and insomnia symptoms. Subgroup #4 was named the Mild-to-Moderate Phenotype for having slightly elevated fatigue, depression, anxiety, pain, and insomnia symptoms compared to the Mild Phenotype. Subgroup #3 became the Moderate-to-Severe Phenotype given its higher symptoms compared to the Mild-to-Moderate Phenotype and lower symptoms compared to the Severe Phenotype. Subgroup #5 was named the Fatigue-dominant Phenotype for having moderate to severe fatigue, but lower symptoms of depression, anxiety, pain, and insomnia. Lastly, Subgroup #2 was named the Mental Health-dominant Phenotype for having relatively low fatigue, pain, and insomnia, but higher depression and anxiety symptoms. The LTA indicated that subgroup membership was highly stable over the 3-month period, that is, the latent transition probabilities for remaining in the same subgroup at 6-weeks compared to the baseline ranged from 0.84 to 0.93 and at 3-months compared to 6-weeks from 0.90 to 1.00.

Fig. 1
figure 1

Latent profile analysis of the seven self-reported measures of fatigue, depression, anxiety, pain, and sleep. Key: Class 1 = Mild Phenotype; Class 2 = Mental Health- dominant Phenotype; Class 3 = Moderate-to-Severe Phenotype; Class 4 = Mild-to-Moderate Phenotype; Class 5 = Fatigue- dominant Phenotype; Class 6 = Severe Phenotype; CFS-P = Chalder Fatigue Scale-physical; CFS-M = Chalder Fatigue Scale-mental; FS-P = Fatigue Scale for Motor and Cognitive Functions-physical; FS-M = Fatigue Scale for Motor and Cognitive Functions-mental; ANX = Neuro-QOL Anxiety; DEP = Neuro-QOL Depression; SLP = Pittsburgh Sleep Quality Index

Table 2 presents the participants’ characteristics in each phenotype along with the results of the ANOVA and effect size calculations, and Supplementary Table 1 shows the results of the post-hoc analysis. The ANOVA revealed that most participants exhibited significant differences across the six subgroups, with the only variable not significantly different across groups was physical activity (i.e., GLTEQ) and utilization of health and wellness services (i.e., H&W). The variables that had a large eta squared effect (> 0.14) on phenotypic membership were community engagement (i.e., CPI-AR), number of comorbidities (i.e., CQMS), environmental barriers (i.e., CHIEF-SF), self-efficacy (i.e., SMCD), and fatigue catastrophizing (i.e., FCS). Using the Mild Phenotype as a reference, the Bonferroni post-hoc analysis showed that community engagement was significantly worse in the Moderate-to-Severe and Severe Phenotypes; the number of comorbidities was significantly higher in the Moderate-to-Severe, Severe, and Fatigue-dominant Phenotypes; environmental barriers were significantly worse for the Moderate-to-Severe, Severe, and Fatigue-dominant Phenotypes; and self-efficacy and fatigue catastrophizing were significantly worse for all phenotypes compared to the Mild Phenotype.

Table 2 Characterization of phenotypes, Mean (Standard deviation)

Discussion

In this study, we proposed a classification of self-reported symptom manifestations based on the identification of distinct subgroups of fatigued individuals with MS. To the best of our knowledge, this is the first study to examine the clustering of mental and physical fatigue severity and impact with symptoms of depression, anxiety, pain, and insomnia using LPA and LTA. Notable findings include homogeneity in how the six different phenotypes experienced mental and physical fatigue severity and impact; the identification of the Fatigue- and Mental Health-dominant Phenotypes; and the large effect size differences between phenotypes in community engagement (i.e., CPI-AR, environmental barriers (i.e., CHIEF-SF), self-efficacy (i.e., SMCD), and fatigue catastrophizing (i.e., FCS). Our findings provide insight on the patterns and clustering of different types of fatigue and their relationship with other symptoms, which may have implications for the development and testing of personalized fatigue interventions.

There were negligible differences between the phenotypes concerning the mental and physical severity of fatigue and the mental and physical impact of fatigue. None of the phenotypes were characterized as having a distinct type of fatigue, such as mental or physical fatigue, and were rather characterized by having low, moderate, or high levels of all types of fatigue. These results indicate that the magnitude or extent of fatigue experienced may be a more important factor to consider than the type of fatigue experienced. Research is varied on the similarities and distinctions between the severity and impact of mental and physical fatigue. Pust et al. [70] concluded from a factor analysis that only total fatigue composite scores should be used rather than the subscales of mental and physical fatigue. Other studies have identified two-factor solutions or a bi-factor model with one general fatigue factor and two subfactors of severity and impact of fatigue or mental and physical fatigue [36, 37, 71]. Further research is thus needed on the dimensions of self-reported measures of fatigue.

Nonetheless, the identification of the Fatigue-dominant and Mental Health-dominant Phenotypes provides support for distinguishing between fatigue, anxiety, and depression. The distinctions between these phenotypes may have implications for selecting the most appropriate fatigue intervention based on an individual’s characteristics. For example, future research can explore whether those in the Mental Health-dominant Phenotype benefit more from an intervention focused on managing emotions, while those in the Fatigue-dominant Phenotype could benefit more from an intervention focused on energy management. Although it may be difficult to clinically distinguish between fatigue and depression, evidence-based approaches have been identified using adjusted depression screening scales based on differential item functioning [72].

There were no significant differences between the Fatigue-dominant and Mental Health- dominant Phenotypes on indicators of insomnia, health and function, social and environmental determinants, psychosocial factors, and engagement in healthy behaviors. This is consistent with prior research showing a moderate correlation between the impact of fatigue and the impact of depression on daily activities [73,74,75]. However, we did find that participants in the Fatigue-dominant Phenotype had significantly higher levels of pain compared to those in the Mental Health-dominant Phenotype. Together, these findings have several implications for further research. First, they provide a rationale for conducting clinical trials that combine evaluations of fatigue and depression interventions using a common distal outcome, such as participation in social roles. Second, they may indicate a need for further research to test whether those in the Fatigue-dominant Phenotype may benefit from additional intervention strategies to manage pain. Lastly, these findings offer support to further explore the supposition that fatigue arising from interoceptive networks may be modulated by pain pathways [18].

In contrast to the Fatigue- and Mental Health-dominant Phenotypes, latent Subgroups #1, #3, #4, and #6 were categorized into phenotypes based on low, moderate, and high levels of fatigue, depression, anxiety, pain, and insomnia symptoms. Individuals in the Moderate-to-Severe and Severe Phenotypes had the worse indicators of health and function, social and environmental determinants, psychosocial factors, and engagement in healthy behaviors, whereas individuals in the Mild and Mild-to-Moderate Phenotype had the most positive indicators of these concepts. Despite the clear need for individuals in the Severe Phenotype to utilize health and wellness services, they reported the lowest service utilization. Thus, individuals in the more severe phenotypes likely experience more bodily impairments due the pathophysiology of MS, which is further aggravated by environmental barriers, inaccessible health and wellness services, low self-efficacy, high fatigue catastrophizing, and difficulties engaging in self-care behaviors.

Interestingly, environmental barriers, self-efficacy to manage symptoms, and fatigue catastrophizing provided the greatest overall effect on explaining group membership. These 3 variables consistently showed the sharpest declines or decrements between the Mild-to-Moderate and Moderate-to-Severe Phenotypes. Both behavior change theories and interventional studies have indicated that it is possible to reduce the impact of environmental barriers by improving problem solving, decision making, and systems thinking skills; increase self-efficacy by fostering modeling and mastery experiences using verbal persuasion and teaching appropriate emotional attributions; and decrease fatigue catastrophizing by helping individuals reframe how they think about fatigue [76,77,78,79]. A possible strategy for future research is to tailor interventions specifically to environmental barriers, self-efficacy, and fatigue catastrophizing to reduce the likelihood of transitioning to a more severe phenotype.

There are limitations to this study that should be noted. The study solely relied on self-report measures, used convenience sampling methodology, did not obtain verification of MS and depression diagnoses, and did not confirm the identified fatigue phenotypes in other research samples of people with MS. Although we confirmed that membership in the phenotypes was stable over a 3-month period, future research conducted over a longer period of time in different research samples is necessary. The low number of participants in the Severe Phenotype also prevented the examination of categorical variables that could provide additional insights. Consistent with other MS studies, people with more severe disabilities and from historically marginalized backgrounds were not adequately represented in our research study either. Future research is therefore required to develop strategies to recruit a more diverse sample to fully understand the determinants of individuals who are categorized into the Mild, Moderate, and Severe Phenotypes. Future research is also needed on examining the influence of disease modifying therapies and other medications on phenotypes and using performance-based measures of fatiguability and disability as well as state measures of perceived fatiguability to classify participants into phenotypes [18].