Background

Multiple sclerosis (MS) is a neurodegenerative, autoimmune disease of the central nervous system affecting near 900,000 Americans [1, 2]. The MS disease course is highly variable, and most affected individuals will variably accrue neurological disability across multiple functional domains [3]. Approximately 85% of persons with MS (PwMS) present with relapsing–remitting (RR) MS, and experience cycles of symptom exacerbation and remission that can last for weeks or months for a time, with modest disability accrual upon relapse [4]. The majority of persons with RRMS will transition to secondary-progressive (SP) MS, a more debilitating form of the disease, where in lieu of relapses PwMS steadily accrue of neurological disability, as well as impairments across an increasing number of functional domains [5].

Anticipating the transition to SPMS, and therefore identifying factors conferring risk for transition, has substantive value as it would grant PwMS the opportunity to plan for possible changes in their independence and care, which would contribute to optimizing long-term quality of life. This is particularly salient considering most federally approved disease-modifying therapies modulate RR disease activity driven by inflammation and offer modest benefit to those with SPMS [6, 7]. To date, a few factors have been shown to consistently influence the transition from RR to SPMS across several studies, and they include being male and developing MS later in life [8,9,10,11,12,13]. Furthermore, prior studies have not investigated the potential impact of comorbid health conditions nor have they comprehensively explored the influence of MS genetic risk factors on the risk for transitioning to SPMS.

We conducted a retrospective cross-sectional study based on 1,295 non-Hispanic, white individuals who had RRMS at onset and whose data were available through an open-access repository from the United States. We constructed time-to-event prediction models for three censored SPMS outcomes (ever transitioning, transitioning within 10 years, and transitioning within 20 years) using sociodemographic and self-reported clinical measures that would be available at/near the onset of MS, with specific considerations for MS genetic risk factors, including a genetic risk score (GRS), HLA-DRB1*15:01, and HLA-A*02:01. These models are amongst the first to investigate the relationships between MS genetic risk factors and self-reported comorbid health and risk of SPMS.

Methods

Study population

The study population was derived from participants in the Accelerated Cure Project for MS (ACP), a repository of biological samples and epidemiological data for persons with demyelinating diseases, including MS. ACP recruited PwMS from the communities surrounding ten MS specialty clinics across the United States. All participants gave informed consent and completed an extensive survey administered by trained study staff or a neurologist. All non-genetic data used in this study were based on the participant-reported responses to this survey instrument. A diagnosis of MS was confirmed by a neurologist using the standard diagnostic criteria at the time of enrollment [14, 15]. Inclusion and exclusion criteria have been previously described [16]. For the current study, we restricted to ACP participants with an age of MS onset of ≥ 18 years, who had RRMS at onset, and were non-Hispanic white (N = 1320). There were multiple PwMS from several extended families, we randomly included one participant from each family to ensure independence amongst observations. The final sample consisted of 1295 PwMS.

Transition to SPMS

Time to SPMS was the difference between participant-reported age of onset of RRMS (defined as age of “first symptom or exacerbation”) and age of onset of SPMS. 15.3% of study subjects had transitioned to SPMS at the time of the interview (N = 197; Table 1). This information was used to construct three censored outcomes for SPMS: ever transitioning, transitioning within 10 years, and transitioning within 20 years.

Table 1 Study population characteristics by disease state (quantitative variables are described by medians and interquartile ranges; categorical and binary variables are described by counts and percentages)

Predictors

Predictors of interests included sociodemographic and participant-reported clinical variables. Sociodemographic variables included age of MS onset, sex, years of education, a history of infectious mononucleosis prior MS onset, and whether the participant smoked tobacco products within 5 years prior to MS onset. Clinical variables included binary variables that captured several comorbid conditions existing prior to the onset of MS. We included individual variables for obesity, high cholesterol, high blood pressure, and type II diabetes, and variables for specific categories for other comorbid diseases, including cancer, neurological disease, physical disease, psychological disorders, and other autoimmune diseases (see Supplementary Methods). Clinical variables relevant to MS included 13 binary variables capturing impaired functional domains at onset (as previously described [17]), and categorical variables for time to second relapse (TT2R; ≤ 1, 2–5 and ≥ 6 years) and the number of relapses experienced in the first 2 years after MS onset (NR2Y; ≤ 1, 2–3, and ≥ 4 relapses). Predictors with ≤ 10% missingness were imputed using random forest imputation (package MissForest, R v3.5). NR2Y had > 10% missingness; therefore, it was recoded as an indicator variable with a category for missing values to retain observations.

Finally, genotypic information for MS risk variants was available for a subset of the study population. Genetic data for 1,036 study participants were available as a part of an international effort to identify susceptibility variants by the International MS Genetics Consortium (Supplementary Methods) [18]. These samples were genotyped using the Illumina iSelect platform consisting of approximately 200,000 variants from the Illumina Exome Core content and 90,000 custom-selected variants prioritized from earlier MS genetic studies [18]. Genetic data quality control, including the removal of genetic outliers and cryptically related samples for this sample has been outlined previously [17]. HLA-A*02:01 and HLA-DRB1*15:01 allele counts were determined by the tagging SNPs rs2975033T and rs3135388A, respectively. Also available were 200 autosomal, non-MHC, MS risk variants [18], whose risk alleles were summed to create a genetic risk score (GRS) that was normally distributed in this study population (Supplementary Fig. 1) [17]. Genotypic data for HLA-A*02:01 and HLA-DRB1*15:01 were available for an additional 130 participants who were genotyped on the Illumina MEGAEx BeadChip and met standard quality control criteria (Supplementary Methods). Thus, genetic data for HLA variants were available in 90% (N = 1166) of the study population, while the GRS was available in 75% of the study population (N = 972).

Statistical analysis

Cox proportional hazards modeling was used to identify early predictors (measurable at or near MS onset) of risk for ever transitioning to SPMS irrespective of time, risk for transitioning within 10 years, and risk for transitioning within 20 years. Since the proportional hazards model assumes a continuous hazard function, we used the Breslow method to handle tied failures (PHREG procedure, SAS v15.1). We considered three censored outcomes due to the fact MS presentation varies with age of onset, and that increasing age of MS onset and disease duration confers increased SPMS risk [17, 19, 20]—thus, we hypothesized that the risk for transitioning to SPMS within specific timeframes would be influenced by similar and different combinations of predictors. Amongst the non-genetic variables there were uneven underlying correlations (Supplementary Fig. 2), therefore we employed a forward stepwise variable selection algorithm based on Akaike Information Criterion (AIC; unlike approaches based on a p value threshold for individual predictors). For each outcome, we selected the model consisting of a subset of the non-genetic predictors that minimized AIC. The primary genetic predictors of interest (HLA-DRB1*15:01, HLA-A*02:01, and the non-MHC GRS) were iteratively included in the final models, adjusting for variation in genetic ancestry. We also conducted exploratory analyses for 199 of the 200 non-MHC risk variants that had an allelic variation, similarly iteratively considering them in each of the three final non-genetic models.

The proportional hazard assumption for predictors within the final models was assessed by calculating the proportion of 1000 simulations that contain a maximum cumulative Martingale residual larger than the observed maximum cumulative residual (Supremum tests, SAS v15.1). The goodness-of-fit and variation explained were assessed for each of the final models using the Gronnesby and Borgan test (stcoxgof function, STATA v13.1) and a Royston’s modification Nagelkerke’s R2 statistic (str2ph function, STATA v13.1), respectively [21]. Adjusted Kaplan–Meier curves were generated (sts graph function, STATAv13.1) and nomograms were constructed to visualize how the weights of the predictors in the final models could be used to predict transitioning to SPMS over the disease course, and within 10 and 20 years after MS onset (nomocox function, STATA v13.1) [22].

Ethics approval

This study was approved by the Case Western Reserve University Institutional Review Board (Protocol No. IRB-2016-1583).

Results

There were 1295 non-Hispanic white PwMS who contributed to these analyses (Table 1). At the time of data collection, 15.2% of subjects reported having SPMS. As expected, there was a female preponderance (78.1%) and the median age of MS onset was 32 years. Approximately 30% and 34% of subjects had a history of infectious mononucleosis or had smoked within 5 years of MS onset, respectively. The median years of education were 16.

The time-to-event models of non-genetic predictors that minimized AIC for each of the three outcomes are presented in Table 2. Increases in an age of MS onset increased the hazards for transitioning to SP irrespective of when observations were censored (HREver 1.07, 95% confidence interval [CI] 1.06–1.09, p < 0.001; HRyear10 1.06, 95% CI 1.03–1.09, p < 0.001; HRyear20 1.07, 95% CI 1.05–1.09, p < 0.001). Males also had increased hazards of transitioning to SP compared to females; the hazard was near threefold greater for converting by year 10 compared to twofold at year 20 or at any time point on average (HREver 1.97, 95% CI 1.46–2.66, p < 0.001; HRyear10 2.96, 95% CI 1.77–4.97, p < 0.001; HRyear20 1.87, 95% CI 1.27–2.57, p < 0.001; Supplementary Fig. 3). There was also a consistently decreased hazard for transitioning to SP irrespective of when observations were censored for those whose TT2R was ≥ 6 years compared to 0–1 year (HREver 0.65, 95% CI 0.45–0.94, p = 0.021; HRyear10 0.32, 95% CI 0.14–0.74, p = 0.008; HRyear20 0.56, 95% CI 0.36–0.86, p = 0.010).

Table 2 Multivariable Cox proportional hazard effect estimates for retained non-genetic predictors per outcome

Specific comorbid conditions and impaired functional domains at onset variably contributed to predicting SP risk across the models. Risk for ever transitioning to SPMS was lower amongst those reporting a neurological disorder (HREver 0.58, 95% CI 0.38–0.88, p = 0.009) or spasticity symptoms (HREver 0.57, 95% CI 0.35–0.93, p = 0.024) at MS onset. The risk for transitioning to SPMS within 10 years was threefold higher for the 2.8% of PwMS reporting a history of cancer by MS onset (HRyear10 3.11, 95% CI 1.20–8.10, p = 0.02); while the hazard was decreased for those reporting brainstem/bulbar symptoms. The hazards for transitioning to SPMS within 20 years were also decreased for PwMS who reported histories of obesity and other neurological disorders. Across the time-to-event models, having a comorbid cardiovascular, physical, psychological, or other autoimmune diseases at MS onset did not contribute to predicting the transition to SPMS.

The genetic variables were iteratively considered in each of the prediction models. Interestingly, HLA-DRB1*15:01 and the non-MHC GRS were not associated with SPMS risk; however, HLA-A*02:01 significantly reduced the hazard of SPMS which is similar to its effect on MS susceptibility (HREver 0.73, 95% CI 0.56–0.97, p = 0.030; HRyear10 0.60, 95% CI 0.35–1.04, p = 0.067; HRyear20 0.56, 95% CI 0.39–0.80, p = 0.001). Of the 199 autosomal, non-MHC risk variants explored, only one risk variant (rs6072343; 1.3 Kb upstream of LPIN3) was associated with increased hazards of SPMS across the three models (HR range 1.5–2.0; Supplementary Table 1); however, results were not significant after adjusting for multiple testing.

The three final models of non-genetic predictors with and without HLA-A*02:01 and genetic ancestry had good fit (data not shown), and all predictors in their respective models met proportional hazard assumptions (Supplementary Table 2). The variation explained by these final models was high, with the models for ever transition explaining 28–32%, transitioning within 10 years explaining 50–56%, and transitioning within 20 years explaining 34–40% (Supplementary Table 2). We, therefore, used these models to generate nomograms that could be used to predict the probability of transitioning to SPMS at various points in time (Fig. 1; approximate nomogram points assigned to each predictor per model is presented in Supplementary Table 3). Adjusted Kaplan–Meier survival curves for HLA-A*02:01 allele counts based on the ever transitioning to SPMS model are presented in Fig. 2.

Fig. 1
figure 1figure 1

Nomograms illustrating the probability of surviving the transition to SPMS based on baseline predictors for the three outcome models presented in Table 2 that also included HLA-A*02:01. The points assigned to each predictor is determined by their value in relation to the scoreline. Points are then added across predictors to generate the total score which is compared to the probability of surviving the transitioning to SPMS (thus, the probability of not transitioning). For example, in Panel B, the total points for a PwMS with an onset age of 45 years (7 points), who is male (2.5 points), who experienced his second relapse within 1 year (~ 2.6 points), who did not have a history of cancer (0 points), who presented with brainstem/bulbar symptoms at onset (0 points), and who had no copies of HLA-A*02:01 (~ 2.8 points) would be 14.9 points; therefore, the probability of not transitioning to SPMS in year 1 is 99%, year 2 is ~ 98%, in year 5 is ~ 96%, and in year 10 is ~ 83%

Fig. 2
figure 2

Kaplan–Meier curve for HLA-A*02:01 genotypes in the multivariable adjusted model for ever transitioning to SPMS

Discussion

We investigated three censored SPMS outcomes (ever transitioning, transitioning within 10 years, and transitioning within 20 years) and constructed time-to-event prediction models using non-genetic variables measurable at/near the onset of MS, with specific considerations for MS genetic risk factors, including a non-MHC GRS, HLA-DRB1*15:01, and HLA-A*02:01. We also conducted exploratory analyses investigating 199 of 200 non-MHC risk variants. Collectively, these models are amongst the first to comprehensively investigate the relationships between MS genetic risk factors and the risk of SPMS. We observed a consistently increased hazard for SPMS for an older age of MS onset and male sex, and decreased hazard for a longer interval between first and second relapse (TT2R), while comorbid conditions and onset symptomatology variably contributed to the prediction models. As for genetic factors, HLA-A*02:01 which confers a decreased risk for MS also contributed to a decreased hazard for SPMS.

Sociodemographic factors

Age of RRMS onset was a significant predictor of the transition to SPMS, with a younger age of onset associated with a longer latency to SPMS—consistent with multiple survival analyses of at least 150 SPMS cases conducted in the last 2 decades [8, 9, 12, 19, 23]. Age of onset is also a strong predictor of other aspects of the MS phenotype, including increased relapse activity, TT2R, and reaching disability milestones at a younger age [17, 23,24,25,26,27]. Only a few studies have sought to characterize factors that contribute to variation in the age of onset of MS, which includes older onset for males and those who report a history of obesity [17, 28]. There is also strong evidence that the genetic risk component for MS, specifically HLA-DRB1*15:01 and the non-MHC GRS, is associated with an earlier onset age [17]. This latter observation is interesting, considering that these genetic factors were not associated with SPMS risk in the current analyses.

We observed that males with MS had a shorter latency to SPMS, which is consistent with several prior studies [8, 9, 12, 29, 30]; however, there have also been null findings [19]. Prior studies have reported a 25–50% increased hazard for SPMS amongst males compared to females from adjusted survival regression models [12, 30, 31]. Our HR for risk of SPMS across the time period was twofold greater for males, which is greater than the prior effect estimates generated for Canadian, European and international (< 4% were from the US) study populations; interestingly, there is some evidence that disability may vary across continental populations which may explain a portion of this difference [32]. We also observed that the hazards for transitioning to SPMS within 10 years were threefold greater for males—this finding emphasizes the importance of modeling distinct censored outcomes, which prior study has not done. Thus, even though the proportional hazard assumptions were not violated for any model, the risk for SPMS in males is elevated for the earlier transition.

Other demographic variables were not retained in the final models, including being a smoker within 5 years of onset, a history of infectious mononucleosis, and years of education. Smoking, measured as ‘ever versus never’ or ‘current, former, or never’, has been shown to contribute to disability accrual in PwMS [33,34,35]; however, the relationship with risk for SPMS is not definitive. A meta-analysis of four studies suggest that there may be an association for ‘ever versus never’ and SPMS risk (risk ratio = 1.88, 95% CI: 0.98, 3.61), but there was significant heterogeneity between studies [36]. Being a smoker confers increased risk for MS, but risk diminishes after 5–10 years after quitting [37], and we, therefore, considered an analogous variable, which has been shown to also influence MS presentation near onset [17]. No other study of MS progression has considered this operational definition of smoking.

Variation in MS presentation near onset

Several studies (including at least 150 SPMS cases conducted within the last 2 decades) have investigated the predictive potential of various measures of the presentation of MS near onset and subsequent transition to SPMS, including early relapse activity, symptom burden, impaired functional domains, disability measures, radiological observations, and TT2R [8, 9, 12, 19, 30, 31, 38, 39]. Amongst the studies that focused on evaluating multivariable models for ever transitioning to SPMS, many of the promising results await independent replication (i.e., rapid worsening measured by annualized disability trajectory [19]). The only predictor with consistent associations in two prior studies, and now our own, was TT2R, where a longer interval between the first two relapses was predictive of a longer SPMS latency [30, 38]. We also showed that TT2R was even more predictive (smaller HR) of transitioning to SP within 10 years. Considering TT2R strong effect on SPMS, it has the potential to be an informative prognostication measure, since it can be readily obtained early in the disease course.

MS symptomatology at/near onset has also been investigated across several studies; however, no specific impairment pattern has demonstrated consistent predictive associations—however, this may be due to the fact that different symptom definitions have been investigated [8, 9, 12, 19, 30]. We found that spasticity and brainstem/bulbar symptoms at MS onset were associated with longer latency to SPMS over the disease course and within 10 years, respectively. Spasticity has not been investigated in other studies, while brainstem alone or brainstem/cerebellar impairments have not been associated with ever transitioning to SPMS [8, 12, 30]. This latter observation is consistent with our findings, as we only observed an association for transitioning within 10 years. However, further investigations are warranted to confirm these associations.

We do note that the number of relapses in the first 2 years was not retained in any of our models after variable selection. A post hoc χ2 assessment of the non-missing distribution of our categorical variable between RRMS who did and did not transition to SPMS (Table 1) was not significant (data not shown). Prior studies have reported mixed findings, with a trend toward increased SPMS risk for those with increased relapses [12, 38].

Comorbid health conditions

Comorbidity is common in PwMS; they also contribute to adverse outcomes, including the accrual of physical and psychological impairments [34, 40]. To the best of our knowledge, no prior study of SPMS has considered the potential impact of comorbidities, much less comorbidities existing at the onset of MS. We observed unexpected associations. First, cardiovascular conditions, such as hypertension, were not informative in our models. There is an increasing evidence that these traits, specifically hypertension, contribute to adverse MS outcomes [34, 41, 42]. Possible reasons for the lack of associations in the current study are that we assessed the predictive nature of comorbid conditions occurring at MS onset, and not across the life-span; thus, it is possible future studies of these traits occurring at any point in time may be associated with SPMS risk. Similarly, not informative were a history of autoimmune diseases, psychological disorders, and other physical disorders.

We did observe that those reporting having other neurological diseases had a ~ 50% decreased hazards for ever transitioning to SPMS and transitioning within 20 years. For our analyses, the comorbid neurological disease was a binary variable of having at least one of a list of neurological diseases (Supplementary Methods). Interestingly, 91.3% (284 of 311) of the PwMS with a comorbid neurological disease was due to reporting a history of migraines. Post hoc analyses demonstrated that the decreased hazards were driven by a history of migraines and not other neurological conditions (data not shown). Migraines were originally considered an uncommon symptom of MS [43]; however, the relationship with MS is currently unclear. There is evidence which suggests that migraines may be associated with increased MS risk, a greater number of impaired domains, and that it might in fact be a common comorbidity [44,45,46,47]. However, there is also a growing number of studies, showing that migraineurs may be misdiagnosed with MS, and thus we can speculate that some of our study population might have also been misdiagnosed, which would contribute to the decreased SPMS hazards [48,49,50]. We also observed that a history of obesity by MS onset was associated with a longer latency for SPMS. This finding may be due, in part, to sampling variability, since only 8.4% (N = 109) of our study population had a history of obesity and only 6.4% (N = 7) of these had transitioned to SPMS. Our last unexpected finding was that having had cancer by MS onset was the strongest predictor of transitioning to SPMS within 10 years (HR = 3.11). Only 2.8% (N = 36) PwMS had had cancer and 13.9% (N = 5) of these individuals had transitioned to SPMS, and thus, it is possible that sampling variation may be a factor; however, it is worth noting that both cancer and MS have substantial immune system components [51], and that PwMS may have an increased cancer risk [52].

MS genetic risk component

There have been several genetic studies of various aspects of MS phenotype, yet no robust associations have been identified, which may be largely due to variation in how the outcomes were measured. Currently, we do know that MS risk variants, specifically HLA-DRB1*15:01 and the non-MHC GRS, are associated with earlier age of onset of MS, but not with the other measures of MS presentation (i.e., TT2R), relapse activity, or disease severity (i.e. MS Severity Scale) [53,54,55,56]. A few studies have investigated the relationships between HLA-A*02:01 and MS outcomes, and have reported mixed findings of either null or protective effects for HLA-A*02 on disease severity (i.e., Expanded Disease Status Scale) and radiographic outcomes [56,57,58]. No prior study has investigated the contribution of genetic factors on the risk for transitioning to SPMS. In our study, we observed HLA-A*02:01 conferred a ~ 30–45% decreased risk for transitioning to SPMS, analogous to its relationship with MS risk [18]. While this novel finding warrants replication, it adds to the limited literature, suggesting that this protective genetic factor contributes to both MS risk and progression.

Study strengths and limitations

The current study has several strengths, including the fact that this study was the first to incorporate genetic and comorbidity information in evaluating risk for transitioning to SPMS. It is also one of the first studies limited to a modestly sized study population of PwMS from U.S. We also had detailed information on impaired functional domains at onset. In addition, we applied a robust variable selection approach to identify key predictors of SPMS transition, for three censored time points. This is unlike prior studies which assumed that a single model would be effective in predicting outcomes at various time points, even though there were no violations to the proportional hazards assumptions. Furthermore, we present nomograms that can be readily used to inform prognostication efforts based on information measured at/near onset.

There are several limitations to acknowledge. The first is that this study utilized a cross-sectional study design and may be vulnerable to length-sampling bias which refers to the possibility that those with a longer disease duration were more likely to have been sampled. This is a concern considering disability accrues with time, and thus, it is possible that participants with a longer disease duration may have a less severe manifestation than their counterpart non-participants. Another limitation is that all the non-genetic variables were based on self-reported responses and, therefore, subject to recall and measurement error biases. We would assume that these biases would result in non-differential misclassification of the exposures and outcome, and bias the associations toward the null for binary measures. Considering the strong magnitude of associations observed, it is unlikely that residual confounding could explain the observed associations. Furthermore, a validation study of self-reported comorbidity in PwMS has reported moderate or better levels of agreement with medical records [59]. The last limitation relates to the generalizability of the study population beyond those of other ethnicities and racial populations.

Conclusions

We constructed time-to-event prediction models using non-genetic variables measurable at/near the onset of MS, with specific considerations for MS genetic risk factors. Our results show established effects of age of disease onset and biological sex, as well as indicators of early relapse activity, which were predictive of SPMS across time points. We notably demonstrated for the first time that key protective genetic variants for MS risk may also be protective against the transition to SPMS. Collectively, these findings and the included nomograms have the potential to advance prognostication for PwMS using information that would be available at/near onset, with the hope of improving care and quality of life for those living with MS.