Introduction

The trajectory of affective disorders is characterized by specific lifetime epochs, including early lifetime traumata (ELTs), recurrent depressive and manic episodes, suicidal behaviors alternating with euthymic phases, and a residual stage with neurocognitive deficits, functional impairments, and brain alterations [1,2,3,4]. Both major depression (MDD) and bipolar disorder (BD) are progressive disorders whereby the number of episodes is associated with increased symptomatic severity, functional decline in daily task performance, higher risk of future episodes, and increased risk for impairments in neurocognitive functions, including memory and executive functions [3, 5,6,7]. Nevertheless, the current nosological classifications of affective disorders are mainly based on symptoms and polarity course characteristics of the disorder and do not consider the preceding risk factors, including ELTs, severity of the recurring pattern of the disease, suicidal behaviors, and deteriorated outcome characteristics, including disabilities, lowered health-related quality of life (HR-QoL), and neurocognitive deficits, which are all associated with the course of the illness [7].

As such, affective disorders are not stage-defined, although diagnostic criteria based on staging may provide more appropriate and targeted treatments to prevent further episodes and progression into the later stages, which are characterized by functional and cognitive decline [8]. Recently, some authors proposed staging models by dividing bipolar disorder or major depression into relevant stages based on risk factors, such as family history of mental illness, prodromal patterns, relapses of episodes, refractory states, and interepisodic functioning including cognitive impairments, comorbidities, and functional impairments [9,10,11]. Nevertheless, these staging models were not based on predictive mathematical algorithms derived from biomarkers and clinical data including ELT, recurrent episodes and suicidal behaviors, neurocognitive deficits, and functional impairments.

There is now evidence that depressive and (hypo)manic episodes have detrimental effects on neuronal functions in the brain as exemplified by changes in neurocognitive functions, overall psychological disabilities, and brain imaging techniques [3]. There is also evidence that major depression and bipolar disorder are neuro-immune and neuro-oxidative disorders with increased levels of pro-inflammatory cytokines and acute phase proteins, such as haptoglobin and C-reactive protein (CRP) [12, 13]. These disorders are also characterized by lowered lipid-associated antioxidant defenses, in particular lower high-density lipoprotein (HDL) cholesterol and paraoxonase 1 (PON1), by increased lipid peroxidation, as indicated by increased lipid hydroperoxides (LOOH) and aldehyde formation (e.g., malondialdehyde, MDA), by increased production of nitric oxide and hypernitrosylation, and by increased protein oxidation as indicated by increased levels of advanced protein oxidation products (AOPPs) [12,13,14,15,16]. The working hypotheses are that (a) activated neuro-immune and neuro-oxidative pathways cause detrimental effects on brain circuitry through the effects of pro-inflammatory cytokines, reactive oxygen and nitrogen species (ROS/RNS), and hypernitrosylation; and (b) lowered lipid-associated antioxidant defenses, including lowered levels of HDL-cholesterol, coenzyme Q10, omega-3 polyunsaturated fatty acids and vitamin E, lowered activities of lecithin cholesterol acyltransferase (LCAT) and PON1, reduce neuroprotection and increased vulnerability to develop nitro-oxidative stress [17,18,19,20,21,22,23,24,25,26,27,28,29]. These adverse effects comprise cytotoxic, excitotoxic, and neurotoxic effects that cause dysfunctions in neurogenesis, apoptotic mechanisms, receptor expression, neuroplasticity, and neuronal functions [3, 15, 16]. Interestingly, there is some evidence that increased numbers of relapses are accompanied by activated neuro-immune and neuro-oxidative pathways, as indicated by increased levels of neopterin, pro-inflammatory cytokines, and malondialdehyde [30, 31]. Most importantly, we showed that the number of depressive and (hypo)manic episodes could be combined with lowered PON1 activity and HDL-cholesterol levels into a reliable underlying construct (latent vector), which further mediated the effects of ELTs on oxidative stress biomarkers and affective phenomenology, including lowered HR-QoL, increased disabilities, neurocognitive deficits, current suicidal ideation, socioeconomic status, use of medication, and severity of illness [7]. Nevertheless, this study examined the staging effects of recurrent patterns of depressive and (hypo)manic episodes and PON1/HDL-cholesterol, but not a comprehensive staging index, including ELTs, recurrent patterns of episodes and suicidal behaviors, increased disabilities, lowered HR-QoL, neurocognitive deficits, and oxidative and inflammatory biomarkers. Finally, there is also some evidence that staging characteristics of both affective disorders may be accompanied by metabolic disorders including increased body mass index (BMI), the metabolic syndrome (MetS), changes in thyroid-stimulating hormone (TSH), and increased insulin resistance [32, 33].

Hence, the aim of the present study was to construct a predictive staging index/model based on ELTs, recurrent patterns of episodes and suicidal behaviors, increased disabilities, lowered HR-QoL, neurocognitive deficits, and immune, oxidative, and metabolic biomarkers.

Subjects and Methods

Participants

In this study, 172 participants were recruited, including 105 patients with affective disorder, 68 patients diagnosed with BD and 37 patients with MDD, and 67 healthy controls. All patients with affective disorders were in partial remission or remission, while the index episode was not of (hypo)manic polarity. All participants were of Brazilian nationality, were of both genders, and were aged 20 to 63 years. We included all self-declared ethnicities. All patients were outpatients admitted to the University Hospital of the State University of Londrina (UEL), Parana, Brazil. Controls were recruited by word of mouth from the same catchment area. In addition, 81 of the participants showed anxiety disorders according to DSM-5 criteria, namely social phobia, simple phobia, generalized anxiety disorder (GAD), or panic disorder. Diagnoses of MDD, BD (types 1 and 2), the four anxiety disorders and tobacco use disorder (TUD) were made using the diagnostic criteria of the Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, Text Revision (DSM-IV-TR).

Exclusion criteria were (a) any other axis-1 DSM-IV-TR disorder, including schizophrenia, schizo-affective disorder, autism, psycho-organic syndromes, and substance abuse disorders; (b) participants with neurodegenerative/neuroinflammatory disorders such as Parkinson’s disease, Alzheimer’s disease, and multiple sclerosis; (c) (auto)immune disorders including inflammatory bowel disease, psoriasis, cancer, chronic kidney disease, rheumatoid arthritis, type 1 diabetes, chronic obstructive pulmonary disease, systemic lupus erythematosus, HIV, and hepatitis B/C virus infection; (d) treatments with immune-regulatory drugs, nonsteroidal anti-inflammatory drugs, glucocorticoids, interferon, and antioxidant and omega-3 polyunsaturated fatty acid supplements; and (e) pregnant women. All participants provided written informed consent to take part in the study, whose experimental procedures were previously approved by the Research Ethics Committee at UEL (protocol number: CAAE 34935814.2.0000.5231).

Methods

A research psychiatrist made the clinical assessment of axis I diagnoses employing a validated Portuguese version of the structured clinical interview for DSM-IV interview (SCID) axis I [34, 35] and the DSM-IV-TR diagnostic criteria. The research psychiatrist used a structured interview to assess sociodemographic (marital status, education, income, employment status, and ethnicity) and clinical (number of previous depressive, hypomanic and manic episodes) data. Marital status was registered as married + live together versus widowed + single, education was registered as number of education years, employment status was registered as working versus not working + pension + disability payment, and income was computed on a US dollar/month scale and classified as follows: 0 = up to US$ 300, 1 = $300 to $600; 2 = $600 to $1000; 3 = $1000 to $1500; and 4 = over $1500 a month. Treatment with psychotropic medications was registered showing that some participants were treated with antidepressants (n = 44), lithium (n = 26), atypical antipsychotics (n = 32), and anticonvulsant mood stabilizers (n = 33), Previously, we showed that using the same study sample there were no significant effects of these psychotropic drugs on the biomarkers used here [12].

We used the Hamilton Depression (HAM-D) and Anxiety (HAM-A) Rating Scales to assess severity of depression and anxiety, respectively [36, 37] and we used the Young Mania Rating Scale (YMRS) to assess severity of hypomanic symptoms [38]. Disabilities were assessed using the Sheehan scale [39], which is a self-administered rating scale with scores of 0–10 in domains including occupational, social life, leisure, family life, activities, and household activities, whereby a score of 0–3 indicates mild disabilities, 4–6 indicates moderate disabilities, and 7–10 indicates severe disabilities. HR-QoL was assessed using the WHO Quality of Life Instrument-Abbreviated Version (WHOQOL-BREF) in a validated Brazilian Portuguese translation [40,41,42]. This instrument comprises 26 items and measures 4 HR-QoL subdomains, namely physical health, psychological health, social relationships, and environment. In the current study, we employed the sum of the raw scores on these 26 items as an overall index of HR-QoL. We used the Clinical Global Impressions (CGI) scale to measure overall severity of illness, which scores the subject status on a scale from 1 to 7 [43]. The Columbia–Suicide Severity Rating Scale (C-SSRS) was employed in a validated Portuguese translation to measure current suicidal ideation, number of prior suicide attempts, and life-time suicidal ideation [44]. A subset of the participants (n = 84) also completed the verbal fluency test (VFT), animal category, and the trail-making test, part A, which probes cognitive processing speed (TMT_A), and part B, which probes executive functioning (TMT_B) [45, 46].

The Childhood Trauma Questionnaire (CTQ) is a self-rated scale for adults and adolescents that measures severity of ELT with scores on a five-point Likert scale reflecting the frequency of reported events in childhood. The CTQ comprises 28 items divided into 5 domains, including sexual abuse, physical abuse, emotional abuse, emotional neglect, and physical neglect [47]. In our study, we used the validated Brazilian Portuguese version [48]. BMI was computed according to weight (in kg) divided by square of height (in m2). The diagnosis of MetS was made employing the criteria of the International Diabetes Federation; i.e., three criteria of the following should be present: (a) abdominal obesity (waist circumference ≥ 90 cm for men and ≥ 80 cm for women in South Asian and South Americans and ≥ 94.0 cm for men and ≥ 80.0 cm for women in Caucasians); (b) low HDL-cholesterol (< 40 mg/dL in men and < 50 mg/dL in women) or use of hypolipidemic drugs; (c) hypertriglyceridemia (triglycerides > 150 mg/dL) or use of a hypolipidemic agent; (d) increased fasting glucose (> 100 mg/dL) or use of oral antidiabetic medications; and (e) increased average blood pressure (130/85 mmHg) or currently taking antihypertensive medication.

Assays

Blood was sampled after an overnight fast at 8 a.m. on the same day that we completed the structured interview, including severity rating scales and clinical diagnoses. We measured the activities or levels of superoxide dismutase (SOD), uric acid, LOOH, NO metabolites (NOx), MDA, AOPP, HDL-cholesterol, PON1, thyroid secreting hormone, high sensitive C-reactive protein (hsCRP), insulin, and glucose. The HOMA2-insulin resistance index (the HOMA2IR index) was calculated according to Levy et al. [49]. The assays of the first eight biomarkers were described previously [12]. Briefly, SOD1 activity in erythrocytes was determined using the pyrogallol method described by Marklund and Marklund [50]. This technique is based on the inhibition of pyrogallol self-oxidation by SOD in aqueous solution. The assay was conducted in a spectrophotometer Helios (Thermo Spectronic, Waltham, MA, USA) at 420 nm and 37 °C. Variation in optical density (OD) was recorded every minute for 5 min. The level of SOD that inhibited 50% of the pyrogallol oxidation was defined as one unit of enzymatic activity. The results were expressed in U/mg of hemoglobin (Hb). LOOH was assayed by chemiluminescence (CL-LOOH) [51, 52]. This method uses the compound tert-butyl hydroperoxide to initiate a lipid chain reaction that can be detected by photon emission during the formation of lipid hydroperoxides. Readings were performed in a Glomax luminometer (TD 20/20 Turner Designers, USA) over 1 h at 1 reading per second. The results are expressed as relative units of light (RLU). NOx levels were assessed indirectly by determining the plasma nitrite concentration using an adaptation of the technique described by Navarro-Gonzalvez et al. [53]. This method is based on the reduction of the nitrate present in the sample to nitrite by oxidation-reduction reactions mediated by the system cadmium-copper reagent. Then, Griess reagent was added to induce diazotization, forming a colored complex and subsequent detection at 540 nm. The quantification of NOx was made in a microplate reader Asys Expert Plus, Biochrom (Holliston, MA, USA). AOPP was quantified using the method described by Hanasand et al. [54] in a microplate reader, Perkin Elmer, model EnSpire (Waltham, MA, USA) at a wavelength of 340 nm. AOPP concentration was expressed in: M of equivalent chloramine T. Based on these four nitro-oxidative measurements, we have computed a z unit-weighted composite score reflecting the path from superoxide production to AOPP production, namely zSOD + LOOH + NOx + AOPP [12]. This index was computed as z transformation of SOD (zSOD) + z LOOH + zNOx + zAOPP. We also used two other z unit-weighted composite scores, namely zSOD + LOOH and zLOOH + AOPP as indices of superoxide and hydryxyperoxide formation, and lipid + protein oxidation, respectively [12].

MDA levels were measured through complexation with two molecules of thiobarbituric acid (TBA) using MDA estimation through high-performance liquid chromatography (HPLC Alliance e2695, Waters’, Barueri, SP, Brazil) [55]. Experimental conditions included the use of a column Eclipse XDB-C18 (Agilent, USA); the mobile phase consisted of 65% phosphate buffer (50 nM pH 7.0) and 35% HPLC grade methanol; the flow rate was 1.0 mL/min; the temperature was 30 °C; the wavelength was 532 nm. MDA concentration in the samples was quantified based on a calibration curve and are expressed in mmol of MDA/mg proteins. HDL cholesterol was measured with an enzymatic colorimetric method in an automated clinical chemistry system (Dimension RXL, Siemens, USA). PON1 status was measured using different kinetic assays conducted in a microplate reader (EnSpire, Perkin Elmer, USA) [26]. The substrate phenylacetate (PA, Sigma, USA) under a high-salt condition and 4-(chloromethyl)phenyl acetate (CMPA, Sigma, USA) were used. PON1 total activity was determined by the rate of hydrolysis of phenyl acetate under a low salt condition at 270 nm and 25 °C. The activity was expressed in U/mL based on the phenyl acetate molar extinction coefficient of 1.31 mmol/L cm−1. In the present study, we adjusted PON1 and CMPAase activities for the PON1 genotypes and used the residualized values in subsequent analyses.

Uric acid (UA) was measured with an UV enzymatic method (direct equilibrium uricase method) in an automated clinical chemistry system (Dimension RXL, Siemens, USA). hsCRP was assayed using a nephelometric assay (Behring Nephelometer II, Dade Behring, Marburg, Germany). The interassay coefficients of variability were less than 10%. The ARCHITECT TSH assay is a Chemiluminescent Microparticle Immunoassay (CMIA). The analytical measuring range for the Abbott Architect TSH assay was 0.01 to 80.00 μIU/L and the intra-assay coefficients of variation (CV) values were < 2.6%. HDL-cholesterol was assayed by automated methods using the DimensionRxL (Deerfield, IL, USA). Plasma insulin levels were determined with a microparticle enzyme immunoassay (MEIA) (AXSYM, Abbotts Laboratory, Germany). Glucose was assayed using an automated method in a clinical chemistry system, Dimension® RXL (Siemens Healthcare Diagnostics Inc., USA). The interassay CV values for all analytes were < 10%.

Statistics

We used analysis of contingency tables (χ2-test) to check associations between nominal variables. Analysis of variance (ANOVA) was employed to assess differences in scale variables between groups. Pearson’s product moment or Spearman’s rank order coefficients were used to examine associations between scale variables, and the point-biserial correlation coefficients were used to examine associations between binary (e.g., sex) and scale variables. All tests were two-tailed and an alpha level of 0.05 was considered to be statistically significant. All analyses were performed using the IMB-SPSS software version 24 for Windows.

Partial least squares analysis (SmartPLS) was used as a path modeling technique with PLS-structural equation modeling algorithms [56]. We examined the effects of new staging constructs or latent vectors (LVs) (used as input variables) on affective phenomenology (output variables). Some variables were introduced in the path model as single indicators (e.g., income, age or sex), while others were used as LVs extracted from a set of indicator variables in a reflective model (e.g., total scores on the WHO-QoL and Sheehan reflecting QoL and disabilities). PLS path analysis was only performed when the model and constructs complied with specific quality criteria, namely (a) model fit standardized root mean residual (SRMR) < 0.08; (b) composite constructs had good discriminant validity and reliability as indicated by average variance extracted (AVE) > 0.45 and composite reliability > 0.700; and (c) construct cross-validated redundancies and communalities were checked [56]. Consequently, we performed consistent PLS bootstrapping with 2000 bootstraps and computed path coefficients with t values and exact p values for the inner model and t values and loadings on the LVs for the outer model. In addition, we interpreted the total effects, total indirect and specific indirect effects. Finally, a newly constructed LV was only accepted as valid when it showed a composite reliability > 0.800, Cronbach’s alpha > 0.75, rho_A > 0.8 and AVE > 0.450. Indicators were only included in LVs when the factor loadings were > 0.45 with p < 0.01.

Results

Construction of a Recurrence Index Based on the Recurrence of Episodes and Suicidal Behaviors

Figure 1 shows a first attempt to construct a new relapse index based on the recurrence of episodes and suicidal behaviors. Table 1 shows that the reliability characteristics of the first LV extracted from the number of all episodes, number of depression episodes, number of suicide attempts, and lifetime suicidal ideation (the “recurrence-LV”) performed well as a composite score with an adequate composite reliability, Cronbach’s alpha, and rho_A and AVE. The number of all episodes (0.849), number of depression episodes (0.917), number of suicide attempts (0.693), and lifetime suicidal ideation (0.721) loaded highly on this recurrence LV. The number of (hypo)maniac episodes could be considered as a valid fifth indicator of this recurrence LV, but because its use caused considerable collinearity in the subsequent analyses, we did not incorporate this indicator variable in the “recurrence LV.”

Fig. 1
figure 1

Results of consistent partial least squares (PLS) bootstrapping (2000 bootstraps) showing the “recurrence staging index” and predicting outcome measurements including a “QoL-Disability” latent vector (LV) extracted from health-related quality of life (whoqolin) and the Sheehan total score (shetotal) and a “cognition” LV extracted from the trail making test (TMT) and verbal fluency test (VFT) scores. Education and age are additional explanatory variables. The recurrence staging index is predicted by an “early lifetime trauma” (ELT) LV extracted from 5 ELT scores, namely emotional abuse (emo.abu), emotional neglect (emo.neg), physical neglect (phy.negl), physical abuse (phys.abu), and sexual abuse (sexabuse), which, in turn, is predicted by sex. The model shows path coefficients and exact p values for the inner model and t values for the outer model

Table 1 Construct reliability and validity of the five staging indices constructed in the current study

Figure 1 shows the first PLS path model using a “Qol-disability LV” (extracted from the total Sheehan and WHO-QoL scores as indicators) and a “cognitive LV” (extracted from TMT and VFT as indicator variables) as output variables. Input variables were the “recurrence LV,” an “ELT LV” (extracted from the five ELTs as indicators), and sex (male = 1, female = 0). Sex and the ELT LV were introduced in the model as explanatory variables for the recurrence LV, while sex was also an explanatory variable for the ELT LV. Thus, this model also examined whether the effects of ELT on the outcome variables were mediated by the recurrence LV. Figure 1 shows the outcome of consistent PLS bootstrapping (2000 bootstraps) with path coefficients and exact p values for the inner model and t values for the outer model. Since the cognitive tests were performed in a subset, we carried out the analysis using the mean replacement method, although casewise deletion showed essentially the same results. The model quality data were very adequate with a model fit SRMR = 0.052, composite reliabilities (for ELT LV = 0.856 and QoL-disability LV = 0.831), Cronbach’s alpha (for ELT LV = 0.848 and QoL-disability LV = 0.817), rho_A values (for ELT LV = 0.881 and QoL-disability LV = 0.856), and AVE values (for ELT LV = 0.551 and QoL-disability LV = 0.714).

We found that 52.6% of the variance in the QoL-Disability LV was explained by the recurrence LV and ELT LV, and 25.1% of the variance in the cognitive LV was explained by the recurrence LV, sex, and education. We found that 30.0% of the variance in the recurrence LV was explained by the ELT LV, which in turn was associated with female sex. There were significant total effects of sex on ETL-LV (t = − 2.57, p = 0.011), Qol-disability LV (t = − 3.18, p = 0.002), recurrence LV (t = − 3.06, p = 0.002), and cognitive LV (t = − 2.53, p = 0.012). There were significant total effects of ELT LV on QoL-disability LV (t = 5.81, p < 0.001) and there was a significant specific indirect effect of sex on the recurrence LV mediated by ELT-LV (t = − 2.07, p = 0.039). There were significant indirect effects of ELT-LV on QoL-disability LV (t = 2.52, p = 0.012) and cognitive LV (t = 2.50, p = 0.013).

Associations Between the Recurrence Index and Affective Phenomenology

Figure 2 shows a second PLS path model using the recurrence LV as an input variable and different characteristics of affective phenomenology as output variables, including income, a drug LV (extracted from use of antipsychotics, lithium, and anticonvulsant mood stabilizers as indicators), severity of current suicidal ideation, YMRS, a severity LV extracted from the HAM-D and HAM-A scores, BMI-MetS LV (extracted from BMI and MetS as indicators), an antioxidant LV (extracted from CMPAAse, HDL-cholesterol, and PON1 values), an oxidative and nitrosative stress (O&NS) LV (extracted from zLOOH + AOPP, zSOD + LOOH, and zSOD + LOOH + NOx + AOPP) and UA, hsCRP, basal TSH, and HOMA-IR as single indicators. The BMI-MetS LV was introduced as an explanatory variable for all biomarker data. Thus, this model also examined whether the effects of the recurrence LV on biomarker data were (partly) mediated by the BMI-MetS LV.

Fig. 2
figure 2

Results of consistent partial least squares (PLS) bootstrapping (2000 bootstraps) showing the “recurrence staging index” latent vector (LV) (see legends to Fig. 1) predicting income, a “drugs” LV (extracted from use of antipsychotics (antipsy), lithium, and mood stabilizers (moodst)), current suicidal ideation, Young Mania Rating Scale (YMRS), a “severity” LV (extracted from the Hamilton Depression (HAMD) and anxiety (HAMA) Rating Scale scores), an “antioxidant” LV (antiox) (extracted from PON1 activities and HDL-cholesterol), and a “body mass index” (BMI) LV (extracted from BMI and metabolic syndrome (METS)). The latter predicts biomarkers, including uric acid (UA), C-reactive protein (CRP), thyroid stimulating hormone (TSH), and a HOMA-insulin resistance index (HOMAIR). The antiox LV and UA predict the oxidative and nitrosative (O&NS) LV (extracted from the three O&NS biomarker scores used in the present study). Sex predicts UA, CRP, and O&NS LV. The model shows path coefficients and exact p-values for the inner model and t values for the outer model

Figure 2 shows the outcome of consistent PLS bootstrapping (2000 bootstraps). The model quality data were very adequate with model fit SRMR = 0.065, composite reliabilities that were > 0.803 for all LVs (antioxidant, O&NS, BMI-MetS, drugs, severity LVs, and of course the recurrence LV) and with AVE values that were > 0.577 for all LVs. We found that the recurrence LV explained a large part of the variance in the severity LV (38.5%), YMRS (7.2%), the drug LV (50.7%), income (14.4%), and suicidal ideation (70.8%), while 5.8% of the variance in the BMI-MetS LV and 3.5% of the variance in the antioxidant LV was explained by the recurrence LV. There were no significant direct effects of the recurrence LV on UA, hsCRP, basal TSH, and HOMA-IR index. There were significant associations between the BMI-MetS LV and hsCRP, HOMA-IR index, basal TSH, and UA. There were significant specific indirect effects of the recurrence LV on hsCRP (t = 2.06, p = 0.040) and HOMA-IR (t = 2.15, p = 0.032), which were mediated by the BMI-MetS LV. There were no significant effects of the recurrence and BMI-MetS LVs on the O&NS LV, while O&NS was affected by the antioxidant LV and UA and predicted an increased severity LV. There were significant direct effects of the antioxidant LV and UA on the O&NS LV, which in turn showed direct effects on the severity LV. There were significant total effects of sex on hsCRP, O&NS and UA.

Associations Between a Staging Index and Affective Phenomenology

Figure 3 shows the results of a third PLS analysis with the construction of a second staging index, based on the four indicators used in the recurrent LV, combined with the outcome variables in Fig. 1, namely Sheehan total and WHOQoL scores as well as cognitive test results, namely VFT and TMT_A (TMT_B could not be combined). Table 1 shows that the LV extracted from these indicators had very good performance with excellent composite reliability, Cronbach’s α, rho_A, and AVE. All loadings of the eight indicators were > 0.484, all at p < 0.001. The PLS model in Fig. 3 examines this second staging index as an output variable using the ELT LV as direct explanatory variable and sex as an explanatory variable for both the ELT and staging LV. Figure 3 shows the path model computed using consistent PLS bootstrapping (2000 bootstraps) with casewise deletion. The ELT LV quality data were very adequate with composite reliability of 0.903, Cronbach’s α of 0.866, rho_A of 0.908, and AVE of 0.655. We found that 39.6% of the variance in the staging LV was explained by the regression on the ELT LV (sex was not significant) and that sex explained 8.3% of the variance in the ELT LV. There was a significant specific indirect effect of sex on the staging LV mediated by ELT LV (t = − 2.23, p = 0.026). This second staging index was strongly correlated with the recurrence LV (r = 0.915, p < 0.001, n = 172) and education (r = − 0.316, p < 0.001, n = 171), income (r = − 0.452, p < 0.001, n = 170), employment status (r = − 0.376, p < 0.001, n = 171), severity LV (r = 0.635, p < 0.001, n = 172), YMRS (r = 0.283, p < 0.001, p = 172), severity of current suicidal ideation (r = 0.491, p < 0.001, n = 172), CGI score (r = 0.531, p < 0.001, n = 169), the drug LV (r = 0.514, p < 0.001, n = 172), and the BMI-MetS LV (r = 0.187, p = 0.014, n = 172).

Fig. 3
figure 3

Results of consistent partial least squares (PLS) bootstrapping (2000 bootstraps) showing the construction of a “staging index” latent vector (LV). This staging index consists of number of previous suicide attempts (@#attempt), lifetime suicidal ideation (ColLife_2gr), number of depressive (@#depres) and total number of episodes (allepisono), Sheehan total score (shetotal), WHO-QoL total score (whoqolin), trail making test (TMT), and verbal fluency test (VFT) scores. This staging index is predicted by an “early lifetime trauma” (ELT) LV extracted from five ELT scores, namely emotional abuse (emo.abu), emotional neglect (emo.neg), physical neglect (phy.negl), physical abuse (phys.abu), and sexual abuse (sexabuse), which, in turn, is predicted by sex. The model shows path coefficients and exact p values for the inner model and t values for the outer model

Construction of the ELT-Staging Index

Figure 4 shows how we made a higher order construct (HOC) (named ELT-staging HOC), based on the five ELTs and all indicators of the staging LV presented in Table 3, by combining all indicators of the ELT LV and the staging LV into one HOC comprising 13 indicators. The AVE was, however, only 0.424 and the loading of TMT_A on this LV was only 0.345. Therefore, we deleted TMT_A from this analysis and reran the PLS analysis. Table 1 shows that the LV extracted from those 12 indicators (thus without TMT_A) performed well and showed good composite reliability, Cronbach’s α, and rho_A, with an AVE of 0.454. All variables loaded significantly on this latent vector, namely, all > 0.451 and all p < 0.001.

Fig. 4
figure 4

Results of consistent partial least squares (PLS) bootstrapping (2000 bootstraps) showing the construction of an “ELT staging HOC” (higher order construct) using all indicators of the early lifetime trauma (ELT) latent vector (LV) (see Fig. 1 for description) and the “staging index” LV (see Fig. 3 for description). This ELT staging HOC index consists of number of previous suicide attempts (@#attempt), lifetime suicidal ideation (ColLife_2gr), number of depressive (@#depres) and total number of episodes (allepisono), Sheehan total score (shetotal), WHO-QoL total score (whoqolin), verbal fluency test (VFT) score, and five ELT scores, namely emotional abuse (emo.abu), emotional neglect (emo.neg), physical neglect (phy.negl), physical abuse (phys.abu), and sexual abuse (sexabuse). The model shows path coefficients and exact p values for the inner model and t values for the outer model

For reasons of simplicity, we calculated a final model without TMT_A and VFT. Table 1 shows that the LV extracted from these 11 indicators (Final ELT-staging index) had very good performance with excellent composite reliability, Cronbach’s α, and rho_A and a sufficient AVE, namely 0.460. All indicators loaded highly on this LV; all loadings were > 0.512 and all p < 0.001. There was a very strong association between this index and the ELT-staging-HOC (r = 0.995, p < 0.001, n = 172). There were significant direct effects of sex on the recurrence-staging (t = − 3.52, p < 0.001) and ELT-staging HOC (t = − 3.39, p < 0.001) indices.

Characteristics of the Staging Groups

We used the final ELT staging index as a reference staging index and applied the 66.6666 and 33.3333 percentile values to divide the patients into three groups with equal numbers, thus reflecting three different stages of illness. Table 2 shows the staging indices computed here (namely, recurrence index, staging index, ELT-staging HOC, and the final ELT staging index), as well as the recurrence indicators, total WHOQoL and Sheehan scores, TMT and VFT values in these three staging groups compared with controls. The staging indices were significantly different among the four groups and increased in the order of controls → stage 1 → stage 2 → stage 3. The number of depressive episodes and total number of episodes were significantly different between the three staging groups and increased in the order of stage 1 → stage 2 → stage 3. The number of suicidal attempts and lifetime suicidal ideation were significantly greater in stage 3 compared with all other groups, while lifetime suicidal ideation was also significantly more prevalent in stages 1 and 2 than in controls. The Sheehan and WHO-QoL scores were significantly different between stages 2 and 3 and the other two groups. VFT was significantly lower in the mood staging groups compared with controls, while TMT_A was significantly higher in stages 2/3 than in controls, and there was a trend toward higher TMT_B in stage 3 compared with controls. All five ELTs showed significant differences between the groups with higher values in stage 3 than in all other groups and higher values in stage 2 than in stage 1.

Table 2 Measurement of the indicator variables in controls and three staging groups based on the final ELT-staging index

Table 3 shows the sociodemographic and clinical data of the participants in the staging groups. There were no significant differences in age, marital status, duration of illness, metabolic syndrome, and use of lithium and antipsychotics among the groups. All the other variables showed significant differences with a lowered income in stages 2 and 3 and higher levels of HAM-D, HAM-A, YMRS, and CGI in stages 2 and 3. Current suicidal ideation was significantly higher in stage 3 compared with all other subjects, while stages 1 and 2 also showed higher levels than controls. BMI was significantly higher in stage 3 compared with stage 1 and controls. Significantly more subjects in stage 3 used mood stabilizers than subjects in stage 1. The prevalence of BP1 disorder was significantly higher in stages 2 and 3 than in stage 3.

Table 3 Socio-demographic and clinical characteristics of the three staging groups based on the final ELT-staging index

Computation of a Clinically Useful Staging Algorithm

To compute an algorithm that may be used in clinical practice, we carried out a multiple regression analysis with the final ELT staging index as the dependent variable and its indicator variables or total ELT, and HR-QoL, Sheehan, and CGI scores as explanatory variables. We found that 95.6% of the variance in the final ELT staging index could be explained by the regression on number of depressive and (hypo)maniac episodes and suicide attempts, and total ELT and CGI scores (Χ2 = 684.50, df = 5/159, p < 0.001). Table 4 shows the results of the bootstrapped regression coefficients (1000 bootstraps). As such, the B values can be used to compute the unstandardized predicted values, which are an adequate index of the final ELT staging index. The latter is significantly associated with the BMI-MetS LV (r = 0.215, p = 0.006, n = 165), CMPAase activity (r = − 0.261, p = 0.002, n = 144), the nitro-oxidative stress index zSOD + LOOH + NOx + AOPP (r = 0.318, p < 0.001, n = 134), cognitive LV (r = 0.332, p < 0.001), VFT (r = − 0.385, p < 0.001, n = 80), TMT_A (r = 0.382, p < 0.001, n = 81), and TMT_B (r = 0.298, p = 0.007, n = 80).

Table 4 Results of bootstrapped multiple regression analysis with the Final-ELT-staging index as outcome variable

Construction of a Biological-Staging Index

Finally, we have also examined whether the biomarkers could be reliably combined with the ELT-recurrence-outcome data. Table 1 shows the construct reliability of a fifth LV (named PON1-staging LV) with CMPAase activity (loadings = 0.595), PON1 activity (0.524), number of depressive episodes (0.793), total number of episodes (0.753), HR-QoL (0.781), and Sheehan total score (0.698) as indicator variables. This LV showed good construct reliability with an AVE of > 0.50 and adequate composite reliability, Cronbach’s α, and Rho_A values. This PON1-staging index was significantly associated with the recurrence index (r = 0.731, p < 0.001, n = 172), staging index (r = 0.725, p < 0.001, n = 172), ELT-staging HOC (r = 0.793, p < 0.001, n = 172) and final ELT staging index (0.794, p < 0.001, n = 172), ELT LV (r = 0.507, p < 0.001, n = 172), the nitro-oxidative index (r = 0.258, p = 0.001, n = 172), VFT (r = − 0.374, p < 0.001, n = 83), and TMT_A (r = 0.416, p < 0.001, n = 84), but not CRP (r = 0.061, p = 0.440, n = 160). No latent constructs could be extracted from the indicators of the ELT-relapse LV and other biomarkers.

Figure 5 shows the association between the five indices and the clinical diagnoses into MDD and BD type 1 and type 2. There were significant differences in the five staging indices between the groups (F = 11.65, df = 15/453, p < 0.001, partial eta squared = 0.259). The staging indices were significantly different between controls, major depression, and bipolar disorder (no significant differences between type 1 and type 2).

Fig. 5
figure 5

This figure shows the five new staging indices (mean values with SE of the z transformations) that were computed in the current study in normal controls (00), patients with bipolar type 1 (1.00) and type 2 (2.00), and major depression (3.00). Shown are the recurrence index (recurrence), staging index (Staging), ELT-staging HOC (ELT-Staging), final ELT-staging index (ELT-staging), and PON1-staging index (PON1-staging).

Discussion

The main finding of this study was that, based on real data and consistent PLS bootstrapping (2000 bootstraps), we were able to construct staging indices, which showed excellent construct validity as indicated by adequate Cronbach’s α, composite reliability, and AVE values. As such, our staging indices reflect relevant dimensions of affective phenomenology that are useful in clinical practice to assess severity of staging in both major depression and bipolar disorder. A first reliable staging index comprised relapse data as defined by the number of depressive and (hypo)manic episodes as well as lifetime suicide attempts. This relapse-staging index was significantly associated with female sex, effects of multiple ELTs, and outcome variables including greater disabilities, lowered HR-QoL, and impaired neurocognition, as well as socioeconomic data, use of psychotropic medications, current suicidal ideation, severity of illness, BMI, and reduced antioxidant defenses, thus indicating excellent concurrent validity. Moreover, this relapse-staging index could be combined with ELTs, increased disabilities and lowered HR-QoL, as well as impairments in verbal fluency into an ELT-staging latent construct. A mathematical algorithm based on the latter may be used in clinical practice to score staging severity and to classify patients into three progressive stages of illness. Table 5 summarizes the clinical characteristics of these three stages. An early stage includes patients with few reoccurring episodes, increased lifetime and current suicidal ideation, and lowered verbal fluency, but no significant disabilities or changes in HR-QoL. The transition of stage 1 to stage 2 (the relapse-regression stage) is largely related to increased recurrent episodes and is characterized by increased disabilities, lower HR-QoL and lower socioeconomical status, and cognitive processing speed. Stage 3 shows the same characteristics as stage 2 but, in addition, more ELTs in childhood, a higher recurrence of depressive, but not (hypo)manic, episodes, a trend toward executive dysfunctions, and highly increased suicidal behaviors, which is the hallmark of this suicidal-regression stage.

Table 5 Characteristics of the three stages of affective disoders as delineated in the present study

Our staging models show significant differences from previous models proposed by Berk et al. [2], Kapczinski et al. [10], and Ferensztajn et al. [11]. For example, these models did not take into account the existence of a suicidal-regression phase (our stage 3) and did not consider ELT to be part of the model criteria. Nevertheless, our model and that of Kapczinski et al. [10] show some similar criteria including the use of relapses of episodes, interepisodic functioning, and cognitive impairments. For example, there is a strong similarity between our stage 2/3 and Kapczinsky’s stage 3, which use functional impairments as a key criterion. Major differences are that our model is based on statistical results of supervised machine learning (PLS path analysis) applied on real data from affective disorder patients, while previous models were based on common knowledge. Moreover, it should be noted that our ELT-staging model represents a key dimension of affective phenomenology, which is shared by both major depression and bipolar disorder and encompass ELTs, the relapsing-remitting course, suicidal behaviors, progressive deterioration with functional deficits, cognitive impairments, and disabilities. As such, this stage-defined index captures the trajectory of affective phenomenology from childhood to later retrogression stages.

The role of ELT in the development of affective disorders is now well established. Childhood traumatic events and especially their accumulation in childhood increase risk for developing both depression and bipolar disorder later in life [57,58,59,60,61,62,63,64,65,66,67,68,69,70,71]. Relapses of episodes (and by inference the relapse-remitting course), including recurrent suicidal behaviors, constitute the key component of all our staging indices. The number of depressive episodes increases from stage 1 → stage 2 → stage 3 and the number of (hypo)manic episodes from stage 1 to stage 2/3. Furthermore, a highly increased number of suicidal attempts is the hallmark of stage 3. It is well known that both depression and bipolar disorder are highly recurrent disorders with an increased risk for suicidal attempts and behaviors [72,73,74,75]. In the present study, we found that the staging indices significantly increased from controls → major depression → bipolar disorder, while there were no significant differences between type 1 and type 2 bipolar disorder. Previously, it was shown that bipolar 1 individuals had a higher number of lifetime depressive episodes than bipolar 2 and major depressed patients [76, 77] and that bipolar 1 and 2 patients had a higher number of lifetime depressive episodes compared with major depressed patients [31].

The current study showed that functional deficits, including increased disabilities, decreased quality of life, and a decreased socioeconomical status, are hallmarks of the transition from stage 1 to stage 2/3. There is now evidence that major depression and bipolar disorder are among the leading sources of disability among younger adults worldwide [78, 79], whereby a significant part of bipolar patients show impairments in their employment status and interepisode dysfunctions [80]. Both disorders are not only accompanied by increased functional disabilities but also by lowered HR-QoL [80,81,82,83].

Our findings showed that the components of the ELT-staging index share a large part of their variance, indicating that the indicators of this latent vector should be functionally associated. It is known that ELTs may cause more recurring episodes [65, 84, 85], and increased risk for subsequent suicidal behaviors [86, 87]. The latter findings may also explain why ELTs are more prominent in stage 2 and especially in stage 3, which is characterized by increased ELT and suicidal behaviors. In this regard, it should be noted that the effects of female sex on the staging indices are mediated by higher ELT scores in females, suggesting that females with ELT are at increased risk to experience increased staging severity compared with their male counterparts [7]. Moreover, it is known that multiple ELTs are associated with increased severity of illness [61, 69, 71, 88, 89], decreased HR-QoL, and increased disabilities in later life [90,91,92,93], delays in neurocognitive development [94, 95] and higher BMI and obesity [96, 97].

In the current study, we found that the cumulative effects of multiple ELTs, which appear to drive most outcome variables, are in fact mediated by the recurrence-staging index. Such findings may be explained by the knowledge that recurrent episodes and especially reoccurring suicidal attempts and suicidal ideation may affect work productivity and functions, thereby contributing to lower socioeconomic status [33, 98,99,100]. Moreover, ELTs and recurrent patterns and suicidal behaviors may affect the severity of illness, which in turn is accompanied by decreased HR-QoL [33, 101].

The second major finding of this study was that lowered lipid-associated antioxidant defenses could be reliably combined with relapsing episodes, disabilities, and HR-QoL into a PON1-staging index, which showed adequate composite reliability and that the final ELT-staging index was significantly associated with increasing nitro-oxidative stress. These findings suggest that lower antioxidant defenses and increased nitro-oxidative stress are important factors underpinning the pathophysiology of the ELT-staging process. Previous literature has shown that most, if not all, components of the ELT-staging index are associated with nitro-oxidative stress, lowered antioxidant defenses, and related neuro-immune pathways.

Recently, we reported that, in affective disorders, physical neglect is associated with nitro-oxidative biomarkers reflecting increased production of ROS/RNS, nitro-oxidative stress, and lipid and protein oxidation [102]. Importantly, interactions between IL-6 and CRP single-nucleotide polymorphisms and ELT increase risk toward major depression [103]. There is now evidence that, in humans, psychosocial stressors may cause nitro-oxidative stress and decreased antioxidant defenses [review: 102]. Animal models, including those that model chronic mild stress, chronic social isolation, swimming and restraint stress, are characterized by increased lipid and protein oxidation and lowered antioxidant levels in the brain [review: 102]. The second component of the ELT-staging index, namely recurrence of episodes and suicidal behavior, was also significantly associated with lowered antioxidant defenses and neuro-oxidative and neuro-immune biomarkers [26, 30, 31, 104]. For example, Sowa-Kucma et al. [31] reported that the number of affective episodes was significantly associated with increased lipid peroxidation and signs of cell-mediated immune activation, while Moreira et al. [26] established that lowered lipid-associated antioxidant defenses are associated with increased numbers of depressive and (hypo)manic phases. Celik et al. [104] and Maes et al. [30] found positive associations between the number of lifetime depressive episodes and immune biomarkers including neopterin and cytokines. These neuro-immune data are relevant to our current results because, in affective disorders, neuro-immune and neuro-oxidative pathways are interrelated phenomena [105]. Interestingly, lowered PON1 activity is already present in patients with a first episode of psychosis of whom a considerable percentage will develop bipolar disorder [106], while lowered PON1-related antioxidant defenses are significantly associated with activated neuro-oxidative and neuro-immune pathways in affective disorders [26].

In the current study, suicidal behaviors are a key component of the staging models and a hallmark of stage 3. Previously, it was shown that current suicidal ideation is associated with increased signs of lipid peroxidation and that a history of lifetime suicidal attempts is associated with increased lipid peroxidation and signs of immune activation [31]. Vargas et al. [107] reported that prior suicidal attempts are accompanied by increased lipid hydroperoxides and lower total radical trapping antioxidant parameters, suggesting that lipid peroxidation and lowered antioxidant defenses are biomarkers of suicidal attempts. Additionally, the outcome variables measured here were strongly related with lipid-associated antioxidant defenses, including income, CGI, HR-QoL, and disabilities. Recently, we reported that lowered HR-QoL and its subdomains in affective disorders are associated with increased levels of malondialdehyde, advanced oxidation protein products, and lowered CMPAase PON1 activity [33]. Finally, there is now some evidence that increased neuro-oxidative stress and neuro-immune pathways may affect neurocognitive functions in affective disorders [108]. It is interesting to note that the significant effects of the recurrence-staging LV on hsCRP and insulin resistance were in fact mediated by the effects of recurrence-staging on BMI. These findings may explain why both hsCRP and insulin resistance are biomarkers of affective disorders [32, 109], while controlling for BMI shows that both biomarkers do not always discriminate patients with depression or bipolar disorder from controls [32, 70]. The other two biomarkers examined here, including uric acid and basal TSH levels, were not significantly associated with the staging indices.

The results of our study should be interpreted with respect to its limitations. First, we did not include patients with chronic depression or treatment resistant depression (TRD), although they were important indicators in previous staging models [9, 11]. It remains to be elucidated whether TRD belongs to the same ELT-staging index or is a separate entity that should be treated differently. Second, we excluded patients with comorbid conditions (apart from DSM-5 anxiety disorders) including psychosis and medical disorders, although in clinical practice these comorbidities are very common and probably deteriorate staging. For example, cardiovascular disorders are common comorbidities with affective disorders and are accompanied by lowered PON1 activities and increased neuro-oxidative stress [26]. Third, our study was a cross-sectional study and, therefore, no causal inferences can be made. Fourth, it would have been more interesting if we had measured a panel of cytokines to compute the inflammatory/anti-inflammatory indices [13]. Finally, although the model herein proposed was validated using the bootstrap method, the model deserves validation in independent study samples consisting of patients originating from different regions of the world and with a different genetic and cultural background.

Importantly, here, we propose an evidence-based and statistically validated approach, which may have consequences for the diagnosis of affective disorders [110, 111]. Indeed, we showed that patients with mood disorders may be (a) scored on a severity scale based on a psycho-bio-phenomenological dimension (psycho pointing toward ELT; bio to biomarkers and phenomenological to the clinical and staging characteristics) that encompasses the lifetime trajectory of mood disorders (from ELTs, a relapsing course to progressive disabilities and suicidal behaviors); and (b) classified into three stages of that trajectory as well as as unipolar or bipolar. Phrased differently, here, we have constructed a trans-diagnostic psycho-bio-phenomenological dimension, which allows a quantitative and qualitative approach of mood disorders. This approach is much more powerful than the American Psychiatric Association (APA) approach applied in the DSM, which lacks statistical and biological validation [112,113,114], and the trans-diagnostic phenotypes proposed by the NIMH Research Domain Criteria (RDoC) [115, 116]. In addition, our statistically validated HOC (dimensional) and classification (categorical) of mood disorders combines the continuity versus discontinuity approach and the bipolar spectrum approach [117]. Moreover, the abovementioned theories [117], the consensus-based APA classifications, and the RDoC system miss our point that the delineation of phenomenological dimensions and classifications of psychiatric phenomenology should be based on results of pattern recognition methods, which are validated statistically [118].

Figure 6 summarizes our findings. We constructed an ELT-staging index that reflects a key dimension of affective phenomenology, shared by both major depression and bipolar disorder and encompassing ELTs, reoccurring episodes and suicidal behaviors, and a progressive functional retrogression with increased disabilities, impairments in verbal fluency, cognitive processing speed and executive functions, and lower HR-QoL. Female sex has a significant impact on the staging indices, and this effect is mediated by increased ELT scores in females. The ELT-staging index is significantly associated with increased nitro-oxidative stress, while lowered serum paraoxonase 1 activities combines with recurrent episodes, disabilities, and HR-QoL into a relevant biological dimension. Moreover, based on the ELT-staging index, which reflects severity of staging, we defined three stages: (1) an early stage, (2) a relapse-retrogression stage, and (3) a suicidal-retrogression stage. This staging model may help to tailor interventions in the early stage to prevent the transition from stage 1 to stages 2/3. Our results suggest that treatments that target lowered lipid-associated antioxidant defenses may be useful to this end.

Fig. 6
figure 6

New staging model of affective disorders showing a key dimension of affective phenomenology, shared by both major depression and bipolar disorder, and which encompasses early lifetime traumata (ELT), reoccurring episodes and suicidal behaviors, and a progressive functional retrogression with increased disabilities, impairments in cognitive processes and lower HR-QoL. Female sex had a significant impact on this staging index which was mediated by increased ELT scores in females. Lowered serum paraoxonase 1 activities (PON1) may be combined with recurrent episodes, disabilities, and HR-QoL into a latent construct with adequate composite reliability, while the staging index is also significantly associated with increased nitro-oxidative stress. Three stages of progressing severity were defined: (1) an early stage, (2) a relapse-retrogression stage, and (3) a suicidal-retrogression stage