Introduction

Neurofilaments are structural proteins classified as intermediate filaments with an intermediate diameter (10 nm) between actin (6 nm) and myosin (15 nm), and solely expressed in neurons as the name indicates (Khalil et al. 2018). These proteins consist of several subunits, one of which is neurofilament light chain (NfL), an attractive candidate biomarker for neurodegenerative disorders. NfL is consistently released from axons into the extracellular space in an age-dependent manner. Axonal damage induces the increased release of NfL into the extracellular space (Gaetani et al. 2019). Thus, widespread neurodegeneration in amyotrophic lateral sclerosis (ALS) and atypical parkinsonian disorders (APD) such as multiple system atrophy (MSA) and progressive supranuclear palsy (PSP) causes axonal damage, which subsequently increases the levels of NfL in the cerebrospinal fluid (CSF).

Accumulating evidence has demonstrated that CSF NfL can be helpful for the differential diagnosis of ALS or APD (Sako et al. 2015; Gaetani et al. 2019), and that moreover, the diagnostic power of blood NfL is comparable to that of CSF NfL (Hansson et al. 2017; Feneberg et al. 2018; Marques et al. 2019; Lin et al. 2019; Verde et al. 2019). Owing to the invasiveness and contraindications of obtaining CSF NfL, the use of blood NfL is expected to be extended in the clinical setting. However, blood NfL per se cannot serve as a specific diagnostic tool because NfL is only a marker for axonal damage. Although diagnostic performance of blood NfL should be tested in a clinical setting for first medical examination at the initial stage of diagnosis, only one study evaluated the diagnostic performance of blood NfL for ALS in the aforementioned setting (Feneberg et al. 2018), in addition, no study for parkinsonian disorders.

Chitinase 3-like 1 (CHI3L1), also named as YKL-40, is the other attractive candidate biomarker. CHI3L1 is produced by a variety of cells, such as astrocytes, neutrophils and monocytes/macrophages, and is involved in the activation of macrophages/microglia, the migration of macrophages, and the initial innate immune responses (Rosa et al. 2016; Sanfilippo et al. 2017). CHI3L1 is overexpressed in the primary motor cortex of patients with ALS (Sanfilippo et al. 2017), and CSF CHI3L1 was reported as useful, to some extent, in the differential diagnosis of ALS and parkinsonian disorders (Olsson et al. 2013; Thompson et al. 2018; Gille et al. 2019). Furthermore, no significant difference was observed in blood CHI3L1 levels between patients with ALS and normal controls (Vu et al. 2020). However, these studies measured the concentration of CHI3L1 in the CSF using enzyme-linked immunosorbent assay or proteomics, which are unsuitable for detecting subtle differences in proteins in the blood between the groups.

Highly sensitive methods including electrochemiluminescence (ECL) and single molecule array (Simoa) in the blood CHI3L1 are expected to allow for a good diagnostic performance similar level to CSF like NfL.

We assessed the diagnostic power of serum NfL and CHI3L1 using samples from patients with suspected ALS or parkinsonian disorders at the initial stage of diagnosis based on the findings of neurological examination, magnetic resonance imaging, single photon emission computed tomography, nerve conduction study and electromyography, to confirm the clinical importance of these blood biomarkers in the process of diagnosis.

Methods

Participants

Figure 1 shows the flow diagram for the present study. A total of 331 independent subjects with available serum samples were identified. They had visited Tokushima University Hospital between July 2016 and August 2019, and were assessed for eligibility, based on the following criteria: (1) healthy controls, suspected of having ALS or parkinsonian disorders, and (2) an age of ≤ 80 years old. Among these, 157 subjects were deemed eligible for this study including 58 subjects with suspected ALS [40 men and 18 women, median age 68 years, interquartile range (IQR) 59.3–72.8], 58 subjects with parkinsonism (35 men and 23 women, median age 69 years, IQR 64–73), and 41 healthy controls (17 men and 24 women, median age 67 years, IQR 53–70). We used the International Consensus Criteria for the diagnosis of ALS (Costa et al. 2012), PD (Hughes et al. 1992), multiple system atrophy (MSA) (Gilman et al. 2008), progressive supranuclear palsy (PSP) (Hoglinger et al. 2017), and corticobasal syndrome (CBS) (Armstrong et al. 2013). All patients met the proposed criteria for “probable,” “probable laboratory-supported,” or “definite.” The control group included subjects without neurological deficits or psychiatric disorders.

Fig. 1
figure 1

A flowchart of the diagnostic process. ALS amyotrophic lateral sclerosis; APD atypical parkinsonian disorders; CBS corticobasal syndrome; Ctr healthy controls; MMN multifocal motor neuropathy; MSA-P multiple system atrophy with predominant parkinsonism; PD Parkinson’s disease; PLS primary lateral sclerosis; PSP-P progressive supranuclear palsy with predominant parkinsonism; PSP-RS progressive supranuclear palsy with Richardson’s syndrome

Of 58 patients with suspected ALS, 18 patients satisfied the criteria for ALS at the initial stage of diagnosis [initially diagnosed with ALS (ID-ALS)]. Among the remaining 40 patients, 32 patients fulfilled the criteria later [late-diagnosed with ALS (LD-ALS)], and 8 patients were diagnosed with other diseases (ALS mimics). The details of the ALS mimics were as follows: three cases of cervical spondylosis; one case of multifocal motor neuropathy; two cases of neuropathy of unknown cause with electrodiagnostic findings; one case of primary lateral sclerosis, and one case of probable PSP with Richardson’s syndrome (PSP-RS). We combined the ID-ALS group and the LD-ALS group into one group (LD/ID-ALS group), based on the final diagnosis.

Of 58 patients with parkinsonism, 17 patients were diagnosed with APD at the initial stage of diagnosis [initially diagnosed with APD (ID-APD)] including 12 cases of MSA with predominant parkinsonism and 5 cases of PSP-RS. Among the remaining 41 patients with suspected PD, 29 patients were diagnosed with PD (PD) and 12 patients received a diagnosis of APD later [PD mimics, late-diagnosed with APD (LD-APD)]. The PD mimics consisted of eight cases of PSP-RS, two cases of PSP with predominant parkinsonism, and two cases of CBS. We combined the PD mimics group and the ID-APD group into one group (PD mimics/ID-APD group), based on final diagnosis.

We evaluated age at blood sampling, sex, disease duration, follow-up period, Mini-Mental State Examination (MMSE) score, frontal assessment battery (FAB) score, and disease severity. Disease duration was the time interval from motor symptom onset to blood sampling. Disease severity in ALS and in parkinsonism were assessed using the revised ALS Functional Rating Scale (ALSFRS-R) (Cedarbaum et al. 1999) and the Hoehn–Yahr stage (HY), respectively.

Sample collection and measurement of serum concentration

Blood samples were collected between 9:00 am and 3:30 pm and centrifuged within 1 h of collection at 780 g for 10 min at 4 °C. Serum samples were aliquoted into microfuge tubes, and stored at − 80 °C until analysis. An ECL immunoassay was applied to measure the serum concentrations of NfL and CHI3L1 using the MESO QuickPlex SQ 120 (Meso Scale Discovery, Rockville, MD, USA). In this study, we used the R-PLEX Human Neurofilament L Antibody Set (Meso Scale Discovery, Rockville, MD, USA) for the analysis of NfL and the U-PLEX Human YKL-40 Assay (Meso Scale Discovery, Rockville, MD, USA) to measure the concentrations of CHI3L1 according to the manufacturer’s protocol. These kits included 96-well plates with a biotinylated capture antibody that binds to streptavidin on the plate surface. Serum samples were diluted two-fold for NfL and ten-fold for CHI3L1 with supplied assay diluent before analysis. The diluted samples were reacted with the biotinylated capture antibody and incubated for 1 h. After washing, the detection antibody conjugated with an electrochemiluminescent label (MSD GOLD SULFO-TAG) was added and incubated for 1 h. The MSD GOLD Read Buffer was added after final wash, and the MESO QuickPlex SQ 120 allowed for measurement of serum concentration of NfL and CHI3L1 by voltage-induced chemiluminescent emission. All analyses were conducted in duplicate, and the results were determined as the average. Values below the detection limit were set to zero.

Statistical analysis

The clinical characteristics and concentrations of NfL and CHI3L1 were compared between groups using the Kruskal–Wallis nonparametric analysis of variance, followed by a Dunn’s post hoc test corrected for multiple comparisons with Bonferroni adjustment for continuous variables, or Fisher’s exact test for categorical variables. Spearman correlation coefficient was assessed between serum concentration of NfL and CHI3L1 in the all subjects and each subgroup. Receiver operating characteristic (ROC) curves were generated and the area under the curve (AUC) was measured to assess the diagnostic accuracy. We used multivariate logistic analysis to combine concentration of NfL and CHI3L1. Diagnostic performances were compared on the basis of AUC using DeLong’s test. Findings were significant for a value of p < 0.05. All analyses were conducted with R software (version 3.5.2; http://www.r-project.org/).

Availability of supporting data

Anonymized data are available on request after ethical approval and permission from the responsible committee.

Results

Subject characteristics

No between-group differences existed in age at blood sampling, sex, follow-up period, or score on the ALSFRS-R, MMSE, or FAB. The median (IQR) disease durations were 15 (9–47.8) months in the ALS mimics group, 8 (5–12) months in the LD-ALS, and 14 (7.5–27) months in the ID-ALS group, 43 (24–72) months in the PD group, 29 (9–67) months in the PD mimics group, and 33 (13–50) months in the ID-APD group. Disease duration was significantly different between the groups (p < 0.001, Kruskal–Wallis test).

The median (IQR) HY scores were 2 (2–3) in the PD group, 3.5 (2–4) in the PD mimics group, and 4 (3–4) in the ID-APD group. The HY stage was significantly different between patients with parkinsonism (p < 0.001, Kruskal–Wallis test). The HY stages were higher in the PD mimics group (p < 0.01, Dunn’s test) and ID-APD group (p < 0.001, Dunn’s test) than in the PD group. Subject characteristics were summarized in Table 1.

Table 1 Characteristics of included subjects

Serum NfL and CHI3L1 concentrations

A significant difference in the serum NfL concentration existed between subjects with suspected ALS, including the control group (p < 0.001, Kruskal–Wallis test; Fig. 2A, left). Compared to the control and the ALS mimics groups, the LD-ALS group (p < 0.001 and p < 0.05, respectively, Dunn’s test) and the ID-ALS group (p < 0.001 and p < 0.05, respectively, Dunn’s test) had higher serum concentrations of NfL (Fig. 2A, left). A significant difference was observed in the serum NfL concentration between subjects with parkinsonism including the control group (p < 0.001, Kruskal–Wallis test; Fig. 2A, right). Post hoc Dunn’s test revealed significantly higher serum concentrations of NfL in the PD mimics group (p < 0.05 and p < 0.05, respectively, Dunn’s test) and ID-APD (p < 0.001 and p < 0.001, respectively, Dunn’s test) than in the control group or PD group (Fig. 2A, right).

Fig. 2
figure 2

Serum concentration of NfL and CHI3L1. A In the group with suspected ALS, the serum NfL concentration is increased in the LD-ALS and ID-ALS groups, compared to the control group or ALS mimics group. In the parkinsonism group, the serum NfL concentration is decreased in the PD group, compared to the PD mimics group and ID-APD group. B No significant difference in CHI3L1 exists between any groups. CHI3L1 chitinase 3-like 1; Ctr healthy control; ID-ALS initially diagnosed with amyotrophic lateral sclerosis; ID-APD initially diagnosed with atypical parkinsonian disorders; LD-ALS late-diagnosed with amyotrophic lateral sclerosis; LD-APD late-diagnosed with atypical parkinsonian disorders; NfL neurofilament light chain; NS not significant; PD Parkinson’s disease. The significant levels are *(p < 0.05), **(p < 0.01), and ***(p < 0.001)

No significant difference existed in the serum CHI3L1 concentrations between subjects with suspected ALS (p = 0.14, Kruskal–Wallis test; Fig. 2B, left) or parkinsonism (p = 0.44, Kruskal–Wallis test; Fig. 2B, right).

Serum concentration of NfL was correlated with that of CHI3L1 in parkinsonism (parkinsonism, rho = 0.46, p < 0.001; PD, rho = 0.48. p = 0.01; LD-APD, rho = 0.64, p = 0.03; ID-APD, rho = 0.53, p = 0.03; Figure S1G–J), whereas serum concentration of NfL was not correlated with that of CHI3L1 in other groups (all subjects, rho = 0.16, p = 0.04; Ctr, rho = 0.00, p = 0.99; suspected ALS, rho = 0.12, p = 0.37; ALS mimics, rho = 0.05. p = 0.93; LD-ALS, rho = 0.04, p = 0.84; ID-ALS, rho = 0.40, p = 0.10; Figure S1A–F).

Diagnostic accuracy of serum NfL and CHI3L1 concentration

Serum NfL had high AUC for differentiating ALS mimics from LD-ALS at the initial stage of diagnosis (AUC = 0.90, p < 0.001; Fig. 3A), as well as from ALS (LD/ID-ALS; AUC = 0.89, p < 0.001; Fig. 3B). The diagnostic accuracy of serum NfL for PD versus PD mimics was almost high at the initial stage of diagnosis (AUC = 0.76, p < 0.01; Fig. 3C), and the serum NfL was good for distinguishing PD from APD (AUC = 0.80, p < 0.001; Fig. 3D).

Fig. 3
figure 3

The ROC curve analysis results of serum NfL. A, B The AUCs are significant and approximately 0.9 for ALS mimics versus LD-ALS or ALS (LD/ID ALS). C, D The AUCs are significant and approximately 0.8 for PD versus PD mimics (LD-APD) or APD (PD mimics/ID-APD). AUCs area under the curves; ID-ALS initially diagnosed with amyotrophic lateral sclerosis; ID-APD initially diagnosed with atypical parkinsonian disorders; LD-ALS late-diagnosed with amyotrophic lateral sclerosis; LD-APD late-diagnosed with atypical parkinsonian disorders; NfL neurofilament light chain; PD Parkinson’s disease; ROC receiver operating characteristic

Serum CHI3L1 had a low AUC for differentiating ALS mimics from LD-ALS (AUC = 0.55, p = 0.33; Figure S2A) and ALS (LD/ID-ALS) (AUC = 0.50, p = 0.50; Figure S2B). Furthermore, CHI3L1 had a low AUC for differentiating PD from PD mimics (AUC = 0.64, p = 0.08; Figure S2C) and APD (PD mimics/ID-APD) (AUC = 0.59, p = 0.12, Figure S2D). These results reflected no significant differences in the serum concentrations of CHI3L1 between subjects with suspected ALS or parkinsonism.

The AUCs of the serum NfL and the combination of serum NfL and serum CHI3L1 were 0.90 and 0.89 for discriminating ALS mimics from LD-ALS at the initial stage of diagnosis (p = 0.48, Figure S3A). The AUCs of the serum NfL and the combination of NfL and CHI3L1 were 0.89 and 0.88 for discriminating ALS mimics from ALS (p = 0.57, Figure S3B). The AUCs of the serum NfL and the combination of NfL and CHI3L1 were 0.76 and 0.80 for discriminating PD from PD mimics at the initial stage of diagnosis (p = 0.36, Figure S3C). The AUCs of the serum NfL and combination of NfL and CHI3L1 were 0.80 and 0.83 for discriminating PD from APD (p = 0.39, Figure S3D).

Discussion

Our findings suggested that serum NfL can be helpful in the differential diagnosis of ALS and to some extent of parkinsonian disorders at the initial stage of diagnosis. By contrast, serum CHI3L1 did not show a difference between clinically mimicking and true ALS/PD.

In studies comparing of PD with MSA or PSP, the AUC of the ROC curve exceeded at least 0.8, albeit not mimicking PD (Hansson et al. 2017; Marques et al. 2019; Lin et al. 2019), which corresponded to our finding. Likewise, in a study comparing patients with ALS onset within 6 months and those with an ALS-mimicking disease, the AUC of the ROC curve was more than 0.9 (Feneberg et al. 2018), which agreed with our finding. In addition, the AUC exceeded 0.85 in another study that compared ALS with other disorders (Verde et al. 2019), and the diagnostic power of our study was not inferior to that of the previous studies. Serum NfL is increased at least 6 months before phenoconversion in inherited ALS (Benatar et al. 2018, 2019). Given these results, serum NfL allows for an early differential diagnosis between ALS and other mimicking disorders. However, to the best of our knowledge, no reports exist in which the diagnostic accuracy of serum NfL in patients with APD was tested at the initial stage of diagnosis, which was found to be almost good in this study. Serum NfL may help differentiate parkinsonian syndromes, even in the early stage of the diseases.

Plasma NfL has been associated with progression of cognitive impairment in PD (Lin et al. 2019). Patients with the postural instability and gait difficulty subtype have a higher concentration of plasma NfL than those with the tremor-dominant subtype (Ng et al. 2020). Cognitive impairment and the postural instability and gait difficulty subtype are recognized risk factors for mortality (Kempster et al. 2010; Forsaa et al. 2010), and thus, an increased level of blood NfL may increase mortality because blood NfL was associated with these two factors (i.e., cognitive impairment, postural instability and gait difficulty). An increased level of blood NfL is related to poor survival in patients with ALS (Lu et al. 2015). Patients with typical ALS or progressive bulbar palsy exhibit a higher level of CSF NfL than those with a frail arm, leg syndrome or progressive muscular atrophy in a subtype-dependent manner (Gaiani et al. 2017), and tend to have a poor prognosis. Collectively, blood NfL can allow us to predict the prognosis in PD and ALS, and the patients enrolled in this study will be followed up to determine the usefulness of serum NfL for predicting a prognosis.

The CSF CHI3L1 concentration has been reported as decreased or unchanged in patients with PD (Olsson et al. 2013; Hall et al. 2018; Magdalinou et al. 2014; Wennström et al. 2015), and increased in patients with MSA, PSP (Magdalinou et al. 2014; Alcolea et al. 2017; Hall et al. 2018), and ALS (Illán-Gala et al. 2018; Thompson et al. 2018; Gille et al. 2019; Gray et al. 2020; Vu et al. 2020); however, the blood CHI3L1 concentration was not significantly different between any groups in this study-even with ECL, which is a highly sensitive method. This result was in agreement with that of a previous study (Vu et al. 2020). Using difference in blood CHI3L1 to reflect pathology in the central nervous system may be difficult because various types of peripheral cells contribute to the serum concentration of CHI3L1 and decrease the signal-to-noise ratio (Rosa et al. 2016; Sanfilippo et al. 2017). Patients with ALS nevertheless tended to have a slightly high concentration of serum CHI3L1 compared to patients with ALS-mimic diseases. Further studies are required to determine the significance of the increased level of serum CHI3L1 in ALS.

Serum NfL was found to be significantly correlated with serum CHI3L1 not in ALS but in parkinsonian disorders, suggesting neuronal damage might be associated with innate immunity in parkinsonian disorders. Toll-like receptors, involved in innate immunity, were thought to play an important role in pro-inflammation in the presence of alpha-synuclein (Kouli et al. 2019). In contrast, recent study demonstrated that Toll-like receptor 4 deficiency facilitates alpha-synuclein propagation and neurodegeneration in a mouse model of prodromal PD (Venezia et al. 2021). Collectively, it remains controversial how innate immunity contributes to pathology of parkinsonian disorders. Further studies need to be performed to confirm our findings.

A major limitation of this study is the small number of subjects at the initial stage of diagnosis due to the single-center design. In addition, single-molecule assay (Simoa) allows to capture single protein molecules using antibody-coated beads, and has been accepted as an optimal method to measure concentration of NfL in the CSF compared to ECL (Alagaratnam et al. 2021). Simoa could improve sensitivity to detect NfL in serum and plasma (Kuhle et al. 2016; Rissin et al. 2010; Wilson et al. 2016). To the best of our knowledge, this is the first study to assess the discrimination ability of the serum NfL concentration for parkinsonism at the initial stage of diagnosis, and additional studies are needed to confirm our findings.

In conclusion, the serum NfL concentration had an excellent and an almost good diagnostic performance for patients with ALS and PD, respectively, at the initial stage of diagnosis, whereas no significant difference existed in serum CHI3L1 between any groups. These findings contribute to development of a diagnostic biomarker for ALS and parkinsonian disorders in the early stage of the diseases.