INTRODUCTION

Intermediate filaments (IF) represent an important component of cytoskeleton, along with microtubules and microfilaments. About 70 genes encoding IF proteins have been described in humans [12]. All proteins of the IF family have similar structural features, in particular, presence of the central α-helical domain (the so-called rod domain). The IF structure is characterized by common structural elements: three α-helical domains (1A, 1B, and 2), which are involved in the formation of coiled-coil domains, are separated by α-helical linkers, which are not involved in the coiled-coil formation [34]. Rod domain of IF is a constituent of the coiled-coil structure and its primary structure is organized as heptad repeats in which amino acid residues are denoted by the letters a-g. The positions a and d often contain hydrophobic residues involved in formation of the hydrophobic core, which allows α-helical rod domains to form a coiled-coil structure. IF dimers interact in antiparallel manner and form a tetramer [34]. Assembly of IF from dimers to mature filaments has three stages [5-7]. At the first stage, rapid formation of IF tetramers occurs, next lateral interactions trigger assembly of ULFs (Unit Length Filaments) or protofibrils whose diameter exceeds 10 nm, and, finally, at the last stage, filaments are elongated, compacted, and their diameter is reduced to 10 nm [7]. Principles of the organization of proteins in the IF family are generally universal, but its individual representatives are characterized by unique properties. For example, lamins are not capable of lateral polymerization. Due to complicated organization of the IF protein complexes, point amino acid substitutions even in one domain could lead to serious changes, up to the loss of ability of the mutant protein to self-assembly or to selective blocking of polymerization at the ULF stage (mutation Y117L in vimentin [8]). The most studied members of the IF family are vimentin, desmin and keratins. For them, atomic structures of the coiled-coil domains, including sites responsible for interaction of the dimers and tetramers with each other, have been obtained [3, 4, 9]. Based on the homology between the IFs, it can be assumed that the similar principle of organization would be also characteristic for neurofilaments.

In mature neurons, five IF proteins are expressed – a triad of neurofilament proteins: the neurofilament light (NFL), medium (NFM), and heavy (NFH) chains, as well as peripherin and α-internexin. The main difference between the three neurofilament proteins from each other is determined by their C-terminal domains, which differ both in length and in the presence of important phosphorylation sites. The NFM and NFH proteins are characterized by the presence of long C-terminal domains that contain KSP (Lys-Ser-Pro) repeats that are phosphorylation sites (for example, for Erk 1,2 kinases) [1011]. In early studies of the neurofilament structure in neurons, it was shown that the ratio of proteins in the cells is 7/3/2 per NFL/NFM/NFH monomer [10]. Structurally, the proteins are represented by NFL/NFL homodimers, as well as NFL/NFM and NFL/NFH heterodimers [1213]. Compared with other IFs, neurofilaments are distinguished by their heterooligomeric structure (heterodimeric), as well as a significant extension along the entire length of the axon, which requires a developed system of their transport. However, in vitro, only NFL can self-assembly, while NFM and NFH are not able to polymerize. Formation of the filamentous structures produced by the NFL protein in vivo has been shown in the SW13 Vim cells that do not express endogenous vimentin [14]. At the same time, NFM and NFH proteins are necessary for the normal expression of NFL in neurons [15].

In this paper, we focused on studying physicochemical properties of the main representative of neurofilaments – the NFL protein. A large number of mutations leading to the development of hereditary Charcot–Marie–Tooth neuropathy (CMT) were found in the NEFL gene encoding this protein [16]. This disease includes a group of pathologies that are divided into two types: demyelinating and axonal forms [1617]. The first type of neuropathy (CMT 1) is characterized by the decrease in conductivity of the nerve fiber, and the second type (CMT 2) is characterized by the degenerative processes in the axon without reducing the conductivity [18]. The known mutations in the NEFL gene can be divided into several groups according to (i) their localization in the structure of the NFL protein or (ii) type of Charcot–Marie–Tooth disease that they cause: demyelinating (1F) or axonal (2E) [17, 19, 20]. Aggregation was demonstrated for a number of the mutant NFL proteins (P8Q, E90K, N98S, A149V) in the cell culture, which in most cases could be eliminated by co-expression of NFL with NFM [21-24]. At the same time, expression of three intermediate filament proteins (NFL, NFM, and peripherin) was necessary to prevent aggregation of the three mutant proteins (N98S, Q332P, E397K) [22]. For the NFL protein with N98S replacement, disruption of assembly and formation of aggregates was shown both in the cell cultures and in the mouse neurons [23]. Moreover, a pronounced tremor and accumulation of aggregates in the cerebellum and spinal cord were observed in the mouse heterozygous N98S/WT model. At the same time, pathological aggregation of the mutant proteins inside the cells was not observed in some studies, but a noticeable decrease in the axon myelinization was detected [25]. Contradictions in the accumulated data raise a number of questions. So, it is not obvious whether formation of the NFL aggregates is a consequence of improper protein folding or is it caused by the changes in its interaction with the protein partners?

Proteins containing amino acid substitutions in the coil 1A (E90K and N98S) and in the coil 1B (A149V) were selected for study in this work (Fig. 1). For these mutant forms of NFL, disruption of assembly (formation of very short filaments or protein aggregates) was previously shown in vivo [21, 22, 24]. To study the causes of pathogenic properties of these NFL mutations, it is necessary to determine effects they have on the protein structure. Position of amino acid residues in the heptads determines structure and properties of the coiled-coil proteins, including stability of the coiled-coil structure, which in turn could affect properties of the NFL oligomers. Considering that the mutations chosen for this study are located in the heptad repeats at positions d (E90K), e (N98S), and g (A149V), i.e., in those positions, which can affect stability of the coiled-coil structure, it can be suggested that the mutations E90K, N98S, and A149V (Fig. 1) may lead to the changes in the coiled-coil structure of the NFL. This assumption is supported by the previously obtained data that the mutations not only at positions a and d of the heptad repeats, but also at positions e and g can affect significantly stability (thermal stability) of the coiled-coil of the tropomyosin molecule [26-31]. Verification of this assumption was the main aim of this work. For that, we investigated effects of these amino acid substitutions on thermal denaturation of the NFL protein using circular dichroism (CD) and differential scanning calorimetry (DSC).

Fig. 1.
figure 1

Structure of the NFL molecule. Arrows mark positions of mutations E90K, N98S, and A149V in the coils 1A and 1B.

MATERIALS AND METHODS

Proteins expression and purification. All NFL protein used in this study were recombinant proteins, products of the human NEFL gene (UniProt P07196). Wild-type NFL (WT) and mutant forms were cloned into the EV vector (Cloning Facility, Russia). For introduction of mutations, a Q5-site directed mutagenesis kit (NEB, USA) and the following sets of primers were used. Primers F1 and R1 were used to introduce the N98S substitution; primers F2 and R2 were used to introduce the E90K substitution; primers F3 and R3 were used to introduce the A149V substitution. Sequences of the primer (Evrogen, Russia) are presented in Table 1.

Table 1 Sequences of primers used in this work

All obtained constructs were used for bacterial expression of proteins in Escherichia coli strain C41. Isolation and purification of all recombinant NFL proteins was carried out from the fraction of inclusion bodies. After ultrasonic disintegration of the cells, inclusion bodies were isolated by centrifugation at 18,000g. A pellet containing wild type NFL protein (NFL WT) protein or its mutant forms was dissolved in a 20 mM Tris/HCl buffer, pH 8.0, containing 8 M urea, 2 mM EGTA, 15 mM β-ME, 0.1 mM PMSF (buffer A) and subjected to ultracentrifugation at 105,000g. Supernatant was loaded onto a HiTrap Q column (GE Healthcare, USA) equilibrated with buffer A. Elution was carried out with a linear NaCl gradient (0-0.7 M NaCl). Protein concentration was determined spectrophotometrically at 280 nm by using extinction coefficient E1% of 5.9 cm–1. Purified proteins were stored at –80°C. Purity of the obtained preparations was no less than 95% (Fig. 2).

Protein renaturation was performed before the experiments by overnight dialysis against a 5 mM Hepes/NaOH buffer, pH 8.0, containing 0.5 mM EGTA and 2 mM DTT at 4°C.

Fig. 2.
figure 2

Analysis of purity of WT NFL and mutants preparations performed by SDS-PAGE.

Analytical ultracentrifugation (AUC). Sedimentation velocity (SV) experiments were carried out in an analytical ultracentrifuge, model E (Beckman, USA), equipped with absorption optics, photoelectric scanner, monochromator, and online computer. Analytical ultracentrifugation was performed at a protein concentration of 1 mg/ml in a 5 mM Tris buffer, pH 8.0, containing 0.5 mM EGTA and 2 mM DTT. An An-G Ti titanium rotor and two-sector cells were used in the experiments. Sedimentation profiles were recorded by measuring optical density at 280 nm. All cells were scanned simultaneously with an interval of 2.5 min. Differential distributions of sedimentation coefficients [ls-g*(s) vs s] were determined at 25°C using the SEDFIT program [32].

Circular dichroism spectroscopy (CD). Circular dichroism spectra of NFL proteins were recorded in the range of 190-280 nm at 5°C with a Chirascan CD spectrometer (Applied Photophysics, England) using cuvettes with an optical path length 0.02 cm. Signal registration time for each wavelength was 5 s. At least five CD spectra were recorded for each sample. All experiments were carried out at a protein concentration of 1 mg/ml in a 5 mM Hepes buffer, pH 8.0, containing 0.5 mM EGTA and 2 mM DTT. Temperature dependences of thermal denaturation of proteins were recorded under constant heating rate of 1°C/min in the range 5-85°C at a wavelength of 222 nm.

Differential scanning calorimetry (DSC). DSC experiments were carried out by using a differential scanning microcalorimeter MicroCal VP-Capillary (Malvern Instruments, USA) with capillary measuring cells. Samples were heated at a constant rate of 1°C/min from 10 to 85°C. All experiments were carried out using protein concentration of 2 mg/ml in a 5 mM Hepes buffer, pH 8.0, containing 0.5 mM EGTA and 2 mM DTT. At least three recording were performed for each protein. To study reversibility of denaturation, protein preparations were subjected to two consecutive heatings. Deconvolution procedure, i.e., decomposition of heat absorption curves into separate thermal transitions (calorimetric domains) was carried out with the Origin 7.0 program by fitting data to a model for analysis of multi-domain proteins (a non-two-state model) [33]. This mathematical approach, based on the classical work of Freire and Biltonen [33], was incorporated into the software Origin 7.0 as a “DSC module”. This module allows not only deconvolution of DSC curves obtained for the reversibly denaturing proteins, but also a number of other important data processing procedures. For example, it allows to construct (and subsequently subtract from the heat capacity curve) the so-called chemical baseline, connecting the states of native and fully denatured protein. It also makes it possible to accurately determine maximum temperature (Tm) for each thermal transition, and its calorimetric enthalpy (ΔHcal), which represents the integral of excess heat capacity over temperature within the denaturation temperature range of one calorimetric domain or an entire molecule.

RESULTS AND DISCUSSION

NFL protein belongs to the IF family of proteins and has coiled-coil regions which are characteristic for the proteins of this family. However, a distinctive feature of IF proteins is their ability to self-assembly and form various oligomers. For the bovine NFL protein, formation of low molecular weight oligomers (tetramers) was previously shown in a buffer solution with low ionic strength [34]. Sedimentation coefficient of the human NFL WT protein oligomers, obtained under similar conditions by the analytical sedimentation method, was 7.3 S (Fig. 3). Previously, using equilibrium ultracentrifugation, molecular weights of the oligomers of other IFs, vimentin and desmin, obtained under similar conditions [35], were determined. It was shown that at low ionic strength (5 mM Tris-HCl) molecular weight of the oligomers corresponds to tetramers of these proteins. However, sedimentation coefficients of vimentin and desmin tetramers significantly depend on pH, which indicated a different degree of rigidity, size, or shape of the formed oligomers. At pH 8, the sedimentation coefficient of vimentin tetramers was 4.7 S, while at pH 7.5 it was 5.5 S [35]. It was shown in the subsequent work by same authors that the changes in experimental conditions (in particular, increase of protein concentration to 0.5 mg/ml and the addition of EDTA), caused increase of the vimentin sedimentation coefficient up to 7.2 S [36]. This value almost did not differ from the value of 7.3 S determined in our work for the NFL at concentration of 1 mg/ml (Fig. 3).

Fig. 3.
figure 3

Analysis of oligomeric state of the NFL WT (1 mg/ml) by analytical ultracentrifugation. Distribution of sedimentation coefficients ls-g*(s) is presented. The rotor speed was 48,000 rpm.

Taking into account predominance of the main peak 7.3 S in the distribution of ls-g*(s) of NFL WT (Fig. 3), it can be concluded that under the used conditions, this protein was predominantly in the form of oligomers corresponding to tetramers of the NFL protein. Presence of minor amounts of other oligomeric forms with sedimentation coefficients of 5.2, 5.8, or 13.2 S (each form <2%) was also observed. At present, we can only assume that the tetrameric state is also preserved for the mutant forms of NFL (convincing proof of this assumption requires additional experiments, which are planned to be carried out in future).

Nevertheless, we can confidently state that under the used conditions neither NFL WT nor its mutants formed high-molecular-weight oligomers (ULF or filaments), which was confirmed by the complete absence of pellets after high-speed (at 105,000g) centrifugation (data not presented). It is important to note that the tetrameric form of IF can exist not only in vitro, but also in vivo [37], which allows us to consider it as a minimal structure for studying the properties of NFL.

Recorded CD spectra of the WT NFL and all mutant proteins showed presence of the negative maxima of molar ellipticity at 208 and 222 nm, which are characteristic of α-helical proteins (Fig. 4). According to the DichroWeb program (http://dichroweb.cryst.bbk.ac.uk/html/home.shtml), the NFL structure is represented by approximately 52-56% of α-helical regions.

Fig. 4.
figure 4

Circular dichroism spectra of NFL WT and its mutants with substitutions E90K, N98S, and A149V. All spectra were recorded at 5°C.

First of all, we studied thermal denaturation of the whole NFL WT molecule and its mutant forms, by monitoring the change in molar ellipticity at 222 nm in the temperature range of 5-85°C (Fig. 5a). It turned out that melting of the NFL WT molecule occurs gradually and has several pronounced thermal transitions (Fig. 5, a-c). By using first derivative of the thermal dependence presented in Fig. 5a, we identified the presence of a thermal transition at ~38°C and a higher temperature “shoulder” (Fig. 5c). Thermal denaturation of the mutant protein with the N98S mutation is very similar to that of the wild-type protein, and also has a pronounced transition at ~39°C. For the mutant protein with the E90K substitution, we found thermal transitions at ~42°C and ~55-57°C. For the NFL protein with the A149V mutation, thermal transitions were detected at ~35°C and ~42°C (Fig. 5c). At the same time, all NFL proteins are characterized by the reversible denaturation. The CD spectra of these proteins after cooling showed the same negative maxima at 208 and 222 nm characteristic for α-helical proteins.

Fig. 5.
figure 5

Thermal denaturation of the NFL WT protein and its E90K, N98S, and A149V mutants, monitored via CD spectroscopy. a) Temperature dependences of molar ellipticity of NFL WT and mutant proteins recorded at 222 nm. b) Temperature dependences of the α-helix content in the NFL molecule. Helicity of the NFL molecule at 5°C, corresponding to ~50% of the protein helicity, was taken as 100%. c) First derivatives of the temperature dependences of ellipticity presented in panel (a).

In order to accurately identify thermal transitions in the NFL molecule, we used the DSC method, which allows recording temperature dependences of the excess heat capacity of the protein molecule during its thermal denaturation. NFL melting was completely reversible, which allowed computer-assisted deconvolution of the excess heat capacity curves of the NFL WT and its mutant forms, i.e., decomposition of these curves into separate thermal transitions (calorimetric domains), reflecting denaturation of various protein regions, which melt cooperatively and independently of each other. The obtained data are shown in Fig. 6 and in Table 2.

Fig. 6.
figure 6

Temperature dependences of excess heat capacity of the NFL WT protein and its point mutants obtained by DSC. Solid lines show experimental curves of the excess heat capacity, and dotted lines represent calorimetric domains identified by deconvolution of these curves. Numbering of the domains in the figure corresponds to their numbers in Table 2.

Table 2 Calorimetric parameters obtained by DSC for individual thermal transitions (calorimetric domains) of the wild-type NFL protein and its mutant forms*

Based on the results of deconvolution for the WT NFL, 4 calorimetric domains with half-transition temperatures at 33.2°C, 41.7°C, 53.6°C, and 63.2°C were determined. The first derivative graphs of the change in molar ellipticity at 222 nm (Fig. 5c) and the temperature dependences of the excess heat capacity of the WT NFL protein and its mutants (Fig. 6) are, in general, very similar and demonstrate the presence of identical thermal transitions. The CD method makes it possible to register denaturation of the NFL α-helices, while the DSC method detects denaturation of the coiled-coil structure. Similarity of the temperature dependences obtained by these two methods indicates that the destruction of the coiled-coil structure occurs simultaneously with the melting of α-helices.

For all the mutant proteins, except for NFL E90K, 4 calorimetric domains were also identified. The NFL E90K protein did not show any calorimetric domain 1 (Fig. 6, Table 2). It is important to note that, despite the fact that all amino acid substitutions studied in this work are located in coils 1A and 1B (Fig. 1), the changes of calorimetric domains occurred only in the case of one amino acid substitution, E90K. Since this substitution is located in the coil 1A (Fig. 1), it can be assumed that the calorimetric domain 1 reflects denaturation of this region of the NFL molecule. Introduction of other amino acid substitutions (N98S and A149V) changed enthalpy of the individual calorimetric domains, but did not change their number. The smallest changes in the half-transition temperature were observed for the calorimetric domains 2 and 4. For the NFL WT protein and its mutants, the Tm value of calorimetric domain 2 is in the range of 40.7-41.7°C, and for the domain 4 – in the range of 62.5-63.2°C. At the same time, introduction of the mutations caused noticeable changes in the calorimetric domain 3. In particular, the Tm value of this domain in the case of the wild-type protein is 53.6°C, while for the mutant protein with the A149V substitution it is much lower, 47.5°C.

The main changes in the thermal denaturation of NFL caused by mutations affect calorimetric domains 1 and 3. To explain possible reasons for the structural changes that may occur upon amino acid substitutions in NFL protein, we examined the structure of IF.

Primary structure of the coil regions of the IF family proteins demonstrates distribution of amino acid residues in the heptad repeats that is typical for all such proteins [26]. In addition, each amino acid residue plays a specific role in stabilization of the coiled-coil structure [2638]. For the vimentin protein atomic structures of the individual fragments were previously obtained and distribution of amino acid residues in the coils 1A and 1B was determined (Fig. 7) [3]. Alignment of the primary structures of vimentin and NFL showed that there is high homology between their coils 1A and 1B. Hence, we assumed that the amino acid residues in the heptads of NFL coiled-coil regions would correspond to the same positions (a-g) as in vimentin. If this is true, then E90 is at position d, N98 – at position e, and A149 – at position g (Fig. 7). The E90K substitution, found in the patients with Charcot–Marie–Tooth neuropathy, leads to disappearance of the calorimetric domain 1 on the NFL thermogram (Fig. 6, Table 2), which may indicate changes in the structure of coil 1A, for example, its higher stabilization or, conversely, destabilization. It was shown, that amino acid positions a and d of the coiled-coil proteins should contain hydrophobic amino acid residues that stabilize the coiled-coil structure. However, for many intermediate filament proteins (vimentin, glial fibrillar protein, peripherin, desmin), the position homologous to E90 in the NFL also contains an amino acid residue E [3]. At the same time, for the remaining residues in the coil 1A, the positions a and d contain hydrophobic amino acid residues (Fig. 7a). Despite the fact that the E90 amino acid residue is not hydrophobic, it also seems to play an important role in stabilization of the coiled-coil structure of the coil 1A due to electrostatic interactions, since it corresponds to R87 at the position d in the hydrophobic core (Fig. 7b). Introduction of the E90K substitution should lead to a local gap between the two α-helices due to appearance of the additional positive charge K90, which causes destabilization of this region in the protein structure. It is important to note that the members of the IF family exhibit capacity to polymerization, and it has been shown in a number of studies that the coil 1A plays an important role in the longitudinal polymerization of IF (i.e., formation of extended filaments). For this coil region of the IF proteins, the lowest melting temperature, ~30°C, was found, and it was suggested that low thermal stability of the 1A coil domain is important for IF polymerization [39-41]. For vimentin, a single amino acid substitution located at the beginning of the linker 1 (K139C) was previously found to cause temperature dependent polymerization. In particular, the protein with the K139C substitution quickly formed ULF both at 21°C and 37°C, however, at 21°C it was not able to form extended fibrils [35]. In this case, it can be expected that the amino acid substitutions that affect thermal stability of NFL could also affect its ability to polymerize. Studies of the NFL E90K mutant polymerization in the SW13 Vim cells showed that this protein forms thin short filaments, in contrast to the wild-type protein, which forms long extended filaments under the same conditions [21]. Comparison of these literature data [21] with our results (Fig. 6, Table 2) indicates that the changes in thermal stability induced by the E90K mutation apparently occur in that domain (coil 1A, Fig. 1), which is responsible for NFL polymerization and filament formation.

Fig. 7.
figure 7

Analysis of the position of amino acid residues in the heptad repeats of the NFL protein. a) Alignment of primary structures of vimentin (UniProt P08670) and NFL proteins (UniProt P07196). Gray color indicates mutations in the NFL protein that are associated with the development of Charcot–Marie–Tooth neuropathy. Position of amino acids in the heptads is denoted by letters a-g and was determined based on the article by Chernyatina et al. [3]. b) Analysis of the position of amino acid residues in the NFL WT and NFL E90K heptads made by using the Waggawagga program [43]. Numbers 1-5 denote various heptads.

However, it should be noted, that in the case of IFs, it is also important to take into account their ability for lateral polymerization, which is achieved due to antiparallel interaction of the dimers. For IF, several types of interactions of dimers in the tetramer were found: A11, A12, and A22 [3638], which differ in the interacting coils (for example, in the case of A11, the dimers interact antiparallel through the coil 1B). Despite the fact that all three types of interaction are present in the mature IF filament, at the initial stages of polymerization, up to ULF, interaction of the A11 type is predominant [35].

Coil 1B plays the main role in the formation of IF tetramers; however, coil 1A and the beginning of coil 2 are also important for the formation of tetramers [342]. This fact can explain the changes in the Tm value of the calorimetric domains 1 and 3, as well as the changes in the enthalpy (ΔHcal) of all calorimetric domains caused by mutations. N98S and A149V mutations led to the changes in the NFL thermal denaturation, however, in both these cases, all four calorimetric domains were retained. This apparently indicates that these mutations do not have such a serious effect on stabilization of the NFL coiled-coil structure. This can be explained by to the fact that these amino acid residues are located in the e and g positions of the heptad repeats, which are on the surface of coiled-coil structure. Pathogenic properties of these NFL mutations seem to involve disruption of the interaction between the tetramers during formation of ULF and filaments. In particular, aggregation of the NFL N98S protein was previously shown in the SW13 Vim cells [21].

CONCLUSION

It can be concluded that point mutations in the NFL protein found in the patients with peripheral Charcot–Marie–Tooth neuropathy do not affect the ability of this protein to form a coiled-coil structure. For all studied mutant proteins with E90K, N98S, and A149V substitutions, presence of the α-helical structure was determined by CD analysis. At the same time, these mutations affect stability and thermal denaturation of the NFL molecule, leading either to the decrease in the number of calorimetric domains detected by DSC, as in the case of the E90K substitution, or to the noticeable changes in the transition temperature (Tm) of the calorimetric domain 3, or to the changes in the calorimetric enthalpy (ΔHcal) of the domains.

In general, the obtained data indicate that the E90K mutation apparently leads to destabilization of the coil 1A in the NFL molecule. As for the N98S and A149V mutations, it can be assumed that they do not affect the coiled-coil structure but might affect other levels of the NFL molecules organization.