Keywords

2.1 Introduction

In eukaryotic cells, ER is the site for synthesis, maturation and quality control of secreted, membrane-bound and ER-resident proteins (Halperin et al. 2014; Kepp and Galluzzi 2020). This process involves translation of mRNAs on ER membrane-bound ribosomes, translocation of the nascent polypeptide across the ER membrane and co-translational folding to the native three-dimensional structure. To assist in these processes, the ER contains elaborate systems involved in post-translational modification and quality control of newly synthesized proteins (Vincenz-Donnelly L and Hipp MS 2017; Halperin et al. 2014). An important protein quality control system in the ER is constituted by the chaperones calreticulin (Calr) and calnexin (Canx), which act in concert with protein disulfide isomerases (e.g. Pdia3/ERp57) to facilitate the synthesis and maturation of proteins, and to prevent accumulation of misfolded proteins (Bergeron et al. 1994; Michalak et al. 1998; Coe and Michalak 2010; Halperin et al. 2014; Matsusaki et al. 2020). Canx is a transmembrane protein, with its major part in the ER lumen, while Calr is an ER luminal protein (Bergeron et al. 1994; Wada et al. 1995; Michalak et al. 1998; Danilczyk et al. 2000). In humans, these proteins are encoded by separate genes with 9 (Calr) or 20 (Canx) exons (McCauliffe et al. 1992; Tjoelker et al. 1994). In mammals, Calr and Canx furthermore play important roles in the loading of peptides on MHC class I (Blees et al. 2017). This chapter reviews current knowledge on the structure of human Calr (hCalr).

2.2 Primary Structures of Calr and Related Proteins

Analyses of Canx and Calr amino acid sequences have revealed that they are evolutionarily conserved proteins encoded by single genes. In the testicles a Canx homologue, calmegin (Clgn) and a Calr homologue, calsperin (Calr3) are also present (Ohsako et al. 1994; Watanabe et al. 1994; Ikawa et al. 2011). In plants, several Calr isoforms are encoded by separate genes (Jia et al. 2009; Del Bem 2011; Wasąg et al. 2019). In yeasts, a Canx homologue is present (de Virgilio et al. 1993; Parlati et al. 1995).

Figure 2.1 shows an alignment of amino acid sequences of selected Calr/Canx family members. Common to both Canx and Calr is an N-terminal domain of approximately 200 amino acid residues, followed by a proline-rich domain of approximately 100 (Calr and Calr3) or 145 (Canx and Clgn) amino acid residues, which is followed by a C-terminal domain. In Calr (and Calr3) the C-terminal (C) domain is highly acidic and consists of approximately 100 amino acids ending in an ER retrieval sequence of four residues (KDEL in hCalr). In Canx (and Clgn) the C-terminal domain has a transmembrane stretch of 20 amino acids and a cytoplasmic “tail” of approximately 90 amino acids.

Fig. 2.1
figure 1figure 1

Alignment of Calrs and related proteins from various species. (a) Calr E. histolytica XP_655241; (b) Calr D. discoideum Q23858; (c) Calr M. musculus P14211; (d) Calr R. norvegicus P18418; (e) Calr H. sapiens P27797; (f) Calr C. elegans P27798; (g) Calr3 H. sapiens Q96L12; (h) Calr T. cruzi XP_812571; (i) Canx C. lupus P24643; (j) Canx H. sapiens P27824; (k) Clgn H. sapiens O14967. The alignment was carried out using Clustal 2.1 and default parameters. Coloring: core region—blue, P-domain—yellow. C-domain—green

Relatively few residues are invariant among all Calr and related proteins (Fig. 2.1) and these mainly reside in the N-terminal (N) domain and the middle proline-rich (P) domain. Conserved stretches/clusters of amino acids in the N-domain are found in the vicinity of the conserved disulfide bridge (C106-C137 in Calr) and in the P-domain in the neighborhood of conserved W residues, being part of characteristic repeat sequences. A third interesting residue (C163 in hCalr) is conserved in all known mammalian and some other Calrs.

2.3 Higher Order Structures of Calr and Related Proteins

Table 2.1 shows a list of Calr and Canx proteins from different species with solved three dimensional (3D) structures. Figure 2.2 shows two partial structures for human Calr, plus one view of its full-length elongated structure in the context of the MHC-I peptide-loading complex. The 3D structures of Canx and Calr show close similarity but also some important differences. Both proteins have a globular core consisting of a β-sheet sandwich, where the strands are arranged in a “jelly roll” fold, and an α-helix. The sandwich is composed of two β-sheets, where the N-terminal 200 amino acids contribute 6 strands to the first, “upper” β-sheet and 6 strands to the second “lower” β-sheet. Intertwined in the two sheets are two β-strands derived from the C-terminal domain of the molecules, the first constituting the Calr β19 strand of the upper β-sheet and the second constituting the Calr β20a/b strand of the lower β-sheet (Fig. 2.2a). From the core extend the proline-rich stretches as β-hairpins (P-domain, Fig. 2.2c), stabilised by hydrogen bonds between four (Canx) or three (Calr) sets of repeat sequences. The α-helix of the globular core is longer in Calr, where it continues into the acidic tail (Fig. 2.2c), while in Canx the α-helix is followed by a transmembrane sequence and the cytoplasmic domain.

Table 2.1 Structural data available for Calr and Canx
Fig. 2.2
figure 2

H. Sapiens (Hs) Calr structures. (a) XRC core structure (Chouquet et al. 2011). The structural Ca2+ ion is shown in yellow. (b) XL-MS structure (Boelt et al. 2016). (c) HsCalr (red) in complex with MHC-I (yellow), Pdia3 (blue), tapasin (cyan) and ß2-microglobulin (green) (Blees et al. 2017). (d) Zoom on the cluster2 residues common to Calr (cyan) and Canx (green) proteins

There are several interesting characteristic features of the conserved structure of Calr. A carbohydrate-binding site on the upper concave β-sheet capable of accommodating a tetrasaccharide and having affinity for terminal glucose residues. This site contains several conserved residues (cluster 1) of importance for carbohydrate binding (Kozlov et al. 2010a, b; Chouquet et al. 2011; Moreau et al. 2016).

An exposed disulphide bridge (C106–C137 in hCalr) is located on the edge of the carbohydrate-binding site. This bridge is unusual due to its surface location and its geometry, which reveals an almost parallel, strained conformation with a sulphur atom distance of 2.8–3 Å (in contrast to normal unstrained disulphide bonds, which have a torsion angle of approximately 90° and a sulphur atom distance of 2 Å (Thornton 1981). The disulphide bridge is important for thermodynamic stability and reduceable by DTT but is not affected by alkylating agents under conditions, where the free Cys163 in hCalr can be alkylated (100 mM sodium phosphate, pH 7.2, room temperature), showing that it is stable under near-to-physiological conditions (Houen and Koch 1994; Højrup et al. 2001; Jørgensen et al. 2005), in accordance with its surface-exposed localisation at the edge of the carbohydrate-binding site. Interestingly, the disulphide bridge seems to be important for lectin activity (Kozlov et al. 2010a).

The lower β-sheet has a conserved structural Ca2+ ion-binding site (Fig. 2.2a) and also contains several residues conserved in published structures of Calr and Caln (cluster 2) and in primary sequences of Calr and related proteins (Fig. 2.1), including a triad composed of D166, H170 and D187 (DHD triad). Finally, the C-terminal α-helix contains two conserved residues including W347, close to the DHD triad (Fig. 2.2d) (Kozlov et al. 2010a, b; Chouquet et al. 2011; Moreau et al. 2016).

Chemical cross-linking in solution in combination with mass spectrometry (MS) of human Calr has confirmed the overall structure of the globular domain and has added some structural information on the parts of Calr absent in the crystallised “amputated” Calr (Fig. 2.2b). In the presence of Ca2+, the P-domain hairpin folds back on itself and associates with the C-domain, which itself forms an α-helix extending “upwards” to meet the P-domain. Together, they cover the lectin site, however, in the absence of Ca2+, they are more flexible and do not sterically limit access to the lectin site to the same extent. X-ray studies of Trypanosoma cruzi and Entamoeba histolytica Calrs have also shown for the first time a possible hinge motion at the basis of the Calr P-domain, as a switch between an open and a more compact conformation. This part of the Calr molecule differs in Canx (Moreau et al. 2016).

Structural studies of full-length Calr proteins have been hindered by its flexibility. However, its overall ordered structure could be trapped by cryo-electron microscopy in the context of the MHC-I peptide-loading complex (Blees et al. 2017). In this complex, Calr still shows substantial flexibility, as compared to its structurally stable binding partners. This flexibility remains despite the fact that it interacts with an MHC-class I glycan through its carbohydrate-binding site, with Pdia3 through the tip of its P-arm and with tapasin through its acidic tail.

The overall structure of Calr and Canx indicated by the structural data for various amputated forms of the molecules has been confirmed by several studies with other methods including electron cryo-microscopy (ECM), small angle X-ray scattering (SAXS) and nuclear magnetic resonance (NMR) spectroscopy (Table 2.1). Both molecules have a stable globular beta sandwich core with an α helix at the bottom. From the core protrudes a β-hairpin P-domain and a C-domain, which in the presence of Ca2+ make contacts and cover the lectin site.

2.4 Calr Structure-Function Relationships

The thermodynamic stability of Calr is moderate and depends on the general packing of the core region, the conserved, surface-exposed disulphide bridge and Ca2+ association (Jørgensen et al. 2005; Duus et al. 2013). Calr and Canx are chaperones for glycoprotein synthesis and several of the conserved residues in cluster 1, including the disulphide bridge, have been shown to be involved in lectin activity of Calr/Canx (Kozlov et al. 2010a, b). These residues are also strikingly conserved in the parasite Calrs (T. cruzi, E. histolytica) (Moreau et al. 2016). A putative polypeptide binding site has been located at the edge of the carbohydrate-binding upper β-sheet and Calr has been shown to interact with several non-glycosylated proteins (Duus et al. 2009; Pocanschi et al. 2011; Chouquet et al. 2011; Møllegaard et al. 2011; Moreau et al. 2016).

Calr has been implicated in several diseases including cancers, autoimmune diseases, and others (Tesniere et al. 2008; Gold et al. 2010; Zamanian et al. 2013; Wiersma et al. 2015; Eggleton et al. 2016; Schcolnik-Cabrera et al. 2019; Houen 2019). Most evident in relation to structure are somatic alterations seen in myeloproliferative neoplasms (MPNs), including polycythemia vera, myelofibrosis and essential thrombocythemia (Klampfl et al. 2013; Nangalia et al. 2013; Pietra et al. 2016; Imai et al. 2017). These alterations are mainly insertions and deletions (INDELS) in exon 9 leading to a common polybasic stretch of amino acids in the C-terminus, in close conjunction to the α-helix in the C-domain. Since these INDELS have only been detected in MPNs, it can be speculated that the frame-shifted Calr forms allow the affected myeloid precursor cells to remain viable, to grow and differentiate to some degree and to exhibit neoplastic properties. The structural polybasic alterations of the C-terminus can be speculated to mimic the binding of Ca2+ to the acidic C-terminus of wild type Calr, allowing the protein to remain stable and viable, but imparting some new properties on the molecule (i.e. an oncogenic alteration). In agreement with this, the frameshifted Calr has been shown to interact with the thrombopoietin receptor, which itself can cause MPN upon mutation and which signals through a pathway involving the Janus kinase (JAK), mutations of which is the most prominent cause of MPN (Vainchenker et al. 2019).

In relation to cancer in general, Calr has been shown to translocate to the cell surface in response to various forms of cellular stress, where it may serve as a signal for immune recognition (Tesniere et al. 2008; Wiersma et al. 2015; Eggleton et al. 2016; Schcolnik-Cabrera et al. 2019). However, it is presently unclear, whether the various involvements of Calr in cell transformation and immune surveillance are related to structural alterations or to changes in concentration and subcellular localization.

2.5 Summary Points

  • Calr and Canx are conserved Ca2+-binding molecular chaperones in eucaryotes.

  • Both Calr and Canx have a globular β-sandwich/α-helix core with a structural Ca2+ ion. From the core extends a characteristic β-hairpin P-domain and a C-domain. In Calr, the C-domain is acidic and has a large capacity for Ca2+ binding. In Canx, the C-domain has a transmembrane stretch and a cytoplasmic domain.

  • The upper β-sheet of the globular core sandwich has a carbohydrate-binding site and a disulphide bridge at the edge, which both are involved in carbohydrate interactions. A polypeptide binding site is located at the edge of the carbohydrate-binding site but has not been precisely mapped yet.

  • The lower β-sheet of the globular sandwich has a conserved D166-H170-D187 triad, a free C163 and several other conserved residues, including Y172 (hCalr numbering) (Fig. 2.2d).

  • The C-terminal α-helix at the bottom of the α-helix core has a conserved W347 residue facing the D166-H170-D187 triad (hCalr numbering) (Fig. 2.2d).

  • The structure of Calr/Canx proteins reflects their multiple functions and interaction partners, including nascent (glycosylated) proteins, cyclophilin, Pdia3, and others, as illustrated by the role of Calr and Canx in the peptide loading complex.

  • Characteristic mutations in the exon coding for the extreme C-terminus of hCalr changes the polyacidic Ca2+-binding sequence to a polybasic sequence.

2.6 Future Issues

  • Detailed structural analysis of complete Calr/Canx proteins in complex with substrates and binding partners.

  • Detailed analysis of the role of individual amino acids in the thermodynamic stability and functions of Calr and Canx.

  • Detailed understanding of the role of the polybasic sequence in frameshifted Calr variants involved in myeloproliferative neoplasms.

  • Detailed understanding of cell surface functions of Calr.