Keywords

1 Introduction

Basement membranes (BMs) are evolutionary ancient and highly conserved sheet-like structures of extracellular matrix (ECM) that coordinate cells, direct their polarization, and play a formative role in shaping tissues (Morrissey and Sherwood 2015). In adult as all animals are multicellular, BMs serve structural and protective functions by providing signaling stimuli for cell behaviors and tissue regeneration (Jayadev and Sherwood 2017; Morrissey and Sherwood 2015). Prominent cases of highly specialized BMs are glomerular basement membrane (GBM), which provides ultrafiltration in the kidney (Gunwar et al. 1998), lens capsule basement membrane (LBM), which acts as anchor points for lens cells during migration and proliferation (Lu et al. 2008), and seminiferous tubule basement membrane, which plays a role in spermatogenesis (Kahsai et al. 1997). Type IV collagen is a structural scaffold in all BMs that integrates other ECM components, such as laminins, nidogens and proteoglycans (Fidler et al. 2017).

Genetic variants in collagen IV genes, COL4A1-6, cause BM pathologies in various tissues. Central nervous system and cerebral vasculature are primary targets of pathogenic variants in COL4A1 and COL4A2 genes, while pathogenic variants in COL4A3-5 genes almost exclusively lead to kidney diseases, hearing defects and ocular abnormalities (Cosgrove and Liu 2017; Meuwissen et al. 2015; Stokman et al. 2016; Wang et al. 2018). Whereas connection between gene variant and resulting abnormalities is firmly established in most cases, the mechanism of pathogenesis remains unclear. To provide better insight into why and how these diseases develop, it is essential to understand molecular mechanisms of collagen IV scaffold assembly.

2 Structural Organization of Collagen IV

Collagen IV was discovered in kidney GBM over 50 years ago in the seminal works of Spiro and Kefalides groups (Spiro 1967; Kefalides 1968) as a novel crosslinked molecule with unusual properties. Structural organization of the collagen IV scaffold was later analyzed using acid extraction and limited proteolysis of tumor BM, which led to a discovery of hexamer and dodecamer oligomers (Fig. 1a) (Timpl et al. 1981). Subsequently, two key structural elements responsible for forming each of these complexes, the 7S dodecamer and the NC1 hexamer (Fig. 1a), were isolated using proteolytic enzymes (Weber et al. 1984; Risteli et al. 1980; Timpl et al. 1979). These and other functional domains of collagen IV can be extracted and purified to homogeneity from different tissues, tumors, and matrix deposited by cultured cells (Boudko et al. 2018). The type IV collagen family has six genetically distinct α-chains designated α1 to α6 (Fig. 1b). Each α-chain contains an N-terminal 7S region, collagenous domain, and C-terminal NC1 domain. Initially, three α chains form collagen IV protomer that further assembles into the scaffolds composed of the hexameric and dodecameric assemblies. Studies of the NC1 hexamers extracted from tissues by treatment with collagenase (Fig. 1c) led to discoveries of three compositions of NC1 hexamers: α121, α345, and α121/α556 as well as a new type of covalent cross-link, the sulfilimine bond, stabilizing these hexamers. (Fig. 1d) (Hudson et al. 2003; Hudson et al. 1994; Borza et al. 2001; Boutaud et al. 2000; Vanacore et al. 2009).

Fig. 1
figure 1

Collagen IV. (a) Two key assemblies, NC1-to-NC1 hexamer formation and 7S dodecameric assembly drive the formation of collagen IV scaffolds that are further re-enforced by lateral interactions. (b) The six α-chains, α1-α6, of collagen IV found in humans. (c) The non-collagenous (NC1) domain can be solubilized from tissues and matrix deposited by cultured cells after collagenase treatment. (d) The three reported types of collagen IV scaffolds based on analysis of solubilized NC1 hexamers. The NC1 hexamers are often re-enforced by sulfilimine bonds between two protomers

3 Approaches to Overcome Hurdles to Study NC1 Hexamer Assembly

Collagen IV is notoriously insoluble in tissue due to extensive cross-linking, though this hurdle can be partially resolved by applying β-aminoproprionitrile, a lathrytic agent that prevents the formation of cross-links, during animal development or tumor growth. Although full-length collagen IV protomers can be extracted from animal tissues or cell cultures, they are quite challenging to study under physiological conditions as they naturally tend to aggregate (Bachinger et al. 1982). These challenges make full-length protomers prohibitively difficult to use to study NC1 hexamer assembly. Currently, there are three approaches available to bypass full-length protomers when studying hexamer formation (Fig. 2). The first approach (Fig. 2a) utilizes either recombinantly produced NC1 monomers (Casino et al. 2018) or collagenase liberated native NC1 from bovine LBM or PFHR9 cell culture (Bhave et al. 2012). These hexamers dissociate into monomers upon removal of Cl ions using dialysis or desalting columns. These monomers can then be analyzed in hexamer assembly assays by addition of Cl. This approach though does not reflect natural hexamer formation as it combines two steps, trimerization and hexamerization, into one process. The second approach (Fig. 2b) utilizes recombinant constructs consisting of the NC1 domain and an N-terminal stretch of 28 GXY repeats essential for forming a stable triple helix (Cummings et al. 2016). These constructs form trimeric molecules in the absence of Cl ions and thus, can be used for direct hexamer assembly experiments. The third approach (Fig. 2c) utilizes single-chain recombinant technology in which all three chains of the NC1 domain are tied together in a desired domain composition and registry to form a stable, single polypeptide chain NC1 trimer (NC1sc) (Pedchenko et al. 2019). All three approaches bear unique advantages and have certain limitations, but collectively provide tools for studying assembly of the NC1 hexamer.

Fig. 2
figure 2

Experimental models of collagen IV hexamer assembly. (a) NC1 monomers from either recombinant expression or from collagenase liberated NC1 extracted from bovine lens capsules or murine PFHR9 cell culture matrix. (b) Recombinant truncated protomers containing 28 GXY repeats necessary to form the triple helix and the whole NC1 domain. (c) Recombinant single-chain NC1 trimers. Three NC1 chains are linked by artificial 3-residue linkers (shown in cyan)

4 Role of Cl in NC1 Hexamer Assembly

The initial clue for how NC1 domains assemble into hexamers was provided by crystal structures of human and bovine NC1α121 hexamers that presented acetate, Br, Cl, Ca2+, and K+ coordinated by residues from two opposite trimers (Sundaramoorthy et al. 2002; Than et al. 2002; Vanacore et al. 2004). These and chemically similar ions were extensively studied for their abilities to trigger or facilitate the hexamer assembly (Cummings et al. 2016). Neither Ca2+ or K+ were able to induce the hexamer formation without presence of negatively charged ions. In contrast, halides were able to induce the hexamer assembly with the following relative efficiency: Br > Cl > > I > > F, where F had negligible effect. Whereas Br was the most efficient in initiating hexamerization, Cl was the only ion that could efficiently assemble NC1 hexamers at its physiologically relevant extracellular concentration (Cummings et al. 2016). Interestingly, addition of 1 mM Ca2+ in the presence of physiologically relevant (100 mM) Cl further increased efficiency of hexamerization suggesting complementary effect of divalent cations in this process (Cummings et al. 2016).

The role of Cl in assembly and stability of the hexamer can be illustrated in several ways. Size-exclusion chromatography demonstrates assembly (Fig. 3a, left) and disassembly (Fig. 3a, right) of the hexamer by the presence of hexamer or trimer peaks in elution profiles upon addition or removal of Cl (Pedchenko et al. 2019). Atomic force microscopy was found to be a very illustrative technique to assess the presence of hexamers in a sample (Fig. 3b) (Pedchenko et al. 2019). When a sample with NC1 trimers was imaged, individual randomly dispersed particles were seen (Fig. 3b, left). Surprisingly, when a sample with NC1 hexamers was imaged, pairs of individual particles were seen (Fig. 3b, right). Explanation of this phenomenon can be found in the method of sample preparation. After absorbing the NC1 hexamers onto the mica surface using the working solution (with enough Cl concentration), the surface is briefly washed with water and immediately air dried for subsequent imaging. This short washing step removes Cl and thus, initiates dissociation into trimers, which remain non-covalently absorbed to the surface.

Fig. 3
figure 3

Cl concentration affects collagen IV. (a) Elution profiles from size-exclusion chromatography demonstrate that single-chain NC1 trimers (red line) associate into hexamers (blue line) at high Cl concentration (left) but dissociate back to trimers when Cl are replaced with acetate (right). (b) Atomic force microscopy of single-chain NC1 trimers (left) or hexamers (right). Hexamers dissociate into pairs of trimers after washing the sample with water, which initiates Cl removal from the NC1. (c) Effect of Cl on the quality of deposited basement membrane-like matrix by cultured PFHR9 cells as visualized by transmission electron microscopy. Left, cells were grown using normal medium with physiological Cl concentration for 10 days. Right, cells were grown in the medium with low Cl concentration for 10 days. (d) Effects of medium changes on the matrix. Left, initially cells were grown in the medium with low Cl concentration for 5 days and then in the normal medium for additional 5 days. Right, initially cells were grown in the normal medium for 5 days and then switched to low Cl concentration for additional 5 days. Overall, the matrix deposited under low Cl concentration is less dense and a boarder is noticeable between two types of matrix. When cells were initially grown in the low Cl concentration the matrix was partially restored (patches of dense matrix) after switching to the normal medium. Arrows indicate direction of basement membrane-like matrix deposition over time

Depletion of Cl had drastic effect on quantity and quality of the growing BM as visualized in cell culture experiments with varying Cl concentration in the medium (Cummings et al. 2016). At low Cl concentration (~5 mM) cells deposit disorganized matrix with lesser density compared to normal conditions (Fig. 3c). Switching from low to normal Cl concentration partially repairs the abnormal BM (Fig. 3d, left panel). Meanwhile, switching from normal to low Cl concentration seems to not significantly disrupt BM deposited under normal conditions (Fig. 3d, right panel), suggesting stabilization of the NC1 hexamer through sulfilimine cross-linking (Bhave et al. 2012; Vanacore et al. 2009) and possibly complexation with other macromolecules in the BM. Nevertheless, Cl depletion can disrupt NC1 hexamer assembly, and may disrupt yet unknown macromolecular complexes within BM.

The development of a single-chain NC1 trimer as a tool to study the assembly of the NC1 hexamer (Fig. 2c) allowed quantitative analysis of this process and establishment of a kinetics model (Pedchenko et al. 2019). Plotting hexamer formation versus Cl concentration results in a sigmoidal curve (Fig. 4a). Cl concentrations below 10 mM (intracellularly relevant) appear to prohibit hexamer formation, whereas at 100 mM (extracellular concentration), the fraction of hexamer formed approaches a plateau. This titration curve surprisingly fits the two extremes of the Cl step gradient across the cell membrane (e.g. 7 mM inside the cell and 100 mM outside the cell (Andersen 2013; Armstrong 2003)) and suggests a major role for Cl in collagen IV scaffold assembly outside the cell. Experimentally measured kinetics of the hexamer assembly at high Cl concentration and protein concentration dependence revealed that this Cl ion driven process fits a simple bimolecular reaction model with the rate constant ka = 3.45 ± 0.12 M−1 s−1 (Pedchenko et al. 2019). Projections of the hexamer assembly demonstrate slow kinetics with half-times ranging from 20 min to 7 h at 25 to 1 mg/ml protein concentration (Fig. 4b). The correlation between this in vitro data and the in vivo processes remains unknown.

Fig. 4
figure 4

Cl titration and kinetics of the hexamer assembly. (a) Cl titration curve, adapted from (Pedchenko et al. 2019). The amount of the hexamer formed depends on Cl concentration. The red dot indicates approximate intracellular Cl concentration, while the green dot indicates approximate extracellular Cl concentration under normal physiological conditions. (b) Calculated kinetics of hexamer formation at various protein concentrations, based on experimentally defined mechanism (Pedchenko et al. 2019)

5 Cl Ions Are Structural Components of the NC1 Hexamer

Recently reported crystal structures of Cl bound hexamers NC1sc-α121 (Fig. 2c), NC1α111, NC1α333, and NC1α555 finally revealed the number and positions of Cl within each hexamer (Casino et al. 2018; Pedchenko et al. 2019). The hexamers were specifically held at high Cl concentration prior and during crystallization to assure complete saturation of the structure with the Cl ions (Pedchenko et al. 2019). Each of these hexamer structures revealed 12 Cl ions at the trimer-trimer interface as shown for the α121 composition (Fig. 5). These 12 ions form a Cl ring, composed of two groups of 6, based on their structural environment.

Fig. 5
figure 5

Cl at the hexamer interface of NC1α121. Cartoon representation of the crystal structure of the NC1α121 hexamer. Coloring for α1 and α2 chains are yellow and magenta, respectively. Spaced-out image reveals 12 Cl (shown as spheres) sandwiched between NC1 trimers. Rotation of the hexamer presents an interfacial view that reveals the distribution of Cl resembles a ring around the interface. The Cl can be further split into Group 1 (colored cyan) and Group 2 (colored blue) depending on the nature of their coordination. The figure was generated using coordinates of the crystal structure of NC1sc-α121 (PBD code: 6MPX)

In the following detailed descriptions of Cl environment, NC1sc-α121 and NC1α555 structures will be used as they were solved at the highest resolution. Group 1 Cl intercalate into the base of the NC1 trimers, where they are centered in loops formed by amino acids 74–78 and coordinated by these residues’ backbone atoms (Fig. 6, top panel). In addition, an arginine residue from the opposite trimer forms a salt bridge to each of these ions. Molecular dynamics simulations also suggest that binding Group 1 ions causes the re-arrangement of the R76 side chain to break intra-trimeric salt bridges in order to form inter-trimeric salt bridges that contribute to hexamer formation (Cummings et al. 2016).

Fig. 6
figure 6

Cl coordination at the interface between NC1 trimers. The top panel shows coordination of Group 1 Cl (shown as cyan spheres), while the bottom panel depicts coordination of Group 2 Cl (shown as blue spheres). The α1, α2, and α5 NC1 domains are colored yellow, magenta, and green, respectively. Water molecules are shown as red spheres. PEG molecules found near Group 2 ions are shown as gray carbon chains with red oxygens. Interactions between Cl and either water or PEG are denoted by red dashes. Blue dashes denote Cl interactions with one NC1 trimer, while black dashes denote Cl interactions with the opposite NC1 trimer. Residue numbering follows the convention used for non-single chain structures with single-chain structure numbering in parenthesis. The figure was generated using coordinates of the crystal structures of NC1sc-α121 and NC1α555 hexamers (PBD codes: 6MPX and 5NAZ)

Group 2 Cl are located closer to the equatorial plane of the hexamer (Fig. 5). Instead of intercalating into the base of the NC1 trimers, these ions sit directly at the interface and act like a bridge that connects two protomers by accepting hydrogen bonds from each NC1 trimer (Fig. 6, bottom panel). A unique aspect of the Group 2 Cl that will be discussed further in the subsequent section is the fragility of this coordination. Aliphatic C-H groups from alanine and aromatic residue side chains serve as donors for the Cl interaction. In addition, the solution outside the hexamer can donate hydrogen bonds not only from water, but also from polyethylene glycol (PEG) molecules included in the crystallization solution. Combined, these interactions represent a weak interaction network that could allow for relatively easy exchange with solvent Cl ions suggesting highly dynamic nature of Group 2 Cl coordination within the NC1 domain.

6 Surface Environment Accessibility of Cl Ions

Surface analysis of ions at the center of the NC1 hexamer presents a putative binding order and differentiation of roles between Group 1 and Group 2 Cl. In both the NC1α121 and NC1α555 hexamers, Group 1 Cl are visible only when one of the trimers is removed (Fig. 7). Hence, it appears the Group 1 Cl must bind to the NC1 before Group 2 Cl and thus, play a signaling role in assembly. It appears that Group 1 Cl ions in the NC1α555 trimer are embedded deeper into the NC1 surface compared to equivalent Cl in the NC1α121 trimer. Whether this indicates NC1α555 bind tighter to this Cl group or is also relevant to NC1α345 or NC1α121/α565 hexamers is unknown currently.

Fig. 7
figure 7

Interfacial view of the surface of NC1α121 and NC1α555 trimers as part of a hexameric assembly. As Group 1 Cl (cyan) are embedded into the NC1 trimer, they are accessed only through channels leading into the interface. Group 2 Cl, meanwhile, lie in the interface itself, appearing as though they “sit” on the NC1 surface. The triple helical collagen segments in this figure are added to orient the reader to the interface surface and are not a part of the crystal structures. Arrows extending from each box point to an expand out of the interfacial region within the box. The figure was generated using coordinates of the crystal structures of NC1sc-α121 and NC1α555 hexamers (PBD codes: 6MPX and 5NAZ)

Whereas the Group 1 Cl are sequestered inside the core of the NC1 hexamers, Group 2 Cl are readily accessible to solvent through portals in the hexamer (Fig. 8). Both structures contain polyethylene glycol (PEG) fragments, though their location in the structure leads to a curious observation. While both PEG and water molecules are found near the portals in odd-numbered NC1 alignments, only water was identified near the portals for NC1α2 (Fig. 8). Conclusions cannot be drawn from the NC1α2, α3, α4 homo-oligomer structures (PDB IDs: 5NB2, 5NB0, and 5NB1) as none were solved at sufficient resolution to observe PEG molecules included in the crystallization drop and NC1 α2, α4 structures form unusual non-hexameric assemblies. Meanwhile, the NC1α1 homo-hexamer was not crystallized in the presence of PEG molecules and no structures including NC1α6 are available. Nevertheless, observation of the position of PEG molecules raises the possibility that the surface topography of odd-numbered NC1 pairings is distinguished from the surface topography of even-numbered NC1 pairings in that it could accommodate environmental perturbations near Cl that are necessary for NC1 assembly.

Fig. 8
figure 8

Group 2 Cl are solvent accessible in the NC1 hexamers. Solvent accessible portals on the surface of the NC1 hexamers reveal Group 2 Cl (blue spheres) and unique portal environments. Group 1 Cl are not directly solvent accessible. Structured water molecules are shown as smaller red spheres, while structured PEG molecules are shown with gray carbon chains. Coloring for each NC1 domain is yellow for α1, magenta for α2, and green for α5. Only one NC1α121 trimer is found in the asymmetric unit of the crystal structure and hence, the two α1 centered faces of the hexamer are similar due to the twofold crystallographic symmetry rotation necessary to complete the hexamer. Meanwhile, only one NC1α5 domain is found in the asymmetric unit of the corresponding structure. Thus, each α5 centered face is identical due to the threefold and twofold crystallographic symmetry rotations necessary to complete the hexamer. The figure was generated using coordinates of the crystal structures of NC1sc-α121 and NC1α555 hexamers (PBD codes: 6MPX and 5NAZ)

Whereas Group 1 is solvent-accessible only in the trimer configuration (Fig. 7), Group 2 remains exposed to solvent upon hexamer assembly (Fig. 8). Thus, each Group 2 Cl, positioned in a pocket communicating to the outside through a portal, can therefore be in a dynamic equilibrium with free ions in the solution. Remarkably, the nature and geometry of this arrangement seems to be also conserved for all reported types of NC1 hexamers (Fig. 1d) suggesting a common sensing mechanism to Cl concentration.

7 Conclusions

Assembling of collagen IV structures has distinct stages both inside and outside of the cell (Fig. 9). The NC1 domain plays dual intra- and extracellular roles. Inside the cell, the domain is responsible for selecting three α chains, bringing them together and nucleating folding of triple helix in a zipper-like fashion (Soder and Poschl 2004). Outside the cell, trimeric NC1 domains from two protomers are connected by Cl to form hexamers, which can be then covalently connected by sulfilimine cross-links with the help of the peroxide reductase, peroxidasin (Fig. 1d). (Brown et al. 2017; Bhave et al. 2012; Vanacore et al. 2009). Until recently, the exact molecular mechanism of the NC1-driven collagen scaffold assembly remained unclear. Biochemical and structural data have now demonstrated that NC1 domains of collagen IV are equipped with a Cl sensing mechanism, which is capable of triggering conformational switches essential for hexamer assembly. It also appears that this mechanism has emerged early in evolution (Cummings et al. 2016; Pedchenko et al. 2019). Structural organization resulting in two groups of Cl ions suggests distinct steps take place during hexamer assembly. Group 1 ions bind to the trimer and re-organize the surface for trimer-trimer docking. Group 2 Cl bind to the formed hexamer and ultimately stabilize it (Fig. 9) (Pedchenko et al. 2019). The hexamer structure is sufficiently dynamic to require high Cl concentration in the surrounding environment as depletion of free Cl causes dissociation of the hexamer.

Fig. 9
figure 9

Collagen IV assembly steps driven by the NC1 domain. Once collagen IV chains are translated into the ER lumen the NC1 domain drives chain selection and trimerization event. Trimerization of the NC1 domain subsequently nucleates the formation of the triple helix which propagates along the molecule from the NC1 domain to the N-termini of the chains in a zipper-like fashion. After multiple post-translational modifications and quality control events, the collagen IV protomers are secreted outside the cell into a high Cl concentration environment. In the current model, once in the extracellular space the NC1 trimers undergo conformational transitions upon binding Cl (Group 1 (shown as cyan spheres) first, followed by Group 2 (shown as blue spheres)). Subsequently, physiologically high extracellular Cl concentration is critical for maintaining the hexamer structure, as Cl depletion results in dissociation of the hexamer. This hexamer is a critical step towards collagen IV scaffold assembly

Thus far there is limited structural information on the nature of the extracellular Cl binding sites, and therefore, no consensus motifs or even essential residues for Cl binding have emerged to allow searching for similar sites in other proteins (Luscher et al. 2020). Establishing Cl binding sites within the NC1 hexamer of collagen IV provides a new brick for building our understanding of role of Cl in structural organization, functional aspects, and signaling events happening in the extracellular milieu. The step gradient of Cl concentration between intracellular and extracellular space can also be exploited by other ECM proteins, yet to be discovered, as a common mechanism. Involvement of Cl concentration in control of assembly, function and signaling can also be considered for development of new types of therapies.

There are several interesting questions to be addressed in the future studies of collagen IV scaffold assembly and function related to the NC1 hexamer and Cl: (1) Does the sequence similarity between α1-α6 chains lead to the same Cl-dependent mechanism for NC1 hexamer formation for other known compositions, i.e. α345 and α121/α565? (2) Will the Cl titration curve and kinetics observed for α121 apply to other assemblies? (3) What is the mechanism of discrimination between α121 and α121/α565 hexamers? (4) What cofactors or helper proteins could accelerate the remarkably slow rate-limiting step of the hexamer assembly (Sect. 4) and/or reduce the high protein concentration necessary for hexamer formation? (5) What mechanisms, besides sulfilimine cross-linking (Bhave et al. 2012; Vanacore et al. 2009), could protect the NC1 hexamer in the event of a Cl concentration drop? (6) How much variability in BM Cl concentration exists in health and disease? (7) Is there a cell signaling role for the NC1 domain that utilizes BM Cl concentration sensing?