Entangled Proteins: Knots, Slipknots, Links, and Lassos

Sulkowska, Joanna I.; Sułkowski, Piotr

doi:10.1007/978-3-319-76596-9_8

Joanna I. Sulkowska⁷ &
Piotr Sułkowski^8,9

Part of the book series: Springer Series in Solid-State Sciences ((SSSOL,volume 189))

2589 Accesses
4 Citations

Abstract

In recent years the studies of entangled proteins have grown into the whole new, interdisciplinary and rapidly developing field of research. Here we present various types of entangled proteins studied within this field, which form knots, slipknots, links, and lassos. We discuss their geometric features and indicate what biological and physical role the entanglement plays. We also discuss mathematical tools necessary to analyze such structures and present databases and servers assembling information about entangled proteins: KnotProt, LinkProt, and LassoProt.

Access provided by CONRICYT-eBooks. Download chapter PDF

GLN: a method to reveal unique properties of lasso type topology in proteins

Article Open access 16 September 2020

Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding

Article Open access 10 June 2019

Knot theory for proteins: Gauss codes, quandles and bondles

Article Open access 01 July 2020

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Entanglement of geometric objects is an important and fascinating phenomenon. On one hand it leads to deep mathematical problems, which are studied within branches of mathematics such as topology; several Fields medals have been awarded for such work, in particular in knot theory. On the other hand entanglement is common in Nature and plays a role in physical, chemical, and biological systems. Nobel prizes related to topology have been awarded in 2016 in physics and chemistry. Topology, and entanglement in particular, are important, because they take into account not just local, but global properties of systems under consideration. This sheds new light on such systems and often requires developing new tools to analyze them.

The study of topology and entanglement is particularly interesting when it concerns real physical systems, and at the same time poses theoretical and mathematical challenges. This is the case of entangled proteins, whose studies we summarize in this work. On one hand, it is unbelievable that such complicated structures would evolve accidentally, so there must be some physical and biological role of entanglement. On the other hand, the study of entangled proteins poses new mathematical challenges, for example a description of knots on open chains. Entangled proteins have been very actively studied in recent years, along with the development of new mathematical tools that enable their characterization.

In this work we summarize studies of entangled proteins, which in recent years have gained the status of a new branch of interdisciplinary research, involving biophysics, biochemistry, mathematics, and computer science. Historically knotted proteins attracted attention first, so – after a brief introduction on what we mean by entanglement in general – we discuss first their properties. Subsequently we discuss properties of other entangled structures identified in proteins: slipknots, links, and lassos. We present both theoretical and mathematical tools to describe such structures, as well as their biological role and physical features.

2 Entanglement in Proteins

Proteins are long chains made of amino acids. They are often described in terms of primary, secondary, and tertiary structure. However such a description does not take into account an important feature of a protein chain that we refer to as entanglement. To describe it, it is sufficient to represent a protein as a one-dimensional chain, or a polygonal shape made of a series of segments spanned between consecutive C\(_{\alpha }\) atoms. From a mathematical perspective, when such a one-dimensional chain is embedded in a three-dimensional space, it can non-trivially wind around itself. By the entanglement we understand a pattern of such winding.

There is a branch of mathematics called knot theory, which studies properties of entangled chains. However the most important feature of chains studied in knot theory is that they are closed, i.e. they don’t have loose ends. Such entangled closed chains are called knots, and – apart from closing the chain – they resemble knots that we know from our daily life. Similarly as in daily life, one can also consider knots in open chains, especially if the ends of such chains are long enough – however such knots are not uniquely defined, and they can be untied by a sequence of smooth manipulations (which do not involve cutting of the chain). On the other hand, knots on closed chains cannot be untied without cutting a chain – therefore they can be uniquely defined, and a given type of a knot can be assigned to a given entangled configuration. In this sense knots, as understood mathematically, are referred to as topological objects.

It turns out that protein chains can also be knotted [5, 29, 32, 42, 43, 71, 76] – at least in the imprecise sense of knotting that we use in daily life. As proteins do have loose ends, it is not possible to assign types of knots uniquely to their configurations. Nonetheless, it often happens that termini of a protein are long enough, so that a type of a knot can be assigned to a protein, at least approximately. For this reason one can take advantage of various tools and techniques from knot theory when studying entangled proteins. On the other hand, these tools should be used carefully and they are often insufficient to characterize configurations of open chains – some additional information should also be provided in this case, and some new techniques are necessary to study knotted proteins. As within the last decade it turned out that knotted proteins are much more common than originally believed, such techniques have been developed, as we will discuss in what follows. Knotted proteins are the first class of entangled structures that we discuss in this work – a simple example of a knot formed in an open chain (which could be a protein backbone) is shown in Fig. 8.1.

One interesting feature of entangled proteins is that they can form configurations which are trivial from the topological and knot theory viewpoint, however – in appropriate sense – they are still entangled. One example of such a configuration is called a slipknot [32, 63]. It consists of two loops, one threaded through the other, as shown in Fig. 8.1. If two termini of a slipknot are pulled, the two loops disentangle; equivalently, after connecting two termini of a slipknot in the simplest possible way (e.g. by extending them far away), the configuration represents mathematically trivial knot. As in knot theory a slipknot configuration would be regarded as trivial, some novel tools are necessary to describe its geometry – and this is an important task, because it turns out that the entanglement of the two loops of a slipknot has interesting consequences and affects proteins’ properties. Proteins with slipknots are the second class of entangled structures that we discuss in this work.

So far we have briefly explained what we mean by entanglement of open chains. However the pattern of entanglement can be more involved once additional linkages, such as disulfide (or other) bridges are taken into account. When such bonds are present, then proper loops in a protein chain can be also considered (i.e. closed loops formed by only a part of the whole chain). In principle such proper loops might themselves be knotted, which however has not been observed to date. Nonetheless, when two or more loops are present in a given chain, they can be entangled and form structures called links in knot theory [11], see Fig. 8.1. Such links are non-trivial topological objects and can be classified using knot theory tools. Links that are formed on chains with disulfide linkages we call as deterministic, because they are made of closed loops and their topology is uniquely determined. In addition, one can also consider links made of several chains that form a dimer, trimer, etc., after their termini are connected (e.g. closed on a large sphere) – we call such links as probabilistic. Proteins with links are the third class of entangled structures discussed in what follows.

Once disulfide bridges are taken into account, one can also consider configurations where one or two termini of a protein chain pierce through a loop closed by such a bridge. Such configurations are called lassos [13, 45], see Fig. 8.1. Even though they can be regarded as topologically trivial and – similarly to knots in open chains – cannot be uniquely defined, they also have interesting biological and physical consequences. Lassos are the last class of entangled structures that we discuss in this work.

One important conclusion of the work summarized here is that, first of all, entangled structures exist in proteins – even though for a long time it was believed that they are too complicated to form [42]. Another very important result is that the entanglement has certain biological and physical role. Moreover, studies of entangled proteins pose new challenges in mathematics, computer science, and other related disciplines. For all these reasons the new, interdisciplinary field of entangled proteins is rapidly growing, and despite many fascinating results found in last years, we have no doubts that a lot more is still to be discovered.

3 Proteins with Knots and Slipknots

The first class of entangled proteins that we discuss are proteins with knots. As mentioned above, because proteins have loose ends, it is not possible to identify uniquely knots formed by their chains. It is however possible to identify such knots using probabilistic methods. In this section we present basic properties of knots in mathematical sense, discuss how to describe knots and slipknots in proteins using this language, and summarize which (families of) knotted proteins have been identified to date. We also briefly discuss folding mechanisms and how the presence of knots affects the function of proteins.

3.1 Classification and Description of Knots

To present mathematical knots on closed chains it is useful to project them on a two-dimensional plane. A type of a knot is then encoded in a pattern of crossings formed by the projection of the chain. It is always possible to minimize the number of such crossings by smooth transformations of the chain, which do not involve cutting it. On the level of the two-dimensional diagram such transformations can be reduced to three elementary operations, referred to as Reidemeister moves, see Fig. 8.2. Various topological types of knots can be classified in terms of equivalence classes (with respect to Reidemeister moves) of their diagrams. It is also very useful to introduce so called knot invariants – various mathematical objects, such as numbers or some functions, which can be uniquely assigned to a given knot type (so that they do not change under Reidemester moves). Knot invariants are used to distinguish and classify knots: they can be computed for two given knots, and if the results are different, it means that these knots have different topological type. One ultimate goal of research in knot theory is to construct an invariant which would distinguish all knots, i.e. if it would take the same value for two knots, it would mean that these two knots are of the same type. Such an invariant is still not known, however invariants that we know these days are sufficient to distinguish relatively simple knots found in proteins.

The simplest knot invariant is the minimal number of crossings in a two-dimensional knot diagram obtained from the projection of a knot; this is simply called the number of crossings. For a trivial knot (i.e. unknotted loop), which is denoted \(0_1\), this number is zero. There is no knot with one or two crossings, i.e. any such configuration can be smoothly reduced to, and represents the unknot. There is one knot with three crossings, called the trefoil and denoted \(3_1\), and one knot with four crossings, called the figure-eight knot and denoted \(4_1\). There are two different topological types of knots with five crossings, denoted \(5_1\) and \(5_2\), and three types of knots with six crossings \(6_1\), \(6_2\) and \(6_3\). Some of the knots mentioned here (in fact those identified in proteins to date) are shown in Fig. 8.3. The number of different knots with fixed number of crossings grows very rapidly; for example there are 21 knots with eight crossings, and 165 knots with ten crossings. In the notation we just mentioned the main number denotes the number of crossings, and the subscript labels different knots (with a given crossing number) – so that, for example, \(3_1\) denotes a unique (the only one) knot with three crossings.

In addition, each knot may arise in two forms, which differ only by the mirror image. Sometimes we distinguish the mirror image by adding plus or minus sign in the notation of the knot; for example two versions (mirror images) of the trefoil knot are denoted \(3_1^+\) and \(3_1^-\). For some knots their mirror image is identical to the original knot – for example \(4_1\) knot has this property.

The number of crossings is not a strong invariant – it identifies uniquely only the unknot, trefoil, and figure-eight knot, and for more crossings there are many different, inequivalent topological types of knots with the same number of crossings. For this reason more powerful knot invariants are introduced. There is a large family of knot invariants known as knot polynomials, which are much stronger, i.e. they have different values for more knots, and thus they enable to distinguish more knots. Such polynomials may depend on one or more variables. For example, the so called Alexander polynomial \(\Delta (q)\) and Jones polynomial J(q) depend on one variable q. More intricate HOMFLY-PT polynomial P(a, q) [21, 50] depends on two variables, and reduces to Alexander and Jones polynomials respectively for \(a=1\) and \(a=q^2\). These polynomials can be determined from a two-dimensional diagram of a knot, and – as we explain in what follows – we also use them to identify knots in proteins. Alexander polynomials for knots identified in proteins are given in Fig. 8.3.

One challenge in analyzing knots in proteins is the fact that protein chains are open. To assign a type of knot to an open chain formed by a protein, one may transform it into a closed chain by joining its termini. The problem is that such an operation is not unique, and various knot types may be created depending on how two termini are connected. One simple way to form a closed loop is to connect two termini by a straight segment. However such an operation may lead to other knot than intuitively seen, especially in case when the termini are located far from each other and the entangled region. It is therefore more reasonable to connect the termini to two points – or the same point – on a large sphere surrounding a protein [18, 43], see Fig. 8.4. This gives more reasonable results, however the resulting type of knot may still depend on the details of the method and it is not unique (e.g. it may depend on the choice of such a point or points on a sphere). The best way to cope with this problem is to choose many equally distributed points on a sphere, and – connecting the termini through all those points – compute probability of forming various knots. This is the method that we often use in order to determine a type of a protein knot – for example, after connecting the termini as mentioned above, we calculate the knot polynomial which determines the knot type [29]. We also introduce the minimal probability, typically of the order of 40%, necessary to regard a given structure as knotted (in other words, if a probability of detecting each type of a knot by connecting the termini in various ways is smaller than this 40%, we regard the structure as unknotted). Unless otherwise stated, in case one particular knot type is assigned to a protein, this means that this knot is formed with the highest probability. Influence of the knot detection method on the type and probability of identified knots was analyzed among others in [14, 73].

Once we can assign knot types to open chains, we can also define geometric quantities that characterize such knots, in particular the knot core and knot tails. The knot core is the shortest subchain of a given chain, for which a given knot type is detected; it can be easily determined by cutting consecutive residues from both termini, and checking whether the remaining chain is still knotted (in the probabilistic sense described above). Knot tails, by definition, consist of all residues which do not form the knot core, i.e. these are parts of the backbone chain between either of its termini and the knot core. We also characterize protein knots as being deep or shallow; the latter ones are those which can be untied by thermal fluctuations, and typically their tails consist of not more than a few residues. Deep knots cannot be untied by thermal fluctuations and they have longer tails.

In the analysis of protein knots we also use the so called KMT algorithm [34], which simplifies a given chain without changing its topology. This algorithm works as follows. We analyze triangles determined by all triples of consecutive residues (represented by C\(_{\alpha }\) atoms). In case such a triangle is not pierced by any other segment of the protein chain, we remove the middle residue from a given triangle (and so reduce the triangle to the segment connecting remaining two residues), see Fig. 8.5. This operation simplifies the (closed) chain without changing its topological type. Such operations are performed as long as possible, leading ultimately to a simplified chain. In particular, if the chain is reduced to only three residues, this means that the original configuration represented the unknot – therefore the KMT algorithm is a simple method to check whether a given chain is not knotted (one should however be careful when interpreting the results of the KMT algorithm – there are certain peculiar unknotted configurations that cannot be reduced simply to three residues). It is also useful to use the KMT algorithm before calculation of knot polynomials – this may significantly reduce their computation time.

3.2 Proteins with Knots and Slipknots – KnotProt Server and Database

For a long time it was believed that knots cannot be formed in proteins, due to their complexity. A possibility of finding knots in proteins was discussed for the first time by Mansfield, who also found a shallow knot in human carbonic anhydrase [42]. The first example of a deeply knotted protein was found by Taylor in 2000 [71], and subsequently other examples of knotted proteins were identified [32, 39, 43, 76]. The most complicated protein knot is the Stevedore’s knot \(6_1\), identified in [5]. Proteins with slipknots were identified and analyzed e.g. in [32, 63].

These days the main source of data that facilitates detection of knots is the Protein Data Bank (PDB, or RCSB database), which stores geometric configurations of more than hundred thousands proteins. It is impossible to identify knots in such a large set of proteins without the use of mathematical and computer tools. In fact, detection of a knot even in a single protein is usually impossible just by a naked eye, due to the complicated entangled structure of the backbone chain. This is why various algorithms and mathematical techniques that we summarized above, involving in particular computation of knot polynomials, are indispensable in the search of knots in proteins. Several databases that store information about knotted proteins were constructed some years ago, for example [33, 35]. However, a modern database and server that stores up to date (and regularly updated), and much more extensive information about proteins with knots and slipknots, is the KnotProt [29], whose features we summarize in what follows.

The KnotProt database, http://knotprot.cent.uw.edu.pl, contains information about all proteins with knots or slipknots. This database contains not only the information about a type of a knot in a given protein and some of its geometric features (e.g. the length of loose ends and the size of the knotted core), but it also presents an internal geometric structure of a given protein in terms of a matrix diagram, which is called the knotting fingerprint. The form and properties of such diagrams will be introduced in the next section. Moreover, the database presents extensive information about the biological function of proteins with knots and slipknots, structural and homological similarity to other knotted or unknotted proteins, and various statistics. In addition, the KnotProt database enables users to upload protein or polymer structures, analyze whether they form knots or slipknots, and generate their knotting fingerprints.

As of today (spring 2017), there are 993 knotted chains identified in KnotProt, and 473 chains with slipknots. This is around 2% of all proteins deposited in the Protein Data Bank. Knotted proteins are found in all kingdoms of life [30]. Knots are found in globular proteins, including those from mitochondria or a ribosome, and in membrane proteins. Around 90% of knotted proteins act as enzymes, whose active site – responsible e.g. for binding ligands – is located in the knotted core. Some knotted proteins are responsible for DNA binding, and some still have an unknown function. The biggest family of proteins with a deep knot is the SPOUT family [72]. The smallest and rather deeply knotted protein, from methanocaldococcus jannaschii, consists of 92 amino acids and is known as MJ0366. The deepest knot is found in protein with unknown function from T. pallidum (PDB code: 5ijr chain A) – its tails have length of at least 100 amino acids. This protein consists of three domains and the middle one is knotted. A deep knot exists also in the family of membrane proteins, calcium exchanger protein (e.g. protein with PDB code 4k1c). A protein structure with an artificial (designed) knot is also known [31].

Table 8.1 A list of families of proteins with knots (following [30]). In the second column a representative structure of each family is given (PDB codes include the chain identificator as the last letter, in the subscript). In the third column the knot type is given, and in the last column the knot fingerprint is provided (whose meaning is explained in the text). A sign in the superscript of the knot type denotes its chirality

Full size table

Table 8.2 A list of families of proteins with slipknots. In the second column a representative structure of each family if given (PDB codes include the chain identificator as the last letter, in the subscript). In the last column the slipknot fingerprint is provided (whose meaning is explained in the text). A sign in the superscript of the knot type denotes its chirality

Full size table

We present a list of representative proteins with knots and slipknots, based on the KnotProt database, in Tables 8.1 and 8.2. In the first column of these tables representative entangled proteins are listed, grouped according to their function, and PDB codes of such representative structures are given in the second column. In Table 8.1 in the third column a type of a knot for the whole chain is provided. In the last column of both tables knotting fingerprints (whose meaning is explained below) of respective structures are given.

3.3 Knotting Fingerprint for Knots and Slipknots

As we discussed earlier, knots in proteins are formed in open chains, which is subtle from mathematical perspective and enables their identification only in a probabilistic manner. However, once a relevant definition of knots in open chains is provided, it can be used not only to determine a type of the knot for the whole chain under consideration, but also for all subchains of such a knot. Such an analysis provides much more accurate and detailed information about entangled proteins. It also enables rigorous detection of slipknots, which can be defined as chains which are unknotted as a whole, but which possess a subchain which is knotted. It is convenient to present an information about knotting of all subchains of a given chain in terms of a matrix diagram, called knotting fingerprint, see Fig. 8.6, which was introduced in [67] following [32].

More precisely, the knotting fingerprint presents data about knotting of each subchain of a given chain in a lower-triangular part of matrix of the size \(N\times N\), where N is the length of the whole chain. The position (i, j) of this matrix represents a subchain spanned between ith and jth residue of the backbone chain, and it is colored according to the type of the (most probable) knot formed by this subchain. The intensity of the color corresponds to the probability of forming such a knot. Typically such a matrix diagram consists of several approximately rectangular regions, which represent various types of knots spanned by various subchains. Such a diagram is a source of various interesting information. First, it enables identification of the minimal length of the knotted region (i.e. the size of the knotted core), the depth of a knot (i.e. the number of amino acids that can be removed from either end of the protein chain before converting it from a knot to a different type of knot or an unknot), and other geometric data characterizing a knot. Second, the location of various knots along the chain indicates their biological and physical role, especially if it is correlated with the location of active sites.

It is also useful to encode the pattern of knots on subchains of a given chain in a simplified notation, which lists all those types of knots identified in the fingerprint matrix. For example by K\(4_1 3_1\) we denote a protein which forms \(4_1\) knot, and which possesses a subchain forming \(3_1\) knot; the initial letter K means that the whole chains forms \(4_1\) knot. In case the whole chain forms a slipknot the letter S is used; for example, S\(4_1 3_1\) means that the whole chain forms a slipknot, and some of its subchains form knots of type \(4_1\) and \(3_1\). Knotting patterns (including the orientation (or mirror image) of identified knots, denoted by ± in the superscript) of some representative proteins with knots and slipknots are provided in the last column in Tables 8.1 and 8.2.

In Fig. 8.7 we present examples of the most complicated fingerprints of a knot and a slipknot found to date. The knotted chain (left) with the fingerprint K\(6_1 6_1 4_1 3_1\) is the structure of DehI (PDB code 3bjx chain A). The chain with the slipknot (right) with the fingerprint S\(3_1 4_1 3_1 3_1 3_1\) is the crystal structure of uracil transporter–uraa (PDB code 3qe7A). Fingerprints for all protein structures with knots and slipknots identified to date can be found in the KnotProt database.

3.4 Folding of Knotted Proteins

An important challenge in the study of entangled proteins is to understand and describe not only their geometric configurations which determine biological activity and function, but also folding mechanisms which lead to such configurations. Even though some experiments with knotted proteins have been conducted, in recent years their folding mechanisms have been studied mainly theoretically, using various coarse grained models. Knots identified in proteins to date are of twist type, and they can be made in one step, by threading of a tail through a twisted loop (the number of twists determines the type of the resulting knot). Current results suggest that knots are not made spontaneously along the protein backbone (and then slight to the final native location), but they are created in two main steps [64]: the first is formation of a twisted native loop, and the second is threading a shorter knot tail through this loop, as shown in Fig. 8.8. In case of proteins with more complex structure, twisted native loop flips over a core of the protein creating e.g. the \(6_1\) knot also just in one step [5]. Flipping mechanism was also observed for proteins with slipknots [64].

One of the most commonly analysed proteins are Yibk and YebA, members of SPOUT family, which possess a deep trefoil knot. Theoretical studies with an unbiased structure based model revealed that these proteins can self tie, via formation of a twisted loop and threading shorter C-terminal tail [64]. Detailed analysis showed that the tail threads through the twisted loop in the slipknot configuration and this threading is a rate limiting step. The same pathway is also observed when non-native contacts are introduced [79]. Even though these proteins have been very extensively studied, still only kinetics pathways are known [41, 54, 74].

The free energy landscape however has been uncovered theoretically for a few small knotted proteins with rather shallow knots. In particular reversible folding via a slipknot conformation was observed for the smallest knotted protein (known to date), MJ0366 [48], with a structure based model, and it was further extensively studied e.g. in [4, 44]. Furthermore, explicit all atom simulations showed that this protein can self-tie and electrostatic interactions facilitate tying of a knot [49]. The self-tying mechanism of MJ0366 was also suggested experimentally [80]. Reversible folding was also observed for artificially designed proteins [31], where both theoretical and experimental studies suggested that topological constrains are responsible for their slower folding [37, 66].

Theoretical studies of proteins with more complex knots, such as \(5_2\) and \(6_1\), also suggest that knotting happens just in one step, which in this case corresponds to flipping the twisted loop over the core of the protein [5, 83]. Moreover folding of those proteins consists of parallel pathways and intermediates steps. Experimental studies of these proteins suggest that their folding mechanism involves at least one intermediate state, see e.g. [2, 36], and proteins are prone to misfold at the final state of folding [38, 81]. Influence of knot type on folding pathway based on lattice models was also investigated, e.g. in [20, 56].

Since proteins in vivo are surrounded by crowded environment and their folding could be supported by chaperons, or they could fold cotranslationally, it was also analyzed how these aspects affect knotting mechanism [8, 40, 57]. For example, when a chaperon is approximated by a cylindrical box, it was found that knotting is still a rate limiting step, however a chaperon significantly smooths the free energy landscape for smallest knotted proteins MJ0366, VirC2 and DndE [47], as well as proteins with complex fingerprint \(5_23_13_1\) [83]. Experimental study on YibK, YebA shown that chaperon can significantly speed up the folding process [40]. Cotranslational knotting with theoretical methods was extensively studied e.g. in [8].

Even though in principle the folding pathways leading to knotted configurations have been proposed, the origin (type of interactions) of the driving force needed to overcome the topological barrier [12, 55], and the influence of chain stiffness [66], are still under investigation. Moreover it is important to mention that a certain exception in folding mechanism was observed based on a three domain protein from T. Pallidum (PDB code 5jir chain A) where the middle domain is knotted. This protein probably folds via a shallow knot formed at random position [30]. Other approach to understand free knotting mechanism and reviews discussing folding pathways of knotted proteins can be found e.g. in [19, 28, 68, 77].

3.5 Function of Knotted Proteins

Another crucial challenge, apart from characterizing folding and geometry of entangled proteins, is to understand what is the role and the function of entanglement. This question is currently actively studied. A review of all knotted proteins shows that the topological fingerprint is well conserved in proteins separated even by million years of evolution and very low sequence similarity [67]. Moreover, more than 90% of known knotted proteins are enzymes, whose active sites are located inside the knotted core. These observations suggest that knots provide some advantages for the hosting organism.

Analysis of analogous proteins having the same function – methylation of tRNA [27] – but different topology also reveals some information about the role of a knot. An example of such a pair of proteins involves a bacterial one with a trefoil \(3_1\) knot (TrmD), and an eukaryotic one which is unknotted (Trm5). As showed in [7], the knot responds to the motions of the whole protein, even though it belongs to the rigid part of the structure. The trefoil knot in the TrmD protein is capable of transferring the signal coming from the substrates further through the protein, by its internal moves. Mutations of key residues in the area of the knot suppress this motion and make the protein unable to conduce the methyl transfer. The fact that the knot plays a role in the enzymatic function suggests that knots play more profound role than just stabilizing the structure. Stabilizing role of knots however should not be neglected, and it is important e.g. in phytochromes, which are red/far-red light photoreceptors that direct photosensory responses across the bacterial, fungal and plant kingdoms [17, 78]. The binding pocket of those proteins is stabilized by a deep figure-eight knot. Some other suggestions about the role of knots are presented in [14, 53].

Additional information about knots in proteins was obtained based on mechanical manipulations, via single molecule experiment both in vitro and in silico (by computer analysis) [51, 58]. First, molecular dynamics simulations with structure based model [59, 60] showed that a knotted protein is more resistant than an unknotted one [61]. Second, it was found in [6, 62] that upon pulling proteins by their termini, knots tighten along the backbone at deterministic locations. Conditions to untie knotted proteins were established in [65] and tested experimentally in [84]. Furthermore, it was shown that knotted proteins can block or be pulled through narrow channels, depending on the applied force [69, 70, 82]. A surprising tying of a knot on the protein backbone was also observed in thermal and chemical denaturation [3, 41]. Furthermore, mechanical manipulations of proteins with slipknots showed that they form certain metastable conformations [25, 26, 63].

4 Links in Proteins

In knot theory, apart from knots, also another class of entangled structures is considered, which are called links. A link consists of several closed loops (called components of a link), such that each of those loops may be knotted, and moreover pairs, triples, etc. of loops can be simultaneously interlinked. Any knot may be regarded as a link with only one component. The simplest example of a non-trivial link is the Hopf link, formed by two loops linked in the simplest possible way. Another link that can be made out of two loops is the Solomon link. Hopf link and Solomon link are shown in Fig. 8.9 (in the middle).

Links in proteins have been identified only very recently, and they can be considered from two perspectives [11]. First, links can be identified in a single protein chain if it has additional linkages, for example disulfide bridges. Each such bridge defines a closed loop, and if several such loops are present in one protein chain, then the whole structure forms a link (which is nontrivial if those loops are interlinked). Importantly, links defined in such a way are properly defined from mathematical perspective and there is no need to consider probabilistic methods to identify them – we call such links deterministic. Once orientation of component loops is introduced – e.g. induced from the orientation of the backbone from N to C terminus – the notion of chirality of links can be defined, see Fig. 8.9.

Table 8.3 Families of proteins (following [11]) with the Hopf link (with positive and negative orientation, respectively in the upper and lower part of the table, as indicated in the last column). For each family the function and a representative chain are given respectively in the second and the third column. L1 and L2 denote the size of the two loops of the Hopf link, P1 and P2 is the signed index of a residue piercing through each of those loops. “# of hom.” is the number of homologs for a given structure, “# of loops” is the number of disulfide-based covalent loops in the structure (e.g. if 4, there are 2 covalent loops forming the Hopf link and two trivial covalent loops), “Size” is the number of residues in the structure, “Loop sep.” is the sequential distance between the loops. The dashed line separates the humanly modified protein with PDB code 3T93. Proteins are ordered according to the size of the first pierced loop

Full size table

Table 8.4 Families of proteins (following [11]) with the Solomon link. For each family the function and a representative chain are given respectively in the second and the third column. L1 and L2 denote the size of the two loops of the Hopf link, P1 and P2 is the signed index of a residue piercing through each of those loops. “# of hom.” is the number of homologs for a given structure, “# of loops” is the number of disulfide-based covalent loops in the structure (e.g. if 4, there are 2 covalent loops forming the Solomon link and two trivial covalent loops), “Size” is the number of residues in the structure, “Loop sep.” is the sequential distance between the loops

Full size table

On the other hand, links can also be made of several separate chains, which form dimers, trimers, etc. [15]. To identify such links, each component chain must be closed (e.g. by connecting its termini on a large sphere, similarly as in the case of knots), and then the HOMFLY-PT polynomial is calculated. A type of such a link may depend on details how termini of all chains are connected, and so we call such links as probabilistic.

All proteins with links identified in the Protein Data Bank are presented in the LinkProt database [15], available at http://linkprot.cent.uw.edu.pl. As of spring 2017, 124 deterministic and 8456 probabilistic links (1071 linked proteins with 30% of sequence similarity) have been identified in this database. This database is also regularly updated, so that it always contains an up to date list of proteins with links. Deterministic links identified in proteins to date form one of two simplest links – Hopf link and Solomon link – as shown in examples in Fig. 8.9. Once the orientation of two component loops is introduced, one can consider two versions of the Hopf link, and both of them are found in proteins (in this case there is a natural orientation of each loop induced by the ordering of the protein from N to C terminus). In Fig. 8.9 in the left a cartoon representation of a given protein is shown, and in the right the protein backbone is shown in a simplified way, with link components colored in blue and red, linkages (closing the loops) shown in orange, and remaining parts of the protein chain in black. Furthermore, all non-redundant structures (found to date) that form deterministic Hopf link, together with their geometric properties and the function, are listed in Table 8.3. Analogous data for non-redundant proteins that form deterministic Solomon link is shown in Table 8.4.

Probabilistic chains identified in LinkProt database form more configurations, which involve two, three or four components. One of the most interesting configurations is \(6^3_3\) (probabilistic) link, which involves three symmetrically and mutually interlinked components (note that it should not be confused with the Borromean rings). As there are many more protein configurations forming probabilistic links, to learn about their properties we encourage a reader to browse the LinkProt database.

Currently the function of links in proteins is not well understood, however they definitely provide additional topological stability [11] (in addition to stabilizing role of disulfide bridges and closed loops). Link topology is also responsible for misfolding of a hosted protein [11].

5 Proteins with Lassos

The last class of entangled proteins that we discuss are proteins with lassos, introduced systematically in [45]. Lassos can be identified in structures with disulfide (or other types of) bridges. Similarly as in (deterministic) links, such a bridge defines a closed loop. By a lasso we mean a configurations which consists of such a loop, through which some other part (or parts) of the backbone chain is threaded. The pattern of such threading may be quite complicated: a protein chain may pierce the loop several times, it can wind around the backbone chain forming a loop, etc. Note that configurations similar to lassos were studied in [75]; we also stress that lassos should not be confused with cystein knots [9, 10, 16].

In order to define lassos unambiguously, we propose to span an auxiliary surface of minimal area (analogous to a soap bubble), called the minimal surface, on the closed loop. To determine such a surface, or more precisely a triangulated approximation to such a surface, we use tools and algorithms from computer graphics [45]. The orientation of the loop (from N to C terminal) induces the orientation of the surface spanned on this loop, which enables to identify a direction of piercing of this surface by a protein terminus, as shown in Fig. 8.10.

Several lasso motifs identified so far in proteins are shown in Fig. 8.11. One class of motifs involves one protein terminus piercing the minimal surface once, or several times from opposite directions, as shown in the first row in this figure. Such configurations are called single lasso, double lasso, triple lasso, etc., and denoted respectively \(L_1\), \(L_2\), \(L_3\), etc. Another lasso motif involves one terminus winding several times around the closed loop and piercing the minimal surface each time from the same direction; we call such a configuration a supercoiling and denote it LS. Finally, a configuration where two termini pierce through the closed loop is denoted LL, or more precisely \(LL_{i,j}\), where i and j denote the number of times each terminus pierces the minimal surface. For each of those motifs an information about the direction of piercings and the piercing terminus can also be provided. More complicated lasso configurations, involving several loops, also exist, for example such as shown in Fig. 8.12.

An example of a protein forming a lasso of type \(L_2\) – an oxidoreductase protein (PDB code 2oiz) – is shown in Fig. 8.10. Cartoon representation of the protein is shown in the left, with the disulfide bridge shown in orange. A simplified protein structure is shown in the middle, with the grey triangulated minimal surface spanned on the closed loop, pierced by one terminus at two triangles (green and blue color, indicating the direction of piercing). A baricentric planar representation of the minimal surface with pierced triangles is shown in the right. The numbers 127 and 172 indicate the numbers of segments connecting consecutive C\(_{\alpha }\) atoms that pierce the surface.

Table 8.5 Families of proteins with various types of lassos (following [45]). In the first bullet groups of enzymes are listed in order of decreasing number of occurrences; in the second bullet PDB classes of non-enzymatic proteins are listed in order of decreasing number of occurrences. In case of \(L_1\) lasso only groups with more than 9 elements are listed

Full size table

All proteins with lassos identified to date are collected in the LassoProt database [13], available at http://lassoprot.cent.uw.edu.pl. As of spring 2017, there are 6446 structures with lassos in this database, which have been identified in all kingdoms of life, in globular and membrane proteins. We list families of these proteins in Table 8.5, together with the number of lasso structures in each family, and grouped into enzymes and non-enzymes. It truns out that around 18%, i.e. 376 out of 2021 protein structures with disulfide bonds (data from the year 2015), form lassos in the nonredundant set (i.e. proteins with sequence similarity lower than 30%). Furthermore, geometric properties of lassos can be analyzed using the online PyLasso server [46], available at http://pylasso.cent.uw.edu.pl.

Apart from their geometry, also functions and other properties of proteins with lassos have been studied. In [1, 52] it was shown that the lasso motif is well preserved in antimicrobial proteins, where the lasso stabilizes the entire fold. Lassos also have a therapeutic potential, similarly as proteins with cystein knots [16]. Even though the influence of lassos on proteins’ stability and biological activity have been analyzed systematically only recently, it is already known that e.g. in leptin the presence of a lasso slows down folding, but facilitates receptor binding [22,23,24].

6 Conclusions

We have presented various types of entangled structures in proteins: knots, slipknots, links and lassos. Their crucial feature is the fact that the whole protein chain needs to be considered to identify its type of entanglement; in this sense entanglement is a global property of a protein chain. Some years ago it was believed that entangled proteins, in particular knots, cannot exist, due to the complexity of their structure. However more recently many such structures have been found, and it has become clear that their existence is not accidental and they must play certain biological and physical role.

We have also presented databases and servers that assemble and regularly update information about entangled structures, as well as a plugin that facilitates analysis of lassos:

KnotProt, http://knotprot.cent.uw.edu.pl
LinkProt, http://linkprot.cent.uw.edu.pl
LassoProt, http://lassoprot.cent.uw.edu.pl
PyLasso http://pylasso.cent.uw.edu.pl

Apart from geometric structure of all protein chains, also an information on their function, various classifications, a list of similar chains, etc., are provided in the above databases. In these websites one can also upload other polymer-like structures, not necessarily proteins, and analyze if they contain knots, slipknots, links or lassos.

In last few years the studies of entangled proteins have grown into a new, rapidly developing and interdisciplinary field, which involves methods and techniques from biophysics, biochemistry, computer science, branches of mathematics such as topology and knot theory, etc. There are plenty of opportunities and outstanding questions in studying entangled proteins, which involve understanding their function, evolution, folding mechanisms, and other features. We encourage a reader to try to answer some of these questions too.

References

C.D. Allen, M.Y. Chen, A.Y. Trick, D.T. Le, A.L. Ferguson, A.J. Link, Thermal unthreading of the lasso peptides astexin-2 and astexin-3. ACS Chem. Biol. (2016)
Article Google Scholar
F.I. Andersson, D.G. Pina, A.L. Mallam, G. Blaser, S.E. Jackson, Untangling the folding mechanism of the 52-knotted protein uch-l3. FEBS J. 276(9), 2625–2635 (2009)
Article Google Scholar
B.T. Andrews, D.T. Capraro, J.I. Sulkowska, J.N. Onuchic, P.A. Jennings, Hysteresis as a marker for complex, overlapping landscapes in proteins. J. Phys. Chem. Lett. 4(1), 180–188 (2012)
Article Google Scholar
S.A. Beccara, T. Škrbić, R. Covino, C. Micheletti, P. Faccioli, Folding pathways of a knotted protein with a realistic atomistic force field. PLoS Comput. Biol. 9(3), e1003002 (2013)
Google Scholar
D. Bölinger, J.I. Sułkowska, H.-P. Hsu, L.A. Mirny, M. Kardar, J.N. Onuchic, P. Virnau, A Stevedore’s protein knot. PLoS Comput. Biol. 6(4), e1000731–e1000731 (2010)
Article MathSciNet Google Scholar
T. Bornschlögl, D.M. Anstrom, E. Mey, J. Dzubiella, M. Rief, K.T. Forest, Tightening the knot in phytochrome by single-molecule atomic force microscopy. Biophys. J. 96(4), 1508–1514 (2009)
Article ADS Google Scholar
T. Christian, R. Sakaguchi, A.P. Perlinska, G. Lahoud, T. Ito, E.A. Taylor, S. Yokoyama, J.I. Sulkowska, Y-M. Hou, Methyl transfer by substrate signaling from a knotted protein fold. Nat. Struct. Mol. Biol. (2016)
Google Scholar
M. Chwastyk, M. Cieplak, Cotranslational folding of deeply knotted proteins. J. Phys. Condens. Matter 27(35), 354105 (2015)
Article Google Scholar
D.J. Craik, N.L. Daly, T. Bond, C. Waine, Plant cyclotides: a unique family of cyclic and knotted proteins that defines the cyclic cystine knot structural motif. J. Mol. Biol. 294(5), 1327–1336 (1999)
Article Google Scholar
D.J. Craik, M. Čemažar, C.K.L. Wang, N.L. Daly, The cyclotide family of circular miniproteins: nature’s combinatorial peptide template. Pept. Sci. 84(3), 250–266 (2006)
Article Google Scholar
P. Dabrowski-Tumanski, J.I. Sulkowska, Topological knots and links in proteins. Proc. Natl. Acad. Sci. 114(13), 3415–3420 (2017)
Article Google Scholar
P. Dabrowski-Tumanski, A.I. Jarmolinska, J.I. Sulkowska, Prediction of the optimal set of contacts to fold the smallest knotted protein. J. Phys. Condens. Matter 27(35), 354109 (2015)
Article Google Scholar
P. Dabrowski-Tumanski, W. Niemyska, P. Pasznik, J.I. Sulkowska, Lassoprot: server to analyze biopolymers with lassos. Nucleic Acids Res. 44(W1), W383–W389, 2016
Article Google Scholar
P. Dabrowski-Tumanski, A. Stasiak, J.I. Sulkowska, In search of functional advantages of knots in proteins. PloS one, 11(11), e0165986 (2016)
Article Google Scholar
P. Dabrowski-Tumanski, A.I. Jarmolinska, W. Niemyska, E.J. Rawdon, K.C. Millett, J.I. Sulkowska, Linkprot: a database collecting information about biological links. Nucleic Acids Res. 45(D1), D243 (2017)
Article Google Scholar
N.L. Daly, D.J. Craik. Bioactive cystine knot proteins. Curr. Opin. Chem. Biol. 15(3), 362–368 (2011)
Article Google Scholar
L.-O. Essen, J. Mailliet, J. Hughes, The structure of a complete phytochrome sensory module in the Pr ground state. Proc. Natl. Acad. Sci. 105(38), 14709–14714 (2008)
Article ADS Google Scholar
B. Ewing, K.C. Millett, Computational algorithms and the complexity of link polynomials. Prog. Knot Theory Relat. Top. 56, 51–68 (1997)
MathSciNet MATH Google Scholar
P.F.N. Faísca, Knotted proteins: a tangled tale of structural biology. Comput. Struct. Biotechnol. J. 13, 459–468 (2015)
Article Google Scholar
P.F.N. Faísca, R.D.M. Travasso, T. Charters, A. Nunes, M. Cieplak, The folding of knotted proteins: insights from lattice simulations. Phys. Biol. 7(1), 016009 (2010)
Article ADS Google Scholar
P. Freyd, D. Yetter, J. Hoste, W.B.R. Lickorish, K. Millett, A. Ocneanu, A new polynomial invariant of knots and links. Bull. Am. Math. Soc. 12(2), 239–246 (1985)
Article MathSciNet Google Scholar
E. Haglund, J.I. Sułkowska, Z. He, G-S. Feng, P.A. Jennings, J.N. Onuchic, The unique cysteine knot regulates the pleotropic hormone leptin. PloS one 7(9), e45654 (2012)
Article ADS Google Scholar
E. Haglund, J.I. Sulkowska, J.K. Noel, H. Lammert, J.N. Onuchic, P.A. Jennings, Pierced lasso bundles are a new class of knot-like motifs. PLoS Comput. Biol. 10(6), e1003613 (2014)
Article ADS Google Scholar
E. Haglund, A. Pilko, R. Wollman, P.A. Jennings, J.N. Onuchic, Pierced lasso topology controls function in leptin. J. Phys. Chem. B 121(4), 706–718 (2017)
Article Google Scholar
C. He, G.Z. Genchev, H. Lu, H. Li, Mechanically untying a protein slipknot: multiple pathways revealed by force spectroscopy and steered molecular dynamics simulations. J. Am. Chem. Soc. 134(25), 10428–10435 (2012)
Article Google Scholar
C. He, G. Lamour, A. Xiao, J. Gsponer, H. Li, Mechanically tightening a protein slipknot into a trefoil knot. J. Am. Chem. Soc. 136(34), 11946–11955 (2014)
Article Google Scholar
Y.M. Hou, R. Matsubara, R. Takase, I. Masuda, J.I. Sulkowska. TrmD: a methyl transferase for tRNA methylation with m1G37. The Enzymes (2017)
Chapter Google Scholar
S.E. Jackson, A. Suma, C. Micheletti, How to fold intricately: using theory and experiments to unravel the properties of knotted proteins. Curr. Opin. Struct. Biol. 42, 6–14 (2017)
Google Scholar
M. Jamroz, W. Niemyska, E.J. Rawdon, A. Stasiak, K.C. Millett, P. Sułkowski, J.I. Sulkowska, Knotprot: a database of proteins with knots and slipknots. Nucleic Acids Res. 43(D1), D306–D314 (2015)
Article Google Scholar
A.I. Jarmolinska, A.P. Perlinska, R. Runkel, B. Trefz, P. Virnau, J.I. Sulkowska, Proteins? knotty problems (2017) (under review)
Google Scholar
N.P. King, A.W. Jacobitz, M.R. Sawaya, L. Goldschmidt, T.O. Yeates, Structure and folding of a designed knotted protein. Proc. Natl. Acad. Sci. 107(48), 20732–20737 (2010)
Article ADS Google Scholar
N.P. King, E.O. Yeates, T.O. Yeates, Identification of rare slipknots in proteins and their implications for stability and folding. J. Mol. Biol. 373(1), 153–166 (2007)
Article Google Scholar
G. Kolesov, P. Virnau, M. Kardar, L.A. Mirny, Protein knot server: detection of knots in protein structures. Nucleic Acids Res. 35, W425–8 (2007)
Article Google Scholar
K. Koniaris, M. Muthukumar, Self-entanglement in ring polymers. J. Chem. Phys. 95(4), 2873–2881 (1991)
Article ADS Google Scholar
Y.-L. Lai, S.-C. Yen, Y. Sung-Huan, J.-K. Hwang, pknot: the protein knot web server. Nucleic Acids Res. 35(2), W420–W424 (2007)
Article Google Scholar
Y.-T.C. Lee, C-Y. Chang, S.-Y. Chen, Y.-R. Pan, M.-R. Ho, S.T.D. Hsu, Entropic stabilization of a deubiquitinase provides conformational plasticity and slow unfolding kinetics beneficial for functioning on the proteasome. Sci. Rep. 7, 45174 (2017)
Article ADS Google Scholar
W. Li, T. Terakawa, W. Wang, S. Takada, Energy landscape and multiroute folding of topologically complex proteins adenylate kinase and 2ouf-knot. Proc. Natl. Acad. Sci. 109(44), 17789–17794 (2012)
Article ADS Google Scholar
S.-C. Lou, S. Wetzel, H. Zhang, E.W. Crone, Y.-T. Lee, S.E. Jackson, S.-T.D. Hsu, The knotted protein UCh-L1 exhibits partially unfolded forms under native conditions that share common structural features with its kinetic folding intermediates. J. Mol. Biol. 428(11), 2507–2520 (2016)
Article Google Scholar
R.C. Lua, Pyknot, a pymol tool for the discovery and analysis of knots in proteins. Bioinformatics 28(15), 2069–2071 (2012)
Article Google Scholar
A.L. Mallam, S.E. Jackson, Knot formation in newly translated proteins is spontaneous and accelerated by chaperonins. Nat. Chem. Biol. 8(2), 147–153 (2012)
Article Google Scholar
A.L. Mallam, J.M. Rogers, S.E. Jackson, Experimental detection of knotted conformations in denatured proteins. Proc. Natl. Acad. Sci. 107(18), 8189–8194 (2010)
Article ADS Google Scholar
M.L. Mansfield, Are there knots in proteins? Nat. Struct. Mol. Biol. 1(4), 213–214 (1994)
Article Google Scholar
K.C. Millett, E.J. Rawdon, A. Stasiak, J.I. Sułkowska, Identifying knots in proteins. Biochem. Soc. Trans. 41(2), 533–537 (2013)
Article Google Scholar
S. Najafi, R. Potestio, Folding of small knotted proteins: insights from a mean field coarse-grained model. J. Chem. Phys. 143(24):12B606_1 (2015)
Google Scholar
W. Niemyska, P. Dabrowski-Tumanski, M. Kadlof, E. Haglund, P. Sułkowski, J.I. Sulkowska, Complex lasso: new entangled motifs in proteins. Sci. Rep. 6, 36895 (2016)
Google Scholar
W. Niemyska, A.M. Gierut, P. Sulkowski, P. Dabrowski-Tumanski, J.I. Sulkowska, Pylasso a pymol plugin to identify lassos (2017) (under review)
Google Scholar
S. Niewieczerzal, J.I. Sulkowska, Knotting and unknotting proteins in the chaperonin cage: effects of the excluded volume. PloS one 12(5), e0176744 (2017)
Article Google Scholar
J.K. Noel, J.I. Sułkowska, J.N. Onuchic, Slipknotting upon native-like loop formation in a trefoil knot protein. Proc. Natl. Acad. Sci. 107(35), 15403–15408 (2010)
Article ADS Google Scholar
J.K. Noel, J.N. Onuchic, J.I. Sulkowska, Knotting a protein in explicit solvent. J. Phys. Chem. Lett. 4(21), 3570–3573 (2013)
Article Google Scholar
J.H. Przytycki, P. Traczyk, Invariants of links of conway type. Kobe J. Math. 4, 115–139 (1988)
Google Scholar
M. Rief, H. Grubmüller, Force spectroscopy of single biomolecules. Chem. Phys. Chem. 3(3), 255–261 (2002)
Article Google Scholar
K.J. Rosengren, R.J. Clark, N.L. Daly, U. Göransson, A. Jones, D.J. Craik, Microcin j25 has a threaded sidechain-to-backbone ring structure and not a head-to-tail cyclized backbone. J. Am. Chem. Soc. 125(41), 12464–12474 (2003)
Article Google Scholar
T.C. Sayre, T.M. Lee, N.P. King, T.O. Yeates, Protein stabilization in a highly knotted protein polymer. Protein Eng. Des. Select. 24(8), 627–630 (2011)
Article Google Scholar
E. Shakhnovich, Protein folding: to knot or not to knot? Nat. Mater. 10(2), 84–86 (2011)
Article ADS MathSciNet Google Scholar
T. Škrbić, C. Micheletti, P. Faccioli, The role of non-native interactions in the folding of knotted proteins. PLoS Comput. Biol. 8(6), e1002504 (2012)
Article ADS MathSciNet Google Scholar
M.A. Soler, A. Nunes, P.F.N. Faísca, Effects of knot type in the folding of topologically complex lattice proteins. J. Chem. Phys. 141(2), 07B607_1 (2014)
Google Scholar
M.A. Soler, A. Rey, P.F.N. Faísca, Steric confinement and enhanced local flexibility assist knotting in simple models of protein folding. Phys. Chem. Chem. Phys. 18(38), 26391–26403 (2016)
Article Google Scholar
M. Sotomayor, K. Schulten, Single-molecule experiments in vitro and in silico. Science 316(5828), 1144–1148 (2007)
Article ADS Google Scholar
J.I. Sułkowska, M. Cieplak, Mechanical stretching of proteins—a theoretical survey of the protein data bank. J. Phys. Condens. Matter 19(28), 283201 (2007)
ADS Google Scholar
J.I. Sułkowska, M. Cieplak, Selection of optimal variants of gō-like models of proteins through studies of stretching. Biophys. J. 95(7), 3174–3191 (2008)
Article ADS Google Scholar
J.I. Sułkowska, P. Sułkowski, P. Szymczak, M. Cieplak, Stabilizing effect of knots on proteins. Proc. Natl. Acad. Sci. 105(50), 19714–19719 (2008)
Article ADS Google Scholar
J.I. Sułkowska, P. Sułkowski, P. Szymczak, M. Cieplak, Tightening of knots in proteins. Phys. Rev. Lett. 100(5), 058106 (2008)
Google Scholar
J.I. Sułkowska, P. Sułkowski, J.N. Onuchic, Jamming proteins with slipknots and their free energy landscape. Phys. Rev. Lett. 103(26), 268103 (2009)
Google Scholar
J.I. Sułkowska, P. Sułkowski, J. Onuchic, Dodging the crisis of folding proteins with knots. Proc. Natl. Acad. Sci. 106(9), 3119–3124 (2009)
Article ADS MathSciNet Google Scholar
J.I. Sułkowska, P. Sułkowski, P. Szymczak, M. Cieplak, Untying knots in proteins. J. Am. Chem. Soc. 132(40), 13954–13956 (2010)
Article Google Scholar
J.I. Sułkowska, J.K. Noel, J.N. Onuchic, Energy landscape of knotted protein folding. Proc. Natl. Acad. Sci. 109(44), 17783–17788 (2012)
Article ADS Google Scholar
J.I. Sułkowska, E.J. Rawdon, K.C. Millett, J.N. Onuchic, A. Stasiak, Conservation of complex knotting and slipknotting patterns in proteins. Proc. Natl. Acad. Sci. 109(26), E1715–E1723 (2012)
Article ADS Google Scholar
J.I. Sułkowska, J.K. Noel, C.A. Ramírez-Sarmiento, E.J. Rawdon, K.C. Millett, J.N. Onuchic, Knotting pathways in proteins. Biochem. Soc. Trans. 41(2), 523–527 (2013)
Article Google Scholar
P. Szymczak, Tight knots in proteins: can they block the mitochondrial pores? Biochem. Soc. Trans. 41(2), 620–624 (2013)
Article Google Scholar
P. Szymczak, Periodic forces trigger knot untying during translocation of knotted proteins. Sci. Rep. 6 (2016)
Google Scholar
W.R. Taylor, A deeply knotted protein structure and how it might fold. Nature 406(6798), 916–919 (2000)
Article ADS Google Scholar
K.L. Tkaczuk, S. Dunin-Horkawicz, E. Purta, J.M. Bujnicki, Structural and evolutionary bioinformatics of the spout superfamily of methyltransferases. BMC Bioinform. 8(1), 73 (2007)
Article Google Scholar
L. Tubiana, E. Orlandini, C. Micheletti, Probing the entanglement and locating knots in ring polymers: a comparative study of different arc closure schemes. Prog. Theor. Phys. Suppl. 191, 192–204 (2011)
Article ADS Google Scholar
I. Tuszynska, J.M. Bujnicki, Predicting atomic details of the unfolding pathway for yibk, a knotted protein from the spout superfamily. J. Biomol. Struct. Dyn. 27(4), 511–520 (2010)
Article Google Scholar
E. Uehara, T. Deguchi, Statistical and hydrodynamic properties of topological polymers for various graphs showing enhanced short-range correlation. J. Chem. Phys. 145(16), 164905 (2016)
Article ADS Google Scholar
P. Virnau, L.A. Mirny, M. Kardar, Intricate knots in proteins: function and evolution. PLoS Comput. Biol. 2(9), e122 (2006)
Google Scholar
P. Virnau, A. Mallam, S. Jackson, Structures and folding pathways of topologically knotted proteins. J. Phys. Condens. Matter 23(3), 033101 (2010)
Article ADS Google Scholar
J.R. Wagner, J.S. Brunzelle, K.T. Forest, R.D. Vierstra, A light-sensing knot revealed by the structure of the chromophore-binding domain of phytochrome. Nature 438(7066), 325–331 (2005)
Article ADS Google Scholar
S. Wallin, K.B. Zeldovich, E.I. Shakhnovich, The folding mechanics of a knotted protein. J. Mol. Biol. 368(3), 884–893 (2007)
Article Google Scholar
I. Wang, S.-Y. Chen, S.-T.D. Hsu, Unraveling the folding mechanism of the smallest knotted protein, mj0366. J. Phys. Chem. B 119(12), 4359–4370 (2015)
Article Google Scholar
I. Wang, S.-Y. Chen, S.-T.D. Hsu, Folding analysis of the most complex stevedore’s protein knot. Sci. Rep. 6 (2016)
Google Scholar
M. Wojciechowski, À. Gómez-Sicilia, M. Carrión-Vázquez, M. Cieplak, Unfolding knots by proteasome-like systems: simulations of the behaviour of folded and neurotoxic proteins. Mol. Biosyst. 12(9), 2700–2712 (2016)
Article Google Scholar
Y. Zhao, S. Niewieczerzal, P. Dabrowski-Tumanski, J.I. Sulkowska, The exclusive effects of chaperonin on the behavior of the 52 knotted proteins (under review)
Google Scholar
F. Ziegler, N.C. Lim, S.S. Mandal, B. Pelz, W.-P. Ng, M. Schlierf, S.E. Jackson, M. Rief, Knotting and unknotting of a protein in single molecule experiments. Proc. Natl. Acad. Sci. 201600614 (2016)
Google Scholar

Download references

Acknowledgements

This work has been financed from the budget for science in the years 2016-2019 [#0003/ID3/2016/64 MNiSW – Ideas Plus], EMBO [#2057 Installation Grant], and National Science Centre [#2012/07/E/NZ1/01900] to J.I.S. The work of P.S. has been supported by the ERC Starting Grant no. 335739 “Quantum fields and knot homologies” funded by the European Research Council under the European Union’s Seventh Framework Programme.

Author information

Authors and Affiliations

Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097, Warsaw, Poland
Joanna I. Sulkowska
Faculty of Physics, University of Warsaw, Pasteura 5, 02-093, Warsaw, Poland
Piotr Sułkowski
Walter Burke Institute for Theoretical Physics, California Institute of Technology, Pasadena, CA, 91125, USA
Piotr Sułkowski

Authors

Joanna I. Sulkowska
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Sułkowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joanna I. Sulkowska .

Editor information

Editors and Affiliations

Department of Physics and Astronomy, Western Kentucky University, Bowling Green, Kentucky, USA
Sanju Gupta
Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, USA
Avadh Saxena

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sulkowska, J.I., Sułkowski, P. (2018). Entangled Proteins: Knots, Slipknots, Links, and Lassos. In: Gupta, S., Saxena, A. (eds) The Role of Topology in Materials. Springer Series in Solid-State Sciences, vol 189. Springer, Cham. https://doi.org/10.1007/978-3-319-76596-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-76596-9_8
Published: 22 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76595-2
Online ISBN: 978-3-319-76596-9
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics

Entangled Proteins: Knots, Slipknots, Links, and Lassos

Abstract

Similar content being viewed by others

GLN: a method to reveal unique properties of lasso type topology in proteins

Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding

Knot theory for proteins: Gauss codes, quandles and bondles

Keywords

1 Introduction

2 Entanglement in Proteins

3 Proteins with Knots and Slipknots

3.1 Classification and Description of Knots

3.2 Proteins with Knots and Slipknots – KnotProt Server and Database