Introduction

Dust and storage mites produce allergens leading to widespread and common allergic diseases, including bronchial asthma, perennial rhinitis, and atopic dermatitis, that affect about 10–20 % of the global population [1, 2]. Two dust mite species, Dermatophagoides farinae and D. pteronyssinus, are the predominant sources of these allergens. Indeed, these species produce at least 23 groups of allergens [3]. The production of therapies against these allergens, to alleviate or eliminate allergic symptoms, hinges on our understanding of the individual allergens produced by dust mites as well as by storage mites like Blomia tropicalis. In fact, many studies have been devoted to understanding the D. farinae and D. pteronyssinus allergens from clinical, molecular, and immunological standpoints. In particular, analysis of allergen protein structure and organization has advanced the field so far as to allow the production of recombinant allergens. Importantly, allergen engineering offers the potential for large-scale production of more effective treatments, therefore such studies are indispensible in reducing allergic disease.

Studies of the structural biology of allergens, including Dermatophagoides, have established the foundations on which to explore the properties that determine allergenicity. Beginning with amino acid sequence homology, the higher-order structure and organization of the proteins can be derived to determine IgE binding capabilities or predict functions. Such information can also be used to understand potential cross-reactivity with other allergens. Thus, the contributions of structural biology approaches to studying mite allergens are crucial to advancing both clinical and immunological perspectives on these disease-causing proteins.

Structural biology: the means to an end

Structural biology studies seek to determine the physical properties of protein organization as a basis for imputing functions and interactions of a protein. These kinds of studies are commonly employed in allergen research because of the ease with which protein databases allow rapid classification and comparison of suspected allergens with known proteins, particularly other allergens. Ascertaining how the protein folds allows researchers to deduce whether/to what extent the protein binds IgE antibodies; uncovering the size, stability, and solubility of an allergen aids in calculating its ability to cross mucosal barriers. Thus, structural information for allergens helps researchers gauge both the allergenicity—whether a protein induces a clinical or immunological allergic response—and cross-reactivity—ability to bind the same epitope as other proteins—of these widely-different proteins.

A variety of techniques are used to determine the structure of a protein. From sequence homology searches and computer modeling, to X-ray crystallography and nuclear magnetic resonance spectroscopy, the availability of both virtual and experimental approaches provides a range of possible methods to gain insight to protein organization and function. Indeed, the recent windfall of novel proteins—including allergens [4, 5]—identified through large-scale proteomics studies has placed structural biology techniques at the forefront of characterizing these new proteins.

The primary structure of a protein is determined by amino acid sequencing using standard molecular biology techniques, and often includes extrapolation from the nucleic acid sequence of a gene. Additionally, advanced mass spectrometry methods are commonly used to determine the amino acid composition of proteins. Often, though, bioinformatics analyses form the foundation for further characterization. Many freely-accessible databases (e.g., NCBI, PDB) are available for sequence alignments to identify similarities to proteins from other species or those with defined functions. Alignments also aid in predicting post-translational modifications that can alter a protein’s conformation or activity.

The characterization of proteins beyond primary structure becomes gradually more complicated. Secondary structure, or the pattern of arrangement encompassing strands, helices, turns, and coils, while often predicted through computer modeling based on primary sequence, can be determined in solution using experimental approaches such as circular dichroism spectroscopy (CD). CD employs optical absorbance to determine the conformation of a protein in solution. Using this method, the arrangement of α-helices, β-sheets, turns, and coils can be determined. While low in resolution, the application of CD can quickly and reliably identify the secondary structure of a protein. For example, following primary structure determination through mass spectrometry, the secondary structure of the birch pollen allergen, Bet v 1, was deduced using CD [5].

Higher-order structures are determined by more advanced methods with higher resolution, particularly X-ray crystallography and nuclear magnetic resonance spectroscopy (NMR). These techniques resolve protein structures at the atomic level, allowing the definition of tertiary and quaternary organization (in fact, primary and secondary structure can be determined at the same time). However, each of these approaches carries disadvantages. X-ray crystallography consumes time and materials and cannot offer insight to protein dynamics. NMR, while providing information on protein dynamics, is limited by the size of the protein being studied; larger macromolecules are precluded. Additionally, the comparability of crystal structures and solution-based structures (as in NMR) remains somewhat controversial. Despite these drawbacks, both approaches are widely used and continue to offer invaluable information toward the understanding of protein structure, function, and interactions. Further, these methods have enabled the classification of many different allergens and promoted a better understanding of allergic disease.

Finally, additional virtual methods are used to uncover biophysical information about proteins like allergens. Programs like Visual Molecular Dynamics (VMD; http://www.ks.uiuc.edu/Research/vmd), and the related NAMD, allow researchers to simulate the interactions of proteins, including their abilities to bind or cross membranes. The ability to model allergens in silico can lead to discoveries regarding cross-reactivity or allergenicity, as has been done for severity of response to peanut allergens [6].

Structural biology of mite allergens

The application of structural biology to the mite allergens has advanced both clinical and immunological understandings of how these proteins promote allergic disease. Allergens from Dermatophagoides are classified based on immune response into major groups 1 and 2 and middle groups 4, 5, and 7 [7]. However, there are at least 23 groups of allergens from the two most common dust mite species, D. farinae and D. pteronyssinus, all of which seem to elicit some degree of immune response [2, 3]. The importance of structural biology approaches to classifying and understanding these allergens has been invaluable—from outlining the basic properties of the proteins, to determining the ability to bind IgE and predicting cross-reactivity with related allergens, and, finally, aiding in the development of immunotherapy against the allergens [8]. Indeed, combining what is known about the major allergen groups with what can be derived from related proteins described in existing databases, much more information will be forthcoming regarding other allergen groups.

The spatial structure has been determined for several of the allergens of Dermatophagoides and B. tropicalis. Initial studies used in the late 1980s and early 90s used traditional molecular and immunological approaches to purify the allergen proteins (affinity chromatography), determine their characteristics (SDS-PAGE), and quantify allergenicity (ELISA, radioimmunoassay) (e.g., [911]). Due to the advent of the many more advanced and specific techniques described above, a wide range of approaches have been employed to derive more detailed information about allergens. Below, we will review what is known about structures for the groups 1, 2, 5, 7, and 13 allergens, and how structural biology has contributed to the understanding of the allergenicity of these proteins.

Group 1 allergens

The group 1 allergens from D. pteronyssinus and D. farinae, Der p 1 and Der f 1, respectively, are major mite allergens, accounting for >50 % of IgE antibodies against total mite extract [12]. First isolated following cDNA cloning, these proteins are 82 % identical in sequence and belong to a family of papain-like cysteine proteases [13]. The crystal structures of both mature Der p 1 and Der f 1, as well as their recombinant prodomains, have been solved [14, 15]. Interestingly, despite extensive sequence homology, mature Der p 1 and Der f 1 do not exhibit identical structures. Der p 1 is monomeric and exhibits a cysteine protease fold and a magnesium binding site [15, 16]. Der f 1 is also monomeric, but lacks a metal binding site [15]. Structural differences are also found at the N-terminal and central loop regions, those sites that are exposed for surface binding. Additionally, Der f 1 appears to have fewer polymorphisms (five known) compared to Der p 1 (23 known) [7]. The prodomains of these two allergens differ from the mature forms with respect to secondary structure [14].

The structural differences between Der p 1 and Der f 1 likely result in differences in IgE binding. While these allergens exhibit high cross-reactivity in humans [17], antibody binding demonstrates a high degree of species specificity [9, 10, 17]: monoclonal antibodies raised against either species are ~95 % species-specific. Four surface patches differ between Der f 1 and Der p 1, and areas these proteins have in common are the putative basis for cross-reactivity between the two allergens. The Der f 1 and Der p 1 structures show the presence of a conserved water binding site at one end and in the catalytic site at the other. The binding of water molecules could be biologically significant and may serve to catalyze the cysteine protease activity of the allergen [14].

The proforms of both Der p 1 and Der f 1 exhibit less allergenicty than do the mature and recombinant forms. The sites where the prosegments are anchored likely reflect the major IgE epitopes of the two allergens [14]. Based on the crystal structures of other cysteine proteases, molecular modeling of pro-Der f 1 was performed to demonstrate information on the location of IgE-binding sites blocked by the propeptides. To be more exact, the Der f 1 prosegment consists of an N-terminal domain-structure followed by a C-terminal extended loop and is tightly anchored at two regions of the mature enzyme. The prosegment binding loop, the substrate binding cleft surrounding catalytic residue Cys35, and Asn53, the potential N-glycosylation site in mature Der f 1, is located on the surface distant from the catalytic site and the prosegment. These two regions on the molecular surface of the mature portion were blocked by the prosegments as major conformational IgE epitopes. The allergenicities of the proforms are less than those of the mature forms in a broad population of patients. Further, recombinant mature Der p 1 and Der f 1 are similar to the natural types in their structures and allergenicities and retain proteolytic activities [14].

Group 2 allergens

The Der p 2 and Der f 2 allergens belong to the ML domain lipid binding protein family, which also includes the Neimann-Pick type C2 (NPC2) proteins [7]. The crystal structures of these proteins have been determined, revealing anti-parallel beta strands and a hydrophobic cavity [18, 19]. Similarities with the structures of NPC2 proteins suggest a potential role in lipid binding or transport.

The overall tertiary fold of Der f 2 is that of two anti-parallel β-pleated sheets overlying each other. This fold is characteristic of the immunoglobulin superfamily. There are three disulfide bonds in Der f 2, forming covalent bonds between residues 8 and 119, 21 and 27, and 73 and 78 [20, 21], which are critical to the IgE-binding capacity of Der f 2. Indeed, Takai et al. [20] extended initial findings of the importance of these disulfide bonds to the allergenicity of Der f 2 [2224] by studying engineered recombinant proteins with mutations in these bonds. Alterations in the disulfide bonds of Der f 2 led to conformational changes that reduced its IgE binding capacity. Therefore, loosening of the rigid tertiary structure by elimination of key intramolecular interactions in the allergen molecule would be an effective strategy in allergen engineering for safe and effective allergen-specific immunotherapy [20].

The similarities between Der f 2 and Der p 2 explain the extensive cross-reactivity between these allergens [11]. Indeed, the majority of the residues [86 % homology between Der f 2 and Der p 2 [21] on the binding surface of the group 2 allergens is conserved, allowing for similar IgE binding capacities [19]. NMR and structural modeling studies indicate two surface patches representing the IgE binding epitopes [25].

Group 5 allergens

The group 5 allergens from D. pteronyssinus and D. farinae show similarity to the group 5 allergen from another house dust mite species, B. tropicalis. Der p 5 and Blo t 5 are 42 % identical in amino acid sequence [26]. Additionally, the crystal structure of Der p 5 was recently solved, revealing a monomeric protein that appears similar to the solution structure of Blo t 5 [27]. Further, Der p 5 has a helical structure that seems to interact with itself in an anti-parallel manner and a hydrophobic cavity. However, the function and activity of Der p 5 and other group 5 allergens remain unknown.

The group 5 allergens represent an important allergen group because of high responsiveness of patients to dust mite extracts containing these proteins. However, while some cross-reactivity exists among the group 5 allergens, in general IgE binding appears to be species-specific [26]. The Blo t 5 allergen appears to elicit IgE binding in 90 % of asthmatic and allergic patients [28]. However, further work is needed to understand the function and activity of this group of allergens.

Group 7 allergens

Group 7 allergens are fairly abundant in mite extracts. Indeed, these are the 44th most abundant protein produced by mites, ranking not far behind Der p 2 (41st) [29]. These allergens share sequence similarity to the prenylcysteine lyase proteins predicted to exist in insects [30]. Analyses also predict at least three isoforms exist for Der p 7; these may represent different levels of glycosylation [31]. A more recent characterization of Der p 7 indicates a structure comprising anti-parallel β sheets. Additionally, this allergen appears to bind a bacterial lipid product and resembles proteins in the Toll-like receptor pathway [27].

Some earlier work demonstrated that 50 % of dust mite-allergic patients react to Der p 7. This protein also appears to elicit a high T cell proliferation response [32, 33]. This specificity may be related to the similarity of Der p 7 to Toll-like receptors, which are involved in innate immunity [34]. However, further work is necessary to uncover the reason for this specificity.

Group 13 allergens

Group 13 allergens are considered fatty acid binding proteins (FABP) and are fairly abundant in mite extracts [7]. NMR studies of Der f 13 revealed anti-parallel beta strands preceeded by a helix-turn-helix motif [35]. Several surface residues are conserved between group 13 allergens and other proteins involved in the transport of fatty acids, with the highest homology to human brain FABP.

Epitope mapping and mutational analysis indicate that the surface residues are critical to IgE binding. However, despite high abundance in mite extracts and good capacity to bing IgE, the group 13 allergens do not elicit a severe immune response; only some patients produce IgE against these allergens [35].

Immunotherapy against mite allergens

Given the high allergenicity of the groups 1 and 2 allergens from house dust mites, effort has been devoted to developing specific immunotherapies (SIT) against these proteins. Additionally, advances in generating large amounts of recombinant proteins have led to animal-model studies of potential vaccines. For example, the changes induced in IgE binding capacity by altering the disulfide bonds of Der f 2 suggests potential avenues for immunotherapy against the group 2 allergens. Studies of human cells in vitro and of immunized mice indicate that the production of a hypoallergen—an allergenic protein less capable of binding IgE but retaining potent T cell reactivity—through genetic techniques may produce immunity to group 2 allergens [36]. A vaccination was recently developed by creating hypoallergens using recombinant Der p 1 and Der p 2; testing in rabbits demonstrated good levels of efficacy and tolerability [37]. Recent work also suggests that expressing the dominant T cell epitopes of Der p 1 in transgenic rice seed may produce a viable vaccination against group 1 allergens [38].

Several clinical trials have been conducted in recent years to test the efficacy and safety of SIT against the dust mite allergens, albeit with mixed results. Two types of SIT have been used: subcutaneous (SCIT) and sublingual (SLIT). Some have used purified mite extracts; others have used mite extracts combined with adjuvant therapies. In general, it appears that these SIT against mite allergens are effective, but the magnitude of effect varies between studies and types of SIT [3941]. A search of government-sponsored clinical trials in the U.S. (clinicaltrials.gov; August 2012) returned 14 active trials investigating the efficacy of house dust mite SIT. A windfall of findings from these studies—and many more worldwide—can be expected within the next several years.

While SIT still have far to go before widespread use in humans, they offer some hope for eventual reduction of allergic disease. Continued advances in the field will rely heavily on the information generated by structural biology studies of each allergen. The work reviewed here has established strong foundations for progress in both understanding the structural biology of allergens and developing immunotherapies against them.

Conclusions

Proteins produced by mite species, especially the Dermatophagoides, are responsible for allergic disease worldwide. Knowledge gained from structural biology studies of these allergens has established a foundation for development of new therapies to treat and prevent allergic disease. Such studies will continue to contribute greatly to clinical advances, particularly in this era of high-throughput genomics and proteomics.