Abstract
Dual Oxidase 1 (DUOX1) is a prominent immune system component primarily expressed in esophagus, lungs, skin, and urinary bladder including others. DUOX1 is involved in lactoperoxidase-mediated innate immunity at mucosal surfaces by generation of antimicrobial hypothiocyanite at the apical surface of epithelial lining. Upon detection of bacterial pathogens mainly Pseudomonas aeruginosa, DUOX1 is activated in bronchial epithelial cells. Both the host and pathogen enter a redox dual with DUOX1 and hypothiocyanite from host and Pyocyanin (PCN) as a redox active virulence factor from P. aeruginosa. The synergy of the both enzymes permanently oxidizes PCN and thus holds the potential to prevent PCN-induced virulence, which otherwise paves the way for establishment of persistent chronic infection. In this study, we structurally and functionally annotated the DUOX1, predicted its 3d structure, physio-chemical properties, post-translational modifications, and genetic polymorphism analysis with subsequent disease-associated single-nucleotide variations and their impact on DUOX1 functionality by employing in silico approaches. DUOX1 holds greater homology with gorilla and chimpanzee than other primates. The localization signal peptide was present at the beginning of the peptide with cleavage site at 22 aa position. Three distinct functional domains were observed based on homology: An_peroxidase, FRQ1, and oxido-reductase domains. Polymorphism analysis revealed > 60 SNPs associated with different cancers with probable damaging effects. No cancer-associated methylated island was observed for DUOX1. Three-dimensional structure was developed via homology modeling strategy. The proper annotation will help in characterization of DUOX1 and enhance our knowledge of its functionality and biological roles.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The Innate immune system serves as first line of defense against the invading pathogens and restricts the infections by neutralizing the invading potential immunological threats [1,2,3]. This defense system constitutes of various specialized components performing specific tasks assigned to them across the human body. Innate immune components tasked with early detection and clearance of an invading pathogen are triggered within hours of detection of an immunogen in the body [4]. The Dual oxidase 1 (DUOX1) is a prominent membrane protein expressed in airway epithelial cells and functions as NADPH oxidase enzyme [5, 6]. DUOX1 is an important innate immune component respiratory tract. Through generation of hydrogen peroxide (H2O2), it is involved in lactoperoxidase-mediated innate immunity at mucosal surfaces. The generated H2O2 is necessary for thyroid peroxidase and lactoperoxidase (LPO) activities [7, 8]. It has a peroxidase-like domain at N terminal which is potentially responsible for its intrinsic peroxidase activity [6, 9]. The NADPH oxidase activity is calcium dependent with two potential Ca++ binding domains at 828–839 and 864–875 positions [10].
The DUOX1 plays a role in generation of hypothiocyanite, an antimicrobial oxidant in respiratory surface liquid along with LPO, thiocyanate by generation of H2O2 at the apical surface of epithelial lining [11]. Being a membrane protein, DUOX1 potentially plays its part in tracheal epithelial wound repair when activated by ATP released from injured cells [11]. DUOX1 has been reported to be induced by inflammatory cytokines [12, 13] and bacterial stimuli [14]. DUOX1 activation occurs in bronchial epithelial cells when challenged by Pseudomonas aeruginosa infection. Bacterial flagellum and LPS along with Type Three Secretion System components indirectly elicited DUOX1-mediated H2O2 production via TLR-dependent signaling [7, 15].
During respiratory infection, P. aerginosa virulence factor Pyocyanin (PCN) inhibits H2O2 generation by DUOX1, thus improving pathogen’s survival in the respiratory cavity by actively reducing the availability of its substrate NADPH2 in the cytoplasm [12]. PCN is a redox active, cytotoxic tricyclic metabolite considered responsible for immune modulatory, pro-inflammatory, enzyme inactivation, and pro-apoptotic activities during P. aeruginosa infection [12, 16,17,18,19,20]. PCN is reported to induce apoptosis in neutrophils via interfering with the mitochondrial respiratory chain, releasing reactive oxygen species and cytochrome c into cytoplasm and activation of mitochondrial acid sphinglomyelinase in cytoplasm [19, 21]. Peroxidases (Microperoxidase-11, LPO, and myeloperoxidase) in the presence of adequate quantities of H2O2 are capable of permanently oxidizing PCN, thus neutralizing it cytotoxicity [12, 22]. The DUOX1 serves as an important source of H2O2, in the extra-cellular environment [9]. The synergistic action of both DUOX1 and peroxidase enzymes holds the potential protective shield to prevent PCN-induced P. aeruginosa virulence, which otherwise usually paves the way for establishment of persistent chronic infection. Thus, the outcome of this redox warfare between the host and the pathogen determines the course of infection (Fig. 1).
The increase in antibiotic resistance and emergence of Pan-drug resistant P. aeruginosa strains, the need of the hour is to explore unconventional antibacterial approaches [23]. Being a frontline immune component, DUOX1 holds a vital position protecting the pulmonary cavity from invading pathogens and thus is subjected to immune evasive and modulatory assaults from them. Comprehensive knowledge of pulmonary infection dynamics, primarily host–pathogen interaction (the way pathogen manipulates host responses and escape clearance simultaneously inflecting injuries to host) is astutely vital for designing any sort of therapeutic and/or preventive interventions. This requires a better understanding of DUOX1 [5]. Without proper annotation, DUOX1 cannot be used as therapeutic option. In this study, we endeavored to structurally and functionally annotate the DUOX1 gene and protein using servers and tools available in public domain. The contributions of this study are prediction of protein’s three-dimensional structure, physio-chemical properties, post-translational modifications, and genetic polymorphism analysis with subsequent disease-associated single-nucleotide variations and their impact on DUOX1 functionality by employing in silico approaches. Human DUOX1 was more homologous with the gorilla and chimpanzee than other mammalian species. The protein harbors the 21 aa long localization signal peptide at the beginning of the peptide. Three distinct functional domains were observed based on homology: An_peroxidase, FRQ1, and oxido-reductase domains. Polymorphism analysis revealed > 60 SNPs associated with different cancers with probable damaging effects. No cancer-associated methylated island was observed for DUOX1. Three-dimensional structure was developed via homology modeling strategy. The proper annotation will help in characterization of DUOX1 and enhance our understanding of its functionality and biological roles ultimately leading to its potential therapeutic applications.
This paper is organized as follows. Section 2 contains the adopted methodology for annotation enlisting all the tools and servers employed. In Sect. 3, the results are presented encompassing the DUOX1 annotation, while in Sect. 4, the results are discussed and Sect. 5 concludes the findings.
2 Methodology
2.1 Sequence Retrieval, Homology, and Phylogenetic Analysis
DUOX1 localization on the genome was determined by searching within human genome draft sequence on NCBI (http://www.ncbi.nlm.nih.gov/). Phylogenetic analysis provides vital information regarding gene variation among the included species. Peptide sequences of Human DUOX1 were obtained from Uniprot (http://www.uniprot.org/) along with other available species, i.e., Caenorhabditis elegans, Canis lupus familiaris, Sus scrofa, Rattus norvegicus, Danio rerio, Mus musculus, Poecilia Formosa, Oreochromis niloticus, Lepisosteus oculatus, Xiphophorus maculatus, Pan troglodytes, Bos Taurus, Callithrix jacchus, Ictidomys tridecemlineatus, Sarcophilus harrisii, Latimeria chalumnae, Oryzias latipes, Cavia porcellus, Pongo abelii, Nomascus leucogenys, Oryctolagus cuniculus, Otolemur garnettii, Papio Anubis, Felis catus, Ovis aries, Equus caballus, Loxodonta africana, Takifugu rubripes, Gorilla gorilla gorilla, Myotis lucifugus, Monodelphis domestica, Alligator mississippiensis, Callithrix jacchus, Macaca mulatta, Xenopus tropicalis, Bactrocera cucurbitae, and Lygus hesperus. Online multiple sequence alignment tool Clustal Omega was used to construct homology (Neighbour-joining) tree for determining DUOX1’s evolutionary trend with default parametric settings [24].
2.2 Sequence-Based Physico-chemical Characterization
Protparam Server (http://web.expasy.org/protparam/) was employed to calculate overall physical and chemical parameters and structural/functional motifs for human DUOX1 peptide sequence [25]. Signal peptide prediction for DUOX1 was done using online tool SignalP v4.1 (http://www.cbs.dtu.dk/services/SignalP/) [26]. Protscale Server (http://web.expasy.org/ protscale/) was used to calculate physio-chemical properties, i.e., polarity, % buried residues, average flexibility, bulkiness, hydrophilicity, and relative mutability for human DUOX1 [27].
2.3 Functional Domain Assessment and Polymorphism Analysis
Functional domains for DUOX1 proteins were studied using Domain Mapping of Disease Mutations (DMDM) database (http://www.bioinf.umbc.edu/dmdm/) [28]. DMDM employs Hidden Markov Model-based sequence alignment tool (HMMer) to map genetic polymorphisms (mutations/variations) on the protein at domain and visualized. Default parametric settings were used. Disease association with genetic polymorphism was also checked with BioMuta v3.0 database (https://hive.biochemistry.gwu.edu/biomuta). BioMuta is a curated single-nucleotide variation (SNV) and disease association database in special reference to oncogenesis [29, 30]. PROVEAN/SIFT and PolyPhen2 Prediction tools were used for studying the impact of variations on the protein’s functionality. SIFT tool predicts the impact of amino acid substitution on the proteins function. The prediction is made from analyzing conservation of amino acids via PSI_BLAST in the sequence alignments derived from closely related sequences [31]. PROVEAN tool provides a swift computational approach to predict effects caused by amino acid substitution or indel on biological function of the protein. This tool identifies nonsynonymous and/or indel variants predicted to be important for proteins functionality by developing pairwise sequence alignments with generation of precomputed predictions for all single amino acid substitutions and deletions at every position in the query protein [32]. PolyPhen tool was used to annotate coding nonsynonymous SNPs. It utilizes high-quality multiple sequence alignment pipeline based on machine learning method optimized for high-throughput analysis of the next generation sequencing data [33].
2.4 Sub-cellular Localization
Sub-cellular localization is important in respect of better understanding the functionality of any biomolecule in a biological system. Sub-cellular localization for DUOX1 was evaluated via CELLO server [34], while TMHMM, HMMTOP servers [35], and UniProt [36]. Topological information was used to calculate transmembrane structural domains with in the protein structure inferred from primary sequence of the protein.
2.5 Assessment of Methylation Sites
DNA methylation plays vital role in epigenetic regulation of genetic information. Methylation sites in DUOX1 were analyzed using MethyCancer (http://methycancer.psych.ac.cn/MethyCancer.do) provided by MethyCancer Database of Human DNA Methylation and Cancer [37]. DUOX1 gene (45145823–45149381) was analyzed for CpG sites. MethyCancer tool utilizes six data sources BIG/UHN [38], Columbia University [39], Human Epigenome Project HEP [40], MethDB [41], CpG130 CGI [42], and UCSC CGI [43]. Default settings were used for the analysis.
2.6 Prediction of Post-translational Modifications (PTMs)
Center for Biological Sequence Analysis CBS (http://www.cbs.dtu.dk/services/) provides a rich set of servers to predict post-translational modifications. The available servers NetCGlyc 1.0 [44], NetCorona 1.0 [45], NetGlycate 1.0 [46], NetNGlyc 1.0 [47], NetOGlyc 4.0 [48], NetPhos 3.1 [49], and ProP 1.0 [50] were used to predict sites for glycosylation, mannosylation, and phosphorylation for DUOX1 protein from its sequence.
2.7 Functional Protein Association Networks
Interacting partners of a biomolecule in a biological system influence its functionality. Without properly understanding the causes and consequences of these interactions, the complete comprehension of the biological role played by a protein could not be implicated. Search Tool for Retrieval of Interacting Genes and proteins (STRING v 10) provides protein–protein interaction and cluster of orthologs analysis servers using its database resources [51]. It was used to identify the interactions and associations involving DUOX1.
2.8 Protein Structure Building
Standard practice for protein structure prediction via homology modeling requires availability of suitable template for structure with good sequence identity and coverage (query sequence identity > 40% and query coverage > 50%). Protein Databank (PDB) was searched via BLAST for suitable template with above-mentioned parameters. Lack of such template in PDB was a major issue in predicting the DUOX1 3d structure. Other factors, i.e., the transmembrane nature and size of the protein (1551aa), complicated the structure prediction, thus limiting the usefulness of Ab initio structure building. Lack of template issue was overcame by relaxing PDB search parameters with focus on identifying domainwise similarity among available PDB structures (query sequence identity ≥ 25%). This approached yielded similar domains and hence were used for structure prediction via Swiss Model online protein prediction server [52]. The structure was built in domain-by-domain manner. The resultant structure was analyzed by Rampage server for Ramachandran plot to evaluate the predicted structure quality [53]. Predicted protein structure was visualized via Biovia Discovery Studio v 4.5 [54].
3 Results
3.1 Sequences’ Retrieval and Homology Tree Construction
DUOX1 is located on q arm of human chromosome 15 (15q21.1) precisely between 45,129,994 and 45,165,578 position. The gene spans a total of 35,585 bp and contains 35 exons, encoding 1551 amino acids. The chromosomal location, neighbouring genes along with their respective position and exon structure of DUOX1 gene, is shown in Fig. 2a–c. Peptide sequence of Human DUOX1 (Q9NRD9) along with various other species was subjected to multiple sequence alignment using Clustal Omega, and the alignment was used to generate homology tree (Fig. 3). Human DUOX1 was clustered together with the great apes Gorilla (G3R9G4) and Chimpanzee (H2Q9C8), while closely placed with Sumatran Orangutan (H2NN45) as members of Hominidae family. Gibbons despite member of Hominoidea as lesser apes was clustered with Tasmanian devil and opossums potentially due to geographical distribution of these mammals.
3.2 Prediction of Signal Peptide and Structural Domains of DUOX1
SignalP Server predicted a cleavage site between position 21 and 22, as shown in Fig. 4. Server computed C-, S-, and Y-scores for DUOX1. Amino acid 22 exhibited maximum C- and Y-scores, while max S score was calculated for position 3 all greater than the standard value (0.5). The signal peptide was estimated to be 1–21 segment of the protein. The Uniprot database provided curated and reviewed topological information regarding the transmembrane domains of DUOX1. The protein had 7 transmembrane helical domains with extra-cellular N terminal and cytoplasmic C terminal domains, since the information obtained from Uniprot was manually reviewed and was thus preferred over the predictions servers. Both the domain prediction servers stumbled on complex arrangement of the transmembrane domains, as amino acid 1248 was only a single amino acid spanned by two transmembrane domains. The two servers failed to consider this lone extra-cellular residue and assumed it as continuation of a large transmembrane domain instead of two separate domains.
3.3 Functional Domain Analysis
DMDM provided a centralized portal for studying functional domains of DUOX1, as depicted in Fig. 2d and Table 1. Three major domains were observed in the protein, namely, An_peroxidase (pfam03098) on N terminus 29–557 aa exposed on the extra-cellular surface, FRQ1 domain (COG5126) 769–929 aa hidden beneath plasma membrane in cytoplasm and third domain with oxido-reductase activity was COG4097 1054–1494 aa. The An_peroxidase domain (Animal haem peroxidase) is a member of the superfamily cl14561. The FRQ1 domain contained EF-hand motifs which provide the Ca++ binding sites and is a member of the superfamily cl25352. The c terminus domain had overlapping motifs for FAD- and NAD-binding pockets along with oxido-reductase domains. This domain was classified as member of the superfamily cl06868.
3.4 Single-Nucleotide Polymorphism of DUOX1
The DMDM database provided three polymorphisms for DUOX1 p.CYS1026ARG, p.ILE962THR, and p.LEU1178PHE, but could not associate such variations with any disease condition. The BioMuta provided a comprehensive overview of DUOX1 genetic polymorphism with cancer type and frequency. The observed SNPs for DUOX1 gene for each cancer type were: Uterine cancer 128, Skin cancer 106, Kidney Cancer 102, Breast cancer 98, Stomach cancer 96, and Lung cancer 82 (Fig. 5a). While top three most frequent SNPs were occurring at 297 aa observed in 18 patients, 1393 in 16 patients, and 975 in 14 patients (Fig. 5b). All single-nucleotide polymorphs and somatic cancer single-nucleotide variants are shown in supplementary Tables 1 and 2, respectively. Based on PROVEAN/SIFT and PolyPhen2 predictions, somatic cancer single-nucleotide variants’ (SNVs) functional impact of the DUOX1 SNPs is depicted in Fig. 6a, b. PROVEAN/SIFT predicted deleterious/damaging 62, deleterious/tolerated 12, neutral/damaging 11, and neutral/tolerated 44, while PolyPhen2 predicted benign 43, possibly damaging 19, and probably damaging 67 genetic variants. Both the tools predicted the same 70 SNPs as lacking any functional association. Domainwise SNV frequency distribution showed higher degree of SNVs in An_peroxidase (66: damaging 23, neutral 14 and NA 29) followed by COG4097 (46:30, 9, 7) and FRQ1 (17:9, 2, 6), and the signal peptide also had a SNV at position 4, although this peptide segment does not play any direct role in functioning of the protein, but it is vital, as it marks the sub-cellular location of the protein, i.e., plasma membrane.
3.5 Prediction of Hydrophilicity, Accessibility, Polarity, Flexibility, Mutability, and Bulkiness of DUOX1
DUOX1 was subjected physio-chemical analyses (polarity, % buried residues, average flexibility, bulkiness, hydrophilicity, and relative mutability) using Protscale Server available at expasy platform. Higher the generated score greater is the probability for each parameter. The average flexibility values are in the range of 0.36 (1209 aa) to 0.512 (160 aa). The bulkiness are between 10.671 (592 aa) and 20.67 (1232 aa), while accessibility is calculated to be 3.3 (1471 aa) and 7.589 (1268 aa). The least buried residue is 269 with calculated values of 2.044 and most buried one is 1388 aa valuing 11.8. The values for polarity are calculated with highly polar residue to be 1215 aa with 34.841 and least polar one 1388 aa having value 0.072. Hydrophobicity is estimated using Hopp and Woods method, and the values ranged from − 2.011 (1186 aa) to 1.733 (853 aa). The relative mutability calculations showed a patch of highly mutable region comprising of three residues (354, 355, and 356 with values 104.111, 103.111, and 103.556, respectively), while least mutable residue was 252 aa with values 44.444. The highest and lowest scores computed for each parameter of DUOX1 are shown in Table 2.
3.6 Methylation Sites in DUOX1
MethyCancer tool predicted a 100 bp CpG Island type III at position 45,144,265–45,144,364, but no methylation site or island was observed in DUOX1 gene segment. This upstream island was predicted employing BIG and UHN data sources.
3.7 Post-translational Modifications of DUOX1
CBS prediction server predicted multiple post-translational modifications for DUOX1. Cutoff for each parameter was set at 0.5 by default, and higher values indicated higher probability. The propeptide signal cleavage site was calculated to be between 21 and 22 aa using ProP 1.0 prediction server. Net Corona 1.0 server estimated two Proteinase cleavage sites at positions 245 and 376 aa (Fig. 7a). C-mannosylation sites were calculated using NetCGlyc 1.0 server which yielded lack of any mannosylation site in the protein. NetGlycate 1.0 server estimated a total of 13 glycation sites, while 5 N-linked glycosylation sites and 15 O-linked glycosylation sites were calculated by NetNGlyc1.0 and NetOGlyc 4.0 servers, respectively (Fig. 7b, c). Overall 123 phosphorylation sites were predicted by NetPhos 3.1 server. Serine residue was to host highest number of phosphorylation site (74), while Threonine had 41 and Tyrosine harbored only 8 sites (Fig. 6d). Sites with their respective kinases are shown in Table 3.
3.8 Physiological Parameters of DUOX1
Physiological parameters are predicted using ProtParam server. DUOX1 has molecular weight around 177235.09 Da with speculated formula C8023H12364N2212O2233S53 and the total number of atoms is 24,885, while the theoretical pI is 8.12. Half-life estimated in mammalian reticulocytes, in vitro for DUOX1 was 30 h and it was > 20 h in yeast in vivo. The protein was slightly instable with calculated instability index to be 47.83 and aliphatic index 88.56. The estimated grand average of hydropathicity (GRAVY) is − 0.190.
3.9 Homology Modeling and Evaluation of Model Stability
PDB search with relaxed parameters yielded four templates Human Myloperoxidase (3fgp), EF-hand protein (2q4u), Transmembrane domain of Cylindrospermum stagnale NADPH Oxidase 5 (5o0t), and Dehydrogenase domain of Cylindrospermum stagnale NADPH Oxidase 5 (5o0x). DUOX1 protein was divided into four functional domains based on structure-sequence alignment obtained from the second PDB search. Human myeloperoxidase had 28% sequence similarity with first domain of DUOX1 protein (1–700 aa), EF-hand protein 25% similar to second domain (701–1000 aa), transmembrane domain of NADPH Oxidase 5 was 25% similar to third domain (1001–1260 aa), and dehydrogenase domain of NADPH Oxidase 5 was 35% similar to the last domain (1261–1551 aa). The structure was built in domain wise manner using Swiss Model web server (Fig. 8a, b). The figure shows the predicted structures in domainwise manner in accordance with structure building scheme. Three segment (signal peptide 1–21 aa, gap I 581–751 aa and gap II 931–1086 aa) were not modelled due to lack of any structural domain and availability of template. Ramachandran plot was used to evaluate the model quality (Fig. 8c). Being a membrane embedded protein, the stability indicators are relatively lower than those of non-membrane proteins.
3.10 Protein–Protein Interaction and Cluster of Orthologs Analysis for DUOX1
Protein interactions at cellular level are calculated using STRING Database. The protein–protein interaction analysis yielded a multiple node interaction network map (Fig. 9). The resultant interaction network analysis contained a total number of nodes to be 11 and 29 edges with 5.27 average node degree. The clustering coefficient was 0.786, and PPI enrichment value was 2.47e-06. DUOX1 is classified into non-supervised orthologous group (NOG19510) (Fig. 9a). DUOX1 is involved in biological processes: reactive oxygen species metabolic process (GO:0072593), superoxide metabolic process (GO:0006801), hydrogen peroxide metabolic process (GO:0042743), positive regulation of immunoglobulin-mediated immune response (GO:0002891), positive regulation of interleukin-10 production (GO:0032733), superoxide anion generation (GO:0042554), positive regulation of defense response to bacterium (GO:1900426), regulation of thyroid hormone generation (GO:2000609), regulation of toll-like receptor 2 signaling pathway (GO:0034135), hydrogen peroxide biosynthetic process (GO:0050665), regulation of inflammatory response (GO:0050727), cytokine production involved in immune response (GO:0002367), regulation of defense response (GO:0031347), respiratory burst (GO:0045730), response to organic cyclic compound (GO:0014070), metabolic process (GO:0008152), regulation of reactive oxygen species metabolic process (GO:2000377), regulation of response to external stimulus (GO:0032101), and innate immune response in mucosa (GO:0002227). DUOX1 was also involved in Molecular Function (GO); oxido-reductase activity, acting on NAD(P)H, oxygen as acceptor (GO:0050664), oxido-reductase activity, acting on NAD(P)H (GO:0016651), superoxide-generating NADPH oxidase activity (GO:0016175), Cellular Component (GO); NADPH oxidase complex (GO:0043020) and KEGG Pathways; Leishmaniasis (5140), Osteoclast differentiation (4380), Leukocyte transendothelial migration (4670), Phagosome (4145), and Inflammatory bowel disease (5321). All the associated functions and pathways require DUOX1’s peroxidase activity and H2O2 generation.
4 Discussion
DUOX1 is one of a major receptor dependent source of oxidants in cells [55]. Its prominent role is in innate airway host defense via production of inflammatory mediators and promotion of cell migration and wound healing [11, 16, 55,56,57,58]. It has recently been reported to be involved in multiple processes ranging from innate immunity at mucosal surfaces to Regulation of Epidermal Growth Factor Receptor Signaling [59]. Platelet-derived growth factor directs wound healing and tissue regeneration processes via DUOX1-generated H2O2 [55]. Being localized at epithelial surface of airway cavity, it forms first line of defense against invading pathogens, and consequently, it is subject to immune evasion and immunomodulation by invading pathogens [60, 61]. Hence, its regular ambient functionality is essential for innate protection [62]. In this study, we have analyzed DUOX1 protein employing in silico approaches for acquiring better understanding of its structure and functionality. Human DUOX1 sequence was subjected to structural analysis. The structure prediction was complicated due to three factors: sheer size of the protein, its transmembrane nature, and lack of template. These factors resulted in relaxation of template search criteria and employment of homology modeling approach with templates having lower sequence similarity (< 40%). Proteins with lower sequence similarity were considered (which were previously discarded) and the structure was developed in domain-by-domain manner. Due to shorter query coverage of the selected templates, three segments could not be modelled. The modelled domains were subjected to Ramachandran plot analysis. Being a membrane protein and having seven transmembrane domains, the overall quality of the predicted structure was lower and also caused difficulty in structure building as well. Two of the transmembrane domains 1227–1247 and 1249–1269 were separated by a single extra-cellular amino acid at position 1248. This caused problem in transmembrane domain prediction and all the used prediction servers failed to consider this lone amino acid as extra-cellular and hence considered 1227–1269 as a continuous transmembrane domain. The sensitivity of the prediction servers needs to be enhanced to avoid such problems. ProtParam computations predicted the protein to be unstable with lower aliphatic and stability indexes as a massive membrane protein. Prominent kinases with number of their respective phosphorylation are sites: PKC 45, cdc2 26, PKA 22, and CKII 13, while 69 potential sites were predicted for non-specific kinases. Collectively in the enzyme structure, the two terminal domains peroxidase and NOX (oxido-reductase) domains are linked through transmembrane and EF-hand Ca++ binding sites. Ferredoxin (i.e., FAD and NADPH)-binding domains are characteristic of flavoprotein modules. The electron transfer begins from NADPH to FAD ending to oxygen via heme resulting in generation of H2O2 [28]. All the interaction proteins obtained from STRING database analysis show DUOX1’s involvement in NADPH oxidation along with other oxidases, i.e., NOX and respective activation factors, e.g., DUOXA1, DUOXA2 along with cytokines, and other associating proteins.
Components of cigarette smoke, i.e., acrolein, have been reported to inhibit DUOX1 [63]. DUOX1 gets silenced in metastatic cancer, and also leads to resistance against epidermal growth factor receptor tyrosine kinase inhibitors [64]. Thus, one could assume that hypermethylation of DUOX1 promoter and/or gene could serve as potential marker for oncogenesis in lungs [65]. However, our in silico results did not yielded any potential methylation sites in DUOX1 gene region on chromosome 15. The observed upstream CpG Island might act like a cis-regulatory element and effect the gene expression. Another potential reason for lack of such sites and silencing of DUOX1 in tumors could be the sequence variations arising as SNPs and SNVs. While its higher expression levels have been reported to be associated with better survival in patients [66]. The improved survival might be attributed to prolong exposure to oxidative stress after receiving ionizing radiation therapy [67]. The SNP analysis by DMDM provided us three point mutations when observed in BioMuta data, all these three polymorphisms were present in tumor samples. The silencing of DUOX1 in tumors might be associated with the SNPs observed rather than epigentic. DUOX1 has been reported to play a significant role in keratinocytes differentiation in kidneys [68]. As mentioned in the results, kidney cancers exhibited third highest observed SNPs of DUOX1 gene; hence, there could be a probable association between the two phenomena. A similar role played by DUOX1 in human fetal lung epithelial cells [11, 69], while again, lung cancer among others had higher SNP frequency for DUOX1 gene as per SNVs’ analysis data. This high SNP frequency could also be related to DUOX1 expression levels as observed in NCBI gene expression data (Gene ID: 53905). DUOX1 is highly expressed in esophagus, skin, lungs, and thyroid tissues in comparison with other tissues which also exhibit lower degree of expression [70]. These mutations might serve as prognostic biomarkers.
As discussed previously, the potential involvement of DUOX1 in immune cell recruitment, wound repair, and tissue regeneration and above all antimicrobial defense in respiratory cavity signify its importance in the combating infections with special reference to pulmonary infections caused by P. aeruginosa. Pulmonary microenvironment in cystic fibrotic lungs provides a good case study example for studying infection establishment and immune evasion by P. aeruginosa. PCN secreted by the pathogen inhibits DOX1 and up regulates pro-inflammatory response thus causing injuries. Ironically enough over expressed DUOX1 alone aggravates the cystic fibrotic pathogenesis by pouring abundance of ROS in pulmonary cavity and has been suggested as potential drug target [12, 71]. Over expression of DUOX1 coupled with peroxidase enzymes in cases of P. aeruginosa infection could potentially enable the immune system to detoxify PCN before its entry in to epithelial cells and neutrophils, thus avoiding its severe cytotoxic effects otherwise. At least site directed delivery of microperoxidase 11 could potentially reduce the PCN mediated pathogenicity in pulmonary infections and could positively manage cystic fibrotic pulmonary disease progression. Detailed studies are required to fully understand the potential therapeutic and prognostic aspects of DUOX1 gene.
5 Conclusion
We report employment of in silico approaches for rapid and reliable structural and functional annotation for human Dual Oxidase 1 protein. The adopted approach, based on DUOX1 sequence, yielded structural as well as functional information, i.e., motifs and domains, evolutionary significance, potential post-translational modifications, and its sub-cellular localization. The disease-associated genetic polymorphism might be potentially employed as biomarkers for the studied disease conditions and hence needs further investigations comprising both clinical as well as therapeutic aspects. DUOX1 is an important component of innate immunity against pulmonary infections, and thus, the predictions reported here need to be validated in wet lab to strengthen DUOX1-based antimicrobial protection in the face of unstoppable antibiotic resistance crisis. In conclusion, DUOX1 is a stable transmembrane protein, potentially involved in multiple aspects of pulmonary health. The further studies are needed to enhance our better understanding of its potential applied role in pulmonary health conditions.
References
Medzhitov R, Janeway CA (1997) Innate immunity: the virtues of a nonclonal system of recognition. Cell 91(3):295–298
Abualigah LM, Khader AT, Hanandeh ES (2018) A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis. Eng Appl Artif Intell 73:111–125
Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48:1–25. https://doi.org/10.1007/s10489-018-1190-6
Bruce Alberts AJ, Lewis J, Raff M, Roberts K, Walter P (2002) Molecular biology of the cell, 4th edn. Garland Science, New York
Damiano S et al (2012) Reactive oxygen species regulate the levels of dual oxidase (Duox1-2) in human neuroblastoma cells. PLoS One 7(4):e34405
Ameziane-El-Hassani R et al (2005) Dual oxidase-2 has an intrinsic Ca2+-dependent H2O2-generating activity. J Biol Chem 280(34):30046–30054
Rada B, Leto TL (2010) Characterization of hydrogen peroxide production by Duox in bronchial epithelial cells exposed to Pseudomonas aeruginosa. FEBS letters 584(5):917–922
Strengert M et al (2014) Mucosal reactive oxygen species are required for antiviral response: role of Duox in influenza a virus infection. Antioxid Redox Signal 20(17):2695–2709
Edens WA et al (2001) Tyrosine cross-linking of extracellular matrix is catalyzed by Duox, a multidomain oxidase/peroxidase with homology to the phagocyte oxidase subunit gp91phox. J Cell Biol 154(4):879–892
Sigrist CJ et al (2005) ProRule: a new database containing functional and structural information on PROSITE profiles. Bioinformatics 21(21):4060–4066
Wesley UV et al (2007) Airway epithelial cell migration and wound repair by ATP-mediated activation of dual oxidase 1. J Biol Chem 282(5):3213–3220
Rada B et al (2008) The Pseudomonas toxin pyocyanin inhibits the dual oxidase-based antimicrobial system as it imposes oxidative stress on airway epithelial cells. J Immunol 181(7):4883–4893
Harper RW et al (2005) Differential regulation of dual NADPH oxidases/peroxidases, Duox1 and Duox2, by Th1 and Th2 cytokines in respiratory tract epithelium. FEBS Lett 579(21):4911–4917
Gattas MV et al (2009) Oxidative epithelial host defense is regulated by infectious and inflammatory stimuli. Free Radic Biol Med 47(10):1450–1458
Raoust E et al (2009) Pseudomonas aeruginosa LPS or flagellin are sufficient to activate TLR-dependent signaling in murine alveolar macrophages and airway epithelial cells. PLoS One 4(10):e7259
Caldwell CC et al (2009) Pseudomonas aeruginosa exotoxin pyocyanin causes cystic fibrosis airway pathogenesis. Am J Pathol 175(6):2473–2488
Britigan BE, Railsback MA, Cox CD (1999) The Pseudomonas aeruginosa secretory product pyocyanin inactivates α1 protease inhibitor: implications for the pathogenesis of cystic fibrosis lung disease. Infect Immun 67(3):1207–1212
Allen L et al (2005) Pyocyanin production by Pseudomonas aeruginosa induces neutrophil apoptosis and impairs neutrophil-mediated host defenses in vivo. J Immunol 174(6):3643–3649
Manago A et al (2015) Pseudomonas aeruginosa pyocyanin induces neutrophil death via mitochondrial reactive oxygen species and mitochondrial acid sphingomyelinase. Antioxidants Redox Signal 22(13):1097–1110
Hall S et al (2016) Cellular effects of pyocyanin, a secreted virulence factor of Pseudomonas aeruginosa. Toxins 8(8):236
Kroemer G, Reed JC (2000) Mitochondrial control of cell death. Nat Med 6(5):513
Reszka KJ et al (2004) Oxidation of pyocyanin, a cytotoxic product from Pseudomonas aeruginosa, by microperoxidase 11 and hydrogen peroxide. Free Radic Biol Med 36(11):1448–1459
Fernandes M et al (2016) Extensively and pan-drug resistant Pseudomonas aeruginosa keratitis: clinical features, risk factors, and outcome. Graefes Arch Clin Exp Ophthalmol 254(2):315–322
McWilliam H et al (2013) Analysis tool web services from the EMBL-EBI. Nucleic Acids Res 41(W1):W597–W600
Garg VK et al (2016) MFPPI–Multi FASTA ProtParam interface. Bioinformation 12(2):74–77
Emanuelsson O et al (2007) Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc 2(4):953–971
Gasteiger E et al. (2005) Protein identification and analysis tools on the ExPASy server. Springer, New York
Peterson TA et al (2010) DMDM: domain mapping of disease mutations. Bioinformatics 26(19):2458–2459
Wu TJ et al. (2014) A framework for organizing cancer-related variations from existing databases, publications and NGS data using a High-performance Integrated Virtual Environment (HIVE). Database (Oxford) 2014:bau022
Pan Y et al (2014) Human germline and pan-cancer variomes and their distinct functional profiles. Nucleic Acids Res 42(18):11570–11588
Kumar P, Henikoff S, Ng PC (2009) Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc 4(7):1073–1081
Choi Y, Chan AP (2015) PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics 31(16):2745–2747
Adzhubei IA et al (2010) A method and server for predicting damaging missense mutations. Nat Methods 7(4):248–249
Yu C-S et al (2014) CELLO2GO: a web server for protein subCELlular LOcalization prediction with functional gene ontology annotation. PLoS One 9(6):e99368
Tusnady GE, Simon I (2001) The HMMTOP transmembrane topology prediction server. Bioinformatics 17(9):849–850
Bairoch A, Apweiler R (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res 28(1):45–48
He X et al (2008) MethyCancer: the database of human DNA methylation and cancer. Nucleic Acids Res 36(Database issue):D836–D841
Schumacher A et al (2006) Microarray-based DNA methylation profiling: technology and applications. Nucleic Acids Res 34(2):528–542
Rollins RA et al (2006) Large-scale structure of genomic methylation patterns. Genome Res 16(2):157–163
Bradbury J (2003) Human epigenome project-up and running. PLoS Biol 1(3):E82
Amoreira C, Hindermann W, Grunau C (2003) An improved version of the DNA Methylation database (MethDB). Nucleic Acids Res 31(1):75–77
Takai D, Jones PA (2003) The CpG island searcher: a new WWW resource. In Silico Biol 3(3):235–240
Gardiner-Garden M, Frommer M (1987) CpG islands in vertebrate genomes. J Mol Biol 196(2):261–282
Julenius K (2007) NetCGlyc 1.0: prediction of mammalian C-mannosylation sites. Glycobiology 17(8):868–876
Kiemer L et al (2004) Coronavirus 3CL pro proteinase cleavage sites: Possible relevance to SARS virus pathology. BMC Bioinform 5(1):72
Johansen MB, Kiemer L, Brunak S (2006) Analysis and prediction of mammalian protein glycation. Glycobiology 16(9):844–853
Gupta RE, Jung E, Brunak S (2004) Predicition of N-glycosylation sites in human proteins. Center for Biological Sequence Analysis at Technical University of Denmark DTU
Steentoft C et al (2013) Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology. EMBO J 32(10):1478–1488
Blom N et al (2004) Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4(6):1633–1649
Duckert P, Brunak S, Blom N (2004) Prediction of proprotein convertase cleavage sites. Protein Eng Design Se 17(1):107–112
Szklarczyk D et al (2011) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39(suppl 1):D561–D568
Guex N, Peitsch MC (1997) SWISS-MODEL and the Swiss-Pdb Viewer: an environment for comparative protein modeling. Electrophoresis 18(15):2714–2723
Lovell S et al (2003) Structure validation by Calpha geometry: phi, psi, and Cbeta deviation. Proteins 50:437e450
BIOVIA DS (2015) Discovery studio modeling environment, release 4.5. Dassault Systèmes, San Diego
Tyurin-Kuzmin PA et al (2016) Nox4 and Duox1/2 mediate redox activation of mesenchymal cell migration by PDGF. PLoS One 11(4):e0154157
Gingerich A et al (2016) Hypothiocyanite produced by human and rat respiratory epithelial cells inactivates extracellular H1N2 influenza A virus. Inflamm Res 65(1):71–80
Chang S et al (2013) Dual oxidase regulates neutrophil recruitment in allergic airways. Free Radic Biol Med 65:38–46
Cho DY et al (2013) Expression of dual oxidases and secreted cytokines in chronic rhinosinusitis. Int Forum Allergy Rhinol 3(5):376–383
Van Der Vliet A et al (2016) Airway epithelial DUOX1 mediates EGFR activation, mucus metaplasia, and remodeling during allergic asthma, in A13. NOVEL INNATE IMMUNE PATHWAYS IN SEVERE ASTHMA. Am Thoracic Soc A1004. Am J Respir Crit Care Med 193:A1004. https://www.atsjournals.org/doi/pdf/10.1164/ajrccm-conference.2016.193.1_MeetingAbstracts.A1004
Chai W et al (2014) Pseudomonas pyocyanin stimulates IL-8 expression through MAPK and NF-kappaB pathways in differentiated U937 cells. BMC Microbiol 14:26
Rada B, Leto TL (2010) Characterization of hydrogen peroxide production by Duox in bronchial epithelial cells exposed to Pseudomonas aeruginosa. FEBS Lett 584(5):917–922
Kwon J et al (2010) The nonphagocytic NADPH oxidase Duox1 mediates a positive feedback loop during T cell receptor signaling. Sci Signal 3(133):ra59
Danyal K et al (2016) Acrolein and thiol-reactive electrophiles suppress allergen-induced innate airway epithelial responses by inhibition of DUOX1 and EGFR. Am J Physiol Lung Cell Mol Physiol 311(5):L913–L923
Little A et al (2016) DUOX1 silencing in lung cancer promotes EMT, cancer stem cell characteristics and invasive properties. Oncogenesis 5(10):e261
Little AC et al (2016) DUOX1 expression in lung cancer disrupts pro-oncogenic activation mechanisms and localization of Src and EGFR. AACR, Philadephia
Chen S et al (2016) Dual oxidase 1: a predictive tool for the prognosis of hepatocellular carcinoma patients. Oncol Rep 35(6):3198–3208
Ameziane-El-Hassani R et al (2015) NADPH oxidase DUOX1 promotes long-term persistence of oxidative stress after an exposure to irradiation. Proc Natl Acad Sci USA 112(16):5051–5056
Choi H et al (2014) Hydrogen peroxide generated by DUOX1 regulates the expression levels of specific differentiation markers in normal human keratinocytes. J Dermatol Sci 74(1):56–63
Fischer H et al (2007) Developmental regulation of DUOX1 expression and function in human fetal lung epithelial cells. Am J Physiol Lung Cell Mol Physiol 292(6):L1506–L1514
Fagerberg L et al (2014) Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol Cell Proteom 13(2):397–406
Pongnimitprasert N et al (2008) Potential role of the “NADPH oxidases” (NOX/DUOX) family in cystic fibrosis. Ann Biol Clin (Paris) 66(6):621–629
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Rashid, M.I., Ali, A. & Andleeb, S. Functional Annotation and Analysis of Dual Oxidase 1 (DUOX1): a Potential Anti-pyocyanin Immune Component. Interdiscip Sci Comput Life Sci 11, 597–610 (2019). https://doi.org/10.1007/s12539-018-0308-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12539-018-0308-1