Abstract
Background
The aim of this study was to investigate the gene expression profile of chronic obstructive pulmonary disease (COPD) patients and non-COPD patients.
Methods
Microarray raw data (GSE29133) was downloaded from Gene Expression Omnibus, including three COPD samples and three normal controls. Gene expression profiling was performed using Affymetrix human genome u133 plus 2.0 GeneChip. Differentially expressed genes were identified by Student’s t test and genes with p < 0.05 were considered significantly changed. Up- and downregulated genes were submitted to the molecular signatures database (MSigDB) to search for a possible association with other previously published gene expression signatures. Furthermore, we constructed a COPD protein–protein interaction (PPI) network and used the connectivity map (cMap) to query for potential drugs for COPD.
Results
A total of 680 upregulated genes and 530 downregulated genes in COPD were identified. The MSigDB investigation found that upregulated genes were highly similar to gene signatures that respond to interferon and downregulated genes were similar to erythroid progenitor cells from fetal livers of E13.5 embryos with KLF1 knocked out. A PPI network consisting of 814 gene/proteins and 2,613 interactions was identified by Search Tool for the Retrieval of Interacting Genes. The cMap predicted helveticoside, disulfiram, and lanatoside C as the top three possible drugs that could perhaps treat COPD.
Conclusion
Comprehensive analysis of the gene expression profile for COPD versus control reveals helveticoside, disulfiram, and lanatoside C as potential molecular targets in COPD. This evidence provides a new breakthrough in the medical treatment of patients with COPD.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Chronic obstructive pulmonary disease (COPD) is a major cause of chronic morbidity and mortality throughout the world. It is the third leading cause of death in the United States. This condition is characterized by airflow limitation associated with an abnormal inflammatory response in the lungs due to exposure to cigarette smoke and noxious particles or gases [1]. COPD is a slowly progressive and irreversible disorder characterized by functionally abnormal airway obstruction, which is a significant cause of morbidity, mortality, and high health-care costs [2]. Symptoms often worsen over time and can limit the patient’s ability to do routine activities. Severe COPD may prevent the patient from doing even basic activities like walking, cooking, or taking care of hygiene [3].
Therefore, understanding the pathogenesis of COPD and determining its optimal treatment is an important part of the overall management of patients with COPD. Most of the time, COPD is diagnosed in middle-aged or older adults [4]. The disease is not passed from person to person—you cannot catch it from someone else. COPD has no cure yet, and doctors do not know how to reverse the damage to the airways and lungs [5]. However, treatments and lifestyle changes can help you feel better, stay more active, and slow the progress of the disease [6]. Elderly patients with exacerbations of COPD present special challenges. There may be difficulties in diagnosis.
Biomedical researchers have made significant progress against COPD using molecular biology, cell biology, genetics, and other experimental biology [7, 8]. However, these researchers still face a great challenge against COPD since the methodology of classic experimental biology is based on studying individual genes and proteins and treating the organism as a simple and linear system, which is not sufficient to solve the problems of such complex diseases. Therefore, it is clear that new methodologies and techniques need to be used to analyze the molecular mechanisms of complex diseases such as COPD, and provide new solutions to prevent and cure these diseases.
Recently, Ning et al. [9] employed microarray analysis to identify differentially expressed genes (DEGs) and found a select number of genes significantly expressed between GOLD-2 and GOLD-0 smokers, which were confirmed by real-time quantitative RT-PCR. These genes encode transcription factors (EGR1 and FOS), growth factors or related proteins (CTGF, CYR61, CX3CL1, TGFB1, and PDGFRA), and extracellular matrix protein (COL1A1). In addition, the systematic evaluation for COPD and its associated genes also provided a new direction for preventing and curing the disease. Gan et al. [10] identified various systemic inflammatory markers such as C-reactive protein (CRP), fibrinogen, leukocytes, tumor necrosis factor-α (TNF-α), and interleukins 6 and 8, which are closely related with COPD.
To better understand the molecular basis of COPD, we proposed a systems biology approach that integrates expression profile data to identify genes and pathways responsible for COPD. This approach consisted of three steps: First, we screened a set of DEGs using array data sets between normal and COPD samples. Next, we submitted the DEGs to the molecular signatures database (MSigDB) to search for a possible association with other previously published gene expression signatures. Finally, we constructed a COPD protein–protein interaction (PPI) network and used connectivity map (cMap) to query for potential drugs for COPD. Our research highlights the DEGs-related phenotype and the mechanism related to the pathogenesis of COPD, which may provide novel insight into the development of a therapy strategy.
Materials and Methods
Microarray Data Set
Microarray raw data (GSE29133) were downloaded from Gene Expression Omnibus (GEO), including three COPD samples and three normal controls. Gene expression profiling was performed using Affymetrix human genome u133 plus 2.0 GeneChip. We recalculated the gene expression signal intensities using custom chip description files [11] by Robust Multi-array Average (RMA) [12].
Identification of DEGs
DEGs were identified by Student’s t test and genes with p < 0.05 were considered significantly changed. Up- and downregulated genes were submitted to the MSigDB [13] to search for a possible association with other previously published gene expression signatures. The MSigDB is a collection of annotated gene sets for use with Gene Set Enrichment Analysis (GSEA) software. The GSEA is a computational method that determines whether an a priori defined set of genes shows statistically significant concordant differences between two biological states (e.g., phenotypes) [14].
Construction of COPD PPI Network
DEGs were submitted to Search Tool for the Retrieval of Interacting Genes (STRING) 9.0 [15] and PPIs between COPD signature genes were retained. All associations in STRING are provided with a probabilistic confidence score, and in our analysis only interactions with a score of at least 0.4 were retained. We further performed network clustering [16] and divided the PPI network into subnetworks. Biological annotation of the resulting subnetworks was done by BinGo [17] in Cytoscape [18].
Drug Prediction Using cMap
The COPD gene signature was used to query cMap to find potential drugs for use in COPD patients. cMap [19] is an in silico method to predict potential drugs that could possibly reverse, or induce, the biological state encoded in particular gene expression signatures. cMap is a collection of more than 7,000 genome-wide transcriptional expression profiles from cultured human cells treated with 1,309 bioactive small molecules. Gene expression profiles were organized into instances, which represent a treatment and control pair, and the list of genes ordered by their extent of differential expression between this treatment and control pair. The query gene signature is then compared to each rank-ordered list to determine whether upregulated query genes tend to appear near the top of the list and downregulated query genes appear near the bottom (“positive connectivity”) or vice versa (“negative connectivity”), yielding a “connectivity score” ranging from −1 to 1. A high positive connectivity score indicates that the corresponding perturbagenFootnote 1 induced the expression of the query signature. A high negative connectivity score indicates that the corresponding perturbagen reversed the expression of the query signature. All instances in the database are then ranked according to their connectivity scores: those at the top are most strongly correlated to the query signature and those at the bottom are most strongly anticorrelated. Gene symbols for the COPD gene signature were converted into Affymetrix probe set IDs as cMap requires. Because a single gene could be represented by multiple probe sets and cMap could take up to only 1,000 probe sets per input, we ranked the DEGs by their p values and used the top 300 upregulated (or downregulated) genes for querying.
Results
Differentially Expressed PPI Network of COPD
A total of 680 genes upregulated and 530 genes downregulated in COPD were identified (Tables 1, 2). The MSigDB investigation found that upregulated genes were highly similar to the gene signature that responded to interferon [20–22] (Table 3). Downregulated genes were similar to genes downregulated in erythroid progenitor cells from fetal livers of E13.5 embryos with KLF1 knocked out [23] (Table 3).
Mining Network Biology of COPD
A PPI network consisting of 814 gene/proteins and 2,613 interactions was identified by STRING. The top ten gene/proteins with the most interacted partners were STAT1, AR, ISG15, UBE2L6, TAP1, IRF9, CREB1, XPO1, PSMB9, and YWHAZ. Network clustering identified 30 subnetworks with at least 6 members from the original network. The largest subnetwork was enriched with genes involved in the response to virus infection (corrected p = 3.13E−14; Table 4). The second largest subnetwork was enriched with genes involved in antigen processing and presentation (corrected p = 1.58E−23). The third largest subnetwork was enriched with genes involved in the regulation of the mitotic cell cycle (corrected p = 4.28E−06). The top ten subnetworks are shown in Fig. 1 and listed in Table 4.
cMap Predicted Potential Drugs that May Be Used to Treat COPD
The cMap predicted helveticoside, disulfiram, and lanatoside C as the top three drugs that perhaps could treat COPD (Table 5). Helveticoside, a cardiac glycoside, is an active cytotoxic constituent of the environmental endocrine disruptors (EEDS), which was demonstrated to be cytotoxic to human cancer cell lines [24]. Disulfiram is an aldehyde dehydrogenase (ALDH) inhibitor that has long been used as an alcohol deterrent in clinics. In cultured prostate cancer cells, disulfiram induces oxidative stress, reduces ALDH and DNA methyltransferase activities, and inhibits DNA replication [25, 26]. Lanatoside C sensitizes glioblastoma (GBM) cells to TNF-related apoptosis-inducing ligand (TRAIL)-induced apoptosis in a GBM xenograft model in vivo. Lanatoside C on its own serves as a therapeutic agent against GBM by activating a caspase-independent cell death pathway [27]. The therapeutic effects of these predicted drugs on COPD may be worth further investigation.
Discussion
Cluster 1 was enriched with genes involved in response to virus infection. COPD, as a chronic airway disease, is characterized by reversible airflow obstruction and symptoms of cough and sputum production. These symptoms can worsen with exposure to microbial infections [28]. Rhinoviruses (RVs) are the most frequently detected viruses during acute exacerbation [29], and viral infection is associated with a rapid decline in lung function and severe symptoms that often require hospitalization. In addition, we found ISG15 and MX1 in cluster 1, both of which were upregulated in COPD patients. A previous study [30] reported that an antiviral pretreatment effect was associated with increased expression of the antiviral genes IFN-stimulated gene 15 (ISG15) and Mx1, and the effect was maintained even when IFN-β levels in the supernatant of A549 cells were undetectable. IFN-γ levels are increased in COPD patients compared with healthy subjects and are further elevated during viral exacerbations. Southworth et al. [31] demonstrated that IFN-γ-induced STAT-1 signaling is corticosteroid resistant in alveolar macrophages (AMs) and that targeting IFN-γ signaling by JAK inhibitors is a potentially novel anti-inflammatory strategy in COPD. Interestingly, Bakke et al. [32] has reported significant associations of the binary COPD phenotype to STAT1. We also found IRF7 and IRF9 in this cluster. It was reported that mRNA expression of IRF7 could be induced by intact RV-1B [33].
Cluster 2, which was characterized by antigen processing and presentation, included PSMB8, PSMB9, TAP1, and TAP2, which were also reported by Fujino et al. [34]. Fujino et al. demonstrated that interferon-stimulated genes involved in the antigen processing and presentation pathway and genes involved in cell cycle progression were enriched in ATII cells of COPD patients. Using the same data as Fujino et al., our analysis recaptured their primary finding and further depicted the underlying PPI network.
Cluster 6, which was characterized by regulation of transcription, included CREB1 and CREBBP, both of which were downregulated in COPD. Activated CREB protein has histone acetyltransferase activity and increases histone acetylation and transcriptional activation of chromatin. In a study conducted by Holownia et al. [35], 21 stable COPD patients who received 12 μg formoterol b.i.d. were assayed before and after 3 months of add-on therapy, consisting of 18 μg tiotropium q.d. After therapy, the mean expressions of CREB and phosphorylated CREB levels in cytosol and nuclei were decreased by about 30 %. In addition, our analysis found that HAT1, which was involved in the rapid acetylation of newly synthesized cytoplasmic histones, was downregulated in COPD and was the hub protein of cluster 7, which was not significantly enriched with any gene ontology annotation. Compared to healthy controls, COPD patients showed low histone deacetylase (HDAC) activity in their AMs [36, 37]. The reduction of HDAC activity may be associated with smoking exposure through inflammatory pathways [38]. Our analysis suggested that besides HDAC, the role of histone acetylase may be also worth further investigation.
Cluster 8 was characterized by the regulation of Rho protein signal transduction. Rho GTPases have been implicated in several pulmonary diseases such as pulmonary hypertension, pulmonary embolism, COPD, acute lung injury, and acute respiratory distress syndrome [39]. Findings by Richens et al. [40] advance the hypothesis that impaired efferocytosis may contribute to the pathogenesis of COPD and suggest the therapeutic potential of drugs that target the RhoA-Rho kinase pathway.
Conclusions
In our study, we performed a comprehensive analysis of the gene expression profiles of COPD versus control to screen for DEGs and submitted those genes to MSigDB to search for a possible association with other previously published gene expression signatures. Then, we constructed a COPD PPI network and used cMap to query for potential drugs to treat COPD patients. We further discussed how the metabolic pathway changed in the cells of patients with COPD and explored small-molecule drugs that can respond to these changes and could provide a new breakthrough in the medical treatment of patients with COPD.
Notes
A perturbagen is a term used to describe an expressed peptide or protein fragment that disrupts physiological processes in mammalian cells and thereby identifies a novel target for drug discovery. The perturbagens may be introduced into the cells using viral-based libraries. This approach is part of a functional genomics approach in which the function of an unknown gene is ascertained by affecting its activity within the cell.
References
Rabe KF, Hurd S, Anzueto A, Barnes PJ, Buist SA, Calverley P, Fukuchi Y, Jenkins C, Rodriguez-Roisin R, van Weel C, Zielinski J (2007) Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am J Respir Crit Care Med 176(6):532–555
Murphy TF, Sethi S (2002) Chronic obstructive pulmonary disease. Drugs Aging 19(10):761–775
Petty TL (2003) Definition, epidemiology, course, and prognosis of COPD. Clin Cornerstone 5(1):1–10
Abramson M, Matheson M, Wharton C, Sim M (2002) Prevalence of respiratory symptoms related to chronic obstructive pulmonary disease and asthma among middle aged and older adults. Respirology 7(4):325–331
Hammond T, Ford A, de Bravo BF, Cote J (2012) Chronic obstructive pulmonary disease (COPD) and lung cancer. http://www.stopcancerfund.org/p-lung-cancer/copd-and-lung-cancer/
Nazir SA, Erbland ML (2009) Chronic obstructive pulmonary disease (COPD). Drugs Aging 26(10):813–831
Barnes PJ (2002) New treatments for COPD. Nat Rev Drug Discov 1(6):437–446
Berndt A, Leme AS, Shapiro SD (2012) Emerging genetics of COPD. EMBO Mol Med 4(11):1144–1155
Ning W, Li CJ, Kaminski N, Feghali-Bostwick CA, Alber SM, Di YP, Otterbein SL, Song R, Hayashi S, Zhou Z, Pinsky DJ, Watkins SC, Pilewski JM, Sciurba FC, Peters DG, Hogg JC, Choi AMK (2004) Comprehensive gene expression profiles reveal pathways related to the pathogenesis of chronic obstructive pulmonary disease. Proc Natl Acad Sci USA 101(41):14895–14900
Gan W, Man S, Senthilselvan A, Sin D (2004) Association between chronic obstructive pulmonary disease and systemic inflammation: a systematic review and a meta-analysis. Thorax 59(7):574–580
Dai M, Wang P, Boyd AD, Kostov G, Athey B, Jones EG, Bunney WE, Myers RM, Speed TP, Akil H, Watson SJ, Meng F (2005) Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res 33(20):e175
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4(2):249–264
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP (2011) Molecular signatures database (MSigDB) 3.0. Bioinformatics 27(12):1739–1740. doi:10.1093/bioinformatics/btr260
Cantu E, Lederer D, Meyer K, Milewski R, Suzuki Y, Shah R, Diamond J, Meyer N, Tobias J, Baldwin D (2013) Gene set enrichment analysis of bronchial alveolar lavage fluid identifies key innate immune pathways in primary graft dysfunction after lung transplantation. J Heart Lung Transpl 32(4):S41–S42
Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P, Jensen LJ, von Mering C (2011) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39(Database issue):D561–D568. doi:10.1093/nar/gkq973
Enright AJ, Van Dongen S, Ouzounis CA (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res 30(7):1575–1584
Maere S, Heymans K, Kuiper M (2005) BiNGO: a cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21(16):3448–3449
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11):2498–2504
Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, Reich M, Hieronymus H, Wei G, Armstrong SA, Haggarty SJ, Clemons PA, Wei R, Carr SA, Lander ES, Golub TR (2006) The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313(5795):1929–1935
Der SD, Zhou A, Williams BR, Silverman RH (1998) Identification of genes differentially regulated by interferon alpha, beta, or gamma using oligonucleotide arrays. Proc Natl Acad Sci USA 95(26):15623–15628
Sana TR, Janatpour MJ, Sathe M, McEvoy LM, McClanahan TK (2005) Microarray analysis of primary endothelial cells challenged with different inflammatory and immune cytokines. Cytokine 29(6):256–269
Radaeva S, Jaruga B, Hong F, Kim WH, Fan S, Cai H, Strom S, Liu Y, El-Assal O, Gao B (2002) Interferon-alpha activates multiple STAT signals and down-regulates c-Met in primary human hepatocytes. Gastroenterology 122(4):1020–1034
Pilon AM, Arcasoy MO, Dressman HK, Vayda SE, Maksimova YD, Sangerman JI, Gallagher PG, Bodine DM (2008) Failure of terminal erythroid differentiation in EKLF-deficient mice is associated with cell cycle perturbation and reduced expression of E2F2. Mol Cell Biol 28(24):7394–7401. doi:10.1128/MCB.01087-08
Lee YJ, Kim NS, Kim H, Yi JM, Oh SM, Bang OS, Lee J (2013) Cytotoxic and anti-inflammatory constituents from the seeds of Descurainia sophia. Arch Pharmacol Res 36(5):536–541
Iljin K, Ketola K, Vainio P, Halonen P, Kohonen P, Fey V, Grafstrom RC, Perala M, Kallioniemi O (2009) High-throughput cell-based screening of 4910 known drugs and drug-like small molecules identifies disulfiram as an inhibitor of prostate cancer cell growth. Clin Cancer Res 15(19):6070–6078. doi:10.1158/1078-0432.CCR-09-1035
Lin J, Haffner MC, Zhang Y, Lee BH, Brennen WN, Britton J, Kachhap SK, Shim JS, Liu JO, Nelson WG, Yegnasubramanian S, Carducci MA (2011) Disulfiram is a DNA demethylating agent and inhibits prostate cancer cell growth. Prostate 71(4):333–343. doi:10.1002/pros.21247
Badr CE, Wurdinger T, Nilsson J, Niers JM, Whalen M, Degterev A, Tannous BA (2011) Lanatoside C sensitizes glioblastoma cells to tumor necrosis factor-related apoptosis-inducing ligand and induces an alternative cell death pathway. Neuro Oncol 13(11):1213–1224. doi:10.1093/neuonc/nor067
MacNee W (2005) Pathogenesis of chronic obstructive pulmonary disease. Proc Am Thorac Soc 2(4):258–266 discussion 290–291
Mallia P, Message SD, Kebadze T, Parker HL, Kon OM, Johnston SL (2006) An experimental model of rhinovirus-induced chronic obstructive pulmonary disease exacerbations: a pilot study. Respir Res 7:116
Gaajetaan GR, Geelen TH, Vernooy JH, Dentener MA, Reynaert NL, Rohde GG, Beuken EV, Grauls GE, Bruggeman CA, Stassen FR (2013) Interferon-beta induces a long-lasting antiviral state in human respiratory epithelial cells. J Infect 66(2):163–169. doi:10.1016/j.jinf.2012.11.008
Southworth T, Metryka A, Lea S, Farrow S, Plumb J, Singh D (2012) IFN-γ synergistically enhances LPS signalling in alveolar macrophages from COPD patients and controls by corticosteroid-resistant STAT1 activation. Br J Pharmacol 166(7):2070–2083. doi:10.1111/j.1476-5381.2012.01907.x
Bakke PS, Zhu G, Gulsvik A, Kong X, Agusti AG, Calverley PM, Donner CF, Levy RD, Make BJ, Pare PD, Rennard SI, Vestbo J, Wouters EF, Anderson W, Lomas DA, Silverman EK, Pillai SG (2011) Candidate genes for COPD in two large data sets. Eur Respir J 37(2):255–263. doi:10.1183/09031936.00091709
Wang Q, Nagarkar DR, Bowman ER, Schneider D, Gosangi B, Lei J, Zhao Y, McHenry CL, Burgens RV, Miller DJ, Sajjan U, Hershenson MB (2009) Role of double-stranded RNA pattern recognition receptors in rhinovirus-induced airway epithelial cell responses. J Immunol 183(11):6989–6997. doi:10.4049/jimmunol.0901386
Fujino N, Ota C, Takahashi T, Suzuki T, Suzuki S, Yamada M, Nagatomi R, Kondo T, Yamaya M, Kubo H (2012) Gene expression profiles of alveolar type II cells of chronic obstructive pulmonary disease: a case-control study. BMJ Open. doi:10.1136/bmjopen-2012-001553
Holownia A, Mroz RM, Skopinski T, Kolodziejczyk A, Chyczewska E, Braszko JJ (2013) Tiotropium increases PPAR gamma and decreases CREB in cells isolated from induced sputum of COPD patients. Adv Exp Med Biol 756:9–14. doi:10.1007/978-94-007-4549-0_2
Barnes PJ (2009) Role of HDAC2 in the pathophysiology of COPD. Annu Rev Physiol 71:451–464. doi:10.1146/annurev.physiol.010908.163257
Ito K, Ito M, Elliott WM, Cosio B, Caramori G, Kon OM, Barczyk A, Hayashi S, Adcock IM, Hogg JC, Barnes PJ (2005) Decreased histone deacetylase activity in chronic obstructive pulmonary disease. N Engl J Med 352(19):1967–1976
Chen Y, Huang P, Ai W, Li X, Guo W, Zhang J, Yang J (2012) Histone deacetylase activity is decreased in peripheral blood monocytes in patients with COPD. J Inflamm (Lond) 9:10. doi:10.1186/1476-9255-9-10
Storck EM, Wojciak-Stothard B (2012) Rho GTPases in pulmonary vascular dysfunction. Vasc Pharmacol 58(3):202–210. doi:10.1016/j.vph.2012.09.004
Richens TR, Linderman DJ, Horstmann SA, Lambert C, Xiao YQ, Keith RL, Boe DM, Morimoto K, Bowler RP, Day BJ, Janssen WJ, Henson PM, Vandivier RW (2009) Cigarette smoke impairs clearance of apoptotic cells through oxidant-dependent activation of RhoA. Am J Respir Crit Care Med 179(11):1011–1021. doi:10.1164/rccm.200807-1148OC
Conflict of interest
None.
Author information
Authors and Affiliations
Corresponding author
Additional information
The Publisher and Editor retract this article in accordance with the recommendations of the Committee on Publication Ethics (COPE). After a thorough investigation we have strong reason to believe that the peer review process was compromised.
About this article
Cite this article
Bao, H., Wang, J., Zhou, D. et al. RETRACTED ARTICLE: Protein–Protein Interaction Network Analysis in Chronic Obstructive Pulmonary Disease. Lung 192, 87–93 (2014). https://doi.org/10.1007/s00408-013-9509-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00408-013-9509-x