Abstract
Identifying gene–gene and gene–environment interactions may help us to better describe the genetic architecture for complex traits. While advances have been made in identifying genetic variants associated with complex traits through more dense panels of genetic variants and larger sample sizes, genome-wide interaction analyses are still limited in power to detect interactions with small effect sizes, rare frequencies, and higher order interactions. This chapter outlines methods for detecting both gene-gene and gene-environment interactions both through explicit tests for interactions (i.e., ones in which the interaction is tested directly) and non-explicit tests (i.e., ones in which an interaction is allowed for in the test, but does not test for the interaction directly) as well as approaches for increasing power by reducing the search space. Issues relating to multiple test correction, replication, and the reporting of interaction results in publications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Niel C, Sinoquet C, Dina C et al (2015) A survey about methods dedicated to epistasis detection. Front Genet 6:285
Ritchie MD (2015) Finding the epistasis needles in the genome-wide haystack. Methods Mol Biol 1253:19–33
Gusareva ES, Van Steen K (2014) Practical aspects of genome-wide association interaction analysis. Hum Genet 133(11):1343–1358
Tiret L (2002) Gene-environment interaction: a central concept in multifactorial diseases. Proc Nutr Soc 61(4):457–463
Ottman R (1990) An epidemiologic approach to gene-environment interaction. Genet Epidemiol 7(3):177–185
Manolio TA, Collins FS, Cox NJ et al (2009) Finding the missing heritability of complex diseases. Nature 461(7265):747–753
Bateson W (1909) Mendel’s principles of heredity. Cambridge University Press, Cambridge
Cordell HJ (2002) Epistasis: what it means, what it doesn’t mean, and statistical methods to detect it in humans. Hum Mol Genet 11(20):2463–2468
Moore JH (2005) A global view of epistasis. Nat Genet 37(1):13–14
Ma J, Thabane L, Beyene J et al (2016) Power analysis for population-based longitudinal studies investigating gene-environment interactions in chronic diseases: a simulation study. PLoS One 11(2):e0149940
Dunham I, Kundaje A, Aldred SF et al (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489(7414):57–74
Wang K, Li M, Hakonarson H (2010) ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38(16):e164
Bush WS, Dudek SM, Ritchie MD (2009) Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies. Pac Symp Biocomput:368–379
Price AL, Patterson NJ, Plenge RM et al (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38(8):904–909
Howie BN, Donnelly P, Marchini J (2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5(6):e1000529
Purcell S, Neale B, Todd-Brown K et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81(3):559–575
Ueki M, Cordell HJ (2012) Improved statistics for genome-wide interaction analysis. PLoS Genet 8(4):e1002625
Wu X, Dong H, Luo L et al (2010) A novel statistic for genome-wide interaction analysis. PLoS Genet 6(9):e1001131
Wan X, Yang C, Yang Q et al (2010) BOOST: a fast approach to detecting gene-gene interactions in genome-wide case-control studies. Am J Hum Genet 87(3):325–340
Hahn LW, Ritchie MD, Moore JH (2003) Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions. Bioinformatics 19(3):376–382
Ritchie MD, Hahn LW, Roodi N et al (2001) Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet 69(1):138–147
Calle ML, Urrea V, Malats N et al (2010) mbmdr: an R package for exploring gene-gene interactions associated with binary or quantitative traits. Bioinformatics 26(17):2198–2199
Gui J, Moore JH, Williams SM et al (2013) A simple and computationally efficient approach to multifactor dimensionality reduction analysis of gene-gene interactions for quantitative traits. PLoS One 8(6):e66545
Van der Auwera GA, Carneiro MO, Hartl C et al (2013) From FASTQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics 43:11.10 1–11.1033
Li H, Durbin R (2010) Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26(5):589–595
Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16):2078–2079
McKenna A, Hanna M, Banks E et al (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20(9):1297–1303
DePristo MA, Banks E, Poplin R et al (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43(5):491–498
Dewan AT, Egan KB, Hellenbrand K et al (2012) Whole-exome sequencing of a pedigree segregating asthma. BMC Med Genet 13(1):95
Marchini J, Donnelly P, Cardon LR (2005) Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat Genet 37(4):413–417
Calle ML, Urrea V, Vellalta G, Malats N, Steen KV (2008) Improving strategies for detecting genetic patterns of disease susceptibility in association studies. Stat Med 27(30):6532–6546
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B 57(1):289–300
Nyholt DR (2004) A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Hum Genet 74(4):765–769
North BV, Curtis D, Sham PC (2002) A note on the calculation of empirical P values from Monte Carlo procedures. Am J Hum Genet 71(2):439–441
North BV, Curtis D, Sham PC (2003) A note on calculation of empirical P values from Monte Carlo procedure. Am J Hum Genet 72(2):498–499
Murk W, DeWan AT (2016) Exhaustive genome-wide search for SNP-SNP interactions across 10 human diseases. G3 (Bethesda) 6(7):2043–2050
Gauderma WJ, Morrison JM, QUANTO 1.1: A computer program for power and sample size calculations for genetic-epidemiology studies. http://hydra.usc.edu/gxe2006
Uzun A, Sharma S, Padbury J (2012) A bioinformatics approach to preterm birth. Am J Reprod Immunol 67(4):273–277
Uzun A, Triche EW, Schuster J et al (2016) dbPEC: a comprehensive literature-based database for preeclampsia related genes and phenotypes. Database (Oxford). https://doi.org/10.1093/database/baw006. pii:baw006
Shearer AE, Eppsteiner RW, Booth KT et al (2014) Utilizing ethnic-specific differences in minor allele frequency to recategorize reported pathogenic deafness variants. Am J Hum Genet 95(4):445–453
Murk W, DeWan AT (2016) Genome-wide search identifies a gene-gene interaction between 20p13 and 2q14 in asthma. BMC Genet 17(1):102
Ma L, Clark AG, Keinan A (2013) Gene-based testing of interactions in association studies of quantitative traits. PLoS Genet 9(2):e1003321
Wu MC, Lee S, Cai T et al (2011) Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 89(1):82–93
Lin X, Lee S, Wu MC et al (2016) Test for rare variants by environment interactions in sequencing association studies. Biometrics 72(1):156–164
Chen H, Meigs JB, Dupuis J (2014) Incorporating gene-environment interaction in testing for association with rare genetic variants. Hum Hered 78(2):81–90
Murk W, Bracken MB, DeWan AT (2015) Confronting the missing epistasis problem: on the reproducibility of gene-gene interactions. Hum Genet 134(8):837–849
Greene CS, Penrod NM, Williams SM et al (2009) Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One 4(6):e5639
Fleiss JL (1993) The statistical basis of meta-analysis. Stat Methods Med Res 2(2):121–145
Fisher RA (1948) Combining independent tests of significance. Am Stat 2:30
Piegorsch WW, Weinberg CR, Taylor JA (1994) Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Stat Med 13(2):153–162
Begg CB, Zhang ZF (1994) Statistical analysis of molecular epidemiology studies employing case-series. Cancer Epidemiol Biomark Prev 3(2):173–175
Hodgson ME, Olshan AF, North KE et al (2012) The case-only independence assumption: associations between genetic polymorphisms and smoking among controls in two population-based studies. Int J Mol Epidemiol Genet 3(4):333–360
Yang Q, Khoury MJ, Sun F et al (1999) Case-only design to measure gene-gene interaction. Epidemiology 10(2):167–170
The International HapMap Consortium (2003) The international HapMap project. Nature 426:789–796
Yang CH, Lin YD, Wu SJ et al (2015) High order gene-gene interactions in eight single nucleotide polymorphisms of renin-angiotensin system genes for hypertension association study. Biomed Res Int 2015:454091
Wu C, Zhang H, Liu X et al (2009) Detecting essential and removable interactions in genome-wide association studies. Stat Interface 2(2):161–170
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
DeWan, A.T. (2018). Gene-Gene and Gene-Environment Interactions. In: Evangelou, E. (eds) Genetic Epidemiology. Methods in Molecular Biology, vol 1793. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7868-7_7
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7868-7_7
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7867-0
Online ISBN: 978-1-4939-7868-7
eBook Packages: Springer Protocols