Abstract
Advancements in high-throughput technologies, genomics, transcriptomics, and metabolomics play an important role in obtaining biological information about living organisms. The field of computational biology and bioinformatics has experienced significant growth with the advent of high-throughput sequencing technologies and other high-throughput techniques. The resulting large amounts of data present both opportunities and challenges for data analysis. Big data analysis has become essential for extracting meaningful insights from the massive amount of data. In this chapter, we provide an overview of the current status of big data analysis in computational biology and bioinformatics. We discuss the various aspects of big data analysis, including data acquisition, storage, processing, and analysis. We also highlight some of the challenges and opportunities of big data analysis in this area of research. Despite the challenges, big data analysis presents significant opportunities like development of efficient and fast computing algorithms for advancing our understanding of biological processes, identifying novel biomarkers for breeding research and developments, predicting disease, and identifying potential drug targets for drug development programs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Alipanahi B (2018) Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 15(141):20170387
Yang J, Zhang X, Liu S (2017) Big data analytics in bioinformatics: algorithms, methods, and applications. Int J Data Min Bioinform 17(2):105–124
Yang CT, Kristiani E, Leong YK, Chang JS (2023) Big data and machine learning driven bioprocessing – recent trends and critical analysis. Bioresour Technol 372:128625
Kim JH, Kim S (2019) Big data analysis in systems biology. J Microbiol Biotechnol 29(2):171–180
Ljosa V, Caie PD, Ter Horst R, Sokolnicki KL, Jenkins EL, Daya S, Carpenter AE (2013) Comparison of methods for image-based profiling of cellular morphological responses to small-molecule treatment. J Biomol Screen 18(10):1321–1329
Khan S, Khan HU, Nazir S (2022) Systematic analysis of healthcare big data analytics for efficient care and disease diagnosing. Sci Rep 12:22377
Chen Y, Palakal M, Zhou B (2017) Big data analytics in genomics. J Biomed Inform 67:1–3
Chen Y, Liu Y, Yu C (2019) Big data in computational biology: a review. Biomed Pharmacother 110:524–532
Peng Y, Tang H (2016) Big data analytics in cancer research. Curr Pharmacol Rep 2(6):305–313
Sharma A, Menon R (2019) Big data analytics in genomics: a review. Genomics 111(1):43–50
Karczewski, Konrad J, Snyder MP (2018) Integrative omics for health and disease. Nat Rev Genet 19(5):299–310
Nobile MS, Sealfon SC (2017) Big data and machine learning in neuroscience. J Neuroimmune Pharmacol 12(1):1–2
Alqurashi M, Mavromatis C (2018) Big data analytics in computational biology: a review. Curr Bioinforma 13(5):452–462
Leung MK, Delong A, Alipanahi B, Frey BJ (2015) Machine learning in genomic medicine: a review of computational problems and data sets. Proc IEEE 104(1):176–197
Chen L, Li S, Bai Q, Yang J, Jiang S, Miao Y (2021) Review of image classification algorithms based on convolutional neural networks. Remote Sens 13(22):4712
Heldens S, Sclocco A, Dreuning H, Werkhoven BV, Hijma P, Maassen J, Nieuwpoort RV (2022) litstudy: a Python package for literature reviews. SoftwareX 20:Article 101207
Kumar P, Kumar A, Panwar S, Dash S, Sinha K, Chaudhary VK, Ray M (2018) Role of big data in agriculture – a statistical perspective. Ann Agric Res 39(2):210–215
Costa FF (2014) Big data in biomedicine. Drug Discov Today 19(4):433–440
Das S, Chaudhuri S, Chatterjee R (2017) Big data analytics in healthcare and bioinformatics: a survey of the literature. J Biomed Inform 71:93–108
Langmead B, Nellore A (2018) Cloud computing for genomic data analysis and collaboration. Nat Rev Genet 19(4):208–219
Bisong E (2019) Google colaboratory. In: Building machine learning and deep learning models on Google cloud platform. Apress, Berkeley
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Kumar, P., Paul, R.K., Roy, H.S., Yeasin, M., Ajit, Paul, A.K. (2024). Big Data Analysis in Computational Biology and Bioinformatics. In: Mandal, S. (eds) Reverse Engineering of Regulatory Networks. Methods in Molecular Biology, vol 2719. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-3461-5_11
Download citation
DOI: https://doi.org/10.1007/978-1-0716-3461-5_11
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-3460-8
Online ISBN: 978-1-0716-3461-5
eBook Packages: Springer Protocols