Abstract
Nowadays, educational institutions are one of the biggest producers of data. The rise of e-Learning contents, digital libraries, webinars, learning management systems, online classes and examinations, video surveillance, sensors, and wearables devices contribute to this data explosion. Learning management systems can index millions of students’ data, their interactions, course registrations, social networks, and their Internet research results. Besides, the potential to learn from this population-scale data is massive. By building analytic dashboards using machine learning and deep learning approaches on these datasets, educational organizations can improve the learning experience, teaching skills, and learning environment and drive better teaching and learning outcomes. Some real-world examples are students’ dropouts, students’ behavior, employee and student's health, prevention fraud data and abuse, etc. In present legacy systems, the data silos from the data warehouse could not handle unstructured data. It increases the complexity and cost of transferring data between multiple disparate data systems. Also, there is a performance bottleneck with data throughput while managing multiple data copies in different locations. This paper aims to store all educational data in a central location and handle all structured and unstructured data without any performance bottlenecks. It is proposed to design an enterprise data lake solution for academic organizations using deep learning to predict the outcomes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
A. Pan, Unlocking the potential of machine learning in a Data Lake (2019). https://www.datavirtualizationblog.com/unlocking-the-potential-of-machine-learning-in-a-data-lake/
A. Varangaonkar, Top five deep learning architectures (2018)
A. Hernández-Blanco, B. Herrera-Flores, D. Tomás, B. Navarro-Colorado, A systematic review of deep learning approaches to educational data mining, complexity (2019). https://doi.org/10.1155/2019/1306039
B. Dik, Higher education predictive analytics: the future in the digital age (2019)
Capgemini, The technology of the business data lake, consulting technology outsourcing (2020)
C. Perrotta, N. Selwyn, Deep learning goes to school: toward a relational understanding of AI in education. Learn. Media Technol. 45(3), 251–269 (2020). https://doi.org/10.1080/17439884.2020.1686017
DataFlair, How is machine learning enhancing the future of education? (2020) https://data-flair.training/blogs/machine-learning-in-education/
DataRoot, Machine learning life cycle (2020). https://www.datarobot.com/wiki/machine-learning-life-cycle/
T. Doleck, D.J. Lemay, R.B. Basnet et al., Predictive analytics in education: a comparison of deep learning frameworks. Educ. Inf. Technol. 25, 1951–1963 (2020). https://doi.org/10.1007/s10639-019-10068-4
Enterprise, Enterprise data lake architecture: what to consider when designing (2020)
F. Shaikh, Ten advanced deep learning architectures data scientists should know! (2017)
F.A. Nothaft, M. Ortega, A. Kermany, Building a modern clinical health data lake with delta lake (2020)
M. Fullan, J. Quinn, M. Drummy, M. Gardner, Education Reimagined; The Future of Learning. A collaborative position paper between New Pedagogies for Deep Learning and Microsoft Education (2020). http://aka.ms/HybridLearningPaper
I. Goodfellow, Y. Bengio, A. Courvillec, Deep Learning. (MIT Press, 2016). http://www.deeplearningbook.org
H. Sarmah, Can data lakes solve machine learning workload challenges? (2019) https://analyticsindiamag.com/can-data-lakes-solve-machine-learning-workload-challenges/
H. Vatter, Reality and misconceptions about big data analytics, data lakes, and the future of AI (2019)
J. Patel, Overcoming data silos through big data integration. Int. J. Comput. Sci. Technol. 3(1), 1–6 (2019). https://doi.org/10.5121/IJDMS.2019.1301
L. Dorard, The architecture of a real-world machine learning system (2019)
M.A. Peters, Deep learning, education, and the final stage of automation. Educ. Philos. Theor. 50(6–7), 549–553 (2018). https://doi.org/10.1080/00131857.2017.1348928
M. Schedlbauer, Data Lakes—how to enable advanced analytics and machine learning? (2019)
N. Hobar, How learning analytics can make your teaching more effective (2018). https://www.classtime.com/blog/learning-analytics-make-teaching-more-effective/
O.I. Abiodun, A. Jantan, et al., State-of-the-art in Artificial Neural Network Applications: A Survey. Elsevier 4(11), 1–41 (2018)
O. Genç, Notes on artificial intelligence, machine learning, and deep learning for curious people (2019). https://towardsdatascience.com
K. Palanivel, K. Suresh Joseph, Data lake model to modern educational organizations. Int. Res. J. Eng. Technol. 7(7), 268–276 (2020)
R. Dwivedi, Step-by-step building block for machine learning models (2020)
S. Briggs, Deeper learning: what is it and why is it so effective? (2015). https://www.opencolleges.edu.au/informed/features/deep-learning/
S. Rao, How to build a data pipeline for autonomous driving (2020)
S. Digumarti, Predictive analytics has a future in education (2013). https://analyticsindiamag.com/predictive-analytics-has-a-future-in-education/
S. Tiao, Machine learning and modern data lake (2018)
S. Bhattacharya, N. Matthews, Enterprise data lake architecture: what to consider when designing (2020). https://www.cloudtp.com/doppler/how-to-guide-architecture-patterns-to-consider-when-designing-an-enterprise-data-lake/
G. Manogaran, G. Srivastava, B.A. Muthu, S. Baskar, P.M. Shakeel, C. Hsu, P.M. Kumar, A response-aware traffic offloading scheme using regression machine learning for user-centric large-scale Internet of Things. IEEE Internet Things J. 1–1 (2020).https://doi.org/10.1109/jiot.2020.3022322
T.N. Nguyen, B.-H. Liu, S.-Y. Wang, On new approaches of maximum weighted target coverage and sensor connectivity: hardness and approximation. IEEE Trans. Netw. Sci. Eng. 7(3), 1736–1751 (2020). https://doi.org/10.1109/TNSE.2019.2952369
M. Tim Jones, Deep learning architectures, the rise of artificial intelligence (2017)
S. Tsai, C. Chen, Y. Shiao, et al., Precision education with statistical learning and deep learning: a case study in Taiwan. Int. J. Educ. Technol. High Educ. 17, 12 (2020). https://doi.org/10.1186/s41239-020-00186-2
K. Warburton, Deep learning and education for sustainability. Int. J. Sustain. High. Educ. 4(1), 44–56 (2003). https://doi.org/10.1108/14676370310455332
D. Kieffer, Building an enterprise analytics and BI practice in higher education (2019)
E. Levy, Understanding data lakes and data lake platforms (2018). https://www.upsolver.com/blog/understanding-data-lakes-and-data-lake-platforms
J. Stephan, The charm of security-driven data lake architecture (2020)
J. Jablonski, Building a platform for machine learning and analytics (2019). https://www.cloudtp.com/doppler/building-platform-machine-learning-analytics/
T. Spicer, Data Lakes? Big myths about architecture, strategy, and analytics (2019)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kuppusamy, P., Suresh Joseph, K. (2022). Building an Enterprise Data Lake for Educational Organizations for Prediction Analytics Using Deep Learning. In: Manogaran, G., Shanthini, A., Vadivu, G. (eds) Proceedings of International Conference on Deep Learning, Computing and Intelligence. Advances in Intelligent Systems and Computing, vol 1396. Springer, Singapore. https://doi.org/10.1007/978-981-16-5652-1_6
Download citation
DOI: https://doi.org/10.1007/978-981-16-5652-1_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-5651-4
Online ISBN: 978-981-16-5652-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)