Skip to main content

Building an Enterprise Data Lake for Educational Organizations for Prediction Analytics Using Deep Learning

  • Conference paper
  • First Online:
Proceedings of International Conference on Deep Learning, Computing and Intelligence

Abstract

Nowadays, educational institutions are one of the biggest producers of data. The rise of e-Learning contents, digital libraries, webinars, learning management systems, online classes and examinations, video surveillance, sensors, and wearables devices contribute to this data explosion. Learning management systems can index millions of students’ data, their interactions, course registrations, social networks, and their Internet research results. Besides, the potential to learn from this population-scale data is massive. By building analytic dashboards using machine learning and deep learning approaches on these datasets, educational organizations can improve the learning experience, teaching skills, and learning environment and drive better teaching and learning outcomes. Some real-world examples are students’ dropouts, students’ behavior, employee and student's health, prevention fraud data and abuse, etc. In present legacy systems, the data silos from the data warehouse could not handle unstructured data. It increases the complexity and cost of transferring data between multiple disparate data systems. Also, there is a performance bottleneck with data throughput while managing multiple data copies in different locations. This paper aims to store all educational data in a central location and handle all structured and unstructured data without any performance bottlenecks. It is proposed to design an enterprise data lake solution for academic organizations using deep learning to predict the outcomes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. A. Pan, Unlocking the potential of machine learning in a Data Lake (2019). https://www.datavirtualizationblog.com/unlocking-the-potential-of-machine-learning-in-a-data-lake/

  2. A. Varangaonkar, Top five deep learning architectures (2018)

    Google Scholar 

  3. A. Hernández-Blanco, B. Herrera-Flores, D. Tomás, B. Navarro-Colorado, A systematic review of deep learning approaches to educational data mining, complexity (2019). https://doi.org/10.1155/2019/1306039

  4. B. Dik, Higher education predictive analytics: the future in the digital age (2019)

    Google Scholar 

  5. Capgemini, The technology of the business data lake, consulting technology outsourcing (2020)

    Google Scholar 

  6. C. Perrotta, N. Selwyn, Deep learning goes to school: toward a relational understanding of AI in education. Learn. Media Technol. 45(3), 251–269 (2020). https://doi.org/10.1080/17439884.2020.1686017

  7. DataFlair, How is machine learning enhancing the future of education? (2020) https://data-flair.training/blogs/machine-learning-in-education/

  8. DataRoot, Machine learning life cycle (2020). https://www.datarobot.com/wiki/machine-learning-life-cycle/

  9. T. Doleck, D.J. Lemay, R.B. Basnet et al., Predictive analytics in education: a comparison of deep learning frameworks. Educ. Inf. Technol. 25, 1951–1963 (2020). https://doi.org/10.1007/s10639-019-10068-4

    Article  Google Scholar 

  10. Enterprise, Enterprise data lake architecture: what to consider when designing (2020)

    Google Scholar 

  11. F. Shaikh, Ten advanced deep learning architectures data scientists should know! (2017)

    Google Scholar 

  12. F.A. Nothaft, M. Ortega, A. Kermany, Building a modern clinical health data lake with delta lake (2020)

    Google Scholar 

  13. M. Fullan, J. Quinn, M. Drummy, M. Gardner, Education Reimagined; The Future of Learning. A collaborative position paper between New Pedagogies for Deep Learning and Microsoft Education (2020). http://aka.ms/HybridLearningPaper

  14. I. Goodfellow, Y. Bengio, A. Courvillec, Deep Learning. (MIT Press, 2016). http://www.deeplearningbook.org

  15. H. Sarmah, Can data lakes solve machine learning workload challenges? (2019) https://analyticsindiamag.com/can-data-lakes-solve-machine-learning-workload-challenges/

  16. H. Vatter, Reality and misconceptions about big data analytics, data lakes, and the future of AI (2019)

    Google Scholar 

  17. J. Patel, Overcoming data silos through big data integration. Int. J. Comput. Sci. Technol. 3(1), 1–6 (2019). https://doi.org/10.5121/IJDMS.2019.1301

  18. L. Dorard, The architecture of a real-world machine learning system (2019)

    Google Scholar 

  19. M.A. Peters, Deep learning, education, and the final stage of automation. Educ. Philos. Theor. 50(6–7), 549–553 (2018). https://doi.org/10.1080/00131857.2017.1348928

  20. M. Schedlbauer, Data Lakes—how to enable advanced analytics and machine learning? (2019)

    Google Scholar 

  21. N. Hobar, How learning analytics can make your teaching more effective (2018). https://www.classtime.com/blog/learning-analytics-make-teaching-more-effective/

  22. O.I. Abiodun, A. Jantan, et al., State-of-the-art in Artificial Neural Network Applications: A Survey. Elsevier 4(11), 1–41 (2018)

    Google Scholar 

  23. O. Genç, Notes on artificial intelligence, machine learning, and deep learning for curious people (2019). https://towardsdatascience.com

  24. K. Palanivel, K. Suresh Joseph, Data lake model to modern educational organizations. Int. Res. J. Eng. Technol. 7(7), 268–276 (2020)

    Google Scholar 

  25. R. Dwivedi, Step-by-step building block for machine learning models (2020)

    Google Scholar 

  26. S. Briggs, Deeper learning: what is it and why is it so effective? (2015). https://www.opencolleges.edu.au/informed/features/deep-learning/

  27. S. Rao, How to build a data pipeline for autonomous driving (2020)

    Google Scholar 

  28. S. Digumarti, Predictive analytics has a future in education (2013). https://analyticsindiamag.com/predictive-analytics-has-a-future-in-education/

  29. S. Tiao, Machine learning and modern data lake (2018)

    Google Scholar 

  30. S. Bhattacharya, N. Matthews, Enterprise data lake architecture: what to consider when designing (2020). https://www.cloudtp.com/doppler/how-to-guide-architecture-patterns-to-consider-when-designing-an-enterprise-data-lake/

  31. G. Manogaran, G. Srivastava, B.A. Muthu, S. Baskar, P.M. Shakeel, C. Hsu, P.M. Kumar, A response-aware traffic offloading scheme using regression machine learning for user-centric large-scale Internet of Things. IEEE Internet Things J. 1–1 (2020).https://doi.org/10.1109/jiot.2020.3022322

  32. T.N. Nguyen, B.-H. Liu, S.-Y. Wang, On new approaches of maximum weighted target coverage and sensor connectivity: hardness and approximation. IEEE Trans. Netw. Sci. Eng. 7(3), 1736–1751 (2020). https://doi.org/10.1109/TNSE.2019.2952369

  33. M. Tim Jones, Deep learning architectures, the rise of artificial intelligence (2017)

    Google Scholar 

  34. S. Tsai, C. Chen, Y. Shiao, et al., Precision education with statistical learning and deep learning: a case study in Taiwan. Int. J. Educ. Technol. High Educ. 17, 12 (2020). https://doi.org/10.1186/s41239-020-00186-2

  35. K. Warburton, Deep learning and education for sustainability. Int. J. Sustain. High. Educ. 4(1), 44–56 (2003). https://doi.org/10.1108/14676370310455332

    Article  Google Scholar 

  36. D. Kieffer, Building an enterprise analytics and BI practice in higher education (2019)

    Google Scholar 

  37. E. Levy, Understanding data lakes and data lake platforms (2018). https://www.upsolver.com/blog/understanding-data-lakes-and-data-lake-platforms

  38. J. Stephan, The charm of security-driven data lake architecture (2020)

    Google Scholar 

  39. J. Jablonski, Building a platform for machine learning and analytics (2019). https://www.cloudtp.com/doppler/building-platform-machine-learning-analytics/

  40. T. Spicer, Data Lakes? Big myths about architecture, strategy, and analytics (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kuppusamy, P., Suresh Joseph, K. (2022). Building an Enterprise Data Lake for Educational Organizations for Prediction Analytics Using Deep Learning. In: Manogaran, G., Shanthini, A., Vadivu, G. (eds) Proceedings of International Conference on Deep Learning, Computing and Intelligence. Advances in Intelligent Systems and Computing, vol 1396. Springer, Singapore. https://doi.org/10.1007/978-981-16-5652-1_6

Download citation

Publish with us

Policies and ethics